CN110247795A - A kind of cloud net resource service chain method of combination and system based on intention - Google Patents

A kind of cloud net resource service chain method of combination and system based on intention Download PDF

Info

Publication number
CN110247795A
CN110247795A CN201910461367.6A CN201910461367A CN110247795A CN 110247795 A CN110247795 A CN 110247795A CN 201910461367 A CN201910461367 A CN 201910461367A CN 110247795 A CN110247795 A CN 110247795A
Authority
CN
China
Prior art keywords
service
layout
cost
intention
net resource
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910461367.6A
Other languages
Chinese (zh)
Other versions
CN110247795B (en
Inventor
郭少勇
喻鹏
邱雪松
贺文晨
李文萃
申京
邵苏杰
徐思雅
亓峰
丰雷
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing University of Posts and Telecommunications
Information and Telecommunication Branch of State Grid Henan Electric Power Co Ltd
Original Assignee
Beijing University of Posts and Telecommunications
Information and Telecommunication Branch of State Grid Henan Electric Power Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Posts and Telecommunications, Information and Telecommunication Branch of State Grid Henan Electric Power Co Ltd filed Critical Beijing University of Posts and Telecommunications
Priority to CN201910461367.6A priority Critical patent/CN110247795B/en
Publication of CN110247795A publication Critical patent/CN110247795A/en
Application granted granted Critical
Publication of CN110247795B publication Critical patent/CN110247795B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/08Configuration management of networks or network elements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design
    • H04L41/145Network analysis or design involving simulating, designing, planning or modelling of a network

Abstract

The embodiment of the invention provides a kind of cloud net resource service chain method of combination and system based on intention, method include: to provide end-to-end service to cloud net resource based on preset northbound interface frame of reference;Based on the service chaining cradle of deeply study, online layout is carried out to the end-to-end service and dynamic adjusts, wherein in the online layout and dynamic adjustment, solves default multi-goal Optimization Model to minimize service chaining layout cost and delay.A kind of cloud net resource service chain method of combination and system based on intention provided in an embodiment of the present invention, the preset northbound interface frame of reference of offer and the SFC cradle based on DRL are provided, and a multi-goal Optimization Model is constructed, to reduce long term service chain layout cost to the maximum extent.

Description

A kind of cloud net resource service chain method of combination and system based on intention
Technical field
The present invention relates to field of communication technology more particularly to a kind of cloud net resource service chain method of combination based on intention And system.
Background technique
The rapid growth of Internet of Things service with different QoS requirement brings quickly delivery and QoS to network operator Guarantee the huge challenge of aspect.Network function virtualizes (Network Function Virtualization, NFV) and software Define the pass that network (Software Defined Networking, SDN) has become flexible resource distribution and dynamic Service supply Key technology.But both technologies still need application manual operation to define service model and Configuration network details, in turn Need high professional qualification administrator and a large amount of time.It is unfavorable that these manual tasks have raising reliability and quick offer service Effect.Therefore, it is proposed to which the network (the Intent Based Networking, IBN) based on intention is to simplify low-level configuration And accelerate service offering.
A critical aspects for supporting the service provision based on intention are north orientations unrelated with supplier and unrelated with technology Interface (northbound interface, NBI), for customer language to be converted to service chaining (Service Function Chain, SFC) abstract definition.Another committed step is the online layout of the abstract definition based on SFC model, to realize Requirement drive, the service offering mode of adjust automatically.
However the above method is there is still a need for obtaining complete network details in advance to obtain globally optimal solution, but these are accurate Information be generally difficult to collect.Therefore a kind of cloud net resource service chain method of combination based on intention is needed now to solve State problem.
Summary of the invention
To solve the above-mentioned problems, the embodiment of the present invention provides one kind and overcomes the above problem or at least be partially solved A kind of cloud net resource service chain method of combination and system based on intention of the above problem.
The first aspect embodiment of the present invention provides a kind of cloud net resource service chain method of combination based on intention, comprising:
Based on preset northbound interface frame of reference, end-to-end service is provided to cloud net resource;
Based on the service chaining cradle of deeply study, online layout and dynamic are carried out to the end-to-end service Adjustment, wherein in the online layout and dynamic adjustment, solve default multi-goal Optimization Model to minimize service Chain layout cost and delay.
Wherein, the multi-goal Optimization Model indicates are as follows:
min{cost(server)+cost(link)}
Wherein, cost (server) is the cost of the relevant cost of server resource, the forwarding of cost (link) flow, C1, C2,C3,C4,C5,C6,C7For resource constraint.
Wherein, described to solve default multi-goal Optimization Model to minimize service chaining layout cost and delay, it wraps It includes:
The optimal solution of the multi-goal Optimization Model is obtained based on the preset double-deck depth Q network algorithm.
Wherein, described that the optimal of the multi-goal Optimization Model is obtained based on the preset double-deck depth Q network algorithm Solution, comprising:
Operation flow is initialized;
Based on the preset double-deck depth Q network, arranging service is carried out to the operation flow after initialization.
It is wherein, described that operation flow is initialized, comprising:
The target protocol for concentrating random selection to meet the requirements from Cloud Server;
Based on shortest path selection algorithm, the target routing plan between VNF is determined;
Calculate the layout expense of all service chainings.
Wherein, described based on the preset double-deck depth Q network, arranging service, packet are carried out to the operation flow after initialization It includes:
To the double-deck depth Q network inputs state after state space initialization;
It obtains the corresponding movement of input state and calculates target Q value;
Input state is updated based on gradient descent method until reaching preset termination condition.
The second aspect embodiment of the present invention also provides a kind of cloud net resource service chain arranging system based on intention, comprising:
Service module provides end-to-end service to cloud net resource for being based on preset northbound interface frame of reference;
Layout adjust module, the service chaining cradle for being learnt based on deeply, to the end-to-end service into The online layout of row and dynamic adjust, wherein in the online layout and dynamic adjustment, solve default multi-objective optimization question Model is to minimize service chaining layout cost and delay.
The embodiment of the invention provides a kind of electronic equipment for the third aspect, comprising:
Processor, memory, communication interface and bus;Wherein, the processor, memory, communication interface pass through described Bus completes mutual communication;The memory is stored with the program instruction that can be executed by the processor, the processor Described program instruction is called to be able to carry out the above-mentioned cloud net resource service chain method of combination based on intention.
The embodiment of the invention provides a kind of non-transient computer readable storage medium, the non-transient meters for fourth aspect Calculation machine readable storage medium storing program for executing stores computer instruction, and it is above-mentioned based on intention that the computer instruction executes the computer Cloud net resource service chain method of combination.
A kind of cloud net resource service chain method of combination and system based on intention provided in an embodiment of the present invention, by mentioning For preset northbound interface frame of reference and the SFC cradle based on DRL, and construct a multi-objective optimization question mould Type, to reduce long term service chain layout cost to the maximum extent.
Detailed description of the invention
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, below will to embodiment or Attached drawing needed to be used in the description of the prior art is briefly described, it should be apparent that, the accompanying drawings in the following description is this Some embodiments of invention without creative efforts, may be used also for those of ordinary skill in the art To obtain other drawings based on these drawings.
Fig. 1 is a kind of cloud net resource service chain method of combination process signal based on intention provided in an embodiment of the present invention Figure;
Fig. 2 is the train epochs schematic diagram under different learning rates provided in an embodiment of the present invention;
Fig. 3 is the train epochs schematic diagram under algorithms of different provided in an embodiment of the present invention;
Fig. 4 is the average delay schematic diagram of algorithms of different provided in an embodiment of the present invention;
Fig. 5 is the totle drilling cost schematic diagram of algorithms of different provided in an embodiment of the present invention;
Fig. 6 is a kind of cloud net resource service chain arranging system structural representation based on intention provided in an embodiment of the present invention Figure;
Fig. 7 is the structural block diagram of electronic equipment provided in an embodiment of the present invention.
Specific embodiment
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with the embodiment of the present invention In attached drawing, technical solution in the embodiment of the present invention is explicitly described, it is clear that described embodiment is this hair Bright a part of the embodiment, instead of all the embodiments.Based on the embodiments of the present invention, those of ordinary skill in the art exist Every other embodiment obtained under the premise of creative work is not made, shall fall within the protection scope of the present invention.
Fig. 1 is a kind of cloud net resource service chain method of combination process signal based on intention provided in an embodiment of the present invention Figure, as shown in Figure 1, comprising:
101, it is based on preset northbound interface frame of reference, provides end-to-end service to cloud net resource;
102, the service chaining cradle based on deeply study, to the end-to-end service carry out online layout and Dynamic adjusts, wherein in the online layout and dynamic adjustment, solves default multi-goal Optimization Model to minimize Service chaining layout cost and delay.
It should be noted that the application scenarios of the embodiment of the present invention are to be in Internet of Things how to realize to cloud net resource Service chaining layout, in this scenario, the embodiment of the present invention proposes an IBN frame of reference and sets to manage Internet of Things basis It applies and provides end-to-end service across multiple domains.Specifically, in a step 101, IBN frame of reference provided in an embodiment of the present invention It include: VNF manager (VNF Manager, VNFM) and NFV composer (NFV Orchestrator, NFVO), management and control Plane processed and data plane.VNF manager and NFV composer are for allowing client to state IR using human-readable language, so Declaratively strategy it will be converted to high-level service by the NBI based on intention afterwards and be abstracted, such as VNF property, qos feature and threshold value. Plane is managed and controlled including virtualization architecture manager (VIM) and controller, VIM is by high-level abstractions policy mappings Low level services chain layout strategy coordinates SDN_C and cloud controller (Cloud_C) to automate SFC layout.Data plane is used for It places and flows by southbound interface (Southbound Interface, SBI) receiving control message, and for the VNF in cloud domain Amount routing provides physical resource, and the sensor and actuator of Internet of Things domain are responsible for data collection.So pass through aforementioned present invention IBN frame of reference provided by embodiment can be realized management Internet of Things infrastructure and provide end-to-end service across multiple domains.
Further, in a step 102, the embodiment of the invention provides a kind of service chainings based on deeply study Cradle, i.e. the SFC cradle based on DRL.The SFC cradle of the DRL can pass through NBI and net based on intention Network learning model obtains SFC abstract model and environment detail.Be then based on DRL SFC cradle can tuning controller, The corresponding actions of current network are realized by the specific SBI of technology, meanwhile, network provides the feedback in relation to rewarding or punishing, To promote the SFC cradle of DRL to adjust its behavior, by effectively training so that the SFC cradle realization of DRL is optimal Strategy, cloud net resource service chain layout scheme needed for the optimal policy, that is, embodiment of the present invention.Wherein, optimal plan is being obtained During slightly, the embodiment of the present invention establishes a multi-goal Optimization Model to minimize SFC layout cost and prolong Late, final required cloud net resource service chain layout scheme is obtained by the optimal solution to the multi-goal Optimization Model.
Wherein, in the multi-goal Optimization Model that the embodiment of the present invention is established, SFC is by triple s={ (vso, vde)s,Fs,rsIndicate, wherein (vso,vde)sIndicate s source node to and destination node pair. vsoGenerating has data transmission speed Rate is rsFlow.FsIndicate specific SFC information, the attribute including VNF, sequence and connectivity.Needed for Cloud Server The f ∈ F of CPU, memory source and VNFsProcessing postpone by cpuf, memfAnd dfIt indicates.Virtual chain between the u and w of VNF Connect byIt indicates.Define DsTo postpone threshold value.
So the physical network in cloud domain is indicated by weighted undirected graph G=(N, L), and wherein N and L respectively indicates node and has Wired link.Node is divided into two classes: the first kind is the interchanger for converting flow, and the second class is for hosts virtual machine v ∈ V Cloud Server.Server and the quantity of link are indicated with M and H.Cloud Server v has based on the CPU by placing VNF example Calculation and memory source, these examples are respectively by Capcpu(v) and Capmem(v) it indicates.Physical link l between node i and jijTool There is maximum data transfer rate bijWith transmission delay dij Indicate that the f of the VNF in s has been mapped to Cloud Server, Otherwise it is Indicate the virtual link in sIt has been mapped to physical link, has been otherwise 0.
VNF example needs the memory source in CPU computing resource and Cloud Server.It needs to consider load balancing, because of dimension Load balancing between shield server and link to avoid flow congestion and can further increase network cost efficiency.Therefore, originally Inventive embodiments propose two load balancing factor ΦvAnd Θij, it is used to indicate the load condition of network, their value and money Source utilization rate has positive correlation.ΦvIt calculates as follows:
Wherein α111It is positive parameter, for Φ in Setup Cost calculating processvValue.ΦvIt is UvLinear function also It is whether exponential function depends on UvRange.UvThe weighted sum for indicating CPU and memory usage, is calculated by following formula:
Wherein, epAnd emRepresent the weight of CPU and memory usage, ep+em=1.The relevant cost of server resource by Following formula calculates:
The unit price of CPU and memory source uses c respectively1And c2It indicates.Next consider the forwarding cost in flow routing, bear Carry balance factor ΘijIt calculates as follows:
Wherein α222It is positive parameter and is used to adjust ΘijValue.UijIndicate link lijThe use of middle transmission rate Rate, calculation are as follows:
The cost calculation mode of flow forwarding is as follows:
Wherein, c3Indicate the unit price of link transmission rate.(el·Θij+ed·dij/Ds) represent ΘijWeighted sum delay dij, el+ed=1.Cost (link) is consisted of three parts: Θij, fixed delay dijAnd unit price.It can from above-mentioned calculating formula Out, node or link with larger surplus resources have relatively low cost.
The totle drilling cost Cost_total of SFC layout process calculates as follows:
Cost_total=cost (server)+cost (link);
Resource constraint is obtained by following formula:
Deferred constraint is obtained by following formula:
To establish the multi-goal Optimization Model for being intended to improve cost efficiency and guaranteed qos:
min{cost(server)+cost(link)}
A kind of cloud net resource service chain method of combination and system based on intention provided in an embodiment of the present invention, by mentioning For preset northbound interface frame of reference and the SFC cradle based on DRL, and construct a multi-objective optimization question mould Type, to reduce long term service chain layout cost to the maximum extent.
On the basis of the above embodiments, described to solve default multi-goal Optimization Model to minimize service chaining volume Line up this and delay, comprising:
The optimal solution of the multi-goal Optimization Model is obtained based on preset bilayer depth Q network (DDQN) algorithm.
For the multi-goal Optimization Model proposed in above-described embodiment, the embodiment of the present invention devises a bilayer Depth Q network algorithm finds out the optimal solution of multi-goal Optimization Model.
Specifically, optimization problem is expressed as Markovian decision process { ST, A, Rd, P } by the embodiment of the present invention, Middle ST indicates that state space, A indicate motion space, and Rd is defined as reward function, and P is state transition probability.It is defined as follows:
State space are as follows: each agency is sometime having corresponding layout scheme.The state is defined as owning The satisfaction degree of the qos requirement of SFC, and calculated by following formula:
ST={ st1,sts,...,stK};
Wherein, sts={ 0,1 }, K are the quantity of SFC.sts=1 indicates that the delay of SFC is wanted under current layout scheme Asking can be met.Otherwise, sts=0.Stateful quantity be 2K
Motion space are as follows: between two states of SFC conversion indicate by take action to change VNF placement or Routing.Behavior aggregate A is defined as follows shown in formula:
Wherein, X is designed to the actions available collection that VNF is placed in SFC operation flow.It, can be in addition, if providing X The routing between VNF is obtained by shortest path first.
Y is designed to the actions available collection of the routing of the flow between VNF.Therefore, the operation that VNF is placed and flow routes Space representation is A={ X, Y }.The movement number of s is 2M+H
Reward are as follows: if agent takes certain action, state stsTransfer to new state st's.Acting on behalf of s can also be with Rd is rewarded in acquisition immediatelys(as, st, st'), it is defined as stsIt is transferred to st'sWhen reduce cost.
Rds(as, st, st') and=cost (sts)-cost(st's);
Wherein cost (sts) and cost (st's) represent state stsAnd st'sLayout expense and.By accumulating long-term prize Encourage Rds(as, st, st') and highest cost efficiency may be implemented, according to current state, tactful π can obtain SFC and will take Corresponding action.Optimizing behavior isQs(st a) is defined as state-function of movement, and indicates specified shape The expected accumulation discount reward of state-movement.Qs(st a) is indicated are as follows:
Wherein, γ is discount factor, indicates importance of the following reward in study.It, can be with according to Bellman equation It obtains as follows optimal
Ps(as, st, st') and it indicates from state st to the transition probability of state st'.Therefore, it can be obtained most based on above formula Dominant strategyAnd it indicates are as follows:
In practice, it is generally difficult to obtain accurate transition probability.Therefore, Q study be designed to based on available information with Iterative manner finds optimal solution, and it updates Q value function using following equation:
Wherein, δ is learning efficiency, influences Qs(st, turnover rate a).
It is understood that Q study completes iteration based on Q value table, so if state and motion space are very big, then It is difficult to obtain optimal solution.In order to overcome this weakness, depth Q network (DQN) provided in an embodiment of the present invention passes through depth nerve Network (DNN) rather than the approximate Q value function of Q value table.DNN can be considered as the depth map with multiple process layers.θ is indicated The weight of these layers, and declined by gradient and updated.The approximation for the value function that DQN is used is calculated by following formula:
Qs(st,a,θ)≈Qs(st,a)。
In addition to this, DQN is reset with pinpoint target network using experience and eliminates data dependency.Define target network To be based on weight θ-Calculate target Q value.θ and θ-Between difference be that θ updates in each iteration, but θ-In fixed number of times It is updated in iteration.Target Q value function is given by:
The loss function of DQN is defined as mean square error, is calculated by following formula:
L (θ)=E [(Target_Qs-Qs(st,a,θ))2];
In each iteration, need to update weight θ according to gradientMinimize loss function.Renewal function by Following formula calculates:
It is emphasized that DQN and Q study all carries out calculating Target_Q using maximal functions, which results in over-evaluate Problem.As an improvement, DDQN finds the corresponding actions with maximum Q value in current network first, rather than directly in target The maximum Q value of everything in network:
amax(st', θ)=argmaxa'∈AQs(st',a',θ);
Then selected movement a is usedmax(st', θ) rewrites Target_Qs.New Target_Q in DDQNsBy following formula meter It calculates:
Similarly, L (θ) and θ ' is also required to update together in DDQN.
Specifically, bilayer depth Q network algorithm provided by the embodiment of the present invention may include two parts, first part For operation flow initialization, operation flow initialization be may comprise steps of:
1, from σf,sRandomly choose f ∈ FsFeasible placement schemesσf,sIt indicates for f ∈ FsFeasible cloud service Device collection.
2, the routing plan between VNF is obtained by shortest path first.
3, the layout expense of all SFC is calculated.
Second part is the arranging service using DDQN network, which may comprise steps of:
1, state space is initialized to ST={ st1,sts,...,stK}。
2, for SFCs, it is by stsQ value network Q is added to as inputs(sts,a,θ)。
3, it is acted by ε-greedy strategy acquisition.The strategy is respectively with probability ε and 1- ε selection random action and best Movement.
4, the conversion of all SFC is stored in experience and resets in memory.
5, each agency samples (st, a, Rd, st') from ER, and according to whether calculates its target Q value for end-state.
6, they execute gradient decline step (Target_Q relative to the θ of Q value networks-Qs(st,a,θ))2.In every ufθ After step, θ is replaced by θ.
If 7, { 1,1 ..., 1 } current state ST=, training will be terminated.
In summary the DDQN algorithm that process can be seen that design of the embodiment of the present invention can obtain multiple-objection optimization and ask The optimal solution of model is inscribed, and the algorithm has better cost efficiency and convergence, and can require with guaranteed qos, makes flow It is balanced.
In order to verify the performance of proposed method of the embodiment of the present invention, the embodiment of the present invention has carried out emulation experiment.Specifically , the embodiment of the present invention utilizes the cloud net being made of 30 nodes (10 Cloud Servers and 20 interchangers) and 50 links Network emulates mentioned algorithm.The maximum data transfer rate of link is fixed as 1Gbps.The transmission delay of link is 1- 3ms.The CPU and memory source of server are respectively set to 32 and 100-200GB.Each SFC needs 2-4 VNF, data Transmission rate is 20-50Mbps.Each VNF needs 2-4 CPU and 5-10GB memory source, and processing delay is 2-5ms. The structure of DNN includes the hidden layer of three neural networks being fully connected, and has 64,32,32 neurons.Mainly from algorithm Constringency performance optimizes performance these two aspects to be emulated.
Assessing constringency performance under different learning rates first: Fig. 2 is different learning rates provided in an embodiment of the present invention Under train epochs schematic diagram, as shown in Fig. 2, tool there are three types of learning rate DDQN algorithm when episode starts have it is huge Big train epochs.Train epochs tend to decline with the increase of episode, and which reflects the good convergence performances of DDQN. On the other hand, learning rate is the key factor of constringency performance.By taking EP=90 as an example, the DDQN of δ=0.001 needs 92 training Step.As a comparison, δ=0.01 and the algorithm of δ=0.1 only need 40 and 26 steps to be obtained with optimal solution.
Then the constringency performance under more different nitrification enhancements, Fig. 3 are different calculations provided in an embodiment of the present invention again Train epochs schematic diagram under method, as shown in Figure 3.As can be seen that Q study has lower convergence at different episode Can, because measure needed for it eliminates data dependency is less.On the contrary, DQN and DDQN establish experience playback and pinpoint target Network solves data dependence, therefore their training step learns less than Q.With Q-learning, the instruction of DQN, DDQN For practicing step, respectively 51,32,26.Therefore, compared with DQN, solve the problems, such as to over-evaluate be also DDQN advantage.
Further, the embodiment of the present invention assesses average retardation as a comparison with following two algorithm, total reward and negative Carry equilibrium state.QoS driving placement algorithm (QPA): it obtains the end-to-end path of SFC first, then expand to path it On, to minimize cost and delay, while meeting resource requirement.Random fit Placement (RPA): the placement of VNF is with random The form of fitting executes, to consider to meet the constrained all schemes of institute, and randomly chooses one of them, then also random Select path therein.Fig. 4 is the average delay schematic diagram of algorithms of different provided in an embodiment of the present invention, as shown in figure 4, by It is seldom in the quantity of SFC request, therefore the average retardation of four kinds of algorithms is lower when starting.With the increase of SFC, delay with Different amplitudes and increase.When SFC quantity is 200, the average retardation of DDQN, DQN, QPA and RPA are respectively 38ms, 43ms, 48ms, 56ms.Due to the randomness of RPA, the delay performance of RPA is poor.Although DQN and QPA examine delay minimization Including worry, but they have ignored the influence of load balance, this may cause network congestion.In contrast, it is asked in different SFC It asks under quantity, DDQN has better delay performance.
Fig. 5 is the totle drilling cost schematic diagram of algorithms of different provided in an embodiment of the present invention, as shown in figure 5, in different number Under SFC, the totle drilling cost of DDQN, DQN, QPA are always lower than RPA, because they all consider the cost minimization of objective function.Example Such as, when SFC quantity is 300, cost the ratio QPA, DQN, DDQN high 16% of RPA, 20%, 27%.In these three algorithms, DDQN has optimal cost efficiency, over-evaluates problem because it overcomes and lays particular emphasis on load balance, QPA and DQN have ignored this A bit.Therefore, DDQN can obtain optimal delay and cost in SFC layout process.
For load balancing state, by taking SFCs=300 as an example, the variance ratio RPA, SPA of DDQN junctor usage and DQN low 62%, 55%, 41%.Equally, the variation of DDQN server utilization rate is also lower than 81%, 65%, 48%.It is compiled in SFC It arranges in Optimized model, ΦvAnd ΘijDesigned for maintenance network balance, so that DDQN is it is possible to prevente effectively from network congestion.
Fig. 6 is a kind of cloud net resource service chain arranging system structural representation based on intention provided in an embodiment of the present invention Figure, as shown in Figure 6, comprising: service module 601 and layout adjust module 602, in which:
Service module 601 is used to be based on preset northbound interface frame of reference, provides end-to-end service to cloud net resource;
The service chaining cradle that layout adjustment module 602 is used to learn based on deeply, to the end-to-end service It carries out online layout and dynamic adjusts, wherein in the online layout and dynamic adjustment, solve default multiple-objection optimization and ask Model is inscribed to minimize service chaining layout cost and delay.
Specifically how can be used for executing by service module 601 and layout adjustment module 602 shown in FIG. 1 based on intention Cloud net resource service chain method of combination embodiment technical solution, it is similar that the realization principle and technical effect are similar, no longer superfluous herein It states.
A kind of cloud net resource service chain arranging system based on intention provided in an embodiment of the present invention, it is default by providing Northbound interface frame of reference and SFC cradle based on DRL, and a multi-goal Optimization Model is constructed, with most Reduce to limits long term service chain layout cost.
On the basis of the above embodiments, the multi-goal Optimization Model indicates are as follows:
min{cost(server)+cost(link)}
Wherein, cost (server) is the cost of the relevant cost of server resource, the forwarding of cost (link) flow, C1, C2,C3,C4,C5,C6,C7For resource constraint.
On the basis of the above embodiments, the layout adjustment module includes:
DDQN unit, for obtaining the multi-goal Optimization Model based on the preset double-deck depth Q network algorithm Optimal solution.
On the basis of the above embodiments, the DDQN unit includes:
Initialization section, for being initialized to operation flow;
Arranging service part, for carrying out industry to the operation flow after initialization based on the preset double-deck depth Q network Business layout.
On the basis of the above embodiments, the initialization section is specifically used for:
The target protocol for concentrating random selection to meet the requirements from Cloud Server;
Based on shortest path selection algorithm, the target routing plan between VNF is determined;
Calculate the layout expense of all service chainings.
On the basis of the above embodiments, the arranging service part is specifically used for:
To the double-deck depth Q network inputs state after state space initialization;
It obtains the corresponding movement of input state and calculates target Q value;
Input state is updated based on gradient descent method until reaching preset termination condition.
The embodiment of the present invention provides a kind of electronic equipment, comprising: at least one processor;And it is logical with the processor Believe at least one processor of connection, in which:
Fig. 7 is the structural block diagram of electronic equipment provided in an embodiment of the present invention, referring to Fig. 7, the electronic equipment, comprising: Processor (processor) 701, communication interface (Communications Interface) 702, memory (memory) 703 With bus 704, wherein processor 701, communication interface 702, memory 703 complete mutual communication by bus 704.Place Reason device 701 can call the logical order in memory 703, to execute following method: be referred to based on preset northbound interface Framework provides end-to-end service to cloud net resource;Based on the service chaining cradle of deeply study, to described end-to-end Service carries out online layout and dynamic adjusts, wherein in the online layout and dynamic adjustment, it is excellent to solve default multiple target Change problem model to minimize service chaining layout cost and delay.
The embodiment of the present invention discloses a kind of computer program product, and the computer program product is non-temporary including being stored in Computer program on state computer readable storage medium, the computer program include program instruction, when described program instructs When being computer-executed, computer is able to carry out method provided by above-mentioned each method embodiment, for example, based on default Northbound interface frame of reference, to cloud net resource provide end-to-end service;Service chaining layout frame based on deeply study Frame carries out online layout to the end-to-end service and dynamic adjusts, wherein in the online layout and dynamic adjustment, asks Default multi-goal Optimization Model is solved to minimize service chaining layout cost and delay.
The embodiment of the present invention provides a kind of non-transient computer readable storage medium, and the non-transient computer is readable to deposit Storage media stores computer instruction, and the computer instruction executes the computer provided by above-mentioned each method embodiment Method, for example, be based on preset northbound interface frame of reference, provide end-to-end service to cloud net resource;It is strong based on depth The service chaining cradle that chemistry is practised carries out online layout to the end-to-end service and dynamic adjusts, wherein it is described In line layout and dynamic adjustment, default multi-goal Optimization Model is solved to minimize service chaining layout cost and delay.
Through the above description of the embodiments, those skilled in the art can be understood that each embodiment It can realize by means of software and necessary general hardware platform, naturally it is also possible to pass through hardware.Based on such reason Solution, substantially the part that contributes to existing technology can embody above-mentioned technical proposal in the form of software products in other words Out, which may be stored in a computer readable storage medium, such as ROM/RAM, magnetic disk, CD, It uses including some instructions so that a computer equipment (can be personal computer, server or the network equipment etc.) is held Method described in certain parts of each embodiment of row or embodiment.
Finally, it should be noted that the above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations;To the greatest extent Present invention has been described in detail with reference to the aforementioned embodiments for pipe, those skilled in the art should understand that: it is still It is possible to modify the technical solutions described in the foregoing embodiments, or part of technical characteristic is equally replaced It changes;And these are modified or replaceed, technical solution of various embodiments of the present invention that it does not separate the essence of the corresponding technical solution Spirit and scope.

Claims (9)

1. a kind of cloud net resource service chain method of combination based on intention characterized by comprising
Based on preset northbound interface frame of reference, end-to-end service is provided to cloud net resource;
Based on the service chaining cradle of deeply study, online layout is carried out to the end-to-end service and dynamic adjusts, Wherein, in the online layout and dynamic adjustment, default multi-goal Optimization Model is solved to minimize service chaining layout Cost and delay.
2. the cloud net resource service chain method of combination according to claim 1 based on intention, which is characterized in that more mesh Mark optimization problem model is expressed as:
min{cost(server)+cost(link)}
Wherein, cost (server) is the cost of the relevant cost of server resource, the forwarding of cost (link) flow, C1,C2,C3, C4,C5,C6,C7For resource constraint.
3. the cloud net resource service chain method of combination according to claim 1 based on intention, which is characterized in that the solution Multi-goal Optimization Model is preset to minimize service chaining layout cost and delay, comprising:
The optimal solution of the multi-goal Optimization Model is obtained based on the preset double-deck depth Q network algorithm.
4. the cloud net resource service chain method of combination according to claim 3 based on intention, which is characterized in that described to be based on Preset bilayer depth Q network algorithm obtains the optimal solution of the multi-goal Optimization Model, comprising:
Operation flow is initialized;
Based on the preset double-deck depth Q network, arranging service is carried out to the operation flow after initialization.
5. the cloud net resource service chain method of combination according to claim 4 based on intention, which is characterized in that described to industry Business process is initialized, comprising:
The target protocol for concentrating random selection to meet the requirements from Cloud Server;
Based on shortest path selection algorithm, the target routing plan between VNF is determined;
Calculate the layout expense of all service chainings.
6. the cloud net resource service chain method of combination according to claim 4 based on intention, which is characterized in that described to be based on Preset bilayer depth Q network, carries out arranging service to the operation flow after initialization, comprising:
To the double-deck depth Q network inputs state after state space initialization;
It obtains the corresponding movement of input state and calculates target Q value;
Input state is updated based on gradient descent method until reaching preset termination condition.
7. a kind of cloud net resource service chain arranging system based on intention characterized by comprising
Service module provides end-to-end service to cloud net resource for being based on preset northbound interface frame of reference;
Layout adjusts module, and the service chaining cradle for being learnt based on deeply carries out the end-to-end service Line layout and dynamic adjust, wherein in the online layout and dynamic adjustment, solve default multi-goal Optimization Model with Minimize service chaining layout cost and delay.
8. a kind of electronic equipment including memory, processor and stores the calculating that can be run on a memory and on a processor Machine program, which is characterized in that the processor is realized as described in any one of claim 1 to 6 when executing described program based on meaning The step of cloud net resource service chain method of combination of figure.
9. a kind of non-transient computer readable storage medium, is stored thereon with computer program, which is characterized in that the computer journey The cloud net resource service chain method of combination as described in any one of claim 1 to 6 based on intention is realized when sequence is executed by processor The step of.
CN201910461367.6A 2019-05-30 2019-05-30 Intent-based cloud network resource service chain arranging method and system Active CN110247795B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910461367.6A CN110247795B (en) 2019-05-30 2019-05-30 Intent-based cloud network resource service chain arranging method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910461367.6A CN110247795B (en) 2019-05-30 2019-05-30 Intent-based cloud network resource service chain arranging method and system

Publications (2)

Publication Number Publication Date
CN110247795A true CN110247795A (en) 2019-09-17
CN110247795B CN110247795B (en) 2020-09-25

Family

ID=67885322

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910461367.6A Active CN110247795B (en) 2019-05-30 2019-05-30 Intent-based cloud network resource service chain arranging method and system

Country Status (1)

Country Link
CN (1) CN110247795B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111865686A (en) * 2020-07-20 2020-10-30 北京百度网讯科技有限公司 Cloud product capacity expansion method, device, equipment and storage medium
CN112040011A (en) * 2020-06-30 2020-12-04 北京来也网络科技有限公司 Simulation training method and device based on RPA and AI, computing equipment and medium
CN112583629A (en) * 2019-09-30 2021-03-30 华为技术有限公司 Information processing method, related equipment and computer storage medium
CN112637064A (en) * 2020-12-31 2021-04-09 广东电网有限责任公司电力调度控制中心 Service chain arranging method based on improved depth-first search algorithm
CN114827284A (en) * 2022-04-21 2022-07-29 中国电子技术标准化研究院 Service function chain arrangement method and device in industrial Internet of things and federal learning system

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107438983A (en) * 2014-09-22 2017-12-05 沃尔廷控股公司 Determine software defined network, storage and the physics and the compiler and method of virtual resource that calculate
US20180123974A1 (en) * 2015-09-28 2018-05-03 Centurylink Intellectual Property Llc Intent-Based Services Orchestration
CN108536144A (en) * 2018-04-10 2018-09-14 上海理工大学 A kind of paths planning method of fusion dense convolutional network and competition framework
CN108600101A (en) * 2018-03-21 2018-09-28 北京交通大学 A kind of network for the optimization of end-to-end time delay performance services cross-domain method of combination
CN108900419A (en) * 2018-08-17 2018-11-27 北京邮电大学 Route decision method and device based on deeply study under SDN framework
CN109245916A (en) * 2018-08-15 2019-01-18 西安电子科技大学 A kind of the cloud access net system and method for intention driving
CN109358971A (en) * 2018-10-30 2019-02-19 电子科技大学 Quick and load balancing service function chain dispositions method in dynamic network environment

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107438983A (en) * 2014-09-22 2017-12-05 沃尔廷控股公司 Determine software defined network, storage and the physics and the compiler and method of virtual resource that calculate
US20180123974A1 (en) * 2015-09-28 2018-05-03 Centurylink Intellectual Property Llc Intent-Based Services Orchestration
CN108600101A (en) * 2018-03-21 2018-09-28 北京交通大学 A kind of network for the optimization of end-to-end time delay performance services cross-domain method of combination
CN108536144A (en) * 2018-04-10 2018-09-14 上海理工大学 A kind of paths planning method of fusion dense convolutional network and competition framework
CN109245916A (en) * 2018-08-15 2019-01-18 西安电子科技大学 A kind of the cloud access net system and method for intention driving
CN108900419A (en) * 2018-08-17 2018-11-27 北京邮电大学 Route decision method and device based on deeply study under SDN framework
CN109358971A (en) * 2018-10-30 2019-02-19 电子科技大学 Quick and load balancing service function chain dispositions method in dynamic network environment

Non-Patent Citations (8)

* Cited by examiner, † Cited by third party
Title
FRANCO CALLEGATI: "Performance of Intent-based Virtualized Network Infrastructure Management", 《IEEE ICC 2017 SAC SYMPOSIUM SDN & NFV TRACK》 *
HAMED RAHMANI KHEZRI: "Deep Q-Learning for Dynamic Reliability Aware NFV-Based Service Provisioning", 《谷歌学术》 *
NGUYEN CONG LUONG: "Applications of Deep Reinforcement Learning in communication and networking: A Survey", 《IEEE》 *
WENCHEN HE: "Markov Approximation Method for Optimal Service Orchestration in IoT Network", 《IEEE ACCESS》 *
左国玉: "基于动作注意策略的树形DDQN目标候选区域提取方法", 《电子与信息学报》 *
肖逸凯: "基于软件定义网络的虚拟网络功能服务链在线调度技术", 《万方学位论文》 *
马骋乾: "强化学习研究综述", 《指挥控制与仿真》 *
魏亮: "基于强化学习的服务链映射算法,", 《通信学报》 *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112583629A (en) * 2019-09-30 2021-03-30 华为技术有限公司 Information processing method, related equipment and computer storage medium
CN112583629B (en) * 2019-09-30 2022-06-10 华为技术有限公司 Information processing method, related equipment and computer storage medium
CN112040011A (en) * 2020-06-30 2020-12-04 北京来也网络科技有限公司 Simulation training method and device based on RPA and AI, computing equipment and medium
CN112040011B (en) * 2020-06-30 2023-06-09 北京来也网络科技有限公司 Simulation training method, device, computing equipment and medium based on RPA and AI
CN111865686A (en) * 2020-07-20 2020-10-30 北京百度网讯科技有限公司 Cloud product capacity expansion method, device, equipment and storage medium
CN112637064A (en) * 2020-12-31 2021-04-09 广东电网有限责任公司电力调度控制中心 Service chain arranging method based on improved depth-first search algorithm
CN114827284A (en) * 2022-04-21 2022-07-29 中国电子技术标准化研究院 Service function chain arrangement method and device in industrial Internet of things and federal learning system
CN114827284B (en) * 2022-04-21 2023-10-03 中国电子技术标准化研究院 Service function chain arrangement method and device in industrial Internet of things and federal learning system

Also Published As

Publication number Publication date
CN110247795B (en) 2020-09-25

Similar Documents

Publication Publication Date Title
CN110247795A (en) A kind of cloud net resource service chain method of combination and system based on intention
Liu et al. Dynamic service function chain orchestration for NFV/MEC-enabled IoT networks: A deep reinforcement learning approach
CN110365514B (en) SDN multistage virtual network mapping method and device based on reinforcement learning
CN109768940A (en) The flow allocation method and device of multi-service SDN network
Rjoub et al. Trust-driven reinforcement selection strategy for federated learning on IoT devices
Yuan et al. A Q-learning-based approach for virtual network embedding in data center
CN110022230A (en) The parallel dispositions method of service chaining and device based on deeply study
CN108111335A (en) A kind of method and system dispatched and link virtual network function
Wang et al. Optimal deployment of cloudlets based on cost and latency in Internet of Things networks
CN110198280A (en) A kind of SDN link allocation method based on BP neural network
WO2023216489A1 (en) Computing power network node evaluation and operation method and apparatus
Wang et al. A reinforcement learning approach for online service tree placement in edge computing
Dalgkitsis et al. Dynamic resource aware VNF placement with deep reinforcement learning for 5G networks
Li et al. MEC-based dynamic controller placement in SD-IoV: A deep reinforcement learning approach
Liu et al. Multi-objective optimization service function chain placement algorithm based on reinforcement learning
Yamansavascilar et al. Deepedge: A deep reinforcement learning based task orchestrator for edge computing
Dalgkitsis et al. SCHE2MA: Scalable, energy-aware, multidomain orchestration for beyond-5G URLLC services
CN113543160B (en) 5G slice resource allocation method, device, computing equipment and computer storage medium
CN110233763B (en) Virtual network embedding algorithm based on time sequence difference learning
Faraji-Mehmandar et al. A self-learning approach for proactive resource and service provisioning in fog environment
CN115225512B (en) Multi-domain service chain active reconfiguration mechanism based on node load prediction
Rădulescu et al. Analysing congestion problems in multi-agent reinforcement learning
CN113596138B (en) Heterogeneous information center network cache allocation method based on deep reinforcement learning
Jin et al. Resource management of cloud-enabled systems using model-free reinforcement learning
Xia et al. Learn to optimize: Adaptive VNF provisioning in mobile edge clouds

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant