CN112543119A - Service function chain reliability deployment method based on deep reinforcement learning - Google Patents

Service function chain reliability deployment method based on deep reinforcement learning Download PDF

Info

Publication number
CN112543119A
CN112543119A CN202011359654.5A CN202011359654A CN112543119A CN 112543119 A CN112543119 A CN 112543119A CN 202011359654 A CN202011359654 A CN 202011359654A CN 112543119 A CN112543119 A CN 112543119A
Authority
CN
China
Prior art keywords
service function
function chain
node
deployment
reinforcement learning
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202011359654.5A
Other languages
Chinese (zh)
Other versions
CN112543119B (en
Inventor
王珂
曲桦
赵季红
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xian Jiaotong University
Original Assignee
Xian Jiaotong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xian Jiaotong University filed Critical Xian Jiaotong University
Priority to CN202011359654.5A priority Critical patent/CN112543119B/en
Publication of CN112543119A publication Critical patent/CN112543119A/en
Application granted granted Critical
Publication of CN112543119B publication Critical patent/CN112543119B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/08Configuration management of networks or network elements
    • H04L41/0803Configuration setting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design
    • H04L41/145Network analysis or design involving simulating, designing, planning or modelling of a network

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Molecular Biology (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention discloses a service function chain reliability deployment method based on deep reinforcement learning, and belongs to the field of communication technology and machine learning. The method can realize the fault-tolerant service function chain deployment by respectively solving the two problems of priority perception and reliability deployment of the service function chain, and adapts to the complex and dynamic service function chain deployment request by utilizing the advantage of deep reinforcement learning. The method selects the backup scheme of the virtual network function on the service function chain by dynamically sensing the priority of the service function chain, and simultaneously determines the deployment node and the backup node of the virtual network function in the service function chain by utilizing deep reinforcement learning, thereby realizing the reliable deployment of the service function chain. The method and the device consider the reliability of the service function chain, improve the effective utilization rate of network resources, consider the transmission delay and the load balance of the nodes and the links, and are suitable for the dynamic complex service function chain deployment request scene.

Description

Service function chain reliability deployment method based on deep reinforcement learning
Technical Field
The invention belongs to the field of communication technology and machine learning, and relates to a service function chain reliability deployment method based on deep reinforcement learning.
Background
With the advent of the 5G era, the number and diversity of network service requests has increased dramatically, creating a significant challenge for the reliable deployment of service function chains. In the operator network of today, a large number of network functions, such as firewalls, NATs, DPIs, etc., are distributed and deployed on different server nodes, and the network functions are combined in a certain order, and this Function sequence is called a Service Function Chain (SFC).
A traditional service function chain deployment manner relies on a routing policy formulated by a service operator, so that traffic flows pass through a plurality of virtual network functions in a certain order to provide a required network service. This approach has certain drawbacks: the reliability deployment requirement of a service function chain is difficult to meet due to the lack of certain fault tolerance measures; the resource demand of different service function chains is different, so that the priority is different, and a single strategy is difficult to meet all the deployment requirements; the virtual network function is pre-deployed on the server node, and is difficult to meet the dynamic complex service function chain deployment requirement, and the like. With the rapid development of Software Defined Networking (SDN) and Network Function Virtualization (NFV) technologies, a service Function chain based on an SDN/NFV fusion architecture receives a lot of attention. The SDN/NFV network joint architecture takes an SDN controller as a control plane of a network and NFV as a novel universal infrastructure as a data plane, combines the advantages of control and forwarding separation, centralized control, network resource abstraction, network programmability and the like, can get rid of the locking of deeply entrenched manufacturers, and flexibly and quickly provides an elastic network as required so as to better meet diversified application scenes and differentiated service quality requirements.
Aiming at the problem of dynamic reliability deployment of service function chains, the existing deployment methods have some defects. For example, the chinese patent invention "CN 109586982B" proposes a virtual network function backup method, which determines the backup requirement of each function node by determining the evaluation value of each function node in the deployed service function chain. However, the method is difficult to perform real-time function node backup on a dynamically arrived service function chain deployment request, and does not perform priority division on the service function chain, which is easy to cause network resource waste. The Chinese invention patent "CN 110460465A" proposes a service function chain deployment method facing mobile edge calculation, which adopts Q-learning algorithm in reinforcement learning to deploy. However, the method needs to maintain a huge Q-value table, which wastes a lot of CPU performance and cannot accurately determine the unknown conditions in the sample. The chinese invention patent "CN 111669291A" proposes a virtualized network service function chain deployment method based on deep reinforcement learning, which realizes service function chain deployment with minimum cost by respectively solving two problems of virtual function placement and traffic routing. However, the method does not consider the possible failure condition of the functional node, and the fault tolerance and the reliable deployment of the service function chain are difficult to ensure.
Therefore, an effective solution to the problem of reliable deployment of dynamically complex service function chains is still lacking.
Disclosure of Invention
In order to overcome the defects of the prior art, the invention aims to provide a service function chain reliability deployment method based on deep reinforcement learning, which solves the problem of reliability deployment of a dynamic complex service function chain in the context of network function virtualization.
In order to achieve the purpose, the invention adopts the following technical scheme to realize the purpose:
the invention discloses a service function chain reliability deployment method based on deep reinforcement learning, which comprises the following steps:
1) processing a service function chain deployment request;
2) the method comprises the steps of carrying out priority division on a service function chain, and selecting a backup scheme of a virtual network function in the corresponding service function chain according to the priority of the service function chain;
3) and determining a deployment node and a backup node of the virtual network function in the service function chain by applying deep reinforcement learning, and realizing the reliable deployment of the service function chain.
Preferably, in the step 2), the service function chain is prioritized, and the method specifically includes the following steps:
a) respectively calculating the total CPU requirement r of each virtual network function in the service function chaincpuAnd the total bandwidth requirement r of each virtual linkbw
b) Calculating rcpuTaking the weight of the largest possible demand, r is calculatedbwWeight to the maximum possible demand;
c) according to rcpuThe weight sum r which accounts for the largest possible demandbwCalculating the average weight of the weight occupying the maximum possible demand;
d) when the average weight obtained in the step c) is judged to be more than 2/3, dividing the service function chain into a service function chain with high priority;
when the average weight obtained in the step c) is judged to be less than 1/3, dividing the service function chain into service function chains with low priority;
otherwise, the service function chain with the medium priority is divided.
It is further preferred that each virtual network function in a service function chain is backed up when classified as a high priority service function chain.
Further preferably, when a service function chain of a medium priority is divided, a node to be deployed is selected for a virtual network function in the service function chain, and importance of the selected node to be deployed is divided;
when the nodes to be deployed with high importance are divided, backing up virtual network functions in corresponding service function chains when the nodes to be deployed are divided;
otherwise, the virtual network function in the corresponding service function chain is not backed up when the node is to be deployed.
Preferably, the importance degree division is performed on the selected nodes to be deployed, and the method specifically includes the following steps:
respectively calculating the node degree and the CPU occupation number of the selected nodes to be deployed;
respectively calculating the weight of the node degree of the selected node to be deployed in the maximum node degree and the weight of the CPU occupation number in the total CPU number;
calculating average weight according to the weight of the node degree in the maximum node degree and the weight of the CPU occupation number in the total CPU number;
and fourthly, when the average weight obtained in the step III of judging is more than 0.5, the high importance degree is divided.
It is further preferred that virtual network functions in a service function chain are not backed up when classified as a low priority service function chain.
Preferably, in step 3), the deployment of the virtual network function in the service function chain by applying deep reinforcement learning specifically includes the following steps:
i) setting deep reinforcement learning parameters comprising a state set, an action set and a feedback function;
wherein the state set is S (t) ═ Cr,Br,Finfo);
In the formula, CrRepresenting the amount of CPU resource remaining in each node, BrRepresenting the amount of remaining bandwidth resources of each link, FinfoRepresenting the type and deployment position of the last virtual network function;
the action set is A (t) ═ Vd,Vb);
In the formula, VdLocation of deployment node, V, representing virtual network functionbA backup node location representing a virtual network function;
the feedback function is
Figure BDA0002803653460000041
In the formula, L represents the maximum transmission time delay between adjacent virtual network functions, C represents the maximum CPU resource occupation amount of all nodes, B represents the maximum bandwidth resource occupation amount of all links, and alpha, beta and gamma are weights;
ii) initializing a neural network and an experience replay set D;
iii) selecting the node with the minimum load to deploy or backup the first virtual network function in the service function chain to obtain the current Cr、BrAnd FinfoInformation and setting an initial state S (t);
iv) selecting deployment and backup node pairs (V) for the current state S (t) according to an epsilon-greedy methodd,Vb) After executing the action, obtain updated Cr、BrAnd FinfoInformation and go to the next state S (t + 1);
v) calculating according to the feedback function R (t)Feed back the value R, and apply the four tuples (S (t), (V)d,Vb) S (t +1), R) is stored in the experience playback set D, and S (t) is set as S (t + 1);
vi) updating the neural network parameters by gradient back propagation according to the samples in D.
Further preferably, in step iv), if the node pair (V) is not reservedd,Vb) If yes, backup node VbIs set to empty.
Further preferably, after step vi) is completed, if it is determined that the last virtual network function is performed, the deep reinforcement learning is ended; otherwise, re-executing steps iv) -vi).
Preferably, in step 1), the service function chain reliability deployment request includes a virtual network function type, a virtual network function sequence, a CPU resource requirement of the virtual network function, and a bandwidth resource requirement of the virtual link.
Preferably, after step 3) is completed, if it is determined that the last service function chain reliability deployment request is received, the reliability deployment method is ended; otherwise, re-executing the step 1) to the step 3).
Compared with the prior art, the invention has the following beneficial effects:
the invention discloses a service function chain reliability deployment method based on deep reinforcement learning, which determines the priority of a service function chain resource demand and determines a backup scheme of a virtual network function by dynamically sensing the service function chain resource demand, and aims to improve the reliability of the service function chain, improve the effective utilization rate of network resources, reduce transmission delay and improve the load balance of nodes and links. Meanwhile, the deployment node and the backup node are determined simultaneously by utilizing the deep reinforcement learning, and the advantages of the deep reinforcement learning are utilized to adapt to the complex and dynamic service function chain deployment request. Therefore, the method realizes the service function chain deployment with fault tolerance by respectively solving two problems of priority perception and reliability deployment of the service function chain while realizing the reliability service function chain deployment facing to the priority perception, and realizes the dynamic deployment request of the service function chain by utilizing a deep reinforcement learning method.
Furthermore, the method determines the priority of the service function chain by dynamically sensing the CPU and bandwidth requirements of the service function chain, so that a backup scheme of the virtual network function on the service function chain is selected, the survivability of the service function chain with higher priority can be improved, the overall deployment reliability of the service function chain is further improved, and the condition of large-scale failure of the service function chain caused by physical network faults is prevented.
Furthermore, the invention determines the importance of the node by sensing the node degree of the node and the CPU resource occupation number, so as to select the backup scheme of the virtual network function on the service function chain, thereby improving the effective utilization rate of the network resource, and avoiding the resource waste caused by the fact that the non-important virtual network function on the service function chain with the medium priority occupies the node backup resource.
In conclusion, the invention realizes the purposes of improving the reliability of the service function chain, improving the effective utilization rate of network resources, reducing transmission delay, improving the load balance of nodes and links, and is suitable for the dynamic and complex service function chain deployment request scene.
Drawings
FIG. 1 is a general flow chart of the algorithm of the present invention;
FIG. 2 is a flow chart of a service function chain priority awareness algorithm of the present invention;
FIG. 3 is a flowchart of an importance degree division algorithm for nodes to be deployed according to the present invention;
FIG. 4 is a flowchart of an algorithm of a deep reinforcement learning deployment module of the present invention;
FIG. 5 is a service function chain deployment diagram of one embodiment of the invention.
Detailed Description
In order to make the technical solutions of the present invention better understood, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
It should be noted that the terms "first," "second," and the like in the description and claims of the present invention and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used is interchangeable under appropriate circumstances such that the embodiments of the invention described herein are capable of operation in sequences other than those illustrated or described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
The invention is described in detail below with reference to the following figures and specific embodiments:
referring to fig. 1, it can be seen that the method for deploying reliability based on Deep Reinforcement Learning (DRL) in a priority-aware Service Function Chain (SFC) disclosed in the present invention includes the following steps:
s1, the service orchestrator responds to the service function chain deployment request;
s2, carrying out priority division on the service function chain, selecting a corresponding Virtual Network Function (VNF) backup scheme according to the priority, if the service function chain is of a medium priority, executing S3, and if the service function chain is of a medium priority, executing S4;
s3, selecting nodes to be deployed according to the virtual network function in the service function chain, and dividing the importance of the nodes to be deployed;
s4, applying deep reinforcement learning to perform reliable deployment on the virtual network function in the service function chain;
and S5, the service orchestrator judges whether the current service function chain is the last service function chain reliability deployment request, if so, the service function chain reliability deployment request is ended, and if not, the service orchestrator re-executes S1.
Further, the service function chain deployment request in step S1 includes a virtual network function type, a virtual network function sequence, a CPU resource requirement of the virtual network function, a bandwidth resource requirement of the virtual link, and the like.
Referring to fig. 2, further, the step S2 includes:
s2.1, respectively calculating the total CPU requirement r of each virtual network function in the service function chaincpuAnd the total bandwidth requirement r of each virtual linkbw
S2.2, respectively calculating rcpu、rbwWeight to the maximum possible demand;
s2.3, judging whether the average weight of the two weights obtained in the S2.2 is larger than 2/3, if so, outputting the service function chain with high priority and ending;
and S2.4, judging whether the average weight is smaller than 1/3, if so, outputting the service function chain with low priority and ending, otherwise, outputting the service function chain with medium priority and executing S3.
Referring to fig. 3, further, when the service function chain is divided into service function chains with medium priority, a node to be deployed is selected for a virtual network function in the service function chain, and importance of the selected node to be deployed is divided, where the step S3 includes:
s3.1, respectively calculating the node degree and the CPU occupation number of the selected nodes to be deployed;
s3.2, respectively calculating the weight of the node degree of the selected node to be deployed in the maximum node degree and the weight of the CPU occupation number in the total CPU number;
and S3.3, judging whether the average weight of the weights obtained in the S3.2 is greater than 0.5 or not, if so, judging that the output node is high in importance and ending, and otherwise, judging that the output node is low in importance and ending.
Referring to fig. 4, further, the step S4 includes:
s4.1, setting deep reinforcement learning parameters:
the method comprises the following steps: the state set S is denoted as S (t) ═ Cr,Br,Finfo);
The action set a is represented by a (t) ═ Vd,Vb);
The feedback function R is expressed as
Figure BDA0002803653460000081
In the formula, CrRepresenting the amount of CPU resource remaining in each node, BrRepresenting the amount of remaining bandwidth resources of each link, FinfoRepresenting the type and deployment location, V, of the last VNFdDeployment node location, V, on behalf of VNFbRepresenting the backup node position of the VNF, L representing the maximum transmission time delay between adjacent VNFs, C representing the maximum CPU resource occupation amount of all nodes, B representing the maximum bandwidth resource occupation amount of all links, and alpha, beta and gamma are weights;
s4.2, initializing a neural network and an experience playback set D;
s4.3, selecting the node with the minimum load to deploy or backup the first virtual network function in the service function chain to obtain the current Cr、BrAnd FinfoInformation and setting an initial state S (t);
s4.4, selecting deployment and backup node pairs (V) for the current state S (t) according to an epsilon-greedy methodd,Vb) (if not, backup node VbSet to null), get updated C after performing the actionr、BrAnd FinfoInformation and go to the next state S (t + 1);
s4.5, calculating the current feedback value R according to the feedback function, and combining the four tuples (S (t), (A)a,Ab) S (t +1), R) is stored in the experience playback set D, and S (t) is set as S (t + 1);
s4.6, updating the neural network parameters through gradient back propagation according to the samples in the D;
and S4.7, judging whether the current VNF is the last VNF, if so, ending, and otherwise, executing S4.4 again.
The following detailed description of the present invention will be made with reference to the accompanying drawings and examples.
In this section, a detailed description will be given of a reliability deployment method based on deep reinforcement learning in a priority-aware SFC with reference to the above drawings, where the types and resource requirements of virtual network functions and virtual links are shown in table 1:
TABLE 1 resource requirement settings
Figure BDA0002803653460000091
As shown in fig. 5, the present invention is illustrated in one embodiment:
a service function chain deployment request consists of three virtual network functions of a firewall, DPI and load balancing, the function sequence is firewall-DPI-load balancing, and a service orchestrator responds to the request.
The CPU and bandwidth requirements of the service function chain are calculated to be 6 units and 3 units respectively, the weights of the CPU and the bandwidth requirements occupying the maximum requirements are calculated to be 0.6 and 0.3 respectively, the average weight is 0.45, and the output service function chain is of medium priority.
The node degrees and the CPU occupation numbers of the nodes A, B, C to be deployed with the three virtual network functions are obtained through calculation, and the importance weight is obtained through calculation, in this example, it is assumed that the node B is a high importance node, and the node A, C is a low importance node, so that the DPI network function to be deployed on the node B needs to be backed up.
And sequentially deploying the firewall to the node A according to the deep reinforcement learning deployment module, deploying the DPI to the node B and backing up to the node D, and then deploying the load to the node C in a balanced manner, wherein the current service function chain reliability deployment request is processed.
And then the service orchestrator judges whether the current service function chain is the reliable deployment request of the last service function chain, if so, the service orchestrator finishes the operation, and if not, the service orchestrator continues to process the reliable deployment requests of the subsequent service function chains in sequence.
In summary, the present invention relates to a deep reinforcement learning and network function virtualization technology, and in particular, to a reliability deployment method based on deep reinforcement learning in a priority-aware service function chain, which is used for solving a problem of reliability service function chain deployment in a network function virtualization background. The method can realize the fault-tolerant service function chain deployment by respectively solving the two problems of priority perception and reliability deployment of the service function chain, and adapts to the complex and dynamic service function chain deployment request by utilizing the advantage of deep reinforcement learning. The method selects the backup scheme of the virtual network function on the service function chain by dynamically sensing the priority of the service function chain, and simultaneously determines the deployment node and the backup node of the virtual network function in the service function chain by utilizing deep reinforcement learning, thereby realizing the reliable deployment of the service function chain. The priority of the service function chain can be determined by dynamically sensing the CPU and the bandwidth requirements of the service function chain, and the node importance can be determined by sensing the node degree of the node and the CPU resource occupation number. The invention considers the reliability of the service function chain, improves the effective utilization rate of network resources, namely the proportion between resources occupied by the deployment and the backup of the service function chain, simultaneously considers the transmission delay and the load balance of nodes and links, and is suitable for the dynamic complex service function chain deployment request scene.
The above-mentioned contents are only for illustrating the technical idea of the present invention, and the protection scope of the present invention is not limited thereby, and any modification made on the basis of the technical idea of the present invention falls within the protection scope of the claims of the present invention.

Claims (10)

1. A service function chain reliability deployment method based on deep reinforcement learning is characterized by comprising the following steps:
1) processing a service function chain deployment request;
2) the method comprises the steps of carrying out priority division on a service function chain, and selecting a backup scheme of a virtual network function in the corresponding service function chain according to the priority of the service function chain;
3) and determining a deployment node and a backup node of the virtual network function in the service function chain by applying deep reinforcement learning, and realizing the reliable deployment of the service function chain.
2. The method for deploying the service function chain reliability based on the deep reinforcement learning according to claim 1, wherein in the step 2), the service function chain is prioritized, and specifically comprises the following steps:
a) respectively calculateTotal CPU requirement r for each virtual network function in a service function chaincpuAnd the total bandwidth requirement r of each virtual linkbw
b) Calculating rcpuTaking the weight of the largest possible demand, r is calculatedbwWeight to the maximum possible demand;
c) according to rcpuThe weight sum r which accounts for the largest possible demandbwCalculating the average weight of the weight occupying the maximum possible demand;
d) when the average weight obtained in the step c) is judged to be more than 2/3, dividing the service function chain into a service function chain with high priority;
when the average weight obtained in the step c) is judged to be less than 1/3, dividing the service function chain into service function chains with low priority;
otherwise, the service function chain with the medium priority is divided.
3. The deep reinforcement learning-based service function chain reliability deployment method according to claim 2, wherein when a service function chain with a high priority is classified, each virtual network function in the service function chain is backed up.
4. The service function chain reliability deployment method based on deep reinforcement learning according to claim 2, characterized in that when a service function chain of a medium priority is divided, a node to be deployed is selected for a virtual network function in the service function chain, and importance of the selected node to be deployed is divided;
when the nodes to be deployed with high importance are divided, backing up virtual network functions in corresponding service function chains when the nodes to be deployed are divided;
otherwise, the virtual network function in the corresponding service function chain is not backed up when the node is to be deployed.
5. The service function chain reliability deployment method based on deep reinforcement learning according to claim 4, wherein the importance degree division is performed on the selected nodes to be deployed, and the method specifically comprises the following steps:
respectively calculating the node degree and the CPU occupation number of the selected nodes to be deployed;
respectively calculating the weight of the node degree of the selected node to be deployed in the maximum node degree and the weight of the CPU occupation number in the total CPU number;
calculating average weight according to the weight of the node degree in the maximum node degree and the weight of the CPU occupation number in the total CPU number;
and fourthly, when the average weight obtained in the step III of judging is more than 0.5, the high importance degree is divided.
6. The deep reinforcement learning-based service function chain reliability deployment method according to claim 2, wherein when classified as a low-priority service function chain, virtual network functions in the service function chain are not backed up.
7. The method for deploying the service function chain reliability based on the deep reinforcement learning according to claim 1, wherein in the step 3), the deep reinforcement learning is applied to deploy a virtual network function in the service function chain, and specifically includes the following steps:
i) setting deep reinforcement learning parameters comprising a state set, an action set and a feedback function;
wherein the state set is S (t) ═ Cr,Br,Finfo);
In the formula, CrRepresenting the amount of CPU resource remaining in each node, BrRepresenting the amount of remaining bandwidth resources of each link, FinfoRepresenting the type and deployment position of the last virtual network function;
the action set is A (t) ═ Vd,Vb);
In the formula, VdLocation of deployment node, V, representing virtual network functionbA backup node location representing a virtual network function;
the feedback function is
Figure FDA0002803653450000031
In the formula, L represents the maximum transmission time delay between adjacent virtual network functions, C represents the maximum CPU resource occupation amount of all nodes, B represents the maximum bandwidth resource occupation amount of all links, and alpha, beta and gamma are weights;
ii) initializing a neural network and an experience replay set D;
iii) selecting the node with the minimum load to deploy or backup the first virtual network function in the service function chain to obtain the current Cr、BrAnd FinfoInformation and setting an initial state S (t);
iv) selecting deployment and backup node pairs (V) for the current state S (t) according to an epsilon-greedy methodd,Vb) After executing the action, obtain updated Cr、BrAnd FinfoInformation and go to the next state S (t + 1);
v) calculating the current feedback value R according to the feedback function R (t), and adding the four tuples (S (t), (V)d,Vb) S (t +1), R) is stored in the experience playback set D, and S (t) is set as S (t + 1);
vi) updating the neural network parameters by gradient back propagation according to the samples in D.
8. The deep reinforcement learning-based service function chain reliability deployment method as claimed in claim 7, wherein in step iv), if node pairs (V) are not reserved, the node pairs (V) are not reservedd,Vb) If yes, backup node VbIs set to empty.
9. The method for deploying the service function chain reliability based on the deep reinforcement learning of claim 7, wherein after the step vi) is completed, if the last virtual network function is determined, the deep reinforcement learning is ended; otherwise, re-executing steps iv) -vi).
10. The method for deploying the service function chain reliability based on the deep reinforcement learning according to claim 1, wherein after the step 3) is completed, if the last service function chain reliability deployment request is determined, the reliability deployment method is ended; otherwise, re-executing the step 1) to the step 3).
CN202011359654.5A 2020-11-27 2020-11-27 Service function chain reliability deployment method based on deep reinforcement learning Active CN112543119B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011359654.5A CN112543119B (en) 2020-11-27 2020-11-27 Service function chain reliability deployment method based on deep reinforcement learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011359654.5A CN112543119B (en) 2020-11-27 2020-11-27 Service function chain reliability deployment method based on deep reinforcement learning

Publications (2)

Publication Number Publication Date
CN112543119A true CN112543119A (en) 2021-03-23
CN112543119B CN112543119B (en) 2022-02-18

Family

ID=75015185

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011359654.5A Active CN112543119B (en) 2020-11-27 2020-11-27 Service function chain reliability deployment method based on deep reinforcement learning

Country Status (1)

Country Link
CN (1) CN112543119B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114172937A (en) * 2022-01-19 2022-03-11 重庆邮电大学 Dynamic service function chain arrangement method and system based on deep reinforcement learning
CN114944985A (en) * 2022-05-11 2022-08-26 重庆大学 Reliability-oriented service function chain backup method in resource heterogeneous environment
CN115086175A (en) * 2022-04-26 2022-09-20 北京邮电大学 Resource optimization method and device
CN115174393A (en) * 2022-06-27 2022-10-11 福州大学 Dynamic adjustment method of service function chain based on in-band network telemetry

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105141617A (en) * 2015-09-14 2015-12-09 上海华为技术有限公司 Deploying and adjusting method for service functions among data centers and deploying and adjusting device for service functions among data centers
EP3121997A1 (en) * 2015-07-20 2017-01-25 Koninklijke KPN N.V. Service provisioning in a communication network
CN108134843A (en) * 2018-01-26 2018-06-08 重庆邮电大学 A kind of service function chain dispositions method under 5G-C-RAN scenes
CN108260169A (en) * 2018-01-26 2018-07-06 重庆邮电大学 A kind of service function chain dynamic deployment method ensured based on QoS
CN108965024A (en) * 2018-08-01 2018-12-07 重庆邮电大学 A kind of virtual network function dispatching method of the 5G network slice based on prediction
CN109412963A (en) * 2018-11-16 2019-03-01 电子科技大学 A kind of service function chain dispositions method split based on stream
CN109842528A (en) * 2019-03-19 2019-06-04 西安交通大学 A kind of dispositions method of the service function chain based on SDN and NFV
CN110190987A (en) * 2019-05-08 2019-08-30 南京邮电大学 Based on backup income and the virtual network function reliability dispositions method remapped
CN110213369A (en) * 2019-06-03 2019-09-06 电子科技大学 A kind of service function chain autopatch system and its method of combination
CN110521169A (en) * 2017-04-20 2019-11-29 思科技术公司 Strategy for service chaining guarantees
CN111147307A (en) * 2019-12-30 2020-05-12 重庆邮电大学 Service function chain reliable deployment method based on deep reinforcement learning
CN111475252A (en) * 2020-03-06 2020-07-31 重庆邮电大学 Virtual network function deployment optimization algorithm based on deep reinforcement learning
CN111669291A (en) * 2020-06-03 2020-09-15 北京理工大学 Virtualized network service function chain deployment method based on deep reinforcement learning

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3121997A1 (en) * 2015-07-20 2017-01-25 Koninklijke KPN N.V. Service provisioning in a communication network
CN105141617A (en) * 2015-09-14 2015-12-09 上海华为技术有限公司 Deploying and adjusting method for service functions among data centers and deploying and adjusting device for service functions among data centers
CN110521169A (en) * 2017-04-20 2019-11-29 思科技术公司 Strategy for service chaining guarantees
CN108260169A (en) * 2018-01-26 2018-07-06 重庆邮电大学 A kind of service function chain dynamic deployment method ensured based on QoS
CN108134843A (en) * 2018-01-26 2018-06-08 重庆邮电大学 A kind of service function chain dispositions method under 5G-C-RAN scenes
CN108965024A (en) * 2018-08-01 2018-12-07 重庆邮电大学 A kind of virtual network function dispatching method of the 5G network slice based on prediction
CN109412963A (en) * 2018-11-16 2019-03-01 电子科技大学 A kind of service function chain dispositions method split based on stream
CN109842528A (en) * 2019-03-19 2019-06-04 西安交通大学 A kind of dispositions method of the service function chain based on SDN and NFV
CN110190987A (en) * 2019-05-08 2019-08-30 南京邮电大学 Based on backup income and the virtual network function reliability dispositions method remapped
CN110213369A (en) * 2019-06-03 2019-09-06 电子科技大学 A kind of service function chain autopatch system and its method of combination
CN111147307A (en) * 2019-12-30 2020-05-12 重庆邮电大学 Service function chain reliable deployment method based on deep reinforcement learning
CN111475252A (en) * 2020-03-06 2020-07-31 重庆邮电大学 Virtual network function deployment optimization algorithm based on deep reinforcement learning
CN111669291A (en) * 2020-06-03 2020-09-15 北京理工大学 Virtualized network service function chain deployment method based on deep reinforcement learning

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114172937A (en) * 2022-01-19 2022-03-11 重庆邮电大学 Dynamic service function chain arrangement method and system based on deep reinforcement learning
CN114172937B (en) * 2022-01-19 2023-12-29 广州市宝思信息科技有限公司 Dynamic service function chain arrangement method and system based on deep reinforcement learning
CN115086175A (en) * 2022-04-26 2022-09-20 北京邮电大学 Resource optimization method and device
CN115086175B (en) * 2022-04-26 2023-06-30 北京邮电大学 Resource optimization method and device
CN114944985A (en) * 2022-05-11 2022-08-26 重庆大学 Reliability-oriented service function chain backup method in resource heterogeneous environment
CN114944985B (en) * 2022-05-11 2023-11-24 重庆大学 Reliability-oriented service function chain backup method in resource heterogeneous environment
CN115174393A (en) * 2022-06-27 2022-10-11 福州大学 Dynamic adjustment method of service function chain based on in-band network telemetry
CN115174393B (en) * 2022-06-27 2023-09-22 福州大学 Service function chain dynamic adjustment method based on in-band network telemetry

Also Published As

Publication number Publication date
CN112543119B (en) 2022-02-18

Similar Documents

Publication Publication Date Title
CN112543119B (en) Service function chain reliability deployment method based on deep reinforcement learning
Zhang et al. Adaptive interference-aware VNF placement for service-customized 5G network slices
CN107770096B (en) SDN/NFV network dynamic resource allocation method based on load balancing
CN112738820B (en) Dynamic deployment method and device of service function chain and computer equipment
CN108965014B (en) QoS-aware service chain backup method and system
CN108768877B (en) Distribution method and device of burst traffic and proxy server
CN105279027B (en) A kind of virtual machine deployment method and device
CN109547358B (en) Method for constructing time-sensitive network slice
CN108600101B (en) Network service cross-domain arrangement method aiming at end-to-end time delay performance optimization
CN107967164B (en) Method and system for live migration of virtual machine
CN112769905B (en) NUMA (non uniform memory access) architecture based high-performance network card performance optimization method under Feiteng platform
CN112631766B (en) Dynamic adjustment method and device for project environment resources
CN113904923A (en) Service function chain joint optimization method based on software defined network
CN107070752B (en) Testing method and testing system for long connection capacity
CN112698947B (en) GPU resource flexible scheduling method based on heterogeneous application platform
CN110086726A (en) A method of automatically switching Kubernetes host node
WO2017181936A1 (en) Network resource deployment method, apparatus, controller, and core network device
CN107360235B (en) Task migration method based on reliability grading
CN112073237A (en) Large-scale target network construction method in cloud edge architecture
CN115189910A (en) Network digital twin-based deliberate attack survivability evaluation method
CN117560376A (en) Task processing method and device, storage medium and electronic device
CN111786887A (en) Data forwarding method, apparatus, computing device, and medium executed by control device
CN113259440A (en) Load balancing-based dynamic deployment method and system for virtual network functions
CN102546652B (en) System and method for server load balancing
CN115225496B (en) Mobile perception service unloading fault-tolerant method based on edge computing environment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant