Disclosure of Invention
In order to solve the technical problems, the invention constructs a virtual guiding environment suitable for deep reinforcement learning aiming at the characteristics of abundant and multidimensional edge calculation and background large-center full-link data, and based on data, knowledge, service and the like, the research flow optimizes a self-learning intelligent model algorithm, realizes rapid iterative trial and error, and finds an optimal service flow guiding path. The technical solution for realizing the purpose of the invention is as follows: a business process guiding self-learning optimization algorithm comprises the following specific steps:
firstly, organizing service flows based on an excavation mode and a self-adaptive algorithm, classifying the algorithm according to a similarity flow class driven by comprehensive knowledge and classifying the service operation modes by joint learning, and completing the self-adaptive problem of each service flow on the basis of the excavation mode;
secondly, based on an operation map and a global service flow intelligent guiding algorithm of data convergence, according to an operation map and a data convergence algorithm of a co-coupled neural network, a service flow intelligent guiding algorithm of shortest path optimization of a complex map structure and a centrality algorithm of a hierarchical network constructed by the map network;
and finally, the process self-learning optimization based on the full-link multi-dimensional data composite guidance is completed by combining deep reinforcement learning through the guidance environment construction processing based on the intelligent edge and the background large-center full-link multi-dimensional data.
As an improvement, the comprehensive knowledge driven similarity flow class classification algorithm comprises the following specific methods:
(1) Establishing a flow model by using a Petri network;
(2) Combining the relative positions and the logic relations of all elements in the flow, and carrying out iterative mapping strategy adjustment on the elements in the flow model, including transition and library, so as to find the association relations of the elements in different flows;
(3) After a stable optimal mapping function is obtained, calculating the similarity coefficient between the whole processes;
(4) By the structural similarity coefficient of the element pairs and the global structural similarity coefficient,
defining business process similarity coefficients, namely providing the general neglect tasks of the structure similarity coefficient ESS and the global structure similarity coefficient SSM of element pairs, wherein the general neglect tasks include but are not limited to the external property differences of text labels, providing specific quantifiable numerical values for logic structures such as concurrence, asynchronism, selection and the like of the processes, and describing the structure similarity degree among elements and among the processes;
(5) In business process similarity measurement: adopting a coarse-granularity calculation method, quantitatively analyzing and calculating the overall similarity degree among all mapping elements on the basis of an optimal mapping function obtained through adjustment through automatic research on business process similarity measurement based on an internal structure, and finally determining a process similarity coefficient between [0,1 ];
(6) Comparing and evaluating different process versions generated in different periods, positioning the mutation position and mutation type, and establishing a management knowledge base for improving the evolution efficiency of the process or promoting different changes to adapt to the change of the environment when facing new changes.
As an improvement, the operation mode of the class-based service of the joint learning is realized by representing each event sequence by a time chart, mining the sequence mode by a sequence mode mining algorithm with time intervals, and finally carrying out clustering division.
As an improvement, the operation spectrum and data aggregation algorithm based on the co-coupled neural network is an operation mechanism for describing service operation by adopting a Bayesian network, and aggregation and fusion expression learning of the operation spectrum, knowledge and data is realized on the basis of a cross training mechanism of parameter sharing;
the nodes comprising the network in the operation mechanism correspond to the conditions and the constraints in the operation mechanism ontology description of the flow service network, the actions of the plan fragments and the entities involved in the plan and the logical relations between the concepts represented by the nodes are corresponding to the edges of the network, and the dependency relations of the plan and the sub-concepts thereof on other actions, the plan fragments, the conditions and the constraints in the operation mechanism ontology description of the flow service network are included; the plan segment in the operation mechanism ontology description contains relationships between entities and actions, and timing relationships in the plan;
as an improvement, the business process intelligent guiding algorithm of the shortest path optimization of the complex graph structure is realized by constructing a hierarchical network through a graph network and adopting an iterative calculation method of the hierarchical network;
when the graph network constructs a hierarchical network, for each level of hierarchical network from low to high, all nodes in the network are used as common nodes and the following iteration is carried out:
(1) selecting a common node with the largest degree as a central node, and aggregating common neighbors of the common node except the super nodes to form a super node;
(2) reconstructing the connecting edges among the nodes, and directing the connecting edges pointing to the members in the super nodes to the super nodes, wherein a plurality of connecting edges among the same node pairs are combined into one connecting edge;
(3) continuing the aggregation process until all nodes in the network are aggregated into super nodes, and ending the aggregation iteration process of the current hierarchical network; at this time, all the super nodes obtained by aggregation are used as common nodes in the next-level hierarchical network, so that the scale of the next-level hierarchical network is reduced;
(4) if the number of nodes in the current hierarchical network is lower than a certain threshold, stopping the iterative aggregation process and constructing the hierarchical network, wherein the obtained network is the highest hierarchical network.
As an improvement, the iterative calculation method of the hierarchical network continuously aggregates the central nodes and the neighbors thereof of the hierarchical network by constructing the hierarchical network until the original network is converted into a highest-level hierarchical network with small specification, and iteratively calculates the shortest path between any two points in the original network by using the approximate distance from the node to the central node in each level of hierarchical network on the basis of the hierarchical network;
definition of the definition
For the approximate distance of nodes s to t in the i-th hierarchical network, the shortest path distance d= (s, t) of nodes s to t in the original network may be defined by the approximate distance +.>
Obtained, distance->
Can be defined by the approximate distance +.>
Iterative calculation (wherein i is equal to or greater than 0) to obtain approximate distance +.>
Is an iterative calculation method of (a):
c in the above formula
s And C
t Is the central node of nodes s and t,
representing the approximate distance of node s from its center node c.
As an improvement, approximate distance
In the process of constructing the hierarchical network, the algorithm selects a common node with the largest degree as a central node and aggregates common neighbors thereof,nodes other than the super node constitute a super node which will serve as a common node in the next-level hierarchical network.
According to the approximate distance from node s to central node c in the i-th hierarchical network
Defining a radius r of a node in the i+1st hierarchical network
i+1 The method comprises the steps of carrying out a first treatment on the surface of the When the radius of a node in the level 0 hierarchical network is 0, the distance from the node to the center node is 1; the radius of the node in the level 1 hierarchical network is 1, and the distance from the node to the central node is 3; and so on,
center node of i-th level hierarchical network
And->
Respectively correspond to the common nodes s in the i+1st level hierarchical network
i+1 And t
i+1 The radius of the node in the i-th hierarchical network is r
i Radius r of node in i+1st level hierarchical network
i+1 =2r
i +1. Then approximately obtaining the mean value r of the node radius in the ith hierarchical network
i :
Where k represents the number of different scale networks in the hierarchical network, according to the approximate distance of node s to the central node c in the ith hierarchical network
Is calculated according to the formula:
by combining the two formulas, we can further obtain the approximate distance
Is defined by the iterative calculation formula:
wherein d k-1 (s, t) represents that the approximate distance between nodes s and t in the highest level hierarchical network is equal to the actual distance calculated by Dijkstra's algorithm. The margin between nodes s and t in the highest level hierarchical network is approximately 2r k +1=4*k-5。
As an improvement, the centrality algorithm of the hierarchical network of the graph network structure comprises a near centrality algorithm and a medium value calculation method; the approach centrality algorithm is to calculate the shortest path distance sum from all nodes to other nodes in the network by using a hierarchical network and an iterative calculation method on the basis of the shortest path approximation algorithm, so as to obtain the approach value of all nodes; sequencing all nodes in the network, and obtaining a result which is an algorithm of the node in the network, wherein the algorithm is close to the centrality;
the medium value calculation method is that firstly, iterative aggregation is carried out on an original network through a construction hierarchical network until a highest hierarchical network is obtained; and calculating the shortest paths among the nodes in the highest-level hierarchical network by using a Dijkstra algorithm, and calculating the number of shortest paths passing through the nodes in the highest-level hierarchical network by using the shortest paths.
As an improvement, a guiding environment is constructed based on intelligent edge and background large center full-link multidimensional data, and the specific method is as follows:
according to the definition of the value function, the value function of the defined strategy pi is as follows:
wherein R(s) represents an unknown return function, which is generally a function of a state, and because the return function is unknown, the return function is subjected to parameter approximation by a function approximation method, and the approximation form can be phi(s), which is a basic function, a polynomial substrate or a Fourier substrate. The inverse reinforcement learning is the coefficient w in the return function.
The defining characteristics are expected to be:
given m expert trajectories, the expert strategy is characterized by the following expectations:
finding a strategy, so that the performance of the strategy is similar to that of an expert strategy; by using characteristic expectations to represent the quality of a strategy, when the following inequality is calculated to be true, a strategy is found to behave similar to an expert strategy
When the inequality is established, for arbitrary weights w 1 And less than or equal to 1, the value function satisfies the following inequality:
the normalized form of the objective function is:
s.t.w T μ E ≥w T μ (j) +t,j=0,…,i-1
||w|| 2 ≤1
expert strategies are one class, other strategies are the other class, and the solving of parameters is to find a hypersurface to distinguish the expert strategy from the other strategies, and the hypersurface maximizes the margin between the two classes.
The beneficial effects are that: the invention provides a business process guiding self-learning optimization algorithm, which utilizes comprehensive knowledge to carry out category division of the combination of coarse granularity and fine granularity on business processes in a large-scale business network, and adopts a simplified representation mode aiming at a collection of business operations with similarity; meanwhile, combining the business requirements under the specific scene, fusing and distributing priori knowledge and scene specific data into the business process, and meeting the basic operation mode; and guiding the specific business flow under the global view angle, so that the optimal operation path and the minimum cost are realized. And combining business operations bound with knowledge and data, searching an optimal path in a large-scale graph structure, controlling the length of a business link, shielding unnecessary business fields and realizing point-to-point business guidance aiming at tasks.
Detailed Description
The invention is further described below with reference to examples.
A business process guiding self-learning optimization algorithm comprises the following specific steps:
firstly, organizing service flows based on an excavation mode and a self-adaptive algorithm, classifying the algorithm according to a similarity flow class driven by comprehensive knowledge and classifying the service operation modes by joint learning, and completing the self-adaptive problem of each service flow on the basis of the excavation mode;
secondly, based on an operation map and a global service flow intelligent guiding algorithm of data convergence, according to an operation map and a data convergence algorithm of a co-coupled neural network, a service flow intelligent guiding algorithm of shortest path optimization of a complex map structure and a centrality algorithm of a hierarchical network constructed by the map network;
and finally, the process self-learning optimization based on the full-link multi-dimensional data composite guidance is completed by combining deep reinforcement learning through the guidance environment construction processing based on the intelligent edge and the background large-center full-link multi-dimensional data.
Business process organization and self-adaptive technology realization based on mining mode
(1) Similarity flow class division algorithm implementation based on comprehensive knowledge driving
1) The method uses a Petri network to establish a flow model;
2) Combining the relative positions and logic relations of all elements in the flow, and carrying out iterative mapping strategy adjustment on the elements (Transition and library) in the flow model respectively so as to find the association relations of the elements in different flows;
3) Finally, calculating the similarity coefficient between the whole processes after obtaining an optimal mapping function which tends to be stable;
4) Defining the business process similarity coefficient, namely providing the concepts of the structural similarity coefficient ESS and the global structural similarity coefficient SSM of the element pairs. Ignoring the extrinsic differences of tasks (such as text labels), giving specific quantifiable values for concurrent, asynchronous, selective and other logic structures of the flow, and describing the structural similarity between elements and between flows.
Element structure similarity coefficient, element Structure Similarity): given two Petri net modeled flow models m1= (P) 1 ,T 1 ,A 1 ) And M is as follows 2 =(P 2 ,T 2 ,A 2 ) Observe M 1 Element P (p.epsilon.P) 1 ∩T 1 ) And M is as follows 2 Element q (q.epsilon.P) 2 ∩T 2 ) The similarity coefficients of p and q are expressed as:
S
pq representing the similarity of element p to element q. L (L)
p Represents the number of p corresponding sets, L
q Represents the number of q corresponding sets, R
p Represents the number of p corresponding sets, R
q Analysis of the similarity of p and q, representing the number of sets corresponding to q, requires simultaneous consideration of the business processes M
1 ,M
2 Is the same as or different from the other. S is S
pq Is determined by the left and right flow structures closely adjacent to q and p, and is associated with the number of intersections and the structural similarity ratio established on the mapping function. Similarity of left side structure is defined by
The difference is represented by gamma
pq The similarity of the right-hand structure is indicated by +.>
The difference is represented by delta
pq And (3) representing. α, β represent weights occupied by the left and right structures in the similarity coefficients, respectively, and are preferably each analyzed using a homonymous weight of α=0.5, β=0.5.
Global structural similarity factor, structure Similarity Modulus) defining the similarity factor between flows M1 and M2 to be [0,1]]And, the above is expressed as:
wherein the larger the S value, the higher the similarity of this pair of flows, and the best library mapped set is denoted as f P The optimal transition mapping set is denoted as f T . Where η, θ is expressed as the weight of the library and transition, respectively, if more conscious is similar in the flow, then a greater value should be assigned to θ, otherwise a greater value should be assigned to η. In this paper, the analysis is performed using a homonymic weight of η=0.5, θ=0.5 for ease of presentation. m, n are each f p ,f T Is a length of (c).
5) On the basis of the optimal mapping function obtained through automatic research based on the business process similarity measurement of the internal structure, the overall similarity degree among all mapping elements is quantitatively analyzed and calculated, and finally a process similarity coefficient between [0,1] is determined. Comparing and evaluating different process versions generated in different periods, positioning the mutation position and mutation type, and establishing a management knowledge base, so that the evolution efficiency of the process can be improved or different changes can be promoted to adapt to the change of the environment when new changes are carried out in the future.
(2) Class-based business operation mode mining implementation based on joint learning
1) Each event sequence is represented by a time graph;
2) A sequence pattern mining algorithm with time intervals to mine sequence patterns;
3) And (5) clustering and dividing.
(3) Service flow self-adaptive organization algorithm implementation based on mining mode
(II) implementation of global business process intelligent guide technology based on operation diagram and data aggregation
(1) Operation map and data convergence algorithm implementation based on co-coupled neural network
1) Operation mechanism for describing business operation by Bayesian network
1.1 Nodes of the network correspond to conditions and constraints in the flow business network operating mechanism ontology description, and concepts such as plan segments and actions of entities involved in the plan. Wherein, these concepts are all added with a priori probabilities P (C|SupC) obtained by experience, statistics or subjective judgment, and C represents concepts or class nodes in the network.
1.2 The edges of the network correspond to logical relationships between concepts represented by the nodes, including the dependency of plans and their sub-concepts in the flow business network operation mechanism ontology description on other actions, plan fragments, conditions and constraints; the plan segment-to-entity and action in the operating mechanism ontology description includes relationships, relationships between actions and sub-actions, and timing relationships in the plan. The existence of these relationships provides a conditional probability P (c|supc), which represents the node's upper level node.
2) The cross training mechanism based on parameter sharing realizes the convergence of the operation map, knowledge and data, and fusion representation learning 2.1) replaces U in the basic model by using the fusion representation vector matrix U w And U s Establishing a co-coupled nerve;
2.2 The left and right parts represent the learning model to train alternately, and U is shared by the two models, namely, the U is transmitted mutually in the training process;
2.3 And (3) repeatedly iterating to obtain node vector representation fusing the two-aspect information and obtain a corresponding maximized objective function.
(2) Implementation of intelligent business process guidance algorithm based on shortest path optimization of complex graph structure
1) Constructing a hierarchical network based on a graph network
For each level of hierarchical network from low to high, all nodes in the network are used as common nodes and the following iteration is carried out:
1.1 Selecting a common node with the largest degree as a central node, and aggregating common neighbors (nodes except the super node) of the common node to form a super node;
1.2 Reconstructing the connecting edges among the nodes, and directing the connecting edges pointing to the members in the super node to the super node, wherein a plurality of connecting edges among the same node pairs are combined into one connecting edge;
1.3 Continuing the aggregation process until all nodes in the network are aggregated into super nodes, and ending the aggregation iteration process of the current hierarchical network; at this time, all the super nodes obtained by aggregation are used as common nodes in the next-level hierarchical network, so that the scale of the next-level hierarchical network is reduced;
1.4 If the number of nodes in the current hierarchical network is lower than a certain threshold value, stopping the iterative aggregation process and constructing the hierarchical network, wherein the obtained network is the highest hierarchical network.
2) Iterative calculation method for constructing hierarchical network
The algorithm continuously aggregates the central nodes of the network and their neighbors by constructing a hierarchical network until the original network is converted to a very small-scale highest level hierarchical network. The iterative calculation method is based on the hierarchical network, and the shortest path between any two points in the original network is calculated iteratively by using the approximate distance from the node to the central node in each level of hierarchical network.
Definition of the definition
For the approximate distance of nodes s to t in the i-th hierarchical network, the shortest path distance d= (s, t) of nodes s to t in the original network may be defined by the approximate distance +.>
Obtained, distance->
Can be defined by the approximate distance +.>
Iterative calculation to obtain (i is not less than0) Thus we get an approximate distance
Is an iterative calculation method of (a):
c in the above formula
s And C
t Is the central node of nodes s and t,
representing the approximate distance of node s from its center node c.
In the process of constructing the hierarchical network, the algorithm selects a common node with the largest degree as a central node, and aggregates its common neighbors (nodes except the super node) to form a super node, wherein the super node is used as a common node in the next-level hierarchical network. We can rely on the approximate distance of node s to the central node c in the level i hierarchical network
Defining a radius r of a node in the i+1st hierarchical network
i+1 . If the radius of the node in the level 0 hierarchical network is 0, the distance from the node to the central node is 1; the radius of a node in a level 1 hierarchical network is 1 and the distance from the node to the center node is 3. And so on.
Center node of i-th level hierarchical network
And->
Respectively correspond to the common nodes s in the i+1st level hierarchical network
i+1 And t
i+1 The radius of the node in the i-th hierarchical network is r
i Radius r of node in i+1st level hierarchical network
i+1 =2r
i +1. We can then approximate the mean value r of the node radius in the i-th hierarchical network
i :
Where k represents the number of different scale networks in the hierarchical network, according to the approximate distance of node s to the central node c in the ith hierarchical network
Is calculated according to the formula:
by combining the two formulas, we can further obtain the approximate distance
Is defined by the iterative calculation formula:
wherein d k-1 (s, t) represents that the approximate distance between nodes s and t in the highest level hierarchical network is equal to the actual distance calculated by Dijkstra's algorithm. The margin between nodes s and t in the highest level hierarchical network may be approximated as 2r k +1=4*k-5。
(3) Centrality algorithm implementation of hierarchical network based on graph network construction
1) Near centrality algorithm implementation
1.1 On the basis of a shortest path approximation algorithm, calculating the shortest path distance sum from all nodes to other nodes in the network by using a hierarchical network and an iterative calculation method, thereby obtaining the approach values of all nodes;
1.2 Ordering all nodes in the network, the result is the proximity centrality of the nodes in the network.
2) Medium value calculation method
2.1 Performing iterative aggregation on the original network by constructing a hierarchical network until the highest hierarchical network is obtained;
2.2 The Dijkstra algorithm is utilized to calculate the shortest paths among the nodes in the highest-level hierarchical network, and the shortest path number passing through the nodes in the highest-level hierarchical network is calculated through the shortest paths.
(III) implementation of flow self-learning algorithm based on full-link multidimensional data composite guidance
(1) Guidance environment construction based on intelligent edge and background large-center full-link multidimensional data
And constructing a virtual flow guiding environment by combining full-link operation data by using an inverse reinforcement learning technology.
According to the definition of the value function, the value function of the strategy pi is:
the defining characteristics are expected to be:
it should be noted that the feature expectations are related to policies pi, and that policies are different when policies are different.
After giving m expert trajectories, we can estimate, by definition, the characteristic expectations of the expert strategy as:
a policy is found such that the policy behaves similar to an expert policy. The feature expectations can be used for expressing the quality of a strategy, a strategy is found to be similar to the expert strategy, and the feature expectations of the strategy are found to be similar to the feature expectations of the expert strategy, even if the following inequality is satisfied:
when the inequality is established, for arbitrary weights w 1 And less than or equal to 1, the value function satisfies the following inequality:
the normalized form of the objective function is:
s.t.w T μ E ≥w T μ (j) +t,j=0,…,i-1
||w|| 2 ≤1
expert strategies are one class, other strategies are the other class, and the solving of parameters is to find a hypersurface to distinguish the expert strategy from the other strategies, and the hypersurface maximizes the margin between the two classes.
(2) By combining deep reinforcement learning and a virtual business process environment constructed based on full-link data, self-learning and self-optimizing of business process guidance can be realized.
The above examples illustrate only a few embodiments of the invention, which are described in detail and are not to be construed as limiting the scope of the invention. It should be noted that it will be apparent to those skilled in the art that several variations and modifications can be made without departing from the spirit of the invention, which are all within the scope of the invention. Accordingly, the scope of protection of the present invention is to be determined by the appended claims.