CN110673486A - Multi-spacecraft pursuit and escape control method based on dynamic game theory - Google Patents

Multi-spacecraft pursuit and escape control method based on dynamic game theory Download PDF

Info

Publication number
CN110673486A
CN110673486A CN201911003658.7A CN201911003658A CN110673486A CN 110673486 A CN110673486 A CN 110673486A CN 201911003658 A CN201911003658 A CN 201911003658A CN 110673486 A CN110673486 A CN 110673486A
Authority
CN
China
Prior art keywords
spacecraft
equation
game
model
target
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201911003658.7A
Other languages
Chinese (zh)
Other versions
CN110673486B (en
Inventor
师鹏
徐添
邓忠民
张冉
赵育善
王逍
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beihang University
Beijing University of Aeronautics and Astronautics
Original Assignee
Beijing University of Aeronautics and Astronautics
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Aeronautics and Astronautics filed Critical Beijing University of Aeronautics and Astronautics
Priority to CN201911003658.7A priority Critical patent/CN110673486B/en
Publication of CN110673486A publication Critical patent/CN110673486A/en
Application granted granted Critical
Publication of CN110673486B publication Critical patent/CN110673486B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B13/00Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
    • G05B13/02Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
    • G05B13/04Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
    • G05B13/042Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Automation & Control Theory (AREA)
  • Processing Or Creating Images (AREA)

Abstract

The invention relates to a multi-spacecraft pursuit and escape control method based on a dynamic game theory, which is characterized in that according to the space environment of a spacecraft, all perturbation forces are ignored, a multi-spacecraft dynamics model, also called a dynamics equation, is established in a gravity field at the center of the earth, and the model comprises three roles of a target spacecraft, a defense spacecraft and an attack spacecraft; determining a payment function of the spacecraft and a termination target set of a pursuit and escape game according to the role relationship of each spacecraft in the dynamic model, thereby establishing a multi-spacecraft game model; processing by using a dynamic game theory to obtain a two-point edge value equation, wherein the two-point edge value equation comprises a differential equation of a co-modal vector, a new kinetic equation represented by the co-modal vector and a cross-section equation of time; aiming at the two-point edge value equation obtained from the established two-point edge value problem, the solution of the equation is obtained by solving by a method combining a particle swarm algorithm and a nonlinear programming method, so that the optimal control strategy and the optimal track for the spacecraft to pursue are obtained.

Description

Multi-spacecraft pursuit and escape control method based on dynamic game theory
Technical Field
The invention relates to a multi-spacecraft pursuit and escape control method based on a dynamic game theory, and belongs to the technical field of spacecraft control.
Background
Similar to ground pursuit, for pursuit or counterpursuit purposes, a relative motion scene in which at least two spacecrafts participate under the action of control force is called space pursuit of the spacecrafts. The space pursuit technology plays an important role in the aspects of space safety of the spacecraft, on-orbit service and the like. Due to the inconsistency of task targets among the spacecrafts, the control strategy of the spacecrafts can be analyzed by adopting the idea of non-cooperative dynamic game. So-called dynamic gaming (ISAACS r. differential Games [ M ]. New York: John Wiley and Sons,1965:1-5.) is that during the course of a participant's confrontation, at least one participant can use state information from previous processes to determine the specific action at the current moment, and if the goals between the participants are not exactly the same, it is uncooperative. Since the idea of the game theory can simultaneously consider the control quantity of a plurality of spacecrafts, the research on the application of the game theory in pursuing escape problems is concerned by a plurality of scholars.
Hayon (HAYOUN S Y, SHIMA T.A two-on-one input impulse-evaluation with bound controls [ J ]. Journal of Optimization principles and Applications,2017,174(3):837 and 857.) based on linear model studies two tracing parties, one escape party' S multiple chase one form problem, gives a closed-loop control strategy under the condition of control bound;
rusnak (RUSNAK I. the lady, the bases and the body guard-a two times technical gain [ J ]. IFAC Proceedings Volumes,2005,38(1):441 and 446.) when studying the game problem of multi-person participation, the participant is divided into two time courses, the defense period comprises a protector and a guarded, the attack period is an attacker, the attacker considers that the discussion can be carried out under two time courses, the problem is analyzed by taking the possible interception of the protector to the attacker as a node, and the solution by using the multi-objective optimization method is provided. Yifang Liu (LIU Y F, Li R F, WANG SQ. objective Three-dimensional spatial using semi-direct interaction with non-linear programming [ C ]// Proceedings of 20162 nd International Conference on Control Science and Systems Engineering, Piscataway, NJ: IEEE Press,2016:217-222.) et al propose an escape game model that includes Three spacecrafts, define a termination target set and a spacecraft index function applicable to the model, and propose a solution model using Pontani (CONWAY B A, PONTANI M. numerical solution of the Three-dimensional objective output-space [ J ] of the Journal of the spacecraft and the optimal trajectory solving method of the trajectory in the equation of 2. the method of solving the spacecraft and the optimal trajectory in the equation of 2. the equation of Joonass 2. the method of solving the spacecraft and the optimal trajectory in the equation of the spacecraft. The linear model adopted by Hayon is used for processing the situation of relatively close distance, and although the game of multiple spacecrafts does not contain defense spacecrafts, the role is limited; although the Rusnak model comprises three pursuing and preventing parties, the proposed method for analyzing under two time histories can lead to complex index function situations and is complicated in the using process; the Yifong Liu et al model is a simplified version of a multi-spacecraft game model because the protected object, namely the target spacecraft, in the model has no power, and the relative position relationship of the terminals of the three spacecrafts is used as an index function to simplify the solving process, but the index function and the terminal target set are repeated.
In a word, the existing model for researching multi-spacecraft pursuit is not complete, the multi-spacecraft game is split into a plurality of double-spacecraft games, modeling solving is carried out under a plurality of time courses, the model is complex, most of solving strategies need initial value guessing, and convergence is difficult to guarantee.
Disclosure of Invention
The technical problem to be solved by the invention is as follows: the multi-spacecraft pursuit control method based on the dynamic game theory is simple in form compared with the traditional model, the solving method does not need initial value guessing, and the obtained control law can effectively realize interception or tracking.
The technical scheme of the invention is as follows: a multi-spacecraft pursuit escape control method based on a dynamic game theory comprises the following steps:
the method comprises the steps that firstly, according to the space environment of the orbit height of a spacecraft, all perturbation forces are ignored, a multi-spacecraft dynamics model, also called a dynamics equation, is established in an earth central gravitational field, and the model comprises three roles of a target spacecraft, a defense spacecraft and an attack spacecraft; determining a payment function of the spacecraft and a termination target set of a pursuit and escape game according to the role relationship of each spacecraft in the dynamic model, thereby establishing a multi-spacecraft game model;
secondly, processing the multi-spacecraft game model established in the first step by using a dynamic game theory to obtain a two-point boundary value equation, wherein the two-point boundary value equation comprises a differential equation of a co-modal vector, a new kinetic equation expressed by the co-modal vector and a cross-section equation of time;
and thirdly, solving the two-point edge value equation obtained in the second step by combining a particle swarm algorithm and a nonlinear programming method aiming at the two-point edge value problem established in the second step to obtain a solution of the equation, so as to obtain an optimal control strategy and an optimal track for the spacecraft to pursue.
In the first step, the game model of the multi-spacecraft is as follows:
(1) the dynamic equation of the spacecraft is:
Figure BDA0002242080510000033
wherein, i ═ D, E, A stand for defending spacecraft, escaping spacecraft and attacking spacecraft respectively, mu ═ 3.986005 × 1014m3/s2As the gravitational constant, r represents the spacecraft vectorThe mode of the diameter, x, y and z are three-axis position components of the spacecraft, T represents the unit mass control quantity of the spacecraft, theta and alpha represent the control direction angle of the spacecraft, and the specific direction is defined as follows: theta is an included angle between the controlled variable and the direction of the z axis under the inertial system, and alpha is an included angle between the projection of the controlled variable on the x-y plane and the y axis.
(2) According to the role relationship of the spacecraft, the payment function is confirmed as follows:
Figure BDA0002242080510000034
wherein, t0,tfRepresents the start and end time of the game, rAEAnd rDARespectively representing the relative distance of the attacking and target spacecraft and the distance between the defending and attacking spacecraft, k1And k3Weight, k, representing payment2And k4Representing the rate of change of payment with distance.
(3) The target set of spacecraft is as follows:
ψ(rE(tf),rD(tf),rA(tf),tf)=F11)F22)=0
Figure BDA0002242080510000035
I1={(rA(tf),rE(tf))|||rA(tf)-rE(tf)||<ε1}
I2={(rD(tf),rA(tf))|||rD(tf)-rA(tf)||<ε2}
wherein r isi(tf) Terminal position vector, ξ, representing spacecraft i12Respectively representing combinations of terminal positions of attack-target spacecraft and attack-defense spacecraft, I1,I2Set of terminal positions of attack-tolerant target spacecraft and set of terminal positions of attack-tolerant defense spacecraft to satisfy acquisition relationship, epsilon1≥0,ε2And more than or equal to 0 respectively represents the capture range of the attacking spacecraft and the defending spacecraft.
The two-point edge value equation of the second step is as follows:
Figure BDA0002242080510000041
Figure BDA0002242080510000042
x(tf) Should satisfy the terminal target set
Wherein x represents the position and velocity vector of the spacecraft, λ is the co-modal vector, t is time, f (-) is a new spacecraft dynamical equation represented by the co-modal vector,representing a collaborative equation, and deriving from game theory, psi' is a boundary form of a terminal target set, and x (t)f) Representing the terminal value of the state.
The third step is specifically realized as follows:
(1) the method comprises the following steps of obtaining a rough solution of a two-point boundary value equation by utilizing a particle swarm algorithm, wherein the fitness function of the algorithm is as follows:
Figure BDA0002242080510000045
wherein, ciBoundary constraint and time-transverse condition, k, representing the two-point boundary equation in the second stepi> 0 denotes the weight coefficient of each constraint.
Particle renewal was as follows:
Figure BDA0002242080510000046
Figure BDA0002242080510000047
Figure BDA0002242080510000048
wherein w represents an inertial weight, wmax,wminRepresenting the upper and lower bounds of the inertial weight, n representing the current evolutionary algebra, M representing the total evolutionary algebra, c1,c2Represents the acceleration constant, r1,r2Is a random number from 0 to 1 and,
Figure BDA0002242080510000049
is the individual extremum in the d dimension of the ith example, gdIs a global extremum in the d-th dimension.
(2) And (3) performing fine optimization on the coarse solution of the two-point boundary value equation by using a nonlinear programming tool box, wherein the fitness functions of the objective function and the particle swarm function are the same, so that the numerical solution of the equation is obtained.
(3) And (4) bringing the solution back to an initial value problem equation, and solving an optimal control strategy and an optimal track for the spacecraft to pursue.
Compared with the prior art, the invention has the advantages that:
(1) according to the invention, the three-dimensional space multi-spacecraft game model is established under a unified time history, and a simpler and clearer payment function form can be obtained compared with a multi-time history model, so that formula deduction under different time histories is omitted, and the model is easier to understand.
(2) The payment function adopted by the invention is based on the instantaneous relative position relationship of the spacecraft, so that the target party and the defense party can be effectively matched, and the model establishment is closer to the reality.
(3) The invention omits the initial guess of the solving result and improves the convergence of the solving.
Drawings
FIG. 1 is a flow chart of a method implementation of the present invention;
FIG. 2 is a schematic diagram of what coordinate system is according to the present invention;
FIG. 3 is a diagram of the spacecraft motion trajectory of the present invention;
FIG. 4 is a diagram of the relative distance of the defense spacecraft and the target spacecraft of the present invention;
FIG. 5 is a diagram of the relative distances of the attacking and defending spacecraft of the present invention;
fig. 6 shows a spacecraft control azimuth angle in accordance with the present invention.
Detailed Description
The present invention will be described in detail below with reference to the accompanying drawings and examples.
As shown in fig. 1, the invention relates to a multi-spacecraft pursuit control method based on a dynamic game theory, which is used for establishing a game model with a maneuvering capability of a target spacecraft under a uniform time history aiming at a game system comprising three spacecrafts, wherein a control strategy and an optimal trajectory of the spacecraft are obtained by adopting a new index function and a solving method in the model, and the method comprises the following steps:
the invention establishes a multi-spacecraft dynamics model in an earth central gravity field, gives a payment function form of each spacecraft and a termination target set of a pursuit escape game, obtains a first-order condition and a second-order condition of each spacecraft control strategy by using necessary conditions existing in saddle point solution, converts the game problem into a two-point boundary value equation, and solves the solution of the equation by using a method combining a particle swarm algorithm and a nonlinear programming method, thereby obtaining an optimal control strategy and an optimal track of the pursuit escape of the spacecraft.
1. Spacecraft attack and defense model
The model comprises a target spacecraft, an attack spacecraft and a defense spacecraft, wherein the target spacecraft is a capture target of the attack spacecraft, and the defense spacecraft attempts to intercept the attack spacecraft in order to protect the target spacecraft. And the game is terminated when the attacking spacecraft successfully captures the target spacecraft or the defending spacecraft intercepts the attacking spacecraft.
1.1 kinetic model
Establishing a dynamic model of the spacecraft in a spring equinox geocentric inertial coordinate system: as shown in FIG. 2, the origin of coordinates is located at the center of the earth, with the X-axis pointing in the direction of the spring's point and the Z-axis perpendicular to the equatorial plane pointing in the North Pole direction.
Only the gravity of the earth center is considered, all perturbation and the mass consumption of the spacecraft are ignored, and the dynamic equation form of the spacecraft is obtained as follows:
Figure BDA0002242080510000051
where i ═ E, D, a represents the target, defense and attack spacecraft. mu-3.986005X 1014m3/s2The method comprises the following steps of defining parameters of a control quantity as an earth gravity constant, u as the control quantity of a unit mass, T, theta and alpha, wherein T represents the magnitude of the control quantity, theta and alpha represent control direction angles of a spacecraft, and the specific directions are defined as follows: theta is an included angle between the controlled variable and the direction of the z axis under the inertial system, and alpha is an included angle between the projection of the controlled variable on the x-y plane and the y axis. The spacecraft control quantity T is assumed to be constant in the whole game time, so that the control parameter of the game is the direction angle of thrust.
If the control quantity of the target spacecraft is always larger than that of the attack spacecraft, the target spacecraft cannot be captured as long as the control direction is proper, and the terminal target set loses meaning, so the following assumptions are made:
TD>TA>TE(2)
the kinetic equation can be developed as:
Figure BDA0002242080510000061
wherein x, y, z are the three-axis position components of the spacecraft. The state quantities of the three spacecrafts are expressed in a unified state as follows:
Figure BDA0002242080510000062
wherein any one of the components
Figure BDA0002242080510000063
The above kinetic equation can be described as:
Figure BDA0002242080510000064
1.2 terminating the target set
Considering duration t of a gamefUnknown, so it is necessary to determine a terminal target set, and when the relative position between the spacecrafts meets the set requirement, the game is terminated, and the target set is established by the relative position relationship of the terminals:
ψ(rE(tf),rD(tf),rA(tf),tf)=F1(ξ)F2(ξ)=0 (5)
wherein the function used is defined as:
Figure BDA0002242080510000065
in the formula, ri(tf) Terminal position vector, ξ, representing spacecraft i12Respectively representing combinations of terminal positions of attack-target spacecraft and attack-defense spacecraft, I1,I2Set of terminal positions of attack-tolerant target spacecraft and set of terminal positions of attack-tolerant defense spacecraft to satisfy acquisition relationship, epsilon1≥0,ε2And > 0 represents the capture radius of the attacking spacecraft and the defending spacecraft. From the above equation, the terminal time definition of the countermeasures can be obtained:
Figure BDA0002242080510000071
2. spacecraft pursuit strategy solving
2.1 Payment function of people in bureau
Payment functions are the targets that each spacecraft expects to achieve, and commonly used payment functions include terminal distance constraints, countermeasure time, fuel consumption and the like. However, because the invention researches the multi-spacecraft game, the countermeasure time is not a contradiction point among the spacecrafts, and because the thrust of the spacecraft per unit mass is assumed to be constant in the game process, the fuel consumption and the countermeasure time are consistent in nature, and the terminal distance constraint and the terminal target set are repeated in action and form, so that the conventional pay function cannot meet the game model of the multi-spacecraft.
In the invention, indexes containing the instantaneous position relation between spacecrafts are used as payment functions, and the following instantaneous payment forms are firstly defined:
Figure BDA0002242080510000072
in the formula, rAEAnd rDARespectively representing the relative distance of the attacking and target spacecraft and the distance between the defending and attacking spacecraft, k1And k3Represents a paid weight representing the measure of the attacking spacecraft as it is away from the defending spacecraft or continues to track the target spacecraft, k2And k4Representing the rate of change of payment with distance.
In the game process, the attack spacecraft evaluates the relative position relation between the target spacecraft and the defense spacecraft, so as to select whether to capture the target spacecraft preferentially or avoid being intercepted preferentially. On the contrary, the target spacecraft and the defense spacecraft hope to cooperate to complete the capture of the attack spacecraft, at least to delay the capture of the target spacecraft, and the pay function of the game is expressed as:
Figure BDA0002242080510000073
therefore, a complete three-spacecraft pursuit prevention model is defined, and in order to apply the dynamic game theory to process the problem, a Nash equilibrium solution and the necessary conditions for the Nash equilibrium solution are required to be introduced to convert the game problem into a two-point boundary value equation.
2.2 Nash equilibrium solution and constraint processing
In a single-target game involving n players, with siIndicates the respective benefit, c ═ c1,c2,···,ci,···,cn) Represents a set of control strategy combinations, wherein ciRepresenting the control strategy selected by the ith participant, C representing the set of selectable control strategy combinations, then a set of strategy combinations
Figure BDA0002242080510000074
Is a Nash equilibrium, if and only if
Figure BDA0002242080510000075
Comprises the following steps:
in the formula, c-iRepresenting the control strategy of other participants than the ones in the ith office,
Figure BDA0002242080510000081
representing the optimal strategy selection for the person in the ith office.
To obtain a control strategy for a spacecraft, a hamiltonian is introduced:
H=L+λTf(x,u,t)=L+λA TfA(xA,uA,t)+λD TfD(xD,uD,t)+λE TfE(xE,uE,t) (11)
wherein, λ represents a co-modal vector, and according to the necessary condition existing in the saddle point, the differential equation of the co-modal vector is:
Figure BDA0002242080510000082
the three spacecraft collaborative states can be analyzed through an equationThe results of the vector differential equations are similar in form, the components of the collaborative vectors of each spacecraft in three axes are also similar, and only the differential equation of the collaborative vectors of the defense spacecraft is solved below, so that H is the1=L,H2=λTf(x,u,t),
Figure BDA0002242080510000083
Figure BDA0002242080510000084
Then there are:
Figure BDA0002242080510000085
Figure BDA0002242080510000086
it can be seen that the co-modal vector is a function of the state quantity, which is noted as:
Figure BDA0002242080510000087
and (3) the final value of the co-modal vector is related to the terminal target set, and the terminal constraint represented by the formula (5) is converted as follows:
ψ′=(||rA(tf)-rE(tf)||-ε1)·(||rD(tf)-rA(tf)||-ε2)=0 (16)
the above form is a boundary form of the terminal target, which means that when the distance between the attacking spacecraft and the target spacecraft is exactly epsilon1Or the distance between the attacking spacecraft and the defending spacecraft is exactly epsilon2. According to the game theory, the magnitude of the final value of the covariance vector along the external normal direction of the boundary of the terminal target set is represented by the partial derivative of the boundary of the target set:
Figure BDA0002242080510000091
similarly, the covariance end values of the spacecraft are similar, wherein the covariance end value of the attacking spacecraft is:
Figure BDA0002242080510000092
in the same way, the coordination final values of the other two spacecrafts can be obtained, and the following rules are observed and found:
λP,x(tf)+λE,x(tf)+λD,x(tf)=0
λP,y(tf)+λE,y(tf)+λD,y(tf)=0 (19)
λP,z(tf)+λE,z(tf)+λD,z(tf)=0
for the saddle-point strategy solution, it must be satisfied:
Figure BDA0002242080510000093
if the control has no constraint requirement, the first order condition for the optimum control of the spacecraft can be obtained by the equation:
Figure BDA0002242080510000094
suppose sin θiNot equal to 0, then there are
Figure BDA0002242080510000095
By substituting an arbitrary α into equation (21), two θ can be obtained, thereby obtaining four combinations of steering angles, so it is necessary to obtain a second-order condition of the steering equation:
Figure BDA0002242080510000101
Figure BDA0002242080510000102
Figure BDA0002242080510000103
Figure BDA0002242080510000104
for an attacking spacecraft, its hessian matrix h should satisfy: h is more than or equal to 0; the defense spacecraft and the target spacecraft need to meet: h is less than or equal to 0. At any instant, only a pair of combinations of control direction angles satisfy the first and second order conditions simultaneously. The obtained control amount equation is recorded as:
Figure BDA0002242080510000105
in the case where formula (4) is substituted by formula (22), the following are provided:
Figure BDA0002242080510000106
the state at the initial time is given as:
ψ0(xA,0,xD,0,xE,0,t0)=0 (24)
the countermeasure time is free, and then the time cross-section condition is:
Figure BDA0002242080510000107
the dynamic game problem involving three space vehicles is converted to a two-point equation by equations (15), (17), (23) - (25). In this equation, the unknown variables comprise the state vectors x of the three spacecraftA,xD,xEAnd their control quantities uA,uD,uECo-modal vector λADEAnd a game time tf
3. Solving strategy based on particle swarm and downhill simplex method
For a multi-spacecraft game model, whether the result is converged or not and the precision of the result is very sensitive to the initial value of the co-modal vector lambda, so the method adopts a particle swarm algorithm which is an optimization algorithm without giving out an initial guess value. However, the calculation accuracy of the particle swarm is limited by the population number and the evolution times, and the calculation accuracy is considered by integrating time and accuracy, so that excessive particles and algebras cannot be adopted, so that the convergence accuracy cannot be ensured by the algorithm under certain conditions, and further optimization is further performed by adopting a downhill simplex method on the basis of the particle swarm algorithm, and the method is a nonlinear programming method under the condition of processing no constraint. In the process of solving the established model, the downhill simplex method can directly adopt an index function of a particle swarm algorithm, so that the time and the difficulty of constraint processing are saved.
1) Fitness function
In the established two-point boundary value equation, the related constraint summation is taken as a fitness function of the particle swarm optimization:
Figure BDA0002242080510000111
wherein, ciThe constraints are represented and are composed of equations (16), (17) and (25). k is a radical ofi> 0 represents the weight of each coefficient, ncRepresenting the number of constraints.
2) Arrangement of particles
The dimension of the particle comprises three spacecraft covariance initial values (6 x 3) and a game time tfIn total 19 dimensions. The limit value of each dimension of the particle cannot be determined, the co-modal vector is processed in a unitization mode, and the particle swarm optimization adopts a dynamic interval mode to process: initial guess interval (a) for a given particlei,bi) Wherein i ═ 1,2, ·,19 denotes the particle dimension, N particles are randomly generated in this interval, and if some element j of the current globally optimal particle exceeds (a) in the particle updating processj,bj) Taking expansion factor gamma > 1, expanding the interval to obtain new element limit (gamma a)j,γ*bj) And regenerating M particles under the limit, selecting N particles with higher fitness from the new N + M particles, and continuously iterating to correct the limit range.
For element tfSetting a small quantity delta as a time lower limit, wherein the limit is not changed in the iteration process, and the following steps are carried out:
4. simulation setup
In order to verify the effectiveness of the established game model and the adopted solving strategy, the pursuit model comprising three spacecrafts is subjected to simulation calculation. Firstly, in order to avoid generating a sick matrix due to too large magnitude difference of each parameter of an equation, the equation is unitized, and the equator radius R of the earth is takenEFor a Distance Unit (DU), a Time Unit (TU) is defined as a value such that the earth's gravity constant μ is 1, and there are:
Figure BDA0002242080510000113
wherein g is the earth gravitational acceleration. The thrust vector of the spacecraft is TA=0.04g,TD=0.1g,TEThe acquisition range of the spacecraft was set to 0.01 g. Orbit element for initial state of spacecraftThe elements in the formula represent a semi-major axis of the orbit, eccentricity, inclination angle of the orbit, right ascension of the ascending intersection, amplitude angle of the perigee and angle of the perigee respectively. If the orbit is a circular orbit, the horizontal latitude amplitude angle u is represented by omega + M. The initial orbit elements of the spacecraft are:
TABLE 1 spacecraft initial orbit element
Figure BDA0002242080510000115
In the payment of the spacecraft, the values of all coefficients of the instantaneous payment are as follows: k is a radical of1=k2=k3k 41, the particle group size is 150, the iteration frequency is 200 generations, the simplex method is iterated 3000 times, and the precision requirement is 10-8
The invention establishes a three-dimensional space game model containing a plurality of spacecrafts, which divides the game process of the spacecrafts into a uniform time history and avoids the sectional representation of an index function. The analysis method based on the dynamic game can simultaneously consider the control strategies of a plurality of spacecrafts and is closer to the antagonism requirement in the scene.
The model adopts an index containing the instantaneous relative position information of the spacecraft, and the index promotes the connection between the target spacecraft and the defense spacecraft to form certain matching; the attacker is also prompted to evaluate the importance of determining the two sets of games.
In the process of solving the model, the method can adopt a mode of combining the particle swarm algorithm and the downhill simplex to calculate, modifies the traditional particle swarm algorithm in the method for processing the particle boundary, uses the form of a dynamic interval, and avoids strict guess of the particle interval.
The simulation results are shown in fig. 3 and 4: as can be seen from fig. 3, due to the setting of the initial state and the control condition, the defending spacecraft successfully intercepts the attacking spacecraft, the interception time is 856.9s, and the relative distance of the terminal is 1.16 m. Fig. 4 shows the distance change between the target spacecraft and the defense spacecraft when the target spacecraft makes the optimal maneuver and natural motion, and it can be seen that the distance between the target spacecraft and the defense spacecraft is closer under the optimal maneuver, because the target spacecraft can give a greater payment function pressure to the attack spacecraft by approaching the defense spacecraft under the condition of ensuring the self-security, thereby forming an effective cooperation with the attack spacecraft.
Fig. 5 shows the relative distance change between the attacking and defending spacecraft, and it can be seen that the three-axis components all converge almost to 0. Fig. 6 is a time history of the three spacecraft control direction angles, the curve is generally smooth, there is a sudden change near the game termination time, because the co-modal vector with respect to speed is close to zero at this time, and the direction angle solved according to equation (21) has a singularity at this time, which should be solved using the law of lobida.
In a word, the invention constructs a game model under the same time course through the analysis of the problem of the spacecraft pursuit and prevention, solves a multilateral optimal control problem by utilizing the dynamic game theory analysis, provides optimal control strategies and optimal motion tracks which accord with the benefits of the spacecraft in the model, has simple model and fitting reality, does not need initial guess in the solving method, and has wide adaptability.
Although particular embodiments of the present invention have been described above, it will be appreciated by those skilled in the art that these are merely examples and that many variations or modifications may be made to these embodiments without departing from the principles and implementations of the invention, the scope of which is therefore defined by the appended claims.

Claims (4)

1. A multi-spacecraft pursuit escape control method based on a dynamic game theory is characterized by comprising the following steps:
firstly, establishing a dynamic model of a plurality of spacecrafts in an earth center gravitational field, wherein the model is also called a dynamic equation and comprises three roles of a target spacecraft, a defense spacecraft and an attack spacecraft; determining a payment function of the spacecraft and a termination target set of a pursuit and escape game according to the role relationship of each spacecraft in the dynamic model, thereby establishing a multi-spacecraft game model;
secondly, processing the multi-spacecraft game model established in the first step by using a dynamic game theory to obtain a two-point boundary value equation, wherein the two-point boundary value equation comprises a differential equation of a co-modal vector, a new kinetic equation expressed by the co-modal vector and a cross-section equation of time;
and thirdly, solving the two-point edge value equation obtained in the second step by combining a particle swarm algorithm and a nonlinear programming method aiming at the two-point edge value problem established in the second step to obtain a solution of the equation, so as to obtain an optimal control strategy and an optimal track for the spacecraft to pursue.
2. The multi-spacecraft escape pursuit control method based on the dynamic game theory as claimed in claim 1, characterized in that: in the first step, the multi-spacecraft gaming model is as follows:
(1) the dynamic equation of the spacecraft is:
Figure FDA0002242080500000011
Figure FDA0002242080500000012
Figure FDA0002242080500000013
wherein i ═ D, E, a respectively represent the defense spacecraft, the target spacecraft and the attack spacecraft, μ ═ 3.986005 × 1014m3/s2The method is characterized in that the method is an earth gravity constant, r represents a mode of a radial of a spacecraft, x, y and z are three-axis position components of the spacecraft, T represents the unit mass control quantity of the spacecraft, theta and alpha represent control direction angles of the spacecraft, and the specific directions are defined as follows: theta is an included angle between the controlled variable and the direction of the z axis under the inertial system, and alpha is an included angle between the projection of the controlled variable on the x-y plane and the y axis;
(2) according to the role relationship of each spacecraft in the dynamic model, the payment function is as follows:
wherein, t0,tfRepresents the start and end time of the game, rAEAnd rDARespectively representing the relative distance of the attacking and target spacecraft and the distance between the defending and attacking spacecraft, k1And k3Weight, k, representing payment2And k4Represents the rate of change of payment with distance;
(3) the set of termination targets for the spacecraft is as follows:
ψ(rE(tf),rD(tf),rA(tf),tf)=F11)F22)=0
Figure FDA0002242080500000021
Figure FDA0002242080500000022
I1={(rA(tf),rE(tf))|||rA(tf)-rE(tf)||<ε1}
I2={(rD(tf),rA(tf))|||rD(tf)-rA(tf)||<ε2}
wherein r isi(tf) Terminal position vector, ξ, representing spacecraft i12Respectively representing combinations of terminal positions of attack-target spacecraft and attack-defense spacecraft, I1,I2Set of terminal positions of attack-tolerant target spacecraft and set of terminal positions of attack-tolerant defense spacecraft to satisfy acquisition relationship, epsilon1≥0,ε2More than or equal to 0 respectively represents the capture range of the attacking spacecraft and the defending spacecraft;
and a multi-spacecraft game model is formed by a kinetic equation, a payment function and a termination target set.
3. The multi-spacecraft escape pursuit control method based on the dynamic game theory as claimed in claim 1, characterized in that: in the second step, the two-point edge value equation obtained by the two-point edge value problem comprises the following steps:
differential equation of the co-modal vector:
Figure FDA0002242080500000023
wherein x represents the position and velocity vector of the spacecraft, the final value needs to meet the termination target set in the first step, lambda is a co-modal vector, and t is time;
new kinetic equations represented by the co-modal vectors:
Figure FDA0002242080500000024
the equation is derived by jointly deriving a kinetic equation and a game theory in the first step;
cross-sectional equation of time:
Figure FDA0002242080500000025
wherein, x (t)f) The terminal value representing the state, ψ' is a boundary form of the termination target set, expressed as follows:
ψ′=(||rA(tf)-rE(tf)||-ε1)·(||rD(tf)-rA(tf)||-ε2)=0
wherein r isi(tf) Representing the terminal position vector, ε, of a spacecraft i1≥0,ε2More than or equal to 0 respectively represents the capture range of the attacking spacecraft and the defending spacecraft;
the above equations together form a two-point edge value equation.
4. The multi-spacecraft escape pursuit control method based on the dynamic game theory as claimed in claim 1, characterized in that: the third step is specifically realized as follows:
(1) the method comprises the following steps of obtaining a rough solution of a two-point boundary value equation by utilizing a particle swarm algorithm, wherein the fitness function of the algorithm is as follows:
Figure FDA0002242080500000031
wherein, ciBoundary constraint and time-transverse condition, k, representing the two-point boundary equation in the second stepi> 0 represents the weight coefficient of each constraint;
particle renewal was as follows:
Figure FDA0002242080500000032
Figure FDA0002242080500000033
Figure FDA0002242080500000034
wherein w represents an inertial weight, wmax,wminRepresenting the upper and lower bounds of the inertial weight, n representing the current evolutionary algebra, M representing the total evolutionary algebra, c1,c2Represents the acceleration constant, r1,r2Is a random number from 0 to 1 and,
Figure FDA0002242080500000035
is the individual extremum of the ith particle in the d dimension, gdIs a global extremum in the d-th dimension;
(2) utilizing a nonlinear programming tool box to carry out fine optimization on the rough solution of the two-point boundary value equation, wherein the target function is the same as the fitness function of the particle swarm function, thereby obtaining the numerical solution of the equation, namely the solution is the initial value lambda of the covariance vector lambda0
(3) The solution takes the initial equation, which is in the form:
Figure FDA0002242080500000037
x(0)=x0,λ(0)=λ0
wherein x represents the position and velocity vector of the spacecraft, λ is the co-modal vector, t is time, x0Is the initial position and velocity, λ, of the spacecraft0I.e. the numerical solution of the equation of two-point edge values, f (-) i.e. the new formThe kinetic equation of the formula (I) is,
Figure FDA0002242080500000038
namely a collaborative equation;
thereby numerically solving the optimal control strategy theta for spacecraft pursuiti、αiAnd an optimal trajectory xi
Wherein, theta and alpha are control direction angles of the spacecraft, and i represents a defense spacecraft, a target spacecraft and an attack spacecraft.
CN201911003658.7A 2019-10-22 2019-10-22 Multi-spacecraft pursuit and escape control method based on dynamic game theory Active CN110673486B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911003658.7A CN110673486B (en) 2019-10-22 2019-10-22 Multi-spacecraft pursuit and escape control method based on dynamic game theory

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911003658.7A CN110673486B (en) 2019-10-22 2019-10-22 Multi-spacecraft pursuit and escape control method based on dynamic game theory

Publications (2)

Publication Number Publication Date
CN110673486A true CN110673486A (en) 2020-01-10
CN110673486B CN110673486B (en) 2021-03-09

Family

ID=69083467

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911003658.7A Active CN110673486B (en) 2019-10-22 2019-10-22 Multi-spacecraft pursuit and escape control method based on dynamic game theory

Country Status (1)

Country Link
CN (1) CN110673486B (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111679592A (en) * 2020-06-22 2020-09-18 中国人民解放军国防科技大学 Spacecraft pursuit and escape game closed-loop semi-physical simulation system and method
CN113190033A (en) * 2021-05-20 2021-07-30 北京理工大学 Method for quickly judging rendezvous in spacecraft flight game
CN113221365A (en) * 2021-05-20 2021-08-06 北京理工大学 Method for processing fuel consumption constraint in spacecraft flight game
CN113282097A (en) * 2021-06-04 2021-08-20 中国人民解放军战略支援部队航天工程大学 Control method for non-spherical perturbation error of relative position of GEO game spacecraft
CN113353289A (en) * 2021-04-25 2021-09-07 北京控制工程研究所 Autonomous driving and separating method and device for space game and storage medium
CN113552872A (en) * 2021-01-28 2021-10-26 北京理工大学 Pursuit and escape game decision method for chaser at different speeds
CN114415730A (en) * 2022-03-21 2022-04-29 南京航空航天大学 Intelligent planning method for escape trajectory of spacecraft
CN116258317A (en) * 2022-12-09 2023-06-13 哈尔滨工业大学 Satellite formation anti-monitoring optimal control method, device and medium based on game theory
CN116449714A (en) * 2023-04-20 2023-07-18 四川大学 Multi-spacecraft pursuit Game track control method
CN117332684A (en) * 2023-09-25 2024-01-02 同济大学 Optimal capturing method under multi-spacecraft chase-escaping game based on reinforcement learning
CN118052150A (en) * 2024-04-11 2024-05-17 中国科学院空天信息创新研究院 Spacecraft chase game strategy solving method, device and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006060214A2 (en) * 2004-11-18 2006-06-08 The Regents Of The University Of California Apparatus and methods for manipulation and optimization of biological systems
US20060241824A1 (en) * 2005-01-06 2006-10-26 Junichiro Kawaguchi Method of injecting plurality of spacecrafts into different orbits individually
CN107613561A (en) * 2017-10-21 2018-01-19 桂林理工大学 A kind of method for three-dimensionally positioning network node of wireless sensor based on exchange SCE PSO algorithms
CN108519958A (en) * 2018-02-05 2018-09-11 中国人民解放军国防科技大学 Method for analyzing and constructing spacecraft pursuit escape boundary grating and judging and capturing escape area
CN109085754A (en) * 2018-07-25 2018-12-25 西北工业大学 A kind of spacecraft neural network based is pursued and captured an escaped prisoner game method
CN109238287A (en) * 2018-09-06 2019-01-18 中国人民解放军国防科技大学 Spacecraft escape path planning method and system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006060214A2 (en) * 2004-11-18 2006-06-08 The Regents Of The University Of California Apparatus and methods for manipulation and optimization of biological systems
US20060241824A1 (en) * 2005-01-06 2006-10-26 Junichiro Kawaguchi Method of injecting plurality of spacecrafts into different orbits individually
CN107613561A (en) * 2017-10-21 2018-01-19 桂林理工大学 A kind of method for three-dimensionally positioning network node of wireless sensor based on exchange SCE PSO algorithms
CN108519958A (en) * 2018-02-05 2018-09-11 中国人民解放军国防科技大学 Method for analyzing and constructing spacecraft pursuit escape boundary grating and judging and capturing escape area
CN109085754A (en) * 2018-07-25 2018-12-25 西北工业大学 A kind of spacecraft neural network based is pursued and captured an escaped prisoner game method
CN109238287A (en) * 2018-09-06 2019-01-18 中国人民解放军国防科技大学 Spacecraft escape path planning method and system

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
YE, D 等: "Satellite proximate interception vector guidance based on differential games", 《CHINESE JOURNAL OF AERONAUTICS》 *
刘胜男: "多对多卫星拦截的任务规划与轨道优化", 《中国优秀硕士学位论文全文数据库 工程科技Ⅱ辑》 *
吴其昌 等: "基于生存型微分对策的航天器追逃策略及数值求解", 《控制与信息技术》 *

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111679592B (en) * 2020-06-22 2023-04-07 中国人民解放军国防科技大学 Spacecraft pursuit and escape game closed-loop semi-physical simulation system and method
CN111679592A (en) * 2020-06-22 2020-09-18 中国人民解放军国防科技大学 Spacecraft pursuit and escape game closed-loop semi-physical simulation system and method
CN113552872A (en) * 2021-01-28 2021-10-26 北京理工大学 Pursuit and escape game decision method for chaser at different speeds
CN113552872B (en) * 2021-01-28 2024-05-14 北京理工大学 Method for controlling pursuit and escape of pursuit robot at different speeds
CN113353289B (en) * 2021-04-25 2022-12-13 北京控制工程研究所 Autonomous driving and separating method and device for space game and storage medium
CN113353289A (en) * 2021-04-25 2021-09-07 北京控制工程研究所 Autonomous driving and separating method and device for space game and storage medium
CN113221365A (en) * 2021-05-20 2021-08-06 北京理工大学 Method for processing fuel consumption constraint in spacecraft flight game
CN113190033A (en) * 2021-05-20 2021-07-30 北京理工大学 Method for quickly judging rendezvous in spacecraft flight game
CN113190033B (en) * 2021-05-20 2022-05-10 北京理工大学 Method for quickly judging rendezvous in spacecraft flight game
CN113282097B (en) * 2021-06-04 2022-07-29 中国人民解放军战略支援部队航天工程大学 Control method for relative position non-spherical perturbation error of GEO game spacecraft
CN113282097A (en) * 2021-06-04 2021-08-20 中国人民解放军战略支援部队航天工程大学 Control method for non-spherical perturbation error of relative position of GEO game spacecraft
CN114415730B (en) * 2022-03-21 2022-10-11 南京航空航天大学 Intelligent planning method for escape trajectory of spacecraft
CN114415730A (en) * 2022-03-21 2022-04-29 南京航空航天大学 Intelligent planning method for escape trajectory of spacecraft
CN116258317A (en) * 2022-12-09 2023-06-13 哈尔滨工业大学 Satellite formation anti-monitoring optimal control method, device and medium based on game theory
CN116258317B (en) * 2022-12-09 2023-10-31 哈尔滨工业大学 Satellite formation anti-monitoring optimal control method, device and medium based on game theory
CN116449714A (en) * 2023-04-20 2023-07-18 四川大学 Multi-spacecraft pursuit Game track control method
CN116449714B (en) * 2023-04-20 2024-01-23 四川大学 Multi-spacecraft pursuit game track control method
CN117332684A (en) * 2023-09-25 2024-01-02 同济大学 Optimal capturing method under multi-spacecraft chase-escaping game based on reinforcement learning
CN117332684B (en) * 2023-09-25 2024-04-26 同济大学 Optimal capturing method under multi-spacecraft chase-escaping game based on reinforcement learning
CN118052150A (en) * 2024-04-11 2024-05-17 中国科学院空天信息创新研究院 Spacecraft chase game strategy solving method, device and storage medium

Also Published As

Publication number Publication date
CN110673486B (en) 2021-03-09

Similar Documents

Publication Publication Date Title
CN110673486B (en) Multi-spacecraft pursuit and escape control method based on dynamic game theory
CN108549233B (en) Unmanned aerial vehicle air combat maneuver game method with intuitive fuzzy information
CN108827312B (en) Cooperative game path planning method based on neural network and artificial potential field
CN113093802B (en) Unmanned aerial vehicle maneuver decision method based on deep reinforcement learning
Kim et al. Look-angle-shaping guidance law for impact angle and time control with field-of-view constraint
CN106970530B (en) Model-free preset performance control method for autonomous sight intersection of space non-cooperative targets
CN114003050B (en) Active defense guidance method of three-body countermeasure strategy based on differential game
CN111367314B (en) Space non-cooperative target cooperative capturing method based on multi-spacecraft formation
Ryoo An autonomous control of fuzzy-PD controller for quadcopter
Kung et al. Missile guidance algorithm design using particle swarm optimization
Nikusokhan et al. Closed-form optimal cooperative guidance law against random step maneuver
CN115329594B (en) Large-scale missile cluster attack and defense confrontation simulation acceleration method and system
CN116680509A (en) Dynamic matching method for multi-spacecraft escape-tracking game task
CN116225049A (en) Multi-unmanned plane wolf-crowd collaborative combat attack and defense decision algorithm
CN116858039A (en) Hypersonic aircraft game guidance method, system, equipment and medium
CN113552872A (en) Pursuit and escape game decision method for chaser at different speeds
Moosapour et al. A novel nonlinear robust guidance law design based on SDRE technique
Menon et al. Guidance laws for spacecraft pursuit-evasion and rendezvous
CN115892519A (en) Spacecraft control method for short-distance spacecraft orbit pulse game
CN110162084A (en) Cruising missile group system formation control method based on congruity theory
Golestani et al. Fractional order sliding mode guidance law: Improving performance and robustness
Bahari et al. Intelligent fading memory for high maneuvering target tracking
Altaher et al. Intercepting a superior missile: Trajectory optimization approach to a pursuit-evasion game
Yang et al. Adaptive super-twisting sliding mode back-stepping control for hypersonic flight vehicle with impact angle constraint and autopilot dynamics
CN116222310B (en) Two-pair synchronous region coverage interception method based on RBF_G in three-dimensional space

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant