Disclosure of Invention
In order to solve the problems, the invention provides a collaborative control method and a collaborative control system for a convoy task based on an obstacle environment and bounded input, wherein the convoy task and an obstacle avoidance task are considered, NSB control is adopted in the design of an outer ring, and a generated speed vector is used as a reference value of the inner ring. In the inner ring, an IRPD-SMC technology based on an arctan function and an adaptive RBFNNS is provided, so that the robot is ensured to follow a reference track and zero steady-state error is realized.
In some embodiments, the following technical scheme is adopted:
the convoy task cooperative control method based on the obstacle environment and bounded input comprises the following steps:
describing a physical model of the convoy task by adopting a multi-Euler-Lagrange system;
an inner ring and outer ring control structure is adopted, the outer ring adopts a behavior control method based on a space, and expected speed and an expected motion track required by an inner ring physical model are generated; the inner ring is based on an improved proportional derivative sliding mode control (IRPD-SMC) method of an adaptive radial basis function neural network, so that each physical model can track expected speed and expected motion trail under the condition of interference and parameter uncertainty, and zero steady-state error and bounded input are realized.
The inner ring is based on the improved proportional derivative sliding mode control method of the adaptive radial basis function neural network, and the control law is as follows:
wherein k is
αiIs a position error dependent gain for canceling position errors; k is a radical of
βiIs a differential related gain used for predicting the overall response trend and preventing the system from acting too violently; λ is the approximate proportional gain; k is a radical of
iIs the robust term gain, κ is a constant;
is the reference torque ρ
iEstimated value of e
iIs a position tracking error,
Is the velocity tracking error, s
iIs a slip form surface.
In other embodiments, the following technical solutions are adopted:
a convoy mission cooperative control system based on obstacle environment and bounded input comprises: the controller adopts an inner ring and outer ring control structure, the outer ring adopts a behavior control method based on a space, and expected speed and an expected motion track required by an inner ring physical model are generated; the inner ring is based on an improved proportional derivative sliding mode control method of the adaptive radial basis function neural network, so that each physical model can track expected speed and expected motion trail under the condition of interference and parameter uncertainty, and zero steady-state error and bounded input are realized.
In other embodiments, the following technical solutions are adopted:
a terminal device comprising a processor and a computer-readable storage medium, the processor being configured to implement instructions; the computer readable storage medium stores a plurality of instructions adapted to be loaded by a processor and to perform the above-described obstacle environment and bounded input based convoy mission cooperative control method.
In other embodiments, the following technical solutions are adopted:
a computer-readable storage medium having stored therein a plurality of instructions adapted to be loaded by a processor of a terminal device and to execute the aforementioned obstacle environment and bounded input-based convoy mission cooperative control method.
Compared with the prior art, the invention has the beneficial effects that:
under the condition that model uncertainty, interference and obstacles exist, the coordination control problem of the multi-Euler Lagrange system under the bounded input escort task is researched; a robust hierarchical control structure consisting of NSB and IRPD-SMC is designed; in order to avoid obstacles and complete a convoying task, NSB control is adopted in the outer ring design, and the speed and the track required by the inner ring are generated; IRPD-SMC is proposed in inner loop design to track the desired trajectory and speed. Thus, the controller compensates for unknown disturbances and parameter uncertainty well, ensures bounded control inputs, and achieves fast convergence, robustness, and zero steady-state error. Finally, all robots in p-dimensional space can achieve uniform distribution around the target and robust delivery of the target at a specified distance while avoiding obstacles (where p ≧ 2 is a positive integer).
In practice, the actuators are limited in their ability and in many cases they may not be able to produce the desired torque large enough to cause a reduction in the performance of the control system. Therefore, the practical problem of bounded control input is considered, the input is ensured to be of limited amplitude, the performance of the control system can be ensured to be stable, and the control system can be really applied in practice.
The adaptive radial basis function neural network adopted in the design of the controller has strong learning capability and robustness, interference is well compensated, and zero steady-state error is finally realized.
The controller proposed by the present invention is simple, model-free and capable of providing a continuous control signal, making it easy to implement in practice.
Example one
In one or more embodiments, disclosed is a convoy mission cooperative control method based on an obstacle environment and bounded input, referring to fig. 1, including:
describing a physical model of the convoy task by adopting a multi-Euler-Lagrange system;
an inner ring and outer ring control structure is adopted, the outer ring adopts a behavior control method based on a space, and expected speed and an expected motion track required by an inner ring physical model are generated; the inner ring is based on an improved proportional derivative sliding mode control method of the adaptive radial basis function neural network, so that each physical model can track expected speed and expected motion trail under the condition of interference and parameter uncertainty, and zero steady-state error and bounded input are realized.
In order to avoid obstacles and form convoy formation, the embodiment adopts the behavior based on empty space (NSB) control in the design of the outer ring controller, and generates the speed required by the inner ring. In the design of an inner ring, a proportional derivative sliding mode control (IRPD-SMC) method based on an improved adaptive Radial Basis Function Neural Network (RBFNNs) has the following robustness under the condition of interference and parameter uncertainty, and realizes zero steady-state error and bounded input. Finally, all robots in p-dimensional space can achieve uniform distribution around the target and robust delivery of the target at a specified distance while avoiding obstacles (where p ≧ 2 is a positive integer).
The stability and convergence of the system are strictly proved by using the Lyapunov stability theory, and the effectiveness of the proposed control strategy is verified by comparing with PD-SMC, APD-SMC and ASMC simulation experiments in two-dimensional and three-dimensional space.
The method of the present embodiment will be described in detail below.
Consider a set of n mobile robots, whose dynamics can be described as an euler-lagrange system, expressed as:
wherein M is
i(q
i)∈R
p×pIs a positive definite inertia matrix, q
i∈R
pIs a vector of coordinates in a broad sense,
is the vector and centrifugal moment of Coriolis, g
i(q
i) Is the moment of gravity, τ
iIs the control input vector for robot i,
is an unknown disturbance.
The Euler-Lagrange system is assumed to have the following properties:
property 1 is bounded: for any one i, there is a normal number
m i,k
CiAnd k
giSo that
And C
i(x,y)||≤k
CiFor all vectors, | y | |
And g
i(q
i)||≤k
giThis is true.
Property 2 skew symmetry:
is diagonally symmetrical.
Property 3 dynamic parameter linearization: for all vectors
Is a regression vector, and Θ
iIs a common parameter vector associated with the ith robot.
Properties 4: assumption of disturbance force
Is that the material is bounded by the surface,
ξ therein
i>0。
1. Outer loop controller design
NSB control is used in the outer loop to combine three different tasks, define the final motion of the robot and generate the required speed. The proposed IRPD-SMC is used in the inner loop of the multi-euler lagrange system to compensate for unknown disturbances, parameter uncertainty and to guarantee bounded inputs etc. The whole control system diagram is shown in figure 1.
The convoying task with the obstacle avoidance requirement is decomposed into three different subtasks, namely obstacle avoidance subtasks, the subtasks that the robot is uniformly distributed around the target and the subtasks that the robot maintains on the surface of a sphere or a hyper-sphere with the target as the center, and the two next subtasks belong to the convoying task.
The expected speed of the convoy mission with the obstacle avoidance requirement is designed as
Wherein the content of the first and second substances,
representing the desired speed required by the obstacle avoidance subtask,
and
is the desired speed required for the convoy mission.
In the obstacle avoidance subtask, each robot is virtualized into a space
Is surrounded by
Represents the position of the current obstacle for the ith robot, and B
i,oRepresents f
i,1d=d
iRegion of where d
iIs the minimum allowable safe distance between the ith robot and the obstacle.
In the form of a jacobian matrix,
is the unit vector pointing to the nearest obstacle, α
i,1>0 is a state dependent gain.
The obstacle avoidance task function and the task error function are respectively expressed as:
and
it is noted that the robot may be able to move the robot in a direction that is substantially perpendicular to the obstacle, if and only if the robot is sufficiently close to the obstacle,
when the obstacle avoidance task is not activated,
this task is built independently for each robot, rather than an accumulated task function.
In the subtasks where the robots are evenly distributed around the target, taking the planar case as an example, the ideal formation is a regular n-polygon and all the robots are eventually distributed at the vertices of the regular polygon.
The expected task function and the task function error are respectively as follows:
here, the first and second liquid crystal display panels are,
is the ideal distance between two adjacent robots.
k
jIs an index that identifies the robot at the jth location along the circle, not necessarily the jth robot. The desired speed of the task is
The corresponding Jacobian matrix is
The pseudo inverse is
A constant positive definite matrix representing the gain.
In a subtask where the robot is maintained on a sphere or hypersphere surface centered on the target, the task function, the desired task function and the task function error are respectively:
the desired speed of this task is:
the corresponding jacobian matrix is:
and Λ
2A constant positive definite matrix of gains is defined similarly.
Note 1.1. under three-dimensional space and p-dimensional (p >3) space, the problem of how to distribute points on spheres and hyperspheres is considered a Thomson problem. Many scholars have studied this problem and concluded that there is certainly a suitable distance.
Note 1.2. Once any robot is out of control and enters the virtual region BETA of another roboti,oThe robot will be considered an obstacle, which the rest of the robots must avoid. If two or more obstacles are considered simultaneously, the nearest obstacle will be processed first.
2. Inner loop controller design
In order to solve the problems of model uncertainty, external interference and the like and realize zero steady-state error tracking and bounded input, the IRPD-SMC is provided in an inner ring. The improved PD controller is used for stabilizing an equivalent part and providing a bounded input, the SMC part is used for compensating an external disturbance and a system uncertainty part, and the RBFNNS is used for estimating parameters unknown to the system.
(1)IRPD-SMC
Part of the control involves designing the controller and having each robot track the target trajectory
This trajectory may be formed byIn formula (2)
The result of the integration is that,
and similarly defined, they are bounded functions. Defining equivalent partial state quantities
Here, the first and second liquid crystal display panels are,
and
respectively, position tracking error and velocity tracking error.
γi=diag(γi,1,γi,2...γi,p) Is a positive diagonal matrix defined as a sliding mode constant, and a sliding mode surface is defined as:
wherein the content of the first and second substances,
based on property 3, the reference moment is described as follows:
as can be seen,
unknown and difficult to determine because it contains perturbations and uncertain dynamics that are difficult to obtain. Therefore, RBFNNS is used to estimate the unknown and compensate for the interference.
Adaptive RBFNNS can be written as
X
i=[x
i,1,x
i,2...x
i,m]
TIs an input to the computer system that is,
is a weight matrix, v is the number of neurons in the hidden layer,
is an activation function, one of which is commonly used:
ci,jis the center of the neuron, σi,jIs the width of the gaussian function.
The weight update rate is designed as follows:
μiis a positive fixed diagonal gain matrix.
There is an optimal RBFNNS to learn the reference moment ρiSo that
Is the optimal weight vector, ε
iIs provided withAnd (4) a bound nerve approximation error.
From the equations (6) and (9), it can be obtained
Wherein the content of the first and second substances,
the input of BBFNNS in the embodiment is selected as
By using estimated terms
The RPD-SMC control law is designed as follows:
in order to solve the problem of input bounding, the RPD-SMC control law needs to be improved. Currently, two typical saturation functions are available to ensure that the control input is bounded, a hyperbolic tangent function and an arctangent function, respectively. The curves of both functions are shown in fig. 2.
As shown in FIG. 2, the range of the arctangent function arctan (kx) is compared to the range (-1,1) of the hyperbolic tangent function tanh (kx)
Larger and for the same value of κ, (κ being a constant), the arc tan (κ x) function may approach saturation in a more gradual manner. The smaller the value of κ, the smaller the zero-crossing slope of the arc tan (κ x) function, the more nearly linearly the function approaches saturation. To better describe the direct proportional relationship between x and y, the arc tan (kx) function is typically multiplied by a certain amount of gain. As shown in fig. 3, by gain 5.5, y
1Having an approximately proportional characteristic, i.e. y
1=5.5*arctan(0.2*x)。
Through the analysis, under the condition of fully considering actuator saturation, an arctangent function is designed to improve the PD part of the control law, so that an IRPD-SMC control law is proposed.
Wherein k isαiIs a position error dependent gain for canceling position errors; k is a radical ofβiIs a differential related gain used for predicting the overall response trend and preventing the system from acting too violently; λ is the approximate proportional gain; k is a radical ofiIs the robust term gain. All gains are positive.
Note 2.1 this embodiment does not consider the saturation behavior of the actuator, but instead presents a bounded input that can produce a finite magnitude. In practical applications, actuator saturation can be successfully prevented by appropriately adjusting the controller parameters.
Note 2.2 it can be seen that the proposed IRPD-SMC algorithm only correlates with the desired track signal
And error signal
It is related. Therefore, the proposed controller is model independent.
3. Stability analysis
From equation (4), one can obtain
From equations (1), (5) and (12), the following equations can be obtained:
theorem 3.1 consider the Eulerian Lagrangian system in equation (1), the IRPD-SMC control law designed by equation (11), and the weight update rate of equation (8). Provided that the control gain satisfies k, under the assumption that the properties 1-4 holdi>εi,maxRegardless of the interference and the uncertainty of the system, the following conclusions can be drawn.
1. If there is no conflict among the three tasks, the three tasks can be completed simultaneously, the whole system is globally and gradually stable, and the tracking error is
Converge to 0.
2. If the obstacle avoidance task is activated and conflicts with the convoy task, simultaneously setting the gain as
Wherein
Is based on robustness to noise. The obstacle avoidance task can be performed first, the system is globally asymptotically stable, and the tracking error is
Converge to 0.
And (3) proving that: lyapunov function V is selected as follows
V=V1+V2
Wherein, η
1,η
2And η
3Are positive design parameters, and
it is easy to conclude that V is a positive function. Base ofFrom equations (11) and (13), the following equations can be obtained:
will V1Differentiating time while substituting equations (4), (8) and (15) yields:
if k is
i>ε
i,maxThat is to say
Is negatively determined and can be derived
It follows that the inner ring subsystem is globally asymptotically stable and the tracking error e
iAnd
converge to 0.
Note 3.1. to eliminate chatter, introducing a hyperbolic tangent function, tanh (-), instead of a discontinuous sign (-), equation (11) can be modified:
wherein the content of the first and second substances,
will V2Differentiating the time to obtain:
two cases are discussed separately below: conflicting tasks and non-conflicting tasks;
suppose that between every two tasksWithout conflict, i.e.
Therefore, it is obtained from equation (18):
it can be concluded that if there is no conflict between each pair of tasks, the outer ring subsystem is globally asymptotically stable, and the tracking error e
iAnd
converge to 0. If the obstacle avoidance task is activated and conflicts with the convoy task, V
2This can be written as:
wherein the content of the first and second substances,
and P ═ P
ij]I, j ═ 1,2,3, the submatrices of which are:
for arbitrary
Applying 2| ab ≦ a in equation (20)
2+b
2Obtaining:
wherein p is
ij,mAnd p
ij,mThe upper and lower limits of the induction sub-block of P are indicated, respectively.
Because | | J
1||=||J
2||=||J
3||=1,p
22,m=
p 22,M0 and p
33,m=p
33,M=0,
And
will lose control, in which case V
2Reset to
At the same time, the method can obtain,
it is also obtained that the outer ring subsystem is globally asymptotically stable.
In summary, the conclusion of theorem 3.1 holds true, meaning that the obstacle avoidance task has a higher priority and is executed first when it conflicts with the convoy task, second when no conflict exists, the convoy task is executed again, since NSB is a kinematics that works with a desired velocity rather than a desired position, a rational setting α is requiredi,1So that the velocity error dominates the position error.
By substituting equation (2) into equation (4) and considering that every two tasks conflict with each other, then
And
it can be removed. Thus, it is possible to provide
It is possible to obtain:
the inequality (22) is taken as an equation and the norm is taken on both sides, which can be obtained:
by selecting
Wherein
The design of (2) is based on robustness to noise, so that the function of the obstacle avoidance task can ensure that the robot is far away from the obstacle.
Note 3.2. if only two subtasks of the convoy task conflict, the robot only needs to complete the first two higher level tasks. No collision will occur, at this time
4. Simulation (Emulation)
In the embodiment, the comparison tests with APD-SMC, ASMC and PD-SMC show the superiority of the controller.
The kinetic equation for the robot is set as:
five robots are considered in the experiment of the two-dimensional space, six robots are adopted in the experiment of the three-dimensional space, and the system parameter is set to be M i1 and C i0. The outer loop parameters in two and three dimensions are shown in tables 1 and 2, respectively, and the controller parameters for ASMC, PD-SMC, APD-SMC and proposed IRPD-SMC were adjusted by trial and error, with the final selected parameter values shown in table 3.
TABLE 1
Two-dimensional spatial outer loop parameter values
TABLE 2
Three-dimensional space outer loop parameter value
TABLE 3
Control parameters of different controllers
Case 1. in two-dimensional space, the initial positions of the five robots are q, respectively
1(0)=[5,10]
T,q
2(0)=[-5,5]
T,q
3(0)=[-5,-5]
T,q
4(0)=[5,-10]
TAnd q is
5(0)=[5,0]
T. The trajectory of the target is set to [3+0.1t,0 ═ c]
TThe external disturbance parameter is
And
the values of the parameters for the outer and inner rings are shown in tables 1 and 3, respectively.
The trajectories of the five robots and the target in two-dimensional space are shown in fig. 4. Under the action of different controllers, the distance between the robot and the target, the distance between two adjacent robots, the position tracking error and the velocity tracking error are respectively shown in fig. 5(a) -8 (d). It can clearly be seen that there is a significant difference in the performance of these controllers when interference occurs. With the controller proposed herein, the control characteristics are significantly better than other controllers, as shown in fig. 5(a) - (d), fig. 5(a) is the distance between the robot and the target, fig. 5(b) is the distance between two adjacent robots, fig. 5(c) is the position tracking error, fig. 5(d) is the velocity tracking error, due to the learning ability and strong robustness of RBFNNS, the disturbances are well compensated and finally zero steady-state error is achieved.
In contrast, as shown in fig. 6(a) - (d), the PD-SMC controller cannot achieve zero steady-state error, and it can also be seen that the PD-SMC is limited in compensating for the interference. Sustained perturbation that is not changed when t-40S is introduced
At all times, the PD-SMC controller will always have tracking errors and all robots will never arrive on the reference trajectory anymore.
Finally, the control inputs of the IRPD-SMC are shown in fig. 9, and fig. 9 shows the control inputs of 5 robots when the IRPD-SMC is used, and it can be seen that the control inputs are small and do not exceed 4N. Likewise, by replacing the sign (-) function with the tanh (-) function, the tremor problem is solved and the control signal is continuous and physically realizable.
Case 2. to verify the noise immunity and robustness of the proposed controller, we performed the following simulation experiments and analyzed the results. In the two-dimensional space, the initial positions of the five robots and the set values of the outer and inner loop parameters are the same as in
case 1. The target locus is [3+0.1t, sin0.1t%]
TThe noise was considered and simulated by a gaussian function with a mean of 0 and a standard deviation of 0.2. The interference value is defined as
Where i is 1.
Fig. 10 shows the trajectories of five robots and targets when the IRPD-SMC technique is used in two dimensions for case 2. FIGS. 11(a) -13(d) show simulation experimental results for different controllers in the presence of interference and Gaussian noise. Compared to the controllers of fig. 12(a) -13(d), the proposed controller IRPD-SMC can effectively suppress noise and ensure position and velocity tracking errors of significantly less than 0.02m and 0.06m/s, respectively.
And 3, verifying the effectiveness of the proposed control law in the three-dimensional space under the condition that Gaussian noise and interference exist and the obstacle avoidance activity is in an activated state. In three-dimensional space, the initial positions of six robots are set to q
1(0)=[-10,1,0]
T,q
2(0)=[-1,-10,0.3]
T,q
3(0)=[10,0,1]
T,q
4(0)=[0,0.5,10]
T,q
5(0)=[0,10,0.3]
TAnd q is
6(0)=[0,0,-10]
T. The target locus is [0.1t,3+0.1t, sin0.1t%]
TThe position of the obstacle is q
0=[15,3,-4.5]
T. Interference value is set to
The parameters of the noise and the controller are set as in
case 2, and the parameters of the outer loop controller are set as in table 2.
The simulation results of the proposed IRPD-SMC algorithm are shown in fig. 14-16 (d). Obviously, the proposed control strategy can make the robot change its own position to avoid obstacles and collisions when an obstacle avoidance task occurs. Further, as is clear from fig. 17(a) -18(d), APD-SMC and ASMC are less accurate in control than IRPD-SMC.
In summary, the simulation results verify that the control effect of the control method provided by the embodiment is obviously better than that of the PD-SMC, the APD-SMC, and the ASMC no matter whether the obstacle avoidance task is in the activated state or not. Due to the learning capability of RBFNNS and the strong robustness of SMC, IRPD-SMC has strong robustness to various interferences and noises. Furthermore, the IRPD-SMC may provide limited control inputs to prevent actuator saturation.