CN108897315A

CN108897315A - A kind of Multi-agent Team Formation

Info

Publication number: CN108897315A
Application number: CN201810600769.5A
Authority: CN
Inventors: 邹文成; 周超; 钱科威; 向峥嵘; 黄月影
Original assignee: Nanjing University of Science and Technology
Current assignee: Nanjing University of Science and Technology
Priority date: 2018-06-12
Filing date: 2018-06-12
Publication date: 2018-11-27

Abstract

The invention discloses a kind of Multi-agent Team Formations, and in particular to a kind of Multi-agent Team Formation with action selection function, reliability be difficult to ensure the problems such as poor with the robustness for solving system in traditional formation method.The specific steps are：Step 1: setting detection income calculation method relevant to position；Step 2: defining the concrete behavior of robot；Step 3: determining the number of channels of basal ganglion, basal ganglion channel pattern is established, initializes relevant parameter；Step 4：Correct the channel pattern parameter of basal ganglion.The present invention is for Collaborative Control of forming into columns in multirobot Detection task.

Description

A multi-robot formation method

技术领域technical field

本发明涉及机器人的编队领域，具体涉及一种具有行为选择功能的多机器人编队方法。The invention relates to the field of robot formation, in particular to a multi-robot formation method with a behavior selection function.

背景技术Background technique

机器人代替人完成在复杂、危险的环境下的重复性工作，加速了社会的发展。近年来社会市场对机器人的功能的提出了新的需求，机器人需要完成更为复杂的检测任务甚至作业任务，但是面对复杂的，高效率的、并行完成的任务时，单一的机器人无法胜任，需要多机器人协同。Robots replace humans to complete repetitive tasks in complex and dangerous environments, accelerating the development of society. In recent years, the social market has put forward new requirements for the functions of robots. Robots need to complete more complex detection tasks and even homework tasks. However, when faced with complex, efficient, and parallel tasks, a single robot is not competent. Multi-robot collaboration is required.

机器人编队控制是一种多个机器人在达到目标队形的过程中，既能组成目标队形，又可以适应具体的环境约束的一种控制方法。现有的常用的方法主要有基于领航—跟随者方法、虚拟结构法、基于行为的方法、基于图论、势能法的方法。Robot formation control is a control method in which multiple robots can form a target formation and adapt to specific environmental constraints in the process of reaching the target formation. The existing commonly used methods are mainly based on the leader-follower method, the virtual structure method, the behavior-based method, the graph theory-based method, and the potential energy method.

其中领航者—跟随者方法的基本思想是在多机器人组成的编队系统中，可以存在一个或者多个领航机器人，其他非领航者机器人即为跟随机器人，跟随机器人以相对其领航机器人的位置相对距离、相对角度中作为输入控制量，使得跟随机器人与领航机器人的相对位置无限逼近目标值。领航跟随方法的控制结构比较简单，但是由于领航机器人和跟随机器人之间没有位置反馈，并且是领航者机器人单点控制，导致容易出现机器人掉队等情况，系统的鲁棒性较差。The basic idea of the leader-follower method is that in a multi-robot formation system, there can be one or more leader robots, and other non-leader robots are follower robots, and the relative distance between the follower robot and its leader robot position is , The relative angle is used as the input control quantity, so that the relative position of the follower robot and the leader robot approaches the target value infinitely. The control structure of the pilot-following method is relatively simple, but since there is no position feedback between the pilot robot and the following robot, and the single-point control of the pilot robot, it is easy for the robot to fall behind, and the robustness of the system is poor.

虚拟结构法的基本思想是将多机器人组成的系统看成一个假想的刚体结构，各个机器人在参考坐标系下的坐标不变,即机器人之间的相对位置不变。The basic idea of the virtual structure method is to regard the system composed of multiple robots as a hypothetical rigid body structure, and the coordinates of each robot in the reference coordinate system remain unchanged, that is, the relative positions between the robots remain unchanged.

基于行为的方法的基本思想是将多机器人编队的各个环节看成由单个机器人的多个基本行为构成，只需要研究各个基本行为的控制方法即可通过基本行为的组合控制多机器人形成编队。基本行为一般包括目标跟踪、避免障碍、避免碰撞、编队生成和编队保持等。基于行为的方法由各个机器人相互感知控制，系统比较容易实现分布式控制，具有很好的鲁棒性。但是由于无法结合准确的数学模型分析，导致不能十分有效的保证系统的可靠性。The basic idea of the behavior-based method is to regard each link of the multi-robot formation as composed of multiple basic behaviors of a single robot, and only need to study the control method of each basic behavior to control the formation of multi-robots through the combination of basic behaviors. Basic behaviors generally include target tracking, obstacle avoidance, collision avoidance, formation generation and formation maintenance, etc. The behavior-based method is controlled by mutual perception of each robot. The system is relatively easy to realize distributed control and has good robustness. However, due to the inability to combine accurate mathematical model analysis, the reliability of the system cannot be guaranteed very effectively.

势能法势场法是与上而所述传统方法不巧的另一种队形形成控制方法，它将队形形成的所有步骤合为一起，通过构建合近的势函数，由势场函数来决定机器人的控制律将其他机器人对其不同的势力作用，结合期望的队形图形，使机器人朝预期的队形图运动，当各自到达编队图时，整个系统的势能玻小。在队形形成过程中，队形目标点是动态变化的，属于动态队形形成方法，机器人是通过不断调整相互之间的距离而达到一个平衡态；但队形形成的快速性和可控性无法把握，由于是按照相对距离的编队图来形成队形，那么说明机器人之问的相互相对位置其实是事先确定的，这就使机器人的目标位置可能不是最佳的，又由于整个队形目标上保持一个相对位置固定的编队图，假设一个机器人未能达到指定队形点，其他机器人将一直处于动态搜寻最小势能点的状态并一直运动。Potential energy method Potential field method is another method of formation formation control that is different from the traditional method mentioned above. It combines all the steps of formation formation. By constructing a close potential function, the potential field function can Determine the control law of the robot to act on different forces of other robots, combined with the expected formation figure, so that the robots move towards the expected formation figure. When each reaches the formation figure, the potential energy of the whole system will decrease. In the formation formation process, the formation target point is dynamically changed, which belongs to the dynamic formation formation method, and the robots reach an equilibrium state by constantly adjusting the distance between each other; but the rapidity and controllability of formation formation Unable to grasp, because the formation is formed according to the formation diagram of the relative distance, it means that the relative positions of the robots are actually determined in advance, which makes the target position of the robot may not be optimal, and because the entire formation target Maintain a relatively fixed formation diagram, assuming that one robot fails to reach the designated formation point, other robots will always be in the state of dynamically searching for the minimum potential energy point and keep moving.

针对上述问题，提出一种基于行为选择的多机器人编队方法。Aiming at the above problems, a multi-robot formation method based on behavior selection is proposed.

发明内容Contents of the invention

本发明的目的在于解决的多机器人编队过程中系统的可靠性与鲁棒性无法保证的问题。The purpose of the present invention is to solve the problem that the reliability and robustness of the system cannot be guaranteed in the multi-robot formation process.

为了解决上述技术问题，本发明提供一种基于行为选择的多机器人编队方法，通过引入一种基底神经节行为选择编队策略，增强系统的可靠性与鲁棒性。In order to solve the above technical problems, the present invention provides a multi-robot formation method based on behavior selection, which enhances the reliability and robustness of the system by introducing a basal ganglia behavior selection formation strategy.

具体技术方案为：The specific technical solutions are:

综合考虑机器人向形成编队目标位置的运动的路程之和dis_sum，机器人的适宜移动速度v_i，机器人调整到面向目标点姿态的时间T_adjust，机器人运动到目标点的时间T_move，机器人在下个采样周期后到达位置的检测收益，进行行为选择。Comprehensively consider the sum dis _sum of the movement distance of the robot to the formation target position, the suitable moving speed v _i of the robot, the time T _adjust for the robot to adjust to the attitude towards the target point, and the time T _move for the robot to move to the target point. The detection income of the arrival position after the sampling period is used for behavior selection.

T_cost＝T_move+T_adjust T _cost = T _move + T _adjust

步骤一、设定与位置相关的检测收益计算方法。Step 1, setting a detection revenue calculation method related to the location.

(1)目标探测位置收益区设定：目标探测位置收益区设定具体是如下的条件：收益区为以目标探测中心为圆形的数块扇形区域。(1) Setting of profit area of target detection position: The setting of profit area of target detection position is specifically as follows: the profit area is several fan-shaped areas with the center of target detection as a circle.

(2)检测收益计算方法：在距离收益区较近的扇形内，设定该机器人的该检测点收益为与目标点重要程度相关的固定收益；在距离收益区较远的圆环内，设定该机器人的该检测点收益为与机器人与目标点之间距离成反比的动态收益。并在检测完成后，当前作业周期的检测收益将清零。(2) Calculation method of detection income: in the sector closer to the income area, set the detection point income of the robot as a fixed income related to the importance of the target point; in the circle farther from the income area, set The detection point income of the robot is determined to be a dynamic income inversely proportional to the distance between the robot and the target point. And after the detection is completed, the detection income of the current job cycle will be cleared.

步骤二、定义机器人的具体行为：根据机器人的具体结构与功能定义机器人在行为选择完成后可能会出现的具体行为。对各个行为的激励抑制关系进行限制与设定分析，若某一机器人行为可被激发，则采用基底神经节通道模型进行验证；若被激发的机器人行为可以通过验证，则执行此行为。Step 2. Define the specific behavior of the robot: according to the specific structure and function of the robot, define the specific behavior that the robot may appear after the behavior selection is completed. The incentive-inhibition relationship of each behavior is restricted and analyzed. If a certain robot behavior can be stimulated, the basal ganglia channel model is used for verification; if the stimulated robot behavior can pass the verification, this behavior is executed.

步骤三、确定基底神经节的通道数量，建立基底神经节通道模型，初始化相关参数。Step 3: Determine the number of channels of the basal ganglia, establish a channel model of the basal ganglia, and initialize relevant parameters.

基底神经节数学模型包括纹状体、苍白球外核、底丘脑核、核团苍白球内核。The mathematical model of the basal ganglia includes the striatum, the outer nucleus of the globus pallidus, the subthalamic nucleus, and the inner nucleus of the globus pallidus.

(1)建立行为通道的数学模型：每个通道用一个漏积分神经元表示：(1) Establish a mathematical model of behavioral channels: each channel is represented by a leaky integral neuron:

其中，x是神经元的状态，u为神经元的输入，y为神经元的输出，H为阶跃函数，其余为模型参数。Among them, x is the state of the neuron, u is the input of the neuron, y is the output of the neuron, H is the step function, and the rest are model parameters.

y_i ^C＝S_i y _i ^C = S _i

大脑皮层整合、加工得到结合步骤一中各个因素的综合重要性指标S_i，其中i为通道号，y_i ^C表征大脑皮层中第i个通道的输出。The cerebral cortex is integrated and processed to obtain a comprehensive importance index S _i combining all factors in step 1, where i is the channel number, and y _i ^C represents the output of the i-th channel in the cerebral cortex.

纹状体D1的状态可以描述为:The state of striatum D1 can be described as:

其中，u_i ^SD1是纹状体D1，i通道的神经元输入，a_i ^SD1是该通道神经元的状态，y_i ^SD1是该通道神经元的输出，w_CSD1是大脑皮层到纹状体D1的权重。1+λ描述的是多巴胺对纹状体D1的激励作用，ε_SD1为纹状体D1的输出阈值。Among them, u _i ^SD1 is the striatum D1, the neuron input of the i channel, a _i ^SD1 is the state of the channel neuron, y _i ^SD1 is the output of the channel neuron, w _CSD1 is the cerebral cortex to the striatum D1 the weight of. 1+λ describes the stimulating effect of dopamine on striatum D1, and ε _SD1 is the output threshold of striatum D1.

纹状体D2的状态可以描述为:The state of striatum D2 can be described as:

其中，u_i ^SD2是纹状体D2，i通道的神经元输入，a_i ^SD2是该通道神经元的状态，y_i ^SD2是该通道神经元的输出，w_CSD2是大脑皮层到纹状体D2的权重,1-λ描述的是多巴胺对纹状体D2的抑制作用，ε_SD2为纹状体D2的输出阈值。Among them, u _i ^SD2 is the striatum D2, the neuron input of the i channel, a _i ^SD2 is the state of the channel neuron, y _i ^SD2 is the output of the channel neuron, w _CSD2 is the cerebral cortex to the striatum D2 The weights, 1-λ describe the inhibitory effect of dopamine on striatal D2, and ε _SD2 is the output threshold of striatal D2.

苍白球外核可以描述为：The outer nucleus of the globus pallidum can be described as:

其中，u_i ^GPe是苍白球外核，i通道的神经元输入，a_i ^GPe是该通道神经元的状态，y_i ^GPe是该通道神经元的输出，w_SD2GPe是纹状体D2到苍白球外核的权重,ε_GPe是苍白球外核的输出阈值。Among them, u _i ^GPe is the outer nucleus of the globus pallidus, the neuron input of the i channel, a _i ^GPe is the state of the neuron of the channel, y _i ^GPe is the output of the neuron of the channel, w _SD2GPe is the striatum D2 to the globus pallidum The weight of the outer nucleus, _εGPe is the output threshold of the outer nucleus of the globus pallidum.

底丘脑核的模型可描述为:The model of the subthalamic nucleus can be described as:

其中，u_i ^STN是底丘脑核，i通道的神经元输入，a_i ^STN是该通道神经元的状态，y_i ^STN是该通道神经元的输出，w_GPeSTN是苍白球外核到底丘脑核的权重,ε_STN是底丘脑核的输出阈值。Among them, u _i ^STN is the subthalamic nucleus, the neuron input of the i channel, a _i ^STN is the state of the channel neuron, y _i ^STN is the output of the channel neuron, w _GPeSTN is the subthalamic nucleus of the external nucleus of the globus pallidum Weight, ε _STN is the output threshold of the subthalamic nucleus.

根据解剖学上研究，底丘脑核对于苍白球内核的投射十分的分散，因此在描述苍白球内核的数学模型时，令苍白球内核的输入包括其他左右通道的底丘脑核出。According to anatomical studies, the projection of the subthalamic nucleus to the globus pallidus nucleus is very scattered. Therefore, when describing the mathematical model of the globus pallidus nucleus, the input of the globus pallidus nucleus includes the output of the subthalamic nucleus of other left and right channels.

描述为：described as:

其中，u_i ^GPi是苍白球内核，i通道的神经元输入，a_i ^GPi是该通道神经元的状态，y_i ^GPi是该通道神经元的输出，w_SD1GPi是纹状体D1到苍白球内核的权重,w_STNGPi为底丘脑核到苍白球内核的输出权重，ε_GPi是苍白球内核的输出阈值，需要注意的是，w_CSD1、w_CSD2、w_STNGPi为正值表征激励性连接、w_SD1GPi、w_SD2GPe、w_GPeSTN为负值，表征抑制性连接。Among them, u _i ^GPi is the globus pallidus kernel, the neuron input of the i channel, a _i ^GPi is the state of the neuron of the channel, y _i ^GPi is the output of the neuron of the channel, w _SD1GPi is the striatum D1 to the globus pallidus kernel , w _STNGPi is the output weight from the subthalamus nucleus to the globus pallidus kernel, ε _GPi is the output threshold of the globus pallidus kernel, it should be noted that w _CSD1 , w _CSD2 , and w _STNGPi are positive values representing the incentive connection, w _SD1GPi , w _SD2GPe , w _GPeSTN are negative values, representing inhibitory connections.

步骤四：校正基底神经节的通道模型参数：当机器人的整体运行情况在以用户偏好为标准的条件下有所偏差时，进行对步骤三中模型各项参数的校正。Step 4: Correct the channel model parameters of the basal ganglia: When the overall operation of the robot deviates from the user preference as the standard, correct the parameters of the model in step 3.

本发明与现有技术相比，其显著优点在于：(1)引入了一种基于行为选择的动态机器人编队方法，有效解决了现有编队检测方法中鲁棒性不强，可靠性不强等缺点；(2)通过开放参数与各项权重修改通道，使得本发明具有更加广泛的通用性；(2)引入目标检测收益变量，可以在整体上提升多机器人进行检测任务的收益，间接提升了系统运行效率。Compared with the prior art, the present invention has significant advantages in that: (1) It introduces a dynamic robot formation method based on behavior selection, which effectively solves the problems of weak robustness and low reliability in the existing formation detection methods. Disadvantages; (2) The present invention has wider versatility by opening parameters and various weight modification channels; (2) Introducing target detection income variables can improve the income of multi-robots for detection tasks as a whole, indirectly improving System operating efficiency.

附图说明Description of drawings

图1为本发明机器人检测任务检测收益范围示意图。Fig. 1 is a schematic diagram of the detection profit range of the robot detection task in the present invention.

图2为本发明基底神经节示意图。Fig. 2 is a schematic diagram of the basal ganglia of the present invention.

具体实施方式Detailed ways

本发明基于行为选择的多机器人编队方法，包括以下步骤：The multi-robot formation method based on behavior selection of the present invention comprises the following steps:

其中η_profit为当前目标点的动态收益，η_k为当前目标点的固定收益，r为机器人距离目标检测点的距离，r₁为固定检测收益区与目标检测点的最短距离，r₂为动态收益区与目标检测点的最长距离。Among them, η _profit is the dynamic profit of the current target point, η _k is the fixed profit of the current target point, r is the distance between the robot and the target detection point, r ₁ is the shortest distance between the fixed detection profit area and the target detection point, and r ₂ is the dynamic The longest distance between the revenue zone and the target detection point.

步骤二、定义机器人的具体行为：根据机器人的具体结构与功能定义机器人在行为选择完成后可能会出现的具体行为。对各个行为的激励抑制关系进行限制与设定分析，若某一机器人行为可被激发，则采用基底神经节通道模型进行验证；若被激发的机器人行为可以通过验证，则执行此行为。设计机器人姿态行为有顺时针转动、顺时针转动、前进、后退、搜索、回程六种。Step 2. Define the specific behavior of the robot: according to the specific structure and function of the robot, define the specific behavior that the robot may appear after the behavior selection is completed. The incentive-inhibition relationship of each behavior is restricted and analyzed. If a certain robot behavior can be stimulated, the basal ganglia channel model is used for verification; if the stimulated robot behavior can pass the verification, this behavior is executed. There are six kinds of posture behaviors for the design robot: clockwise rotation, clockwise rotation, forward, backward, search, and return.

其中s_i表征各个行为的重要程度，η_i ^k为待整定常数。Among them, s _i represents the importance of each behavior, and η _i ^k is a constant to be adjusted.

y_i ^C＝S_i y _i ^C = S _i

其中，u_i ^GPe是苍白球外核，i通道的神经元输入，a_i ^GPe是该通道神经元的状态，y_i ^GPe是该通道神经元的输出，w_SD2GPe是纹状体D2到苍白球外核的权重,ε_GPe是苍白球外核的输出阈值。Among them, u _i ^GPe is the outer nucleus of the globus pallidus, the neuron input of the i channel, a _i ^GPe is the state of the neuron of the channel, y _i ^GPe is the output of the neuron of the channel, w _SD2GPe is the striatum D2 to the globus pallidum The weight of the outer nucleus, _εGPe is the output threshold of the outer nucleus of the globus pallidus.

描述为：described as:

其中，u_i ^GPi是苍白球内核，i通道的神经元输入，a_i ^GPi是该通道神经元的状态，y_i ^GPi是该通道神经元的输出，w_SD1GPi是纹状体D1到苍白球内核的权重,w_STNGPi为底丘脑核到苍白球内核的输出权重，εGPi是苍白球内核的输出阈值，值需要注意的是，w_CSD1、w_CSD2、w_STNGPi为正值表征激励性连接、w_SD1GPi、w_SD2GPe、w_GPeSTN为负值，表征抑制性连接。Among them, u _i ^GPi is the globus pallidus kernel, the neuron input of the i channel, a _i ^GPi is the state of the neuron of the channel, y _i ^GPi is the output of the neuron of the channel, w _SD1GPi is the striatum D1 to the globus pallidus kernel weight, w _STNGPi is the output weight from the subthalamic nucleus to the globus pallidus core, εGPi is the output threshold of the globus pallidus core, the values need to be noted that w _CSD1 , w _CSD2 , and w _STNGPi are positive values representing incentive connections, w _SD1GPi , w _SD2GPe , w _GPeSTN are negative values, representing inhibitory connections.

最终按照苍白球内核输出对机器人行为进行选定。Finally, the behavior of the robot is selected according to the output of the globus pallidum kernel.

本发明也可在不同功能，不同数量规模，不同检测目标，不同运动方式与目标检测方式的机器人集群中应用。以上对本发明的具体实施例进行了描述。需要理解的是，本发明并不局限于上述特定实施方式，本领域技术人员可以在权利要求的范围内作出各种变形或修改，这并不影响本发明的实质内容。The present invention can also be applied in robot clusters with different functions, different numbers and scales, different detection targets, different motion modes and target detection modes. Specific embodiments of the present invention have been described above. It should be understood that the present invention is not limited to the specific embodiments described above, and those skilled in the art may make various changes or modifications within the scope of the claims, which do not affect the essence of the present invention.

Claims

1. A multi-robot formation method is characterized in that, comprising the following steps:

Step 1: Set the location-related detection revenue calculation method;

Step 2: Define the specific behavior of the robot;

Step 3: Determine the number of channels of the basal ganglia, establish a channel model of the basal ganglia, and initialize relevant parameters;

Step 4: Calibrate the channel model parameters of the basal ganglia.

2. The multi-robot formation method according to claim 1, wherein said step 1 is specifically:

Comprehensively consider the sum dis _sum of the movement distance of the robot to the formation target position, the suitable moving speed v _i of the robot, the time T _adjust for the robot to adjust to the attitude towards the target point, and the time T _move for the robot to move to the target point. The detection income of the arrival position after the sampling period can be obtained as:

T _cost = T _move + T _adjust ;

(1) Setting of target detection position profit area: the specific conditions are: the profit area is several fan-shaped areas with the target detection center as a circle;

(2) Calculation method of detection income: in the sector closer to the income area, set the detection point income of the robot as a fixed income related to the importance of the target point; in the circle farther from the income area, set The detection point income of the robot is determined to be a dynamic income inversely proportional to the distance between the robot and the target point. After the detection is completed, the detection income of the current job is cleared.

3. The multi-robot formation method according to claim 1, characterized in that, the step 2 is specifically: according to the specific structure and function of the robot, define the specific behaviors that the robot may appear after the behavior selection is completed, and for each behavior The incentive-inhibition relationship is restricted and analyzed. If a certain robot behavior can be stimulated, it will be verified using the basal ganglia channel model; if the stimulated robot behavior can pass the verification, this behavior will be executed.

4. The multi-robot formation method according to claim 1, characterized in that, in said step 3, the mathematical model of the basal ganglia includes striatum, globus pallidus outer nucleus, subthalamic nucleus, and nucleus pallidus inner nucleus.

5. multi-robot formation method according to claim 1, is characterized in that, described step 3 is specifically: (1) establishes the mathematical model of behavior channel: each channel represents with a leaky integral neuron:

In the above formula, x is the state of the neuron, u is the input of the neuron, y is the output of the neuron, H is a step function, and the rest are model parameters;

y _i ^C = S _i

The integration and processing of the cerebral cortex results in the comprehensive importance index S _i combining the various factors in step 1, where i is the channel number, and y _i ^C represents the output of the i-th channel in the cerebral cortex; the state of the striatum D1 can be described as :

In the above formula, u _i ^SD1 is the striatum D1, i is the neuron input of the channel, a _i ^SD1 is the state of the channel neuron, y _i ^SD1 is the output of the channel neuron, w _CSD1 is the ε _SD1 is the output threshold of striatum D1; the state of striatum D2 can be described as:

In the above formula, u _i ^SD2 is the striatum D2, i is the neuron input of the channel, a _i ^SD2 is the state of the channel neuron, y _i ^SD2 is the output of the channel neuron, w _CSD2 is the The weight of striatum D2, 1-λ describes the inhibitory effect of dopamine on striatum D2, and ε _SD2 is the output threshold of striatum D2; the outer nucleus of globus pallidus can be described as:

In the above formula, u _i ^GPe is the outer nucleus of globus pallidum, i is the neuron input of the channel, a _i ^GPe is the state of the channel neuron, y _i ^GPe is the output of the channel neuron, w _SD2GPe is the striatum D2 The weight to the external nucleus of the globus pallidum, _εGPe is the output threshold of the external nucleus of the globus pallidus; the model of the subthalamic nucleus can be described as:

In the above formula, u _i ^STN is the subthalamic nucleus, i is the neuron input of the channel, a _i ^STN is the state of the channel neuron, y _i ^STN is the output of the channel neuron, w _GPeSTN is the end of the outer nucleus of the globus pallidus The weight of the thalamic nucleus, ε _STN is the output threshold of the subthalamic nucleus; if the input of the globus pallidus nucleus includes the output of the subthalamic nucleus of other left and right channels, we can get:

In the above formula, u _i ^GPi is the globus pallidus kernel, i is the neuron input of the channel, a _i ^GPi is the state of the channel neuron, y _i ^GPi is the output of the channel neuron, w _SD1GPi is the The weight of the globus pallidus kernel, w _STNGPi is the output weight from the subthalamus nucleus to the globus pallidus kernel, ε _GPi is the output threshold of the globus pallidus kernel, w _CSD1 , w _CSD2 , and w _STNGPi are positive values, which represent incentive connections, w _SD1GPi , Negative values of w _SD2GPe and w _GPeSTN indicate inhibitory connections.

6. The multi-robot formation method according to claim 1, characterized in that the step 4 is specifically: when the overall operation of the robot deviates from the user preference as the standard, the model in step 3 Carry out calibration of various parameters.