CN116256972A

CN116256972A - Six-foot robot man-machine instruction combination optimization method

Info

Publication number: CN116256972A
Application number: CN202211508724.8A
Authority: CN
Inventors: 尤波; 董正; 李佳钰; 庄天扬
Original assignee: Harbin University of Science and Technology
Current assignee: Harbin University of Science and Technology
Priority date: 2022-11-29
Filing date: 2022-11-29
Publication date: 2023-06-13
Anticipated expiration: 2042-11-29
Also published as: CN116256972B

Abstract

The invention belongs to the technical field of foot robot driving control. The invention discloses a hexapod robot man-machine instruction combination optimization method, which solves the problem of optimization distribution of a driver and a robot decision instruction in the driving process of the hexapod robot. According to the hexapod robot man-machine instruction combination optimization method, an allocation scheme between a decision instruction of a driver and a decision instruction of the hexapod robot is used as a state of a finite state machine, when constraint conditions of a stability margin, energy consumption and collision times of the hexapod robot in a running process exceed a set threshold value, state transition is triggered, and transition probability is increased in the state transition process to improve flexibility of state transition. The method can acquire the optimal driving instruction combination between the driver and the robot in real time, and effectively improves the running performance of the hexapod robot and the adaptability to emergency conditions.

Description

Six-foot robot man-machine instruction combination optimization method

Technical Field

The invention belongs to the technical field of foot robot driving operation, and particularly relates to a collaborative control method capable of efficiently and rapidly combining and optimizing driving instructions of people and machines.

Background

The foot type mobile robot has stronger adaptability and trafficability to the terrain environment compared with the wheeled or crawler type mobile robot by virtue of rich gait and flexible movement modes when facing complex terrain, the foot type mobile robot has wide application in the fields of space detection, disaster relief and rescue, weaponry and the like due to the characteristics of the foot type robot, and also has a plurality of burdens on driving operation of a driver due to complex movement parameters to be controlled by multiple degrees of freedom, and the driver has to make real-time decisions on external environment changes and self-states of a machine body in the driving process, so that the driver can certainly operate improperly, the accuracy of decision is affected, and a good running strategy cannot be obtained by solely depending on an autonomous algorithm of the robot when facing complex and changeable environments, besides the robot is controlled fully automatically or manually, the robot can also work with people with different auxiliary and automatic levels, and the man-machine cooperative decision can play respective advantages and make up for the advantages of the people and the machine, so that how to carry out reasonable man-machine cooperative work ensures that the foot type robot has a stable running state to be solved.

The finite state machine is a mathematical calculation model for representing finite states and transfer and action behaviors among the states, has wide application in the fields of computer science, circuit system design, software engineering and the like, is applied to the aspect of autonomous decision of unmanned vehicles at present due to the characteristics of limited states and flexible conversion among the states, and is often used for determining specific work division of people and machines in the aspect of man-machine cooperative work, and the contents of division of work in different occasions are also changed in real time, so that the characteristics of flexible conversion of the finite state machine are brought into play in consideration of how the finite state machine is applied to the field of man-machine cooperative work, and the best division of work can be determined in real time between man-machines, so that the problem of man-machine cooperative work is solved.

Disclosure of Invention

The invention aims to provide a hexapod robot man-machine instruction combination optimization method, which can realize optimizing distribution of driving instructions between a driver and the hexapod robot, so that the hexapod robot has a better driving effect.

The method comprises the steps of recording instructions of a driver and recording driving instructions of the hexapod robot under the position coordinates of specific terrains; the robot reproduces the instruction of the driver, and obtains the driving instruction information of the driver at the position according to the current position coordinate of the hexapod robot so as to realize autonomous decision; establishing an offline evaluation function to determine with which driving scheme combination the hexapod robot should travel in the current terrain environment; the on-line real-time man-machine driving instruction scheme distribution of the hexapod robot is realized by utilizing the finite state machine, so that the driving scheme combination can be timely adjusted when a driver or the hexapod robot has improper driving behaviors.

The invention is realized by the following technical scheme:

the six-foot robot man-machine instruction combination optimization method specifically comprises the following steps:

step 1: the method comprises the steps of (1) recording a driver instruction, wherein as shown in fig. 1, the driver makes a decision according to current environment information, and meanwhile, the gait, stride, step height, speed and pitch angle information of the hexapod robot and position coordinate information of the hexapod robot during instruction recording are subjected to data persistence;

step 2: the robot reproduces the instruction of the driver, as shown in fig. 1, the hexapod robot obtains the driving instruction information of gait, stride, step height, speed and pitch angle of the driver at the position in the step 1 according to the information of the current terrain and the current position coordinate, so as to realize autonomous decision;

step 3: establishing an offline evaluation function to determine which driving scheme combination the hexapod robot should travel in the current terrain environment, taking the stability margin, energy consumption and collision times of the hexapod robot as constraint conditions, determining model parameters by using a Gaussian kernel formula, enumerating the stability margin, energy consumption and collision times data of the hexapod robot traveling under all driving scheme combinations, calculating the score of the driving scheme combination by using the offline evaluation function, and finally determining which man-machine driving instruction distribution mode the hexapod robot will travel according to the score;

step 4: the on-line real-time man-machine driving instruction scheme distribution of the hexapod robot is realized by utilizing the finite state machine, so that the driving scheme combination can be timely adjusted when a driver or the hexapod robot has improper driving behaviors. The allocation scheme between the decision instruction of the driver and the decision instruction of the hexapod robot is used as the state of the finite state machine, when the constraint condition exceeds the set threshold value to be the state triggering condition, the state transition process is increased by the transition probability, so that the flexibility of state transition is improved, and the adaptability of the hexapod robot to the complex environment is improved.

The beneficial effects of the invention are as follows:

according to the six-legged robot man-machine instruction combination optimization method, on one hand, the problem of decision conflict when both man-machine parties make decisions can be realized, and the respective decision results are optimized and combined to form a new decision result; on the other hand, the collaborative driving control method can lead the man-machine side to give the driving authority to the party when the driving is lost, thereby ensuring the driving fault tolerance; finally, the optimal instruction distribution of the man-machine can be realized, and the running performance of the hexapod robot is improved.

Drawings

FIG. 1 is a schematic diagram of a driver instruction recording and reproducing structure

FIG. 2 is an offline decision flow chart

FIG. 3 is an online decision flow chart

Detailed Description

Embodiments of the present invention are described in further detail below with reference to the accompanying drawings and examples.

One embodiment of the invention: a six-foot robot man-machine instruction combination optimization method comprises the following steps:

(1): by analyzing the topographical features and geometric features of the topography, the common irregular topography can be generalized into four typical features: flat terrain, sloping field terrain, obstacle terrain, and gully terrain under which a driver commonly uses driving instructions including: when describing the terrains, the method of describing the grid map is adopted, the grid map information is divided into a plurality of layers, one layer is used for recording the height information of the terrains, the other layers are used for recording the instruction information of a driver, the association of the terrains and the driver instructions is realized, the adoption recording frequency of each instruction is reasonably set, the driving instruction using frequencies under the 4 terrains are summarized according to the driving experience of the driver, the parameters of the same terrains are further divided, the distance of the flat terrains, the gradient of the sloping terrains, the height of the obstacle terrains and the width of the gully terrains are divided into three grades of large, medium and small, the using frequency of the driver operating instructions under different terrains is divided into three grades of high, medium and low, and therefore the recording frequency of the driving instructions with high using frequency can be improved under a certain terrains, and the recording frequency of the driving instructions with low using frequency is reduced.

(2): according to the corresponding relation between the topography recorded by the driver and the instruction, the complex autonomous decision process of the robot can be simplified into a reproduction process of the recorded instruction, when the robot runs in the topography environment recorded by the driver, the driving instruction recorded in the corresponding grid can be obtained according to the current position, and the robot uses the driving instruction as a decision result of the robot.

(3): FIG. 2 is a flow chart of offline decision, in which stability margin, energy consumption and collision number data of different decision instruction combinations under the terrain are first obtained, and since 3 constraint conditions are different in unit and cannot be calculated subsequently, normalization processing is required, and then the stability margin R is used _st Energy consumption R _en And number of collisions R _co Establishing an evaluation function E _c ＝aR _st +bR _en +cR _co The weight parameters corresponding to the constraint conditions are a, b and c, the size relation of the evaluation function parameters can be determined according to different terrain environments, the robot hardly collides when driving on a flat ground, and the fluctuation of stability is small, so that the consumption of energy is mainly considered on the flat ground, the weight of the stability is inferior, and the weight of the collision is the least; when driving on rough obstacle terrain, mainly considering collision with the obstacle, secondly stability and thirdly energy consumption; when the robot runs on a sloping field, the posture of the robot body changes when the robot goes up and down the sloping field, so that the requirement on stability is highest, the energy consumption is the energy consumption, and the collision times are the first time; when the robot passes through the gully, the stability requirement is maximum because the pose and the foot falling position of the robot body are to be adjusted, the collision problem of the shank is considered, and finally the energy consumption is considered, and three constraint conditions are considered to be specificThe weight values of the type of terrain have obvious magnitude relation, the weight values of the type of terrain are also related to the terrain parameters, and the Gaussian kernel function is utilized to determine the weight values of the constraint function

The magnitude relation of the weight of the constraint function under the terrain can be expressed according to the distances from the constraint condition under different terrains to the central position, the specific weight value of the constraint function is related to the magnitude of sigma, the sigma can be determined by using the parameters of the terrain, and the larger the slope of the terrains is, the smaller the sigma value is; the longer the distance to be travelled under flat terrain, the smaller the value of sigma; the greater the number of obstacles under the obstacle topography, the smaller the value of sigma; the wider the width of the trench σ, the smaller the value of the trench.

(4) In the method, in order to make a decision process more real-time and efficient, a finite state machine method is used to realize dynamic decision between man-machine driving instructions, as shown in fig. 3, an online decision flow chart is shown, states of the state machine are distribution combinations of driving instructions, when a constraint condition of a robot body is changed to an unfavorable direction due to a parameter controlled by a driver or the robot, a control authority of the parameter is required to be removed, namely, state transition occurs, a transition condition between states is related to the constraint condition of the robot, specifically, the current stability margin is too low, energy consumption is too high, collision times are all triggering conditions of state transition, in order to make the transition process more flexible, the probability of transition is transferred from the fixed probability, the probability of occurrence under the current driving instruction can be represented by the driving instruction ratio, the probability of occurrence between the current state and the next state is simplified according to the state transition probability, and the current state transition probability can be represented by the driving instruction under the independent state, and the probability of the driving instruction can be obtained under the condition that the current state transition probability is represented by the driving instruction, and the driving instruction can be represented by the driving instruction is obtained under the condition of the following the driving instruction:

event a represents the current state and event B represents the next state. />

Claims

1. The six-foot robot man-machine instruction combination optimization method is characterized by comprising the following steps of:

step 1: the method comprises the steps of recording instructions of a driver, recording the instructions of the driver of a six-foot robot in the positions of specific training terrains (wherein the specific training terrains refer to regular obstacle terrains such as sloping lands, step ravines and the like matched with the dimensions of the robot, coordinates of foot ends and mass centers of an organism in a three-dimensional fixed coordinate system), and aiming at different using frequencies of different instructions of the driver under different training terrains, dynamically changing the recording frequencies of the different instructions according to the training terrains, so that the using frequencies and the recording frequencies of the instructions are in positive correlation;

step 2: the robot reproduces the instruction of the driver, and according to the current position coordinates of the six-foot robot, the driving instruction sequences of the gait, the stride, the step height, the speed and the pitch angle of the driver at the position in the step 1 are fetched, and the group of instruction sequences are automatically executed to realize the autonomous decision of the robot;

step 3: according to the combined data of the man-machine driving instructions acquired offline under different training terrains, evaluating the preselected combination of the man-machine driving instructions entering the different training terrains by combining an offline decision-making evaluation function;

step 4: the conversion of the man-machine driving instruction combination is realized by using a finite state machine under different states, and the optimal man-machine instruction combination under the terrain is dynamically selected.

2. An offline decision-making evaluation function according to claim 1, step 3, characterized in that: taking the stability margin, the energy consumption and the collision times of the hexapod robot as constraint conditions for evaluating the running performance of the hexapod robot, and establishing an offline evaluation function:

e=arst+brn+crco (1) where E represents an offline evaluation score, and Rst, ren, rco is a result obtained by normalizing and unifying correlations of a stability margin, energy consumption and collision times of the hexapod robot, respectively, in consideration of differences of magnitude and correlations of three evaluation dimensions, and a, b and c are weight parameters corresponding to the stability margin, the energy consumption and the collision times, respectively, and since weights of constraint conditions change with different training terrains, a gaussian kernel function is used to calculate the weight parameters:

the process of the weight parameter can be classified into: taking a cluster of straight lines parallel to the radial central axis of the Gaussian kernel function, intersecting the Gaussian function curve, wherein the longitudinal coordinate value of the intersection point is a weight value, x is expressed as the distance between constraint conditions and the central axis, sigma is used for expressing the magnitude grade of the terrain parameter, the steeper the grade is, the steeper the terrain is, and the magnitude of the weight parameter expressed by y can be determined according to the values of x and sigma;

the weight of the three evaluation indexes is determined according to the type difference of the terrain, and then the values of the six-foot robot constraint conditions under different instruction combinations are brought into the evaluation function, so that the instruction combination with the highest offline evaluation score can be obtained to enter the terrain.

3. The method for realizing the combined conversion of the man-machine driving instructions of the hexapod robot in different states by using the finite state machine according to the step 4 of the claim 1 is characterized in that: taking a combination of driving instructions of a driver and the six-legged robot as a state of a finite state machine; the transition condition between the states is related to the constraint condition of the offline function in the step 3, and when the stability margin of the hexapod robot is too low, the energy consumption is too high and the collision times are too high in the running process, the state transition is triggered; and taking the duty ratio of the driving scheme combination as the probability of the driving scheme combination according to the constraint condition of each driving scheme combination in the step 3:

pst in _i 、Pen _i 、Pco _i To generate a stability margin energy consumption and probability of each state under the crash frequency conversion condition, it is equal to the value Rst of the constraint condition under each state _i 、Ren _i 、Rco _i And the ratio of the total constraint condition to the total constraint condition in each state is n, and the number of states is considered to be mutually independent, so that the probability of occurrence of the driving scheme combination is the probability of transition between the driving scheme combinations, thereby improving the flexibility of state transition and enabling the hexapod robot to have stronger adaptability to emergency conditions.