CN114454157A - Local track adjustment and man-machine sharing control method and system suitable for robot - Google Patents
Local track adjustment and man-machine sharing control method and system suitable for robot Download PDFInfo
- Publication number
- CN114454157A CN114454157A CN202111575047.7A CN202111575047A CN114454157A CN 114454157 A CN114454157 A CN 114454157A CN 202111575047 A CN202111575047 A CN 202111575047A CN 114454157 A CN114454157 A CN 114454157A
- Authority
- CN
- China
- Prior art keywords
- robot
- human
- time
- module
- track
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 40
- 230000003993 interaction Effects 0.000 claims abstract description 58
- 238000011156 evaluation Methods 0.000 claims abstract description 16
- 238000004590 computer program Methods 0.000 claims abstract description 8
- 239000011159 matrix material Substances 0.000 claims description 50
- 238000005457 optimization Methods 0.000 claims description 47
- 239000013598 vector Substances 0.000 claims description 32
- 230000004888 barrier function Effects 0.000 claims description 12
- 238000009826 distribution Methods 0.000 claims description 12
- 230000001186 cumulative effect Effects 0.000 claims description 11
- 230000007613 environmental effect Effects 0.000 claims description 11
- 238000001727 in vivo Methods 0.000 claims description 11
- 238000005096 rolling process Methods 0.000 claims description 11
- 238000012216 screening Methods 0.000 claims description 8
- 239000000126 substance Substances 0.000 claims description 8
- 238000013016 damping Methods 0.000 claims description 6
- 230000005484 gravity Effects 0.000 claims description 6
- 238000003860 storage Methods 0.000 claims description 6
- 238000010586 diagram Methods 0.000 description 5
- 238000002324 minimally invasive surgery Methods 0.000 description 4
- 238000001356 surgical procedure Methods 0.000 description 3
- 208000007536 Thrombosis Diseases 0.000 description 2
- 210000004204 blood vessel Anatomy 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 210000003437 trachea Anatomy 0.000 description 2
- 241000282412 Homo Species 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000000630 rising effect Effects 0.000 description 1
- 239000000523 sample Substances 0.000 description 1
Images
Classifications
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B25—HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
- B25J—MANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
- B25J9/00—Programme-controlled manipulators
- B25J9/16—Programme controls
- B25J9/1656—Programme controls characterised by programming, planning systems for manipulators
- B25J9/1664—Programme controls characterised by programming, planning systems for manipulators characterised by motion, path, trajectory planning
- B25J9/1666—Avoiding collision or forbidden zones
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61B—DIAGNOSIS; SURGERY; IDENTIFICATION
- A61B34/00—Computer-aided surgery; Manipulators or robots specially adapted for use in surgery
- A61B34/30—Surgical robots
- A61B34/35—Surgical robots for telesurgery
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B25—HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
- B25J—MANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
- B25J9/00—Programme-controlled manipulators
- B25J9/16—Programme controls
- B25J9/1602—Programme controls characterised by the control system, structure, architecture
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B25—HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
- B25J—MANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
- B25J9/00—Programme-controlled manipulators
- B25J9/16—Programme controls
- B25J9/1674—Programme controls characterised by safety, monitoring, diagnostic
- B25J9/1676—Avoiding collision or forbidden zones
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B25—HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
- B25J—MANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
- B25J9/00—Programme-controlled manipulators
- B25J9/16—Programme controls
- B25J9/1679—Programme controls characterised by the tasks executed
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B25—HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
- B25J—MANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
- B25J9/00—Programme-controlled manipulators
- B25J9/16—Programme controls
- B25J9/1679—Programme controls characterised by the tasks executed
- B25J9/1689—Teleoperation
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B25—HAND TOOLS; PORTABLE POWER-DRIVEN TOOLS; MANIPULATORS
- B25J—MANIPULATORS; CHAMBERS PROVIDED WITH MANIPULATION DEVICES
- B25J9/00—Programme-controlled manipulators
- B25J9/16—Programme controls
- B25J9/1694—Programme controls characterised by use of sensors other than normal servo-feedback from position, speed or acceleration sensors, perception control, multi-sensor controlled systems, sensor fusion
- B25J9/1697—Vision controlled systems
Abstract
The invention provides a local track adjustment and man-machine sharing control method and system suitable for a robot, which are used for improving the autonomy of a surgical robot and converting the relation between the robot and the robot from a master-slave mode to a cooperative mode. When the difference between the instruction of the human and the reference track of the robot is large, the robot can locally and actively adjust the reference track of the robot by combining the virtual interaction force of the human; when the difference between the human and the robot is small, the instructions of the human and the robot are comprehensively considered, the human-computer mixed cost function is dynamically adjusted based on the system safety evaluation index, the optimal control quantity is calculated, and the human-computer sharing control is realized. The invention also provides a corresponding computer program storage medium and a robot.
Description
Technical Field
The invention relates to the technical field of teleoperation surgical robots, in particular to a local track adjustment and man-machine sharing control method and system suitable for a robot. In particular to a local track adjustment and man-machine sharing control method and system suitable for teleoperation and suitable for a robot and a surgical robot thereof.
Background
The minimally invasive surgery robot integrates advanced intelligent robot technology into clinical surgery, fully exerts the advantages of high stability, flexible operability, motion accuracy and the like of the robot in surgery tasks, greatly reduces surgery intensity of surgeons, and avoids the risk of improving misoperation probability caused by continuous high-intensity work.
The minimally invasive surgery robot is always the key input direction of all countries in the world, and related research results are continuously promoted to be new: the da Vinci surgical robot system is the most famous, and is continuously optimized and updated, so that the performance of the da Vinci surgical robot system is more remarkable in the aspects of the operation dexterity, the safety interaction and the like of a mechanical arm. Surgical robot development in china has focused on this decade: such as "Shen Jian Hua Tuo" minimally invasive surgery robot developed by Shanghai university of transportation; "Miaomanus" series of robots developed by Tianjin university; "Huaqun-II" type minimally invasive surgery robot developed by Harbin university of industry, etc.
In the aspect of control of surgical robots, most of the current surgical robot systems are in a master-slave mode, that is, a doctor remotely controls the motion of a slave-end mechanical arm by operating a teleoperation rod, so that the automation degree is relatively low, the workload of the doctor is relatively high, and the technical level of the doctor is still relatively high. On the other hand, due to the complexity and diversity of surgical tasks, it is not possible to perform surgical tasks in a short time using fully automatically controlled robots, so the concept of man-machine sharing is more applicable to current surgical robot systems, which changes the relationship of human and robot from master-slave to cooperative, with the motion of the robot being determined by both human and robot.
The existing idea of man-machine sharing is largely applied to the control aspect, i.e. shared control. One limitation of current systems, however, is that humans cannot influence the future desired trajectory that the robot originally set through the teleoperational device, which also indicates that the robot is not sufficiently predictive of human intent. In addition, the prior art still can not adjust the control ratio of people and robot according to actual conditions developments well, and the human-computer control has not been fused with higher degree of automation yet.
Disclosure of Invention
Aiming at the defects in the prior art, the invention aims to provide a local track adjustment and man-machine sharing control method and system suitable for a robot.
The invention provides a local track adjustment and man-machine sharing control method suitable for a robot, which comprises any one or more of the following steps:
step S0, determining an initial reference trajectory based on the one-time trajectory plan: generating a feasible trajectory set based on the desired end position and the ambient environment information obtained from the sensor data; screening an optimal track from the current feasible track set as a reference track for one-time planning;
step S1, a step of local trajectory re-planning taking into account human intent: the method comprises the steps that a human being transmits a motion instruction to a robot through teleoperation equipment, whether human intention is strong or not is judged through virtual interaction force, and when the human intention is strong, the robot adjusts a local reference track;
step S2, the step of adjusting the human-computer control weight based on the system safety evaluation index: evaluating the safety of the current robot configuration through sensor data, constructing an evaluation index representing the safety of the system, and dynamically adjusting the human-computer control weight based on the evaluation index;
step S3, model prediction control based on man-machine mixed cost function: constructing a hybrid cost function based on the control cost of the robot and the human; and calculating to obtain an optimal control instruction through a model prediction controller of a hybrid cost function, so as to realize man-machine sharing control.
Preferably, the step S1 includes:
step S1.1: establishing a virtual force model representing human interaction force;
step S1.2: judging whether the human intention is strong or not through the virtual interaction force; if so, the robot does not adjust the reference track; (ii) a If not, the robot adjusts the local reference track;
the step S2 includes:
step 2.1: sensing environmental information in real time based on data of a sensor, and establishing an index representing the safety of the system; wherein the environment information comprises a distance from a boundary and a distance from an obstacle;
step S2.2: taking the system safety index as a basis for adjusting the man-machine control weight;
the step S3 includes:
step S3.1: establishing a state space expression of the slave end system based on a dynamic model of the slave end system;
step S3.2: respectively calculating error vectors of the robot and the human according to expected instructions of the robot and the human;
step S3.3: at any moment k, calculating a robot cost function;
step S3.4: at any time k, calculating a human cost function;
step S3.5: establishing the k mixing cost function at the moment based on the index representing the system safety;
step S3.6: forming a control problem based on a model prediction control framework: at any time k, the optimized time domain is t ═ k, k +1, the time domain is divided into P discrete time periods with equal time step, and the objective of the rolling optimization is to minimize the cumulative mixed cost function in the time domain; the prediction model is a state space expression established based on a dynamic model of the slave end system; the system safety constraint is that the distance between the position of the current robot and the nearest barrier is equal to the distance between the current robot and the nearest boundary;
step S3.7: and solving the optimization problem in each optimization time domain t ═ k, k +1, and calculating the optimal control quantity input at the time k.
Preferably, in the step S0:
determining a target configuration of a robot in Cartesian space as xfinalInitial position x0Desired trajectory duration T, with the goal of planning a path from initial configuration to target configuration xfinalThe optimum trajectory is the reference trajectory xd(t), t is a time variable;
obtaining obstacle position distribution in vivo environment by sensing deviceAnd a motion boundary(ii) a Generation of feasible trajectories using fast-expanding random tree RRT algorithmCollection(ii) a An optimal track is selected from the feasible track set as a reference track x for one-time planning through the following optimization targetsd(t):
Wherein p represents a feasible track, and the optimization index is composed of three parts respectively used for representing the shortest path, avoiding obstacles to the maximum degree, avoiding boundaries to the maximum degree, and alphal,αo,αbAll are normal numbers and are used for adjusting the proportion of the three parts.
Preferably, the step S1 includes:
step S1.1: establishing a virtual force model representing human interaction force:
Mm,Dm,Kmrespectively representing an inertia matrix, a damping matrix and a rigidity matrix;
Fhcharacterizing a virtual interaction force applied by a human to the slave robot through the teleoperational device;
step S1.2: judging whether the human intention is strong through the virtual interaction force, wherein the judgment method comprises the following steps:
setting a threshold value deltaiI 1.. m, which is the corresponding virtual interaction force FhThe lower limit of the ith component,real-time monitoring of virtual interaction force F applied by a human to a slave robot via a teleoperational devicehThe value of (d);
in step S1.2, the step of adjusting the local reference trajectory includes:
step S1.2.1: determining a local track range t epsilon [ t ] to be adjusteds,tf]T is a time variable, ts、tfRespectively representing the starting time and the ending time of the local track;
Wherein the content of the first and second substances,xdii is 1, …, m isxd(t) the ith component, δ being the time interval of the selected discrete point;
step S1.2.3: for the distance from the current position to xd(tf) The local track is re-planned to generate a local feasible track set
Step S1.2.4: for a local feasible trajectory gammadLocal trace energy E (γ)d) Is gammadAdjusted trajectory energy:
α is a normal number;
r is a positive definite symmetric matrix;
from the set of feasible trajectories based on the representation adjusted trajectory energy indexSelecting a feasible track with the minimum energy of the adjusting track as an optimal track gammadAs an adjusted reference trajectory;
the step S2 includes:
step S2.1: sensing environmental information in real time based on data of a sensor, and establishing an index representing the safety of the system; wherein the environment information comprises a distance from a boundary and a distance from an obstacle;
step S2.2: taking the system safety index as a basis for adjusting the man-machine control weight;
the step S2.1 comprises the steps of:
step S2.1.1: obstacle position distribution in-vivo environment obtained based on sensing dataAnd a motion boundarySubscript i represents a serial number;
step S2.1.2: constructing a current moment representation movable margin vector dres(k):
γd(k-1) is the reference trajectory position at the previous time k-1, and if it is determined in step S1.2 that the human intention is strong, the reference trajectory γ isdTo the adjusted desired trajectory, otherwise, to the reference trajectory gammadObtaining an initial reference track for one-time planning; k represents a time;
step S2.1.3: calculating an actual offset d (k) as a current position configuration x (k-1) of the robot and a corresponding expected value gammadDistance of (k-1):
d(k)=||x(k-1)-γd(k-1)||
step S2.1.4: defining a saturation function dsat:[0,dres]→(0,dres)
μ1,μ2,θ2And xi is a parameter of the Richards curve;
dmax(k)=min{d(k),dres(k) the parameter value determines the shape of the curve and is a preset value set according to the actual situation;
step S2.1.5: establishing an index lambda (k) representing the safety of the system:
the step S3 includes:
step S3.1: establishing a slave end system state space expression based on a dynamic model of the slave end system:
Mx,Cx,Gxthe method comprises the following steps that an inertia matrix, a Coriolis force centrifugal force matrix and a gravity matrix of the end robot in a Cartesian space are respectively set, and J is a Jacobian matrix from a robot joint space to the Cartesian space; the control input is the joint moment input tau of the slave end robot; wherein the state variableThe system configuration and its derivatives;
step S3.2: respectively calculating error vectors of the robots according to expected instructions of the robots, wherein the error vector of the robot is delta xr=x-γdThe human error vector is Δ xh=x-xhd(ii) a Wherein, γdIs the desired cartesian spatial configuration position of the robot; x is the number ofhdThe configuration position in the expected Cartesian space of the person is obtained by converting force input of the person through the main-end interaction device;
step S3.3: at any time k, a robot cost function C is calculatedr(k):
Q1r,Q2r,Q3rAre all positive definite matrixes;
step S3.4: at any time k, a human cost function is computed:
Q1h,Q2hare all positive definite matrixes;
step S3.5: establishing a k-time hybrid cost function based on an index lambda (k) representing the system safety:
C(k)=λ(k)Ch(k)+(1-λ(k))Cr(k)
step S3.6: based on the model predictive control framework, the following control problem is formed:
z(k+1|k)=A(k)z(k)+B(k)τ(k)+C(k)
the control problem is: at any time k, the time domain is optimized to be t ═ k, k +1]Dividing the time-domain-based data into P discrete time periods with equal time step, wherein the goal of rolling optimization is to minimize the cumulative mixed cost function in the time domain; the prediction model is a state space expression established based on a dynamic model of the slave end system, and A (k), B (k) and C (k) are corresponding discrete coefficient matrixes; the system security constraint is hoAnd hbRespectively representing the distance between the position of the current robot and the nearest barrier and the nearest boundary, and ensuring that the distance is not less than the corresponding threshold value deltao,δb;
Step S3.7: and solving the optimization problem in each optimization time domain t ═ k, k +1, and calculating the optimal control quantity input at the time k.
The invention provides a local track adjustment and man-machine sharing control system suitable for a robot, which comprises any one or more of the following modules:
module M0, step of determining an initial reference trajectory based on a trajectory plan: generating a feasible trajectory set based on the desired end position and the ambient environment information obtained from the sensor data; screening an optimal track from the current feasible track set as a reference track for one-time planning;
module M1, step of local trajectory re-planning taking into account the person's intention: the method comprises the steps that a human being transmits a motion instruction to a robot through teleoperation equipment, whether human intention is strong or not is judged through virtual interaction force, and when the human intention is strong, the robot adjusts a local reference track;
module M2, step of human-machine control weight adjustment based on system security assessment indicators: evaluating the safety of the current robot configuration through sensor data, constructing an evaluation index representing the safety of the system, and dynamically adjusting the human-computer control weight based on the evaluation index;
module M3, model predictive control step based on human-machine hybrid cost function: constructing a hybrid cost function based on the control cost of the robot and the human; and calculating to obtain an optimal control instruction through a model prediction controller of a hybrid cost function, so as to realize man-machine sharing control.
Preferably, said module M1 comprises:
module M1.1: establishing a virtual force model representing human interaction force;
module M1.2: judging whether the human intention is strong or not through the virtual interaction force; if so, the robot does not adjust the reference track; (ii) a If not, the robot adjusts the local reference track;
the module M2 includes:
step 2.1: sensing environmental information in real time based on data of a sensor, and establishing an index representing the safety of the system; wherein the environment information comprises a distance from a boundary and a distance from an obstacle;
module M2.2: taking the system safety index as a basis for adjusting the man-machine control weight;
the module M3 includes:
module M3.1: establishing a state space expression of the slave end system based on a dynamic model of the slave end system;
module M3.2: respectively calculating error vectors of the robot and the human according to expected instructions of the robot and the human;
module M3.3: at any moment k, calculating a robot cost function;
module M3.4: at any time k, calculating a human cost function;
module M3.5: establishing the k mixing cost function at the moment based on the index representing the system safety;
module M3.6: forming a control problem based on a model prediction control framework: at any time k, the optimized time domain is t ═ k, k +1, the time domain is divided into P discrete time periods with equal time step, and the objective of the rolling optimization is to minimize the cumulative mixed cost function in the time domain; the prediction model is a state space expression established based on a dynamic model of the slave end system; the system safety constraint is that the distance between the position of the current robot and the nearest barrier is equal to the distance between the current robot and the nearest boundary;
module M3.7: and solving the optimization problem in each optimization time domain t ═ k, k +1, and calculating the optimal control quantity input at the time k.
Preferably, in said module M0:
determining a target configuration of a robot in Cartesian space as xfinalInitial position x0Desired trajectory duration T, with the goal of planning a path from initial configuration to target configuration xfinalThe optimum trajectory is the reference trajectory xd(t), t is a time variable;
obtaining obstacle position distribution in vivo environment by sensing deviceAnd a motion boundary(ii) a Generation of feasible trajectory sets using fast-spanning random tree RRT algorithm(ii) a An optimal track is selected from the feasible track set as a reference track x for one-time planning through the following optimization targetsd(t):
Wherein, p isPossible tracks are shown, and the optimization index is composed of three parts which are respectively used for representing the shortest path, avoiding obstacles to the maximum degree and avoiding boundaries to the maximum degree, alphal,αo,αbAll are normal numbers and are used for adjusting the proportion of the three parts.
Preferably, said module M1 comprises:
module M1.1: establishing a virtual force model representing human interaction force:
Mm,Dm,Kmrespectively representing an inertia matrix, a damping matrix and a rigidity matrix;
Fhcharacterizing a virtual interaction force applied by a human to the slave robot through the teleoperational device;
module M1.2: whether the human intention is strong or not is judged through the virtual interaction force, and the judgment system is as follows:
setting a threshold value deltaiI 1.. m, which is the corresponding virtual interaction force FhThe lower limit of the ith component monitors the virtual interaction force F applied to the slave robot by the human through the teleoperation equipment in real timehThe value of (d);
in block M1.2, the step of local reference trajectory adjustment comprises:
module M1.2.1: determining a local track range t epsilon [ t ] to be adjusteds,tf]T is a time variable, ts、tfRespectively representing the starting time and the ending time of the local track;
Wherein the content of the first and second substances,are respectivelyxd(t) the ith component, δ being the time interval of the selected discrete point;
module M1.2.3: for the distance from the current position to xd(tf) Re-planning the local track to generate a local feasible track set
Module M1.2.4: for a local feasible trajectory gammadLocal trace energy E (γ)d) Is gammadThe adjusted track energy is:
α is a normal number;
r is a positive definite symmetric matrix;
from the set of feasible trajectories based on the representation adjusted trajectory energy indexSelecting a feasible track with the minimum energy of the adjusting track as an optimal track gammadAs an adjusted reference trajectory;
the module M2 includes:
module M2.1: sensing environmental information in real time based on data of a sensor, and establishing an index representing the safety of the system; wherein the environment information comprises a distance from a boundary and a distance from an obstacle;
module M2.2: taking the system safety index as a basis for adjusting the man-machine control weight;
the module M2.1 comprises the following steps:
module M2.1.1: obstacle position distribution in-vivo environment obtained based on sensing dataAnd a motion boundarySubscript i represents a serial number;
module M2.1.2: constructing a current moment representation movable margin vector dres(k):
γd(k-1) is the reference track position at the previous time k-1, and if the human intention is judged to be strong in the module M1.2, the reference track gamma isdTo the adjusted desired trajectory, otherwise, to the reference trajectory gammadObtaining an initial reference track for the primary planning; k represents a time;
module M2.1.3: calculating an actual offset d (k) as a current position configuration x (k-1) of the robot and a corresponding expected value gammadDistance of (k-1):
d(k)=||x(k-1)-γd(k-1)||
module M2.1.4: defining a saturation function dsat:[0,dres]→(0,dres)
μ1,μ2,θ2And xi is a parameter of the Richards curve;
dmax(k)=min{d(k),dres(k) the parameter value determines the shape of the curve and is a preset value set according to the actual situation;
module M2.1.5: establishing an index lambda (k) for representing the safety of the system:
the module M3 includes:
module M3.1: establishing a slave end system state space expression based on a dynamic model of the slave end system:
Mx,Cx,Gxthe method comprises the following steps that an inertia matrix, a Coriolis force centrifugal force matrix and a gravity matrix of the end robot in a Cartesian space are respectively set, and J is a Jacobian matrix from a robot joint space to the Cartesian space; the control input is the joint moment input tau of the slave end robot; wherein the state variableThe system configuration and its derivatives;
module M3.2: respectively calculating error vectors of the robots according to expected instructions of the robots, wherein the error vector of the robot is delta xr=x-γdThe human error vector is Δ xh=x-xhd(ii) a Wherein, γdIs the desired cartesian spatial configuration position of the robot; x is the number ofhdThe configuration position in the expected Cartesian space of the person is obtained by converting force input of the person through the main-end interaction device;
module M3.3: at any time k, a robot cost function C is calculatedr(k):
Q1r,Q2r,Q3rAre all positive definite matrixes;
module M3.4: at any time k, a human cost function is computed:
Q1h,Q2hare all positive definite matrixes;
module M3.5: establishing a k-time hybrid cost function based on an index lambda (k) representing the system safety:
C(k)=λ(k)Ch(k)+(1-λ(k))Cr(k)
module M3.6: based on the model predictive control framework, the following control problems are formed:
z(k+1|k)=A(k)z(k)+B(k)τ(k)+C(k)
the control problem is: at any time k, the time domain is optimized to t ═ k, k +1]Dividing the time-domain-based data into P discrete time periods with equal time step, wherein the goal of rolling optimization is to minimize the cumulative mixed cost function in the time domain; the prediction model is a state space expression established based on a dynamic model of the slave end system, and A (k), B (k) and C (k) are corresponding discrete coefficient matrixes; the system security constraint is hoAnd hbRespectively representing the distance between the position of the current robot and the nearest barrier and the nearest boundary, and ensuring that the distance is not less than the corresponding threshold value deltao,δb;
Module M3.7: and solving the optimization problem in each optimization time domain t ═ k, k +1, and calculating the optimal control quantity input at the time k.
According to the present invention, there is provided a computer readable storage medium storing a computer program, which when executed by a processor, implements the steps of the method for robot local trajectory adjustment and human-machine sharing control.
According to the invention, the robot comprises the local track adjustment and man-machine sharing control system suitable for the robot, or comprises the computer readable storage medium storing the computer program.
Compared with the prior art, the invention has the following beneficial effects:
1. the invention can further improve the autonomy of the surgical robot and change the relation between the robot and the robot from a master-slave mode to a cooperative mode.
2. In the invention, when the motion instruction provided by the person to the slave robot through the teleoperation device is greatly different from the reference track of the robot, the robot locally and actively adjusts the reference track of the robot by combining the virtual interaction force of the person, and simultaneously avoids the following two possible situations: firstly, the robot still masters a larger control right and ignores the intention of the person; secondly, the palm of a person holds a larger weight, continuous active operation is carried out, the workload of an operator is increased, and meanwhile, the safety in the operation cannot be well ensured;
3. the invention comprehensively considers the instructions of both the human and the machine, evaluates the safety degree of the current slave end mechanical arm configuration in real time, constructs the safety of a corresponding index quantification system, and dynamically adjusts the control proportion of the human and the robot: when the safety is higher, the human is given a larger weight, otherwise, the robot has a larger weight, and the human-computer control is fused with a higher automation degree.
Drawings
Other features, objects and advantages of the invention will become more apparent upon reading of the detailed description of non-limiting embodiments with reference to the following drawings:
fig. 1 is a schematic diagram of the principle of the present invention.
Fig. 2 is an overall block diagram of the present invention.
Fig. 3 is a Richards curve.
FIG. 4 shows the system safety indexes λ and dsatThe mapping relationship of (2).
FIG. 5 is a block diagram of a shared control module.
Detailed Description
The present invention will be described in detail with reference to specific examples. The following examples will assist those skilled in the art in further understanding the invention, but are not intended to limit the invention in any way. It should be noted that it would be obvious to those skilled in the art that various changes and modifications can be made without departing from the spirit of the invention. All falling within the scope of the present invention.
The invention provides a local track adjustment and man-machine sharing control method suitable for a robot, which comprises the following steps:
step S0: determining an initial reference trajectory based on the one-time trajectory plan;
step S1: a step of local trajectory re-planning taking into account the intention of the person;
step S2: adjusting the human-computer control weight based on the system safety evaluation index;
step S3: a step of model predictive control based on a man-machine hybrid cost function;
the invention can further improve the autonomy of the surgical robot and change the relation between the robot and the robot from a master-slave mode to a cooperative mode. When the difference between the instruction of the person and the reference track of the robot is large, the robot can locally and actively adjust the reference track of the robot by combining the virtual interaction force of the person; when the difference between the human and the robot is small, instructions of both the human and the robot are comprehensively considered, a human-computer mixed cost function is dynamically adjusted based on the system safety evaluation index, the optimal control quantity is calculated, and human-computer sharing control is realized.
In this embodiment, a schematic diagram and an overall block diagram of the method are respectively shown in fig. 1 and fig. 2, and the method includes the following steps:
a step of determining an initial reference trajectory based on a trajectory plan, denoted as step S0, specifically: determining the target configuration of the robot in Cartesian space as x according to the requirements of the surgical taskfinalInitial position x0If the desired trajectory duration T is desired, then the goal of this step is to plan a path from the initial configuration to the target configuration xfinalThe optimum trajectory is the reference trajectory xdAnd (t), wherein t is a time variable and has a unit of seconds. Firstly, the position distribution of obstacles (organs, tissues, blood clots and the like) in the internal environment is obtained through sensing equipment such as an external CT, an ultrasonic probe, an endoscope and the likeAnd the boundary of motion (trachea, blood vessel, etc.)Generating a set of feasible trajectories using a fast-expanding random tree (RRT) algorithm(ii) a Optimizing the objective byScreening out an optimal track from the feasible track set as a reference track x for one-time planningd(t):
Wherein p represents a feasible track, and the optimization index is composed of three parts respectively used for representing the shortest path, avoiding obstacles to the maximum degree, avoiding boundaries to the maximum degree, and alphal,αo,αbAll are normal numbers and are used for adjusting the proportion of the three parts.
The step of local trajectory re-planning, which takes human intent into consideration, is denoted as step S1, specifically: the method comprises the steps that a human transmits virtual interaction force to a slave-end robot by operating a master-end teleoperation device, whether the operation intention of the current human is strong or not is judged by a virtual interaction force system, when the virtual interaction force is larger than a certain threshold value, the human control intention is strong, and the robot locally adjusts an original reference track;
a step of adjusting the human-computer control weight based on the system security evaluation index, which is denoted as step S2, specifically: evaluating whether the current robot configuration is safe in the environment through sensor data, constructing a system safety index based on the concept of movable allowance, quantifying the safety degree of the current system, and dynamically adjusting the human-computer control weight based on the index;
the model predictive control step based on the human-computer hybrid cost function is denoted as step S3, and specifically: constructing a discrete state space expression based on a slave end system dynamic model; respectively calculating the cost of the robot and the human according to the expectation of the robot and the human, constructing a mixed cost function by combining with the system safety index, further forming a model prediction control problem, and realizing human-computer sharing control by iteratively solving an optimal control instruction.
Step S0 is to obtain an initial reference trajectory of the robot through a planning method, and the initial reference trajectory may also be directly set manually according to actual task requirements.
The step S1 includes:
step S1.1: establishing a virtual force model representing human interaction force:
wherein the content of the first and second substances,representing the current position of the end of the robot arm,to the corresponding desired value. FhA virtual interaction force applied by a human to the slave robot through the teleoperational device is characterized. M is a group ofm,Dm,KmRespectively representing an inertia matrix, a damping matrix and a rigidity matrix.
Step S1.2: judging whether the human intention is strong through the virtual interaction force, wherein the judgment method comprises the following steps:
setting a threshold value deltaiI 1.. m, which is the corresponding virtual interaction force FhThe lower limit of the ith component monitors the virtual interaction force F applied to the slave robot by the human through the teleoperation equipment in real timehThe value of (d);
1) if it isFh≤δiIf so, the deviation between the current expected command of the person and the expected motion of the robot is small, and the robot does not adjust the reference track;
2) if it isFh>δiIf the difference between the current expected command of the robot and the expected motion of the robot is large, the robot adjusts the local reference track.
In step S1.2, the step of adjusting the local reference trajectory includes:
step S1.2.1: determining a local track range t epsilon [ t ] to be adjusteds,tf]T is timeThe time variable, in seconds, ts、tfRespectively representing the starting time and the ending time of the local track;
Wherein the content of the first and second substances,xdii is 1, …, m isxd(t) the ith component, δ being the time interval of the selected discrete point;
step S1.2.3: for the distance from the current position to xd(tf) The local track is re-planned, and a local feasible track set is searched and generated through an RRT algorithm
Step S1.2.4: for a local feasible trajectory gammadLocal trace energy E (γ)d) Is gammadThe adjusted track energy is:
wherein the content of the first and second substances,in order to obtain the original local energy of the track, is a column vector of all 1, fhi、γdiAre respectively Fh、γdα is a positive constant, and R is a positive definite symmetric matrix. It can be observed that the adjusted trajectory energy consists of three parts, namely a first original local trajectory energy, a second adjusted trajectory energy which is a work done by a human being, and a third adjusted trajectory energy which is a square norm of the matrix R;
from the set of feasible trajectories based on the representation adjusted trajectory energy indexSelecting an optimal track gammadAs adjusted reference trajectories, the screening criteria were as follows:
it can be seen that the screening criterion is actually to select a feasible trajectory with the minimum energy of the adjustment trajectory, since the optimization objective contains invariantAnd removing all terms contained in the invariant quantity is the screening standard. The selection of the matrix R can be determined by actual conditions, and here, we provide a selection mode based on a Minimum-jerk model:
because the Minimum-jerk model can more accurately express the motion trail of the human body, the selection mode ensures that the adjusted trail is more in line with the motion habit of the human body.
The step S2 includes:
step S2.1: sensing environmental information in real time based on data of a sensor, and establishing an index representing the safety of a system; wherein the environment information comprises a distance from a boundary and a distance from an obstacle;
s2.2, taking the system safety index as a basis for adjusting the man-machine control weight;
the step S2.1 comprises the steps of:
step S2.1.1: location distribution of obstacles (organs, tissues, blood clots, etc.) in an in vivo environment based on sensed dataAnd the boundary of motion (trachea, blood vessel, etc.);
Step S2.1.2: constructing a current moment representation movable margin vector dres(k):
Wherein, gamma isd(k-1) is the reference trajectory position at the previous time, and if it is determined in step S1.2 that the human intention is strong, the reference trajectory γ isdTo the adjusted desired trajectory, otherwise, to the reference trajectory gammadObtaining an initial reference track for one-time planning; k represents a time;
step S2.1.3: calculating the actual offset as the distance between the current position configuration of the robot and the corresponding expected value:
d(k)=||x(k-1)-γd(k-1)||
step S2.1.4: a saturation function d is defined using the Richards curve as shown in FIG. 3sat:[0,dres]→(0,dres)
Wherein d ismax(k)=min{d(k),dres(k)},μ1,μ2,θ2Xi is the parameter of the Richards curve, and the parameter value determines the shape of the curve according toActual situation set predetermined value. Theta2The rising rate of the saturation function is determined for positive real numbers, xi is positive real numbers, the degree of curvature of the curve is determined, mu1Is a normal number close to 1, determines the position of the upper asymptote of the curve, mu2Is a normal number, the length of the curve front end lag;
step S2.1.5: establishing an index lambda (k) representing the safety of the system:
the index and dsatThe mapping relationship of (2) is shown in fig. 4.
The step S3 is that the structure of the human-machine sharing control module is shown in fig. 5, and includes:
step S3.1: establishing a slave end system state space expression based on a dynamic model of the slave end system:
wherein the state variableThe system configuration and its derivatives; the control input is the joint moment input tau of the slave end robot; m is a group ofx,Cx,GxThe method comprises the following steps that an inertia matrix, a Coriolis force centrifugal force matrix and a gravity matrix of the end robot in a Cartesian space are respectively set, and J is a Jacobian matrix from a robot joint space to the Cartesian space;
step S3.2, error vectors of the robot and the human are respectively calculated according to expected instructions of the robot and the human, wherein the error vector of the robot is delta xr=x-γdThe human error vector is Δ xh=x-xhd(ii) a Wherein,γdThe configuration position in the expected Cartesian space of the robot can be obtained by the paths of track planning and the like; x is the number ofhdIs the expected configuration position in the Cartesian space of the human, and is obtained by the force input conversion of the human through the main-end interaction device.
Step S3.3: at any time k, a robot cost function is calculated:
the above equation is composed of two parts of a quadratic form (first two terms) of the error vector of the robot and its derivative and a quadratic form (third term) of the robot control input, where Q1r,Q2r,Q3rAre all positive definite matrixes;
step S3.4: at any time k, a human cost function is computed:
the above equation consists of a quadratic form of the human error vector and its derivatives, where Q1h,Q2hAre all positive definite matrixes;
step S3.5: establishing a k-time hybrid cost function based on an index lambda (k) representing the system safety:
C(k)=λ(k)Ch(k)+(1-λ(k))Cr(k)
step S3.6: based on the model predictive control framework, the following control problem is formed:
z(k+1|k)=A(k)z(k)+B(k)τ(k)+C(k)
the control problem is: at any time k, the time domain is optimized to t ═ k, k +1]Dividing the time-domain-based data into P discrete time periods with equal time step, wherein the goal of rolling optimization is to minimize the cumulative mixed cost function in the time domain; the prediction model is a state space expression established based on a dynamic model of the slave end system, and A (k), B (k) and C (k) are corresponding discrete coefficient matrixes; the system security constraint is hoAnd hbRespectively representing the distance between the position of the current robot and the nearest barrier and the nearest boundary, and ensuring that the distance is not less than the corresponding threshold value deltao,δb;
And step 3.7, solving the optimization problem in each optimization time domain t ═ k, k +1, and calculating the optimal control quantity input at the time k. There are many alternative methods for solving the Quadratic Programming (QP) optimization problem, including active set, interior point method, first-order optimization method, etc., and there are also many open QP solution libraries that can be directly called at present.
The invention further provides a local trajectory adjusting and man-machine sharing control system suitable for the robot, and a person skilled in the art can realize the local trajectory adjusting and man-machine sharing control system suitable for the robot by executing the step flow of the local trajectory adjusting and man-machine sharing control method suitable for the robot, namely, the method can be understood as a preferred embodiment of the local trajectory adjusting and man-machine sharing control system suitable for the robot.
The invention provides a local track adjustment and man-machine sharing control system suitable for a robot, which comprises any one or more of the following modules:
module M0, step of determining an initial reference trajectory based on a trajectory plan: generating a feasible trajectory set based on the desired end position and the ambient environment information obtained from the sensor data; screening an optimal track from the current feasible track set as a reference track for one-time planning;
module M1, step of local trajectory re-planning taking into account the person's intention: the method comprises the steps that a human being transmits a motion instruction to a robot through teleoperation equipment, whether human intention is strong or not is judged through virtual interaction force, and when the human intention is strong, the robot adjusts a local reference track;
module M2, step of human-machine control weight adjustment based on system security assessment indicators: evaluating the safety of the current robot configuration through sensor data, constructing an evaluation index representing the safety of the system, and dynamically adjusting the human-computer control weight based on the evaluation index;
module M3, model predictive control step based on human-machine hybrid cost function: constructing a hybrid cost function based on the control cost of the robot and the human; and calculating to obtain an optimal control instruction through a model prediction controller of a hybrid cost function, so as to realize man-machine sharing control.
Preferably, said module M1 comprises:
module M1.1: establishing a virtual force model representing human interaction force;
module M1.2: judging whether the human intention is strong or not through the virtual interaction force; if so, the robot does not adjust the reference track; (ii) a If not, the robot adjusts the local reference track;
the module M2 includes:
step 2.1: sensing environmental information in real time based on data of a sensor, and establishing an index representing the safety of the system; wherein the environment information comprises a distance from a boundary and a distance from an obstacle;
module M2.2: taking the system safety index as a basis for adjusting the man-machine control weight;
the module M3 includes:
module M3.1: establishing a state space expression of the slave end system based on a dynamic model of the slave end system;
module M3.2: respectively calculating error vectors of the robot and the human according to expected instructions of the robot and the human;
module M3.3: at any moment k, calculating a robot cost function;
module M3.4: at any time k, calculating a human cost function;
module M3.5: establishing the k mixing cost function at the moment based on the index representing the system safety;
module M3.6: forming a control problem based on a model prediction control framework: at any time k, the optimized time domain is t ═ k, k +1, the time domain is divided into P discrete time periods with equal time step, and the objective of the rolling optimization is to minimize the cumulative mixed cost function in the time domain; the prediction model is a state space expression established based on a dynamic model of the slave end system; the system safety constraint is that the distance between the position of the current robot and the nearest barrier is equal to the distance between the current robot and the nearest boundary;
module M3.7: and solving the optimization problem in each optimization time domain t ═ k, k +1, and calculating the optimal control quantity input at the time k.
Preferably, in said module M0:
determining a target configuration of a robot in Cartesian space as xfinalInitial position x0Desired trajectory duration T, with the goal of planning a path from initial configuration to target configuration xfinalThe optimum trajectory is the reference trajectory xd(t), t is a time variable;
obtaining obstacle position distribution in vivo environment by sensing deviceAnd a motion boundary(ii) a Generation of feasible trajectory sets using fast-spanning random tree RRT algorithm(ii) a An optimal track is selected from the feasible track set as a reference track x for one-time planning through the following optimization targetsd(t):
Wherein, p isPossible tracks are shown, and the optimization index is composed of three parts which are respectively used for representing the shortest path, avoiding obstacles to the maximum degree and avoiding boundaries to the maximum degree, alphal,αo,αbAll are normal numbers and are used for adjusting the proportion of the three parts.
Preferably, said module M1 comprises:
module M1.1: establishing a virtual force model representing human interaction force:
Mm,Dm,Kmrespectively representing an inertia matrix, a damping matrix and a rigidity matrix;
Fhcharacterizing a virtual interaction force applied by a human to the slave robot through the teleoperational device;
module M1.2: whether the human intention is strong or not is judged through the virtual interaction force, and the judgment system is as follows:
setting a threshold value deltaiI 1.. m, which is the corresponding virtual interaction force FhThe lower limit of the ith component is used for monitoring the virtual interaction force F applied to the slave robot by the human through the teleoperation equipment in real timehThe value of (d);
in module M1.2, the step of adjusting the local reference trajectory comprises:
module M1.2.1: determining a local track range t epsilon [ t ] to be adjusteds,tf]T is a time variable, ts、tfRespectively representing the starting time and the ending time of the local track;
Wherein the content of the first and second substances,xdii is 1, …, m isxd(t) the ith component, δ being the time interval of the selected discrete point;
module M1.2.3: for the distance from the current position to xd(tf) Re-planning the local track to generate a local feasible track set
Module M1.2.4: for a local feasible trajectory gammadLocal trace energy E (γ)d) Is gammadThe adjusted track energy is:
α is a normal number;
r is a positive definite symmetric matrix;
from the set of feasible trajectories based on the representation adjusted trajectory energy indexSelecting a feasible track with the minimum energy of the adjusting track as an optimal track gammadAs an adjusted reference trajectory;
the module M2 includes:
module M2.1: sensing environmental information in real time based on data of a sensor, and establishing an index representing the safety of the system; wherein the environment information comprises a distance from a boundary and a distance from an obstacle;
module M2.2: taking the system safety index as a basis for adjusting the man-machine control weight;
the module M2.1 comprises the following steps:
module M2.1.1: obstacle position distribution in-vivo environment obtained based on sensing dataAnd a motion boundarySubscript i represents a serial number;
module M2.1.2: constructing a representative movable margin vector d at the current momentres(k):
γd(k-1) is the reference track position at the previous time k-1, and if the human intention is judged to be strong in the module M1.2, the reference track gamma isdTo the adjusted desired trajectory, otherwise, the reference trajectory gammadObtaining an initial reference track for one-time planning; k represents a time;
module M2.1.3: calculating an actual offset d (k) as a current position configuration x (k-1) of the robot and a corresponding expected value gammadDistance of (k-1):
d(k)=||x(k-1)-γd(k-1)||
module M2.1.4: defining a saturation function dsat:[0,dres]→(0,dres)
μ1,μ2,θ2And xi is a parameter of the Richards curve;
dmax(k)=min{d(k),dres(k) the parameter value determines the shape of the curve and is a preset value set according to the actual situation;
module M2.1.5: establishing an index lambda (k) representing the safety of the system:
the module M3 includes:
module M3.1: establishing a slave end system state space expression based on a dynamic model of the slave end system:
Mx,Cx,Gxthe method comprises the following steps that an inertia matrix, a Coriolis force centrifugal force matrix and a gravity matrix of the end robot in a Cartesian space are respectively set, and J is a Jacobian matrix from a robot joint space to the Cartesian space; the control input is the joint moment input tau of the slave end robot; wherein the state variableThe system configuration and its derivatives;
module M3.2: respectively calculating error vectors of the robots according to expected instructions of the robots, wherein the error vector of the robot is delta xr=x-γdThe human error vector is Δ xh=x-xhd(ii) a Wherein, γdIs the desired cartesian spatial configuration position of the robot; x is the number ofhdThe configuration position in the expected Cartesian space of the person is obtained by converting the force input of the person through the main-end interaction device;
module M3.3: at any time k, a robot cost function C is calculatedr(k):
Q1r,Q2r,Q3rAre all positive definite matrixes;
module M3.4: at any time k, a human cost function is computed:
Q1h,Q2hare all positive definite matrixes;
module M3.5: establishing a k-time hybrid cost function based on an index lambda (k) representing the system safety:
C(k)=λ(k)Ch(k)+(1-λ(k))Cr(k)
module M3.6: based on the model predictive control framework, the following control problem is formed:
z(k+1|k)=A(k)z(k)+B(k)τ(k)+C(k)
the control problem is: at any time k, the time domain is optimized to t ═ k, k +1]Dividing the time-domain-based data into P discrete time periods with equal time step, wherein the goal of rolling optimization is to minimize the cumulative mixed cost function in the time domain; the prediction model is a state space expression established based on a dynamic model of the slave end system, and A (k), B (k) and C (k) are corresponding discrete coefficient matrixes; the system security constraint is hoAnd hbRespectively representing the distance between the position of the current robot and the nearest barrier and the nearest boundary, and ensuring that the distance is not less than the corresponding threshold value deltao,δb;
Module M3.7: and solving the optimization problem in each optimization time domain t ═ k, k +1, and calculating the optimal control quantity input at the time k.
Those skilled in the art will appreciate that, in addition to implementing the systems, apparatus, and various modules thereof provided by the present invention in purely computer readable program code, the same procedures can be implemented entirely by logically programming method steps such that the systems, apparatus, and various modules thereof are provided in the form of logic gates, switches, application specific integrated circuits, programmable logic controllers, embedded microcontrollers and the like. Therefore, the system, the device and the modules thereof provided by the present invention can be considered as a hardware component, and the modules included in the system, the device and the modules thereof for implementing various programs can also be considered as structures in the hardware component; modules for performing various functions may also be considered to be both software programs for performing the methods and structures within hardware components.
According to the present invention, there is provided a computer readable storage medium storing a computer program, which when executed by a processor, implements the steps of the method for robot local trajectory adjustment and human-machine sharing control.
According to the invention, the robot comprises the local track adjustment and man-machine sharing control system suitable for the robot, or comprises the computer readable storage medium storing the computer program.
The foregoing description of specific embodiments of the present invention has been presented. It is to be understood that the present invention is not limited to the specific embodiments described above, and that various changes or modifications may be made by one skilled in the art within the scope of the appended claims without departing from the spirit of the invention. The embodiments and features of the embodiments of the present application may be combined with each other arbitrarily without conflict.
Claims (10)
1. A local track adjustment and man-machine sharing control method suitable for a robot is characterized by comprising any one or more of the following steps:
step S0, determining an initial reference trajectory based on the one-time trajectory plan: generating a feasible trajectory set based on the desired end position and the ambient environment information obtained from the sensor data; screening an optimal track from the current feasible track set as a reference track for one-time planning;
step S1, a step of local trajectory re-planning taking into account human intent: the method comprises the steps that a human being transmits a motion instruction to a robot through teleoperation equipment, whether human intention is strong or not is judged through virtual interaction force, and when the human intention is strong, the robot adjusts a local reference track;
step S2, the step of adjusting the human-computer control weight based on the system safety evaluation index: evaluating the safety of the current robot configuration through sensor data, constructing an evaluation index representing the safety of the system, and dynamically adjusting the human-computer control weight based on the evaluation index;
step S3, model prediction control based on man-machine mixed cost function: constructing a hybrid cost function based on the control cost of the robot and the human; and calculating to obtain an optimal control instruction through a model prediction controller of a hybrid cost function, so as to realize man-machine sharing control.
2. The method for robot local trajectory adjustment and human-machine sharing control according to claim 1,
the step S1 includes:
step S1.1: establishing a virtual force model representing human interaction force;
step S1.2: judging whether the human intention is strong or not through the virtual interaction force; if so, the robot does not adjust the reference track; (ii) a If not, the robot adjusts the local reference track;
the step S2 includes:
step 2.1: sensing environmental information in real time based on data of a sensor, and establishing an index representing the safety of the system; wherein the environment information comprises a distance from a boundary and a distance from an obstacle;
step S2.2: taking the system safety index as a basis for adjusting the man-machine control weight;
the step S3 includes:
step S3.1: establishing a state space expression of the slave end system based on a dynamic model of the slave end system;
step S3.2: respectively calculating error vectors of the robot and the human according to expected instructions of the robot and the human;
step S3.3: at any moment k, calculating a robot cost function;
step S3.4: at any time k, calculating a human cost function;
step S3.5: establishing the k mixing cost function at the moment based on the index representing the system safety;
step S3.6: forming a control problem based on a model prediction control framework: at any time k, the optimized time domain is t ═ k, k +1, the time domain is divided into P discrete time periods with equal time step, and the objective of the rolling optimization is to minimize the cumulative mixed cost function in the time domain; the prediction model is a state space expression established based on a dynamic model of the slave end system; the system safety constraint is that the distance between the position of the current robot and the nearest barrier is equal to the distance between the current robot and the nearest boundary;
step S3.7: and solving the optimization problem in each optimization time domain t ═ k, k +1, and calculating the optimal control quantity input at the time k.
3. The method for robot local trajectory adjustment and human-machine sharing control according to claim 2, wherein in step S0:
determining the target configuration of the robot in Cartesian space as xfinalInitial position x0Desired trajectory duration T, with the goal of planning a path from initial configuration to target configuration xfinalThe optimum trajectory is the reference trajectory xd(t), t is a time variable;
obtaining obstacle position distribution in vivo environment by sensing deviceAnd a motion boundaryGeneration of feasible trajectory sets using fast-spanning random tree RRT algorithmAn optimal track is selected from the feasible track set as a reference track x for one-time planning through the following optimization targetsd(t):
Wherein p represents a feasible track, and the optimization index comprises three parts, which are respectively used for representing the shortest path, avoiding obstacles to the maximum degree, avoiding boundaries to the maximum degree, and alphal,αo,αbAll are normal numbers and are used for adjusting the proportion of the three parts.
4. The method for robot local trajectory adjustment and human-machine sharing control according to claim 2,
the step S1 includes:
step S1.1: establishing a virtual force model representing human interaction force:
Mm,Dm,Kmrespectively representing an inertia matrix, a damping matrix and a rigidity matrix;
Fhcharacterizing a virtual interaction force applied by a human to the slave robot through the teleoperational device;
step S1.2: judging whether the human intention is strong through the virtual interaction force, wherein the judgment method comprises the following steps:
setting a threshold value deltaiI 1.. m, which is the corresponding virtual interaction force FhThe lower limit of the ith component monitors the virtual interaction force F applied to the slave robot by the human through the teleoperation equipment in real timehThe value of (d);
in step S1.2, the step of adjusting the local reference trajectory includes:
step S1.2.1:determining a local track range t epsilon [ t ] to be adjusteds,tf]T is a time variable, ts、tfRespectively representing the starting time and the ending time of the local track;
Wherein the content of the first and second substances,are respectivelyxd(t) the ith component, δ being the time interval of the selected discrete point;
step S1.2.3: for the distance from the current position to xd(tf) Re-planning the local track to generate a local feasible track set
Step S1.2.4: for a local feasible trajectory gammadLocal trace energy E (γ)d) Is gammadThe adjusted track energy is:
α is a normal number;
r is a positive definite symmetric matrix;
from the set of feasible trajectories based on the representation adjusted trajectory energy indexSelecting a feasible track with the minimum energy of the adjusting track as an optimal track gammadAs an adjusted reference trajectory;
the step S2 includes:
step S2.1: sensing environmental information in real time based on data of a sensor, and establishing an index representing the safety of the system; wherein the environment information comprises a distance from a boundary and a distance from an obstacle;
step S2.2: taking the system safety index as a basis for adjusting the man-machine control weight;
the step S2.1 comprises the steps of:
step S2.1.1: obstacle position distribution in-vivo environment obtained based on sensing dataAnd a motion boundarySubscript i represents a serial number;
step S2.1.2: constructing a current moment representation movable margin vector dres(k):
γd(k-1) is the reference trajectory position at the previous time k-1, and if it is determined in step S1.2 that the human intention is strong, the reference trajectory γ isdTo the adjusted desired trajectory, otherwise, to the reference trajectory gammadObtaining an initial reference track for one-time planning; k represents a time;
step S2.1.3: calculating an actual offset d (k) as a current position configuration x (k-1) of the robot and a corresponding expected value gammadDistance of (k-1):
d(k)=||x(k-1)-γd(k-1)||
step S2.1.4: defining a saturation function dsat:[0,dres]→(0,dres)
μ1,μ2,θ2And xi is a parameter of the Richards curve;
dmax(k)=min{d(k),dres(k) the parameter value determines the shape of the curve and is a preset value set according to the actual situation;
step S2.1.5: establishing an index lambda (k) representing the safety of the system:
the step S3 includes:
step S3.1: establishing a slave end system state space expression based on a dynamic model of the slave end system:
Mx,Cx,Gxthe method comprises the following steps that an inertia matrix, a Coriolis force centrifugal force matrix and a gravity matrix of the end robot in a Cartesian space are respectively set, and J is a Jacobian matrix from a robot joint space to the Cartesian space; the control input is the joint moment input tau of the slave end robot; wherein the state variableThe system configuration and its derivatives;
step S3.2: respectively calculating error vectors of the robots according to expected instructions of the robots, wherein the error vector of the robot is delta xr=x-γdThe human error vector is Δ xh=x-xhd(ii) a Wherein, γdIs the desired cartesian spatial configuration position of the robot; x is the number ofhdThe configuration position in the expected Cartesian space of the person is obtained by converting force input of the person through the main-end interaction device;
step S3.3: at any time k, a robot cost function C is calculatedr(k):
Q1r,Q2r,Q3rAre all positive definite matrixes;
step S3.4: at any time k, a human cost function is computed:
Q1h,Q2hare all positive definite matrixes;
step S3.5: establishing a k-time hybrid cost function based on an index lambda (k) representing the system safety:
C(k)=λ(k)Ch(k)+(1-λ(k))Cr(k)
step S3.6: based on the model predictive control framework, the following control problem is formed:
z(k+1|k)=A(k)z(k)+B(k)τ(k)+C(k)
the control problem is: at any time k, the time domain is optimized to t ═ k, k +1]Dividing the time-domain-based data into P discrete time periods with equal time step, wherein the goal of rolling optimization is to minimize the cumulative mixed cost function in the time domain; the prediction model is a state space expression established based on a dynamic model of the slave end system, and A (k), B (k) and C (k) are corresponding discrete coefficient matrixes; the system security constraint is hoAnd hbRespectively representing the distance between the position of the current robot and the nearest barrier and the nearest boundary, and ensuring that the distance is not less than the corresponding threshold value deltao,δb;
Step S3.7: and solving the optimization problem in each optimization time domain t ═ k, k +1, and calculating the optimal control quantity input at the time k.
5. A local track adjustment and man-machine sharing control system suitable for a robot is characterized by comprising any one or more of the following modules:
module M0, step of determining an initial reference trajectory based on a trajectory plan: generating a feasible trajectory set based on the desired end position and the ambient environment information obtained from the sensor data; screening an optimal track from the current feasible track set as a reference track for one-time planning;
module M1, step of local trajectory re-planning taking into account the person's intention: the method comprises the steps that a human being transmits a motion instruction to a robot through teleoperation equipment, whether human intention is strong or not is judged through virtual interaction force, and when the human intention is strong, the robot adjusts a local reference track;
module M2, step of human-machine control weight adjustment based on system security assessment indicators: evaluating the safety of the current robot configuration through sensor data, constructing an evaluation index representing the safety of the system, and dynamically adjusting the human-computer control weight based on the evaluation index;
module M3, model predictive control step based on human-machine hybrid cost function: constructing a hybrid cost function based on the control cost of the robot and the human; and calculating to obtain an optimal control instruction through a model prediction controller of a hybrid cost function, so as to realize man-machine sharing control.
6. The system for robot local trajectory adjustment and human-machine sharing control according to claim 1,
the module M1 includes:
module M1.1: establishing a virtual force model representing human interaction force;
module M1.2: judging whether the human intention is strong or not through the virtual interaction force; if so, the robot does not adjust the reference track; (ii) a If not, the robot adjusts the local reference track;
the module M2 includes:
step 2.1: sensing environmental information in real time based on data of a sensor, and establishing an index representing the safety of the system; wherein the environment information comprises a distance from a boundary and a distance from an obstacle;
module M2.2: taking the system safety index as a basis for adjusting the man-machine control weight;
the module M3 includes:
module M3.1: establishing a state space expression of the slave end system based on a dynamic model of the slave end system;
module M3.2: respectively calculating error vectors of the robot and the human according to expected instructions of the robot and the human;
module M3.3: at any moment k, calculating a robot cost function;
module M3.4: at any time k, calculating a human cost function;
module M3.5: establishing the k mixing cost function at the moment based on the index representing the system safety;
module M3.6: forming a control problem based on a model prediction control framework: at any time k, the optimized time domain is t ═ k, k +1, the time domain is divided into P discrete time periods with equal time step, and the objective of the rolling optimization is to minimize the cumulative mixed cost function in the time domain; the prediction model is a state space expression established based on a dynamic model of the slave end system; the system safety constraint is the distance between the position of the current robot and the nearest barrier and the nearest boundary;
module M3.7: and solving the optimization problem in each optimization time domain t ═ k, k +1, and calculating the optimal control quantity input at the time k.
7. The system for robot local trajectory adjustment and human-machine sharing control according to claim 6, wherein in said module M0:
determining a target configuration of a robot in Cartesian space as xfinalInitial position x0Desired trajectory duration T, with the goal of planning a path from initial configuration to target configuration xfinalThe optimum trajectory is the reference trajectory xd(t), t is a time variable;
obtaining obstacle position distribution in vivo environment by sensing deviceAnd a motion boundaryGeneration of feasible trajectory sets using fast-expanding random tree RRT algorithmAn optimal track is selected from the feasible track set as a reference track x for one-time planning through the following optimization targetsd(t):
Wherein p represents a feasible track, the optimization index consists of three parts which are respectively used for representing the shortest path, avoiding barriers to the maximum extent and avoiding boundaries to the maximum extent,αl,αo,αball are normal numbers and are used for adjusting the proportion of the three parts.
8. The system for robot local trajectory adjustment and human-machine sharing control according to claim 6,
the module M1 includes:
module M1.1: establishing a virtual force model representing human interaction force:
Mm,Dm,Kmrespectively representing an inertia matrix, a damping matrix and a rigidity matrix;
Fhcharacterizing a virtual interaction force applied by a human to the slave robot through the teleoperational device;
module M1.2: whether the human intention is strong or not is judged through the virtual interaction force, and the judgment system is as follows:
setting a threshold value deltaiI 1.. m, which is the corresponding virtual interaction force FhThe lower limit of the ith component monitors the virtual interaction force F applied to the slave robot by the human through the teleoperation equipment in real timehThe value of (d);
in block M1.2, the step of local reference trajectory adjustment comprises:
module M1.2.1: determining a local track range t epsilon [ t ] to be adjusteds,tf]T is a time variable, ts、tfRespectively representing the starting time and the ending time of the local track;
Wherein the content of the first and second substances,are respectivelyxd(t) the ith component, δ being the time interval of the selected discrete point;
module M1.2.3: for the distance from the current position to xd(tf) Re-planning the local track to generate a local feasible track set
Module M1.2.4: for a local feasible trajectory gammadLocal trace energy E (γ)d) Is gammadThe adjusted track energy is:
α is a normal number;
r is a positive definite symmetric matrix;
from the set of feasible trajectories based on the representation adjusted trajectory energy indexSelecting a feasible track with the minimum energy of the adjusting track as an optimal track gammadAs an adjusted reference trajectory;
the module M2 includes:
module M2.1: sensing environmental information in real time based on data of a sensor, and establishing an index representing the safety of the system; wherein the environment information comprises a distance from a boundary and a distance from an obstacle;
module M2.2: taking the system safety index as a basis for adjusting the man-machine control weight;
the module M2.1 comprises the following steps:
module M2.1.1: obstacle position distribution in-vivo environment obtained based on sensing dataAnd a motion boundarySubscript i represents a serial number;
module M2.1.2: constructing a current moment representation movable margin vector dres(k):
γd(k-1) is the reference track position at the previous time k-1 ifIf the module M1.2 judges that the human intention is stronger, the reference track gamma is determineddTo the adjusted desired trajectory, otherwise, to the reference trajectory gammadObtaining an initial reference track for one-time planning; k represents a time;
module M2.1.3: calculating an actual offset d (k) as a current position configuration x (k-1) of the robot and a corresponding expected value gammadDistance of (k-1):
d(k)=||x(k-1)-γd(k-1)||
module M2.1.4: defining a saturation function dsat:[0,dres]→(0,dres)
μ1,μ2,θ2And xi is a parameter of the Richards curve;
dmax(k)=min{d(k),dres(k) the parameter value determines the shape of the curve and is a preset value set according to the actual situation;
module M2.1.5: establishing an index lambda (k) representing the safety of the system:
the module M3 includes:
module M3.1: establishing a slave end system state space expression based on a dynamic model of the slave end system:
Mx,Cx,Gxrespectively, the slave robot is in Cartesian spaceAn inertia matrix, a Coriolis force centrifugal force matrix and a gravity matrix among the three matrixes, wherein J is a Jacobian matrix from a robot joint space to a Cartesian space; the control input is the joint moment input tau of the slave end robot; wherein the state variableThe system configuration and its derivatives;
module M3.2: respectively calculating error vectors of the robots according to expected instructions of the robots, wherein the error vector of the robot is delta xr=x-γdThe human error vector is Δ xh=x-xhd(ii) a Wherein, gamma isdIs the desired cartesian spatial configuration position of the robot; x is the number ofhdThe configuration position in the expected Cartesian space of the person is obtained by converting force input of the person through the main-end interaction device;
module M3.3: at any time k, a robot cost function C is calculatedr(k):
Q1r,Q2r,Q3rAre all positive definite matrixes;
module M3.4: at any time k, a human cost function is computed:
Q1h,Q2hare all positive definite matrixes;
module M3.5: establishing a k-time hybrid cost function based on an index lambda (k) representing the system safety:
C(k)=λ(k)Ch(k)+(1-λ(k))Cr(k)
module M3.6: based on the model predictive control framework, the following control problem is formed:
z(k+1|k)=A(k)z(k)+B(k)τ(k)+C(k)
the control problem is: at any time k, the time domain is optimized to t ═ k, k +1]Dividing the time-domain-based data into P discrete time periods with equal time step, wherein the goal of rolling optimization is to minimize the cumulative mixed cost function in the time domain; the prediction model is a state space expression established based on a dynamic model of the slave end system, and A (k), B (k) and C (k) are corresponding discrete coefficient matrixes; the system security constraint is hoAnd hbRespectively representing the distance between the position of the current robot and the nearest barrier and the nearest boundary, and ensuring that the distance is not less than the corresponding threshold value deltao,δb;
Module M3.7: and solving the optimization problem in each optimization time domain t ═ k, k +1, and calculating the optimal control quantity input at the time k.
9. A computer-readable storage medium storing a computer program, wherein the computer program, when executed by a processor, implements the steps of the method for robot local trajectory adjustment and human-machine sharing control of any one of claims 1 to 4.
10. A robot comprising the local trajectory adjustment and human-machine sharing control system for a robot according to any one of claims 5 to 8, or comprising the computer-readable storage medium of claim 9 having a computer program stored thereon.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111575047.7A CN114454157A (en) | 2021-12-21 | 2021-12-21 | Local track adjustment and man-machine sharing control method and system suitable for robot |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111575047.7A CN114454157A (en) | 2021-12-21 | 2021-12-21 | Local track adjustment and man-machine sharing control method and system suitable for robot |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114454157A true CN114454157A (en) | 2022-05-10 |
Family
ID=81405596
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111575047.7A Pending CN114454157A (en) | 2021-12-21 | 2021-12-21 | Local track adjustment and man-machine sharing control method and system suitable for robot |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114454157A (en) |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150201711A1 (en) * | 2014-01-23 | 2015-07-23 | Nike, Inc. | Automated assembly and stitching of shoe parts |
CN108388270A (en) * | 2018-03-21 | 2018-08-10 | 天津大学 | Cluster unmanned plane track posture cooperative control method towards security domain |
CN110996825A (en) * | 2017-07-13 | 2020-04-10 | 直观外科手术操作公司 | System and method for switching control authority between multiple instrument arms |
CN111805546A (en) * | 2020-07-20 | 2020-10-23 | 中国人民解放军国防科技大学 | Human-multi-robot sharing control method and system based on brain-computer interface |
WO2021077725A1 (en) * | 2019-10-21 | 2021-04-29 | 南京航空航天大学 | System and method for predicting motion state of surrounding vehicle based on driving intention |
CN113370210A (en) * | 2021-06-23 | 2021-09-10 | 华北科技学院(中国煤矿安全技术培训中心) | Robot active collision avoidance system and method |
CN214758698U (en) * | 2020-09-23 | 2021-11-19 | 谭军 | Mobile phone 5G network remote sharing control real fish fishing robot |
-
2021
- 2021-12-21 CN CN202111575047.7A patent/CN114454157A/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150201711A1 (en) * | 2014-01-23 | 2015-07-23 | Nike, Inc. | Automated assembly and stitching of shoe parts |
CN110996825A (en) * | 2017-07-13 | 2020-04-10 | 直观外科手术操作公司 | System and method for switching control authority between multiple instrument arms |
CN108388270A (en) * | 2018-03-21 | 2018-08-10 | 天津大学 | Cluster unmanned plane track posture cooperative control method towards security domain |
WO2021077725A1 (en) * | 2019-10-21 | 2021-04-29 | 南京航空航天大学 | System and method for predicting motion state of surrounding vehicle based on driving intention |
CN111805546A (en) * | 2020-07-20 | 2020-10-23 | 中国人民解放军国防科技大学 | Human-multi-robot sharing control method and system based on brain-computer interface |
CN214758698U (en) * | 2020-09-23 | 2021-11-19 | 谭军 | Mobile phone 5G network remote sharing control real fish fishing robot |
CN113370210A (en) * | 2021-06-23 | 2021-09-10 | 华北科技学院(中国煤矿安全技术培训中心) | Robot active collision avoidance system and method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP3705074A1 (en) | Systems and methods for controlling movement of a surgical tool along a predefined path | |
Su et al. | Fuzzy approximation-based task-space control of robot manipulators with remote center of motion constraint | |
KR102603224B1 (en) | Systems and methods for robotic surgery | |
CN108451643B (en) | Surgical manipulator capable of controlling a surgical instrument in multiple modes | |
Haidegger et al. | Cascade control for telerobotic systems serving space medicine | |
Qureshi et al. | A supervisory on-line tuned fuzzy logic based sliding mode control for robotics: An application to surgical robots | |
US9662138B2 (en) | Puncture control system and method therefor | |
CN112428278A (en) | Control method and device of mechanical arm and training method of man-machine cooperation model | |
JP2008238396A (en) | Apparatus and method for generating and controlling motion of robot | |
Weede et al. | Towards cognitive medical robotics in minimal invasive surgery | |
Wang et al. | A PSO-optimized fuzzy reinforcement learning method for making the minimally invasive surgical arm cleverer | |
Rosen et al. | Autonomous operation in surgical robotics | |
Bauzano et al. | A minimally invasive surgery robotic assistant for HALS–SILS techniques | |
CN114454157A (en) | Local track adjustment and man-machine sharing control method and system suitable for robot | |
CN116619369A (en) | Sharing control method based on teleoperation flexible mechanical arm and application thereof | |
Fracczak et al. | Share control of surgery robot master manipulator guiding tool along the standard path | |
Cursi et al. | Task accuracy enhancement for a surgical macro-micro manipulator with probabilistic neural networks and uncertainty minimization | |
CN107168105B (en) | Virtual surgery hybrid control system and verification method thereof | |
Razjigaev et al. | Optimal Vision-Based Orientation Steering Control for a 3-D Printed Dexterous Snake-Like Manipulator to Assist Teleoperation | |
Tervo et al. | Towards human skill adaptive manual control | |
CN116077089B (en) | Multimode safety interaction method and device for ultrasonic scanning robot | |
Dominici et al. | Compensation for 3D physiological motion in robotic-assisted surgery using a predictive force controller. Experimental results | |
Zheng et al. | Adaptive variable impedance position/force tracking control of fracture reduction robot | |
Jiang et al. | Human-Machine Interaction Methods for Minimally Invasive Surgical Robotic Arms | |
Estebanez et al. | Maneuvers recognition system for laparoscopic surgery |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |