JP7378640B2

JP7378640B2 - Robot control device and robot control method

Info

Publication number: JP7378640B2
Application number: JP2022563069A
Authority: JP
Inventors: 暁生斎藤
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2021-03-24
Filing date: 2021-03-24
Publication date: 2023-11-13
Anticipated expiration: 2041-03-24
Also published as: DE112021007371T5; JPWO2022201377A1; WO2022201377A1; CN116940906A

Description

本開示は、マニピュレータの制御を行うロボット制御装置およびロボット制御方法に関する。 The present disclosure relates to a robot control device and a robot control method that control a manipulator.

工場の製造ラインなどにおいて、部品および製品（以下、「ワーク」と称する）をマニピュレータで把持して搬送するピックアンドプレースの工程が存在する。ピックアンドプレースにおいて、ロボットの動作速度および加速度が適切でない場合に、ワークおよびマニピュレータの把持部に対し過剰な慣性力とモーメントとが発生し、ワークが落下するといった課題がある。これを解決するために、ワークに発生する力およびモーメントを考慮して適切な動作条件を決定する技術が提案されている。 2. Description of the Related Art On a factory production line, etc., there is a pick-and-place process in which parts and products (hereinafter referred to as "works") are gripped and transported by a manipulator. In pick-and-place, if the operating speed and acceleration of the robot are not appropriate, excessive inertia force and moment are generated on the workpiece and the gripping portion of the manipulator, causing the workpiece to fall. To solve this problem, a technique has been proposed that determines appropriate operating conditions by considering the force and moment generated in the workpiece.

特許文献１には、弾性を有する保持体を含むロボットのシミュレーションモデルを用いてシミュレーションを実施し、保持体に発生する負荷モーメントが閾値よりも大きい場合に、負荷モーメントが閾値以下となるようシミュレーションの実行条件を変更するシミュレーション装置について開示されている。 Patent Document 1 discloses that a simulation is performed using a simulation model of a robot including an elastic holding body, and when the load moment generated on the holding body is larger than a threshold value, the simulation is performed so that the load moment is below the threshold value. A simulation device that changes execution conditions is disclosed.

特開２０１９－１４１９３９号公報Japanese Patent Application Publication No. 2019-141939

Ｄ．Ｖｅｒｓｃｈｅｕｒｅｅｔａｌ．”Ｔｉｍｅ－ＯｐｔｉｍａｌＰａｔｈＴｒａｃｋｉｎｇｆｏｒＲｏｂｏｔｓ：ＡＣｏｎｖｅｘＯｐｔｉｍｉｚａｔｉｏｎＡｐｐｒｏａｃｈ”ＩＥＥＥＴｒａｎｓ．ＡｕｔｏｍａｔｉｃＣｏｎｔｒｏｌ，２００９D. Verscheure et al. “Time-Optimal Path Tracking for Robots: A Convex Optimization Approach” IEEE Trans. Automatic Control, 2009 Ｂ．Ａｍｏｓｅｔａｌ．”ＯｐｔＮｅｔ：ＤｉｆｆｅｒｅｎｔｉａｂｌｅＯｐｔｉｍｉｚａｔｉｏｎａｓａＬａｙｅｒｉｎＮｅｕｒａｌＮｅｔｗｏｒｋｓ”Ｐｒｏｃｅｅｄｉｎｇｓｏｆｔｈｅ３４ｔｈＩｎｔｅｒｎａｔｉｏｎａｌＣｏｎｆｅｒｅｎｃｅｏｎＭａｃｈｉｎｅＬｅａｒｎｉｎｇ，２０１７B. Amos et al. “OptNet: Differential Optimization as a Layer in Neural Networks”Proceedings of the 34th International Conference on M achine Learning, 2017

特許文献１では、負荷モーメントが閾値以下になるまでシミュレーションの実行条件を繰り返し変更する必要があるため、ワークに発生する力およびモーメントを考慮して適切な実行条件を求めるのに時間を要するという問題がある。また、自動的に調整可能なシミュレーションの実行条件は最大加速度のみであり、ロボットの動作軌跡と速度プロファイルといった自由度の高い動作条件を調整するのは困難なため、マニピュレータが動作開始してからワークを把持するまでの動作時間を短くできないという問題がある。 In Patent Document 1, it is necessary to repeatedly change the simulation execution conditions until the load moment becomes equal to or less than a threshold value, so there is a problem that it takes time to determine appropriate execution conditions in consideration of the force and moment generated on the workpiece. There is. In addition, the simulation execution condition that can be automatically adjusted is only the maximum acceleration, and it is difficult to adjust operating conditions with a high degree of freedom such as the robot's operating trajectory and velocity profile. There is a problem in that it is not possible to shorten the operation time until the object is grasped.

本開示は、上述の課題を解決するためになされたもので、ワークに発生する力およびモーメントなどの制約条件を満たし、動作時間が短くなるようなマニピュレータの指令軌跡と速度プロファイルを高速に求めることができるロボット制御装置およびロボット制御方法を提供することを目的とする。 The present disclosure has been made to solve the above-mentioned problems, and is to quickly obtain a command trajectory and speed profile of a manipulator that satisfies constraint conditions such as force and moment generated on a workpiece and shortens operation time. The purpose of the present invention is to provide a robot control device and a robot control method that can perform the following tasks.

本開示に係るロボット制御装置は、予め設定されたマニピュレータへの指令軌跡と、前記マニピュレータに関する制約条件と、前記マニピュレータの動作時間に基づく評価指標とに基づいて、前記マニピュレータの速度プロファイルを計算する速度計算部と、前記速度プロファイルに基づいて、前記動作時間の前記指令軌跡に関する勾配を計算して勾配情報とする勾配計算部と、前記勾配情報に基づいて前記指令軌跡を補正して補正指令軌跡とする指令軌跡補正部と、前記補正指令軌跡に対し前記マニピュレータが追従するよう制御する制御部と、を備える。 The robot control device according to the present disclosure calculates a speed profile of the manipulator based on a preset command trajectory to the manipulator, constraint conditions regarding the manipulator, and an evaluation index based on an operation time of the manipulator. a calculation section; a gradient calculation section that calculates a gradient regarding the command trajectory of the operation time based on the speed profile to obtain gradient information; and a gradient calculation section that corrects the command trajectory based on the gradient information to obtain a corrected command trajectory. and a control section that controls the manipulator to follow the corrected command trajectory.

また、本開示に係るロボット制御方法は、予め設定されたマニピュレータへの指令軌跡と、前記マニピュレータに関する制約条件と、前記マニピュレータの動作時間に基づく評価指標とに基づいて、前記マニピュレータの速度プロファイルを計算する工程と、前記速度プロファイルに基づいて、前記動作時間の前記指令軌跡に関する勾配を計算して勾配情報とする工程と、前記勾配情報に基づいて前記指令軌跡を補正して補正指令軌跡とする工程と、前記補正指令軌跡に対し前記マニピュレータが追従するよう制御する工程と、を備える。 Further, the robot control method according to the present disclosure calculates a velocity profile of the manipulator based on a preset command trajectory to the manipulator, constraint conditions regarding the manipulator, and an evaluation index based on the operation time of the manipulator. a step of calculating a gradient regarding the command trajectory of the operation time based on the speed profile to obtain gradient information; and a step of correcting the command trajectory based on the gradient information to obtain a corrected command trajectory. and a step of controlling the manipulator to follow the correction command trajectory.

本開示によれば、ロボット制御装置およびロボット制御方法は、マニピュレータに関する制約条件とマニピュレータの動作時間とに基づく評価指標に基づいて速度プロファイルを計算し、動作時間の指令軌跡に関する勾配に基づいて指令軌跡を補正するため、制約条件を満たし、動作時間が短くなるようなマニピュレータの補正指令軌跡と速度プロファイルとを高速に求めることができる。 According to the present disclosure, a robot control device and a robot control method calculate a speed profile based on an evaluation index based on constraint conditions related to a manipulator and an operation time of the manipulator, and calculate a command trajectory based on a gradient regarding a command trajectory of the operation time. In order to correct this, it is possible to quickly obtain a correction command trajectory and a speed profile of the manipulator that satisfy the constraint conditions and shorten the operation time.

実施の形態１におけるロボット制御装置の一例を示すブロック図である。1 is a block diagram showing an example of a robot control device in Embodiment 1. FIG. 実施の形態１から４におけるロボット制御装置を含む構成の一例を示す図である。2 is a diagram showing an example of a configuration including a robot control device in Embodiments 1 to 4. FIG. 実施の形態１から４における速度計算部の一例を示すブロック図である。FIG. 3 is a block diagram showing an example of a speed calculation unit in Embodiments 1 to 4. FIG. 実施の形態１から４における勾配計算部の一例を示すブロック図である。FIG. 3 is a block diagram illustrating an example of a gradient calculation unit in Embodiments 1 to 4. FIG. 実施の形態１から４における制御部の一例を示すブロック図である。FIG. 3 is a block diagram showing an example of a control unit in Embodiments 1 to 4. FIG. 実施の形態１におけるロボット制御装置の動作の一例を示すフローチャートである。5 is a flowchart illustrating an example of the operation of the robot control device in the first embodiment. 実施の形態２におけるロボット制御装置の一例を示すブロック図である。FIG. 3 is a block diagram showing an example of a robot control device in a second embodiment. 実施の形態２における指令軌跡補正部の一例を示すブロック図である。FIG. 3 is a block diagram showing an example of a command trajectory correction section in Embodiment 2. FIG. 実施の形態２におけるロボット制御装置の動作の一例を示すフローチャートである。7 is a flowchart illustrating an example of the operation of the robot control device in Embodiment 2. FIG. 実施の形態３におけるロボット制御装置の一例を示すブロック図である。FIG. 3 is a block diagram showing an example of a robot control device in Embodiment 3. FIG. 実施の形態３におけるロボット制御装置の動作の一例を示すフローチャートである。12 is a flowchart illustrating an example of the operation of the robot control device in Embodiment 3. 実施の形態４におけるロボット制御装置の一例を示すブロック図である。FIG. 7 is a block diagram showing an example of a robot control device in Embodiment 4. FIG. 実施の形態４におけるロボット制御装置の動作の一例を示すフローチャートである。12 is a flowchart illustrating an example of the operation of the robot control device in Embodiment 4. 実施の形態１から４におけるロボット制御装置のハードウェア構成を示す図である。3 is a diagram showing the hardware configuration of a robot control device in embodiments 1 to 4. FIG.

実施の形態１．
図１は、実施の形態１におけるロボット制御装置１００の一例を示すブロック図である。図１は、ロボット制御装置１００と、アクチュエータ１１０と、マニピュレータ１と、ワーク１１２とにより構成されるブロック図である。また、図２は、実施の形態１におけるロボット制御装置１００を含む構成の一例を示す図である。ロボット制御装置１００は、垂直多関節ロボットであるマニピュレータ１の各関節に設置されるアクチュエータ１１０を制御することで、マニピュレータ１の先端に設置される把持部１１１で把持対象となるワーク１１２のピックアンドプレースの動作を実現する。周辺環境１１３は、例えばカメラなどであり、例えばマニピュレータ１がピックアンドプレースの動作を行う際の映像を図示しない表示装置に出力する。Embodiment 1.
FIG. 1 is a block diagram showing an example of a robot control device 100 according to the first embodiment. FIG. 1 is a block diagram composed of a robot control device 100, an actuator 110, a manipulator 1, and a workpiece 112. Further, FIG. 2 is a diagram showing an example of a configuration including the robot control device 100 in the first embodiment. The robot control device 100 controls the actuators 110 installed at each joint of the manipulator 1, which is a vertically articulated robot, so that a gripping section 111 installed at the tip of the manipulator 1 picks and picks up a workpiece 112 to be gripped. Realize the behavior of the place. The surrounding environment 113 is, for example, a camera, and outputs, for example, an image when the manipulator 1 performs a pick-and-place operation to a display device (not shown).

ロボット制御装置１００は、制約条件記憶部２と、指令軌跡記憶部３と、速度計算部４と、勾配計算部５と、指令軌跡補正部６と、指令点列計算部７と、制御部８とを備える。 The robot control device 100 includes a constraint storage section 2, a command trajectory storage section 3, a speed calculation section 4, a slope calculation section 5, a command trajectory correction section 6, a command point sequence calculation section 7, and a control section 8. Equipped with.

制約条件記憶部２は、マニピュレータ１に関して予め設定された制約条件のパラメータを記憶する。制約条件は、マニピュレータ１の関節角速度、関節角加速度、関節トルク、手先速度、手先加速度、マニピュレータ１が把持する対象（ワーク１１２）で発生する力、および把持する対象で発生するモーメントのうち少なくとも１つに関する条件である。一例として、制約条件が手先速度ｖ_ｈに関する条件の場合、制約条件とは「ｖ_ｍｉｎ≦ｖ_ｈ≦ｖ_ｍａｘ」のことである。ここで、ｖ_ｍｉｎは手先速度ｖ_ｈの下限値、ｖ_ｍａｘは手先速度ｖ_ｈの上限値である。そして、この場合の制約条件のパラメータとは、ｖ_ｍｉｎおよびｖ_ｍａｘのことである。制約条件および制約条件のパラメータは、これに限定されない。例えば、制約条件は「｜ｖ_ｈ｜≦ｖ_ｔｈ」であり、制約条件のパラメータはｖ_ｔｈであってもよい。ここで、｜ｖ_ｈ｜は手先速度ｖ_ｈの絶対値、ｖ_ｔｈは手先速度ｖ_ｈの閾値である。The constraint storage unit 2 stores parameters of constraints set in advance for the manipulator 1. The constraint condition is at least one of the joint angular velocity, joint angular acceleration, joint torque, hand speed, hand acceleration of the manipulator 1, the force generated in the object (workpiece 112) gripped by the manipulator 1, and the moment generated in the gripped object. There are two conditions. As an example, when the constraint condition is related to hand speed v _h , the constraint condition is "v _min ≦v _h ≦ v _max ". Here, v _min is the lower limit value of the hand speed v _h , and v _max is the upper limit value of the hand speed v _h . The parameters of the constraint conditions in this case are v _min and v _max . The constraints and the parameters of the constraints are not limited to these. For example, the constraint condition may be “|v _h |≦v _th ” and the parameter of the constraint condition may be v _th . Here, |v _h | is the absolute value of the hand speed v _h , and v _th is a threshold value of the hand speed v _h .

なお、制約条件のパラメータが予め設定される場合について説明したが、ロボット制御装置１００が図示しない把持制約学習部を備えてもよい。把持制約学習部は、ワーク１１２で発生する力およびモーメントのうち少なくとも１つについて、機械学習を用いて制約条件のパラメータを学習し、得られた制約条件のパラメータを制約条件記憶部２に記憶してもよい。具体的には、把持制約学習部は、マニピュレータ１がワーク１１２を把持する際に、ワーク１１２で発生する力およびモーメントを図示しないセンサによって取得し、取得した値に基づいて、制約条件のパラメータとして学習する。 Although a case has been described in which the parameters of the constraint conditions are set in advance, the robot control device 100 may include a grip constraint learning section (not shown). The gripping constraint learning unit uses machine learning to learn constraint parameters for at least one of the force and moment generated in the workpiece 112, and stores the obtained constraint parameters in the constraint storage unit 2. It's okay. Specifically, the gripping constraint learning unit acquires the force and moment generated in the workpiece 112 when the manipulator 1 grips the workpiece 112 using a sensor (not shown), and uses the acquired values as parameters of the constraint condition. learn.

指令軌跡記憶部３は、予め設定されたマニピュレータ１への指令軌跡を記憶する。一例として、指令軌跡がスプライン曲線として与えられていた場合、指令軌跡記憶部３は、スプライン曲線上の経由点の位置とその位置における曲線の媒介変数の値（０から１の値）とのペアで記憶する。あるいは、指令軌跡記憶部３は、スプライン曲線上の経由点の位置のみを記憶してもよい。この場合、媒介変数の値は、スプライン曲線上の経由点間の距離などから計算される。また、指令軌跡はスプライン曲線以外にも、Ｂ－スプライン曲線、あるいはベジェ曲線などとして与えられてもよい。指令軌跡記憶部３は、後に説明する指令軌跡補正部６からの補正指令軌跡を記憶する。この際、予め設定された指令軌跡は破棄され、補正指令軌跡が新たに記憶される。 The command trajectory storage unit 3 stores a preset command trajectory to the manipulator 1. As an example, when the command trajectory is given as a spline curve, the command trajectory storage unit 3 stores pairs of positions of waypoints on the spline curve and values of parametric variables of the curve at that position (values from 0 to 1). memorize it. Alternatively, the command trajectory storage section 3 may store only the positions of waypoints on the spline curve. In this case, the value of the parameter is calculated from the distance between waypoints on the spline curve. Further, the command trajectory may be given as a B-spline curve, a Bezier curve, or the like other than a spline curve. The command trajectory storage section 3 stores a corrected command trajectory from the command trajectory correction section 6, which will be described later. At this time, the preset command trajectory is discarded, and the corrected command trajectory is newly stored.

速度計算部４は、予め設定されたマニピュレータ１への指令軌跡と、マニピュレータ１に関する制約条件と、マニピュレータ１の動作時間に基づく評価指標とに基づいて、マニピュレータ１の速度プロファイルを計算する。すなわち、速度計算部４は、指令軌跡記憶部３からの指令軌跡と、制約条件記憶部２からの制約条件のパラメータとに基づいて、制約条件の範囲内でマニピュレータ１の動作時間を短くする指令軌跡上の速度プロファイルを計算する。あるいは、ロボット制御装置１００が把持制約学習部を備える場合、速度計算部４は、指令軌跡と、把持制約学習部で学習した制約条件のパラメータとに基づいて、速度プロファイルを計算する。ここで、マニピュレータ１の動作時間に基づく評価指標とは、後に説明するプロファイル計算部４４が用いる評価関数のことである。この評価関数は、媒介変数を介して、マニピュレータ１の加速度および速度を変数として数式化した関数である。また、速度プロファイルとは、マニピュレータ１が指令軌跡上を動作する際の各関節の速度の時間変化を表す。速度計算部４は、評価関数で表された動作時間を最小化する最適化問題によって、速度プロファイルを計算する。 The speed calculation unit 4 calculates the speed profile of the manipulator 1 based on a preset command trajectory for the manipulator 1, constraint conditions regarding the manipulator 1, and an evaluation index based on the operation time of the manipulator 1. That is, the speed calculation section 4 issues a command to shorten the operation time of the manipulator 1 within the range of the constraint conditions based on the command trajectory from the command trajectory storage section 3 and the parameter of the constraint condition from the constraint condition storage section 2. Calculate the velocity profile on the trajectory. Alternatively, if the robot control device 100 includes a grip constraint learning section, the speed calculation section 4 calculates the speed profile based on the command trajectory and the parameter of the constraint learned by the grip constraint learning section. Here, the evaluation index based on the operation time of the manipulator 1 is an evaluation function used by the profile calculation unit 44, which will be described later. This evaluation function is a function expressed mathematically using the acceleration and velocity of the manipulator 1 as variables through a parameter. In addition, the velocity profile represents a temporal change in the velocity of each joint when the manipulator 1 moves on a commanded trajectory. The speed calculation unit 4 calculates a speed profile using an optimization problem that minimizes the operation time expressed by an evaluation function.

図３は、実施の形態１における速度計算部４の一例を示すブロック図である。速度計算部４は、指令補間計算部４１と、動力学計算部４２と、制約条件係数計算部４３と、プロファイル計算部４４とを備える。 FIG. 3 is a block diagram showing an example of the speed calculation unit 4 in the first embodiment. The speed calculation unit 4 includes a command interpolation calculation unit 41, a dynamics calculation unit 42, a constraint coefficient calculation unit 43, and a profile calculation unit 44.

指令補間計算部４１は、予め設定された自然数Ｎを用いて、指令軌跡記憶部３からの指令軌跡の曲線をＮ点で補間し、指令軌跡上の補間点における位置と指令軌跡の媒介変数に関する一階微分および二階微分とを計算する。一例として、指令軌跡がスプライン曲線として与えられていた場合、マニピュレータ１の各軸の関節位置（あるいは関節角度）が媒介変数に関する３次の区分多項式として表現されるので、多項式の微分を用いることで、補間点の位置（あるいは角度）だけでなく、媒介変数に関する一階微分および二階微分を計算することができる。 The command interpolation calculation unit 41 interpolates the curve of the command trajectory from the command trajectory storage unit 3 at N points using a preset natural number N, and calculates information regarding the position at the interpolation point on the command trajectory and the parameter of the command trajectory. Calculate the first and second derivatives. As an example, if the command trajectory is given as a spline curve, the joint positions (or joint angles) of each axis of the manipulator 1 are expressed as a cubic piecewise polynomial with respect to the parametric variables. , it is possible to calculate not only the position (or angle) of the interpolation point, but also the first and second derivatives with respect to the parametric variables.

動力学計算部４２は、指令補間計算部４１からの補間点における位置と一階微分と二階微分とを用いて、マニピュレータ１の運動学計算と動力学計算とを行い、運動学計算結果と動力学計算結果とを出力する。運動学計算とは、マニピュレータ１の各関節の速度、加速度、角速度、および角加速度から、マニピュレータ１の各関節、各リンク、および把持部１１１などの速度、加速度、角速度、および角加速度を計算することである。また、動力学計算とは、マニピュレータ１の各関節、各リンク、および把持部１１１などの速度、加速度、角速度、および角加速度から、マニピュレータ１の各関節に発生するトルク、および把持部１１１に発生する力とモーメントとを計算することである。 The dynamics calculation unit 42 performs kinematics calculation and dynamics calculation of the manipulator 1 using the position, first-order differential, and second-order differential at the interpolation point from the command interpolation calculation unit 41, and calculates the kinematics calculation result and the dynamics. Outputs scientific calculation results. Kinematic calculation is to calculate the velocity, acceleration, angular velocity, and angular acceleration of each joint of the manipulator 1, each link, the grip part 111, etc. from the velocity, acceleration, angular velocity, and angular acceleration of each joint of the manipulator 1. That's true. In addition, dynamic calculation refers to the torque generated in each joint of the manipulator 1 and the torque generated in the grip part 111 from the velocity, acceleration, angular velocity, and angular acceleration of each joint, each link, and the grip part 111 of the manipulator 1. The purpose is to calculate the force and moment.

制約条件係数計算部４３は、動力学計算部４２からの運動学計算結果および動力学計算結果と、制約条件記憶部２からの制約条件のパラメータとに基づいて、制約条件の係数を計算する。一例として、把持部１１１に発生する力に関する制約条件が、以下の数式（１）で表される場合、制約条件の係数とは、ａ、ｂおよびｃのことである。すなわち、制約条件の係数とは、複数の変数を含む関係式において、それぞれの項に含まれる係数のことである。ここで、変数とは、数式（１）における補間点での加速度ｕおよび速度ｘなどである。 The constraint coefficient calculation unit 43 calculates coefficients of the constraint based on the kinematic calculation results and the dynamic calculation results from the dynamic calculation unit 42 and the constraint parameters from the constraint storage unit 2. As an example, when a constraint regarding the force generated in the grip portion 111 is expressed by the following formula (1), the coefficients of the constraint are a, b, and c. That is, the coefficients of the constraint conditions are the coefficients included in each term in a relational expression that includes a plurality of variables. Here, the variables include acceleration u and velocity x at the interpolation point in Equation (1).

プロファイル計算部４４は、制約条件係数計算部４３からの制約条件の係数に基づいて、速度プロファイルを計算する。すなわち、プロファイル計算部４４は、マニピュレータ１の動作時間に基づく評価関数と、制約条件の係数とに基づく最適化計算により、マニピュレータ１の指令軌跡上の速度プロファイルと、マニピュレータ１の動作時間とを計算する。 The profile calculation unit 44 calculates a speed profile based on the constraint coefficients from the constraint coefficient calculation unit 43. That is, the profile calculation unit 44 calculates the velocity profile of the manipulator 1 on the commanded trajectory and the operation time of the manipulator 1 by optimization calculation based on the evaluation function based on the operation time of the manipulator 1 and the coefficients of the constraint conditions. do.

なお、制約条件係数計算部４３が制約条件の係数を計算する方法、およびプロファイル計算部４４が速度プロファイルを計算する方法は、非特許文献１に示されている。 Note that the method by which the constraint coefficient calculation unit 43 calculates the coefficient of the restriction condition and the method by which the profile calculation unit 44 calculates the speed profile are shown in Non-Patent Document 1.

以上のように、速度計算部４は、制約条件記憶部２で設定された制約条件の範囲内で、動作時間が短くなる速度プロファイルを計算することができる。また、ワーク１１２で発生する力およびモーメントを制約条件に加えることで、ワーク１１２のピックアンドプレースにおいて、ワーク１１２が落下する、あるいはワーク１１２に過剰な力が発生するなどの不具合を抑制することができる。加えて、非特許文献１に記載されている方法は、大域的最適解が容易に得られる凸最適化と呼ばれる方法であるため、速度プロファイルを高速に計算することができる。 As described above, the speed calculation unit 4 can calculate a speed profile that reduces the operating time within the range of the constraints set in the constraint storage unit 2. Furthermore, by adding the force and moment generated in the work 112 to the constraint conditions, it is possible to suppress problems such as the work 112 falling or excessive force being generated in the work 112 during pick and place of the work 112. can. In addition, the method described in Non-Patent Document 1 is a method called convex optimization in which a global optimal solution can be easily obtained, so that the velocity profile can be calculated at high speed.

図１に戻り、勾配計算部５は、速度計算部４からの速度プロファイルに基づいて、マニピュレータ１の動作時間の指令軌跡に関する勾配を計算し、勾配情報として出力する。具体的には、勾配計算部５は、自動微分を用いて、プロファイル計算部４４からのマニピュレータ１の動作時間を指令軌跡について微分することで、動作時間の指令軌跡に関する勾配を計算し、勾配情報として出力する。 Returning to FIG. 1, the gradient calculation section 5 calculates the gradient regarding the command trajectory of the operation time of the manipulator 1 based on the speed profile from the speed calculation section 4, and outputs it as gradient information. Specifically, the gradient calculation unit 5 uses automatic differentiation to calculate the gradient of the operation time with respect to the command trajectory by differentiating the operation time of the manipulator 1 from the profile calculation unit 44 with respect to the command trajectory, and calculates the slope information with respect to the command trajectory. Output as .

図４は、実施の形態１における勾配計算部５の一例を示すブロック図である。勾配計算部５は、指令補間勾配計算部５１と、動力学勾配計算部５２と、制約条件係数勾配計算部５３と、プロファイル勾配計算部５４とを備える。指令補間勾配計算部５１、動力学勾配計算部５２、制約条件係数勾配計算部５３、およびプロファイル勾配計算部５４は、それぞれ、指令補間計算部４１、動力学計算部４２、制約条件係数計算部４３、およびプロファイル計算部４４が行った計算結果の勾配を計算する。 FIG. 4 is a block diagram showing an example of the gradient calculation unit 5 in the first embodiment. The gradient calculation section 5 includes a command interpolation gradient calculation section 51, a dynamic gradient calculation section 52, a constraint coefficient gradient calculation section 53, and a profile gradient calculation section 54. The command interpolation gradient calculation section 51, the dynamic gradient calculation section 52, the constraint coefficient gradient calculation section 53, and the profile gradient calculation section 54 are the command interpolation calculation section 41, the dynamics calculation section 42, and the constraint condition coefficient calculation section 43, respectively. , and calculates the slope of the calculation result performed by the profile calculation unit 44.

プロファイル勾配計算部５４は、一例として非特許文献２で示されている方法と同様の方法に基づいて、最適化問題の係数に関する勾配を計算することで、動作時間の制約条件の係数に関する勾配を計算する。 The profile gradient calculation unit 54 calculates the gradient related to the coefficient of the operation time constraint by calculating the gradient related to the coefficient of the optimization problem based on a method similar to the method shown in Non-Patent Document 2 as an example. calculate.

制約条件係数勾配計算部５３は、プロファイル勾配計算部５４からの制約条件の係数に関する勾配を入力とし、制約条件係数計算部４３の計算手順の微分を連鎖律に基づいて計算することで、動作時間の運動学計算結果に関する勾配、および動作時間の動力学計算結果に関する勾配を計算する。 The constraint coefficient gradient calculation unit 53 inputs the gradient of the coefficient of the constraint from the profile gradient calculation unit 54 and calculates the differential of the calculation procedure of the constraint coefficient calculation unit 43 based on the chain rule, thereby calculating the operating time. Calculate the slope with respect to the kinematic calculation result of and the slope with respect to the dynamic calculation result of operation time.

動力学勾配計算部５２は、制約条件係数勾配計算部５３からの動作時間の運動学計算結果に関する勾配、および動作時間の動力学計算結果に関する勾配を入力とし、動力学計算部４２の計算手順の微分を連鎖律に基づいて計算することで、動作時間の指令軌跡上の補間点における位置に関する勾配、動作時間の指令軌跡の媒介変数に関する一階微分に関する勾配、および動作時間の指令軌跡の媒介変数に関する二階微分に関する勾配を計算する。 The dynamic gradient calculation unit 52 inputs the gradient related to the kinematic calculation result of the operating time and the gradient related to the dynamic calculation result of the operating time from the constraint condition coefficient gradient calculation unit 53, and calculates the calculation procedure of the dynamic calculation unit 42. By calculating the differential based on the chain rule, we can calculate the slope with respect to the position at the interpolation point on the command trajectory of the operation time, the slope of the first derivative with respect to the parameter of the command trajectory of the operation time, and the parameter of the command trajectory of the operation time. Compute the slope with respect to the second derivative with respect to .

指令補間勾配計算部５１は、動力学勾配計算部５２からの指令軌跡上の補間点における位置に関する勾配、指令軌跡の媒介変数に関する一階微分に関する勾配、および指令軌跡の媒介変数に関する二階微分に関する勾配を入力とし、指令補間計算部４１の計算手順の微分を連鎖律に基づいて計算することで、動作時間の指令軌跡に関する勾配を計算する。指令補間勾配計算部５１からは、動作時間の指令軌跡に関する勾配を勾配情報として出力する。 The command interpolation gradient calculation unit 51 calculates a gradient related to the position at the interpolation point on the command trajectory from the dynamic gradient calculation unit 52, a gradient related to the first-order differential with respect to the parameter of the command trajectory, and a gradient related to the second-order differential with respect to the parameter of the command trajectory. is input, and the gradient of the command trajectory of the operation time is calculated by calculating the differential of the calculation procedure of the command interpolation calculation unit 41 based on the chain rule. The command interpolation gradient calculation unit 51 outputs the gradient regarding the command trajectory of the operation time as gradient information.

図１に戻り、指令軌跡補正部６は、勾配計算部５からの勾配情報に基づいて、マニピュレータ１の動作時間が減少するように指令軌跡を補正し、補正指令軌跡として出力する。指令軌跡補正部６は、勾配情報に基づく勾配降下法、共役勾配法または準ニュートン法のうちいずれか１つを用いることで、指令軌跡を補正する。勾配降下法の一例として、最急降下法、モーメンタム法、または加速勾配法などがある。補正指令軌跡は、指令軌跡記憶部３で記憶される。 Returning to FIG. 1, the command trajectory correction section 6 corrects the command trajectory so as to reduce the operating time of the manipulator 1 based on the gradient information from the gradient calculation section 5, and outputs it as a corrected command trajectory. The command trajectory correction unit 6 corrects the command trajectory by using any one of a gradient descent method, a conjugate gradient method, or a quasi-Newton method based on gradient information. Examples of gradient descent methods include the steepest descent method, the momentum method, and the accelerated gradient method. The correction command trajectory is stored in the command trajectory storage section 3.

勾配情報に基づいて、指令軌跡補正部６がマニピュレータ１の動作時間が少なくなるよう指令軌跡を補正することで、マニピュレータ１の動作時間を短縮することができる。また、指令軌跡補正部６が勾配情報を用いるため、効率よく高速に指令軌跡を補正することができる。 Based on the slope information, the command trajectory correction section 6 corrects the command trajectory so that the operation time of the manipulator 1 is shortened, so that the operation time of the manipulator 1 can be shortened. Further, since the command trajectory correction section 6 uses the slope information, the command trajectory can be corrected efficiently and at high speed.

指令点列計算部７は、指令軌跡記憶部３で記憶された補正指令軌跡と、速度計算部４からの速度プロファイルとに基づいて、所定のサンプリング周期毎の指令点列を計算する。このサンプリング周期は、後に説明する制御部８がアクチュエータ１１０への電流値を計算する際の周期のことである。一例として、補正指令軌跡がスプライン曲線として与えられていた場合、指令点列計算部７は、速度プロファイルに基づいて、時刻と補正指令軌跡の曲線の媒介変数との間の変換を行い、媒介変数を入力とした補正指令軌跡のスプライン曲線の区分多項式と組み合わせることにより、各サンプリングでマニピュレータ１への位置指令の点列を計算する。 The command point sequence calculation section 7 calculates a command point sequence for each predetermined sampling period based on the corrected command trajectory stored in the command trajectory storage section 3 and the speed profile from the speed calculation section 4. This sampling period is a period when the control unit 8, which will be described later, calculates the current value to the actuator 110. As an example, if the correction command trajectory is given as a spline curve, the command point sequence calculation unit 7 converts between the time and the parameter of the curve of the correction command trajectory based on the speed profile, and converts the parameter By combining with the piecewise polynomial of the spline curve of the correction command trajectory input, the point sequence of the position command to the manipulator 1 is calculated at each sampling.

制御部８は、補正指令軌跡に対しマニピュレータ１が追従するよう制御する。すなわち、制御部８は、指令点列計算部７からの指令点列に対し、マニピュレータ１が追従するよう制御する。 The control unit 8 controls the manipulator 1 to follow the correction command trajectory. That is, the control unit 8 controls the manipulator 1 to follow the command point sequence from the command point sequence calculation unit 7.

図５は、実施の形態１における制御部８の一例を示すブロック図である。制御部８は、フィードフォワード制御部８１と、フィードバック制御部８２と、電流値計算部８３とを備える。 FIG. 5 is a block diagram showing an example of the control unit 8 in the first embodiment. The control section 8 includes a feedforward control section 81, a feedback control section 82, and a current value calculation section 83.

フィードフォワード制御部８１は、指令点列計算部７からの指令点列に対し、例えば平滑化などのフィルタ処理を行い、平滑化後の指令点列として出力する。フィードフォワード制御部８１は、指令点列計算部７からの指令点列に対し、モデル化された逆伝達関数を適用することで、アクチュエータ１１０に入力する電流のフィードフォワード値を計算して出力する。逆伝達関数とは、制御対象であるアクチュエータ１１０の伝達関数に対する逆関数である。センサによってアクチュエータ１１０で発生する外乱を検知できる場合、フィードフォワード制御部８１は外乱信号に対し、モデル化された逆伝達関数を適用してもよい。この場合、フィードフォワード制御部８１は、外乱信号に対し逆伝達関数を適用したものを上記の電流のフィードフォワード値に含めて出力する。外乱の一例として、作業者がマニピュレータ１に接触することによる振動が挙げられる。 The feedforward control unit 81 performs filter processing such as smoothing on the command point sequence from the command point sequence calculation unit 7, and outputs it as a smoothed command point sequence. The feedforward control unit 81 calculates and outputs the feedforward value of the current input to the actuator 110 by applying the modeled inverse transfer function to the command point sequence from the command point sequence calculation unit 7. . The inverse transfer function is an inverse function to the transfer function of the actuator 110 that is the controlled object. If the disturbance generated in the actuator 110 can be detected by the sensor, the feedforward control unit 81 may apply a modeled inverse transfer function to the disturbance signal. In this case, the feedforward control unit 81 outputs the disturbance signal to which the inverse transfer function is applied, including it in the feedforward value of the current. An example of the disturbance is vibration caused by a worker touching the manipulator 1.

フィードバック制御部８２は、フィードフォワード制御部８１からの平滑化後の指令点列にアクチュエータ１１０が追従するようフィードバック制御を行い、アクチュエータ１１０に入力する電流のフィードバック値を計算して出力する。 The feedback control unit 82 performs feedback control so that the actuator 110 follows the smoothed command point sequence from the feedforward control unit 81, and calculates and outputs a feedback value of the current input to the actuator 110.

電流値計算部８３は、フィードフォワード制御部８１からの電流のフィードフォワード値と、フィードバック制御部８２からの電流のフィードバック値とに基づいて、アクチュエータ１１０へ入力する電流値を計算する。 The current value calculation section 83 calculates the current value input to the actuator 110 based on the feedforward value of the current from the feedforward control section 81 and the feedback value of the current from the feedback control section 82 .

図６は、実施の形態１におけるロボット制御装置１００の動作の一例を示すフローチャートである。すなわち、図６は、実施の形態１におけるロボット制御方法の一例を示すフローチャートである。 FIG. 6 is a flowchart showing an example of the operation of the robot control device 100 in the first embodiment. That is, FIG. 6 is a flowchart showing an example of the robot control method in the first embodiment.

図６に示すように、図示しない手段によりロボット制御が開始されると、速度計算部４は、予め設定されたマニピュレータ１への指令軌跡と、マニピュレータ１に関する制約条件と、マニピュレータ１の動作時間に基づく評価指標とに基づいて、マニピュレータ１の速度プロファイルを計算する（ステップＳＴ１）。 As shown in FIG. 6, when robot control is started by means not shown, the speed calculation unit 4 calculates a command trajectory to the manipulator 1 set in advance, constraint conditions regarding the manipulator 1, and operation time of the manipulator 1. The speed profile of the manipulator 1 is calculated based on the evaluation index (step ST1).

勾配計算部５は、マニピュレータ１の動作時間の指令軌跡に関する勾配を計算して勾配情報として出力する（ステップＳＴ２）。 The gradient calculation unit 5 calculates the gradient regarding the command trajectory of the operation time of the manipulator 1 and outputs it as gradient information (step ST2).

指令軌跡補正部６は、勾配情報に基づいて指令軌跡を補正して補正指令軌跡として出力する（ステップＳＴ３）。 The command trajectory correction unit 6 corrects the command trajectory based on the slope information and outputs the corrected command trajectory (step ST3).

指令軌跡記憶部３は、補正指令軌跡を記憶する（ステップＳＴ４）。 The command trajectory storage section 3 stores the corrected command trajectory (step ST4).

指令点列計算部７は、補正指令軌跡と速度プロファイルとに基づいて、指令点列を計算する（ステップＳＴ５）。 The command point sequence calculation unit 7 calculates a command point sequence based on the corrected command trajectory and the speed profile (step ST5).

制御部８は、補正指令軌跡に対しマニピュレータ１が追従するようアクチュエータ１１０を制御する（ステップＳＴ６）。 The control unit 8 controls the actuator 110 so that the manipulator 1 follows the correction command trajectory (step ST6).

図示しない手段により、ロボットの制御を継続するか否かが判定される（ステップＳＴ７）。 By means not shown, it is determined whether or not to continue controlling the robot (step ST7).

ステップＳＴ７の判定が「Ｙｅｓ」の場合は、処理はステップＳＴ６に戻り、ロボットの制御が継続される。ステップＳＴ７の判定が「Ｎｏ」の場合は、ロボットの制御が終了する。ロボットの制御が終了するのは、例えばマニピュレータ１が補正指令軌跡の終点にたどり着いた場合である。あるいは、マニピュレータ１が異常動作したと判定された場合である。この判定は、図示しない手段により行われる。 If the determination in step ST7 is "Yes", the process returns to step ST6 and control of the robot is continued. If the determination in step ST7 is "No", control of the robot ends. Control of the robot ends, for example, when the manipulator 1 reaches the end point of the correction command trajectory. Alternatively, this is a case where it is determined that the manipulator 1 has operated abnormally. This determination is performed by means not shown.

以上で説明した実施の形態１によれば、マニピュレータ１に関する制約条件とマニピュレータ１の動作時間とに基づく評価指標に基づいて速度プロファイルを計算し、動作時間の指令軌跡に関する勾配に基づいて指令軌跡を補正するため、制約条件を満たし、動作時間が短くなるようなマニピュレータ１の補正指令軌跡と速度プロファイルとを高速に求めることができる。 According to the first embodiment described above, the speed profile is calculated based on the evaluation index based on the constraint conditions regarding the manipulator 1 and the operation time of the manipulator 1, and the command trajectory is calculated based on the gradient of the command trajectory of the operation time. In order to perform the correction, it is possible to quickly obtain a correction command trajectory and a speed profile for the manipulator 1 that satisfy the constraint conditions and shorten the operation time.

実施の形態２．
実施の形態２では、周辺環境１１３で予め取得されるマニピュレータ１周辺の周辺情報に基づいて、指令軌跡を生成する。Embodiment 2.
In the second embodiment, a command trajectory is generated based on peripheral information around the manipulator 1 acquired in advance in the peripheral environment 113.

図７は、実施の形態２におけるロボット制御装置１００ａの一例を示すブロック図である。図７は、ロボット制御装置１００ａがロボット制御装置１００の構成要素に加え、周辺環境情報記憶部９と、指令軌跡生成部１０と、速度プロファイル記憶部１１とを備える点で、図１とは異なる。また、図７は、指令軌跡補正部６の代わりに指令軌跡補正部６ａを備える点で、図１とは異なる。周辺環境情報記憶部９、指令軌跡生成部１０、速度プロファイル記憶部１１および指令軌跡補正部６ａ以外は、図１に示すものと同じであるため、説明を省略する。 FIG. 7 is a block diagram showing an example of the robot control device 100a in the second embodiment. 7 differs from FIG. 1 in that the robot control device 100a includes a surrounding environment information storage unit 9, a command trajectory generation unit 10, and a speed profile storage unit 11 in addition to the components of the robot control device 100. . Further, FIG. 7 differs from FIG. 1 in that a command trajectory correction section 6a is provided instead of the command trajectory correction section 6. The components other than the surrounding environment information storage section 9, the command trajectory generation section 10, the speed profile storage section 11, and the command trajectory correction section 6a are the same as those shown in FIG. 1, so the description thereof will be omitted.

周辺環境情報記憶部９は、マニピュレータ１周辺の周辺情報を記憶する。具体的には、周辺環境情報記憶部９は、周辺環境１１３によって取得されたマニピュレータ１周辺の障害物の位置および形状などの情報を周辺情報として記憶する。周辺情報を記憶するためのデータ構造としては、例えば点群、ボクセル、ポリゴンメッシュ、および直方体などの基本形状でもよいし、複数の基本形状を組み合わせたものでもよい。また、データ構造として、後に説明する距離関数計算部６１が行う距離計算などの高速化が図れるバウンディングボリューム階層であってもよい。 The surrounding environment information storage section 9 stores surrounding information around the manipulator 1. Specifically, the surrounding environment information storage unit 9 stores information such as the position and shape of obstacles around the manipulator 1 acquired from the surrounding environment 113 as surrounding information. The data structure for storing peripheral information may be a basic shape such as a point cloud, a voxel, a polygon mesh, a rectangular parallelepiped, or a combination of a plurality of basic shapes. Furthermore, the data structure may be a bounding volume hierarchy that can speed up distance calculations performed by the distance function calculation unit 61, which will be described later.

指令軌跡生成部１０は、周辺環境情報記憶部９で記憶された周辺情報に基づいて、指令軌跡を生成する。具体的には、指令軌跡生成部１０は、周辺情報に基づいて、マニピュレータ１が周辺の障害物と干渉しないような指令軌跡を生成し、指令軌跡記憶部３で記憶されている指令軌跡の更新を行う。指令軌跡を生成するアルゴリズムとして、例えばＲＲＴ（Ｒａｐｉｄｌｙ－ｅｘｐｌｏｒｉｎｇＲａｎｄｏｍＴｒｅｅ）およびＰＲＭ（ＰｒｏｂａｂｉｌｉｓｔｉｃＲｏａｄＭａｐ）などを用いてもよい。 The command trajectory generation section 10 generates a command trajectory based on the surrounding information stored in the surrounding environment information storage section 9. Specifically, the command trajectory generation unit 10 generates a command trajectory such that the manipulator 1 does not interfere with surrounding obstacles based on surrounding information, and updates the command trajectory stored in the command trajectory storage unit 3. I do. As an algorithm for generating the command trajectory, for example, RRT (Rapidly-Exploring Random Tree) and PRM (Probabilistic RoadMap) may be used.

指令軌跡生成部１０で生成される指令軌跡は、予め設定された初期軌道ではなく、マニピュレータ１周辺の障害物との干渉を避けるような指令軌跡であり、自律的に生成されるものである。このため、指令軌跡生成部１０は、例えば指令軌跡の始点と終点とを入力するだけで、指令軌跡を自動的に生成することができる。 The command trajectory generated by the command trajectory generation unit 10 is not a preset initial trajectory, but is a command trajectory that avoids interference with obstacles around the manipulator 1, and is generated autonomously. Therefore, the command trajectory generation unit 10 can automatically generate a command trajectory by simply inputting, for example, the start point and end point of the command trajectory.

指令軌跡補正部６ａは、指令軌跡記憶部３で記憶された指令軌跡と、周辺環境情報記憶部９で記憶された周辺情報と、勾配計算部５からの勾配情報とに基づいて、指令軌跡を補正し、補正指令軌跡として出力する。補正指令軌跡は、指令軌跡記憶部３で記憶される。 The command trajectory correction unit 6a corrects the command trajectory based on the command trajectory stored in the command trajectory storage unit 3, the surrounding information stored in the surrounding environment information storage unit 9, and the slope information from the slope calculation unit 5. Correct it and output it as a corrected command trajectory. The correction command trajectory is stored in the command trajectory storage section 3.

図８は、実施の形態２における指令軌跡補正部６ａの一例を示すブロック図である。指令軌跡補正部６ａは、距離関数計算部６１と、バリア関数計算部６２と、バリア関数勾配計算部６３と、指令軌跡補正値計算部６４とを備える。 FIG. 8 is a block diagram showing an example of the command trajectory correction section 6a in the second embodiment. The command trajectory correction section 6a includes a distance function calculation section 61, a barrier function calculation section 62, a barrier function gradient calculation section 63, and a command trajectory correction value calculation section 64.

距離関数計算部６１は、指令軌跡記憶部３で記憶された指令軌跡と、周辺環境情報記憶部９で記憶された周辺情報とに基づいて、指令軌跡と周辺の障害物との距離関数の値を計算する。 The distance function calculation unit 61 calculates the value of the distance function between the command trajectory and surrounding obstacles based on the command trajectory stored in the command trajectory storage unit 3 and the surrounding information stored in the surrounding environment information storage unit 9. Calculate.

バリア関数計算部６２は、距離関数の値がある値以下になると関数の値が発散するようなバリア関数を構成し、距離関数計算部６１からの距離関数の値に基づいて、バリア関数の値を計算する。 The barrier function calculation unit 62 configures a barrier function such that the value of the function diverges when the value of the distance function becomes less than a certain value, and calculates the value of the barrier function based on the value of the distance function from the distance function calculation unit 61. Calculate.

バリア関数勾配計算部６３は、バリア関数計算部６２からのバリア関数の値の指令軌跡に関する勾配を計算する。バリア関数勾配計算部６３は、例えば自動微分を用いて勾配を計算する。 The barrier function gradient calculation section 63 calculates the gradient of the barrier function value from the barrier function calculation section 62 with respect to the command trajectory. The barrier function gradient calculation unit 63 calculates the gradient using automatic differentiation, for example.

指令軌跡補正値計算部６４は、勾配計算部５からのマニピュレータ１の動作時間の指令軌跡に関する勾配と、バリア関数勾配計算部６３からのバリア関数の値の指令軌跡に関する勾配とを合わせて勾配情報とし、この勾配情報に基づいて、動作時間とバリア関数の値との和が小さくなるように、指令軌跡の補正値を計算する。 The command trajectory correction value calculation section 64 combines the gradient regarding the command trajectory of the operation time of the manipulator 1 from the gradient calculation section 5 and the slope regarding the command trajectory of the value of the barrier function from the barrier function gradient calculation section 63 to obtain gradient information. Based on this gradient information, a correction value for the command trajectory is calculated so that the sum of the operation time and the value of the barrier function becomes small.

図７に戻り、速度プロファイル記憶部１１は、速度計算部４からの速度プロファイルを記憶する。ロボット制御装置１００ａが速度プロファイル記憶部１１を備えることにより、速度計算部４が速度プロファイルを計算するタイミングを任意に設定することができる。すなわち、制御部８がアクチュエータ１１０を制御する直前だけでなく、例えばマニピュレータ１が動作していない間、あるいはマニピュレータ１が他の動作を行っている間などに速度プロファイルを計算することができる。 Returning to FIG. 7, the speed profile storage section 11 stores the speed profile from the speed calculation section 4. Since the robot control device 100a includes the speed profile storage section 11, the timing at which the speed calculation section 4 calculates the speed profile can be arbitrarily set. That is, the velocity profile can be calculated not only immediately before the control unit 8 controls the actuator 110, but also while the manipulator 1 is not operating or while the manipulator 1 is performing another operation.

指令点列計算部７は、指令軌跡記憶部３で記憶された補正指令軌跡と、速度プロファイル記憶部１１で記憶された速度プロファイルとに基づいて、所定のサンプリング周期毎の指令点列を計算する。 The command point sequence calculation unit 7 calculates a command point sequence for each predetermined sampling period based on the corrected command trajectory stored in the command trajectory storage unit 3 and the speed profile stored in the speed profile storage unit 11. .

図９は、実施の形態２におけるロボット制御装置１００ａの動作の一例を示すフローチャートである。すなわち、図９は、実施の形態２におけるロボット制御方法の一例を示すフローチャートである。図９のステップＳＴ１からステップＳＴ７は、図６のステップＳＴ１からステップＳＴ７と同じであるため、ここでは詳細説明を省略する。 FIG. 9 is a flowchart showing an example of the operation of the robot control device 100a in the second embodiment. That is, FIG. 9 is a flowchart showing an example of the robot control method in the second embodiment. Steps ST1 to ST7 in FIG. 9 are the same as steps ST1 to ST7 in FIG. 6, so detailed explanation will be omitted here.

図９に示すように、図示しない手段によりロボット制御が開始されると、指令軌跡生成部１０は、周辺環境情報記憶部９で記憶された周辺情報に基づいて、指令軌跡を生成する。（ステップＳＴ８）。
As shown in FIG. 9, when robot control is started by means not shown, the command trajectory generation section 10 generates a command trajectory based on the surrounding information stored in the surrounding environment information storage section 9. (Step ST8).

速度計算部４は、マニピュレータ１への指令軌跡と、マニピュレータ１に関する制約条件と、マニピュレータ１の動作時間に基づく評価指標とに基づいて、マニピュレータ１の速度プロファイルを計算する（ステップＳＴ１）。 The speed calculation unit 4 calculates the speed profile of the manipulator 1 based on the command trajectory to the manipulator 1, the constraint conditions regarding the manipulator 1, and the evaluation index based on the operation time of the manipulator 1 (step ST1).

速度プロファイル記憶部１１は、速度プロファイルを記憶する（ステップＳＴ９）。 The speed profile storage unit 11 stores the speed profile (step ST9).

指令軌跡補正部６ａは、バリア関数の値の指令軌跡に関する勾配を計算し、ステップＳＴ２により計算された勾配と合わせたものを勾配情報として出力する（ステップＳＴ１０）。 The command trajectory correction unit 6a calculates the gradient of the value of the barrier function with respect to the command trajectory, and outputs the gradient information combined with the gradient calculated in step ST2 as gradient information (step ST10).

指令軌跡補正部６ａは、勾配情報に基づいて指令軌跡を補正して補正指令軌跡として出力する（ステップＳＴ３）。 The command trajectory correction section 6a corrects the command trajectory based on the gradient information and outputs the corrected command trajectory (step ST3).

ステップＳＴ７の判定が「Ｙｅｓ」の場合は、処理はステップＳＴ６に戻り、ロボットの制御が継続される。ステップＳＴ７の判定が「Ｎｏ」の場合は、ロボットの制御が終了する。 If the determination in step ST7 is "Yes", the process returns to step ST6 and control of the robot is continued. If the determination in step ST7 is "No", control of the robot ends.

以上で説明した実施の形態２によれば、指令軌跡生成部１０が周辺環境１１３からの周辺情報に基づいて指令軌跡を生成するため、障害物との干渉を避けつつ動作時間が短くなるようなマニピュレータ１の補正指令軌跡と速度プロファイルとを高速に求めることができる。 According to the second embodiment described above, the command trajectory generation unit 10 generates the command trajectory based on the surrounding information from the surrounding environment 113, so that the operation time can be shortened while avoiding interference with obstacles. The correction command locus and velocity profile of the manipulator 1 can be determined at high speed.

実施の形態３．
実施の形態３では、タブレットなどの入出力装置１２を用いて、マニピュレータ１の制御を行う。Embodiment 3.
In the third embodiment, the manipulator 1 is controlled using an input/output device 12 such as a tablet.

図１０は、実施の形態３におけるロボット制御装置１００ｂの一例を示すブロック図である。図１０は、ロボット制御装置１００ｂが入出力装置１２と図示しない手段により接続されている点で、図１とは異なる。なお、ロボット制御装置１００ｂは、実施の形態１におけるロボット制御装置１００と同じ構成であるため、説明を省略する。なお、ロボット制御装置１００ｂは、実施の形態２におけるロボット制御装置１００ａと同じ構成であってもよい。 FIG. 10 is a block diagram showing an example of a robot control device 100b in the third embodiment. FIG. 10 differs from FIG. 1 in that the robot control device 100b is connected to the input/output device 12 by means not shown. Note that the robot control device 100b has the same configuration as the robot control device 100 in Embodiment 1, so a description thereof will be omitted. Note that the robot control device 100b may have the same configuration as the robot control device 100a in the second embodiment.

入出力装置１２は、マニピュレータ１の動作情報と周辺環境１１３から取得される周辺情報とを画面に表示する。動作情報は、例えばマニピュレータ１が動作している映像のことである。入出力装置１２は、作業者が入力した動作情報をロボット制御装置１００ｂに出力する。例えば、入出力装置１２は、画面上のタッチパネルあるいは音声インターフェースを通じて、作業者にマニピュレータ１の動作の始点と終点とを入力させてもよいし、作業者がタブレットの画面を指でトレースすることで、マニピュレータ１の把持部１１１の大まかな指令軌跡を入力させてもよい。この指令軌跡は、指令軌跡記憶部３で記憶される。なお、作業者は、マニピュレータ１が動作中に入出力装置１２を用いて停止させることもできる。この場合、入出力装置１２は、画面上に「動作停止」の表示をさせ、作業者にその表示をタッチさせることで、作業停止の命令をロボット制御装置１００ｂへ送信する。これにより、ロボット制御装置１００ｂは、マニピュレータ１の動作を停止させる。 The input/output device 12 displays operation information of the manipulator 1 and peripheral information acquired from the surrounding environment 113 on a screen. The motion information is, for example, an image of the manipulator 1 in motion. The input/output device 12 outputs motion information input by the worker to the robot control device 100b. For example, the input/output device 12 may allow the operator to input the start and end points of the operation of the manipulator 1 through a touch panel on the screen or an audio interface, or may allow the operator to input the start and end points of the operation of the manipulator 1 by tracing the screen of the tablet with a finger. , a rough command trajectory of the grip portion 111 of the manipulator 1 may be input. This command trajectory is stored in the command trajectory storage section 3. Note that the operator can also use the input/output device 12 to stop the manipulator 1 while it is in operation. In this case, the input/output device 12 displays "stop operation" on the screen and causes the worker to touch the display, thereby transmitting a command to stop the work to the robot control device 100b. Thereby, the robot control device 100b stops the operation of the manipulator 1.

図１１は、実施の形態３におけるロボット制御装置１００ｂの動作の一例を示すフローチャートである。すなわち、図１１は、実施の形態３におけるロボット制御方法の一例を示すフローチャートである。図１１のステップＳＴ１からステップＳＴ６は、図６のステップＳＴ１からステップＳＴ６と同じであるため、ここでは詳細説明を省略する。 FIG. 11 is a flowchart showing an example of the operation of the robot control device 100b in the third embodiment. That is, FIG. 11 is a flowchart showing an example of the robot control method in the third embodiment. Steps ST1 to ST6 in FIG. 11 are the same as steps ST1 to ST6 in FIG. 6, so detailed explanation will be omitted here.

図１１に示すように、図示しない手段によりロボット制御が開始されると、速度計算部４は、予め設定されたマニピュレータ１への指令軌跡と、マニピュレータ１に関する制約条件と、マニピュレータ１の動作時間に基づく評価指標とに基づいて、マニピュレータ１の速度プロファイルを計算する（ステップＳＴ１）。 As shown in FIG. 11, when robot control is started by means not shown, the speed calculation unit 4 calculates a preset command trajectory for the manipulator 1, constraint conditions regarding the manipulator 1, and operation time of the manipulator 1. The speed profile of the manipulator 1 is calculated based on the evaluation index (step ST1).

図示しない手段により、ロボットの制御を継続するか否かが判定される（ステップＳＴ１１）。 By means not shown, it is determined whether or not to continue controlling the robot (step ST11).

ステップＳＴ１１の判定が「Ｙｅｓ」の場合は、処理はステップＳＴ６に戻り、マニピュレータ１の制御が継続される。ステップＳＴ１１の判定が「Ｎｏ」の場合は、ロボットの制御が終了する。ロボットの制御が終了するのは、例えばマニピュレータ１が補正指令軌跡の終点にたどり着いた場合である。あるいは、マニピュレータ１が異常動作したと判定された場合である。あるいは、入出力装置１２から「動作停止」の命令がロボット制御装置１００ｂへ送信された場合である。この判定は、図示しない手段により行われる。 If the determination in step ST11 is "Yes", the process returns to step ST6, and control of the manipulator 1 is continued. If the determination in step ST11 is "No", control of the robot ends. Control of the robot ends, for example, when the manipulator 1 reaches the end point of the correction command trajectory. Alternatively, this is a case where it is determined that the manipulator 1 has operated abnormally. Alternatively, this is a case where a command to "stop operation" is sent from the input/output device 12 to the robot control device 100b. This determination is performed by means not shown.

以上で説明した実施の形態３によれば、入出力装置１２を用いてマニピュレータ１の制御を行うことで、マニピュレータ１の動作を可視化することができる。 According to the third embodiment described above, the operation of the manipulator 1 can be visualized by controlling the manipulator 1 using the input/output device 12.

実施の形態４．
実施の形態４では、マニピュレータ１が移動架台１３の上に設置された状態で、マニピュレータ１の制御を行う。Embodiment 4.
In the fourth embodiment, the manipulator 1 is controlled while the manipulator 1 is installed on the movable frame 13.

図１２は、実施の形態４におけるロボット制御装置１００ｃの一例を示すブロック図である。図１２は、マニピュレータ１が移動架台１３の上に設置される点で、図１とは異なる。また、図１２は、ロボット制御装置１００ｃが制約条件記憶部２の代わりに制約条件記憶部２ｃを備える点、速度計算部４の代わりに速度計算部４ｃを備える点、および勾配計算部５の代わりに勾配計算部５ｃを備える点で、図１とは異なる。制約条件記憶部２ｃ、速度計算部４ｃおよび勾配計算部５ｃ以外は、図１に示すものと同じであるため、説明を省略する。なお、ロボット制御装置１００ｃは、実施の形態１におけるロボット制御装置１００をベースとしているが、実施の形態２におけるロボット制御装置１００ａあるいは実施の形態３におけるロボット制御装置１００ｂをベースとしてもよい。 FIG. 12 is a block diagram showing an example of a robot control device 100c in the fourth embodiment. FIG. 12 differs from FIG. 1 in that the manipulator 1 is installed on a movable frame 13. FIG. 12 also shows that the robot control device 100c includes a constraint storage unit 2c instead of the constraint storage unit 2, a speed calculation unit 4c instead of the speed calculation unit 4, and a slope calculation unit 5 instead of the slope calculation unit 5. It differs from FIG. 1 in that it includes a gradient calculation section 5c. Components other than the constraint storage section 2c, speed calculation section 4c, and slope calculation section 5c are the same as those shown in FIG. 1, and therefore their description will be omitted. Note that the robot control device 100c is based on the robot control device 100 in the first embodiment, but may be based on the robot control device 100a in the second embodiment or the robot control device 100b in the third embodiment.

制約条件記憶部２ｃは、マニピュレータ１に関して予め設定された制約条件のパラメータを記憶する。制約条件は、マニピュレータ１の関節角速度、関節角加速度、関節トルク、手先速度、手先加速度、前記マニピュレータ１が把持する対象で発生する力、および前記対象で発生するモーメントに加え、マニピュレータ１が設置される移動架台１３に与える反力、および反力のトルク成分のうち少なくとも１つに関する条件を含む。 The constraint storage unit 2c stores parameters of constraints set in advance for the manipulator 1. The constraint conditions include the joint angular velocity, joint angular acceleration, joint torque, hand speed, hand acceleration of the manipulator 1, the force generated in the object gripped by the manipulator 1, the moment generated in the object, and the condition in which the manipulator 1 is installed. conditions regarding at least one of the reaction force applied to the movable frame 13 and the torque component of the reaction force.

なお、制約条件のパラメータが予め設定される場合について説明したが、ロボット制御装置１００ｃが図示しない反力制約学習部を備えてもよい。反力制約学習部は、マニピュレータ１が設置される移動架台１３に与える反力および反力のトルク成分のうち少なくとも１つについて、機械学習を用いて制約条件のパラメータを学習し、得られた制約条件のパラメータを制約条件記憶部２ｃに記憶してもよい。具体的には、反力制約学習部は、マニピュレータ１の動作中に、移動架台１３に与える反力および反力のトルク成分を図示しないセンサによって取得し、取得した値に基づいて、制約条件のパラメータとして学習する。 Although a case has been described in which the parameters of the constraint are set in advance, the robot control device 100c may include a reaction force constraint learning section (not shown). The reaction force constraint learning unit uses machine learning to learn parameters of constraint conditions for at least one of the reaction force applied to the movable frame 13 on which the manipulator 1 is installed and the torque component of the reaction force, and learns the parameters of the constraints using machine learning. The parameters of the conditions may be stored in the constraint storage unit 2c. Specifically, the reaction force constraint learning unit acquires the reaction force applied to the movable frame 13 and the torque component of the reaction force using a sensor (not shown) during the operation of the manipulator 1, and determines the constraint conditions based on the acquired values. Learn as a parameter.

速度計算部４ｃは、指令軌跡記憶部３からの指令軌跡と、制約条件記憶部２ｃからの制約条件のパラメータとに基づいて、制約条件の範囲内でマニピュレータ１の動作時間を短くする指令軌跡上の速度プロファイルを計算する。 The speed calculation unit 4c calculates a command trajectory that shortens the operation time of the manipulator 1 within the range of the constraint conditions based on the command trajectory from the command trajectory storage unit 3 and the constraint parameters from the constraint condition storage unit 2c. Calculate the velocity profile of

速度計算部４ｃの構成は、図３に示す速度計算部４の構成と同じであるが、動力学計算部４２の動力学計算に、マニピュレータ１の動作時に移動架台１３に与える反力および反力のトルク成分の計算を含める点、および制約条件係数計算部４３が反力および反力のトルク成分に関する制約条件も含めて制約条件の係数を計算する点が、速度計算部４とは異なる。なお、ロボット制御装置１００ｃが反力制約学習部を備える場合、速度計算部４ｃは、指令軌跡と、反力制約学習部で学習した制約条件のパラメータとに基づいて、速度プロファイルを計算する。 The configuration of the speed calculation unit 4c is the same as that of the speed calculation unit 4 shown in FIG. This differs from the speed calculation unit 4 in that it includes the calculation of the torque component of , and that the constraint coefficient calculation unit 43 calculates the coefficients of the constraint including the constraints regarding the reaction force and the torque component of the reaction force. Note that when the robot control device 100c includes a reaction force constraint learning section, the speed calculation section 4c calculates the speed profile based on the command trajectory and the parameters of the constraint learned by the reaction force constraint learning section.

勾配計算部５ｃは、マニピュレータ１の動作時間の指令軌跡に関する勾配を計算し、勾配情報として出力する。 The gradient calculation unit 5c calculates the gradient regarding the command trajectory of the operation time of the manipulator 1, and outputs it as gradient information.

勾配計算部５ｃの構成は、図４に示す勾配計算部５の構成と同じであるが、制約条件係数勾配計算部５３が、マニピュレータ１の動作時に移動架台１３に与える反力および反力のトルク成分も考慮して、動作時間の運動学計算結果に関する勾配、および動作時間の動力学計算結果に関する勾配を計算する点が、勾配計算部５とは異なる。また、プロファイル勾配計算部５４が、マニピュレータ１の動作時に移動架台１３に与える反力および反力のトルク成分も考慮して勾配を計算する点が、勾配計算部５とは異なる。 The configuration of the gradient calculation unit 5c is the same as that of the slope calculation unit 5 shown in FIG. This differs from the gradient calculation unit 5 in that it calculates the gradient related to the kinematic calculation result of the operating time and the gradient related to the dynamic calculation result of the operating time, also taking into consideration the components. Further, the profile gradient calculation section 54 differs from the gradient calculation section 5 in that the profile gradient calculation section 54 calculates the gradient by also taking into account the reaction force applied to the movable frame 13 during the operation of the manipulator 1 and the torque component of the reaction force.

以上のように、ロボット制御装置１００ｃが制約条件記憶部２ｃ、速度計算部４ｃおよび勾配計算部５ｃを備えることで、移動架台１３への反力に起因する振動を抑制することができ、マニピュレータ１の動作時間を短くすることができる。 As described above, since the robot control device 100c includes the constraint storage section 2c, the speed calculation section 4c, and the slope calculation section 5c, vibrations caused by reaction force on the movable frame 13 can be suppressed, and the manipulator 1 operation time can be shortened.

図１３は、実施の形態４におけるロボット制御装置１００ｃの動作の一例を示すフローチャートである。すなわち、図１３は、実施の形態４におけるロボット制御方法の一例を示すフローチャートである。図１３のステップＳＴ１からステップＳＴ７は、図６のステップＳＴ１からステップＳＴ７と同じであるため、ここでは詳細説明を省略する。 FIG. 13 is a flowchart showing an example of the operation of the robot control device 100c in the fourth embodiment. That is, FIG. 13 is a flowchart showing an example of the robot control method in the fourth embodiment. Steps ST1 to ST7 in FIG. 13 are the same as steps ST1 to ST7 in FIG. 6, so detailed explanation will be omitted here.

図１３に示すように、図示しない手段によりロボット制御が開始されると、速度計算部４ｃは、予め設定されたマニピュレータ１への指令軌跡と、マニピュレータ１に関する制約条件と、マニピュレータ１の動作時間に基づく評価指標とに基づいて、マニピュレータ１の速度プロファイルを計算する（ステップＳＴ１）。速度計算部４ｃ内の動力学計算部４２は、マニピュレータ１の動作時に移動架台１３に与える反力および反力のトルク成分の計算を含めて運動学計算と動力学計算とを行う。また、速度計算部４ｃ内の制約条件係数計算部４３は、移動架台１３に与える反力および反力のトルク成分に関する制約条件も含めて、制約条件の係数を計算する。 As shown in FIG. 13, when robot control is started by means not shown, the speed calculation unit 4c calculates a preset command trajectory for the manipulator 1, constraint conditions regarding the manipulator 1, and operation time of the manipulator 1. The speed profile of the manipulator 1 is calculated based on the evaluation index (step ST1). The dynamics calculation unit 42 in the speed calculation unit 4c performs kinematic calculation and dynamic calculation including calculation of the reaction force applied to the movable frame 13 and the torque component of the reaction force when the manipulator 1 operates. Further, the constraint coefficient calculation unit 43 in the speed calculation unit 4c calculates coefficients of constraint conditions, including constraints regarding the reaction force applied to the movable frame 13 and the torque component of the reaction force.

勾配計算部５ｃは、マニピュレータ１の動作時間の指令軌跡に関する勾配を計算して勾配情報として出力する（ステップＳＴ２）。勾配計算部５ｃ内の制約条件係数勾配計算部５３は、マニピュレータ１の動作時に移動架台１３に与える反力および反力のトルク成分も考慮して、動作時間の運動学計算結果に関する勾配、および動作時間の動力学計算結果に関する勾配を計算する。また、勾配計算部５ｃ内のプロファイル勾配計算部５４は、移動架台１３に与える反力および反力のトルク成分も考慮して、動作時間の制約条件の係数に関する勾配を計算する。 The gradient calculation unit 5c calculates the gradient regarding the command trajectory of the operation time of the manipulator 1 and outputs it as gradient information (step ST2). The constraint condition coefficient gradient calculation unit 53 in the slope calculation unit 5c calculates the slope related to the kinematic calculation result of the operation time and the operation, taking into account the reaction force applied to the movable platform 13 during the operation of the manipulator 1 and the torque component of the reaction force. Calculate the gradient regarding the time dynamics calculation results. Further, the profile gradient calculation section 54 in the gradient calculation section 5c calculates the gradient regarding the coefficient of the constraint condition of the operation time, taking into consideration the reaction force applied to the movable frame 13 and the torque component of the reaction force.

ステップＳＴ１１の判定が「Ｙｅｓ」の場合は、処理はステップＳＴ６に戻り、マニピュレータ１の制御が継続される。ステップＳＴ１１の判定が「Ｎｏ」の場合は、ロボットの制御が終了する。 If the determination in step ST11 is "Yes", the process returns to step ST6, and control of the manipulator 1 is continued. If the determination in step ST11 is "No", control of the robot ends.

以上で説明した実施の形態４によれば、マニピュレータ１の動作中に移動架台１３に与える反力および反力のトルク成分を考慮することで、移動架台１３の振動を抑制することができ、マニピュレータ１の動作時間を短くすることができる。 According to the fourth embodiment described above, by considering the reaction force applied to the movable frame 13 during the operation of the manipulator 1 and the torque component of the reaction force, vibration of the movable frame 13 can be suppressed, and the vibration of the movable frame 13 can be suppressed. 1 operation time can be shortened.

なお、実施の形態４において、ロボット制御装置１００ｃが行うマニピュレータ１の制御は、移動架台１３に対しても適用できる。また、マニピュレータ１が移動架台１３に設置される場合に限定されず、マニピュレータ１が図示しない固定架台に設置される場合にも適用できる。 Note that in the fourth embodiment, the control of the manipulator 1 performed by the robot control device 100c can also be applied to the movable gantry 13. Furthermore, the present invention is not limited to the case where the manipulator 1 is installed on the movable pedestal 13, but can also be applied when the manipulator 1 is installed on a fixed pedestal (not shown).

実施の形態１から４におけるロボット制御装置１００，１００ａ，１００ｂおよび１００ｃおよびロボット制御方法は、垂直多関節ロボットであるマニピュレータ１以外にも適用できる。例えば、水平多関節ロボットなどの任意の軸構成のマニピュレータ１にも適用できる。また、マニピュレータ１以外の産業装置にも適用できる。 The robot control devices 100, 100a, 100b, and 100c and the robot control method in Embodiments 1 to 4 can be applied to other than manipulator 1, which is a vertically articulated robot. For example, it can be applied to a manipulator 1 with any axis configuration, such as a horizontal articulated robot. Moreover, it can be applied to industrial devices other than the manipulator 1.

ここで、実施の形態１から４におけるロボット制御装置１００，１００ａ，１００ｂおよび１００ｃのハードウェア構成について説明する。ロボット制御装置１００，１００ａ，１００ｂおよび１００ｃの各機能は、処理回路によって実現し得る。処理回路は、少なくとも１つのプロセッサと少なくとも１つのメモリとを備える。 Here, the hardware configurations of robot control devices 100, 100a, 100b, and 100c in Embodiments 1 to 4 will be described. Each function of robot control devices 100, 100a, 100b, and 100c can be realized by a processing circuit. The processing circuit includes at least one processor and at least one memory.

図１４は、実施の形態１から４におけるロボット制御装置１００，１００ａ，１００ｂおよび１００ｃのハードウェア構成を示す図である。ロボット制御装置１００，１００ａ，１００ｂおよび１００ｃは、図１４（ａ）に示すプロセッサ２００およびメモリ２０１によって実現することができる。プロセッサ２００は、例えばＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ、中央処理装置、処理装置、演算装置、マイクロプロセッサ、マイクロコンピュータ、プロセッサ、ＤＳＰ（ＤｉｇｉｔａｌＳｉｇｎａｌＰｒｏｃｅｓｓｏｒ）ともいう）またはシステムＬＳＩ（ＬａｒｇｅＳｃａｌｅＩｎｔｅｇｒａｔｉｏｎ）である。 FIG. 14 is a diagram showing the hardware configuration of robot control devices 100, 100a, 100b, and 100c in Embodiments 1 to 4. Robot control devices 100, 100a, 100b, and 100c can be realized by processor 200 and memory 201 shown in FIG. 14(a). The processor 200 is, for example, a CPU (Central Processing Unit, central processing unit, processing unit, arithmetic unit, microprocessor, microcomputer, processor, also referred to as a DSP (Digital Signal Processor)) or a system LSI (Large Scale Integration).

メモリ２０１は、例えばＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、フラッシュメモリ、ＥＰＲＯＭ（ＥｒａｓａｂｌｅＰｒｏｇｒａｍｍａｂｌｅＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＥＥＰＲＯＭ（登録商標）（ＥｌｅｃｔｒｉｃａｌｌｙＥｒａｓａｂｌｅＰｒｏｇｒａｍｍａｂｌｅＲｅａｄ－ＯｎｌｙＭｅｍｏｒｙ）などの不揮発性または揮発性の半導体メモリ、ＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）、磁気ディスク、フレキシブルディスク、光ディスク、コンパクトディスク、ミニディスク、またはＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｋ）などである。 The memory 201 is, for example, RAM (Random Access Memory), ROM (Read Only Memory), flash memory, EPROM (Erasable Programmable Read Only Memory), EEPROM (registered trademark) (Electrical Non-volatile or These include a volatile semiconductor memory, an HDD (Hard Disk Drive), a magnetic disk, a flexible disk, an optical disk, a compact disk, a mini disk, or a DVD (Digital Versatile Disk).

ロボット制御装置１００，１００ａ，１００ｂおよび１００ｃの各部の機能は、ソフトウェアなど（ソフトウェア、ファームウェア、またはソフトウェアとファームウェア）により実現される。ソフトウェアなどはプログラムとして記述され、メモリ２０１に格納される。プロセッサ２００は、メモリ２０１で記憶されているプログラムを読み出して実行することにより、各部の機能を実現する。すなわち、このプログラムは、ロボット制御装置１００，１００ａ，１００ｂおよび１００ｃの手順または方法をコンピュータに実行させるものであると言える。 The functions of each part of the robot control devices 100, 100a, 100b, and 100c are realized by software or the like (software, firmware, or software and firmware). Software and the like are written as programs and stored in the memory 201. The processor 200 reads and executes programs stored in the memory 201 to realize the functions of each section. That is, this program can be said to cause a computer to execute the procedures or methods of the robot control devices 100, 100a, 100b, and 100c.

プロセッサ２００が実行するプログラムは、インストール可能な形式または実行可能な形式のファイルで、コンピュータが読み取り可能な記憶媒体に記憶されてコンピュータプログラムプロダクトとして提供されてもよい。また、プロセッサ２００が実行するプログラムは、インターネットなどのネットワーク経由でロボット制御装置１００，１００ａ，１００ｂおよび１００ｃに提供されてもよい。 The program executed by the processor 200 may be an installable or executable file stored in a computer-readable storage medium and provided as a computer program product. Further, the program executed by processor 200 may be provided to robot control devices 100, 100a, 100b, and 100c via a network such as the Internet.

また、ロボット制御装置１００，１００ａ，１００ｂおよび１００ｃは、図１４（ｂ）に示す専用の処理回路２０２によって実現してもよい。処理回路２０２が専用のハードウェアである場合、処理回路２０２は、例えば単一回路、複合回路、プログラム化したプロセッサ、並列プログラム化したプロセッサ、ＡＳＩＣ（ＡｐｐｌｉｃａｔｉｏｎＳｐｅｃｉｆｉｃＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ）、ＦＰＧＡ（Ｆｉｅｌｄ－ＰｒｏｇｒａｍｍａｂｌｅＧａｔｅＡｒｒａｙ）、またはこれらを組み合わせたものなどが該当する。 Further, the robot control devices 100, 100a, 100b, and 100c may be realized by a dedicated processing circuit 202 shown in FIG. 14(b). When the processing circuit 202 is dedicated hardware, the processing circuit 202 may be, for example, a single circuit, a composite circuit, a programmed processor, a parallel programmed processor, an ASIC (Application Specific Integrated Circuit), or an FPGA (Field-Programmable Gate). Array), or a combination of these.

以上、ロボット制御装置１００，１００ａ，１００ｂおよび１００ｃの各構成要素の機能が、ソフトウェアなど、またはハードウェアのいずれか一方で実現される構成について説明した。しかしこれに限ったものではなく、ロボット制御装置１００，１００ａ，１００ｂおよび１００ｃの一部の構成要素をソフトウェアなどで実現し、別の一部を専用のハードウェアで実現する構成であってもよい。 The above describes the configuration in which the functions of each component of the robot control devices 100, 100a, 100b, and 100c are realized by either software or hardware. However, the configuration is not limited to this, and a configuration may be adopted in which some of the components of the robot control devices 100, 100a, 100b, and 100c are realized by software, and other parts are realized by dedicated hardware. .

１マニピュレータ、２，２ｃ制約条件記憶部、３指令軌跡記憶部、４，４ｃ速度計算部、４１指令補間計算部、４２動力学計算部、４３制約条件係数計算部、４４プロファイル計算部、５，５ｃ勾配計算部、５１指令補間勾配計算部、５２動力学勾配計算部、５３制約条件係数勾配計算部、５４プロファイル勾配計算部、６，６ａ指令軌跡補正部、６１距離関数計算部、６２バリア関数計算部、６３バリア関数勾配計算部、６４指令軌跡補正値計算部、７指令点列計算部、８制御部、８１フィードフォワード制御部、８２フィードバック制御部、８３電流値計算部、９周辺環境情報記憶部、１０指令軌跡生成部、１１０アクチュエータ、１１１把持部、１１２ワーク、１１３周辺環境、１３移動架台、２００プロセッサ、２０１メモリ、２０２処理回路。 1 Manipulator, 2, 2c Constraint storage unit, 3 Command trajectory storage unit, 4, 4c Speed calculation unit, 41 Command interpolation calculation unit, 42 Dynamics calculation unit, 43 Constraint coefficient calculation unit, 44 Profile calculation unit, 5. 5c gradient calculation section, 51 command interpolation gradient calculation section, 52 dynamic gradient calculation section, 53 constraint coefficient gradient calculation section, 54 profile gradient calculation section, 6, 6a command trajectory correction section, 61 distance function calculation section, 62 barrier function calculation unit, 63 barrier function gradient calculation unit, 64 command trajectory correction value calculation unit, 7 command point sequence calculation unit, 8 control unit, 81 feedforward control unit, 82 feedback control unit, 83 current value calculation unit, 9 surrounding environment information storage unit, 10 command trajectory generation unit, 110 actuator, 111 gripping unit, 112 workpiece, 113 surrounding environment, 13 moving frame, 200 processor, 201 memory, 202 processing circuit.

Claims

a speed calculation unit that calculates a speed profile of the manipulator based on a preset command trajectory to the manipulator, constraint conditions regarding the manipulator, and an evaluation index based on an operation time of the manipulator;
a slope calculation unit that calculates a slope regarding the command trajectory of the operation time based on the speed profile and uses the slope as slope information;
a command trajectory correction unit that corrects the command trajectory based on the gradient information to obtain a corrected command trajectory;
a control unit that controls the manipulator to follow the correction command trajectory;
A robot control device comprising:

The robot control device according to claim 1, further comprising a command trajectory generation unit that generates the command trajectory based on peripheral information around the manipulator.

The constraint condition is a condition related to at least one of the manipulator's joint angular velocity, joint angular acceleration, joint torque, hand speed, hand acceleration, a force generated in an object gripped by the manipulator, and a moment generated in the object. A robot control device according to claim 1 or 2.

Further comprising a grip constraint learning unit that learns parameters of the constraint condition using machine learning for at least one of the force and the moment,
The robot control device according to claim 3, wherein the speed calculation section calculates the speed profile based on the parameters learned by the gripping constraint learning section.

The robot control device according to any one of claims 1 to 4, wherein the constraint conditions include conditions regarding at least one of a reaction force applied to a pedestal on which the manipulator is installed and a torque component of the reaction force.

further comprising a reaction force constraint learning unit that learns parameters of the constraint conditions using machine learning for at least one of the reaction force and the torque component of the reaction force,
The robot control device according to claim 5, wherein the speed calculation section calculates the speed profile based on the parameters learned by the reaction force constraint learning section.

The speed calculation unit includes a command interpolation calculation unit that calculates a position at an interpolation point on the command trajectory and a first-order differential and a second-order differential with respect to a parameter of the command trajectory;
a dynamics calculation unit that performs kinematic calculation and dynamic calculation of the manipulator using the position, the first-order differential, and the second-order differential, and outputs the kinematic calculation result and the dynamic calculation result;
a constraint condition coefficient calculation unit that calculates a coefficient of the constraint condition based on the kinematic calculation result, the dynamic calculation result, and the constraint condition;
a profile calculation unit that calculates the speed profile based on the coefficients of the constraint;
The robot control device according to any one of claims 1 to 6, comprising:

8. The gradient calculating section calculates a gradient of the operating time with respect to the commanded trajectory by differentiating the operating time with respect to the commanded trajectory using automatic differentiation, and uses the gradient information as the gradient information. The robot control device according to item 1.

9. The command trajectory correction unit corrects the command trajectory by using any one of a gradient descent method, a conjugate gradient method, or a quasi-Newton method based on the gradient information. The robot control device described in .

a step of calculating a velocity profile of the manipulator based on a preset command trajectory to the manipulator, constraint conditions regarding the manipulator, and an evaluation index based on an operation time of the manipulator;
Based on the speed profile, calculating a slope regarding the command trajectory of the operation time to obtain slope information;
correcting the command trajectory based on the gradient information to obtain a corrected command trajectory;
controlling the manipulator to follow the correction command trajectory;
A robot control method comprising: