JPH06114768A

JPH06114768A - Robot control device

Info

Publication number: JPH06114768A
Application number: JP28526892A
Authority: JP
Inventors: Yukinori Kakazu; 侑昇嘉数; Katsuhiro Komuro; 克弘小室; Takao Yoneda; 孝夫米田
Original assignee: Toyoda Koki KK
Current assignee: Toyoda Koki KK
Priority date: 1992-09-29
Filing date: 1992-09-29
Publication date: 1994-04-26

Abstract

PURPOSE:To provide a robot control device which is designed to cope with an error factor, being heretofore non-considerable, improve correction precision, and obviate the need for teaching at a site. CONSTITUTION:An objective position computed by an Off-line teaching host system 60 is received by the objective position Mm' holding part 51 of a robot control device 50 and the objective position Mm' is corrected by a correcting part 52 by means of the output of a neutral network. A robot 10 is controlled by a robot control part 54 based on the objective position Mm' corrected by means of the output of the neutral network.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、ロボット制御装置に関
し、特に、ニューラルネットワークを用いてロボットの
位置補正を行う、オフラインティーチングシステムでの
ロボット制御装置に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a robot controller, and more particularly to a robot controller in an offline teaching system that corrects the position of a robot by using a neural network.

【０００２】[0002]

【従来の技術】産業用ロボットにおける絶対位置決め誤
差は、機械加工誤差、組付け誤差、熱歪等によるロボッ
トアームの寸法の誤差と、自重等による各関節のたわみ
等が原因となっている。従来のオフラインティーチング
システムでの産業用ロボットの制御は、図１１に示すよ
うに、先ず、オフラインティーチングシステム７００側
で、目標位置を生成し（７０１）、この目標位置Ｍｍを
保持し（７０２）、そしてこの値をアーム長、組付け角
度誤差等を考慮した数学モデルにより補正を行い（７０
３）補正された目標位置Ｍｍ’を算出し、これを例えば
フロッピイディスクに書き込むことにより保持する（７
０４）。そして、オペレータが、ロボット７１０を動作
させる際に、フロッピィディスクをロボット制御装置７
５０側に移しかえる。このフロッピィディスクに書き込
まれている補正された目標位置Ｍｍ’をロボット制御装
置７５０が読み出し（７５１）、該ロボット７１０の制
御を行っていた（７５４）。2. Description of the Related Art An absolute positioning error in an industrial robot is caused by a machining error, an assembly error, a dimensional error of a robot arm due to thermal strain, and a flexure of each joint due to its own weight. In the control of the industrial robot in the conventional offline teaching system, as shown in FIG. 11, first, a target position is generated on the side of the offline teaching system 700 (701), and this target position Mm is held (702), Then, this value is corrected by a mathematical model considering the arm length, the assembly angle error, etc. (70
3) The corrected target position Mm 'is calculated, and this is held by, for example, writing it on the floppy disk (7).
04). Then, when the operator operates the robot 710, the floppy disk is moved to the robot controller 7
Move to 50 side. The robot control device 750 reads the corrected target position Mm ′ written in the floppy disk (751) and controls the robot 710 (754).

【０００３】[0003]

【発明が解決しようとする課題】しかしながら、オフラ
インティーチングシステム側で、目標位置Ｍｍを誤差要
因を考慮した数学モデルで補正を行おうとしても、ロボ
ットの位置決め誤差の原因は多岐にわたり、これを数学
モデルで完全に表現することは困難であるため補正精度
には限界があった。However, even if the target position Mm is corrected on the side of the offline teaching system by a mathematical model in consideration of the error factors, there are various causes of the positioning error of the robot, and the mathematical model is used. There is a limit to the correction accuracy because it is difficult to completely express with.

【０００４】また、上述のようにロボットの絶対位置決
め精度が低いために、オフラインティーチングシステム
のロボット制御装置を設定するときに、実際に使用され
る場所でのティーチング補正が必要となる。このため
に、ロボット制御装置の設定に時間がかかり、また、こ
れを行うために非常な労力が必要となりコストアップの
原因となっていた。Further, since the absolute positioning accuracy of the robot is low as described above, when the robot controller of the offline teaching system is set, it is necessary to correct the teaching at a place where it is actually used. For this reason, it takes time to set the robot control device, and a great deal of labor is required to do this, which causes a cost increase.

【０００５】本発明は、上記課題を解決するために成さ
れたもので、その目的とするところは、補正精度が高
く、現場でのティーチング補正が不要なロボット制御装
置を提供することにある。The present invention has been made to solve the above problems, and an object of the present invention is to provide a robot controller having high correction accuracy and requiring no on-site teaching correction.

【０００６】[0006]

【課題を解決するための手段】上記課題を解決するため
の発明の構成は、オフラインティーチングシステムのロ
ボット制御装置であって、オフラインティーチングシス
テムにより算出されたロボット制御の目標位置を受け取
る目標位置保持手段と、該目標位置を入力として補正量
を出力するニューラルネットワークと、ニューラルネッ
トワークの出力により前記算出された目標位置を補正す
る補正手段と、該ニューラルネットワークにより補正さ
れた前記目標位置に基づきロボットの制御を行う制御手
段とを有することを特徴とする。The structure of the invention for solving the above-mentioned problems is a robot controller for an off-line teaching system, and a target position holding means for receiving a robot control target position calculated by the off-line teaching system. A neural network that outputs a correction amount using the target position as an input; a correction unit that corrects the calculated target position by the output of the neural network; and a robot control based on the target position corrected by the neural network. And a control means for performing.

【０００７】[0007]

【作用】上記の手段によれば、目標位置保持手段がオフ
ラインティーチングシステムにより算出された目標位置
を受け取り、補正手段がニューラルネットワークにより
前記算出された目標位置を補正し、制御手段が該ニュー
ラルネットワークによる補正された前記目標位置に基づ
きロボットの制御を行う。According to the above means, the target position holding means receives the target position calculated by the offline teaching system, the correction means corrects the calculated target position by the neural network, and the control means operates by the neural network. The robot is controlled based on the corrected target position.

【０００８】[0008]

【実施例】以下に、本実施例に係るロボット制御装置を
図を参照して説明する。先ず、図３参照して、本実施例
に係るロボット制御装置５０、及び、このロボット制御
装置５０にロボットの制御目標位置を与えるオフライン
ティーチングホストシステム６０の動作の概略について
説明する。オフラインティーチングシステム６０側にお
いて、目標位置生成部６１でロボットの制御目標の位置
マトリクスＭｍを生成し、目標位置Ｍｍ保持部６２でこ
の目標位置マトリクスＭｍを保持し、補正部６３で、こ
の目標位置マトリクスＭｍの値をロボットのアーム長、
組付け角度誤差等を考慮した数学モデルにより補正を行
い補正された目標位置マトリクスＭｍ’を得る。次に、
これを目標位置保持部６４において、例えばフロッピイ
ディスクにロボットの制御情報として書き込む。そし
て、この補正された目標位置マトリクスＭｍ’の書き込
まれたフロッピイディスクをオペレータが該目標位置マ
トリクスＭｍ’保持部６４から取り出し、ロボット制御
装置５０側の目標位置マトリクスＭｍ’保持部５１にセ
ットすることにより、該ロボット制御装置５０を動作さ
せる。ロボット制御装置５０の目標位置保持部５１は、
フロッピイディスクに書き込まれている補正された目標
位置マトリクスＭｍ’を読み出し、補正部５２でニュー
ラルネットワークの出力に基づき目標位置マトリクスＭ
ｍ’を補正し、ニューラルネットワークにより更に補正
された目標位置マトリクスＭｍ''を得る。そして、これ
を目標位置マトリクスＭｍ''保持部５３が保持し、ロボ
ット制御部５４が、このニューラルネットワークにより
補正された目標位置マトリクスＭｍ''を基に、ロボット
１０の制御を行う。このロボット制御装置５０により制
御される６関節を有するロボット１０について、更に詳
細に図１を参照して説明する。ロボット１０は、ベース
１３に固定された脚柱１２に取り付けられた、コラム１
４と、第１アーム１５と、第２アーム１６と、第３アー
ム１７と、フインガ１９とから構成され、第１関節ａ、
第２関節ｂ、第３関節ｃ、第４関節ｄ、第５関節ｅ、第
６関節ｆにより、６自由度で自在にフインガ１９の位置
及び姿勢を制御できるよう構成されている。EXAMPLE A robot controller according to this example will be described below with reference to the drawings. First, with reference to FIG. 3, an outline of the operation of the robot control device 50 according to the present embodiment and the offline teaching host system 60 that gives the robot control target position to the robot control target position will be described. On the side of the offline teaching system 60, the target position generation unit 61 generates the position matrix Mm of the robot control target, the target position Mm holding unit 62 holds this target position matrix Mm, and the correction unit 63 uses this target position matrix. The value of Mm is the arm length of the robot,
A corrected target position matrix Mm ′ is obtained by performing correction using a mathematical model that takes into account the assembly angle error and the like. next,
The target position holding unit 64 writes this as control information of the robot, for example, on a floppy disk. Then, the operator takes out the floppy disk in which the corrected target position matrix Mm ′ is written from the target position matrix Mm ′ holding unit 64 and sets it in the target position matrix Mm ′ holding unit 51 on the robot controller 50 side. Thus, the robot control device 50 is operated. The target position holding unit 51 of the robot controller 50 is
The corrected target position matrix Mm ′ written on the floppy disk is read out, and the correction unit 52 outputs the target position matrix M based on the output of the neural network.
The target position matrix Mm ″ that is further corrected by the neural network is obtained by correcting m ′. The target position matrix Mm ″ holding unit 53 holds this, and the robot control unit 54 controls the robot 10 based on the target position matrix Mm ″ corrected by this neural network. The robot 10 having 6 joints controlled by the robot controller 50 will be described in more detail with reference to FIG. The robot 10 has a column 1 attached to a pedestal 12 fixed to a base 13.
4, a first arm 15, a second arm 16, a third arm 17, and a finger 19, and the first joint a,
The second joint b, the third joint c, the fourth joint d, the fifth joint e, and the sixth joint f are configured so that the position and posture of the finger 19 can be freely controlled with 6 degrees of freedom.

【０００９】次に、ロボット１０を制御する前述のロボ
ット制御装置５０の構成を図２を参照して説明する。ロ
ボット１０の位置の演算及び制御を行うＣＰＵ１１に
は、制御指令を入力するオペレーティングボックス２７
及び操作盤２６と、図３に示すオフラインティーチング
ホストシステム６０の作成した制御情報（フロッピィデ
ィスク）等を保持する外部記憶装置２９と、後で詳述す
るＲＯＭ２０と、ＲＡＭ３０とが接続されている。更
に、ＣＰＵ１１には、第１関節ａを制御する１軸サーボ
制御部４０ａ乃至第６関節を制御する６軸サーボ制御部
４０ｆが接続され、該ＣＰＵ１１は、前述の外部記憶装
置２９に収容された制御情報に基づき設定された処理を
行い、該１軸サーボ制御部４０ａ乃至６軸サーボ制御部
４０ｆに制御指令を発する。これに応じて各サーボ制御
部（４０ａ〜４０ｆ）は、サーボモータＭ１乃至Ｍ６を
回動させ、第１関節ａ乃至第６関節ｆを動かすことによ
りロボット１０を駆動する。各サーボモータＭ１乃至Ｍ
６の動きは、エンコーダＥ１乃至Ｅ６により各々のサー
ボ制御部に帰還される。Next, the configuration of the robot controller 50 for controlling the robot 10 will be described with reference to FIG. An operating box 27 for inputting a control command is input to the CPU 11 that calculates and controls the position of the robot 10.
An operation panel 26, an external storage device 29 for holding control information (floppy disk) created by the offline teaching host system 60 shown in FIG. 3, a ROM 20 described later in detail, and a RAM 30 are connected. Further, the CPU 11 is connected to the 1-axis servo control section 40a for controlling the first joint a through the 6-axis servo control section 40f for controlling the sixth joint, and the CPU 11 is housed in the external storage device 29 described above. The processing set based on the control information is performed, and a control command is issued to the 1-axis servo control section 40a to 6-axis servo control section 40f. In response to this, each of the servo control units (40a to 40f) drives the robot 10 by rotating the servo motors M1 to M6 and moving the first joint a to the sixth joint f. Each servo motor M1 to M
The movements of 6 are fed back to the respective servo control units by the encoders E1 to E6.

【００１０】次に、本実施例の位置補正についてロボッ
ト制御装置による制御の説明に先立ち説明する。従来の
産業用ロボットの制御は、図１１に関連して従来技術の
項で前述したように、オフラインティーチングシステム
７００側で、ロボットアームの先端の目標とする位置マ
トリクスＭｍを求め（７０１）、この目標位置マトリク
スＭｍを、誤差を考慮した数学モデルｄにより位置誤差
補正を行い、補正された目標位置マトリクスＭｍ’を得
て（７０３）、これを保持する（７０４）。そして、ロ
ボット制御装置７５０側で、この補正された目標位置マ
トリクスＭｍ’を逆変換関数ｆａにより、逆変換してロ
ボットの各関節の制御目標となる関節角ベクトルΘａを
求め、これを基にロボットを制御していた（７５４）。
関節角ベクトルΘａを基に制御されたロボットアーム先
端の位置マトリクスＭａは、すべての誤差が反映される
実際の順変換関数ｇｔにより関節角ベクトルΘａを順変
換することにより求められる。Next, the position correction of this embodiment will be described prior to the description of the control by the robot controller. In the control of the conventional industrial robot, as described above in the section of the related art with reference to FIG. 11, the target position matrix Mm of the tip of the robot arm is obtained (701) on the offline teaching system 700 side, and The position error correction is performed on the target position matrix Mm using a mathematical model d in consideration of the error, the corrected target position matrix Mm ′ is obtained (703), and this is held (704). Then, on the robot controller 750 side, the corrected target position matrix Mm ′ is inversely transformed by the inverse transformation function fa to obtain the joint angle vector Θa which is the control target of each joint of the robot, and the robot is based on this. Were controlled (754).
The position matrix Ma of the robot arm tip, which is controlled based on the joint angle vector Θa, is obtained by forward transforming the joint angle vector Θa by an actual forward transformation function gt that reflects all errors.

【数１】ｄｆａｇｔＭｍ → Ｍｍ’→ Θａ → Ｍａ式１ここで、Ｍｍ：目標とする位置マトリクスｄ：誤差を限定して考慮した数学モデルＭｍ’：数学モデルｄにより補正された目標位置マトリ
クスｆａ：逆変換関数 Θａ：制御目標となる関節角ベクトルｇｔ：すべての誤差が反映される実際の順変換関数Ｍａ：Θａを基に制御されたロボットアーム先端の位置
マトリクス## EQU00001 ## d fa gt Mm.fwdarw.Mm'.fwdarw..THETA.a.fwdarw.Ma Equation 1 where Mm: target position matrix d: mathematical model in which error is limited Mm ': target position corrected by mathematical model d Matrix fa: Inverse transformation function Θa: Joint angle vector to be controlled gt: Actual forward transformation function that reflects all errors Ma: Position matrix of robot arm tip controlled based on Θa

【００１１】ここでは、誤差要因を限定して考慮した数
学モデルｄにより補正した目標位置マトリクスＭｍ’を
基に、関節角ベクトルΘａを求めロボットを制御してい
るが、実際に制御されたロボットの位置マトリクスＭａ
には、数学モデルｄでは考慮できなかった誤差分が含ま
れるため、この誤差分がロボット制御装置の制御誤差と
なった。即ち、実際の位置マトリクスＭａを規定する順
変換関数ｇｔには考慮できなかった誤差分が含まれ、数
学モデルｄによる補正と順変換関数ｇｔによる順変換と
により生ずる差が誤差となり、これが制御されたロボッ
トアームの位置マトリクスＭａと、目標位置マトリクス
Ｍｍとの位置誤差になっていた。Here, the robot is controlled by obtaining the joint angle vector Θa on the basis of the target position matrix Mm ′ corrected by the mathematical model d in which the error factors are limited and taken into consideration. Position matrix Ma
Contains an error that could not be considered in the mathematical model d, and this error became a control error of the robot controller. That is, the forward conversion function gt that defines the actual position matrix Ma contains an error that could not be considered, and the difference caused by the correction by the mathematical model d and the forward conversion by the forward conversion function gt becomes an error, which is controlled. Further, there is a position error between the position matrix Ma of the robot arm and the target position matrix Mm.

【００１２】これに対して本実施例では、図３に示すよ
うな構成を採用し、式１に示す制御されたロボットアー
ムの位置マトリクスＭａが、目標とする位置マトリクス
Ｍｍに等しくなるように、数学モデルｄにより補正され
た目標位置マトリクスＭｍ’を、誤差量（必要な補正
量）である補正マトリクスΔＭｍ（ニューラルネットワ
ークの出力）で更に補正し（５２）、この補正された目
標位置マトリクスＭｍ''に基づきロボット１０を制御す
る（５４）。この補正の考え方について以下更に詳細に
説明する。On the other hand, in the present embodiment, the configuration as shown in FIG. 3 is adopted so that the position matrix Ma of the controlled robot arm shown in Expression 1 becomes equal to the target position matrix Mm. The target position matrix Mm ′ corrected by the mathematical model d is further corrected by a correction matrix ΔMm (output of the neural network) which is an error amount (necessary correction amount) (52), and the corrected target position matrix Mm ′ is obtained. The robot 10 is controlled on the basis of '(54). The concept of this correction will be described in more detail below.

【００１３】先ず、数学モデルｄによる補正と順変換関
数ｇｔによる順変換とにより生じる誤差量が必要な補正
量に相当すると考え、数学モデルｄによる補正と順変換
関数ｇｔによる順変換とを補正マトリクスΔＭｍと置く
と、式１は次式で表すことができる。First, it is considered that the error amount caused by the correction by the mathematical model d and the forward conversion by the forward conversion function gt corresponds to the necessary correction amount, and the correction by the mathematical model d and the forward conversion by the forward conversion function gt are performed as a correction matrix. When ΔMm is set, Expression 1 can be expressed by the following expression.

【数２】Ｍｍ・ΔＭｍ＝Ｍａ式２そして、式２から誤差量（補正マトリクス）ΔＭｍは次
式で表される。## EQU00002 ## Mm..DELTA.Mm = Ma Equation 2 Then, from Equation 2, the error amount (correction matrix) .DELTA.Mm is expressed by the following equation.

【数３】 ΔＭｍ＝Ｍｍ^-1・Ｍａ式３## EQU3 ## ΔMm = Mm ⁻¹ · Ma Equation 3

【００１４】ここで、誤差量（補正マトリクス）ΔＭｍ
の逆行列ΔＭｍ^-1を必要な補正量と考え、本実施例では
オフラインティーチングホストシステムで補正された目
標位置マトリクスＭｍ’にこの補正量ΔＭｍ^-1を掛けた
もの（Ｍｍ’・ΔＭｍ^-1）をニューラルネットワークで
補正された目標位置マトリクスＭｍ''とし、これに基づ
いてロボットを制御する。Here, the error amount (correction matrix) ΔMm
The inverse matrix ΔMm ^{−1 of} is regarded as a necessary correction amount, and in the present embodiment, the target position matrix Mm ′ corrected by the offline teaching host system is multiplied by this correction amount ΔMm ⁻¹ (Mm ′ · ΔMm ⁻¹ ). Is a target position matrix Mm ″ corrected by the neural network, and the robot is controlled based on this.

【００１５】次に、本実施例に係るロボット制御装置５
０のニューラルネットワークへの学習を、ＣＰＵ１１の
演算処理の概要をブロック図にした図４を参照して説明
する。先ず、目標位置マトリクスＭｍ’入力部４０１
で、オフラインティーチングホストシステム６０が算出
した制御目標の位置マトリクスＭｍ’（これは図３に示
すように先ず目標位置マトリクスＭｍを生成し、誤差を
考慮した数学モデルｄを用いて補正したものである）を
入力する。そして、座標逆変換ｆａ演算部４０３で、逆
変換関数ｆａにより、目標位置マトリクスＭｍ’に対応
するロボットの各関節の関節角ベクトルΘａを演算す
る。Next, the robot controller 5 according to this embodiment.
The learning of 0 into the neural network will be described with reference to FIG. 4, which is a block diagram showing the outline of the arithmetic processing of the CPU 11. First, the target position matrix Mm ′ input unit 401
Then, the position matrix Mm 'of the control target calculated by the offline teaching host system 60 (this is the target position matrix Mm first generated as shown in FIG. 3 and corrected by using the mathematical model d considering the error). ) Is entered. Then, the coordinate inverse transformation fa computing unit 403 computes the joint angle vector Θa of each joint of the robot corresponding to the target position matrix Mm ′ by the inverse transformation function fa.

【００１６】次に、座標逆変換ｆａ演算部４０３で求め
られたロボットの各関節の関節角ベクトルΘａを、関節
角ベクトルΘａ保持部４０５に保持すると共に、この時
のロボット制御の誤差要因となる荷重、温度等を誤差要
因項目保持部４０７に保持する。そして、この関節角ベ
クトルΘａを基に、ロボット制御部４１１でロボット１
０を動作させる。この動作後に、位置マトリクスＭａ測
定部４１３でロボットアームの実際の先端位置マトリク
スＭａを測定する。そして、補正マトリクスΔＭｍ演算
部４１５で、目標位置マトリクスＭｍ’の逆行列を求め
ると共に、この逆行列と該先端位置マトリクスＭａとか
ら前述した式３（ΔＭｍ＝Ｍｍ^-1・Ｍａ）を用いて、補
正マトリクスΔＭｍを求める。そして、求められた補正
マトリクスΔＭｍを補正マトリクスΔＭｍ保持部４１７
へ保持させる。Next, the joint angle vector Θa of each joint of the robot obtained by the coordinate inverse transformation fa computing unit 403 is held in the joint angle vector Θa holding unit 405, which becomes an error factor of robot control at this time. The load, the temperature, etc. are held in the error factor item holding unit 407. Then, based on the joint angle vector Θa, the robot controller 411 controls the robot 1
0 is operated. After this operation, the position matrix Ma measuring unit 413 measures the actual tip position matrix Ma of the robot arm. Then, the correction matrix ΔMm calculation unit 415 obtains an inverse matrix of the target position matrix Mm ′, and using the above-mentioned Equation 3 (ΔMm = Mm ⁻¹ · Ma) from the inverse matrix and the tip position matrix Ma, The correction matrix ΔMm is obtained. Then, the calculated correction matrix ΔMm is stored in the correction matrix ΔMm holding unit 417.
To hold.

【００１７】上記処理による補正データの蓄積を複数の
位置について行う。そして、蓄積された関節角ベクトル
Θａと誤差要因項目とを入力データとし、補正マトリク
スΔＭｍを教師データとする学習をニューラルネットワ
ーク４０９に行わせ、ニューラルネットワーク４０９に
関節角ベクトルΘａに対する補正マトリクスΔＭｍの関
係を学ばせる。このニューラルネットワーク４０９の学
習については、後で更に詳細に説明する。Accumulation of correction data by the above processing is performed for a plurality of positions. Then, the neural network 409 is made to perform learning using the accumulated joint angle vector Θa and the error factor item as input data and the correction matrix ΔMm as teacher data, and the neural network 409 has the relation of the correction matrix ΔMm with respect to the joint angle vector Θa. To learn. The learning of the neural network 409 will be described in more detail later.

【００１８】次に、本実施例に係るロボット制御装置５
０の、上記の学習が完了したニューラルネットワーク４
０９を用いる制御を、ＣＰＵ１１による演算の概要を示
す図５のブロック図を参照して説明する。先ず、目標位
置マトリクスＭｍ’入力部５０１が、オフラインティー
チングホストシステム６０で算出されたロボット１０の
制御目標の位置マトリクスＭｍ’（数学モデルｄで補正
された値）を取得する。そして、第１座標逆変換ｆａ演
算部５０３で、逆変換関数ｆａを用いてロボット１０の
各関節の関節角ベクトルΘａを求める。Next, the robot controller 5 according to the present embodiment.
0, neural network 4 that has completed the above learning
The control using 09 will be described with reference to the block diagram of FIG. 5 showing the outline of the calculation by the CPU 11. First, the target position matrix Mm ′ input unit 501 acquires the control target position matrix Mm ′ of the robot 10 calculated by the offline teaching host system 60 (values corrected by the mathematical model d). Then, the first coordinate inverse transformation fa calculation unit 503 obtains the joint angle vector Θa of each joint of the robot 10 using the inverse transformation function fa.

【００１９】次に、第１座標逆変換ｆａ演算部５０３で
求められたロボット１０の各関節の関節角ベクトルΘａ
を、関節角ベクトルΘａ保持部５０５に保持すると共
に、この時のロボット制御の誤差要因となる荷重、温度
等を誤差要因項目保持部５０７へ保持させる。そして、
この関節角ベクトルΘａと誤差要因項目とをニューラル
ネットワーク４０９へ入力して、該関節角ベクトルΘａ
及びこの時の誤差要因に対する補正マトリクスΔＭｍを
取得し、これを補正マトリクスΔＭｍ保持部５１７に保
持させる。そして、目標位置補正部５１９で、補正マト
リクスΔＭｍ保持部５１７に保持された補正マトリクス
ΔＭｍの逆行列ΔＭｍ^-1を求め、これを目標位置マトリ
クス入力部５０１に保持されている目標位置マトリクス
Ｍｍ’に掛けることにより、補正された目標位置マトリ
クスＭｍ''（Ｍｍ''＝Ｍｍ' ・ΔＭｍ^-1）を求める。最
後に、第２座標逆変換ｆａ演算部５２１で、補正された
目標位置マトリクスＭｍ''を基に、逆変換関数ｆａを用
いてロボット１０の第１関節ａ乃至第６関節ｆの関節角
ベクトルΘａ’を求める。そして、位置制御部５２３
が、求められた関節角ベクトルΘａ’を基に、前述した
図２に示す１軸サーボ制御部４０ａ乃至６軸サーボ制御
部４０ｆに指令を発し、これに応じて１軸サーボ制御部
４０ａ乃至６軸サーボ制御部４０ｆがサーボモータＭ１
乃至Ｍ６を駆動することにより第１関節ａ乃至第６関節
ｆが動かされ、ロボット１０の位置及び姿勢の制御が行
われる。Next, the joint angle vector Θa of each joint of the robot 10 obtained by the first inverse coordinate transformation fa calculation unit 503.
Is held in the joint angle vector Θa holding unit 505, and at the same time, the error factor item holding unit 507 holds the load, temperature, and the like, which are error factors in robot control at this time. And
The joint angle vector Θa and the error factor item are input to the neural network 409, and the joint angle vector Θa is input.
Also, the correction matrix ΔMm for the error factor at this time is acquired and held in the correction matrix ΔMm holding unit 517. Then, the target position correction unit 519 obtains an inverse matrix ΔMm ⁻¹ of the correction matrix ΔMm held in the correction matrix ΔMm holding unit 517, and this is used as the target position matrix Mm ′ held in the target position matrix input unit 501. By multiplying, the corrected target position matrix Mm ″ (Mm ″ = Mm ′ · ΔMm ⁻¹ ) is obtained. Finally, in the second coordinate inverse transformation fa calculation unit 521, based on the corrected target position matrix Mm ″, the inverse transformation function fa is used to joint angle vectors of the first joint a to the sixth joint f of the robot 10. Find Θa '. Then, the position control unit 523
On the basis of the joint angle vector Θa 'thus obtained, issues a command to the 1-axis servo control units 40a to 6f shown in FIG. 2 described above, and in response thereto, the 1-axis servo control units 40a to 6a. The axis servo control unit 40f is the servo motor M1.
By driving M6 to M6, the first to sixth joints a to f are moved, and the position and posture of the robot 10 are controlled.

【００２０】本実施例では、ニューラルネットワーク４
０９に学習させる際に、目標位置マトリクスＭｍ’から
関節角ベクトルΘａを求め、この関節角ベクトルΘａを
ニューラルネットワーク４０９に与え、関節角ベクトル
Θａ算出の基礎とした目標位置マトリクスＭｍ’に対す
る補正量（補正マトリクスΔＭｍ）を得るようにしたた
め、目標位置マトリクスから直接補正マトリクスを得る
ようにした構成と比較して学習が簡単である利点があ
る。更に、この実施例では、関節角ベクトルΘａをニュ
ーラルネットワークに与え補正マトリクスΔＭｍを得る
ようにしたため誤差要因を減らすことが可能となった。
これは、目標位置マトリクスＭｍ’に対する関節角ベク
トルΘａは複数存在するため、目標位置マトリクスを直
接ニューラルネットワークに与えると、複数存在する関
節角ベクトルΘａが誤差要因となるのに対して、本実施
例では関節角ベクトルΘａをニューラルネットワークに
与えており、この関節角ベクトルΘａにより決まるロボ
ットの先端位置は１つだからである。In this embodiment, the neural network 4
When learning in 09, the joint angle vector Θa is obtained from the target position matrix Mm ′, this joint angle vector Θa is given to the neural network 409, and the correction amount for the target position matrix Mm ′ on which the joint angle vector Θa is calculated ( Since the correction matrix ΔMm) is obtained, there is an advantage that learning is simple compared with the configuration in which the correction matrix is obtained directly from the target position matrix. Further, in this embodiment, since the joint angle vector Θa is given to the neural network to obtain the correction matrix ΔMm, the error factor can be reduced.
This is because there are a plurality of joint angle vectors Θa with respect to the target position matrix Mm ′, and therefore, when the target position matrix is directly given to the neural network, the plurality of joint angle vectors Θa become an error factor. This is because the joint angle vector Θa is given to the neural network, and the robot has only one tip position determined by the joint angle vector Θa.

【００２１】次に、前述した本実施例に係るロボット制
御装置のニューラルネットワーク４０９の学習及びこれ
による演算について更に詳細に説明する。１．ニューラルネットワークの構成、本実施例のニューラルネットワークは、図２に示すＣＰ
Ｕ１１、ＲＯＭ２０、ＲＡＭ３０から成るコンピュータ
システムで構成されている。ＲＯＭ２０には入力データ
と教師データの蓄積を管理する制御プログラムの記憶さ
れた制御プログラム領域２１と、ニューラルネットワー
クの演算プログラムの記憶されたニューラルネットワー
ク領域２２と、ニューラルネットワークに学習させるた
めのプログラムの記憶された学習プログラム領域２３と
が形成されている。また、ＲＡＭ３０には、図４に関連
して前述した関節角ベクトルΘａ保持部４０５に蓄積さ
れた関節角ベクトルΘａと、誤差要因項目保持部４０７
に蓄積された温度、荷重等の誤差項目を入力データとし
て記憶する入力データ記憶領域３１と、同様に補正マト
リクスΔＭｍ保持部４１７に蓄積された補正マトリクス
ΔＭｍを教師データとして記憶する教師データ記憶領域
３２と、ニューラルネットワークの結合係数を記憶する
結合係数記憶領域３３とが形成されている。Next, the learning of the neural network 409 of the robot controller according to the above-described embodiment and the calculation by the learning will be described in more detail. 1. Configuration of Neural Network, The neural network of the present embodiment has the CP shown in FIG.
The computer system is composed of U11, ROM 20, and RAM 30. The ROM 20 stores a control program area 21 in which a control program for managing the accumulation of input data and teacher data is stored, a neural network area 22 in which a neural network operation program is stored, and a program for learning the neural network A learning program area 23 is formed. In the RAM 30, the joint angle vector Θa stored in the joint angle vector Θa holding unit 405 described above with reference to FIG. 4 and the error factor item holding unit 407.
An input data storage area 31 for storing error items such as temperature and load accumulated as input data, and a teacher data storage area 32 for similarly storing the correction matrix ΔMm stored in the correction matrix ΔMm holding unit 417 as teacher data. And a coupling coefficient storage area 33 for storing the coupling coefficient of the neural network.

【００２２】２．ニューラルネットワーク本実施例のニューラルネットワーク４０９は、図７に示
すように、入力層ＬＩと出力層Ｌ０と中間層ＬＭの３層
構造に構成されている。入力層ＬＩはｅ個の入力素子を
有し、出力層Ｌ０はｇ個の出力素子を有し、中間層ＬＭ
はｈ個の出力素子を有している。2. Neural Network As shown in FIG. 7, the neural network 409 of this embodiment has a three-layer structure of an input layer LI, an output layer L0, and an intermediate layer LM. The input layer LI has e input elements, the output layer L0 has g output elements, and the intermediate layer LM
Has h output elements.

【００２３】多層構造のニューラルネットワークは、一
般的に、次の演算を行う装置として定義される。第ｉ層
の第ｊ番目の素子の出力Ｏⁱ _jは、次式で演算される。
但し、ｉ≧２である。A multi-layered neural network is generally defined as a device that performs the following operations. The output O ⁱ _j of the j-th element in the i-th layer is calculated by the following equation.
However, i ≧ 2.

【数４】Ｏⁱ _j＝ｆ（Ｉⁱ _j）式４Equation 4 O ⁱ _j = f (I ⁱ _j ) Equation 4

【数５】 [Equation 5]

【数６】ｆ(x) ＝１／｛１＋ｅｘｐ(-x)｝式６## EQU00006 ## f (x) = 1 / {1 + exp (-x)} Expression 6

【００２４】但し、Ｖⁱ _jは第ｉ層の第ｊ番目の演算素
子のバイアス、Ｗ^i-1 _k, ⁱ _jは、第ｉ−１層の第ｋ番目
の素子と第ｉ層の第ｊ番目の素子各の結合係数、Ｏ¹ _j
は第１層の第ｊ番目の素子の出力値を表す。即ち、第１
層であるから演算を行うことなく、そのまま入力を出力
するので、入力層（第１層）の第ｊ番目の素子の入力値
でもある。However, V ⁱ _j is the bias of the j-th arithmetic element of the ^i- th layer, and W ^i-1 _k, ⁱ _j is the k-th element of the i-1-th layer and the j-th element of the i-th layer. Coupling coefficient of each of the th element, O ¹ _j
Represents the output value of the j-th element in the first layer. That is, the first
Since it is a layer, the input is output as it is without performing any calculation, so it is also the input value of the j-th element of the input layer (first layer).

【００２５】次に、図７に示す３層構造のニューラルネ
ットワーク４０９の具体的な演算手順について、図８を
参照して説明する。各層の演算は、ＲＡＭ３０の結合係
数記憶領域３３に記憶されている結合係数を参照しつ
つ、ＲＯＭ２０のニューラルネットワーク領域２２に記
憶されたプログラムを実行することによって行われる。
ステップ１００において、中間層（第２層）の第ｊ番目
の素子は、入力層（第１層）の各素子からの出力値Ｏ¹
_j（第１層の入力データ）を入力して、式５を層番号と
第１層の素子数を用いて具体化した次式の積和演算を行
う。Next, a specific calculation procedure of the neural network 409 having the three-layer structure shown in FIG. 7 will be described with reference to FIG. The calculation of each layer is performed by referring to the coupling coefficient stored in the coupling coefficient storage area 33 of the RAM 30 and executing the program stored in the neural network area 22 of the ROM 20.
In step 100, the j-th element of the intermediate layer (second layer) is the output value O ¹ from each element of the input layer (first layer).
_j (input data of the first layer) is input, and the product-sum calculation of the following equation, which is an embodiment of Equation 5 using the layer number and the number of elements of the first layer, is performed.

【数７】 [Equation 7]

【００２６】次に、ステップ１０２において、次式によ
り、式７の入力値の積和関数値のシグモンド関数によ
り、中間層（第２層）の各素子の出力が演算される。第
２層の第ｊ番目の素子の出力値は次式で演算される。Next, at step 102, the output of each element of the intermediate layer (second layer) is calculated by the following equation using the Sigmond function of the product-sum function value of the input values of equation 7. The output value of the j-th element in the second layer is calculated by the following equation.

【数８】Ｏ² _j＝ｆ（Ｉ² _j）＝１／｛１＋ｅｘｐ( −Ｉ² _j) ｝式８この出力値Ｏ² _jは出力層（第３層）の各素子の入力値
となる。Equation 8] ^{_{^{O 2 j = f (I 2}}} j) = 1 / {1 + exp (-I 2 j)} Equation 8 The output value O ² _j is the input value of each element of the output layer (third layer) .

【００２７】次に、このステップ１０４において、出力
層（第３層）の各素子の入力値の積和演算が実行され
る。Next, at step 104, the sum of products operation of the input values of the respective elements of the output layer (third layer) is executed.

【数９】次に、ステップ１０６において、式８と同様にシグモン
ト関数により、出力層の各素子の出力値Ｏ³ _jが演算さ
れる。[Equation 9] Next, in step 106, the output value O ³ _j of each element of the output layer is calculated by the Sigmont function as in the case of the equation 8.

【数１０】Ｏ³ _j＝ｆ（Ｉ³ _j）＝１／｛１＋ｅｘｐ( −Ｉ³ _j) ｝式１０## EQU10 ## O ³ _j = f (I ³ _j ) = 1 / {1 + exp (-I ³ _j )} Equation 10

【００２８】３．入力データと教師データの構造ニューラルネットワークの更新学習に使用されるデータ
は、図１０に示すようなデータベースに構成されてい
る。入力データは、Ｄ₁ ・・・Ｄ_nであり、対応する
教師データは、Ｅ₁・・・Ｅ_nである。このｎ個の入力
データは、図４に関して前述した関節角ベクトルΘａ保
持部４０５に保持された関節角ベクトルΘａ及び誤差要
因項目保持部４０７に保持された誤差要因となる温度、
荷重等であり、そして、このｎ個の教師データは補正マ
トリクスΔＭｍ保持部４１７に保持された補正マトリク
スΔＭｍである。そしてこれらのデータは、それぞれ、
ＲＡＭ３０の入力データ記憶領域３１及び教師データ記
憶領域３２に記憶されている。3. Structure of input data and teacher data The data used for the update learning of the neural network is configured in the database as shown in FIG. Input data is D ₁ ··· D _n, the corresponding teacher data is E ₁ ··· E _n. The n pieces of input data are the joint angle vector Θa held in the joint angle vector Θa holding unit 405 described above with reference to FIG. 4 and the error factor temperature held in the error factor item holding unit 407,
The weight and the like, and the n pieces of teacher data are the correction matrix ΔMm held in the correction matrix ΔMm holding unit 417. And each of these data
It is stored in the input data storage area 31 and the teacher data storage area 32 of the RAM 30.

【００２９】この入力データは次のように定義される。
ｅ個の入力素子のそれぞれに与えられるｅ個のデータを
１組のデータとして考える。そして、任意の第ｍ番目の
１組の入力データをＤ_mで表し、その組に属する第ｊ番
目の入力素子に対する入力データをｄ_mjで表す。Ｄ_mは
ベクトルを表し、ｄ_mjはそのベクトルの成分である。即
ち、Ｄ_mは次式で定義される。This input data is defined as follows.
Consider the e pieces of data given to each of the e input elements as one set of data. Then, an arbitrary m-th set of input data is represented by D _m , and input data for the j-th input element belonging to the set is represented by d _mj . D _m represents a vector, and d _mj is a component of the vector. That is, D _m is defined by the following equation.

【数１１】Ｄ_m＝（ｄ_m1，ｄ_m2，・・・・ｄ_me-1，ｄ_me）式１１又、ｎ組の入力データはＤ₁，Ｄ₂，・・・Ｄ_n-1，Ｄ
_nで表される。以下、全ｎ組の入力データ群は、入力デ
ータ群Ｄと表記される。尚、入力データＤ_mに対して式
７を用いる場合には、式７のＯ¹ _kに、成分ｄ_mkが代入
される。[Equation 11] D _m = (d _m1 , d _m2 , ... D _me-1 , d _me ) Equation 11 Further, n sets of input data are D ₁ , D ₂ , ... D _n-1 , D
It is represented by _n . Hereinafter, all n sets of input data groups are referred to as an input data group D. When the expression 7 is used for the input data D _m , the component d _mk is substituted into O ¹ _{k of the} expression 7.

【００３０】同様に、Ｅ₁，・・・Ｅ_nは、次のように
定義される。出力層ＬＯに関して、ｇ個の出力素子のそ
れぞれからの出力に対する教師データを１組のデータと
して考える。そして、任意の第ｍ番目の１組の教師デー
タをＥ_mで表し、その組に属する第ｊ番目の出力素子に
対する教師データをｅ_mjで表す。Ｅ_mはベクトルを表
し、ｅ_mjはそのベクトルの成分である。即ちＥ_mは次式
で定義される。Similarly, E ₁ , ... E _n are defined as follows. For the output layer LO, consider the teacher data for the output from each of the g output elements as a set of data. Then, an arbitrary m-th set of teacher data is represented by E _m , and teacher data for the j-th output element belonging to the set is represented by _em j. E _m represents a vector and e _mj is a component of the vector. That is, _Em is defined by the following equation.

【数１２】Ｅ_m＝（ｅ_m1，ｅ_m2，・・・・ｅ_mg-1，ｅ_mg）式１２また、ｎ組の教師データはＥ₁，Ｅ₂，・・・・
Ｅ_n-1，Ｅ_nで表される。以下全ｎ組の教師データ群
は、教師データ群Ｅと表記される。[Equation 12] E _m = (e _m1 , e _m2 , ... E _mg-1 , e _mg ) Equation 12 Further, n sets of teacher data are E ₁ , E ₂ ,.
It is represented by E _n-1 and E _n . Hereinafter, all n sets of teacher data groups will be referred to as a teacher data group E.

【００３１】４．ニューラルネットワークの学習このニューラルネットワークは、初期学習として、ＲＯ
Ｍ２０の学習プログラム領域２３に記憶された図９に示
す手順のプログラムが実行されることにより学習され
る。結合係数の学習は良く知られたバックプロパーゲー
ション法により実行される。4. Learning Neural Network This neural network performs RO as initial learning.
Learning is performed by executing the program of the procedure shown in FIG. 9 stored in the learning program area 23 of M20. The learning of the coupling coefficient is performed by the well-known backpropagation method.

【００３２】この学習は、各種の事象に関する多数の入
力データに対して、それぞれの出力が、それぞれの最適
な教師データとなるように、繰り返し実行される。これ
らの入力データ及び教師データは、前述したようにそれ
ぞれ、入力データ記憶領域３１及び教師データ記憶領域
３２に記憶されている。This learning is repeatedly executed with respect to a large number of input data regarding various events so that the respective outputs become the respective optimum teacher data. These input data and teacher data are stored in the input data storage area 31 and the teacher data storage area 32, respectively, as described above.

【００３３】図９のステップ２００において、データ番
号ｉが初期値１に設定され、出力素子の番号ｊ（教師デ
ータの成分番号ｊ）が初期値の１に設定される。次にス
テップ２０２へ移行して、第ｉ番目の入力データＤｉと
第ｉ番目の教師データＥｉが入力データ記憶領域３１と
教師データ記憶領域３２とから抽出される。In step 200 of FIG. 9, the data number i is set to the initial value 1, and the output element number j (the teacher data component number j) is set to the initial value 1. Next, in step 202, the i-th input data Di and the i-th teacher data Ei are extracted from the input data storage area 31 and the teacher data storage area 32.

【００３４】次にステップ２０６へ移行して、次式によ
り出力層の読みだされた第ｉ番目の教師データＥｉの第
ｊ成分ｅ_ijに対応した素子の学習信号が演算される。Next, in step 206, the learning signal of the element corresponding to the j-th component e _ij of the i-th teacher data Ei read out from the output layer is calculated by the following equation.

【数１３】Ｙ³ _j＝（ｅ_ij−Ｏ³ _j) ・ｆ' （Ｉ³ _j）式１３但し、Ｙ³ _j、Ｏ³ _j、Ｉ³ _jでは、データ番号ｉは省
略されている。ｆ'(X)はシグモンド関数の導関数であ
る。又、Ｉ³ _jは、入力データＤ_iの各成分を式７のＯ
¹ _k代入して、中間層のの全ての素子に関しＩ² _kを求
め、Ｉ² _kを式８に代入して中間層の全ての素子に関し
て出力Ｏ² _kを求め、その全てのｋに関してＯ² _kを式
９に代入して求められる。又、Ｏ³ _jは、Ｉ³ _jを式１
０に代入して求められる。[Formula 13] Y ³ _j = (e _ij −O ³ _j ) · f ′ (I ³ _j ) Formula 13 However, in Y ³ _j , O ³ _j , and I ³ _j , the data number i is omitted. f '(X) is the derivative of the Sigmond function. Further, I ³ _j is the O of the equation 7 in which each component of the input data D _i is
^By substituting ¹ _k, I ² _k is obtained for all the elements in the intermediate layer, I ² _k is substituted in the equation 8 to obtain outputs O ² _k for all the elements in the intermediate layer, and O ² _k is obtained for all the k. ^It is calculated by substituting ² _k into Equation 9. In addition, O ³ _j is obtained by using I ³ _j in Equation 1
It is obtained by substituting 0.

【００３５】次に、ステップ２１０において、全出力素
子について、学習信号が演算されたか否かが判定され、
判定結果がＮＯの場合には、ステップ２１２において、
素子番号ｊが１だけ加算され、ステップ２０６へ戻り、
次の出力素子に関する学習信号が演算される。ステップ
２１０で全出力素子に関する学習信号の演算が完了した
と判定されると、ステップ２１４において、中間層の任
意の第ｒ番目の素子に関する学習信号Ｙが次式で演算さ
れる。Next, at step 210, it is judged whether or not learning signals have been calculated for all output elements,
If the determination result is NO, in step 212,
The element number j is incremented by 1, and the process returns to step 206,
A learning signal for the next output element is calculated. When it is determined in step 210 that the calculation of the learning signal for all the output elements is completed, in step 214, the learning signal Y for an arbitrary r-th element in the intermediate layer is calculated by the following equation.

【数１４】このような学習演算が、中間層の全素子に関して実行さ
れる。[Equation 14] Such a learning operation is performed on all the elements in the middle layer.
Be done.

【００３６】次に、ステップ２１６において、出力層の
各結合係数が補正される。補正量は次式で求められる。Next, in step 216, each coupling coefficient of the output layer is corrected. The correction amount is calculated by the following equation.

【数１５】 Δω² _i, ³ _j（ｔ）＝Ｐ・Ｙ³ _j・ｆ（Ｉ² _i）＋Ｑ・Δω² _i, ³ _j（ｔ−１）式１５但し、Δω² _i, ³ _j（ｔ）は、出力層の第ｊ番目の素子
と中間層の第ｉ番目の素子との間の結合係数の第ｔ回目
演算の変化量である。又、Δω² _i, ³ _j（ｔ−１）は、
その結合係数の前回の補正量である。Ｐ、Ｑは比例定数
である。よって結合係数は、[Expression 15] Δω ² _i, ³ _j (t) = P · Y ³ _j · f (I ² _i ) + Q · Δω ² _i, ³ _j (t−1) Equation 15 where Δω ² _i, ³ _j (T) is the amount of change in the t-th calculation of the coupling coefficient between the j-th element of the output layer and the i-th element of the intermediate layer. Also, Δω ² _i, ³ _j (t-1) is
It is the previous correction amount of the coupling coefficient. P and Q are proportional constants. Therefore, the coupling coefficient is

【数１６】Ｗ² _i, ³ _j＋Δω² _i, ³ _j（ｔ）→ Ｗ² _i, ³ _j 式１６により、補正された結合係数が求められる。## EQU16 ## W ² _i, ³ _j + Δω ² _i, ³ _j (t) → W ² _i, ³ _{j The} corrected coupling coefficient is obtained by the equation 16.

【００３７】次に、ステップ２１８へ移行して、中間層
の各素子の各結合係数が補正される。その結合係数の補
正量は出力層の場合と同様に、次式で求められる。Next, in step 218, each coupling coefficient of each element of the intermediate layer is corrected. The correction amount of the coupling coefficient is obtained by the following equation, as in the case of the output layer.

【数１７】 Δω¹ _i, ² _j（ｔ）＝Ｐ・Ｙ² _j・ｆ（Ｉ¹ _i）＋Ｑ・Δω¹ _i, ² _j（ｔ−１）式１７よって結合係数は、Δω ¹ _i, ² _j (t) = P · Y ² _j · f (I ¹ _i ) + Q · Δω ¹ _i, ² _j (t−1) Equation 17 Therefore, the coupling coefficient is

【数１８】Ｗ¹ _i, ² _j＋Δω¹ _i, ² _j（ｔ）→ Ｗ¹ _i, ² _j 式１８により、補正された結合係数が求められる。[Expression 18] W ¹ _i, ² _j + Δω ¹ _i, ² _j (t) → W ¹ _i, ² _{j The} corrected coupling coefficient is obtained by the expression 18.

【００３８】次に、ステップ２２０において、学習対象
のｎ個の入力データ及び教師データに対して１回の学習
が完了したか否かが判定される。全ての入力データに対
する学習が終了していない場合には、ステップ２２２へ
移行して、次の入力データとその入力データに対応する
教師データを入力データ記憶領域３１と教師データ記憶
領域３２から読み込むためにデータ番号ｉが１だけ加算
され、成分番号ｊは初期値の１に設定される。そして、
ステップ２０２へ戻り、次の入力データ及び教師データ
を用いて上記した学習が実行される。Next, at step 220, it is judged whether or not one learning is completed for the n pieces of input data and the teacher data to be learned. If learning for all input data has not been completed, the process proceeds to step 222 to read the next input data and the teacher data corresponding to the input data from the input data storage area 31 and the teacher data storage area 32. The data number i is incremented by 1, and the component number j is set to the initial value 1. And
Returning to step 202, the above learning is executed using the next input data and the teacher data.

【００３９】ステップ２２０でｎ個全部の入力データ及
び教師データに関して学習が完了したと判定されると、
ステップ２２４に移行して、出力データと教師データの
差の自乗の値が所定の値以下になったか否かの判定によ
り、結合係数が収束したか否かが判定される。結合係数
が収束していなければ、ステップ２００に戻り、第２回
目の学習を行うために、第１番目の入力データ及び教師
データから上述した学習が実行される。When it is determined in step 220 that learning has been completed for all n input data and teacher data,
In step 224, it is determined whether the coupling coefficient has converged by determining whether the square value of the difference between the output data and the teacher data has become equal to or less than a predetermined value. If the coupling coefficient has not converged, the process returns to step 200, and the learning described above is executed from the first input data and the teacher data in order to perform the second learning.

【００４０】このようにして、ステップ２２４におい
て、出力データと教師データの差の自乗の値が所定の値
以下となり、学習が収束するまで、上記の学習演算が繰
り返し実行される。この結果、初期の広範囲の事象に関
して初期学習されたニューラルネットワークが完成され
る。この学習の結果、本実施例のニューラルネットワー
ク４０９は、図５に示すように、関節角ベクトルΘａを
入力することにより必要な補正マトリクスΔＭｍを演算
することが可能となる。In this way, in step 224, the above learning operation is repeatedly executed until the square value of the difference between the output data and the teacher data becomes equal to or less than the predetermined value and the learning converges. This completes the initially learned neural network for a wide range of early events. As a result of this learning, the neural network 409 of the present embodiment can calculate the necessary correction matrix ΔMm by inputting the joint angle vector Θa as shown in FIG.

【００４１】次に、本発明の別実施例に係るロボット制
御装置のニューラルネットワークへの学習を、ＣＰＵの
演算処理の概要をブロック図にした図６を参照して説明
する。前述した実施例においては、ニューラルネットワ
ーク４０９に、入力データとして関節角ベクトルΘａ
を、そして、教師データとして補正マトリクスΔＭｍを
学習させたが、本実施例においては、ニューラルネット
ワーク４０９に、入力データとして目標位置マトリクス
Ｍｍ’を、教師データとして補正マトリクスΔＭｍを学
習させる。なお、この実施例においても、ロボット制御
装置の構成は、図２及び図３を参照して前述した実施例
と略同様であるのでその説明については省略する。Next, the learning of the neural network of the robot controller according to another embodiment of the present invention will be described with reference to FIG. 6 which is a block diagram showing the outline of the arithmetic processing of the CPU. In the above-described embodiment, the joint angle vector Θa is input to the neural network 409 as input data.
Then, the correction matrix ΔMm is learned as the teacher data, but in the present embodiment, the neural network 409 is made to learn the target position matrix Mm ′ as the input data and the correction matrix ΔMm as the teacher data. In this embodiment as well, the configuration of the robot controller is substantially the same as that of the embodiment described above with reference to FIGS. 2 and 3, and therefore its explanation is omitted.

【００４２】先ず、目標位置マトリクスＭｍ’入力部６
０１で、オフラインティーチングホストシステム６０が
誤差を考慮した数学モデルｄを用いて算出したロボット
の制御目標の位置マトリクスＭｍ’を入力する。そし
て、座標逆変換ｆａ演算部６０３で、逆変換関数ｆａに
より、目標位置マトリクスＭｍに対応するロボットの各
関節の関節角ベクトルΘａを演算する。次に、目標位置
マトリクスＭｍ’入力部６０１の目標位置マトリクスＭ
ｍ’を目標位置マトリクスＭｍ’保持部６０５に保持す
ると共に、この時のロボット制御の誤差要因となる荷
重、温度等を誤差要因項目保持部６０７に保持する。そ
して、前述した関節角ベクトルΘａを基に、ロボット制
御部６１１でロボット１０を動作させる。この動作後
に、位置マトリクスＭａ測定部６１３でロボットアーム
の実際の先端位置マトリクスＭａを測定する。そして、
補正マトリクスΔＭｍ演算部６１５で、目標位置マトリ
クスＭｍの逆行列を求めると共に、この逆行列と該先端
位置マトリクスＭａとから前述した式３（ΔＭｍ＝Ｍｍ
^-1・Ｍａ）を用いて補正マトリクスΔＭｍを求める。そ
して、求められた補正マトリクスΔＭｍを補正マトリク
スΔＭｍ保持部６１７へ保持させる。First, the target position matrix Mm 'input unit 6
At 01, the position matrix Mm ′ of the control target of the robot calculated by the offline teaching host system 60 using the mathematical model d considering the error is input. Then, the coordinate inverse transformation fa computing unit 603 computes the joint angle vector Θa of each joint of the robot corresponding to the target position matrix Mm by the inverse transformation function fa. Next, the target position matrix Mm of the target position matrix Mm ′ input unit 601
The target position matrix Mm ′ holding unit 605 holds m ′, and the error factor item holding unit 607 holds the load, the temperature, and the like, which are error factors of the robot control at this time. Then, the robot controller 611 operates the robot 10 based on the joint angle vector Θa described above. After this operation, the position matrix Ma measuring unit 613 measures the actual tip position matrix Ma of the robot arm. And
The correction matrix ΔMm calculation unit 615 obtains the inverse matrix of the target position matrix Mm, and from the inverse matrix and the tip position matrix Ma, the above-mentioned expression 3 (ΔMm = Mm
⁻¹ · Ma) is used to find the correction matrix ΔMm. Then, the obtained correction matrix ΔMm is held in the correction matrix ΔMm holding unit 617.

【００４３】上記処理による補正データの蓄積を複数の
位置について行う。そして、蓄積された目標位置マトリ
クスＭｍ’と誤差要因項目とを入力データとし、補正マ
トリクスΔＭｍを教師データとする学習をニューラルネ
ットワーク４０９に行わせ、ニューラルネットワーク４
０９に目標位置マトリクスＭｍ’に対する補正マトリク
スΔＭｍの関係を学ばせる。この学習を行ったニューラ
ルネットワーク４０９を用いて補正を行うロボットの制
御は、図５に関連して前述した実施例と略同様であるの
でこの説明は省略する。図６に示した実施例、また、図
４を参照して前述した実施例において説明したように、
本発明は、ニューラルネットワーク４０９の学習用のデ
ータとしては、種々のものを用いることができる。Accumulation of correction data by the above processing is performed for a plurality of positions. Then, the neural network 409 is caused to perform learning using the accumulated target position matrix Mm ′ and error factor items as input data and the correction matrix ΔMm as teacher data, and the neural network 4
09 is made to learn the relation of the correction matrix ΔMm to the target position matrix Mm ′. The control of the robot that performs the correction using the learned neural network 409 is substantially the same as the embodiment described above with reference to FIG. As described in the embodiment shown in FIG. 6 and the embodiment described above with reference to FIG.
In the present invention, various data can be used as the learning data of the neural network 409.

【００４４】なお、本実施例では、入力層ＬＩ、中間層
ＬＭ、出力層ＬＯからなる３層構造のニューラルネット
ワークを例に取ったが、本発明のニューラルネットワー
クはかかる構成に限定されず、必要な学習を行い得るい
かなる構成のニューラルネットワークでも本発明の所期
の目的を達成できる。また、本実施例の補正の考え方を
説明する際に、説明の便宜のために数式を挙げて説明し
たが、本発明はこれら数式に限定されるものではない。
更に、図３を参照した前述の説明においては、オフライ
ンティーチングホストシステム６０の算出したロボット
の制御データをフロッピィディスクを介しロボット制御
装置５０に移しかえたが、データの転送をデータライン
等を介し行うことも可能である。In this embodiment, a neural network having a three-layer structure consisting of the input layer LI, the intermediate layer LM, and the output layer LO is taken as an example, but the neural network of the present invention is not limited to such a configuration and is necessary. The desired purpose of the present invention can be achieved by a neural network having any configuration capable of performing various learning. Further, when the concept of the correction of the present embodiment is described, mathematical expressions are given for convenience of explanation, but the present invention is not limited to these mathematical expressions.
Further, in the above description with reference to FIG. 3, the control data of the robot calculated by the offline teaching host system 60 was transferred to the robot controller 50 via the floppy disk, but the data transfer is performed via the data line or the like. It is also possible.

【００４５】[0045]

【発明の効果】本発明は、以上説明したように構成され
ており、従来の数学モデルでは考慮できなかった誤差要
因をニューラルネットワークにより学習させ補正を行う
ため、補正精度を向上させることが可能である。また、
ロボットの補正精度が高いため、ロボット制御装置を設
定するときに、実際に使用される場所でのティーチング
補正が不要となるので、ロボット制御装置の設定時間の
短縮化が可能になり、また、ロボット制御装置設定の費
用が軽減できる。更に安全の面でも改善できる利点があ
る。The present invention is configured as described above and corrects the error factors, which cannot be taken into consideration by the conventional mathematical model, by learning them by the neural network, and therefore the correction accuracy can be improved. is there. Also,
Since the robot correction accuracy is high, it is not necessary to make teaching corrections at the actual location where the robot controller is set, so it is possible to shorten the robot controller setting time. The cost of setting the control device can be reduced. Further, there is an advantage that the safety can be improved.

[Brief description of drawings]

【図１】本発明の一実施例に係るロボットの機械的構成
を示す構成図。FIG. 1 is a configuration diagram showing a mechanical configuration of a robot according to an embodiment of the present invention.

【図２】図１に示すロボットを制御する本実施例に係る
ロボット制御装置の構成を示したブロック図。FIG. 2 is a block diagram showing the configuration of a robot control apparatus according to this embodiment for controlling the robot shown in FIG.

【図３】本実施例に係るオフラインティーチングホスト
システムとロボット制御装置との構成を示すブロック
図。FIG. 3 is a block diagram showing the configurations of an offline teaching host system and a robot controller according to the present embodiment.

【図４】本実施例に係るロボット制御装置の学習時の処
理を説明するブロック図。FIG. 4 is a block diagram illustrating processing during learning of the robot control apparatus according to the present embodiment.

【図５】本実施例に係るロボット制御装置の学習後にお
ける制御の処理を説明するブロック図。FIG. 5 is a block diagram illustrating a control process after learning of the robot controller according to the present embodiment.

【図６】本発明の別実施例に係るロボット制御装置の学
習時における制御の処理を説明するブロック図。FIG. 6 is a block diagram illustrating a control process during learning of a robot controller according to another embodiment of the present invention.

【図７】本実施例に係るロボット制御装置のニューラル
ネットワークの構成を示した構成図。FIG. 7 is a configuration diagram showing a configuration of a neural network of the robot controller according to the present embodiment.

【図８】図７に示す実施例に係るニューラルネットワー
クの演算手順を示したフローチャート。8 is a flowchart showing a calculation procedure of the neural network according to the embodiment shown in FIG.

【図９】図７に示す実施例に係るニューラルネットワー
クの学習手順を示したフローチャート。9 is a flowchart showing a learning procedure of the neural network according to the embodiment shown in FIG.

【図１０】ニューラルネットワークの学習に用いられる
入力データと教師データを有するデータベースのデータ
構成を示した構成図。FIG. 10 is a configuration diagram showing a data configuration of a database having input data and teacher data used for learning of a neural network.

【図１１】従来のオフラインティーチングホストシステ
ムとロボット制御装置との構成を示すブロック図。FIG. 11 is a block diagram showing the configurations of a conventional offline teaching host system and a robot controller.

[Explanation of symbols]

１０ロボット１１ＣＰＵ２０ＲＯＭ３０ＲＡＭ５０ロボット制御装置５１補正された目標位置マトリクスＭｍ’保持部５２ニューラルネットワークによる補正部５３ニューラルネットワークによる補正された目標
位置Ｍｍ''保持部５４ロボット制御部６０オフラインティーチングシステム６１目標位置の生成部６３数学モデルによる補正部４０９ニューラルネットワークＬＩ入力層ＬＭ中間層ＬＯ出力層10 Robot 11 CPU 20 ROM 30 RAM 50 Robot Controller 51 Corrected Target Position Matrix Mm ′ Holding Unit 52 Neural Network Correcting Unit 53 Neural Network Corrected Target Position Mm ″ Holding Unit 54 Robot Control Unit 60 Offline Teaching System 61 Target position generating unit 63 Correcting unit using mathematical model 409 Neural network LI input layer LM intermediate layer LO output layer

Claims

[Claims]

1. A robot controller for an offline teaching system, comprising: target position holding means for receiving a target position for robot control calculated by the offline teaching system; and a neural network for outputting a correction amount with the target position as an input. A robot controller, comprising: a correction unit that corrects the calculated target position based on the output of the neural network; and a control unit that controls the robot based on the target position corrected by the neural network.