JP7118249B2

JP7118249B2 - motor controller

Info

Publication number: JP7118249B2
Application number: JP2021514690A
Authority: JP
Inventors: 武之河田; 暁生斎藤; 英俊池田
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2019-04-16
Filing date: 2019-04-16
Publication date: 2022-08-15
Anticipated expiration: 2039-04-16
Also published as: DE112019007222T5; WO2020213062A1; CN113748597A; JPWO2020213062A1

Description

本発明は、制御対象を模したモデルを生成するモータ制御装置に関する。 The present invention relates to a motor control device that generates a model imitating a controlled object.

ロボット、工作機械などをはじめとした制御対象の機械を制御するモータ制御装置に対して、高速かつ高精度な制御手法および故障検知手法が求められている。これらの性能向上には、制御対象を正確に模したモデルが必要不可欠である。しかしながら、実際のモータ制御装置の動作においては、制御対象における配線、エンドエフェクタなどの取り付けによる質量、重心位置、慣性テンソルといった物理モデルパラメータの誤差が無視できず、また、振動、摩擦などモータ速度に対して非線形な成分も多く含まれる。そのため、予めこれらを高精度に再現可能なモデルを与えることは困難であった。 High-speed and high-precision control methods and failure detection methods are required for motor control devices that control machines to be controlled, such as robots and machine tools. For these performance improvements, a model that accurately simulates the controlled object is essential. However, in the actual operation of a motor control device, errors in physical model parameters such as mass, center of gravity position, and inertia tensor due to wiring in the controlled object, attachment of end effectors, etc. cannot be ignored. On the other hand, many nonlinear components are also included. Therefore, it has been difficult to provide a model capable of reproducing these with high accuracy in advance.

近年、このようなモデルにおける力およびトルクの推定誤差を抑制するための制御方式が提案されている。例えば、特許文献１には、電動機の制御装置が、Ｍ系列または正弦波状の指令を取得し、トルク推定誤差が最小となるように、慣性テンソル、摩擦係数、およびばね定数をサンプリング周期毎に修正することにより、物理モデルパラメータの初期値に含まれる誤差を動作中に修正する技術が開示されている。 In recent years, control methods have been proposed for suppressing force and torque estimation errors in such models. For example, in Patent Document 1, an electric motor control device acquires an M-sequence or sinusoidal command, and corrects the inertia tensor, friction coefficient, and spring constant for each sampling period so that the torque estimation error is minimized. A technique is disclosed for correcting the error contained in the initial values of the physical model parameters during operation by doing so.

特開２０１３－１２８３８７号公報JP 2013-128387 A

しかしながら、特許文献１に記載の技術では、電動機の制御装置の運動が、慣性テンソル、摩擦、およびばねからのトルクでモデル化できると仮定されており、モデルとして考慮されていない外乱が表現できない、という問題があった。また、摩擦のモデルについても簡略化されているため、ストライベック効果、ヒステリシスなどといった非線形性の強い複雑な現象に対する表現能力を持たない、という問題があった。 However, in the technique described in Patent Document 1, it is assumed that the motion of the motor control device can be modeled by inertia tensor, friction, and torque from the spring, and disturbances that are not considered as a model cannot be expressed. There was a problem. In addition, since the friction model is also simplified, there is a problem that it does not have the ability to express highly nonlinear and complex phenomena such as the Stribeck effect and hysteresis.

本発明は、上記に鑑みてなされたものであって、トルク推定精度および汎化性能を向上したモデルを学習し、異常を検知する速度および精度を向上可能なモータ制御装置を得ることを目的とする。 SUMMARY OF THE INVENTION It is an object of the present invention to provide a motor control device capable of learning a model with improved torque estimation accuracy and generalization performance and improving the speed and accuracy of abnormality detection. do.

上述した課題を解決し、目的を達成するために、本発明は、対象機械を駆動するモータを制御するモータ制御装置である。モータ制御装置は、モータの状態を観測し観測結果をモータ状態信号として出力するモータ状態観測部と、モータ状態信号および対象機械の物理モデルパラメータに基づき、運動方程式を用いてモータのトルクの推定値である第一の推定トルクと第一の推定トルクの成分である物理状態変数とを出力する第一のトルク推定部と、モータ状態信号に含まれる実トルクと第一の推定トルクとの差分を低減するよう物理モデルパラメータを学習し記憶する物理モデル記憶部と、第一の推定トルク、物理状態変数、およびモータ状態信号のうち少なくとも一つから特徴量を生成し出力する特徴量生成部と、を備える。また、モータ制御装置は、入出力関係に自己回帰項を含む機械学習を用いて特徴量および機械学習モデルパラメータから機械学習補正量を算出し、算出した機械学習補正量を第一の推定トルクに加算して第二の推定トルクを出力する第二のトルク推定部と、機械学習補正量が実トルクと第一の推定トルクとの差分に近づくように機械学習モデルパラメータを教師あり学習により学習し、学習した機械学習モデルパラメータを記憶する機械学習モデル記憶部と、を備えることを特徴とする。 In order to solve the above-described problems and achieve the object, the present invention is a motor control device for controlling a motor that drives a target machine. The motor control device includes a motor state observation unit that observes the state of the motor and outputs the observation result as a motor state signal, and an estimated value of the torque of the motor using the equation of motion based on the motor state signal and the physical model parameters of the target machine. A first torque estimator that outputs a first estimated torque and a physical state variable that is a component of the first estimated torque, and a difference between the actual torque and the first estimated torque included in the motor state signal a physical model storage unit that learns and stores physical model parameters to be reduced; a feature amount generation unit that generates and outputs a feature amount from at least one of the first estimated torque, the physical state variable, and the motor state signal; Prepare. Further, the motor control device calculates a machine learning correction amount from the feature amount and the machine learning model parameters using machine learning including an autoregressive term in the input/output relationship, and applies the calculated machine learning correction amount to the first estimated torque. A second torque estimator that outputs a second estimated torque by addition, and a machine learning model parameter learned by supervised learning so that the machine learning correction amount approaches the difference between the actual torque and the first estimated torque. and a machine learning model storage unit that stores learned machine learning model parameters.

本発明によれば、モータ制御装置は、トルク推定精度および汎化性能を向上したモデルを学習し、異常を検知する速度および精度を向上できる、という効果を奏する。 Advantageous Effects of Invention According to the present invention, a motor control device can learn a model with improved torque estimation accuracy and generalization performance, and can improve the speed and accuracy of abnormality detection.

モータ制御装置の構成例を示すブロック図Block diagram showing a configuration example of a motor control device モータ制御装置が制御対象とする対象機械を示す模式図Schematic diagram showing a target machine to be controlled by the motor control device 物理モデル記憶部で推定される物理モデルパラメータである非線形摩擦モデルの例を示す図A diagram showing an example of a nonlinear friction model, which is a physical model parameter estimated in the physical model storage unit. 特徴量生成部および機械学習モデル計算部の構成例を示すブロック図Block diagram showing a configuration example of the feature generator and the machine learning model calculator 特徴量生成部および機械学習モデル計算部の構成において自己回帰項を示す図Diagram showing autoregression terms in the configuration of the feature generator and machine learning model calculator モータ制御装置における物理モデルパラメータおよび機械学習モデルパラメータの学習の流れを示すフローチャートFlowchart showing the flow of learning physical model parameters and machine learning model parameters in a motor control device モータ制御装置の制御対象の対象機械の動作を表すモータ速度、モータ制御装置で推定される第一の推定トルク、およびモータ制御装置で推定される第二の推定トルクを示す図FIG. 4 is a diagram showing a motor speed representing an operation of a target machine controlled by the motor control device, a first estimated torque estimated by the motor control device, and a second estimated torque estimated by the motor control device; モータ制御装置が備える処理回路をプロセッサおよびメモリで構成する場合の例を示す図FIG. 4 is a diagram showing an example of a case in which a processing circuit included in a motor control device is configured with a processor and a memory; モータ制御装置が備える処理回路を専用のハードウェアで構成する場合の例を示す図FIG. 4 is a diagram showing an example of a case where a processing circuit included in a motor control device is configured with dedicated hardware;

以下に、本発明の実施の形態に係るモータ制御装置を図面に基づいて詳細に説明する。なお、この実施の形態によりこの発明が限定されるものではない。 A motor control device according to an embodiment of the present invention will be described in detail below with reference to the drawings. In addition, this invention is not limited by this embodiment.

実施の形態．
図１は、本発明の実施の形態に係るモータ制御装置１００の構成例を示すブロック図である。モータ制御装置１００は、対象機械１０を駆動するモータを制御する。モータ制御装置１００は、モータ状態観測部１と、物理モデル計算部２と、特徴量生成部３と、機械学習モデル計算部４と、を備える。物理モデル計算部２は、第一のトルク推定部２１と、物理モデル記憶部２２と、を備える。機械学習モデル計算部４は、第二のトルク推定部４１と、機械学習モデル記憶部４２と、を備える。Embodiment.
FIG. 1 is a block diagram showing a configuration example of a motor control device 100 according to an embodiment of the invention. The motor control device 100 controls a motor that drives the target machine 10 . A motor control device 100 includes a motor state observation unit 1 , a physical model calculation unit 2 , a feature quantity generation unit 3 , and a machine learning model calculation unit 4 . The physical model calculator 2 includes a first torque estimator 21 and a physical model storage 22 . The machine learning model calculator 4 includes a second torque estimator 41 and a machine learning model storage 42 .

モータ制御装置１００が制御対象とする対象機械１０について説明する。図２は、本実施の形態に係るモータ制御装置１００が制御対象とする対象機械１０を示す模式図である。ここでは、対象機械１０として、水平多関節ロボットを例にして説明する。対象機械１０である水平多関節ロボットは、第１軸１１ｊと、第２軸１２ｊと、第３軸１３ｊと、第４軸１４ｊと、第１リンク１１ｌと、第２リンク１２ｌと、第３リンク１３ｌと、第４リンク１４ｌと、手先部１５ｅと、から構成されている。モータ制御装置１００は、第１軸１１ｊに搭載されたアクチュエータによって第２リンク１２ｌを駆動し、第２軸１２ｊに搭載されたアクチュエータによって第３リンク１３ｌを駆動し、第３軸１３ｊおよび第４軸１４ｊに搭載されたアクチュエータによって第４リンク１４ｌを駆動することで、水平多関節ロボットの手先部１５ｅの位置および向きを制御することができる。ここで、第３軸１３ｊは第４リンク１４ｌの手先部１５ｅ方向に対する軸回りの回転運動に対応し、第４軸１４ｊは第４リンク１４ｌの手先部１５ｅ方向に対する並進運動に対応している。 A target machine 10 to be controlled by the motor control device 100 will be described. FIG. 2 is a schematic diagram showing the target machine 10 to be controlled by the motor control device 100 according to the present embodiment. Here, as the target machine 10, a horizontal articulated robot will be described as an example. A horizontal articulated robot, which is the target machine 10, has a first axis 11j, a second axis 12j, a third axis 13j, a fourth axis 14j, a first link 11l, a second link 12l, and a third link. 13l, the fourth link 14l, and the end portion 15e. The motor control device 100 drives the second link 12l by the actuator mounted on the first shaft 11j, drives the third link 13l by the actuator mounted on the second shaft 12j, and drives the third shaft 13j and the fourth shaft. By driving the fourth link 14l with an actuator mounted on the 14j, the position and orientation of the hand portion 15e of the horizontal articulated robot can be controlled. Here, the third axis 13j corresponds to the rotational movement of the fourth link 14l about its axis in the direction of the hand portion 15e, and the fourth axis 14j corresponds to the translational movement of the fourth link 14l in the direction of the hand portion 15e.

本実施の形態では、対象機械１０として水平多関節ロボットを例示したがこれに限定されず、モータ制御装置１００は、モータによって駆動される対象機械１０について広く適用することができる。対象機械１０の例としては、さらに、垂直多関節ロボット、ＮＣ（ＮｕｍｅｒｉｃａｌＣｏｎｔｒｏｌ）工作機械、実装機などを挙げることができる。 In the present embodiment, a horizontal articulated robot is exemplified as the target machine 10, but it is not limited to this, and the motor control device 100 can be widely applied to target machines 10 driven by motors. Examples of the target machine 10 further include vertical articulated robots, NC (Numerical Control) machine tools, mounting machines, and the like.

モータ状態観測部１は、対象機械１０を駆動するモータの状態を観測し、観測結果をモータ状態信号として出力する。具体的には、モータ状態観測部１は、第１軸１１ｊと、第２軸１２ｊと、第３軸１３ｊと、第４軸１４ｊと、に対応する各アクチュエータの位置、速度、加速度の少なくともいずれか一つと、前述のモータのトルクである実トルクとをモータ状態信号として出力する。ここで、実トルクについては、直接センサから計測した値であってもよいし、アクチュエータから取得された入力電流値の換算値を用いてもよい。また、モータ状態信号は、各アクチュエータの位置、速度、加速度、および実トルクの信号のみに限定されない。モータ状態信号は、ロボットの各軸または規定された個所に設けられた内界センサもしくは外界センサのセンシングデータ、例えば、各軸のアクチュエータの温度データ、手先の力覚センサの取得データ、赤外線センサ、超音波センサなどのセンサから得られた距離データ、ビジョンセンサから得られた動画像データ、およびそれらを画像処理して得られた特徴量データ、などの値を含んでもよい。 The motor state observation unit 1 observes the state of the motor that drives the target machine 10 and outputs the observation result as a motor state signal. Specifically, the motor state observation unit 1 detects at least one of the position, velocity, and acceleration of each actuator corresponding to the first axis 11j, the second axis 12j, the third axis 13j, and the fourth axis 14j. , and the actual torque, which is the torque of the motor, as a motor state signal. Here, the actual torque may be a value directly measured from a sensor, or may be a converted value of an input current value obtained from an actuator. Also, the motor status signals are not limited to the position, velocity, acceleration, and actual torque signals of each actuator. The motor status signals are sensing data from internal or external sensors provided on each axis of the robot or at specified locations, such as temperature data from actuators on each axis, data obtained from hand force sensors, infrared sensors, Values such as distance data obtained from a sensor such as an ultrasonic sensor, moving image data obtained from a vision sensor, and feature amount data obtained by image processing thereof may be included.

第一のトルク推定部２１は、モータ状態信号と、物理モデル記憶部２２の出力である後述する物理モデルパラメータとを入力として、前述のモータのトルクである実トルクの推定値である第一の推定トルクを運動方程式により算出する。すなわち、第一のトルク推定部２１は、モータ状態信号および対象機械１０の物理モデルパラメータに基づき、運動方程式を用いてモータのトルクの推定値である第一の推定トルクを推定する。第一のトルク推定部２１は、第一の推定トルクと、第一の推定トルクの成分である物理状態変数とを出力する。物理状態変数には、慣性項、遠心力の項、コリオリ力の項、重力項だけでなく、クーロン摩擦の項、粘性摩擦の項、アームの振動の項、弾性たわみの項に加えて、後述する非線形摩擦モデルのような、第一の推定トルクを算出する上で何らかの物理現象を模擬して導入された関数の項が含まれてもよい。物理状態変数に摩擦推定値を含むことにより、モータ制御装置１００は、摩擦モデルの不確かさおよび実トルクの摩擦成分に対する摩擦推定値の誤差を、後述する機械学習モデル計算部４で補正することを可能にする。 The first torque estimating unit 21 receives the motor state signal and the physical model parameters described later, which are the outputs of the physical model storage unit 22, and calculates the first torque, which is the estimated value of the actual torque, which is the torque of the motor. Estimated torque is calculated by the equation of motion. That is, the first torque estimator 21 estimates the first estimated torque, which is the estimated torque of the motor, using the equation of motion based on the motor state signal and the physical model parameters of the target machine 10 . The first torque estimator 21 outputs a first estimated torque and physical state variables that are components of the first estimated torque. The physical state variables include not only inertia terms, centrifugal force terms, Coriolis force terms, and gravitational terms, but also Coulomb friction terms, viscous friction terms, arm vibration terms, and elastic deflection terms. A term of a function introduced by simulating some physical phenomenon in calculating the first estimated torque, such as a non-linear friction model, may be included. By including the estimated friction value in the physical state variables, the motor control device 100 corrects the uncertainty of the friction model and the error of the estimated friction value with respect to the friction component of the actual torque in the machine learning model calculation unit 4, which will be described later. to enable.

運動方程式において用いられるモータ状態信号および物理モデルパラメータは、それぞれ、各アクチュエータの位置、速度、加速度と、各リンクの質量、重心位置、慣性テンソルとに対応している。すなわち、物理モデルパラメータには、対象機械１０のリンク機構における質量、対象機械１０のリンク機構における重心位置、および対象機械１０のリンク機構における慣性テンソルが含まれる。これにより、モータ制御装置１００は、対象機械１０の既知の機構に関する情報を利用する逆動力学において、係数の質量、重心位置、慣性テンソルを物理モデルパラメータとして学習し実トルクに対する第一の推定トルクの推定精度を向上させることができる。また、モータ制御装置１００は、後述する機械学習モデル計算部４における補正値が相対的に小さくなり、第二の推定トルクの汎化性能を向上させることができる。ここで、運動方程式としては、例えば、ニュートンオイラー法、ラグランジュの運動方程式などを用いることができる。 The motor state signals and physical model parameters used in the equations of motion correspond to the position, velocity and acceleration of each actuator and the mass, center of gravity position and inertia tensor of each link, respectively. That is, the physical model parameters include the mass of the link mechanism of the target machine 10 , the center of gravity position of the link mechanism of the target machine 10 , and the inertia tensor of the link mechanism of the target machine 10 . As a result, the motor control device 100 learns the mass of the coefficients, the position of the center of gravity, and the inertia tensor as physical model parameters in the inverse dynamics using the information about the known mechanism of the target machine 10, and the first estimated torque with respect to the actual torque. can improve the estimation accuracy of Further, in the motor control device 100, the correction value in the machine learning model calculation unit 4, which will be described later, becomes relatively small, and the generalization performance of the second estimated torque can be improved. Here, as the equation of motion, for example, the Newton-Euler method, the Lagrangian equation of motion, or the like can be used.

第一のトルク推定部２１は、モータ状態信号により求まるモータの速度を入力として、トルクの摩擦成分の推定値を出力する連続かつ滑らかな摩擦モデルを含む運動方程式を用いる。後述するように機械学習モデル計算部４がニューラルネットワークを用いるが、ニューラルネットワークが計算できるのは入力に対して連続な関数であるため、特徴量および特徴量生成部３への入力は連続であることが望ましい。そのため、モータ制御装置１００は、速度ゼロ近傍も含めて連続となる摩擦モデルを定義することで、物理モデル記憶部２２内で学習しきれない摩擦成分についてもニューラルネットワークを用いて学習および補正することが可能となる。 The first torque estimator 21 uses an equation of motion including a continuous and smooth friction model that outputs an estimated value of the frictional component of the torque with the speed of the motor obtained from the motor state signal as input. As will be described later, the machine learning model calculation unit 4 uses a neural network, but since the neural network can calculate a continuous function with respect to the input, the feature amount and the input to the feature amount generation unit 3 are continuous. is desirable. Therefore, the motor control device 100 defines a continuous friction model including near-zero speed, so that even the friction component that cannot be learned in the physical model storage unit 22 can be learned and corrected using a neural network. becomes possible.

物理モデル記憶部２２は、対象機械１０を一定期間動作させた際のモータ状態信号を教師データとし、モータ状態信号に含まれる実トルクと第一の推定トルクとの差分を低減するよう物理モデルパラメータを学習する。物理モデル記憶部２２は、学習した物理モデルパラメータを記憶する。モータ制御装置１００は、物理モデルパラメータを学習することで、センサ、配線類の取り付け、加工誤差、組み立て誤差などによる物性の変化を補正し、実トルクの推定精度を向上させることができる。また、一般的に、物理モデルパラメータは、諸元表、ＣＡＤ（ＣｏｍｐｕｔｅｒＡｉｄｅｄＤｅｓｉｇｎ）データなどからおおよその値が算出可能であるため、これらの値を学習時の初期値として与えることにより学習の速度、収束性などを向上させることが可能である。ただし、これらの値が未知もしくは算出が困難な場合は、任意の定数で初期化してもよい。 The physical model storage unit 22 uses the motor state signal obtained when the target machine 10 is operated for a certain period of time as teaching data, and stores the physical model parameters so as to reduce the difference between the actual torque included in the motor state signal and the first estimated torque. to learn. The physical model storage unit 22 stores learned physical model parameters. By learning the physical model parameters, the motor control device 100 can correct changes in physical properties due to attachment of sensors and wiring, processing errors, assembly errors, etc., and can improve the accuracy of estimating the actual torque. In general, physical model parameters can be roughly calculated from specification tables, CAD (Computer Aided Design) data, etc. By giving these values as initial values at the time of learning, the speed of learning can be improved. , convergence, etc. can be improved. However, if these values are unknown or difficult to calculate, they may be initialized with arbitrary constants.

物理モデル記憶部２２での物理モデルパラメータの学習には、上述の運動方程式を変形し、物理モデルパラメータのみを分離した優決定系の線形方程式を用いる。ニュートンの運動方程式、オイラーの運動方程式、およびラグランジュの運動方程式は、いずれも位置、速度、加速度に対して非線形な項を考慮しない場合、質量、重心位置、慣性テンソルからなる定数係数と、位置、速度、加速度からなる変数との積の線形結合で表現できる。すなわち、実トルクに対する動力学計算の誤差をΔｆとおくと、各軸の運動方程式は、質量、重心位置、慣性テンソルからなる物理モデルパラメータｗ、および速度ｖと加速度ａからなる係数行列Ａ（ｖ，ａ）を用いて式（１）のように変数分離した形で記述できる。 For the learning of the physical model parameters in the physical model storage unit 22, linear equations of an overdetermined system obtained by transforming the above equation of motion and separating only the physical model parameters are used. Newton's equations of motion, Euler's equations of motion, and Lagrangian's equations of motion, when none of the nonlinear terms for position, velocity, and acceleration are considered, have constant coefficients consisting of mass, center of gravity position, and inertia tensor, position, It can be expressed as a linear combination of products with variables consisting of velocity and acceleration. That is, if the error in the dynamics calculation with respect to the actual torque is Δf, the equation of motion for each axis consists of the physical model parameter w consisting of the mass, the position of the center of gravity, and the inertia tensor, and the coefficient matrix A (v , a) can be used to describe in a variable-separated form as shown in Equation (1).

Δｆ＝Ａ（ｖ，ａ）ｗ …（１） Δf=A(v,a)w (1)

よって、物理モデル記憶部２２は、式（１）を最小二乗法により軸毎にｗについて解くことで、各軸の物理モデルパラメータを算出できる。また、物理モデル記憶部２２は、駆動軸が直列に接続されている産業用ロボットのような軸間干渉の生じうる駆動機械に対しては、各軸のｗに関する式（１）を１つの線形方程式にまとめて同時に解くことにより、物理モデル計算部２全体としてより尤もらしい解を得ることができる。物理モデル記憶部２２は、物理モデルパラメータｗの同定手法に関して、最小二乗法以外にも正則化最小二乗法、カルマンフィルタに加えて、ベイズ推定、遺伝的アルゴリズム、ニューラルネットワークなどを用いてもよい。 Therefore, the physical model storage unit 22 can calculate the physical model parameter of each axis by solving the equation (1) for w for each axis by the least squares method. In addition, the physical model storage unit 22 converts the equation (1) regarding w of each axis into one linear By summarizing the equations and solving them simultaneously, a more plausible solution can be obtained for the physical model calculation unit 2 as a whole. The physical model storage unit 22 may use, in addition to the least squares method, the regularized least squares method, the Kalman filter, Bayesian estimation, a genetic algorithm, a neural network, etc., for the identification method of the physical model parameter w.

なお、物理モデル記憶部２２は、運動方程式において摩擦の影響を考慮する際、次に示すクーロン摩擦および粘性摩擦からなる摩擦モデルである線形摩擦モデルなど、速度の関数と摩擦係数との線形結合で表現される摩擦係数が分離可能な式で摩擦を表現することで、摩擦係数についても式（１）と同様に物理モデルパラメータとして同時に推定が可能となる。物理モデル記憶部２２で推定される摩擦係数を式（２）で示す。なお、イメージで取り込まれる式（２）および後述の式（３）ではｑ_iの上部にドットが付与されているが、これを説明部分で表現できない。そのため、説明部分では、ｑ_iの上部にドットが付与されているものを単にｑ_iと表記する。In addition, when considering the effect of friction in the equation of motion, the physical model storage unit 22 stores a linear combination of a function of velocity and a friction coefficient, such as a linear friction model that is a friction model consisting of Coulomb friction and viscous friction shown below. By expressing the friction with a separable equation, the friction coefficient can be simultaneously estimated as a physical model parameter in the same manner as the equation (1). The coefficient of friction estimated by the physical model storage unit 22 is expressed by Equation (2). In equation (2), which is captured as an image, and equation (3), which will be described later, a dot is added above q _i , but this cannot be expressed in the description. Therefore, in the explanation part, q _i with a dot above it is simply written as q _i .

ここで、速度ｑ_iで動作する軸ｉの摩擦モデルＦ_iに対して、ｒ_FC,iはクーロン摩擦係数、ｒ_FV,iは粘性摩擦係数である。ただし、産業用ロボットをはじめとする多軸機構の動作においては、粘性摩擦が速度に対して完全に線形な関係とならず、特にアクチュエータの高速動作時においては、線形と仮定した場合の値と比べて小さくなることがしばしば確認される。そのような場合は、速度に対する非線形性を考慮したより自由度の高い摩擦モデルを用いて摩擦を推定し、実トルクと第一の推定トルクの差分における摩擦成分を除去した上で、その他の物理モデルパラメータを推定することが望ましい。このような摩擦モデルである非線形摩擦モデルとして、式（３）で示される以下に示すような関数が一例として挙げられる。where r _FC,i is the Coulomb friction coefficient and r _FV,i is the viscous friction coefficient for a friction model F _i of axis i operating at velocity q _i . However, in the operation of multi-axis mechanisms such as industrial robots, the viscous friction does not have a completely linear relationship with the speed. It is often confirmed that it is smaller than the In such a case, the friction is estimated using a friction model with a higher degree of freedom that considers the nonlinearity with respect to speed, and after removing the friction component in the difference between the actual torque and the first estimated torque, other physical It is desirable to estimate the model parameters. An example of such a non-linear friction model is the following function represented by Equation (3).

ここで、ω_lsは低速動作の境界速度であり、ω_hsは高速動作の境界速度であり、ｒ_ST,iは摩擦の飽和係数である。非線形摩擦モデルのグラフを図３に示す。図３は、本実施の形態に係る物理モデル記憶部２２で推定される物理モデルパラメータである非線形摩擦モデルの例を示す図である。図３において、横軸はモータ速度を示し、縦軸は摩擦を示す。非線形摩擦モデルは、クーロン摩擦係数ｒ_FC,iおよび粘性摩擦係数ｒ_FV,iに加えて計４つのパラメータを持ち、速度ｑ_iに対して非線形な関数であるため、これらは実トルクと第一の推定トルクとの差分から非線形最小二乗法であるＬｅｖｅｎｂｅｒｇ－Ｍａｒｑｕａｒｄｔ法を用いて求める。Ｌｅｖｅｎｂｅｒｇ－Ｍａｒｑｕａｒｄｔ法は、初期値依存性を持つため、一度摩擦に線形性を仮定してクーロン摩擦係数ｒ_FC,iおよび粘性摩擦係数ｒ_FV,iを物理モデルパラメータｗに加え、上述の式（１）により各摩擦係数を算出した後、それらを初期値とすることで初期値依存性による推定の失敗を防ぐことができる。すなわち、物理モデル記憶部２２は、物理モデルパラメータの学習において、最小二乗法、正則化最小二乗法、非線形最小二乗法、またはカルマンフィルタなどを用いることができる。モータ制御装置１００は、最小二乗法、非線形最小二乗法、非線形最小二乗法、またはカルマンフィルタを用いることで、対象機械１０の運動方程式および状態方程式における未知の係数を動作データから推定することが可能となる。動作データは、例えば、モータ状態信号に含まれるデータである。where ω _ls is the boundary speed for low speed motion, ω _hs is the boundary speed for high speed motion, and r _ST,i is the saturation coefficient of friction. A graph of the nonlinear friction model is shown in FIG. FIG. 3 is a diagram showing an example of a nonlinear friction model, which is a physical model parameter estimated by the physical model storage unit 22 according to this embodiment. In FIG. 3, the horizontal axis indicates motor speed and the vertical axis indicates friction. The nonlinear friction model has a total of four parameters in addition to the Coulomb friction coefficient rFC _,i and the viscous friction coefficient _rFV,i _. is obtained using the Levenberg-Marquardt method, which is a nonlinear least-squares method, from the difference from the estimated torque. Since the Levenberg-Marquardt method has initial value dependence, once linearity is assumed for friction, the Coulomb friction coefficient r _FC,i and the viscous friction coefficient r _FV,i are added to the physical model parameter w, and the above equation ( After calculating each friction coefficient by 1), by using them as initial values, estimation failure due to initial value dependence can be prevented. That is, the physical model storage unit 22 can use the least-squares method, the regularized least-squares method, the nonlinear least-squares method, the Kalman filter, or the like in learning the physical model parameters. The motor control device 100 can estimate unknown coefficients in the motion equation and the state equation of the target machine 10 from the operation data by using the least squares method, the nonlinear least squares method, the nonlinear least squares method, or the Kalman filter. Become. Operational data is, for example, data contained in a motor status signal.

物理モデル記憶部２２は、ここで得られた非線形摩擦モデルの係数を物理モデルパラメータとして記憶し、各時刻の非線形摩擦モデルの出力を物理状態変数に追加する。モータ制御装置１００は、摩擦モデルの係数を物理モデルパラメータとして物理モデル記憶部２２で記憶しておくことにより、経年劣化、異常などによる粘性摩擦、またクーロン摩擦の変化を検知することができる。物理モデル記憶部２２は、例えば、物理モデルパラメータの学習において、非線形摩擦モデルの係数および他の物理モデルパラメータを交互に推定する。モータ制御装置１００は、摩擦モデルの係数およびその他の物理モデルパラメータを交互に推定することで、摩擦モデルをより複雑かつ詳細に設計することができる。また、モータ制御装置１００は、その他の物理モデルパラメータの推定についても摩擦の影響をほとんど除去した状態で行えるため、物理モデルパラメータをより高精度に学習することが可能となる。 The physical model storage unit 22 stores the coefficients of the nonlinear friction model obtained here as physical model parameters, and adds the output of the nonlinear friction model at each time to the physical state variables. By storing coefficients of the friction model as physical model parameters in the physical model storage unit 22, the motor control device 100 can detect changes in viscous friction and Coulomb friction due to deterioration over time and abnormalities. The physical model storage unit 22, for example, alternately estimates coefficients of the nonlinear friction model and other physical model parameters in learning the physical model parameters. By alternately estimating coefficients of the friction model and other physical model parameters, the motor control device 100 can design the friction model in a more complex and detailed manner. In addition, since the motor control device 100 can also estimate other physical model parameters in a state in which the influence of friction is almost eliminated, the physical model parameters can be learned with higher accuracy.

ただし、非線形摩擦モデルについては、速度の関数に限らず、位置、加速度、モータ温度などにも依存する多変数関数であってもよく、非線形摩擦モデルの各係数はＧａｕｓｓ－Ｎｅｗｔｏｎ法、最急降下法、拡張カルマンフィルタ、無香カルマンフィルタなどによって同定してもよい。また、非線形摩擦モデルとして、サポートベクター回帰モデル、回帰木、ランダムフォレスト、多変量適応型回帰スプラインモデル、ニューラルネットワークなど機械学習アルゴリズムに基づいたモデル生成手法を用いてもよいが、後述の機械学習モデル計算部４において、ニューラルネットワークをはじめとする機械学習部４１１により第一の推定トルクの補正を行うため、非線形摩擦モデルは摩擦の静的な特性に着目したマクロな影響のみを除去できればよく、陽に与えられる速度のみの関数として考慮すれば十分である。 However, the nonlinear friction model is not limited to a function of velocity, and may be a multivariable function that depends on position, acceleration, motor temperature, etc. Each coefficient of the nonlinear friction model is Gauss-Newton method, steepest descent method , an extended Kalman filter, an unscented Kalman filter, or the like. As the nonlinear friction model, a support vector regression model, regression tree, random forest, multivariate adaptive regression spline model, neural network, or other model generation method based on machine learning algorithms may be used. In the calculation unit 4, the first estimated torque is corrected by the machine learning unit 411 including the neural network. It is sufficient to consider it as a function of only the velocity given to .

特徴量生成部３は、第一の推定トルクと、物理状態変数と、モータ状態信号とを入力とし、機械学習部４１１の入力である特徴量を出力する。これらの特徴量生成部３の入力および推定対象である実トルクは、時系列的に変化する値である。機械学習モデル記憶部４２は、それぞれ時間軸方向の相関についても考慮する必要があるため、特徴量も過去の一定期間の値も同時に含むベクトルとして与える。また、本実施の形態で対象とする水平多関節ロボットのような多軸機構においては、軸間の相互干渉の影響が小さくないため、それぞれの軸に対応する入力の値についても同時に含むベクトルとすることが望ましい。ただし、特徴量におけるトルク成分に関しては、実トルクと第一の推定トルクとの差分として与え、推定対象とする実トルクに対して過去の値のみを用いるものとする。 The feature amount generation unit 3 receives the first estimated torque, the physical state variable, and the motor state signal as inputs, and outputs the feature amount that is the input of the machine learning unit 411 . The actual torque, which is the input and the estimation target of these feature quantity generators 3, is a value that changes in time series. Since the machine learning model storage unit 42 also needs to consider the correlation in the direction of the time axis, it is given as a vector containing both the feature amount and the value for a certain period in the past. In addition, in a multi-axis mechanism such as a horizontal articulated robot targeted in this embodiment, since the influence of mutual interference between axes is not small, It is desirable to However, the torque component in the feature quantity is given as the difference between the actual torque and the first estimated torque, and only past values are used for the actual torque to be estimated.

図４は、本実施の形態に係る特徴量生成部３および機械学習モデル計算部４の構成例を示すブロック図である。図４において、ｚは単位遅延演算子であり、Ｎ_dは遅延の単位時刻数を示す０以上の整数である。図４に示す単位遅延演算子は、特徴量生成部３に備えられた記憶装置ともいえる。すなわち、特徴量生成部３は、第一の推定トルク、物理状態変数、およびモータ状態信号を格納可能な記憶装置を備え、記憶装置に格納されている第一の推定トルク、物理状態変数、およびモータ状態信号から抽出したものを特徴量として出力してもよい。モータ制御装置１００は、物理状態変数およびモータ状態信号に関する時系列変化の情報を入力として特徴量に含むことで、機械学習モデル計算部４が対象機械１０の動作をより詳細に予測できるため、実トルクに対する第二の推定トルクの推定精度を向上させることができる。FIG. 4 is a block diagram showing a configuration example of the feature quantity generation unit 3 and the machine learning model calculation unit 4 according to this embodiment. In FIG. 4, z is a unit delay operator, and _Nd is an integer greater than or equal to 0 indicating the number of unit times of delay. The unit delay operator shown in FIG. 4 can also be said to be a storage device provided in the feature generator 3 . That is, the feature quantity generation unit 3 includes a storage device capable of storing the first estimated torque, the physical state variables, and the motor state signal, and the first estimated torque, the physical state variables, and What is extracted from the motor state signal may be output as the feature quantity. The motor control device 100 includes information on time-series changes related to the physical state variables and the motor state signal as input in the feature amount, so that the machine learning model calculation unit 4 can predict the operation of the target machine 10 in more detail. It is possible to improve the estimation accuracy of the second estimated torque with respect to the torque.

特徴量生成部３は、第一の推定トルク、物理状態変数、およびモータ状態信号の全てを使用せず、一部のみを使用して特徴量を生成し、出力してもよい。すなわち、特徴量生成部３は、第一の推定トルク、物理状態変数、およびモータ状態信号のうち少なくとも一つから特徴量を生成し出力してもよい。 The feature amount generation unit 3 may generate and output the feature amount using only some of the first estimated torque, the physical state variable, and the motor state signal without using all of them. That is, the feature quantity generator 3 may generate and output a feature quantity from at least one of the first estimated torque, the physical state variable, and the motor state signal.

第二のトルク推定部４１は、機械学習部４１１を備える。第二のトルク推定部４１は、特徴量生成部３で生成された特徴量と、第一のトルク推定部２１の出力である第一の推定トルクと、機械学習モデル記憶部４２の出力である機械学習モデルパラメータとを入力とし、特徴量および機械学習モデルパラメータを用いた機械学習部４１１の機械学習に基づいて機械学習補正量を算出する。第二のトルク推定部４１は、第一の推定トルクに機械学習補正量を加算したものを第二の推定トルクとして出力する。 The second torque estimating section 41 includes a machine learning section 411 . The second torque estimating unit 41 is the feature quantity generated by the feature quantity generating unit 3, the first estimated torque output from the first torque estimating unit 21, and the output of the machine learning model storage unit 42. Machine learning model parameters are input, and a machine learning correction amount is calculated based on machine learning of the machine learning unit 411 using the feature amount and the machine learning model parameters. The second torque estimator 41 outputs the result obtained by adding the machine learning correction amount to the first estimated torque as the second estimated torque.

機械学習部４１１は、特徴量生成部３で生成された特徴量と機械学習モデル記憶部４２の出力である機械学習モデルパラメータとを入力とし、実トルクに対する第二の推定トルクの誤差が最小となるよう、機械学習補正量を出力するニューラルネットワークから構成される。すなわち、出力の機械学習補正量の理想的な値は実トルクと第一の推定トルクとの差分と一致し、入力である特徴量に遅延した過去の実トルクと第一の推定トルクとの差分を含むため、機械学習部４１１は、図５に示すような入出力関係に自己回帰項を持つ。 The machine learning unit 411 receives the feature quantity generated by the feature quantity generation unit 3 and the machine learning model parameters output from the machine learning model storage unit 42, and determines that the error of the second estimated torque with respect to the actual torque is the minimum. It is composed of a neural network that outputs the machine learning correction amount so that That is, the ideal value of the machine learning correction amount for the output matches the difference between the actual torque and the first estimated torque, and the difference between the past actual torque delayed by the input feature value and the first estimated torque , the machine learning unit 411 has an autoregressive term in the input/output relationship as shown in FIG.

図５は、本実施の形態に係る特徴量生成部３および機械学習モデル計算部４の構成において自己回帰項を示す図である。図５は、図４から自己回帰項に関連する部分を抽出したものである。ここで、機械学習モデルパラメータは、ニューラルネットワークにおける重み行列およびバイアスベクトルを指す。また、ニューラルネットワークの種類として、中間層の入出力関係に再帰的な構造を持つ、リカレントニューラルネットワーク、ＬＳＴＭ（ＬｏｎｇＳｈｏｒｔＴｅｒｍＭｅｍｏｒｙ）、ＧＲＵ（ＧａｔｅｄＲｅｃｕｒｒｅｎｔＵｎｉｔ）といったリカレント型と呼ばれるニューラルネットワークを用いることも、このような時系列データに対する予測精度を向上する上で有効な手段である。また、学習負荷および計算量を削減するため、ニューラルネットワークの代わりに多項式モデル、重回帰モデル、サポートベクター回帰モデル、回帰木、ランダムフォレスト、多変量適応型回帰スプラインモデルなどを用いてもよいし、軸間の相互干渉を無視してよい場合などは、軸毎に独立に機械学習部４１１を適用してもよい。 FIG. 5 is a diagram showing autoregression terms in the configurations of the feature amount generation unit 3 and the machine learning model calculation unit 4 according to this embodiment. FIG. 5 shows a portion related to the autoregressive term extracted from FIG. Here, machine learning model parameters refer to the weight matrix and bias vector in the neural network. Also, as a type of neural network, use a neural network called a recurrent type such as a recurrent neural network, LSTM (Long Short Term Memory), GRU (Gated Recurrent Unit), which has a recursive structure in the input/output relationship of the intermediate layer. is also an effective means for improving the prediction accuracy for such time-series data. Also, in order to reduce the learning load and the amount of calculation, polynomial models, multiple regression models, support vector regression models, regression trees, random forests, multivariate adaptive regression spline models, etc. may be used instead of neural networks, In cases such as when mutual interference between axes can be ignored, the machine learning unit 411 may be applied independently for each axis.

ただし、ニューラルネットワークをはじめとする機械学習部４１１で用いた汎用的な関数近似モデルは、一般的に不連続関数の近似に適していない。そのため、第一の推定トルクにおいて式（２）の線形摩擦モデルによって摩擦分のトルクの近似および補正を行うと、実トルクと第一の推定トルクとの差分が不連続なジャンプを起こし、これらの値を入出力に含み自己回帰項を形成する機械学習部４１１の機械学習モデルパラメータの学習結果および機械学習補正量の算出結果が著しく悪化する可能性がある。さらに、駆動機械の実機においては、粘着と滑りとが交互に起こって発生するスティックスリップ現象、ストライベック曲線で表わされるような潤滑に関する摩擦などが影響し、速度０付近および低速動作時の摩擦は非線形性が強いため、これらのモデルを陽に与えることは非常に困難である。そこで、式（３）で示した非線形摩擦モデルように、速度０近傍を連続かつ滑らかに接続した摩擦モデルを用いることで、第一の推定トルクおよび特徴量の各値がそれぞれ連続となり、上述のような第一の推定トルクおよび非線形摩擦モデルで再現が困難な摩擦成分についても、機械学習補正量を加えることによって高精度に推定が可能となる。さらに、特徴量は、過去の一定期間の速度、非線形摩擦モデルの出力などを同時に含むベクトルである。そのため、機械学習部４１１は、摩擦のヒステリシス特性など直前の状態に依存して変化する推定誤差要因についても学習し、機械学習補正量として補正することが可能となる。 However, general-purpose function approximation models such as neural networks used in the machine learning unit 411 are generally not suitable for approximating discontinuous functions. Therefore, when the frictional torque is approximated and corrected by the linear friction model of formula (2) in the first estimated torque, the difference between the actual torque and the first estimated torque causes a discontinuous jump, and these There is a possibility that the learning result of the machine learning model parameter and the calculation result of the machine learning correction amount of the machine learning unit 411 that includes the values in the input and output and form the autoregressive term will deteriorate significantly. Furthermore, in the actual drive machine, the stick-slip phenomenon, which occurs when adhesion and slip alternate, and the friction related to lubrication, which is represented by the Stribeck curve, are affected. Due to the strong nonlinearity, it is very difficult to give these models explicitly. Therefore, by using a friction model in which the vicinity of the speed 0 is continuously and smoothly connected like the nonlinear friction model shown in Equation (3), each value of the first estimated torque and the feature amount becomes continuous, and the above-mentioned Even for the friction component that is difficult to reproduce with the first estimated torque and the nonlinear friction model, it is possible to estimate with high accuracy by adding the machine learning correction amount. Furthermore, the feature amount is a vector that simultaneously includes the velocity for a certain period of time in the past, the output of the nonlinear friction model, and the like. Therefore, the machine learning unit 411 can also learn factors of estimation errors that change depending on the immediately preceding state, such as the hysteresis characteristics of friction, and correct them as machine learning correction amounts.

このように、第二のトルク推定部４１は、入出力関係に自己回帰項を含む機械学習を用いて特徴量および機械学習モデルパラメータから機械学習補正量を算出する。第二のトルク推定部４１は、算出した機械学習補正量を第一の推定トルクに加算して第二の推定トルクを出力する。また、第二のトルク推定部４１は、入出力関係に自己回帰項を含む機械学習としてニューラルネットワークを用いた機械学習を用いてもよい。モータ制御装置１００は、機械学習の手法としてニューラルネットワークを利用することで、運動方程式上で表現が困難なトルク成分を非線形多自由度モデルとして学習し保存することが可能となる。また、第二のトルク推定部４１は、機械学習補正量を算出するのに用いる入出力関係に自己回帰項を含む機械学習において、自己回帰項として実トルクと第一の推定トルクとの差分を用いてもよい。実トルクと第一の推定トルクとの差分が第一の推定トルクに対する補正量の目標値である。補正量の目標値は対象機械１０のモータの動作に伴い時系列的に変化するため、モータ制御装置１００は、次の時刻の最適な補正量を推定する際、過去の補正量の目標値を入力に含む自己回帰項を備えることで、より高精度に推定することが可能となる。 In this way, the second torque estimator 41 uses machine learning including an autoregressive term in the input/output relationship to calculate the machine learning correction amount from the feature amount and the machine learning model parameters. The second torque estimator 41 adds the calculated machine learning correction amount to the first estimated torque and outputs the second estimated torque. Further, the second torque estimator 41 may use machine learning using a neural network as machine learning including an autoregressive term in the input/output relationship. By using a neural network as a machine learning method, the motor control device 100 can learn and store a torque component, which is difficult to express on the equation of motion, as a nonlinear multi-degree-of-freedom model. In addition, the second torque estimator 41 calculates the difference between the actual torque and the first estimated torque as an autoregressive term in machine learning that includes an autoregressive term in the input/output relationship used to calculate the machine learning correction amount. may be used. A difference between the actual torque and the first estimated torque is the target value of the correction amount for the first estimated torque. Since the target value of the correction amount changes in time series with the operation of the motor of the target machine 10, the motor control device 100 uses the past target value of the correction amount when estimating the optimum correction amount for the next time. By providing an autoregressive term included in the input, it is possible to estimate with higher accuracy.

機械学習モデル記憶部４２は、物理モデル記憶部２２における物理モデルパラメータの学習後に、対象機械１０を一定期間動作させた際の特徴量、および実トルクと第一の推定トルクとの差分をそれぞれ入出力の教師データとして、実トルクと第一の推定トルクとの差分に対する機械学習補正量の誤差が低減するよう機械学習モデルパラメータを学習する。すなわち、機械学習モデル記憶部４２は、機械学習補正量が実トルクと第一の推定トルクとの差分に近づくように機械学習モデルパラメータを教師あり学習により学習する。機械学習モデル記憶部４２は、学習した機械学習モデルパラメータを記憶する。学習開始時の機械学習モデルパラメータは任意の定数で初期化してよいが、特にニューラルネットワークで表現される機械学習部４１１は、機械学習モデルパラメータである重み行列、バイアスなどに対して初期値依存性を持つ。そのため、機械学習部４１１では、重み行列について、一様分布、通常の正規分布に加えて、切断正規分布、Ｈｅの正規分布、Ｇｌｏｒｏｔの正規分布など、正規分布に由来した確率分布で初期化することが一般的である。 The machine learning model storage unit 42 inputs the feature amount and the difference between the actual torque and the first estimated torque when the target machine 10 is operated for a certain period after the physical model parameters are learned in the physical model storage unit 22 . As training data for the output, machine learning model parameters are learned so as to reduce the error of the machine learning correction amount with respect to the difference between the actual torque and the first estimated torque. That is, the machine learning model storage unit 42 learns machine learning model parameters by supervised learning such that the machine learning correction amount approaches the difference between the actual torque and the first estimated torque. The machine learning model storage unit 42 stores learned machine learning model parameters. The machine learning model parameters at the start of learning may be initialized with arbitrary constants. have. Therefore, in the machine learning unit 411, in addition to the uniform distribution and the normal normal distribution, the weight matrix is initialized with a probability distribution derived from the normal distribution such as the truncated normal distribution, the He normal distribution, and the Grorot normal distribution. is common.

また、機械学習モデル記憶部４２は、学習手法としてミニバッチ勾配降下法および誤差逆伝搬法を利用し、重み行列およびバイアスベクトルで表わされる機械学習モデルパラメータをバッチ毎に更新する。すなわち、機械学習モデル記憶部４２は、機械学習モデルパラメータの学習時に用いる教師あり学習手法として誤差逆伝搬法を用いてもよい。ただし、教師データが十分な量だけ用意できない場合などは、機械学習モデル記憶部４２は、ミニバッチ勾配降下法の代わりに通常の勾配降下法、確率的勾配降下法などを用いてもよい。また、機械学習モデル記憶部４２は、機械学習部４１１におけるニューラルネットワークの汎化性能を向上させるため、学習の際にニューロンをランダムに除外するｄｒｏｐｏｕｔ、エポック毎に誤差を監視して学習を早く打ち切るｅａｒｌｙｓｔｏｐｐｉｎｇといった手法を用いるのも有効である。機械学習モデル記憶部４２は、学習時にｄｒｏｐｏｕｔを適用した際は、機械学習補正量の推論時にｄｒｏｐｏｕｔ確率を重み行列に適用する必要があるので、このようなハイパーパラメータについても機械学習モデルパラメータとして追加し保存する。 The machine learning model storage unit 42 also uses the mini-batch gradient descent method and the error backpropagation method as learning methods, and updates the machine learning model parameters represented by the weight matrix and the bias vector for each batch. That is, the machine learning model storage unit 42 may use the error backpropagation method as a supervised learning method used when learning the machine learning model parameters. However, if a sufficient amount of teacher data cannot be prepared, the machine learning model storage unit 42 may use a normal gradient descent method, a stochastic gradient descent method, or the like instead of the mini-batch gradient descent method. In addition, in order to improve the generalization performance of the neural network in the machine learning unit 411, the machine learning model storage unit 42 has a dropout that randomly excludes neurons during learning, and monitors errors for each epoch to quickly terminate learning. It is also effective to use a technique such as early stopping. When the machine learning model storage unit 42 applies dropout during learning, it is necessary to apply the dropout probability to the weight matrix during inference of the machine learning correction amount. Therefore, such hyperparameters are also added as machine learning model parameters. and save.

ここまでの物理モデルパラメータおよび機械学習モデルパラメータの学習の流れを、フローチャートを用いて説明する。図６は、本実施の形態に係るモータ制御装置１００における物理モデルパラメータおよび機械学習モデルパラメータの学習の流れを示すフローチャートである。物理モデル記憶部２２は、諸元表の値から各リンクの物理モデルパラメータである質量、重心位置、および慣性テンソルの初期値を設定する（ステップＳ１）。モータ状態観測部１は、対象機械１０のモータ状態信号である実トルク、位置、速度、および加速度の時系列データを教師データとして出力する（ステップＳ２）。 The flow of learning the physical model parameters and the machine learning model parameters up to this point will be described using flowcharts. FIG. 6 is a flow chart showing the flow of learning of physical model parameters and machine learning model parameters in motor control device 100 according to the present embodiment. The physical model storage unit 22 sets the initial values of the physical model parameters of mass, center of gravity position, and inertia tensor of each link from the values in the specification table (step S1). The motor state observation unit 1 outputs time-series data of actual torque, position, speed, and acceleration, which are motor state signals of the target machine 10, as teacher data (step S2).

第一のトルク推定部２１は、位置、速度、および加速度に対して、動力学計算により第一の推定トルクを算出する（ステップＳ３）。第一のトルク推定部２１は、動力学方程式に線形摩擦モデルの項を追加する（ステップＳ４）。物理モデル記憶部２２は、物理モデルパラメータにクーロン摩擦係数および粘性摩擦係数を追加する（ステップＳ５）。物理モデル記憶部２２は、実トルクと第一の推定トルクとの差分から、線形方程式に変形した動力学方程式および最小二乗法を用いて、物理モデルパラメータを同定する（ステップＳ６）。物理モデル記憶部２２は、同定した物理モデルパラメータにおけるクーロン摩擦係数および粘性摩擦係数を初期値とし、Ｌｅｖｅｎｂｅｒｇ－Ｍａｒｑｕａｒｄｔ法で非線形摩擦モデルを同定し、得られた各係数を物理モデルパラメータに追加する（ステップＳ７）。物理モデル記憶部２２は、実トルクから第一の推定トルクおよび非線形摩擦モデルの値を引いた結果に対し、摩擦モデルの項を除去し線形方程式に変形した動力学方程式および最小二乗法を用いて、摩擦係数以外の物理モデルパラメータを再び同定する（ステップＳ８）。第一のトルク推定部２１は、動力学方程式に非線形摩擦モデルの項を追加し、学習した物理モデルパラメータを用いて、第一の推定トルクを再び算出する（ステップＳ９）。 The first torque estimator 21 calculates a first estimated torque by dynamic calculation with respect to the position, velocity, and acceleration (step S3). The first torque estimator 21 adds a linear friction model term to the dynamics equation (step S4). The physical model storage unit 22 adds the Coulomb friction coefficient and the viscous friction coefficient to the physical model parameters (step S5). The physical model storage unit 22 identifies physical model parameters from the difference between the actual torque and the first estimated torque, using the dynamic equation transformed into a linear equation and the least-squares method (step S6). The physical model storage unit 22 uses the Coulomb friction coefficient and the viscous friction coefficient in the identified physical model parameters as initial values, identifies the nonlinear friction model by the Levenberg-Marquardt method, and adds each obtained coefficient to the physical model parameters ( step S7). The physical model storage unit 22 subtracts the first estimated torque and the value of the nonlinear friction model from the actual torque, and uses the dynamics equation transformed into a linear equation by removing the terms of the friction model and the least squares method. , the physical model parameters other than the friction coefficient are identified again (step S8). The first torque estimator 21 adds a nonlinear friction model term to the dynamics equation, and uses the learned physical model parameters to calculate the first estimated torque again (step S9).

特徴量生成部３は、第一の推定トルク、物理状態変数、およびモータ状態信号を入力として、それらを過去の一定期間の値も同時に含むベクトルにまとめ、特徴量として出力する（ステップＳ１０）。機械学習モデル記憶部４２は、機械学習モデルパラメータである重み行列、およびバイアスに対し、重み行列を正規分布で、バイアスを定数で初期化する（ステップＳ１１）。機械学習モデル記憶部４２は、学習時の条件として、ｅａｒｌｙｓｔｏｐｐｉｎｇ、ｄｒｏｐｏｕｔなどのハイパーパラメータを設定し、それらの値を機械学習モデルパラメータへ追加する（ステップＳ１２）。第二のトルク推定部４１は、入力を特徴量、出力を実トルクと第一の推定トルクとの差分である機械学習補正量の理想値とし、機械学習モデルパラメータにおける学習時の条件に従って、誤差逆伝搬法により機械学習モデルパラメータの重み行列およびバイアスを学習する（ステップＳ１３）。 The feature amount generator 3 receives the first estimated torque, the physical state variable, and the motor state signal as inputs, puts them together into a vector that also includes values for a certain period of time in the past, and outputs the vector as a feature amount (step S10). The machine learning model storage unit 42 initializes the weight matrix and the bias, which are machine learning model parameters, with a normal distribution and the bias with a constant (step S11). The machine learning model storage unit 42 sets hyperparameters such as early stopping and dropout as learning conditions, and adds these values to the machine learning model parameters (step S12). The second torque estimating unit 41 uses the input as a feature value and the output as an ideal value of the machine learning correction amount that is the difference between the actual torque and the first estimated torque, and the error is calculated according to the conditions at the time of learning in the machine learning model parameters. The weight matrix and bias of the machine learning model parameters are learned by the backpropagation method (step S13).

ここで、第二の推定トルクの汎化性能の向上については、上述の通り、事前に物理モデル記憶部２２において物理モデルパラメータの学習を行い、第一の推定トルクに対する機械学習補正量を自己回帰項として、機械学習モデル記憶部４２において機械学習モデルパラメータの学習を行うことによる寄与が大きい。なぜなら、モータ制御装置１００は、物理モデルパラメータと機械学習モデルパラメータとを区別することで、それぞれ運動方程式においてモデル化が容易な既知の物理現象に起因したトルク推定誤差成分と、運動方程式においてモデル化が困難なトルク推定誤差成分とを分けて学習することができ、機械学習部４１１の出力である機械学習補正量の絶対値が小さくなるためである。言い換えれば、陽に与えられるトルク推定誤差成分が機械学習補正量に含まれないため、モータ制御装置１００は、機械学習モデルパラメータに含まれるパラメータ数も少なくすることができ、結果として機械学習モデル記憶部４２における学習の過学習の防止と高速化、さらに学習に用いる教師データの削減も可能となる。 Here, for improving the generalization performance of the second estimated torque, as described above, the physical model parameters are learned in advance in the physical model storage unit 22, and the machine learning correction amount for the first estimated torque is autoregressed. As a term, learning of the machine learning model parameters in the machine learning model storage unit 42 greatly contributes. This is because the motor control device 100 distinguishes between the physical model parameters and the machine learning model parameters. This is because the absolute value of the machine learning correction amount, which is the output of the machine learning unit 411, becomes smaller. In other words, since the explicitly given torque estimation error component is not included in the machine learning correction amount, motor control device 100 can also reduce the number of parameters included in the machine learning model parameters. It is possible to prevent over-learning and increase the speed of learning in the unit 42, and to reduce teacher data used for learning.

機械学習補正量に関する自己回帰項についても同様に、実トルクと物理モデルパラメータに初期値を用いた運動方程式による算出結果の差分を機械学習モデル計算部４で学習し機械学習補正量とする構成、または、物理モデルパラメータに初期値を用いた運動方程式による算出結果を特徴量に含み第二の推定トルクを直接機械学習部４１１の出力として機械学習モデルパラメータを学習する構成に比べ、機械学習補正量の絶対値が小さくなりニューラルネットワークなどによるトルク推定誤差範囲を抑えることができる。さらに、自己回帰項を用いるメリットとして、モータ特性に依存するコギングトルク、トルクリップル、ギア間のトルク伝達におけるバックラッシュの影響など、運動方程式の式中に与えることが困難な時間軸方向に周期的に変化する推定誤差要因を学習しやすい点などが挙げられる。実際に、物理モデルパラメータを学習した後の第一の推定トルク、さらに実トルクと第一の推定トルクとの差分から機械学習モデルパラメータを学習した第二の推定トルクを図７に示す。図７は、本実施の形態に係るモータ制御装置１００の制御対象の対象機械１０の動作を表すモータ速度、モータ制御装置１００で推定される第一の推定トルク、およびモータ制御装置１００で推定される第二の推定トルクを示す図である。各項目において、横軸は時間を示している。図７に示すように、第二の推定トルクの方が、第一の推定トルクよりも実トルクに近似している。 Similarly, for the autoregression term related to the machine learning correction amount, the difference between the actual torque and the calculation result by the equation of motion using the initial values for the physical model parameters is learned by the machine learning model calculation unit 4 and used as the machine learning correction amount. Alternatively, compared to a configuration in which the machine learning model parameters are learned by directly using the second estimated torque as the output of the machine learning unit 411, the machine learning correction amount , the absolute value of becomes smaller, and the torque estimation error range by a neural network can be suppressed. Furthermore, as an advantage of using an autoregressive term, there are effects of cogging torque, torque ripple, and backlash in torque transmission between gears that depend on the motor characteristics. One of the advantages is that it is easy to learn estimation error factors that change over time. FIG. 7 shows the first estimated torque after learning the physical model parameters and the second estimated torque actually learning the machine learning model parameters from the difference between the actual torque and the first estimated torque. FIG. 7 shows the motor speed representing the operation of the target machine 10 controlled by the motor control device 100 according to the present embodiment, the first estimated torque estimated by the motor control device 100, and the FIG. 10 is a diagram showing a second estimated torque; In each item, the horizontal axis indicates time. As shown in FIG. 7, the second estimated torque is closer to the actual torque than the first estimated torque.

また、機械学習部４１１にニューラルネットワークを用いた場合、特徴量に含まれるどのパラメータが機械学習補正量にどの程度影響したかが明らかでなく、機械学習モデルパラメータの変化から対象機械１０の物性の変化を推測することは非常に困難である。しかしながら、物理モデルパラメータの学習ではそれら物性の各値を直接確認できる本実施の形態では、モータ制御装置１００は、物理モデルパラメータと機械学習モデルパラメータとを区別して学習するため、異常検知に適した構成といえる。 In addition, when a neural network is used for the machine learning unit 411, it is not clear which parameter included in the feature amount has an effect on the machine learning correction amount. Predicting change is very difficult. However, in the learning of the physical model parameters, each value of these physical properties can be directly confirmed in the present embodiment. It can be said that it is a composition.

学習を行った物理モデル計算部２および機械学習モデル計算部４を用いた異常検知の例として、主に実トルクと第二の推定トルクとの差分を監視する手法が考えられる。モータ制御装置１００は、実トルクと第二の推定トルクとの差分を監視することで、観測値がほぼ０周りに集中し、絶対値をそのまま実トルクの異常度と捉えられるためである。具体的な異常検知手法としては、実トルクと第二の推定トルクとの差分の絶対値があらかじめ設定された閾値を超えるかによって検知する単純なものでもよいし、その他に、特徴量および実トルクと第二の推定トルクとの差分を入力とした正常、異常の２クラス分類問題として、ラベル付けを行った教師データを用意し、ロジスティック回帰、サポートベクターマシン、決定木、ニューラルネットワークなどを用いて判別してもよい。 As an example of abnormality detection using the physical model calculation unit 2 and the machine learning model calculation unit 4 that have performed learning, a method of mainly monitoring the difference between the actual torque and the second estimated torque can be considered. This is because the motor control device 100 monitors the difference between the actual torque and the second estimated torque, so that the observed values are concentrated around 0, and the absolute value can be regarded as the degree of abnormality of the actual torque. A specific abnormality detection method may be a simple method of detecting whether the absolute value of the difference between the actual torque and the second estimated torque exceeds a preset threshold. As a two-class classification problem of normal and abnormal with the difference between and the second estimated torque as an input, prepare labeled teacher data and use logistic regression, support vector machine, decision tree, neural network, etc. can be determined.

モータ制御装置１００は、教師データである異常時の動作データが集めにくい場合、ＳＭＯＴＥ（ＳｙｎｔｈｅｔｉｃＭｉｎｏｒｉｔｙＯｖｅｒ－ｓａｍｐｌｉｎｇＴＥｃｈｎｉｑｕｅ）、ＡＤＡＳＹＮ（ＡＤＡｐｔｉｖｅＳＹＮｔｈｅｔｉｃ）などを用いて取得された異常時の動作データに対するオーバーサンプリングを行ってもよいし、Ｋ－ｍｅａｎｓ法、自己組織化マップなど教師なし学習により正常、異常で分かれる２クラスタに分類してもよい。また、モータ制御装置１００において、クラス数およびクラスタ数は正常、異常の２つに限定されず、異常度の度合いに合わせてより細かく段階的に設定してもよいし、実トルクと第二の推定トルクとの差分の絶対値以外の異常度を任意に設定してもよい。 Motor control device 100 over-samples abnormal operation data acquired using SMOTE (Synthetic Minority Over-sampling TECHnique), ADASYN (ADAactive SYNthetic), etc., when it is difficult to collect operational data in abnormal conditions, which is teacher data. may be performed, or by unsupervised learning such as the K-means method or self-organizing map, the clusters may be classified into two clusters, normal and abnormal. In addition, in the motor control device 100, the number of classes and the number of clusters are not limited to two, normal and abnormal, and may be set in finer steps according to the degree of abnormality. Any degree of abnormality other than the absolute value of the difference from the estimated torque may be set.

図１の破線で示した、物理モデル記憶部２２における物理モデルパラメータ、および機械学習モデル記憶部４２における機械学習モデルパラメータの学習のタイミングについて、故障検知を目的とした用途を想定すると、出荷段階の試験運転時などモータ制御装置１００が劣化度合いの比較対象として理想的な状態となる環境で実行することが望ましい。ただし、水平多関節ロボットをはじめとする産業用ロボットは、手先に力覚センサ、エンドエフェクタなどを取り付けることが多々あり、その場合も各リンクの質量、重心位置、慣性テンソルといった物性が変化するため、各状態で物理モデルパラメータおよび機械学習モデルパラメータをどちらも学習し直す必要がある。 Regarding the learning timing of the physical model parameters in the physical model storage unit 22 and the machine learning model parameters in the machine learning model storage unit 42, which are indicated by the dashed lines in FIG. It is desirable that the motor control device 100 be executed in an ideal state for comparison of the degree of deterioration, such as during test operation. However, industrial robots such as horizontal articulated robots often have force sensors and end effectors attached to their hands. , we need to relearn both the physical model parameters and the machine learning model parameters at each state.

ただし、一般的にセンサ、エンドエフェクタなどを取り付けた際、これらの物理モデルパラメータへの影響をロボットコントローラなどによって設定する手間が発生していたのに対し、本実施の形態では、物理モデル記憶部２２において、対象機械１０を一定期間動作させた際のモータ状態信号からそれらを学習することで、この設定の過程についても自動化することが可能となっている。また、物理モデルパラメータの値が正しく設定されていることで、サイクルタイム、軌跡ずれ、衝突時の押しつけ力などが減少するといった利点もあり、位置制御、速度制御などに加えて、力制御、インピーダンス制御の高速化、高精度化にも応用可能な技術であり、上述の手法の用途は異常検知に限定されるものではない。 However, in general, when a sensor, an end effector, etc. are attached, it takes time and effort to set the influence on these physical model parameters by a robot controller, etc., but in this embodiment, the physical model storage unit In 22, by learning them from the motor state signals when the target machine 10 is operated for a certain period of time, it is possible to automate this setting process as well. In addition, by setting the values of the physical model parameters correctly, there is also the advantage that the cycle time, trajectory deviation, pressing force at the time of collision, etc. are reduced. This technology can also be applied to increase the speed and accuracy of control, and the application of the above-described method is not limited to abnormality detection.

つづいて、モータ制御装置１００のハードウェア構成について説明する。モータ制御装置１００において、モータ状態観測部１、物理モデル計算部２、特徴量生成部３、および機械学習モデル計算部４は処理回路により実現される。処理回路は、メモリに格納されるプログラムを実行するプロセッサおよびメモリであってもよいし、専用のハードウェアであってもよい。 Next, the hardware configuration of the motor control device 100 will be described. In motor control device 100, motor state observation unit 1, physical model calculation unit 2, feature value generation unit 3, and machine learning model calculation unit 4 are implemented by processing circuits. The processing circuitry may be a processor and memory executing programs stored in the memory, or may be dedicated hardware.

図８は、本実施の形態に係るモータ制御装置１００が備える処理回路をプロセッサおよびメモリで構成する場合の例を示す図である。処理回路がプロセッサ９１およびメモリ９２で構成される場合、モータ制御装置１００の処理回路の各機能は、ソフトウェア、ファームウェア、またはソフトウェアとファームウェアとの組み合わせにより実現される。ソフトウェアまたはファームウェアはプログラムとして記述され、メモリ９２に格納される。処理回路では、メモリ９２に記憶されたプログラムをプロセッサ９１が読み出して実行することにより、各機能を実現する。すなわち、処理回路は、モータ制御装置１００の処理が結果的に実行されることになるプログラムを格納するためのメモリ９２を備える。また、これらのプログラムは、モータ制御装置１００の手順および方法をコンピュータに実行させるものであるともいえる。 FIG. 8 is a diagram showing an example in which a processing circuit included in motor control device 100 according to the present embodiment is configured by a processor and a memory. When the processing circuit is composed of processor 91 and memory 92, each function of the processing circuit of motor control device 100 is implemented by software, firmware, or a combination of software and firmware. Software or firmware is written as a program and stored in memory 92 . In the processing circuit, each function is realized by the processor 91 reading and executing the program stored in the memory 92 . That is, the processing circuitry includes a memory 92 for storing programs that result in the processing of the motor controller 100 being executed. It can also be said that these programs cause a computer to execute the procedures and methods of the motor control device 100 .

ここで、プロセッサ９１は、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）、処理装置、演算装置、マイクロプロセッサ、マイクロコンピュータ、またはＤＳＰ（ＤｉｇｉｔａｌＳｉｇｎａｌＰｒｏｃｅｓｓｏｒ）などであってもよい。また、メモリ９２には、例えば、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、フラッシュメモリ、ＥＰＲＯＭ（ＥｒａｓａｂｌｅＰｒｏｇｒａｍｍａｂｌｅＲＯＭ）、ＥＥＰＲＯＭ（登録商標）（ＥｌｅｃｔｒｉｃａｌｌｙＥＰＲＯＭ）などの、不揮発性または揮発性の半導体メモリ、磁気ディスク、フレキシブルディスク、光ディスク、コンパクトディスク、ミニディスク、またはＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｃ）などが該当する。 Here, the processor 91 may be a CPU (Central Processing Unit), a processing device, an arithmetic device, a microprocessor, a microcomputer, or a DSP (Digital Signal Processor). The memory 92 includes non-volatile or volatile memory such as RAM (Random Access Memory), ROM (Read Only Memory), flash memory, EPROM (Erasable Programmable ROM), EEPROM (Electrically EPROM). semiconductor memory, magnetic disk, flexible disk, optical disk, compact disk, mini disk, or DVD (Digital Versatile Disc).

図９は、本実施の形態に係るモータ制御装置１００が備える処理回路を専用のハードウェアで構成する場合の例を示す図である。処理回路が専用のハードウェアで構成される場合、図９に示す処理回路９３は、例えば、単一回路、複合回路、プログラム化したプロセッサ、並列プログラム化したプロセッサ、ＡＳＩＣ（ＡｐｐｌｉｃａｔｉｏｎＳｐｅｃｉｆｉｃＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ）、ＦＰＧＡ（ＦｉｅｌｄＰｒｏｇｒａｍｍａｂｌｅＧａｔｅＡｒｒａｙ）、またはこれらを組み合わせたものが該当する。モータ制御装置１００の各機能を機能別に処理回路９３で実現してもよいし、各機能をまとめて処理回路９３で実現してもよい。 FIG. 9 is a diagram showing an example in which a processing circuit included in motor control device 100 according to the present embodiment is configured with dedicated hardware. When the processing circuit is composed of dedicated hardware, the processing circuit 93 shown in FIG. 9 is, for example, a single circuit, a composite circuit, a programmed processor, a parallel programmed processor, an ASIC (Application Specific Integrated Circuit) An FPGA (Field Programmable Gate Array) or a combination thereof is applicable. Each function of the motor control device 100 may be realized by the processing circuit 93 for each function, or the functions may be collectively realized by the processing circuit 93 .

なお、モータ制御装置１００の各機能について、一部を専用のハードウェアで実現し、一部をソフトウェアまたはファームウェアで実現するようにしてもよい。このように、処理回路は、専用のハードウェア、ソフトウェア、ファームウェア、またはこれらの組み合わせによって、上述の各機能を実現することができる。 It should be noted that each function of the motor control device 100 may be partly realized by dedicated hardware and partly realized by software or firmware. Thus, the processing circuitry may implement each of the functions described above through dedicated hardware, software, firmware, or a combination thereof.

以上説明したように、本実施の形態では、モータ制御装置１００のモデルを、対象機械１０の力学的物性と運動方程式上で陽に与えられる物理現象の情報を利用してトルクの推定を行う物理モデル計算部２と、物理モデル計算部２の推定トルクに対する補正量を自己回帰項として含み、運動方程式上で表現が困難な非線形性の強いトルク成分の推定を行う機械学習モデル計算部４の組み合わせで表現する。モータ制御装置１００は、これらを順に学習することで、実トルクに対するトルク推定精度および汎化性能が高い対象機械１０のモデルを、少量の動作データから高速に学習することができる。また、モータ制御装置１００は、実トルクとトルク推定値とを比較する、すなわち実機と学習したモデルとを比較することで、軽微な異常についても検知が可能となる。 As described above, in the present embodiment, the model of the motor control device 100 is a physical model for estimating the torque using the information of the physical phenomena explicitly given on the mechanical properties of the target machine 10 and the equation of motion. A combination of a model calculation unit 2 and a machine learning model calculation unit 4 that includes a correction amount for the estimated torque of the physical model calculation unit 2 as an autoregressive term and estimates a highly nonlinear torque component that is difficult to express on the equation of motion. expressed in By sequentially learning these, the motor control device 100 can quickly learn a model of the target machine 10 with high torque estimation accuracy and generalization performance with respect to the actual torque from a small amount of operation data. In addition, the motor control device 100 can detect even minor abnormalities by comparing the actual torque and the estimated torque value, that is, by comparing the actual machine and the learned model.

すなわち、モータ制御装置１００は、物理モデルパラメータと機械学習モデルパラメータとを順に学習することで、それぞれ対象機械１０の力学的物性、運動方程式上で陽に与えられる物理現象によるトルク成分と、運動方程式上で表現が困難な非線形性の強いトルク成分とを区別して学習し、推定することができる。これにより、モータ制御装置１００は、学習全体にかかる時間を短縮し、機械学習補正量の絶対値が物理モデルパラメータの学習により小さくなるので、実トルクに対する第二の推定トルクの汎化性能を向上させることができる。また、モータ制御装置１００は、第二のトルク推定部４１において、入出力関係の自己回帰項により時間軸方向の特徴量間の相関を効率的に学習させることで、第二の推定トルクの推定精度を向上させることができる。 That is, the motor control device 100 sequentially learns the physical model parameters and the machine learning model parameters to obtain the mechanical properties of the target machine 10, the torque component due to the physical phenomenon explicitly given in the equation of motion, and the equation of motion. It can be learned and estimated by distinguishing it from the highly nonlinear torque component, which is difficult to express above. As a result, the motor control device 100 shortens the time required for the entire learning, and since the absolute value of the machine learning correction amount becomes smaller due to the learning of the physical model parameters, the generalization performance of the second estimated torque with respect to the actual torque is improved. can be made In addition, the motor control device 100 allows the second torque estimator 41 to efficiently learn the correlation between the feature quantities in the time axis direction using the autoregressive term of the input/output relationship, thereby estimating the second estimated torque. Accuracy can be improved.

なお、本実施の形態では、水平多関節ロボットを制御対象としていたが、モータ制御装置１００は、機械の軸数、関節の直動、回転の機構によらずに適用することができるため、垂直多関節ロボット、ＮＣ工作機械、実装機などにも適用することが可能である。 In the present embodiment, a horizontal multi-joint robot is controlled. However, the motor control device 100 can be applied regardless of the number of machine axes, the linear motion of the joints, and the rotation mechanism. It can also be applied to articulated robots, NC machine tools, mounters, and the like.

以上の実施の形態に示した構成は、本発明の内容の一例を示すものであり、別の公知の技術と組み合わせることも可能であるし、本発明の要旨を逸脱しない範囲で、構成の一部を省略、変更することも可能である。 The configuration shown in the above embodiment shows an example of the contents of the present invention, and it is possible to combine it with another known technology, and one configuration can be used without departing from the scope of the present invention. It is also possible to omit or change the part.

１モータ状態観測部、２物理モデル計算部、３特徴量生成部、４機械学習モデル計算部、１０対象機械、１１ｌ第１リンク、１１ｊ第１軸、１２ｌ第２リンク、１２ｊ第２軸、１３ｌ第３リンク、１３ｊ第３軸、１４ｌ第４リンク、１４ｊ第４軸、１５ｅ手先部、２１第一のトルク推定部、２２物理モデル記憶部、４１第二のトルク推定部、４２機械学習モデル記憶部、１００モータ制御装置、４１１機械学習部。 1 motor state observation unit 2 physical model calculation unit 3 feature amount generation unit 4 machine learning model calculation unit 10 target machine 11l first link 11j first axis 12l second link 12j second axis 13l Third link 13j Third axis 14l Fourth link 14j Fourth axis 15e Hand part 21 First torque estimation part 22 Physical model storage part 41 Second torque estimation part 42 Machine learning model storage part, 100 motor control device, 411 machine learning part.

Claims

A motor control device for controlling a motor that drives a target machine,
a motor state observation unit that observes the state of the motor and outputs an observation result as a motor state signal;
Based on the motor state signal and the physical model parameters of the target machine, a first estimated torque, which is an estimated value of the torque of the motor, and a physical state variable, which is a component of the first estimated torque, are calculated using an equation of motion. a first torque estimator that outputs;
a physical model storage unit that learns and stores the physical model parameters so as to reduce the difference between the actual torque included in the motor state signal and the first estimated torque;
a feature quantity generator that generates and outputs a feature quantity from at least one of the first estimated torque, the physical state variable, and the motor state signal;
A machine learning correction amount is calculated from the feature amount and the machine learning model parameters using machine learning including an autoregressive term in the input/output relationship, and the calculated machine learning correction amount is added to the first estimated torque to obtain a second a second torque estimator that outputs two estimated torques;
Machine learning model storage for learning the machine learning model parameters by supervised learning so that the machine learning correction amount approaches the difference between the actual torque and the first estimated torque, and storing the learned machine learning model parameters. Department and
A motor control device comprising:

The second torque estimating unit uses machine learning using a neural network as machine learning including an autoregressive term in the input/output relationship,
The machine learning model storage unit uses an error backpropagation method as a supervised learning method used when learning the machine learning model parameters,
2. The motor control device according to claim 1, wherein:

In machine learning including an autoregressive term in the input/output relationship used to calculate the machine learning correction amount, the second torque estimator uses the difference between the actual torque and the first estimated torque as an autoregressive term. using
3. The motor control device according to claim 2, wherein:

The feature amount generating unit includes a storage device capable of storing the first estimated torque, the physical state variable, and the motor state signal, and the first estimated torque stored in the storage device and the physical outputting state variables and features extracted from the motor state signal as features;
4. The motor control device according to any one of claims 1 to 3, characterized in that:

The physical model parameters include a mass in a linkage of the target machine, a center of gravity position in the linkage of the target machine, and an inertia tensor in the linkage of the target machine.
5. The motor control device according to any one of claims 1 to 4, characterized in that:

The first torque estimating unit uses a motion equation including a continuous and smooth friction model that outputs an estimated value of the frictional component of the torque with the speed of the motor determined by the motor state signal as an input,
6. The motor control device according to any one of claims 1 to 5, characterized in that:

the physical model storage unit learns and stores coefficients of the friction model as the physical model parameters;
7. The motor control device according to claim 6, wherein:

wherein the physical state variables include the friction model;
8. The motor control device according to claim 7, characterized in that:

The physical model storage unit alternately estimates coefficients of the friction model and other physical model parameters in learning the physical model parameters.
9. The motor control device according to claim 8, wherein:

The physical model storage unit uses a least-squares method, a regularized least-squares method, a nonlinear least-squares method, or a Kalman filter in learning the physical model parameters.
10. The motor control device according to claim 9, characterized in that: