JP3242950B2

JP3242950B2 - Predictive control method

Info

Publication number: JP3242950B2
Application number: JP20449491A
Authority: JP
Inventors: 誠加納
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1991-08-14
Filing date: 1991-08-14
Publication date: 2001-12-25
Anticipated expiration: 2016-12-25
Also published as: US5428559A; GB2258742A; GB2258742B; GB9216952D0; JPH0546205A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】この発明は、例えばマニピュレー
タのごとき例えば非線形制御対象のサンプル時の制御量
を予測する予測制御方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a predictive control method for predicting a control amount at the time of sampling a nonlinear control object such as a manipulator.

【０００２】[0002]

【従来の技術】従来、マニピュレータのごとき非線形制
御対象の制御方法として、非線形補償とフィードバック
制御を組み合わせた方法や、非線形制御対象の数式モデ
ルから操作量を求める逆プラント手法が考えられてい
る。これらの各方法は、いずれも数式モデル（動特性の
数式モデル）が未知ならば、正確な制御はできない。2. Description of the Related Art Conventionally, as a control method of a non-linear controlled object such as a manipulator, a method combining non-linear compensation and feedback control, and an inverse plant method of obtaining an operation amount from a mathematical model of the non-linear controlled object have been considered. In each of these methods, accurate control cannot be performed unless a mathematical model (a mathematical model of dynamic characteristics) is unknown.

【０００３】また、以上述べた方法以外に、神経回路を
用いて学習により神経回路が制御対象の動特性を獲得
し、これにより、数式モデルが未知であっても、制御が
可能な神経回路を用いた非線形制御対象の制御方法が幾
つか提案されている。[0003] In addition to the method described above, a neural circuit obtains the dynamic characteristics of a control target by learning using the neural circuit, so that a neural circuit that can be controlled even if the mathematical model is unknown. Several control methods of the non-linear control object used have been proposed.

【０００４】[0004]

【発明が解決しようとする課題】ところが、前者の各方
法は、いずれも制御対象の数式モデル（動特性の数式モ
デル）が未知ならば、正確な制御はできない。また、後
者の各方法は、目標制御量から操作量を計算するフィー
ドフォワード制御であるため、外乱に対する補償ができ
ない。However, in each of the former methods, accurate control cannot be performed unless the mathematical model of the controlled object (the mathematical model of dynamic characteristics) is unknown. In addition, since the latter methods are feedforward controls that calculate an operation amount from a target control amount, compensation for disturbance cannot be performed.

【０００５】この発明は、前記の事情に鑑みてなされた
もので、動特性の数式モデルが未知である制御対象につ
いても予測制御が行え、且つ、外乱に対しても補償がで
きる制御対象の予測制御方法を提供することを目的とす
る。SUMMARY OF THE INVENTION The present invention has been made in view of the above circumstances, and it is possible to perform predictive control even for a controlled object whose mathematical model of dynamic characteristics is unknown, and to predict a controlled object capable of compensating for a disturbance. It is an object to provide a control method.

【０００６】[0006]

【課題を解決するための手段】この発明は、前記目的を
達成するために、以下のような工程を含んでいる。すな
わち、請求項１に対応する発明は、制御すべき制御対象
に対して操作量を与えて予測制御を行う予測制御方法に
おいて、多層神経回路を前記制御対象の同定器として用
い、この同定器に前記制御対象に対するサンプル時の制
御量および操作量を入力し、前記サンプリング時から所
定時間後の制御量を予測する第１の工程と、この第１の
工程で得られた予測値と、前記制御対象の目標制御量の
誤差を求める第２の工程と、この第２の工程で得られた
制御量誤差を前記同定器に入力し、誤差逆伝播法により
現時刻の操作量の誤差を求めて、前記制御対象に与えら
れる操作量を修正する第３の工程とを合んでいる。請求
項２に対応する発明は、請求項１における第２および第
３の工程を繰り返し行う第４の工程を含んでいる。The present invention includes the following steps to achieve the above object. That is, the invention corresponding to claim 1 provides a predictive control method for performing predictive control by giving an operation amount to a control target to be controlled, wherein a multilayer neural network is used as an identifier of the control target, and enter the control amount and the operation amount during the sample relative to the control target, where the time of the sampling
A first step of predicting a control amount after a fixed time, a second step of calculating an error between the predicted value obtained in the first step, and a target control amount of the control object, and a second step of obtained in
Enter the control amount error to said identifier, seeking error in the operation amount of the present time by the error back propagation method, by N if a third step of correcting the manipulated variable applied to the controlled object. The invention corresponding to claim 2 includes a fourth step in which the second and third steps in claim 1 are repeatedly performed.

【０００７】[0007]

【作用】この発明によれば、制御対象の操作量および制
御量を多層神経回路に入力して前向き計算により、サン
プル時の制御量の予測ができ、神経回路の誤差逆伝播計
算により、予測された制御量および目標制御量との誤差
から操作量の修正量を計算することができ、これにより
特性が未知の制御対象、あるいは、動特性の非線形性が
強い制御対象等の動特性の数式モデルが未知である制御
対象（特性が未知の制御対象、あるいは、動特性の非線
形性が強い制御対象）であっても制御できる。また、多
層神経回路の前向き計算では、その時刻に実現されてい
る制御量を操作量とともに、入力してサンプリング周期
後の制御量が目標制御量に近付くように操作量が修正さ
れるので、外乱に対しても補償できる。According to the present invention, the manipulated variable and the controlled variable of the controlled object are input to the multilayer neural network, and the control variable at the time of sampling can be predicted by forward calculation, and can be predicted by the error back propagation calculation of the neural circuit. The amount of correction of the manipulated variable can be calculated from the difference between the control amount and the target control amount, and this allows the mathematical model of the dynamic characteristics of a controlled object with unknown characteristics or a highly controlled nonlinear dynamic characteristic. Can be controlled even if the control target is unknown (a control target having unknown characteristics or a control target having strong nonlinearity in dynamic characteristics). In the forward calculation of the multilayer neural network, the control amount realized at that time is input together with the operation amount, and the operation amount is corrected so that the control amount after the sampling cycle approaches the target control amount. Can be compensated for.

【０００８】[0008]

【実施例】以下、本発明の実施例について図面を参照し
て説明する。図１は本発明を実施するための装置の概略
構成を示すブロック図である。制御対象例えばマニピュ
レータ１、多層型神経回路モデルからなる同定器２、操
作量発生器３、積分器４、スイッチ５、フィードバック
ループ６に設けられた時間遅れ要素７からなっている。Embodiments of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing a schematic configuration of an apparatus for carrying out the present invention. The control target includes, for example, a manipulator 1, an identifier 2 composed of a multilayer neural network model, an operation amount generator 3, an integrator 4, a switch 5, and a time delay element 7 provided in a feedback loop 6.

【０００９】ここでは、制御対象の一例としては、図２
に示す第１関節１１，第２関節１２を有する２関節マニ
ピュレータを例に挙げて説明するが、これに限らず、他
の非線形制御対象であってもよい。マニピュレータ１
は、第１リンク１３がｘ軸となす角を第１関節角θ1 、
リンク１３の延長線と第２リンク１４のなす角を第２関
節角θ2 とし、第１関節トルクτ1 は、第１関節１１
に、また第２関節トルクτ2 は、第２関節１２に各々の
関節角の正方向に加えられる場合を示している。Here, as an example of the control object, FIG.
The following describes an example of a two-joint manipulator having a first joint 11 and a second joint 12 shown in FIG. 1, but the present invention is not limited to this, and other non-linear control targets may be used. Manipulator 1
Is the angle that the first link 13 makes with the x-axis is the first joint angle θ1,
The angle between the extension of the link 13 and the second link 14 is defined as a second joint angle θ2, and the first joint torque τ1
The second joint torque .tau.2 is applied to the second joint 12 in the positive direction of each joint angle.

【００１０】マニピュレータ１には、時刻ｎΔｔに操作
量である関節角トルクＵｎ＝（τ1，n,τ2,n ）^T が
入力され、ここで、Ｘn+1 ＝Ｆ（Ｘn ，Ｕn ）、すなわ
ち、サンプリング周期Δｔ後の制御量である状態Ｘ_n+1
＝（θ₁ ，_n+1 ，θ₂ ，_n+1，θ´₁ ，_n+1 ，θ´₂ ，
_n+1 ）^T が計測されるようになつている。At time nΔt, the manipulator 1 receives a joint angle torque Un = (τ1, n, τ2, n) ^T which is an operation amount, where Xn + 1 = F (Xn, Un), that is, State X _{n + 1} which is a control amount after sampling period Δt
_{_{= (Θ 1, n + 1}} , θ 2, n + 1, θ'1, n + 1, θ'2,
_{n + 1} ) ^T is measured.

【００１１】ここで、τ1, nおよびτ2,n は、それぞれ
時刻nΔｔの第１関節トルクおよび第２関節トルクを示
している。Ｕn はトルクベクトルを示している。θ'
_1,nおよびθ' _2,nは時刻 nΔｔの第１関節角速度およ
び第２関節角速度を示し、Ｘ_nはこれらを成分とする状
態ベクトルである。Here, τ1, n and τ2, n indicate the first joint torque and the second joint torque at time nΔt, respectively. Un indicates a torque vector. θ '
_{1, n} and θ ′ _{2, n} indicate the first joint angular velocity and the second joint angular velocity at time nΔt, and X _n is a state vector having these as components.

【００１２】同定器２は、図３（ａ），（ｂ）に示すよ
うに多層型神経回路モデルを有しており、この第１の機
能は、後述する誤差逆伝搬学習法により、図３（ａ）に
示すように操作量Ｕn とマニピュレータ１の制御量Ｘn
を入力すると、サンプル時（サンプリング周期後）の制
御量の予測値Ｚn+1 が出力されることであり、また、第
２の機能は、図３（ｂ）に示すように予測値Ｚn+1 と目
標値ｄn+1 の差（誤差量）を用いて、誤差逆伝播計算に
よりトルクＵn の誤差ΔＵn を計算することである。図
３の神経回路モデルは、右側が第１層で、左側は最終層
になっている。The identifier 2 has a multilayer neural network model as shown in FIGS. 3 (a) and 3 (b). The first function of the identifier 2 is as shown in FIG. As shown in (a), the operation amount Un and the control amount Xn of the manipulator 1 are determined.
Is input, the predicted value Zn + 1 of the control amount at the time of sampling (after the sampling period) is output. The second function is as shown in FIG. Is to calculate the error ΔUn of the torque Un by the error back propagation calculation using the difference between the target value dn + 1 and the target value dn + 1. In the neural circuit model of FIG. 3, the right side is the first layer, and the left side is the final layer.

【００１３】ここで、図４のフローチャートを参照し
て、誤差逆伝播学習方法について説明する。この誤差逆
伝播学習方法は、現在、最も一般的に利用されている方
法である。この方法は、結合荷重空間に定義された誤差
関数曲面の最急降下方向を計算し、その方向に結合荷重
を変化させる方法である。初めに、結合荷重ｗの初期値
を乱数により設定する（Ｓ１）。次に、神経回路モデル
に学習させたいデータ、すなわち、入力信号Ｕi,Ｘi 、
および、その入力に対する望ましい出力信号ｔiを設定
する（Ｓ２）。ただし、i ＝１，２，…，data maxであ
る。Here, the error back propagation learning method will be described with reference to the flowchart of FIG. This backpropagation learning method is currently the most commonly used method. This method is a method of calculating the steepest descent direction of an error function surface defined in a connection load space and changing the connection load in that direction. First, an initial value of the connection weight w is set by a random number (S1). Next, data that the neural network model wants to learn, that is, input signals Ui, Xi,
Then, a desired output signal ti for the input is set (S2). Here, i = 1, 2,..., Data max.

【００１４】次に、全てのデータについて、結合荷重の
変化量を計算する（Ｓ５ーＳ１０）。データの入力信号
Ｕi ，Ｘi を神経回路モデルに入力し、出力信号（神経
回路モデルにＸi ，Ｕi を入力したときの出力信号）Ｚ
i を、前向き計算で求める（Ｓ６）。該出力値Ｚi と望
ましい出力値ｔi から、誤差関数Ｅi を次のように定義
する（Ｓ７）。１／２（ｔi −Ｚi ）^T （ｔi −Ｚi ）Next, the amount of change in the connection load is calculated for all the data (S5-S10). Data input signals Ui and Xi are input to a neural circuit model, and output signals (output signals when Xi and Ui are input to the neural circuit model) Z
i is obtained by forward calculation (S6). From the output value Zi and the desired output value ti, an error function Ei is defined as follows (S7). 1/2 (ti-Zi) ^T (Ti-Zi)

【００１５】さらに、誤差逆伝播計算により、結合荷重
空間における誤差関数Ｅi の最急降下方向Δｗi （i 番
目の学習データにより計算された神経回路の結合荷重の
変化量）を計算する（Ｓ８）。Ｓ８の式において、ｄＥ
i ／ｄｗ、ｄｇ／ｄｗはいずれも偏微分を示している。
Ｓ６からＳ８の計算をすべてのデータについて行う（Ｓ
５，Ｓ９，Ｓ１０）。そして、求められた各々のデータ
最急降下方向を使い、結合荷重ｗを次のように変化させ
る（Ｓ１１）。すなわち、ｗ＋ε（Δｗ1 ＋Δｗ2 ＋…＋Δｗ datamax）→ｗである。ただし、εは学習定数と呼ばれる結合荷重の変
化量のパラメータである。以上述べたステップをitemax
回繰り返すことにより、誤差関数を減少させていく（Ｓ
３，Ｓ４，Ｓ１２，Ｓ１３）。以上のようにして学習を
し終えた同定器２のマニピュレータ１のダイナミクスの
予測値Ｚn+1 を、次のような関数として表わすことがで
きる。すなわち、Ｚn+1 ＝ｇ（Ｘn ，Ｕn ，Ｗ^*）である。この場合、Ｗ^*は学習後の神経回路モデルの結
合荷重を示し、Ｚn+1 は時刻（n+1 ）Δｔの制御量の予
測値である。図１において、操作量発生器３は、運動開
始時だけ初期操作量Ｕo を発生する働きを有している。Further, the steepest descent direction Δwi of the error function Ei in the connection weight space (the amount of change in the connection weight of the neural circuit calculated from the i-th learning data) is calculated by the error back propagation calculation (S8). In the equation of S8, dE
Both i / dw and dg / dw indicate partial differentiation.
The calculations from S6 to S8 are performed for all data (S
5, S9, S10). Then, using the obtained steepest descent direction of each data, the coupling load w is changed as follows (S11). That is, w + ε (Δw1 + Δw2 +... + Δwdatamax) → w. Here, ε is a parameter of the amount of change in the connection weight called a learning constant. Iterate the steps mentioned above
The error function is reduced by repeating the process (S
3, S4, S12, S13). The predicted value Zn + 1 of the dynamics of the manipulator 1 of the identifier 2 having completed learning as described above can be expressed as a function as follows. That is, Zn + 1 = g (Xn, Un, W ^* ). In this case, W ^* indicates the connection weight of the neural network model after learning, and Zn + 1 is the predicted value of the control amount at time (n + 1) Δt. In FIG. 1, an operation amount generator 3 has a function of generating an initial operation amount Uo only at the start of exercise.

【００１６】同定器２からの修正量ΔＵn は積分器４に
より操作量Ｕnに加算される。このサンプリング周期後
の制御量予測と操作量の修正の計算は、サンプリング時
間内に１回、または、複数回行われる。スイッチ５はサ
ンプリング時間（単位時間）毎に動作し、このときマニ
ピュレータ１に操作量Ｕn を出力する働きを有してい
る。The correction amount ΔUn from the identifier 2 is added to the operation amount Un by the integrator 4. The control amount prediction and the correction of the manipulated variable after the sampling period are performed once or a plurality of times within the sampling time. The switch 5 operates every sampling time (unit time), and has a function of outputting the operation amount Un to the manipulator 1 at this time.

【００１７】次に、図５を参照してマニピレータ１の制
御動作について説明する。初めに目標軌道（ｄn ）が設
定される（Ｓ２０）。続いて、マニピュレータ１が初期
姿勢（Ｘ0 ＝ｄ0 ）をとるための操作量Ｕ0 ＝Ｕ（ｄ0
）が、操作量発生器３で設定される（Ｓ２１）、これ
が積分器４およびスイッチ５を介してマニピュレータ１
へ出力され、マニピュレータ１の目標軌道の初期姿勢に
一致する。そして、マニピュレータ１の運動が開始され
ると、サンプリング時間Δｔ後の制御量の予測と誤差逆
伝播法による操作量の修正が繰り返される（Ｓ２５ーＳ
３１）。Next, a control operation of the manipulator 1 will be described with reference to FIG. First, a target trajectory (dn) is set (S20). Then, the operation amount U0 = U (d0) for the manipulator 1 to take the initial posture (X0 = d0).
) Is set by the manipulated variable generator 3 (S 21), and this is set via the integrator 4 and the switch 5.
To the initial position of the target trajectory of the manipulator 1. When the movement of the manipulator 1 is started, the prediction of the control amount after the sampling time Δt and the correction of the operation amount by the error back propagation method are repeated (S25-S).
31).

【００１８】操作量Ｕｎと時間遅れ７を介してフィード
バックされた制御量Ｘｎが神経回路モデルに入力され、
サンプリング時間Δｔ秒後の制御量の予測値Ｚn+1 が求
められる（Ｓ２６）。この予測値Ｚn+1 と目標値ｄn+1
の差ΔＸn+1 ＝目標値ｄn+1−予測値Ｚn+1 から誤差関
数Ｅn+1 が次のように定義される（Ｓ２７）。Ｅn+1 ＝１／２（ΔＸn+1 ）^T Ｋｓ（ΔＸn+1 ）ただし、Ｋｓはゲイン行列である。前述した誤差逆伝播
法により、この誤差関数を減少させるように入力信号の
補正量が求められる（Ｓ２８）。 ΔＵn ＝−（ｄＥn+1)／（ｄＵn ）＝ｄｇ（Ｘn ，Ｕn ，Ｗ^*）／ｄＵn ×（ＫｓΔＸn+1 ） ΔＸn ＝−（ｄＥn+1)／（ｄＸn ）＝ｄｇ（Ｘn ，Ｕn ，Ｗ^*）／ｄＸn ×（ＫｓΔＸn+1 ）そして、入力信号のうち、操作量の値が、（Ｕn+ΔＵn
→Ｕn ）のように修正される（Ｓ２９）。The manipulated variable Un and the control variable Xn fed back via the time delay 7 are input to the neural circuit model,
The predicted value Zn + 1 of the control amount after the sampling time Δt seconds is obtained (S26). The predicted value Zn + 1 and the target value dn + 1
The error function En + 1 is defined as follows from the difference ΔXn + 1 = target value dn + 1−predicted value Zn + 1 (S27). En + 1 = 1/2 (ΔXn + 1) ^T Ks (ΔXn + 1) where Ks is a gain matrix. The correction amount of the input signal is obtained by the above-described error back propagation method so as to reduce the error function (S28). ΔUn = − (dEn + 1) / (dUn) = dg (Xn, Un, W ^* ) / dUn × (KsΔXn + 1) ΔXn = − (dEn + 1) / (dXn) = dg (Xn, Un, W) ^* ) / DXn × (KsΔXn + 1) The value of the manipulated variable in the input signal is (Un + ΔUn)
→ Un) (S29).

【００１９】再び、修正後の入力信号であるサンプリン
グ時間後の制御量の予測値Ｚn+1 が求められ、操作量の
修正が繰り返される。修正を一定回数（ｋ) 繰り返した
後、サンプリング時刻に操作量がマニピュレータ１へ入
力される（Ｓ３２）。以上のステップを最終時刻（time
f ）まで繰り返す（Ｓ２２，Ｓ２３，Ｓ３３，Ｓ３
５）。Again, the predicted value Zn + 1 of the control amount after the sampling time, which is the corrected input signal, is obtained, and the correction of the manipulated variable is repeated. After the correction is repeated a fixed number of times (k), the manipulated variable is input to the manipulator 1 at the sampling time (S32). Repeat the above steps for the final time (time
f) (S22, S23, S33, S3)
5).

【００２０】そして、初期時刻には操作量発生器３から
の操作量Ｕn が与えられるが、その後は、前時刻の操作
量Ｕn-1 の値を現時刻の操作量Ｕn の初期値にする（Ｓ
３４）。神経回路モデルがマニピュレータのダイナミク
スを十分に学習していれば、神経回路モデルの結合荷重
は、正しい値なので、予測値Ｚn+1 と目標値ｄn+1 に誤
差が生じたとすれば、それは入力信号である制御量Ｘn
と操作量Ｕn に原因がある。このうち、制御量Ｘn は計
測値であるから、操作量Ｕn だけ修正して、現在の制御
量Ｘn からサンプリング時刻後に目標値ｄn+1 に近づく
ようにする。サンプリング時間が短く、ある時刻の操作
量と次の時刻の操作量が近い値であれば、前時刻の操作
量の値を初期値として使うことにより、少ない回数で目
標状態に達する操作量の値が求められることが期待でき
る。At the initial time, the manipulated variable Un from the manipulated variable generator 3 is given. Thereafter, the value of the manipulated variable Un-1 at the previous time is set to the initial value of the manipulated variable Un at the current time ( S
34). If the neural network model has sufficiently learned the dynamics of the manipulator, the connection weight of the neural network model is a correct value. Therefore, if an error occurs between the predicted value Zn + 1 and the target value dn + 1, it means that the input signal Control amount Xn
And the operation amount Un. Since the control amount Xn is a measured value, the control amount Xn is corrected by the operation amount Un so as to approach the target value dn + 1 after the sampling time from the current control amount Xn. If the sampling time is short and the manipulated variable at one time is close to the manipulated variable at the next time, the value of the manipulated variable that reaches the target state in a small number of times by using the value of the manipulated variable at the previous time as the initial value Can be expected.

【００２１】図６および図７は、図２のマニピュレータ
を制御対象とし、図５のフローチャートにおいて、サン
プリング時刻に行う繰り返しを３回にした場合のシミュ
レーション結果を示すもので、図６（ａ）は第１関節角
度、図６（ｂ）は第２関節角度であり、図７（ａ）は第
１関節トルクの波形図、図７（ｂ）は第２関節トルクの
波形図を示している。図６、図７の横軸は運動時間（２
秒間）を示し、図６の縦軸は関節角軌道で、その単位は
ラジアン（ｒａｄ）であり、図７の縦軸はトルク波形
で、その単位はニュートンメートル（Ｎｍ）である。さ
らに、図６の破線は目標軌道を示し、実線は実現軌道を
示している。図６から明らかなように、実現軌道（実
線）が目標軌道（破線）にほぼ重なっていることから、
目標軌道に良く追従していることがわかる。この場合の
トルク波形は、図７に示す通りであり、図５のフローチ
ャートにおいて、サンプリング時刻に行う繰り返し回数
を４回以上行った実験では、実現軌道はより目標軌道に
近づき、トルク波形もさらに滑らかになった。FIGS. 6 and 7 show simulation results when the manipulator of FIG. 2 is to be controlled and the repetition performed at the sampling time is three times in the flowchart of FIG. 5, and FIG. FIG. 6B shows the first joint angle, FIG. 6B shows the second joint angle, FIG. 7A shows the waveform diagram of the first joint torque, and FIG. 7B shows the waveform diagram of the second joint torque. The horizontal axis in FIGS. 6 and 7 indicates the exercise time (2
The vertical axis in FIG. 6 is the joint angle trajectory, the unit is radian (rad), and the vertical axis in FIG. 7 is the torque waveform, and the unit is Newton meter (Nm). Further, the broken line in FIG. 6 indicates the target trajectory, and the solid line indicates the realized trajectory. As is apparent from FIG. 6, since the realized trajectory (solid line) almost overlaps the target trajectory (dashed line),
It can be seen that the vehicle follows the target trajectory well. The torque waveform in this case is as shown in FIG. 7. In the flowchart of FIG. 5, in an experiment in which the number of repetitions performed at the sampling time is four or more, the realized trajectory is closer to the target trajectory, and the torque waveform is further smooth. Became.

【００２２】なお、本発明は前記実施例に限定されるこ
となく、本発明の要旨を逸脱しない範囲において種々変
形可能であることは勿論である。例えば、前記実施例で
は、非線形制御対象としてマニピュレータを例にあげた
が、これ以外の線形制御対象であっても同様に適用可能
である。It should be noted that the present invention is not limited to the above-described embodiment, but may be variously modified without departing from the gist of the present invention. For example, in the above-described embodiment, a manipulator has been described as an example of a non-linear control target, but other linear control targets can be similarly applied.

【００２３】前述の実施例では、最も簡単な例として１
サンプリング時間後の制御量だけを予測する例を示した
が、制御対象の動特性を示す微分方程式の次数が１次よ
り高い場合には、１サンプリング時間よりも更に先の時
間の制御量の予測をし、その予測値の誤差から逆伝播法
により、操作量の修正をすることが必要がある。このよ
うな場合でも、１サンプリング時間後の制御量予測だけ
でなく、２サンプリング時間後、３サンプリング時間後
というように、神経回路モデルに先の制御量を予測する
ように学習させておくことにより、動特性を示す微分方
程式の次数が高い制御対象でも制御することができる。
図８、図９に２サンプリング時間後までの制御量を予測
する神経回路モデルを示す。In the above embodiment, the simplest example is 1
Although the example in which only the control amount after the sampling time is predicted has been described, when the order of the differential equation indicating the dynamic characteristic of the control target is higher than the first order, the control amount is predicted at a time further than one sampling time. It is necessary to correct the manipulated variable by the back propagation method from the error of the predicted value. Even in such a case, by learning not only the control amount prediction after one sampling time but also the neural circuit model so as to predict the previous control amount such as after two sampling times and three sampling times. In addition, it is possible to control even a controlled object having a high order of a differential equation showing dynamic characteristics.
8 and 9 show neural circuit models for predicting the control amount up to two sampling times later.

【００２４】図８は３層からなる神経回路モデルで、図
８（ａ）の第１層（最右層）から操作量Ｕn 、制御量Ｘ
n が入力されると、第３層から１サンプリング時間後の
制御量の予測値Ｚn+1 と２サンプリング時間後の制御量
予測値Ｚn+2 が出力される。図８（ｂの誤差逆伝播計算
では、サンプリング時間後の予測値の誤差ΔＸn+1 ＝ｄ
n+1 −Ｚn+1 と２サンプリング時間後の予測値の誤差Δ
Ｘn+2 ＝ｄn+2 −Ｚn+2 が第３層から逆伝播され、操作
量の修正量ΔＵn が求められる。FIG. 8 shows a neural network model composed of three layers. The operation amount Un and the control amount X from the first layer (rightmost layer) in FIG.
When n is input, a predicted value Zn + 1 of the control amount after one sampling time and a predicted value Zn + 2 of the control amount after two sampling times are output from the third layer. In the error backpropagation calculation in FIG. 8B, the error ΔXn + 1 = d of the predicted value after the sampling time
Error Δ between n + 1−Zn + 1 and the predicted value after two sampling times
Xn + 2 = dn + 2−Zn + 2 is back-propagated from the third layer, and the correction amount ΔUn of the manipulated variable is obtained.

【００２５】図９は５層からなる神経回路モデルで、図
９（ａ）の第１層（最右層）から操作量Ｕn 、制御量Ｘ
n が入力されると、第３層から１サンプリング時間後の
制御量の予測値Ｚn+1 、第５層から２サンプリング時間
後の制御量の予測値Ｚn+2 が出力される。そして、図９
（ｂ）の誤差逆伝播計算では、１サンプリング時間後の
予測値の誤差ΔＸn+1 ＝ｄn +1−Ｚn+1 は第３層から、
２サンプリング時間後の予測値の誤差ΔＸn+2 ＝ｄn +2
−Ｚn+2 は第５層から逆伝播され、操作量の修正量ΔＵ
n が求められる。FIG. 9 shows a neural network model composed of five layers. The operation amount Un and the control amount X are calculated from the first layer (rightmost layer) in FIG.
When n is input, a predicted value Zn + 1 of the control amount after one sampling time from the third layer and a predicted value Zn + 2 of the control amount after two sampling times from the fifth layer are output. And FIG.
In the error backpropagation calculation of (b), the error ΔXn + 1 = dn + 1−Zn + 1 of the predicted value after one sampling time is obtained from the third layer.
Error ΔXn + 2 = dn + 2 of the predicted value after two sampling times
−Zn + 2 is back-propagated from the fifth layer, and the correction amount ΔU
n is required.

【００２６】[0026]

【発明の効果】以上述べたこの発明によれば、動特性の
数式モデルが未知である制御対象についても制御が行
え、且つ、外乱に対しても補償ができる制御対象の予測
制御方法を提供することができる。According to the present invention described above, there is provided a predictive control method for a controlled object which can control a controlled object whose mathematical model of dynamic characteristics is unknown and can also compensate for a disturbance. be able to.

[Brief description of the drawings]

【図１】この発明方法を実施する装置の概略構成を示す
ブロック図。FIG. 1 is a block diagram showing a schematic configuration of an apparatus for implementing the method of the present invention.

【図２】図１のマニピュレータを説明するための図。FIG. 2 is a view for explaining the manipulator of FIG. 1;

【図３】図１における同定器の第１の例を説明するため
のフローチャート。FIG. 3 is a flowchart for explaining a first example of the identifier in FIG. 1;

【図４】図１の初期状態のとき学習方法を説明するため
の図。FIG. 4 is a diagram for explaining a learning method in an initial state of FIG. 1;

【図５】図１の制御動作を説明するためのフローチャー
ト。FIG. 5 is a flowchart for explaining the control operation of FIG. 1;

【図６】図１の実施例の作用効果を説明するための図。FIG. 6 is a diagram for explaining the operation and effect of the embodiment of FIG. 1;

【図７】図１の実施例の作用効果を説明するための図。FIG. 7 is a view for explaining the operation and effect of the embodiment of FIG. 1;

【図８】図１における同定器の第２の例を説明するため
の図。FIG. 8 is a view for explaining a second example of the identifier in FIG. 1;

【図９】図１における同定器の第３の例を説明するため
の図。FIG. 9 is a view for explaining a third example of the identifier in FIG. 1;

[Explanation of symbols]

１…マニピュレータ、２…同定器、３…操作量発生器、
４…積分器、５…スイッチ、６…フィードバックルー
プ、７…遅れ要素。DESCRIPTION OF SYMBOLS 1 ... Manipulator, 2 ... Identifier, 3 ... Manipulated variable generator,
4 integrator, 5 switch, 6 feedback loop, 7 delay element.

Claims

(57) [Claims]

1. A predictive control method for performing predictive control by giving an operation amount to a control target to be controlled, wherein a multi-layer neural circuit is used as an identifier of the control target, and the identifier uses the neural network for sampling the control target. A first step of inputting a control amount and an operation amount of the control target and predicting a control amount after a predetermined time from the sampling time; a prediction value obtained in the first step; and a target control amount of the control target. A second step of obtaining an error; and inputting the control amount error obtained in the second step to the identifier, and operating the target control amount by an error back propagation method.
Of the target control amount,
A third step of multiplying the sensitivity to obtain a correction amount of the manipulated variable .

2. A correction amount of the operation amount for one time,
Determination through repetition of the first to third steps
The predictive control method according to claim 1, wherein:

3. The predictive control method according to claim 1, wherein the first step predicts the control amount after an integer multiple of the sampling period from the time of the sampling or in units other than the sampling period.