JPH02299002A

JPH02299002A - Adaptive controller

Info

Publication number: JPH02299002A
Application number: JP1119447A
Authority: JP
Inventors: Naoki Imazaki; 直樹今崎; Toru Yamaguchi; 亨山口
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1989-05-12
Filing date: 1989-05-12
Publication date: 1990-12-11

Abstract

PURPOSE:To easily update and apply a stabilization control gain by using a method which uses a neural network constituted so that outputs of respective units in an intermediate unit layer are inputted even to the intermediate unit layer itself. CONSTITUTION:When the control circuit 4 of a model identification control part 1 outputs a reference signal generation signal S1 to a reference signal generating circuit 5, the circuit 5 inputs the reference signal REF to a controlled system 3 and a model identifying circuit 2. At this time, the system 3 and circuit 2 outputs a response signal RES and a model response signal RESM, whose difference ERR is inputted to the learning/deciding circuit 6 of the circuit 1. The circuit 6 when judging that the signal ERR is sufficiently small outputs a learning end signal S3 to the circuit 4. When it is decided that the signal ERR is still larger, on the other hand, a model correction signal COR is outputted to the circuit 2 and the internal state of the circuit 2 composed of the neural network consisting of plural unit layers is so corrected that the signal ERR becomes small.

Description

【発明の詳細な説明】［発明の目的］（産業上の利用分野）本発明は、未知のダイナミクスを有する制御対象システ
ムに対し、ある評価関数を最小にするような制御ゲイン
を適用して制御するとき、当該制御ゲインは一定時間ご
とに同定される制御対象システムのダイナミクスに基づ
いて随時更新されるようになされた適応制御器に関する
。[Detailed Description of the Invention] [Objective of the Invention] (Industrial Application Field) The present invention provides control for a controlled system having unknown dynamics by applying a control gain that minimizes a certain evaluation function. In this case, the control gain relates to an adaptive controller that is updated at any time based on the dynamics of the controlled system identified at regular intervals.

（従来の技術）２次形式の評価関数を最小にするように制御ゲインを決
定する最適レギュレータの理論は、できあがった最適制
御器を含む閉ループシステムが安定となるばかりでなく
、ゲイン余裕や位相余裕が大きい。即ち、ロバスト性を
もっていることで制御方法として高く評価され、広い範
囲で実用化されている。(Prior art) The theory of an optimal regulator that determines the control gain so as to minimize the quadratic evaluation function not only makes the closed-loop system containing the optimal controller stable, but also reduces the gain margin and phase margin. is large. That is, it has been highly evaluated as a control method due to its robustness, and has been put into practical use in a wide range of areas.

ところで、制御対象システムに対して最適制御器を構成
し、上述のような性質を得るためには、制御対象システ
ムのダイナミクスがある特定の線形微分方程式として厳
密に与えられることが必要である。By the way, in order to configure an optimal controller for a controlled system and obtain the above properties, it is necessary that the dynamics of the controlled system be strictly given as a specific linear differential equation.

このため、実際上制御対象システムのダイナミクスが未
知の場合や当該ダイナミクスが時間に伴って変化する場
合には、例えば制御対象同定方法を用いて制御対象シス
テムのダイナミクスを同定する方法がとられている。こ
のような方法は、制御対象同定方法と呼ばれており、従
来より様々な方法が提案されている。For this reason, when the dynamics of the controlled system are actually unknown or when the dynamics change over time, a method is used to identify the dynamics of the controlled system using a controlled object identification method, for example. . Such a method is called a controlled object identification method, and various methods have been proposed in the past.

（発明が解決しようとする課題）しかしながら、従来よりあるこれらの制御対象同定方法
では、適用できる制御対象システムに限界がある上、手
順が複雑で熟練した技術者に頼らざるを得なかった。ま
た、オンラインで常時同定を繰り返すことは困難であっ
た。(Problems to be Solved by the Invention) However, with these conventional control object identification methods, there are limits to the control object systems to which they can be applied, the procedures are complex, and the method has to rely on skilled engineers. In addition, it was difficult to constantly repeat identification online.

本発明はこのような課題を解決するためになされたもの
で、未知のダイナミクスを有する種々の制御対象システ
ムに対して常にシステム同定を行い、ある評価関数を最
小化する安定化制御ゲインを更新し適用することが容易
に実現し得る適応制御器を提供することを目的としてい
る。The present invention was made to solve such problems, and it constantly performs system identification for various controlled systems with unknown dynamics and updates the stabilization control gain that minimizes a certain evaluation function. The objective is to provide an adaptive controller that can be easily implemented.

［発明の構成］（課題を解決するための手段）本発明の適応制御器は上記した目的を達成するために、
未知のダイナミクスを有する制御対象システムに対し、
一定時間ごとに制御対象同定部によりモデル同定し、得
られたモデルに対し所定の評価関数を最小にするよう制
御ゲインを決定し、該制御ゲインにより前記制御対象シ
ステムを制御するように構成された適応制御器において
、前記制御対象同定部が、複数のユニット層から構成さ
れるニューラルネットワークからなり、中間ユニット層
の各ユニットの出力が下層ユニット層と共に、該中間ユ
ニット層自体にも入力されるように構成されたモデル同
定回路と、前記制御対象システムに対し所定の基準入力
信号を送出し、該基■人力信号に対する前記制御対象シ
ステムからのシステム応答信号を記のすると共に、前記
モデル同定回路に対し前記基準人力信号を送出し、該基
準入力信号に対する前記モデル同定回路からのモデル応
答信号を入力し、かつ該モデル応答信号と前記システム
応答信号との誤差に基づき前記モデル同定回路の内部状
態を前記誤差が減少するよう補正をするモデル同定制御
部とを備え、前記補正を複数種類の前記基準入力信号に
ついて繰返し行うことにより、前記モデル同定回路の内
部状態が前記制御対象システムのダイナミクスを実現す
るようにしたものである。[Configuration of the Invention] (Means for Solving the Problems) In order to achieve the above-mentioned objects, the adaptive controller of the present invention has the following features:
For controlled systems with unknown dynamics,
The controlled object identification unit identifies the model at regular intervals, determines a control gain to minimize a predetermined evaluation function for the obtained model, and controls the controlled object system using the control gain. In the adaptive controller, the control object identification section is configured to include a neural network composed of a plurality of unit layers, and the output of each unit of the intermediate unit layer is input to the intermediate unit layer itself as well as the lower unit layer. A model identification circuit configured to send a predetermined reference input signal to the controlled system, record a system response signal from the controlled system to the basic human input signal, and send the model identification circuit to the model identification circuit. The reference human input signal is sent out, a model response signal from the model identification circuit is input to the reference input signal, and the internal state of the model identification circuit is determined based on the error between the model response signal and the system response signal. and a model identification control unit that performs correction to reduce the error, and by repeatedly performing the correction for a plurality of types of the reference input signals, the internal state of the model identification circuit realizes the dynamics of the controlled system. This is how it was done.

（作　用）本発明では、制御対象同定方法として中間ユニット層に
おける各ユニットの出力か下層のみならず中間ユニット
層自信にも入力されるように構成されたニューラルネッ
トワークを用いた手法を用いているので、未知のグイナ
ミクスを有する種々の制御対象システムに対して常にシ
ステム同定を行いある評価関数を最小化する安定化制御
ゲインを更新し適用する適応制御器を容易に実現し得る
。(Function) In the present invention, as a control object identification method, a method using a neural network configured such that the output of each unit in the intermediate unit layer is input not only to the lower layer but also to the intermediate unit layer itself is used. Therefore, it is possible to easily realize an adaptive controller that constantly performs system identification for various systems to be controlled having unknown guidance dynamics, and updates and applies a stabilizing control gain that minimizes a certain evaluation function.

（実施例）以下、本発明の実施例を図面に基づいて詳細に説明する
。(Example) Hereinafter, an example of the present invention will be described in detail based on the drawings.

第１図は本発明の一実施例に係る適応制御器を採用した
制御システムの構成を説明するためのブロック図である
。FIG. 1 is a block diagram for explaining the configuration of a control system employing an adaptive controller according to an embodiment of the present invention.

同図において、１０は全体として制御システムを示し、
動作として２つの機能を並列的に実行する。これはマル
チＣＰＵ構成により、容易に実現できる。即ち、第１の
機能は制御対象である制御対象システム３を適応制御器
１１により制御する制御機能であり、第２の機能は制御
対象システム３のダイナミクスをモデル同定制御部１に
より同定し、モデル同定回路２に制御対象システム３の
モデルを形成するモデル同定機能である。In the figure, 10 indicates the control system as a whole,
As an operation, two functions are executed in parallel. This can be easily achieved using a multi-CPU configuration. That is, the first function is a control function that controls the controlled system 3, which is the controlled object, by the adaptive controller 11, and the second function is a control function in which the dynamics of the controlled system 3 is identified by the model identification control unit 1, and the model This is a model identification function that forms a model of the controlled system 3 in the identification circuit 2.

制御システム１０において、第１の機能即ち制御機能は
、第２の機能即ちモデル同定機能の動作／非動作にかか
わらずリアルタイムに継続される。In the control system 10, the first function, ie, the control function, continues in real time regardless of whether the second function, ie, the model identification function, is activated or not.

即ち、適応制御器１１から出力された制御人力Ｕは制御
対象システム３に入力されると共に、モデル同定回路２
に入力される。ここで、モデル同定機能動作中は、モデ
ル同定制御部１から出力される基準入力信号ＲＥＰが制
御人力Ｕに対して重畳される。制御対象システム３は入
力された制御人力Ｕに対する応答出力ｙを出力し、当該
応答出力ｙは適応制御器１１にフィードバック人力され
る。That is, the control human power U output from the adaptive controller 11 is input to the controlled object system 3, and the model identification circuit 2
is input. Here, while the model identification function is in operation, the reference input signal REP output from the model identification control section 1 is superimposed on the control human power U. The controlled system 3 outputs a response output y to the input control human power U, and the response output y is fed back to the adaptive controller 11 .

か（して、制御対象システム３及び適応制御器１１によ
り閉ループ系が構成される。(Thus, the controlled system 3 and the adaptive controller 11 constitute a closed loop system.

第２図は上述した適応制御器１１の構成を詳細に示すブ
ロック図である。FIG. 2 is a block diagram showing the configuration of the above-mentioned adaptive controller 11 in detail.

同図に示すように適応制御器１１では、制御対象システ
ム３より入力された応答出力ｙは、状態合成回路１２に
与えられる。状態合成回路１２は、応答出力ｙ及び状態
モデル回路１３から出力されるモデル状態信号ｘＭより
制御対象システム３の状態を推定し、当該推定結果を観
測状態信号マとして最適制御器１４に入力する。As shown in the figure, in the adaptive controller 11, the response output y input from the controlled system 3 is given to the state synthesis circuit 12. The state synthesis circuit 12 estimates the state of the controlled system 3 from the response output y and the model state signal xM output from the state model circuit 13, and inputs the estimation result to the optimal controller 14 as an observed state signal.

ここで、状態合成回路１２は、次のようにして観測状態
信号マを求める。即ち、ｒ次元の応答出力ｙのうちｉ　
　（ｏ≦ｉ≦「）次元分はそのまま制御対象システム３
の状態を表しているという場合、このｉ次元分の応答出
力ｙをそのまま観測状態信号マのｉ次元分として。残る
次元の観測状態信号マについては、状態モデル回路１３
からのモデル状態信号ｘＭを適用する。状態モデル回路
１３は、一定時間ごとに制御システム１０のモデル同定
機能により同定された制御対象システムのダイナミクス
を示すパラメータ（ゲイン構成回路１５から与えられる
。）から構成されており、制御対象システム３と同一の
制御人力Ｕを与えることにより制御対象システム３の状
態を表すと思われるモデル状態信号ｘＭを出力する。Here, the state synthesis circuit 12 obtains the observed state signal M as follows. That is, i out of r-dimensional response output y
(o≦i≦“) dimension is the same as the control target system 3
If it represents the state of , then use this i-dimensional response output y as it is for the i-dimensional part of the observed state signal Ma. Regarding the observed state signal of the remaining dimensions, the state model circuit 13
Apply the model state signal xM from . The state model circuit 13 is composed of parameters (given from the gain configuration circuit 15) that indicate the dynamics of the controlled system identified by the model identification function of the control system 10 at regular intervals, and is connected to the controlled system 3. By applying the same control human power U, a model state signal xM that is considered to represent the state of the controlled system 3 is output.

最適制御器１４に内蔵される制御ゲインにはゲイン構成
回路１５から一定時間ごとに与えられる。A control gain built into the optimum controller 14 is given from a gain configuration circuit 15 at regular intervals.

最適制御器１４はこの制御ゲインＫを介して入力された
観Ａｐ１状態信号マを制御人力Ｕに変換し出力する。The optimum controller 14 converts the input Ap1 state signal M through the control gain K into control human power U and outputs it.

ゲイン構成回路１５は、モデル同定回路２から一定時間
ごとに得られる制御対象システム３のダイナミクスを表
すパラメータ（φ、Ｂ）より、ある２次形式評価関数を
最小にするような安定化最適制御ゲインＫを演算すると
共に、制御対象システム３の状態モデルφＢを演算し、
それぞれにより一定時間ごとに最適制御器１４及び状態
モデル回路１３を更新する。The gain configuration circuit 15 calculates a stabilizing optimal control gain that minimizes a certain quadratic evaluation function from parameters (φ, B) representing the dynamics of the controlled system 3 obtained from the model identification circuit 2 at regular intervals. At the same time as calculating K, the state model φB of the controlled system 3 is calculated,
The optimal controller 14 and the state model circuit 13 are updated at regular intervals.

かくして、このよ、うな適応制御器１１の構成によれば
、制御対象システム３のグイナミクスが変動した場合に
も、継続的に最適制御が可能な適応型最適制御が実現さ
れる。Thus, with the configuration of the adaptive controller 11 as described above, adaptive optimal control that can continuously perform optimal control is realized even when the guineamics of the controlled system 3 change.

ここで、適応型最適制御を実現する理論を以下で述べる
。Here, the theory for realizing adaptive optimal control will be described below.

即ち、モデル同定回路２から得られる制御対象システム
のダイナミクスが次式、）（ｊ＋１ｓ＊φｘｔ＋Ｂｕｔ　　　　　　＝（１）Ｙ
ｔ　−Ｃｘｔ　　　　　　　　　　　−（２）のような
離散形で表されるとする。ここで、ｘ　ｔ＋１（１＝０
．１．２．−　）は、ｔ＝ｔ＋ｉＸ△ｔにおける制御対
象システム３の状態を表すｎ次元ベクトル、ｕｔは制御
対象システム３に対する入力を表すｍ次元ベクトル、ｙ
ｔは制御対象システム３の出力を表すｒ次元ベクトルで
あり、φ、Ｂ、Ｃ。That is, the dynamics of the controlled system obtained from the model identification circuit 2 is expressed as follows: )(j+1s*φxt+But=(1)Y
Suppose that it is expressed in a discrete form such as t −Cxt −(2). Here, x t+1 (1=0
．． 1.2. ) is an n-dimensional vector representing the state of the controlled system 3 at t=t+iXΔt, ut is an m-dimensional vector representing the input to the controlled system 3, and y
t is an r-dimensional vector representing the output of the controlled system 3, and φ, B, C.

はそれぞれｎＸｎ％ｎｘｍ、ｒＸｎ次元の定係数マトリ
クスである。このとき、次式、Ｊ−Σ（ｘ’　ｔ　Ｑｘｔ　＋ｕ’　ｔ　Ｒｕｔ　）−
（３）ＩＩＯで表される２次形式評価関数Ｊを考える。ここで、Ｑは
半止定のｎｘｎ次元マトリクス、Ｒは止定のｍＸｍ次元
マトリクスで、オペレータが適当に選定する。いま（φ
、Ｂ）が可到達、即ち、ｒａｎｋ　［Ｂ　、　　φＢ−
，φＩＩ−ＩＢ］＝ｎ　　　−（４）及び、（Ｑ）、φ
）が可観測、即ち、ｒａｎｋ［（Ｑ’　）　’　、　　φ′（Ｑ’　）　’
　、　−−−−・−・・・、　（φ’）”（Ｑへ　）’
）−ｎ　　　　・・・　（５）であれば、（３）式で表
される２次評価関数Ｊを最小にする制御入力１１　ｔは
唯一に定まり、次式、ｕｔ　−−Ｋｘｔ　　　　　　　
　　　　−（６）Ｋ−（Ｒ＋Ｂ’　ＳＢ）”’　Ｂ’　
ＳΦ　　・・・（７）で与えられる。ここで、ＫはｍＸ
ｎのマトリクスで表される安定化制御ゲインであり、Ｓ
は次式の離散型リカッチ方程式、Ｓ−Ｑ＋Φ′　ＳΦ− Φ’　ＳＢ　（Ｒ＋Ｂ’　ＳＢ）”Ｂ’　ＳΦ・・・（
８）の唯−止定対称解を表すｎＸｎのマトリクスである。即
ち、適応制御器１１は内部の適応制御器１４に（７）式
で表される安定化制御ゲインＫを有し、（６）式を満足
するように、入力される状態ｘｔを制御人力ｕｔに変換
し出力する。are constant coefficient matrices of nXn%nxm and rXn dimensions, respectively. At this time, the following equation, J-Σ(x' t Qxt +u' t Rut )-
(3) Consider a quadratic evaluation function J expressed by IIO. Here, Q is a semi-fixed nxn dimensional matrix, and R is a fixed mxm dimensional matrix, which are appropriately selected by the operator. Now (φ
, B) is reachable, that is, rank [B, φB−
, φII-IB] = n - (4) and (Q), φ
) is observable, i.e. rank[(Q')', φ'(Q')'
, −−−−・−・・(φ')"(to Q)'
)-n... If (5), the control input 11 t that minimizes the quadratic evaluation function J expressed by equation (3) is unique, and the following equation, ut --Kxt
-(6)K-(R+B'SB)"'B'
SΦ ... is given by (7). Here, K is mX
It is a stabilization control gain expressed by a matrix of n, and S
is the following discrete Riccati equation, S-Q+Φ' SΦ- Φ' SB (R+B'SB)"B' SΦ...(
8) is an nXn matrix representing the only constant symmetric solution of . That is, the adaptive controller 11 has a stabilizing control gain K expressed by equation (7) in the internal adaptive controller 14, and manually controls the input state xt so as to satisfy equation (6). Convert and output.

因みに、安定化制御ゲインにとして（７）式で表される
ものを用いた場合、制御システム１０が全体として安定
となるばかりでなく、低感度特性や大きなゲイン余裕を
もつといった、いわゆるロバスト性を有することが知ら
れており、制御系として優れた性質をもつこととなる。Incidentally, when the stabilization control gain expressed by equation (7) is used, the control system 10 not only becomes stable as a whole, but also has so-called robustness such as low sensitivity characteristics and a large gain margin. It is known to have excellent properties as a control system.

さて、制御システム１０の第２の機能であるモデル同定
機能は一定時間ごとに起動され、制御対象システム３の
ダイナミクスをモデルとして有するモデル同定回路２を
更新するようになされている。The model identification function, which is the second function of the control system 10, is activated at regular intervals to update the model identification circuit 2 having the dynamics of the controlled system 3 as a model.

このときモデル同定制御部１は以下のようにしてモデル
同定回路２に制御対象システム３のダイナミクスを実現
させる。At this time, the model identification control unit 1 causes the model identification circuit 2 to realize the dynamics of the controlled system 3 in the following manner.

即ち、モデル同定機能動作時のモデル同定システムの構
成を示す第３図において、一定時間ごとにモデル同定機
能動作を開始させる制御回路４が、基準信号発生信号Ｓ
１を基準信号発生回路５に出力すると、基準信号発生回
路５は、基準信号ＲＥＦを制御対象システム３に入力す
ると共に、モデル同定回路２に入力する。このとき、制
御対象システム３及びモデル同定回路２は、それぞれ基
準信号ＲＥＰに対する応答である、応答信号Ｉ？ＥＳ及
びモデル応答信号１７ＥＳＭを出力し、当該応答信号Ｒ
ＥＳ及びモデル応答信号ＲＥＳＭの差ＥＲＲが学習／判
定回路６に入力される。That is, in FIG. 3 showing the configuration of the model identification system when the model identification function is operating, the control circuit 4 that starts the model identification function operation at regular intervals receives the reference signal generation signal S.
1 to the reference signal generation circuit 5, the reference signal generation circuit 5 inputs the reference signal REF to the controlled system 3 and also to the model identification circuit 2. At this time, the controlled system 3 and the model identification circuit 2 each receive a response signal I? which is a response to the reference signal REP. ES and model response signal 17ESM, and the response signal R
The difference ERR between ES and model response signal RESM is input to the learning/judgment circuit 6.

学習／判定回路６は、制御回路４から学習／判定開始信
号Ｓ２を受けたとき、入力された誤差信号ＥＲＲを評価
する。そして、誤差信号ＥＲＲが十分少さいと判断され
た場合には、モデル同定回路２が学習が完了したとして
学習完了信号Ｓ３を制御回路４に出力する。これに対し
、誤差信号ＥＲＲが未だに大きいと判断された場合には
、モデル補正信号ＣＯＲをモデル同定回路２に出力し、
例えばバックプロパゲーションの手法を用いてモデル同
定回路２の内部状態を補正する。When the learning/judgment circuit 6 receives the learning/judgment start signal S2 from the control circuit 4, it evaluates the input error signal ERR. If it is determined that the error signal ERR is sufficiently small, the model identification circuit 2 determines that learning has been completed and outputs a learning completion signal S3 to the control circuit 4. On the other hand, if it is determined that the error signal ERR is still large, the model correction signal COR is output to the model identification circuit 2,
For example, the internal state of the model identification circuit 2 is corrected using a backpropagation method.

モデル同定回路２は、例えば第４図に示すようなニュー
ラルネットワークにより構成される。The model identification circuit 2 is constituted by a neural network as shown in FIG. 4, for example.

即ち、入力層にｍ個のニューロンジ１１、ν１２゜・・
・、シ１．ｍ、中間層にｎ個のニューロンジ２１．ν２
２、・・・、シ２ｎ、出力層に１個のニューロンジ３１
．ν３２、・・・、ν３ｒｓを有する３層構造のネット
ワークで、ｍ次元の学習基準信号ＲＥＰＸ（ｒｒ　＋　
、　＝−、ｒｆ’ｉ。That is, the input layer has m neurons 11, ν12°...
・、Si1. m, n neurons in the middle layer 21. ν2
2,..., shi2n, one new range 31 in the output layer
．． In a three-layer network with ν32, ..., ν3rs, the m-dimensional learning reference signal REPX(rr +
, =-, rf'i.

・・・、　ｒｆ’ａ）を受けたとき、ｒ次元のモデル応
答信号ＲＥＳＭ（０３１，−＝、　０３に、−、０３ｒ
）を出力する。..., rf'a), the r-dimensional model response signal RESM (031, -=, 03, -, 03r
) is output.

入力層のニューロンジ１１（１≦ｌ≦Ｉ）は人力された
学習基準信号ｒｆ’ｌをそのまま出力０１ｊ０１ｉ＝ｒ
ｆｉ（１≦Ｉ　５ｍ）　　　　　　　　・＝　（９）と
して中間層のニューロンジ２１ν２２・・・、ν２ｎに
送出する。The input layer neuron 11 (1≦l≦I) outputs the manually generated learning reference signal rf'l as it is 01j01i=r
fi (1≦I 5m) .=(9) and sends it to the intermediate layer new engines 21 ν 22 . . . , ν 2 n.

中間層のニューロンジ２ｊ（１≦ｊ≦口）は入力ｎｅｔ
２ｊとして次式、（１≦ｊ≦ｎ）　　　　　・・・（１０）を受は取る。The middle layer neuron 2j (1≦j≦mouth) is the input net
As 2j, the receiver takes the following equation, (1≦j≦n)...(10).

（１０）式において、Ｗ２ｊｌｉ（１≦ｉ≦ｍ）は入力
層のニューロンジ１１の出力０１１に対する重みであり
、Ｗ２ｊ２　（１≦ｊ≦ｎ）は中間層のニューロンジ２
１の出力０２１に対する重みである。In equation (10), W2jli (1≦i≦m) is the weight for the output 011 of the neuron logic 11 in the input layer, and W2j2 (1≦j≦n) is the weight for the output 011 of the neuron logic 11 in the input layer.
This is the weight for the output 021 of 1.

一般にＷ２ｊ２１≠０としているので、中間層のニュー
ロンジ２ｊ　（１≦ｊ≦ｎ）は入力層のみならず、自分
自身を含めて中間層の出力も入力として受は取ることに
なる。また、中間層のニューロンジ２ｊの出力０２ｊ（
１≦ｊ≦ｎ）は次式、０２ｊ−ｆ　２ｊ（ｎｅｔ２Ｎ　　　　　　　　　　　
＝・　（１１）で表される。ここでｆ　２ｊ　（・）は
例えばｓ１ｇｍｏｉｄ関数を表す。Generally, since W2j21≠0, the intermediate layer neuron 2j (1≦j≦n) receives as input not only the input layer but also the output of the intermediate layer including itself. In addition, the output 02j (
1≦j≦n) is the following formula, 02j-f 2j(net2N
=・ It is expressed as (11). Here, f 2j (·) represents, for example, the s1gmoid function.

出力層のニューロンジ３ｋ（１≦に≦「）は入力ｎｅｔ
３にとして次式、ｎｅｔ３に一ΣＷ３に２ｊ０２ｊ　（１≦に≦ｒ）−（
１２）３為Ｉを受は取る。The output layer neuron 3k (1≦to≦“) is the input net
3 as the following formula, net3 is one ΣW3 is 2j02j (1≦≦r)−(
12) 3 for I to receive.

（１２）式において、Ｗ３に２ｊ　（１≦ｊ≦ｎ）は中
間層のニューロンジ２ｊの出力０２ｊに対する重みであ
る。−力出力層のニューロンジ３にの出力０３ｋ（ｌ≦
に≦ｒ）は次式、０３ｋ　−ｆ３ｋ（ｎｅｔ３ｋ）　　　　　　　　−（
１Ｂ　）で表される。ここでｆ３ｋ（・）は例えば５１
ｇ１ｏｉｄ関数である。In Equation (12), 2j (1≦j≦n) in W3 is a weight for the output 02j of the new logic 2j in the intermediate layer. -Output 03k (l≦
≦r) is the following formula, 03k −f3k(net3k) −(
1B). Here, f3k(・) is, for example, 51
This is the g1oid function.

いま、ベクトルｘ、ｘ”　、ｕｓ　ｙＳＦ２　　（・）
、Ｆ３　　（・）及びマトリクスφ、ＢＳＣを以下のよ
うに定義する。Now, vector x, x”, us ySF2 (・)
, F3 (·) and the matrix φ, BSC are defined as follows.

ｘ　＝　［０２１，−１０２ｊ、−・・、０２ｎ）’　
ｅＲ”・・・（１４）（以下、余白）Ｘ　”　　１ｍｌ＋［ｎｅｔ　　２１．−１ｎｅｔ　　２ｊ、−・・、ｎｅ
ｔ　　２ｎ］　　’　　Ｅ　Ｒｎ・・・　（１５）ｕ　＝　　［ｒｆ’Ｌ　　、・・・、ｒｌ’ｉ　　・・
・、ｒ［’ｍ］’−ＲＥＦＩＥＲ”　　　　　　　　　
　　　　　　　　　−（１６）ｙ　−［０３１，−・・
、０３に、−・・、０３ｒ］’−ＲＥＳＭ（Ｒｒ　　　
　　　　　　　　　　　　　・・・　（１７）Ｆ２　　
（・　）　− ［ｆ２１（−）　　　、−ｆ２ｊ（−）　　　−・・　
、　　ｆ　　２ｎ　　（・　）　　コ　　′・・・　（
１８）Ｆ３　　（・　）　− ［ｆｌｌ（−）　　　、−＝ｆ３ｋ（−）　　　・・・
　、　　ｆ　　３ｒ　　（・　）　　　コ　　′・・・
　（１９）（以下、余白）（以下、余白）このとき、次式のような関係が得られる。ただし、Ｘ５
ｕＳＹに付随するＦサフィックスｔは、時刻ｔにおける
値を示し、同様にｔ＋１は時刻を十Δｔ（△ｔは定数）
における値を示す。x = [021, -102j, -..., 02n)'
eR"...(14) (Hereafter, margin) X" 1ml+ [net 21. -1net 2j, -...,ne
t 2n] ' E Rn... (15) u = [rf'L,..., rl'i...
・, r['m]'-REFIER"
-(16)y -[031,-...
,03,-...,03r]'-RESM(Rr
... (17) F2
(・) − [f21(-), −f2j(-) −・・
, f 2n (・) ko ′... (
18) F3 (・) − [fll(−), −=f3k(−) ・・・
, f 3r (・) ko ′...
(19) (hereinafter referred to as margin) (hereinafter referred to as margin) At this time, the following relationship is obtained. However, X5
The F suffix t attached to uSY indicates the value at time t, and similarly, t+1 indicates the time 10 Δt (Δt is a constant)
shows the value in

Ｘ１１ｔ＋１−ｍφＸｉ　＋Ｉ３　ｕｔ　　　　　　　
＋＋＋　（２３）ｘｔ　−Ｆ２　　（ｘ’　ｔ）　　　
　　　　　　　−（２４）ｙ　ｔ　−Ｆ　３　　（Ｃｘ
　ｔ）−（２５）Ｆ２　　（・）、Ｆ３（・）が線形関
数であれば、式（２３）、（２４）、（２５）は離散時
間システムの状態方程式と等価となる。Ｆ２　　（・）
、Ｆ３　　（・）として、例えば５１ｇｆｆ１ｏｉｄ関
数を選んだ場合、バイアス点近接であれば十分線形であ
ると仮定できるので、結局モデル形式回路２は離散時間
システムのモデルとなることが可能である。X11t+1-mφXi +I3 ut
+++ (23)xt −F2 (x' t)
−(24)y t −F 3 (Cx
If t)-(25)F2 (.) and F3 (.) are linear functions, equations (23), (24), and (25) are equivalent to state equations of a discrete-time system. F2 (・)
, F3 (·), for example, if the 51gff1oid function is selected, it can be assumed that it is sufficiently linear as long as it is close to the bias point, so the model form circuit 2 can eventually become a model of a discrete time system.

即ち、モデル同定回路２に対し、基準信号ＲＥＦを時系
列的に与えてやることにより、応答信号ＲＥＳを時系列
的に得られ、例えばパックプロパゲーションの手法を用
いて学習することにより、モデル同定回路２の内部状態
は、基準信号ＲＥＦ及び応答信号ＲＥＳに対応する離散
時間システムのダイナミクスを実現することになる。基
準信号ＲＥＦとしては、システムを表現するのに適して
いるＭ系列関数を用いるのが適当と思われる。That is, by giving the reference signal REF to the model identification circuit 2 in time series, the response signal RES can be obtained in time series, and by learning using the pack propagation method, for example, model identification can be performed. The internal state of the circuit 2 will realize the dynamics of a discrete-time system corresponding to the reference signal REF and the response signal RES. It seems appropriate to use an M-sequence function suitable for expressing the system as the reference signal REF.

ここで、モデル同定回路２の内部状態を法定する要素は
、（１８）〜（２２）式で表されるマトリクスであるが
、これらは学習／判定回路６から送出されるモデル補正
信号ＣＯＩ？に基づいて修正される。代表的な修正方法
であるバックプロパゲーションでは、モデル補正信号Ｃ
ｏｔ？は２乗誤差を表す。Here, the elements that determine the internal state of the model identification circuit 2 are matrices expressed by equations (18) to (22), but these are the model correction signals COI? sent from the learning/judgment circuit 6. amended based on. In backpropagation, which is a typical correction method, the model correction signal C
ot? represents the squared error.

因みに第４図のような構成のニューラルネットワークを
用いた場合、フィードフォワード型ニューラルネットワ
ークよりも広範囲のシステムが表現できることが知られ
ており、またフィードフォワード型ニューラルネットワ
ークで同等のものが実現し得たとしても、構成ユニット
であるニューロンの数が多く必要なことが知られている
。Incidentally, it is known that a neural network with the configuration shown in Figure 4 can represent a wider range of systems than a feedforward neural network, and it is also possible to achieve the same result using a feedforward neural network. However, it is known that a large number of neurons, which are constituent units, are required.

以上のように本発明によれば、制御対象同定方法として
中間ユニット層における各ユニットの出力が下層のみな
らず中間ユニット層自身にも入力されるように構成され
たニューラルネットワークを用いた手法を用いることに
より、未知のダイナミクスを有する種々の制御対象シス
テムに対して、常に制御対象を同定し、その同定モデル
を用いである２次評制関数を最小化する安定化制御ゲイ
ンを適用する適応型制御器を容易に実現し得る。As described above, according to the present invention, as a controlled object identification method, a method using a neural network configured such that the output of each unit in the intermediate unit layer is input not only to the lower layer but also to the intermediate unit layer itself is used. This enables adaptive control that constantly identifies the controlled object and uses the identified model to apply a stabilizing control gain that minimizes a quadratic constraint function to various controlled systems with unknown dynamics. can be easily realized.

なお、上述の実施例においては、最適制御器１４に入力
する制御対象システム３の観測状態信号マとして、応答
出力ｙ及び状態モデル回路１３のモデル状態信号ｘＭの
組合わせを用いたが、これに代え、オブザーバを用いて
応答出力ｙ及び制御人力Ｕに基づいて状態Ｘを推定する
ようにしても良い。In the above embodiment, the combination of the response output y and the model state signal xM of the state model circuit 13 was used as the observed state signal of the controlled system 3 input to the optimal controller 14. Alternatively, the state X may be estimated based on the response output y and the control human power U using an observer.

［発明の効果］上述したように本発明によれば、制御対象同定方法とし
て中間ユニット層における各ユニットの出力が下層のみ
ならず中間ユニット層自信にも人力されるように構成さ
れたニューラルネットワークを用いた手法を用いている
ので、未知のダイナミクスを有する種々の制御対象シス
テムに対して常にシステム同定を行いある評価関数を最
小化する安定化制御ゲインを更新じ適用する適応制御器
を容易に実現し得る。[Effects of the Invention] As described above, according to the present invention, as a control object identification method, a neural network configured such that the output of each unit in the intermediate unit layer is manually input not only to the lower layer but also to the intermediate unit layer itself is used. This method makes it easy to create an adaptive controller that constantly performs system identification for various controlled systems with unknown dynamics and updates and applies stabilizing control gains that minimize a certain evaluation function. It is possible.

[Brief explanation of the drawing]

第１図は本発明による最適制御器を適用した制御システ
ムの全体構成を示すブロック図、第２図はモデル同定モ
ードにおける制御システムの構成を示すブロック図、第
３図はモデル同定機能を説明するためのブロック図、第
４図はモデル同定回路の詳細構造を示す路線的系統図で
ある。１・・・モデル同定制御部２・・・モデル同定回路３・・・制御対象システム１０・・・制御システム１１・・・適応制御器１２・・・状態合成回路出願人　　　　　　株式会社　東芝代理人　弁理士　　須　山　佐　− 第１図Figure 1 is a block diagram showing the overall configuration of a control system to which the optimal controller according to the present invention is applied, Figure 2 is a block diagram showing the configuration of the control system in model identification mode, and Figure 3 explains the model identification function. FIG. 4 is a block diagram showing the detailed structure of the model identification circuit. 1...Model identification control unit 2...Model identification circuit 3...Controlled system 10...Control system 11...Adaptive controller 12...State synthesis circuit Applicant Toshiba Corporation Attorney Patent attorney Satoshi Suyama - Figure 1

Claims

[Claims] For a controlled system having unknown dynamics,
The controlled object identification unit identifies the model at regular intervals, determines a control gain to minimize a predetermined evaluation function for the obtained model, and controls the controlled object system using the control gain. In the adaptive controller, the control object identification section is configured to include a neural network composed of a plurality of unit layers, and the output of each unit of the intermediate unit layer is input to the intermediate unit layer itself as well as the lower unit layer. a model identification circuit configured to send a predetermined reference input signal to the controlled system, store a system response signal from the controlled system to the reference input signal, and send the model identification circuit to the model identification circuit; Sending out a reference input signal, inputting a model response signal from the model identification circuit in response to the reference input signal, and determining the internal state of the model identification circuit based on the error between the model response signal and the system response signal. and a model identification control unit that performs correction so that the reference input signal decreases, and by repeatedly performing the correction on a plurality of types of the reference input signals, the internal state of the model identification circuit realizes the dynamics of the controlled system. An adaptive controller characterized by: