JP2013137628A

JP2013137628A - Model prediction control method and model prediction control program

Info

Publication number: JP2013137628A
Application number: JP2011287905A
Authority: JP
Inventors: Masahiro Doi; 将弘土井
Original assignee: Toyota Motor Corp
Current assignee: Toyota Motor Corp
Priority date: 2011-12-28
Filing date: 2011-12-28
Publication date: 2013-07-11
Anticipated expiration: 2031-12-28
Also published as: JP5765222B2

Abstract

PROBLEM TO BE SOLVED: To provide a model prediction control method in which the stability of a closed loop is high and the calculation cost can be reduced.SOLUTION: A model prediction control method includes the steps of: linearly increasing, in a control section, a sampling period from a first sampling interval; making the first sampling interval and a control period in the sampling period equal to each other; and linearly increasing a weight according to the increase of a sampling interval in the sampling period.

Description

本発明は、モデル予測制御方法及びモデル予測制御プログラムに関し、特に所定のサンプリング周期で制約条件（重み）を満たすように制御対象の未来の入力値を算出して、１番目の入力値を現在の入力値とする工程を、所定の制御周期で繰り返すことによって断続的に現在の入力値を算出するモデル予測制御方法及びモデル予測制御プログラムに関する。 The present invention relates to a model predictive control method and a model predictive control program, and in particular, calculates a future input value of a control target so as to satisfy a constraint condition (weight) at a predetermined sampling period, and sets the first input value as the current input value. The present invention relates to a model predictive control method and a model predictive control program that calculate a current input value intermittently by repeating the step of setting an input value at a predetermined control cycle.

一般的なモデル予測制御を以下に説明する。但し、本明細書においては、各式において小文字の太字はベクトルを表し、大文字の太字は行例を表す。但し、太字ではない通常の文字はスカラー量を表す。 General model predictive control will be described below. However, in the present specification, in each formula, lowercase bold letters represent vectors, and uppercase bold letters represent line examples. However, normal characters that are not bold represent a scalar quantity.

制御対象の連続システムが式１、２で表され、離散システムが式３、４で表される。詳細には、連続システムである式１、２をΔｔ（スカラー量）で離散化し、状態変数ｘ_ｋ（ベクトル）と、入力値ｕ_ｋ（スカラー量）及び出力値ｗ_ｋ（スカラー量）を用いて表した結果が、離散システムである式３、４である。

The continuous system to be controlled is expressed by

Equations

1 and 2, and the discrete system is expressed by

Equations

3 and 4. Specifically,

Equations

1 and 2 that are continuous systems are discretized with Δt (scalar amount), and a state variable x _k (vector), an input value u _k (scalar amount), and an output value w _k (scalar amount) are used. The results expressed by

Equations

3 and 4 are discrete systems.

ここで、ｘ（スカラー量）は状態変数、Ａ（行列）は係数行列、ｂ（ベクトル）は入力ベクトル、ｃ（ベクトル）は出力ベクトル、ｋ（スカラー量）は時刻ステップ、ｔ（スカラー量）は連続時間、Δｔ（スカラー量）はサンプリング間隔を示す。 Here, x (scalar amount) is a state variable, A (matrix) is a coefficient matrix, b (vector) is an input vector, c (vector) is an output vector, k (scalar amount) is a time step, and t (scalar amount). Represents a continuous time, and Δt (scalar amount) represents a sampling interval.

また、ハードウェアの制限や制御対象の運転条件等から、入力値や出力値に対して制御条件が課される場合の制約条件は、 In addition, due to hardware limitations and control target operating conditions, when the control condition is imposed on the input value or output value,

式５、６に表すように、線形不等式の形に定式化される。

As expressed in Equations 5 and 6, it is formulated in the form of a linear inequality.

ここで、Ｓ_１（行列）は入力制約の係数行列、ｄ_１（ベクトル）は入力制約の係数ベクトル、Ｓ_２（行列）は出力制約の係数行列、ｄ_２（ベクトル）は出力制約の係数ベクトルを示す。 Here, S ₁ (matrix) is an input constraint coefficient matrix, d ₁ (vector) is an input constraint coefficient vector, S ₂ (matrix) is an output constraint coefficient matrix, and d ₂ (vector) is an output constraint coefficient vector. Indicates.

但し、ｕ_ｋ（ベクトル）は現在からＮΔｔ（スカラー量）未来までの入力値を並べたベクトルであり、以下の式７で表される。ここで、Ｎは自然数を示す。 However, u _k (vector) is a vector in which input values from the present to NΔt (scalar amount) future are arranged, and is expressed by the following Expression 7. Here, N represents a natural number.

また、ｗ_ｋ＋１（ベクトル）はＮΔｔ（スカラー量）未来までの出力値を並べて、以下の式８に示すように、ベクトルにしたものである。 Further, w _{k + 1} (vector) is a vector obtained by arranging output values up to NΔt (scalar amount) in the future, as shown in Equation 8 below.

モデル予測制御は現在から有限時間（ＮΔｔ（スカラー量））までの間で、式９で表される２次形式の評価関数を最小化するように入力値を決定する。 In the model predictive control, the input value is determined so as to minimize the evaluation function of the quadratic form expressed by Expression 9 from the present to the finite time (NΔt (scalar amount)).

但し、上添え字のｒｅｆは目標値を表す。即ち、ｗ^ｒｅｆ _ｋ＋１は、式１０で示すように目標出力のＮΔｔ（スカラー量）未来までの時系列を意味する。 However, the superscript ref represents the target value. In other words, w ^ref _{k + 1} means a time series up to NΔt (scalar amount) future of the target output as shown in Expression 10.

また、Ｑ（行列）、Ｒ（行列）は重み行列として、式１１、１２で示すように各予測点における重み値を対角に並べた対角行列である。 Further, Q (matrix) and R (matrix) are diagonal matrices in which the weight values at each prediction point are arranged diagonally as weight matrices, as shown in equations 11 and 12.

モデル予測制御は、式５及び式６の制約条件の範囲内で式９の評価関数を最小化する、入力時系列ｕ_ｋ（ベクトル）を求め、ｕ_ｋ(ベクトル)の１番目の値ｕ_ｋ（スカラー量）を現在の入力値として用いる制御方法である。そして、Δｔ後に同様に最適解を求め、入力値時系列の１番目の値を入力値として用いる、という処理を繰り返す。 The model predictive control obtains an input time series u _k (vector) that minimizes the evaluation function of Equation 9 within the range of the constraints of Equation 5 and Equation 6, and first value u _{k of} u _k (vector). This is a control method using (scalar amount) as the current input value. Then, after Δt, the optimum solution is similarly obtained, and the process of using the first value of the input value time series as the input value is repeated.

つまり、図１２に示すように、制御周期毎に未来の予測期間ＮΔｔ（スカラー量）を後退させながら、制御条件付き最適化問題（式５、６及び式９）を解いてその時点の入力値を決定していく制御方法である。ちなみに、図１２では太線部分が１番目の予測点である。 That is, as shown in FIG. 12, while solving the future prediction period NΔt (scalar amount) for each control cycle, the optimization problem with the control condition (Equations 5, 6 and 9) is solved, and the input value at that time It is a control method that decides. Incidentally, the thick line portion in FIG. 12 is the first prediction point.

以下、上記の定式化に従って最適化問題を導く。
式３、４より、

In the following, the optimization problem is guided according to the above formulation.
From

Equations

3 and 4,

式１３が導き出されるので、出力値時系列は、

式１４で表される。 Since Equation 13 is derived, the output value time series is

It is expressed by Equation 14.

従って、式９は式１５のように定式化される。ここで、Ｈ_ｋ（行列）は半正定行列を示す。

Therefore, Expression 9 is formulated as Expression 15. Here, H _k (matrix) indicates a semi-definite matrix.

同様にして、式５、６についても、式１６に示すように、ｕ_ｋ（ベクトル）に関する式で表すことができる。

Similarly, Expressions 5 and 6 can also be expressed by expressions relating to u _k (vector) as shown in Expression 16.

最終的に最適解問題は、式１５、１６より、

式１７に表すように、凸２次計画問題に定式化され、この凸２次計画問題の求解結果ｕ_ｋ（ベクトル）の第１番目要素が現在の入力値として実際に用いられる。 Finally, the optimal solution problem is

As expressed in Expression 17, the formulation is formulated into a convex quadratic programming problem, and the first element of the solution result u _k (vector) of the convex quadratic programming problem is actually used as the current input value.

以上が一般的なモデル予測制御の制御則である。
なお、上記の制御則では予測区間の間、常に入力値を変化させ続けるように構成したが、モデル予測制御では、図１３に示すように、予測区間内で入力値を変化させる区間（制御区間）と入力値を変化させない区間が導入される場合もある。ここで、Ｍは自然数を示し、Ｎ以下であることを満たす。 The above is a control rule of general model predictive control.
In the above control law, the input value is continuously changed during the prediction interval. However, in the model predictive control, as shown in FIG. 13, the input value is changed within the prediction interval (control interval). ) And an interval that does not change the input value may be introduced. Here, M represents a natural number and satisfies N or less.

予測区間内で制御区間外にある予測点については、制御区間の最終値に設定されることが一般的である。即ち、式１８であり、

ｕ_ｋ（ベクトル）を

式１９と設定し直して改めて定式化を行うと、式３、４より、

式２０となる。 In general, prediction points outside the control interval within the prediction interval are set to the final value of the control interval. That is, Equation 18

u _k (vector)

When formula 19 is re-set with formula 19 again, from

formulas

3 and 4,

Equation 20 is obtained.

このため、出力値時系列は、

式２１と表される。後は制御区間と予測区間が等しい場合のモデル予測制御と同様に式１５及び式１６のように定式化され、最終的に式１７のように最適化問題が導かれる。 Therefore, the output value time series is

It is expressed as Equation 21. After that, similarly to the model predictive control in the case where the control interval is equal to the prediction interval, it is formulated as shown in Equation 15 and Equation 16, and finally an optimization problem as shown in Equation 17 is derived.

このように一般的なモデル予測制御は、モデルに基づく未来の予測・制御条件の設定・最適解の求解という３つの特徴を含んでいるため、非常に適用可能な範囲が広く有用な制御法である。 In this way, general model predictive control includes the following three features: predicting the future based on the model, setting the control conditions, and finding the optimal solution. is there.

しかし、一番のネックは計算コストが高いことになる。これは、モデル予測制御が最適化問題（凸２次計画問題）の求解ステップを含んでいることに起因している。それ故に、多くは時定数の遅いプロセス系の制御に用いられることが多かった。 However, the biggest bottleneck is the high calculation cost. This is due to the fact that the model predictive control includes a solution step for an optimization problem (convex quadratic programming problem). Therefore, many are often used to control process systems with slow time constants.

そのため、モデル予測制御の計算コストを低減するために、例えば特許文献１の技術が提案されている。
特許文献１の技術は、未来の制御量（入力値）を演算する区間内のサンプリング周期のサンプリング間隔を変更している。つまり、上述したモデル予測制御では、予測区間内のサンプリング間隔は一定であるが、特許文献１の技術は、制御量を変動させる区間ではサンプリング間隔を短くし、制御量を変動させない区間ではサンプリング間隔を長くしても、制御の精度がそれ程落ちないという着想から、未来の制御量を演算する区間内でサンプリング間隔を切り替えている。 Therefore, in order to reduce the calculation cost of model predictive control, for example, the technique of Patent Document 1 has been proposed.
The technique of Patent Literature 1 changes the sampling interval of the sampling period in the interval in which the future control amount (input value) is calculated. That is, in the model predictive control described above, the sampling interval in the prediction interval is constant, but the technique of Patent Document 1 shortens the sampling interval in the interval in which the control amount is varied, and the sampling interval in the interval in which the control amount is not varied. The sampling interval is switched within the interval for calculating the future control amount from the idea that the control accuracy does not drop so much even if the length is increased.

特開２００６−７２７９１号公報JP 2006-72791 A

例えばロボットの制御のように時定数が短く、かつ、速応性が要求されるような制御では、モデル予測制御の制御区間を長く取る必要がある。一般的に制御区間が短い方が、閉ループの安定性は高いので、制御区間を長くするということは、安定性が低下する替わりに速応性を向上させることを意味する。 For example, in the control where the time constant is short and the quick response is required like the control of the robot, the control section of the model predictive control needs to be long. In general, the shorter the control interval, the higher the stability of the closed loop. Therefore, increasing the control interval means improving the quick response instead of decreasing the stability.

ロボットの制御では、時定数が短いので、制御周期も早くする必要があり、モデル予測制御の計算時間も短くする必要がある。そこで、特許文献１の技術のように、計算コストを下げるために、未来の制御量を演算する区間内のサンプリング間隔を変化させる場合、サンプリング間隔の変化のさせ方を適当に設定しないと、閉ループの安定性が低く、ロボットが振動を起こしたり、不安定になって発散してしまったり、する可能性がある。 In robot control, since the time constant is short, it is necessary to shorten the control cycle, and it is also necessary to shorten the calculation time of model predictive control. Therefore, as in the technique of Patent Document 1, in order to reduce the calculation cost, when changing the sampling interval in the interval in which the future control amount is calculated, if the method of changing the sampling interval is not appropriately set, the closed loop The stability of the robot is low, and the robot may vibrate or become unstable and diverge.

しかし、特許文献１には、サンプリング間隔の変化のさせ方、即ち、どのようなサンプリング間隔にするのか、という内容に関しては開示されていない。 However, Patent Document 1 does not disclose how to change the sampling interval, that is, what kind of sampling interval is used.

本発明の目的は、このような問題を解決するためになされたものであり、閉ループの安定性が高く、計算コストを低減できるモデル予測制御方法及びモデル予測制御プログラムを提供することである。 An object of the present invention is to provide a model predictive control method and a model predictive control program that have been made to solve such a problem and that have high closed-loop stability and can reduce calculation costs.

本発明の一形態に係るモデル予測制御方法は、制御区間において、サンプリング周期を１番目のサンプリング間隔から線形に増加させる工程と、前記サンプリング周期における１番目のサンプリング間隔と制御周期とを等しくする工程と、前記サンプリング周期におけるサンプリング間隔の増加に伴って、重みを線形に増加させる工程と、を備える。 In the model predictive control method according to an aspect of the present invention, the step of linearly increasing the sampling period from the first sampling interval in the control section, and the step of equalizing the first sampling interval and the control period in the sampling period And a step of linearly increasing the weight as the sampling interval in the sampling period increases.

本発明の一形態に係るモデル予測制御プログラムは、コンピュータに、制御区間において、サンプリング周期を１番目のサンプリング間隔から線形に増加させる処理と、前記サンプリング周期における１番目のサンプリング間隔と制御周期とを等しくする処理と、前記サンプリング周期におけるサンプリング間隔の増加に伴って、重みを線形に増加させる処理と、を実行させる。 A model predictive control program according to an aspect of the present invention causes a computer to perform a process of linearly increasing a sampling period from a first sampling interval in a control section, and a first sampling interval and a control period in the sampling period. A process of equalizing and a process of linearly increasing the weight as the sampling interval in the sampling period increases are executed.

以上、説明したように、本発明によると、閉ループの安定性が高く、計算コストを低減できるモデル予測制御方法及びモデル予測制御プログラムを提供することができる。 As described above, according to the present invention, it is possible to provide a model predictive control method and a model predictive control program that have high closed-loop stability and can reduce the calculation cost.

本発明の実施の形態１に係るモデル予測制御方法を説明するための図である。It is a figure for demonstrating the model prediction control method which concerns on Embodiment 1 of this invention. 本発明の実施の形態１に係るモデル予測制御方法において、サンプリング周期のサンプリング間隔を線形に増加させた際の、予測点とサンプリング間隔との関係を示す図である。In the model predictive control method which concerns on Embodiment 1 of this invention, it is a figure which shows the relationship between a prediction point and a sampling interval at the time of increasing the sampling interval of a sampling period linearly. 本発明の実施の形態２に係るモデル予測制御方法を説明するための図である。It is a figure for demonstrating the model prediction control method which concerns on Embodiment 2 of this invention. 本発明の実施例に係るモデル予測制御方法での、２足歩行ロボットの重心モデルを模式的に示す図である。It is a figure which shows typically the gravity center model of the biped walking robot in the model prediction control method which concerns on the Example of this invention. 本発明の実施例に係るモデル予測制御方法での、目標ＺＭＰとＺＭＰ制約との関係を示す図である。It is a figure which shows the relationship between target ZMP and a ZMP restriction | limiting in the model predictive control method which concerns on the Example of this invention. 本発明の実施例に係るモデル予測制御方法での、ＺＭＰ軌道と重心軌道との関係を示す図である。It is a figure which shows the relationship between a ZMP trajectory and a gravity center trajectory in the model predictive control method which concerns on the Example of this invention. 本発明の実施例に係るモデル予測制御方法での、重心速度を示す図である。It is a figure which shows the gravity center speed | velocity in the model prediction control method which concerns on the Example of this invention. 本発明の実施例に係るモデル予測制御方法での、重心加速度を示す図である。It is a figure which shows the gravity center acceleration in the model prediction control method based on the Example of this invention. 本発明の実施例に係るモデル予測制御方法において、２足歩行ロボットに外乱が生じた際のＺＭＰ軌道と重心軌道との関係を示す図である。In the model predictive control method according to the embodiment of the present invention, it is a diagram showing the relationship between the ZMP trajectory and the gravity center trajectory when a disturbance occurs in the biped robot. 本発明の実施例に係るモデル予測制御方法において、２足歩行ロボットに外乱が生じた際の重心速度を示す図である。It is a figure which shows the gravity center speed when the disturbance arises in the biped walking robot in the model predictive control method which concerns on the Example of this invention. 本発明の実施例に係るモデル予測制御方法において、２足歩行ロボットに外乱が生じた際の重心加速度を示す図である。It is a figure which shows the gravity center acceleration when the disturbance arises in the bipedal walking robot in the model predictive control method which concerns on the Example of this invention. 一般的なモデル予測制御を説明するための図である。It is a figure for demonstrating general model prediction control. 制御区間と予測区間とが異なる、一般的なモデル予測制御を説明するための図である。It is a figure for demonstrating general model prediction control from which a control area differs from a prediction area.

以下、本発明を実施するための最良の形態について、添付図面を参照しながら説明する。但し、本発明が以下の実施の形態に限定される訳ではない。また、説明を明確にするため、以下の記載及び図面は、適宜、簡略化されている。 The best mode for carrying out the present invention will be described below with reference to the accompanying drawings. However, the present invention is not limited to the following embodiment. In addition, for clarity of explanation, the following description and drawings are simplified as appropriate.

＜実施の形態１＞
本実施の形態のモデル予測制御方法及びモデル予測制御プログラムは、例えばロボット等の時定数が短い制御対象を制御するために好適に実行される。本実施の形態のモデル予測制御方法は、当該制御対象の制御装置がモデル予測制御プログラムを実行することで実現される。但し、当該モデル予測制御方法は、ハードウェア資源を用いて実現しても良い。ちなみに、本実施の形態のモデル予測制御方法は、制御区間と予測区間とを等しくしている。 <Embodiment 1>
The model predictive control method and model predictive control program according to the present embodiment are preferably executed to control a control target having a short time constant such as a robot. The model predictive control method according to the present embodiment is realized when the control device to be controlled executes a model predictive control program. However, the model predictive control method may be realized using hardware resources. Incidentally, in the model predictive control method of the present embodiment, the control interval and the prediction interval are made equal.

先ず、本実施の形態のモデル予測制御方法の概略を説明する。当該モデル予測制御方法は、図１及び図２に示すように、制御区間において、サンプリング周期を１番目のサンプリング間隔から線形に増加させて、各々の予測点での入力値を算出し、１番目の予測点での入力値を現在の入力値として用いる。ちなみに、図１では太線部分が1番目の予測点である。また、図２の縦軸Δｔはサンプリング間隔を示す、横軸ｊは何番目の予測点かを示し、ｊ番目のサンプリング間隔Δｔ_ｊをΔｔ_１＋α（ｊ−１）として算出している。ここで、Δｔ_１は1番目のサンプリング間隔を示し、αは係数を示す。 First, an outline of the model predictive control method of the present embodiment will be described. As shown in FIGS. 1 and 2, the model predictive control method calculates an input value at each prediction point by linearly increasing the sampling period from the first sampling interval in the control interval. The input value at the predicted point is used as the current input value. Incidentally, the thick line portion in FIG. 1 is the first prediction point. Also, the vertical axis Δt in FIG. 2 indicates the sampling interval, the horizontal axis j indicates the number of prediction points, and the j-th sampling interval Δt _j is calculated as Δt ₁ + α (j−1). Here, Δt ₁ represents the first sampling interval, and α represents a coefficient.

このとき、１番目の予測点での制約条件と、制御周期後の現在状態が対応するように、１番目のサンプリング間隔と制御周期とを等しくする。また、上記のようにサンプリング間隔の増加に伴って、制約条件（重み）を線形に増加させる。 At this time, the first sampling interval and the control cycle are made equal so that the constraint condition at the first prediction point corresponds to the current state after the control cycle. In addition, as described above, the constraint condition (weight) is linearly increased as the sampling interval is increased.

これにより、詳細な作用効果は後述するが、最適化問題のサイズを削減することができ、制御区間を大きく取った場合でも計算コストを大幅に低減できる。 Thereby, although the detailed effect will be described later, the size of the optimization problem can be reduced, and the calculation cost can be greatly reduced even when a large control interval is taken.

次に、本実施の形態のモデル予測制御方法を詳細に説明する。一般的なモデル予測制御方法では、重み行列Ｑ、Ｒの要素を個別に大きく変化させることはせず、各要素の値を概ね同じ値にするのが一般的である。すなわち、以下の式２２、２３に示すように設定する。

Next, the model predictive control method of this embodiment will be described in detail. In a general model predictive control method, the elements of the weight matrices Q and R are not largely changed individually, and the values of the elements are generally set to substantially the same value. That is, it sets as shown in the following formulas 22 and 23.

しかし、上記のように予測時間内のサンプリング間隔を変化させた場合、各要素の重みを一定にすると、各サンプリング点（予測点）の影響度が異なるため、適切に制御することができない。これは、予測点毎に式１、２の連続システムが異なる（Ａ（行列）、ｂ（ベクトル）がΔｔ（スラカー量）の関数になっている）ためである。サンプリング間隔が大きい程、同じ入力値ｕ_ｋ（スカラー量）でもｘ_ｋ（ベクトル）→ｘ_ｋ＋１（ベクトル）の変化量は大きくなるので、本実施の形態では重み行列の各要素を、

式２４、２５のように線形に増加させる。ここで、β及びγは定数を示す。 However, when the sampling interval within the prediction time is changed as described above, if the weight of each element is made constant, the degree of influence of each sampling point (prediction point) is different, and thus cannot be controlled appropriately. This is because the continuous systems of

Formulas

1 and 2 are different for each prediction point (A (matrix), b (vector) is a function of Δt (slacker amount)). As the sampling interval is larger, the amount of change of x _k (vector) → x _{k + 1} (vector) becomes larger even with the same input value u _k (scalar amount). Therefore, in this embodiment, each element of the weight matrix is

It is increased linearly as in equations 24 and 25. Here, β and γ are constants.

即ち、重み行列を

式２６、２７とし、対角要素の値が線形に増加するように設定する。 That is, the weight matrix

Equations 26 and 27 are set so that the value of the diagonal element increases linearly.

以下、上記の構成で定式化を行う。
各サンプリング間隔Δｔ_ｊ（スカラー量）に対応した離散システム行列をＡ_ｊ（行列）、ｂ_ｊ（スカラー量）とすると、式３、４は

式２８、２９となる。従って、

式３０となるので、出力値時系列は、

式３１と表される。後は一般的なモデル予測制御と同様にして、式９は、

式３２のように定式化される。 Hereinafter, formulation is performed with the above configuration.
Assuming that the discrete system matrix corresponding to each sampling interval Δt _j (scalar amount) is A _j (matrix) and b _j (scalar amount),

equations

3 and 4 are

Expressions 28 and 29 are obtained. Therefore,

Since Expression 30 is satisfied, the output value time series is

It is expressed as Formula 31. After that, similar to general model predictive control, Equation 9 is

Formulated as Equation 32.

制約条件も同様にして、式５、６についても、

として、ｕ_ｋ（ベクトル）に関する式３３で表すことができる。 The same applies to the constraints, and for Equations 5 and 6,

Can be expressed by Expression 33 relating to u _k (vector).

最終的に最適化問題は、式３２、３３より、

式３４に表すように、凸２次計画問題に定式化され、この凸２次計画問題の求解結果ｕ_ｋ（ベクトル）の第１番目要素を現在の入力値として実際に用いる。 Finally, the optimization problem is

As expressed in Expression 34, it is formulated into a convex quadratic programming problem, and the first element of the solution result u _k (vector) of this convex quadratic programming problem is actually used as the current input value.

以上が本実施の形態のモデル予測制御の制御則である。本実施の形態のモデル予測制御は、上述のように制御区間においてサンプリング間隔を線形に増加させている。その作用効果について、本出願人は以下のように推測している。 The above is the control law of model predictive control according to the present embodiment. In the model predictive control according to the present embodiment, the sampling interval is linearly increased in the control section as described above. About the effect, the present applicant estimates as follows.

目標の軌道（ｒｅｆ）の変化速度に対して予測区間（制御区間）のサンプリング間隔が長い場合、予測点間に目標軌道の山や谷が埋もれてしまう。その結果、時間が進むにつれて、前の時間の予測では入らなかった目標軌道の山や谷が、後の時間になってから現れてきたり、また逆に消えたりということが起る。この時、制御周期毎の入力値は振動的になり、状態量も目標軌道の周りで振動的な挙動を示す。 When the sampling interval of the prediction section (control section) is long with respect to the change speed of the target trajectory (ref), the peaks and valleys of the target trajectory are buried between the prediction points. As a result, as time progresses, peaks and valleys of the target trajectory that were not entered in the prediction of the previous time may appear at a later time, or may disappear. At this time, the input value for each control cycle becomes oscillating, and the state quantity also shows oscillating behavior around the target trajectory.

一方、モデル予測制御では予測区間のうち現在に近い時点の方が現在の入力値に対する影響が大きい。即ち、遠い未来の目標値が変わっても、現在の入力値はそれ程大きく変化しないが、近い未来の目標値の変化に対しては、現在の入力値を大きく変化させて対応することになる。 On the other hand, in the model predictive control, the influence on the current input value is larger at a point near the present in the prediction interval. That is, even if the target value in the far future changes, the current input value does not change so much, but the change in the target value in the near future is handled by changing the current input value greatly.

また、モデル予測制御では制御区間の予測点で制約条件を満たすように求解が行われるが、予測点間では制約条件を満たすことは保証されていない。制御区間のサンプリング間隔を変化させた場合、今回の予測点がしばしば以前の求解時の予測点間に設定されるが、予測点間では制約条件を満たす保証がないので、前回の軌道と今回の求解結果とが大きく異なる場合があり（前回の求解時には制約が守られていなかった点が、今回の求解では急に満たすように求解されるようになるため）、この時も振動の要因となる。制御区間のサンプリング間隔を急激に変化させるようなパターンを用いた場合、予測点間で制約を破る解が出力されることが特に多く、振動が発生し易くなる。 In model predictive control, a solution is obtained so as to satisfy a constraint condition at a prediction point in a control section, but it is not guaranteed that the constraint condition is satisfied between prediction points. When the sampling interval of the control section is changed, the current prediction point is often set between the prediction points at the previous solution, but there is no guarantee that the constraint condition is satisfied between the prediction points. There may be a large difference from the solution result (because the current solution solves the problem that the restrictions were not observed at the previous solution, the solution will be solved suddenly), and this time also causes vibration . When a pattern that rapidly changes the sampling interval of the control section is used, a solution that breaks the constraint between prediction points is often output, and vibration is likely to occur.

こうした技術的知見から、現在から予測区間（制御区間）の未来に向かって、線形にサンプリング間隔を増加させることにより、振動を発生させることなく、即ち閉ループを安定させて、予測区間の点の数を効果的に減らして計算コストを削減することが可能になる。 From this technical knowledge, the number of points in the prediction interval can be stabilized without increasing the oscillation, that is, by stabilizing the closed loop, by increasing the sampling interval linearly from the present to the future of the prediction interval (control interval). It is possible to effectively reduce the calculation cost.

また、本実施の形態のモデル予測制御は、重みを線形に増加させているが、その作用効果について、本出願人は以下のように推測している。 Further, in the model predictive control according to the present embodiment, the weight is increased linearly, but the applicant estimates the effect as follows.

モデル予測制御では、制御区間の予測点間では入力値が一定という仮定が前提となっている。つまり、予測点Ａから次の予測点Ｂまでの間では、離散システムにずっと予測点Ａにおける入力が作用しているとして定式化が行われる。 In model predictive control, it is assumed that the input value is constant between prediction points in the control section. That is, in the period from the prediction point A to the next prediction point B, the formulation is performed on the assumption that the input at the prediction point A continues to act on the discrete system.

従って、サンプリング間隔を変化させた場合には、同じ入力値が作用し続ける時間が入力毎に異なってしまい、適切に最適化を行うことができない。そこで、サンプリング間隔を線形に増加させたのに合わせて重み行列を線形に増加させることにより、サンプリング間隔の大きな点の入力値ほど評価関数上の影響度が大きくなるように設定することができ、適切に最適化を行うことが可能となる。 Therefore, when the sampling interval is changed, the time during which the same input value continues to operate differs for each input, and optimization cannot be performed appropriately. Therefore, by increasing the weighting matrix linearly in accordance with the linear increase of the sampling interval, the input value at a point with a large sampling interval can be set to have a greater influence on the evaluation function. It becomes possible to optimize appropriately.

＜実施の形態２＞
実施の形態１のモデル予測制御方法は、制御区間と予測区間とを等しくしたが、図３に示すように、一般的なモデル予測制御方法と同様に制御区間と予測区間とが異なるようにしても良い。なお、図３でも太線部分が1番目の予測点である。 <Embodiment 2>
In the model predictive control method of the first embodiment, the control interval and the prediction interval are made equal. However, as shown in FIG. 3, the control interval and the prediction interval are made different as in the general model predictive control method. Also good. In FIG. 3, the bold line portion is the first prediction point.

この場合、ｕ_ｋ（ベクトル）を式１９と設定し直して改めて定式化を行うと、

式３５となるので、出力値時系列は、

式３６と表される。その後の導出は実施の形態１のモデル予測制御方法と同様であるので説明を省略する。これにより、より長い区間の予測区間を確保することができる。 In this case, if u _k (vector) is reset to Equation 19 and formulated again,

Since Expression 35 is obtained, the output value time series is

It is expressed as Expression 36. Subsequent derivation is the same as in the model predictive control method of the first embodiment, and a description thereof is omitted. Thereby, the prediction area of a longer area is securable.

以上、本発明に係るモデル予測制御方法及びモデル予測制御プログラムの実施の形態を説明したが、上記に限らず、本発明の技術的思想を逸脱しない範囲で、変更することが可能である。 The embodiment of the model predictive control method and the model predictive control program according to the present invention has been described above. However, the present invention is not limited to the above, and can be changed without departing from the technical idea of the present invention.

＜実施例＞
本発明のモデル予測制御方法及びモデル予測制御プログラムを２足歩行ロボットの重心軌道生成問題に適用した実施例を説明する。
図４に示すように、重心高さを一定とすると、ＺＭＰ（Zero Moment Point）方程式は良く知られているように、

式３７となる。ここで、ｘ_ｚｍｐは２足歩行ロボットのＺＭＰにおけるｘ座標、ｘ_ｇは当該重心位置のｘ座標、ｈは当該重心位置の高さ、ｇは重力加速度を示す。 <Example>
An embodiment in which the model predictive control method and the model predictive control program of the present invention are applied to the problem of generating the center of gravity trajectory of a biped robot will be described.
As shown in FIG. 4, when the height of the center of gravity is constant, the ZMP (Zero Moment Point) equation is well known,

Expression 37 is obtained. Here, x _zmp is the x coordinate in the ZMP of the biped robot, x _g is the x coordinate of the centroid position, h is the height of the centroid position, and g is the gravitational acceleration.

入力値をｕ（ｔ）（ベクトル）＝ｘ^{（・・・）} _ｇ（スカラー量）、出力値をｗ（ｔ）(スカラー量)＝ｘ_ｚｍｐ（スカラー量）とすると連続システムは、

式３８、３９となる。これをΔｔ_ｊ（スカラー量）で離散化すると、

式４０、４１となる。 If the input value is u (t) (vector) = x ^(...) _g (scalar amount) and the output value is w (t) (scalar amount) = x _zmp (scalar amount), the continuous system is

Expressions 38 and 39 are obtained. When this is discretized by Δt _j (scalar amount),

Expressions 40 and 41 are obtained.

例として、目標のＺＭＰ軌道とＺＭＰの制約条件（出力値の制約条件）を図５のように設定した。 As an example, the target ZMP trajectory and the ZMP constraint conditions (output value constraint conditions) are set as shown in FIG.

本実施例では、予測区間と制御区間は同一にして１．６［ｓ］、制御周期は１［ｍｓ］として、サンプリング周期は図１と同様の設定を用いた（α＝０．００５）。 In this embodiment, the prediction interval and the control interval are the same, 1.6 [s], the control cycle is 1 [ms], and the sampling cycle is set in the same manner as in FIG. 1 (α = 0.005).

また、重みに関しては、Ｑ（スカラー量）＝１、Ｒ（スカラー量）＝１×１０^−６、β(スカラー量)＝０．００５、γ（スカラー量）＝５×１０^−９とした。 Regarding the weights, Q (scalar amount) = 1, R (scalar amount) = 1 × 10 ⁻⁶ , β (scalar amount) = 0.005, and γ (scalar amount) = 5 × 10 ⁻⁹ .

本実施例の結果を図６〜図８に示した。図６は、図５にＺＭＰ軌道と重心軌道とを重ね合わせてプロットした図である。本図より、目標ＺＭＰ軌道に追従しながら滑らかな重心軌道を生成できていることが分かる。また、図７及び図８は、重心速度・重心加速度の結果をプロットしたグラフだが、これらのグラフから、速度・加速度の次元でも振動等を発生することなく軌道生成が行えることが分かる。 The results of this example are shown in FIGS. FIG. 6 is a diagram in which the ZMP trajectory and the gravity center trajectory are overlaid on FIG. From this figure, it can be seen that a smooth center of gravity trajectory can be generated while following the target ZMP trajectory. FIGS. 7 and 8 are graphs plotting the results of the center of gravity speed and the center of gravity acceleration. From these graphs, it can be seen that the trajectory can be generated without generating vibration or the like even in the dimension of the speed and acceleration.

次に制約条件が守られていることを確認するために、上記と同様の条件で、２．６［ｓ］の時に仮想的に突発外力を入力し、速度をステップ上に変化させた。この時のＺＭＰ軌道と重心軌道とを図９に、重心速度と加速度とを図１０及び図１１に示した。図９から分かるように、ＺＭＰは制約条件を満たしながら、目標ＺＭＰ軌道に戻っていることが確認できる。 Next, in order to confirm that the constraint conditions are observed, a sudden external force was virtually input at 2.6 [s] under the same conditions as described above, and the speed was changed stepwise. The ZMP trajectory and the center of gravity trajectory at this time are shown in FIG. 9, and the center of gravity speed and acceleration are shown in FIG. 10 and FIG. As can be seen from FIG. 9, it can be confirmed that the ZMP returns to the target ZMP trajectory while satisfying the constraint conditions.

Claims

In model predictive control method,
Increasing the sampling period linearly from the first sampling interval in the control interval;
Equalizing the first sampling interval and the control period in the sampling period;
Increasing the weight linearly with increasing sampling interval in the sampling period;
A model predictive control method comprising:

In model predictive control program,
On the computer,
In the control period, a process of linearly increasing the sampling period from the first sampling interval;
Processing for equalizing the first sampling interval and the control period in the sampling period;
A process of linearly increasing the weight as the sampling interval increases in the sampling period;
Model predictive control program that executes