JP2016198873A

JP2016198873A - Optimum control device, optimum control method, and optimum control program

Info

Publication number: JP2016198873A
Application number: JP2015082637A
Authority: JP
Inventors: 将弘土井; Masahiro Doi
Original assignee: Toyota Motor Corp
Current assignee: Toyota Motor Corp
Priority date: 2015-04-14
Filing date: 2015-04-14
Publication date: 2016-12-01
Anticipated expiration: 2035-04-14
Also published as: JP6421683B2

Abstract

PROBLEM TO BE SOLVED: To quickly determine an optimum solution to an optimization problem in a model prediction control.SOLUTION: An optimum control device comprises: contact point planning means setting a contact point plan of moving means of a moving robot; and trajectory generation means. The trajectory generation means performs a model prediction control of calculating a state variable of a center of gravity using an evaluation criterion in a prediction zone, and generating a gravity trajectory on the basis of the calculated state variable of the center of gravity. The evaluation criterion minimizes an evaluation function including a square of a quantity based on a contact force at each contact point in the prediction zone. An equation constraint conditional optimization problem including the evaluation criterion, a state equation, and an equation constraint condition expressed by a moving robot linear equation is converted into a non-constraint conditional optimization problem that does not include the equation constraint condition using an orthogonal complementary space. The trajectory generation means determines an optimum solution to this non-constraint conditional optimization problem using a recursive calculation method, and calculates the state variable of the center of gravity on the basis of the determined optimum solution.SELECTED DRAWING: Figure 1

Description

本発明は、移動ロボットのモデル予測制御を行う最適制御装置、最適制御方法及び制御プログラムに関するものである。 The present invention relates to an optimal control apparatus, an optimal control method, and a control program for performing model predictive control of a mobile robot.

例えば、ロボットの機械的リンク系に対してモデル予測制御（リシーディングホライゾン制御）を行う最適制御装置が知られている（特許文献１参照）。 For example, an optimal control device that performs model predictive control (seeding horizon control) on a mechanical link system of a robot is known (see Patent Document 1).

特開２０００−３３０６０９号公報JP 2000-330609 A

上記モデル予測制御では、ロボットの物理的な制約条件が設定される。そして、制御周期毎にこの制約条件付き最適化問題を求解し、その求解した最適解に基づいてロボットの重心軌道を生成することとなる。しかし、この最適解の求解において、従来、多大な時間を要するという、問題が生じていた。 In the model predictive control, a physical constraint condition of the robot is set. Then, the optimization problem with constraints is solved for each control cycle, and the center of gravity trajectory of the robot is generated based on the obtained optimum solution. However, conventionally, there has been a problem that much time is required for finding the optimum solution.

本発明は、このような問題点を解決するためになされたものであり、モデル予測制御において最適化問題の最適解を高速に求解し重心軌道を生成できる最適制御装置、最適制御方法及び制御プログラムを提供することを主たる目的とする。 The present invention has been made to solve such problems, and an optimal control apparatus, an optimal control method, and a control program capable of generating an optimal solution of an optimization problem in model predictive control at high speed and generating a center-of-gravity trajectory. The main purpose is to provide

上記目的を達成するための本発明の一態様は、二以上の移動手段を交互に接地しながら移動する移動ロボットの該移動手段が接地する接触点の位置と、接地するときの前記移動手段の姿勢と、を時系列のデータとした接触点計画を設定する接触点計画手段と、前記接触点計画設定手段により設定された接触点計画に基づいて、前記移動手段が接触点に接地しながら前記移動ロボットが移動するための重心軌道を生成する軌道生成手段と、を備える最適制御装置であって、前記軌道生成手段は、前記移動手段を接地するときの接触力に基づく量を入力とする予測モデルを構築して、該予測モデルによって所定時間幅の予測区間における前記移動ロボットの重心の状態変数を表わし、前記予測区間において、所定の評価基準を用いて前記重心の状態変数を算出し、該算出した重心の状態変数に基づいて、前記移動ロボットの重心軌道を生成するモデル予測制御を行ない、前記評価基準は、各接触点における前記接触力に基づく量の二乗が含まれる評価関数を予測区間内において最小化するものであり、前記評価基準と、前記接触力に基づく入力と前記重心の状態変数と関係を示す線形な状態方程式と、前記移動ロボットの線形等式で表現される等式制約条件と、を含む等式制約条件付き最適化問題は、直交補空間を用いて、前記等式制約条件を含まない無制約条件の最適化問題に変換され、前記軌道生成手段は、前記予測区間において、該変換した無制約条件の最適化問題を、再帰的計算法を用いて最適解を求解し、該求解した最適解に基づいて前記重心の状態変数を算出する、ことを特徴とする最適制御装置である。
この一態様において、前記軌道生成手段は、前記移動手段を接地するときの接触力の微分値を入力とする予測モデルを構築し、前記評価基準は、前記各接触点に対応して設定された重みに基づいて前記各接触点に前記接触力と、前記接触力の微分値とを配分するという基準が含まれ、前記接触力および接触力の微分値の二乗和を含む評価関数を予測区間内において最小化するものであり、前記評価基準と、前記接触力の微分値の入力と前記重心の状態変数と関係を示す状態方程式と、前記移動ロボットの力の釣合いの拘束を示す等式制約条件と、を含む等式制約条件付き最適化問題は、直交補空間を用いて、前記等式制約条件を含まない無制約条件の最適化問題に変換されてもよい。
この一態様において、前記等式制約条件を示す式に対してＱＲ分解を行って状態変数の変換式が導出され、前記接触力の微分値の入力と重心の状態変数との関係を示す状態方程式から導出した式に対してＱＲ分解を行って入力の変換式が導出され、前記状態方程式と、前記状態変数の変換式と、前記入力の変換式と、前記状態変数の変換式と、に基づいて状態方程式の変換式が導出され、前記導出した状態変数の変換式と、入力の変換式と、等式制約条件付き最適化問題の評価関数と、に基づいて、評価関数の変換式が導出され、
前記無制約条件の最適化問題は、前記導出された評価関数の変換式と、前記状態方程式の変換式と、を含んでいてもよい。
この一態様において、前記軌道生成手段は、前記無制約条件の最適化問題を行列表現した式の最適解条件に対して、再帰的計算法を用いて最適解を求解し、前記求解した最適解と、前記等式制約条件を示す式をＱＲ分解して導出した状態変数の変換式と、に基づいて前記重心の状態変数の時系列データを算出してもよい。
この一態様において、前記等式制約条件は、所定の区間内だけ前記接触力が変化しないように設定した入力を含んでいてもよい。
この一態様において、前記軌道生成手段は、前記等式制約条件と前記接触点の安定性の拘束を示す不等式制約条件とを含む等式制約条件及び不等式制約条件付き最適化問題を直交補空間を用いて変換した無制約条件の最適化問題を、再帰的計算法を用いて最適解を求解し、該求解した最適解に基づいて前記重心の状態変数の時系列データを算出してもよい。
この一態様において、前記軌道生成手段は、前記無制約条件の最適化問題を行列表現した式の最適解条件に対してニュートン法を適用し、該ニュートン法の収束演算の中で前記再帰的計算法を用いてニュートン方向を算出し、該算出したニュートン方向に基づいて、最適解を算出してもよい。
この一態様において、前記軌道生成手段は、前記無制約条件の最適化問題を行列表現した式の最適解条件に対して内点法又はアクティブセット法を適用してもよい。
この一態様において、前記不等式制約条件は、所定の区間内だけ前記接触力に制限をかけるように設定した入力を含んでいてもよい。
この一態様において、前記最適化問題の状態方程式は、線形時変の制御パラメータを含んでいてもよい。
この一態様において、前記軌道生成手段により生成された重心軌道に基づいて前記移動手段を制御する制御手段を更に備えていてもよい。
上記目的を達成するための本発明の一態様は、二以上の移動手段を交互に接地しながら移動する移動ロボットの該移動手段が接地する接触点の位置と、接地するときの前記移動手段の姿勢と、を時系列のデータとした接触点計画を設定するステップと、前記設定された接触点計画に基づいて、前記移動手段が接触点に接地しながら前記移動ロボットが移動するための重心軌道を生成するステップと、を含む最適制御方法であって、前記移動手段を接地するときの接触力に基づく量を入力とする予測モデルを構築して、該予測モデルによって所定時間幅の予測区間における前記移動ロボットの重心の状態変数を表わし、前記予測区間において、所定の評価基準を用いて前記重心の状態変数を算出し、該算出した重心の状態変数に基づいて、前記移動ロボットの重心軌道を生成するモデル予測制御を行ない、前記評価基準は、各接触点における前記接触力に基づく量の二乗が含まれる評価関数を予測区間内において最小化するものであり、前記評価基準と、前記接触力に基づく入力と前記重心の状態変数と関係を示す線形な状態方程式と、前記移動ロボットの線形等式で表現される等式制約条件と、を含む等式制約条件付き最適化問題は、直交補空間を用いて、前記等式制約条件を含まない無制約条件の最適化問題に変換され、前記予測区間において、該変換した無制約条件の最適化問題を、再帰的計算法を用いて最適解を求解し、該求解した最適解に基づいて前記重心の状態変数を算出する、ことを特徴とする最適制御方法であってもよい。
上記目的を達成するための本発明の一態様は、二以上の移動手段を交互に接地しながら移動する移動ロボットの該移動手段が接地する接触点の位置と、接地するときの前記移動手段の姿勢と、を時系列のデータとした接触点計画を設定する処理と、前記設定された接触点計画に基づいて、前記移動手段が接触点に接地しながら前記移動ロボットが移動するための重心軌道を生成する処理と、をコンピュータに実行させる最適制御プログラムであって、前記移動手段を接地するときの接触力に基づく量を入力とする予測モデルを構築して、該予測モデルによって所定時間幅の予測区間における前記移動ロボットの重心の状態変数を表わし、前記予測区間において、所定の評価基準を用いて前記重心の状態変数を算出し、該算出した重心の状態変数に基づいて、前記移動ロボットの重心軌道を生成するモデル予測制御を行ない、前記評価基準は、各接触点における前記接触力に基づく量の二乗が含まれる評価関数を予測区間内において最小化するものであり、前記評価基準と、前記接触力に基づく入力と前記重心の状態変数と関係を示す線形な状態方程式と、前記移動ロボットの線形等式で表現される等式制約条件と、を含む等式制約条件付き最適化問題は、直交補空間を用いて、前記等式制約条件を含まない無制約条件の最適化問題に変換され、前記予測区間において、該変換した無制約条件の最適化問題を、再帰的計算法を用いて最適解を求解し、該求解した最適解に基づいて前記重心の状態変数を算出する、ことを特徴とする最適制御プログラムであってもよい。 One aspect of the present invention for achieving the above object is that a position of a contact point where the moving means of the mobile robot that moves while alternately grounding two or more moving means contacts the position of the moving means when contacting the ground. Contact point planning means for setting a contact point plan with posture as time series data, and based on the contact point plan set by the contact point plan setting means, the moving means contacts the contact point while A trajectory generating means for generating a center-of-gravity trajectory for moving the mobile robot, wherein the trajectory generating means is configured to input an amount based on a contact force when the moving means is grounded A model is constructed to represent the state variable of the center of gravity of the mobile robot in the prediction interval of a predetermined time width by the prediction model, and the state of the center of gravity is determined using a predetermined evaluation criterion in the prediction interval. The model is subjected to model predictive control for generating a center-of-gravity trajectory of the mobile robot based on the calculated center-of-gravity state variable, and the evaluation criterion includes the square of the amount based on the contact force at each contact point The evaluation function is minimized within the prediction interval, and the evaluation criterion, the linear state equation indicating the relationship between the input based on the contact force and the state variable of the center of gravity, and the linear equation of the mobile robot An equality constraint optimization problem including an equality constraint expressed is converted into an unconstrained optimization problem that does not include the equality constraint using orthogonal complement space, and the trajectory generation The means solves the converted unconstrained optimization problem using the recursive calculation method in the prediction interval, and calculates a state variable of the center of gravity based on the obtained optimal solution. That It is optimal controller for the symptoms.
In this aspect, the trajectory generating means constructs a prediction model that receives a differential value of a contact force when the moving means is grounded, and the evaluation criterion is set corresponding to each contact point. A criterion for allocating the contact force and a differential value of the contact force to each contact point based on a weight is included, and an evaluation function including a sum of squares of the contact force and the differential value of the contact force is included in a prediction interval. In the equation, the evaluation criterion, the state equation indicating the relationship between the input of the differential value of the contact force and the state variable of the center of gravity, and the equation constraint indicating the constraint of the balance of the force of the mobile robot And an equality constraint optimization problem including the above may be converted into an unconstrained optimization problem that does not include the equality constraint condition using orthogonal complement space.
In this aspect, a state equation showing the relation between the input of the differential value of the contact force and the state variable of the center of gravity is derived by performing QR decomposition on the equation showing the equation constraint QR conversion is performed on the expression derived from the above, and an input conversion expression is derived. Based on the state equation, the state variable conversion expression, the input conversion expression, and the state variable conversion expression The state equation conversion formula is derived, and the evaluation function conversion formula is derived based on the derived state variable conversion formula, the input conversion formula, and the evaluation function of the optimization problem with equality constraints. And
The unconstrained optimization problem may include a conversion equation for the derived evaluation function and a conversion equation for the state equation.
In this aspect, the trajectory generating means solves an optimal solution using a recursive calculation method with respect to an optimal solution condition of an expression expressing the unconstrained optimization problem as a matrix, and the calculated optimal solution Further, the time series data of the state variable of the center of gravity may be calculated based on a state variable conversion expression derived by QR decomposition of an equation indicating the equation constraint.
In this aspect, the equality constraint condition may include an input set so that the contact force does not change only within a predetermined interval.
In this aspect, the trajectory generating means performs an equality constraint including the equality constraint and an inequality constraint indicating a stability constraint of the contact point and an optimization problem with an inequality constraint in an orthogonal complement space. An unconstrained optimization problem that has been converted by using an optimal solution may be obtained using a recursive calculation method, and time series data of the state variable of the center of gravity may be calculated based on the obtained optimal solution.
In this aspect, the trajectory generating means applies a Newton method to an optimal solution condition of an expression expressing the unconstrained optimization problem as a matrix, and the recursive calculation is performed in a convergence operation of the Newton method. A Newton direction may be calculated using a method, and an optimal solution may be calculated based on the calculated Newton direction.
In this aspect, the trajectory generating means may apply an interior point method or an active set method to an optimal solution condition of an expression expressing the unconstrained optimization problem as a matrix.
In this aspect, the inequality constraint condition may include an input set to limit the contact force only within a predetermined interval.
In this aspect, the state equation of the optimization problem may include a linear time-varying control parameter.
In this aspect, the apparatus may further include a control unit that controls the moving unit based on the center of gravity trajectory generated by the trajectory generating unit.
One aspect of the present invention for achieving the above object is that a position of a contact point where the moving means of the mobile robot that moves while alternately grounding two or more moving means contacts the position of the moving means when contacting the ground. A step of setting a contact point plan in which the posture is time-series data, and a center of gravity trajectory for moving the mobile robot while the moving means contacts the contact point based on the set contact point plan Generating a prediction model having an input based on a contact force when the moving means is grounded, and using the prediction model in a prediction section having a predetermined time width. The state variable of the center of gravity of the mobile robot is represented, and the state variable of the center of gravity is calculated using a predetermined evaluation criterion in the prediction section, and the state variable of the center of gravity is calculated based on the calculated state variable of the center of gravity. The model predictive control for generating the center of gravity trajectory of the mobile robot is performed, and the evaluation criterion is to minimize an evaluation function including a square of an amount based on the contact force at each contact point within the prediction interval, and the evaluation Optimal with equality constraints including a criterion, a linear state equation indicating a relationship between the input based on the contact force and the state variable of the center of gravity, and an equality constraint expressed by a linear equation of the mobile robot The conversion problem is converted into an unconstrained optimization problem that does not include the equality constraint condition using orthogonal complement space, and the converted unconstrained optimization problem is recursively calculated in the prediction interval. The optimal control method may be characterized in that an optimal solution is obtained using a method and a state variable of the center of gravity is calculated based on the obtained optimal solution.
One aspect of the present invention for achieving the above object is that a position of a contact point where the moving means of the mobile robot that moves while alternately grounding two or more moving means contacts the position of the moving means when contacting the ground. A process for setting a contact point plan with the posture as time-series data, and a center-of-gravity trajectory for moving the mobile robot while the moving means contacts the contact point based on the set contact point plan An optimal control program for causing a computer to execute a process of generating a prediction model that inputs an amount based on a contact force when the moving means is grounded, and has a predetermined time width by the prediction model. The state variable of the center of gravity of the mobile robot in the prediction section is represented, the state variable of the center of gravity is calculated using a predetermined evaluation criterion in the prediction section, and the state change of the calculated center of gravity is calculated. The model predictive control for generating the center-of-gravity trajectory of the mobile robot is performed based on the evaluation criteria. And including the evaluation criteria, a linear state equation indicating the relationship between the input based on the contact force and the state variable of the center of gravity, and an equation constraint expressed by a linear equation of the mobile robot, etc. An optimization problem with an expression constraint is converted into an unconstraint optimization problem that does not include the equality constraint using an orthogonal complement space, and the converted unconstraint optimization problem in the prediction interval May be an optimal control program that calculates an optimal solution using a recursive calculation method and calculates the state variable of the center of gravity based on the determined optimal solution.

本発明によれば、モデル予測制御において最適化問題の最適解を高速に求解し重心軌道を生成できる最適制御装置、最適制御方法及び制御プログラムを提供することができる。 ADVANTAGE OF THE INVENTION According to this invention, the optimal control apparatus, the optimal control method, and control program which can obtain | require the optimal solution of the optimization problem in model predictive control at high speed, and can produce | generate a gravity center locus | trajectory can be provided.

移動ロボットの動作の一例を示す図。である。The figure which shows an example of operation | movement of a mobile robot. It is. 移動ロボットの機械構成の一例を示す図である。It is a figure which shows an example of the machine structure of a mobile robot. 移動ロボットの機能ブロック図である。It is a functional block diagram of a mobile robot. ６軸力を示す図である。It is a figure which shows 6 axial force. 最適制御装置の機能ブロック図である。It is a functional block diagram of an optimal control apparatus. 接触点計画の概要の一例を示す図である。It is a figure which shows an example of the outline | summary of a contact point plan. 接触点計画の一例を示す図である。It is a figure which shows an example of a contact point plan. 予測区間の例を示す図である。It is a figure which shows the example of a prediction area. 予測区間での動きを表わした図である。It is a figure showing the motion in a prediction area. 予測区間のシフトを説明するための図である。It is a figure for demonstrating the shift of a prediction area. 予測に用いる移動ロボットのモデルを示す図である。It is a figure which shows the model of the mobile robot used for prediction. 予測区間の離散化を説明するための図である。It is a figure for demonstrating the discretization of a prediction area. 直交補空間のイメージ図である。It is an image figure of orthogonal complement space. 無制約条件のＬＱ最適化問題に変換する際のフローを示す図である。It is a figure which shows the flow at the time of converting into the LQ optimization problem of an unconstrained condition. 最適制御方法を示すフローチャートである。It is a flowchart which shows the optimal control method. 接触点の座標系と接触多角形とを示す図である。It is a figure which shows the coordinate system and contact polygon of a contact point. 接触点が不安定化する場合を例示した図である。It is the figure which illustrated the case where a contact point became unstable. 等式制約条件及び不等式制約条件付きＬＱ最適化問題の最適解の求解フローを示すフローチャートである。It is a flowchart which shows the solution flow of the optimal solution of an LQ optimization problem with an equality constraint condition and an inequality constraint condition.

本発明の実施形態を図示するとともに図中の各要素に付した符号を参照して説明する。
（第１実施形態）
本実施形態は移動ロボットの最適制御装置に特徴があり、具体的には、移動ロボットの移動動作（図１）を制御するための軌道生成に特徴を有するのであるが、具体的な制御（軌道生成）を説明する前に、制御対象となる移動ロボットのハードウェア構成について予め説明しておく。 An embodiment of the present invention will be illustrated and described with reference to reference numerals attached to elements in the drawing.
(First embodiment)
This embodiment is characterized by an optimal control device for a mobile robot. Specifically, the present embodiment has a feature in generating a trajectory for controlling the movement operation (FIG. 1) of the mobile robot. Before describing (generation), the hardware configuration of the mobile robot to be controlled will be described in advance.

図２は、移動ロボットの機械構成の一例を示した図である。
移動ロボット１００は、股関節が３軸、膝関節が１軸、足首関節が２軸、さらに、肩関節が３軸（肩ピッチ、肩ロール、肩ヨー）、肘関節が１軸（肘ピッチ）、および、手首関節が３軸（手首ヨー、手首ピッチ、手首ロール）、で夫々構成されている。 FIG. 2 is a diagram illustrating an example of the mechanical configuration of the mobile robot.
The mobile robot 100 has three axes for the hip joint, one axis for the knee joint, two axes for the ankle joint, three axes for the shoulder joint (shoulder pitch, shoulder roll, shoulder yaw), one axis for the elbow joint (elbow pitch), The wrist joint is composed of three axes (wrist yaw, wrist pitch, wrist roll).

（移動ロボットの機械構成はこれに限定されないが、手（腕）の自由度は６以上、足（脚）の自由度も６以上は必要である。）
移動ロボット１００は、各関節にエンコーダ付きモータ１、２、・・・、２８を有している。
各関節のモータ１ａ、２ａ、・・・、２８ａ（図３）は、各関節の関節角度θ１、θ２、・・・、θ２８を調整できる。
一方、各関節のエンコーダ１ｂ、２ｂ、・・・、２８ｂは、各関節の関節角度θ１、θ２・・・、θ２８を計測することができる。 (The mechanical configuration of the mobile robot is not limited to this, but the degree of freedom of the hand (arm) is 6 or more and the degree of freedom of the foot (leg) is 6 or more.)
The mobile robot 100 has motors 1, 2,..., 28 with encoders at each joint.
The joint motors 1a, 2a,..., 28a (FIG. 3) can adjust the joint angles θ1, θ2,.
On the other hand, the encoders 1b, 2b,..., 28b of each joint can measure the joint angles θ1, θ2,.

また、移動ロボット１００は、足先部（足平部）および手先部（手の平部）に接触力センサ２５を有している。
ここで接触力とは６軸力であり、図４に示すように、ｘ軸、ｙ軸およびｚ軸方向の力ｆの組（ｆ_ｘ、ｆ_ｙ、ｆ_ｚ）^Ｔと、ｘ軸回り、ｙ軸回りおよびｚ軸回りの力τの組（τ_ｘ、τ_ｙ、τ_ｚ）^Ｔと、である。
（なお、ｘ軸およびｙ軸は、鉛直方向であるｚ軸に垂直な面内で互いに直交する軸とする。） In addition, the mobile robot 100 has a contact force sensor 25 at a foot tip (foot portion) and a hand tip (palm portion).
Here, the contact force is a six-axis force. As shown in FIG. 4, a set of forces f in the x-axis, y-axis, and z-axis directions (f _x , f _y , f _z ) ^T , A set of forces τ around the y-axis and the z-axis (τ _x , τ _y , τ _z ) ^T.
(The x-axis and the y-axis are axes orthogonal to each other in a plane perpendicular to the z-axis, which is the vertical direction.)

この移動ロボットは、移動時に、右足、左足、右手および左手のうちの一つ以上を床、壁、あるいはテーブルなどに接触させながら移動する。
そこで、本明細書の以下の説明では、右足、左足、右手および左手を接触点候補と称することがある。また、手先、足先というのは、移動手段の一具体例である。 The mobile robot moves while contacting one or more of the right foot, left foot, right hand, and left hand with the floor, wall, table, or the like.
Therefore, in the following description of the present specification, the right foot, the left foot, the right hand, and the left hand may be referred to as contact point candidates. The hand and foot are specific examples of the moving means.

図３は、移動ロボット１００の機能ブロック図である。
移動ロボット１００は、各関節のモータ１ａ〜２４ａ及びエンコーダ１ｂ〜２４ｂと、接触力センサ２５と、最適制御装置２１０と、を備えている。 FIG. 3 is a functional block diagram of the mobile robot 100.
The mobile robot 100 includes motors 1 a to 24 a and encoders 1 b to 24 b of each joint, a contact force sensor 25, and an optimal control device 210.

最適制御装置２１０には、各関節のエンコーダ１ｂ〜２４ｂ及び接触力センサ２５から、センサ検出値が入力される。また、最適制御装置２１０は、各関節のモータ１ａ〜２４ａに対して駆動信号を出力する。 Sensor detection values are input to the optimal control device 210 from the encoders 1b to 24b and the contact force sensor 25 of each joint. Moreover, the optimal control apparatus 210 outputs a drive signal with respect to the motors 1a-24a of each joint.

最適制御装置２１０は、主要なハードウェア構成として、制御処理、演算処理等を行うＣＰＵ（Central Processing Unit）２１０ａと、ＣＰＵ２１０ａによって実行される制御プログラム、演算プログラム等が記憶されたＲＯＭ（Read Only Memory）２１０ｂと、処理データ等を一時的に記憶するＲＡＭ（Random Access Memory）２１０ｃと、を有するマイクロコンピュータにより構成されている。また、これらＣＰＵ２１０ａ、ＲＯＭ２１０ｂ、及びＲＡＭ２１０ｃは、データバス２１０ｄによって相互に接続されている。必要なプログラムを不揮発性記録媒体に記録しておき、必要に応じてインストールするようにしてもよい。 The optimal control device 210 includes a CPU (Central Processing Unit) 210a that performs control processing, arithmetic processing, and the like as a main hardware configuration, and a ROM (Read Only Memory) that stores control programs, arithmetic programs, and the like executed by the CPU 210a. ) 210b and a microcomputer having RAM (Random Access Memory) 210c for temporarily storing processing data and the like. The CPU 210a, ROM 210b, and RAM 210c are connected to each other by a data bus 210d. Necessary programs may be recorded on a non-volatile recording medium and installed as necessary.

図５は、本発明の一実施形態に係る最適制御装置２１０の機能ブロック図である。本実施形態に係る最適制御装置２１０は、移動ロボット１００の接触点計画を設定する接触点計画設定部（接触点計画手段の一具体例）２２１と、安定に実行できる移動ロボット１００の重心軌道を生成する軌道生成部（軌道生成手段の一具体例）２２２と、生成された重心軌道に従って移動ロボット１００の全身動作を実行させる動作制御部（制御手段の一具体例）２２３と、を有する。 FIG. 5 is a functional block diagram of the optimal control apparatus 210 according to an embodiment of the present invention. The optimal control apparatus 210 according to the present embodiment includes a contact point plan setting unit (one specific example of contact point planning means) 221 that sets a contact point plan of the mobile robot 100 and a center of gravity trajectory of the mobile robot 100 that can be stably executed. A trajectory generation unit (a specific example of the trajectory generation unit) 222 to be generated, and an operation control unit (a specific example of the control unit) 223 that executes the whole body motion of the mobile robot 100 according to the generated center-of-gravity trajectory.

ここで、軌道生成部２２２は、接触点計画に従った動作を実行できる重心軌道を生成するのであるが、この重心軌道生成には必要に応じた接触点変更を含む。
これら機能部の具体的な処理動作については後述する。 Here, the trajectory generation unit 222 generates a center of gravity trajectory that can execute an operation according to the contact point plan, and this center of gravity trajectory generation includes a change of the contact point as necessary.
Specific processing operations of these functional units will be described later.

（多点接触移動のための軌道生成方法）
本実施形態に係る軌道生成部２２２は、（１）多点接触移動を実現できる重心軌道を生成し、かつ、（２）必要に応じて接触点の変更を行っている。ここで、（１）多点接触移動を実現できる重心軌道を生成するための方法を説明する。なお、本出願人は、特願２０１３−２５４９８９（平成２５年１２月１０日出願）においてこの方法を出願している。 (Orbit generation method for multi-point contact movement)
The trajectory generation unit 222 according to the present embodiment generates (1) a center-of-gravity trajectory that can realize multipoint contact movement, and (2) changes the contact point as necessary. Here, (1) a method for generating a center of gravity trajectory capable of realizing multipoint contact movement will be described. The present applicant has applied for this method in Japanese Patent Application No. 2013-254989 (filed on Dec. 10, 2013).

そもそも、将来の目標重心位置を予め知ることはできないのであり、制御目標値として未知であるはずの将来の重心位置をユーザが設定するというのは無理がある。ユーザとしてはロボットに接触点の計画情報だけを与え、あとは、移動ロボットが設定された接触点の計画情報に基づいて自動的に安定な重心軌道を生成して自律的に移動してくれることが望ましい。 In the first place, the future target center-of-gravity position cannot be known in advance, and it is impossible for the user to set the future center-of-gravity position that should be unknown as the control target value. The user gives the robot only the contact point plan information, and then the mobile robot automatically generates a stable center of gravity trajectory based on the set contact point plan information and moves autonomously. Is desirable.

さて、移動ロボットに多点接触移動を安定して行わせるためには、時々刻々と移り変わっていく接触点に応じて接触力を滑らかに適切に分配し、なおかつ、安定な重心軌道を生成する技術が必要である。 Now, in order to make the mobile robot perform multi-point contact movements stably, a technology that distributes contact force smoothly and appropriately according to contact points that change from moment to moment, and generates a stable center of gravity trajectory. is necessary.

このために本実施形態に係る軌道生成部２２２は、モデル予測制御（所謂リシーディングホライゾン制御：Receding Horizon Control）を用いて重心軌道を生成する。
最初にモデル予測制御の概要を説明しておく。 For this purpose, the trajectory generation unit 222 according to the present embodiment generates a barycentric trajectory using model predictive control (so-called receiving horizon control).
First, an outline of model predictive control will be described.

（モデル予測制御の概要説明）
例えば、図１に図示したような移動動作を移動ロボットに行わせたいとする。
ここでは、２本の腕と２本の脚とを有する人型の移動ロボットに、テーブルの奥側にあるボトルを掴ませるという一連の動作を想定する。 (Overview of model predictive control)
For example, assume that the mobile robot wants to perform a moving operation as illustrated in FIG.
Here, a series of operations is assumed in which a humanoid mobile robot having two arms and two legs grips a bottle on the back side of the table.

この場合、接触点計画設定部２２１は、ユーザから指令される接触点の計画情報に基づいて、この一連動作（タスク）を実行できるような接触点計画を作成する。
つまり、接触点計画設定部２２１は、例えば、図６のように、手先および足先を、どの順番で、どこに、どのように、着くか、という計画を作成する。
図６においては、床、壁およびテーブルにおいて足先および手先を接触させる箇所にマークを付けている。 In this case, the contact point plan setting unit 221 creates a contact point plan that can execute this series of operations (tasks) based on the contact point plan information instructed by the user.
In other words, the contact point plan setting unit 221 creates a plan as to, for example, as shown in FIG. 6, in which order, where and how to get the hands and feet.
In FIG. 6, the floor, the wall, and the table are marked at locations where the feet and the hands are brought into contact.

この接触点計画は、具体的には図７のようになる。
接触点計画は、左手（ＬＨ）、右手（ＲＨ）、左足（ＬＦ）および右足（ＲＦ）に関し、どの順番で、どこに、どのように、着いていくか、という時系列のデータである。 This contact point plan is specifically as shown in FIG.
The contact point plan is time-series data regarding which order, where, and how to arrive for the left hand (LH), right hand (RH), left foot (LF), and right foot (RF).

図１、図６および図７の対応関係を簡単に説明する。
当初（ｔ０）左足１本だけで立ち、遊脚である右足を前に振り出し、そして、右足を着地させる（ｔ１）。
この動きに従った接触点計画を移動ロボット１００に実行させるためには、左足が最初に着地している床上の接触点の座標Ｐ_ＬＦ１、そのときの左足の姿勢ｒ_ＬＦ１、そして、右足が着地する床上の接触点の座標Ｐ_ＲＦ１、そのときの右足の姿勢ｒ_ＲＦ１、を指定することが必要である。 The correspondence between FIGS. 1, 6 and 7 will be briefly described.
Initially (t0) Stand with only one left foot, swing out the right foot, which is a free leg, and land the right foot (t1).
In order to cause the mobile robot 100 to execute the contact point plan according to this movement, the coordinates P _LF1 of the contact point on the floor on which the left foot first lands, the posture r _LF1 of the left foot at that time, and the right foot landing It is necessary to specify the coordinates P _RF1 of the contact point on the floor to be performed and the posture r _RF1 of the right foot at that time.

ここで、接触点の座標は、空間座標としてＰ＝（Ｐ_ｘ、Ｐ_ｙ、Ｐ_ｚ）の組で表わされる。
また、姿勢というのは、接触点に着地したときの足の裏面の向きであり、例えばオイラー角の組としてｒ＝（ｒ_ｘ、ｒ_ｙ、ｒ_ｚ）として表わされる。
（すなわち、ｒ_ｘ、ｒ_ｙおよびｒ_ｚは、ロール、ピッチおよびヨー角をそれぞれ表わす。）
足に関する接触点の座標およびそのときの姿勢を指令する形式は今後の説明でも同様なので、以後は適宜説明を省略する。 Here, the coordinates of the contact point are represented as a set of P = (P _x , P _y , P _z ) as spatial coordinates.
Further, the posture is the direction of the back surface of the foot when landing on the contact point, and is represented as, for example, r = (r _x , r _y , r _z ) as a set of Euler angles.
(Ie, r _x , r _y and r _z represent roll, pitch and yaw angles, respectively)
Since the format for instructing the coordinates of the contact point on the foot and the posture at that time will be the same in the following description, the description will be omitted as appropriate.

両足で立った後、左足を振り出し（ｔ_２）、左足を前方に着地する（ｔ_４）。
その間に、左手を壁に着くようにする（ｔ_３）。
ここで、左手を着く壁上の接触点の座標Ｐ_ＬＨ１、および、そのときの左手の姿勢ｒ_ＬＨ１を指定する。 After standing with both feet, swing out the left foot (t ₂ ) and land the left foot forward (t ₄ ).
In the meantime, the left hand is put on the wall (t ₃ ).
Here, the coordinates P _LH1 of the contact point on the wall that _wears the left hand and the posture r _LH1 of the left hand at that time are designated.

この接触点の座標は空間座標としてＰ＝（Ｐ_ｘ、Ｐ_ｙ、Ｐ_ｚ）の組で表わされ、姿勢は接触点に着いたときの手の平の向きとしてオイラー角の組としてｒ＝（ｒ_ｘ、ｒ_ｙ、ｒ_ｚ）として表わされる。 The coordinates of the contact point are expressed as a set of space coordinates P = (P _x , P _y , P _z ), and the posture is set as a set of Euler angles as a set of Euler angles when reaching the contact point. _x, _r y, expressed as _{r z).}

これ以降の接触点計画は図１、図６および図７を対比して頂ければ自明と思われるので省略する。
このようにして、接触点計画設定部２２１は、接触点計画を時系列のデータとして作成する。 Subsequent contact point plans will be omitted if they can be understood by comparing FIGS. 1, 6 and 7. FIG.
In this way, the contact point plan setting unit 221 creates the contact point plan as time series data.

軌道生成部２２２は、上記のように接触点計画設定部２２１により設定された接触点計画を実現するように重心軌道を生成する。動作制御部２２３は、軌道生成部２２２により生成された重心軌道に従って移動ロボット１００の全身動作させるように、各関節のモータ１ａ〜２４ａを制御する。これにより、移動ロボット１００は、設定された接触点計画に基づいて安定な重心軌道に従って、自律的に移動できる。 The trajectory generation unit 222 generates a center of gravity trajectory so as to realize the contact point plan set by the contact point plan setting unit 221 as described above. The motion control unit 223 controls the motors 1a to 24a of the joints so that the mobile robot 100 operates in the whole body according to the center of gravity trajectory generated by the trajectory generation unit 222. Thereby, the mobile robot 100 can move autonomously according to a stable center of gravity trajectory based on the set contact point plan.

このとき、軌道生成部２２２は、軌道生成にあたってモデル予測制御を行う。
すなわち、軌道生成部２２２は、ある時間幅を持った予測区間内で移動ロボット１００が安定移動できる軌道を生成し、予測区間を微小時間（Δｔ）ずつシフトさせながら安定動作を行える軌道を順次更新していくようにする。 At this time, the trajectory generation unit 222 performs model prediction control when generating the trajectory.
That is, the trajectory generation unit 222 generates a trajectory that the mobile robot 100 can stably move within a prediction interval having a certain time width, and sequentially updates the trajectory that can perform a stable operation while shifting the prediction interval by a minute time (Δt). Try to do.

例えば、図８に予測区間の例を示す。
軌道生成部２２２は、現在から所定時間（例えば１．６秒）先の未来までを予測区間として設定する。 For example, FIG. 8 shows an example of a prediction interval.
The trajectory generation unit 222 sets the future interval ahead of a predetermined time (for example, 1.6 seconds) from the present time as the prediction interval.

そして、軌道生成部２２２は、この予測区間の間で発散しないように安定な軌道を生成する。
この予測区間での動きをイメージしたものが図９である。
このように、軌道生成部２２２は、ある時間幅を持つ予測区間で安定な軌道を生成した上で、最初の一点だけを現在の入力値として使用する。 Then, the trajectory generator 222 generates a stable trajectory so as not to diverge between the prediction intervals.
FIG. 9 is an image of the motion in the prediction interval.
As described above, the trajectory generation unit 222 generates a stable trajectory in a prediction interval having a certain time width, and uses only the first point as the current input value.

軌道生成部２２２は、次の軌道更新周期（Δｔ秒後）には予測区間をシフトさせ、新たな予測区間において同様に安定な軌道を生成する（図１０参照）。 The trajectory generation unit 222 shifts the prediction interval in the next trajectory update period (after Δt seconds), and similarly generates a stable trajectory in the new prediction interval (see FIG. 10).

現在だけ、あるいは、現在から次ぎの制御周期（Δｔ秒）まで、だけを見るのではなく、上記のように、ある程度の未来までを予測区間とし、この予測区間内で発散しない軌道が生成されるようにする。
これを繰り返すことで移動ロボットは安定に移動することができる。 Instead of looking only at the present time or only from the present to the next control cycle (Δt seconds), as described above, a certain future is set as the prediction interval, and a trajectory that does not diverge within this prediction interval is generated. Like that.
By repeating this, the mobile robot can move stably.

さて、ここで問題なのは、ある時間幅を持った予測区間のなかで時々刻々と移り変わっていく接触点に応じて接触力を滑らかに適切に分配し、なおかつ、安定な重心軌道を生成するにはどのようにすればよいか、ということである。 Now, the problem here is to generate a stable center of gravity trajectory that distributes the contact force smoothly and appropriately according to the contact points that change from moment to moment in the prediction interval with a certain time width. What should I do?

本発明者らは、ある予測区間における安定軌道の生成問題をＬＱ（Linear Quadratic）最適化問題（凸二次計画問題：Quadratic Programming: QP）に帰着させるという着想を得た。
具体的には、軌道生成部２２２は、各接触点における接触力の二乗和と、前記６軸力（接触力）の微分値の二乗和と、を含む評価関数Ｊを最小化するというＬＱ最適化問題を解くことで、多点接触移動の安定軌道を求める。 The present inventors have come up with the idea of reducing the problem of generating stable trajectories in a certain prediction interval to an LQ (Linear Quadratic) optimization problem (Quadratic Programming Problem: QP).
Specifically, the trajectory generation unit 222 minimizes the evaluation function J including the sum of squares of the contact force at each contact point and the sum of squares of the differential values of the six-axis forces (contact forces). The stable trajectory of multipoint contact movement is obtained by solving the optimization problem.

そこで、次に、この評価関数Ｊの導出およびその解法（ＬＱ最適化問題への帰着）を説明する。
この解法により、ある予測区間内で安定な多点接触移動を実現するための、重心位置、重心速度、接触力および接触力の微分値の時系列データが得られることを示す。（ここからの説明では、まず、接触点計画で指示された通りの位置（接触点）に手足を着くことだけを考える。なお、必要に応じて、スラック変数などを導入し条件式や評価式を緩和するなどの処置を行って接触点を変更してもよい。 Therefore, next, the derivation of the evaluation function J and its solution (reduction to the LQ optimization problem) will be described.
This solution shows that time series data of the center of gravity position, the center of gravity speed, the contact force, and the differential value of the contact force can be obtained in order to realize stable multipoint contact movement within a certain prediction section. (In the following explanation, first consider only putting your limbs on the position (contact point) as instructed in the contact point plan. In addition, if necessary, introduce slack variables, etc. The contact point may be changed by performing a treatment such as relieving.

予測に用いる移動ロボットのモデルを改めて図１１に示す。
移動ロボット全体の慣性を一つの重心Ｇで表わす。各接触点には６軸力を定義する。 A model of the mobile robot used for the prediction is shown again in FIG.
The inertia of the entire mobile robot is represented by one center of gravity G. A 6-axis force is defined for each contact point.

この時、重心Ｇの並進運動量をＰ、重心回りの回転運動量（角運動量）をＬ、接触点の数をｎとすると、運動方程式は次のように書ける。 At this time, if the translational momentum of the center of gravity G is P, the rotational momentum (angular momentum) around the center of gravity is L, and the number of contact points is n, the equation of motion can be written as follows.

添え字ｉは接触点のインデックスを表す。
例えば接触点の候補が左手、右手、左足、右足の４点であれば、ｎ＝４（左手：ＬＨ＝１、右手：ＲＨ＝２、左足：ＬＦ＝３、右足：ＲＦ＝４）とすればよい。ただし、床や壁に接触していない接触点候補については接触力を０にするように拘束条件を設定しておく。例えば図１１の例であれば次のようにする。 The subscript i represents the index of the contact point.
For example, if the contact point candidates are four points of the left hand, right hand, left foot, and right foot, n = 4 (left hand: LH = 1, right hand: RH = 2, left foot: LF = 3, right foot: RF = 4). That's fine. However, for the contact point candidates that are not in contact with the floor or wall, the constraint condition is set so that the contact force is zero. For example, in the example of FIG.

（１）式の第１式、第２式を微分すると次の式が得られる。（（１）式はベクトルで表現しているが、これをｘ、ｙ、ｚに分解した上で、上から順に第１式、第２式・・・第６式と称する。） Differentiating the first expression and the second expression of the expression (1) yields the following expression. (Equation (1) is expressed as a vector, but after decomposing it into x, y, and z, they are called the first equation, the second equation,...

本実施形態では、この２式をシステムとして用いる。そして、（１）式の第３から第５式を拘束条件として定式化する。 In this embodiment, these two types are used as a system. Then, Formulas 3 to 5 of Formula (1) are formulated as constraint conditions.

さらに、予測区間内を図１２のように、Ｎ個の区間に分割し、（３）式、（４）式を離散化する。（３）式を離散化すると次のようになる。 Further, the prediction interval is divided into N intervals as shown in FIG. 12, and equations (3) and (4) are discretized. The equation (3) is discretized as follows.

また、サンプリング点で常に（４）式の拘束が成り立つとすると、（４）式は次のように離散化される。 Further, if the constraint of equation (4) always holds at the sampling point, equation (4) is discretized as follows.

ここで、パラメータを次ぎのように置く。 Here, the parameters are set as follows.

θ_ｉは、６軸力としての接触力を並べたベクトルである。そして、ｘは、重心Ｇのｘ座標、重心Ｇのｘ軸方向速度、重心Ｇのｙ座標、重心Ｇのｙ軸方向速度、および、各接触点における接触力（６軸力）、を並べたベクトルである。このｘを、重心の状態変数ｘと称する。さらに、ｕは、接触力（６軸力）の微分値を並べたベクトルである。 θ _i is a vector in which contact forces as six-axis forces are arranged. X is an x coordinate of the center of gravity G, an x-axis direction speed of the center of gravity G, a y-coordinate of the center of gravity G, a y-axis direction speed of the center of gravity G, and a contact force (six-axis force) at each contact point. Is a vector. This x is referred to as the state variable x of the center of gravity. Furthermore, u is a vector in which the differential values of the contact force (six-axis force) are arranged.

このようにパラメータを設定すると、（５）式を次ぎの状態方程式として記述することができる。 When parameters are set in this way, equation (5) can be described as the next state equation.

この（８）式は、（ｊ＋１）のときの状態変数ｘを、その一つ前の状態で記述できる。（８）式を用いて予測区間内の状態変数ｘを順に計算していくと次のようになる。 This equation (8) can describe the state variable x at the time of (j + 1) in the previous state. When the state variable x in the prediction interval is calculated in order using the equation (8), it is as follows.

したがって、時系列的に求められる状態変数ｘを並べて大文字のＸで表わすと、状態変数の時系列データＸを次のように表わすことができる。 Therefore, when the state variables x obtained in time series are arranged and represented by capital letters X, the time series data X of the state variables can be represented as follows.

この（１０）式は、接触力の微分値（ｕ［ｋ］）を入力として、ある予測区間内における移動ロボットの状態遷移を表わす予測モデルとなる。なお、上記（１０）式において、接触力を入力してもよい。この場合、状態変数ｘは、重心位置と重心速度のみを含むこととなる。また、上記（３）式は、Ｇ（２ドット）（２階微分）とｆとの関係式となり、この関係式と、上記（５）式のｆ（ドット）の項を０にした式とから、上記（８）式のような線形の状態方程式が導出できる。
さて、ここで、本発明者らは、予測区間内において安定な軌道を生成するために次ぎのような評価関数Ｊの評価基準を導入した。 This equation (10) is a prediction model that represents the state transition of the mobile robot within a certain prediction interval with the differential value (u [k]) of the contact force as an input. In the above equation (10), the contact force may be input. In this case, the state variable x includes only the gravity center position and the gravity center speed. Further, the above equation (3) is a relational expression between G (2 dots) (second order differential) and f, and this relational expression is an expression in which the term of f (dot) in the above expression (5) is zero. From this, a linear equation of state like the above equation (8) can be derived.
Now, the present inventors have introduced the following evaluation criteria for the evaluation function J in order to generate a stable trajectory within the prediction interval.

なお、Ｑ_ｉ、Ｒ_ｉは、適宜設定した重みである。例えば、接触点候補すべてに力を均等配分した場合、Ｑ_ｉはすべて１となり、Ｒ_ｉはすべて１×１０^−６と設定できる。 Q _i and R _i are weights set as appropriate. For example, when the force is evenly distributed to all the contact point candidates, Q _i is all 1 and R _i can be set to 1 × 10 ⁻⁶ .

ここで、θ_ｉは、６軸力としての接触力の成分を並べたベクトルであった。したがって、上記（１１）式は、「予測区間内で、接触力（６軸力）と接触力の微分値との２乗和を最小化する」という意味の式である。上記（１１）式の第１項は、接触力（６軸力）の２乗和を最小化することを意味する。 Here, θ _i is a vector in which components of contact force as 6-axis force are arranged. Therefore, the above expression (11) is an expression that means “minimize the sum of squares of the contact force (6-axis force) and the differential value of the contact force within the prediction interval”. The first term of the equation (11) means minimizing the sum of squares of the contact force (six-axis force).

この第１項には、次の作用が含まれている。
（１）各接触点への接触力を均等分配すること。これにより、重心をできる限り安定な位置に動かすという効果がある。
（２）不必要な内力を打ち消すこと。
（３）接触点の接地安定性を高めること。すなわち、接触面内の反力中心点を接触面の中心に設定するという効果がある。 This first term includes the following actions.
(1) Distribute the contact force to each contact point evenly. This has the effect of moving the center of gravity to the most stable position possible.
(2) To cancel unnecessary internal forces.
(3) To improve the grounding stability of the contact point. That is, there is an effect that the reaction force center point in the contact surface is set to the center of the contact surface.

また、上記（１１）式の第２項は、接触力（６軸力）微分値（６軸力の時間変化率）の２乗和を最小化することを意味する。 The second term of the above equation (11) means minimizing the sum of squares of the contact force (6-axis force) differential value (time change rate of 6-axis force).

この第２項には次の作用が含まれている。
（１）重心の発散を抑制すること。
（２）滑らかに接触力を切り替えていくこと。 This second term includes the following actions.
(1) To suppress the divergence of the center of gravity.
(2) Switching contact force smoothly.

これらをＱ、Ｒという重みによって適切に足し合わせることによって、この評価関数Ｊを最小化するということは、
「高い接触安定性、滑らかな接触力遷移、最低限の内力、といった条件を満たしながら、安定な重心軌道と各接触点の接触力とを出力する」
ということを意味することとなる。 Minimizing this evaluation function J by adding these appropriately by the weights of Q and R means that
"Stable center of gravity trajectory and contact force at each contact point are output while satisfying conditions such as high contact stability, smooth contact force transition, and minimum internal force"
It means that.

上記（１１）式を離散化し一般的な形式に書き換えると、次の評価関数（１２）式が得られる。

When the above equation (11) is discretized and rewritten into a general form, the following evaluation function (12) is obtained.

次に、移動ロボットの力の釣合いの拘束を示す等式制約条件（拘束制約条件）について考える。
等式制約条件としては、
（１）移動ロボットの非接触の接触点候補に対して６軸力が０という拘束、
（２）移動ロボットの鉛直方向の力の釣り合いの拘束、および、
（３）移動ロボットのｘｙ軸回りのモーメント力の釣り合いの拘束、
が予測区間の全サンプリング点に渡って成り立つ必要がある。 Next, an equation constraint condition (constraint constraint condition) indicating a constraint of balance of force of the mobile robot is considered.
As an equation constraint,
(1) Constraint that the 6-axis force is 0 with respect to the non-contact contact point candidate of the mobile robot,
(2) Vertical force balance constraint of the mobile robot, and
(3) Constraint on balance of moment force around xy axis of mobile robot,
Must hold over all sampling points in the prediction interval.

ここで、例えば、あるサンプリング点ｋにおいて、ｉ番目とｉ＋２番目の接触点が非接触であったとする。
この時、上記等式制約条件（１）乃至（３）は、下記（１３）式のように記述できる。

Here, for example, it is assumed that the i-th and i + 2th contact points are non-contact at a certain sampling point k.
At this time, the equality constraints (1) to (3) can be described as the following equation (13).

なお、係数行列Ｃ_ｋ、ｄ_ｋの成分はサンプリング点によって異なり、接触点候補の接触／非接触といった情報や接触点位置は接触点計画設定部２２１によって設定される。例えば、上記（１３）式のｐ_ｉｘ［ｋ］、ｐ_ｉｙ［ｋ］、ｐ_ｉｚ［ｋ］は、接触点計画設定部２２１によって設定される。 Note that the components of the coefficient matrices C _k and d _k differ depending on the sampling points, and information such as contact / non-contact of contact point candidates and contact point positions are set by the contact point plan setting unit 221. For example, p _ix [k], p _iy [k], and p _iz [k] in the above equation (13) are set by the contact point plan setting unit 221.

以上から、現在の状態量（状態変数の初期値）をｘ_０とすると、上記（８）式、（１２）式、及び（１３）式より、最適制御装置２１０の軌道生成部２２２は、下記（１４）式に示す等式制約条件付きＬＱ最適化問題を求解し、重心軌道を生成することとなる。

なお、上記（１４）式において、１行目の式（ｍｉｎＪ＝・・）は、上述の如く、予測区間内において、接触力と接触力の微分値との２乗和を最小化するという意味の式である。２行目の式（ｘ［ｋ＋１］＝・・）は、接触力の微分値の入力と重心の状態変数と関係を示す状態方程式である。３行目の式（Ｃ_ｋｘ［ｋ］＝ｄ_ｋ）は、移動ロボットの力の釣合いの拘束を示す等式制約条件である。 From the above, when the amount present state (initial value of the state variable) and _{x 0,} equation (8) and (12), and (13), the trajectory generating unit 222 of the optimal controller 210, the following The LQ optimization problem with equality constraints shown in the equation (14) is solved to generate the center of gravity trajectory.

In the above formula (14), the formula (minJ = ...) in the first row means that the sum of squares of the contact force and the differential value of the contact force is minimized within the prediction interval as described above. It is a formula. The expression (x [k + 1] =...) In the second row is a state equation indicating the relationship between the input of the differential value of the contact force and the state variable of the center of gravity. The expression (C _k x [k] = d _k ) in the third row is an equality constraint condition indicating the constraint of the balance of forces of the mobile robot.

ところで、上述のように移動ロボットの最適制御装置は、多点接触で安定的な動作軌道を生成するためにモデル予測制御を行っている。このモデル予測制御では、移動ロボットの物理的な制約条件（上述の等式制約条件）が設定される。そして、最適制御装置は、制御周期毎にＬＱ最適化問題を求解し、その求解した最適解に基づいて制御を行なっている。しかし、この最適解の求解において、従来、多大な時間を要し、モデル予測制御の周期（軌道更新の周期）に遅延が生じ、制御性能を上げることができないという問題が生じていた。 By the way, as described above, the optimal control device for a mobile robot performs model predictive control in order to generate a stable motion trajectory with multipoint contact. In this model predictive control, a physical constraint condition (the above-described equality constraint condition) of the mobile robot is set. Then, the optimal control device solves the LQ optimization problem for each control cycle, and performs control based on the obtained optimal solution. However, the solution of the optimum solution has conventionally required a lot of time, causing a delay in the cycle of model predictive control (orbit update cycle), resulting in a problem that the control performance cannot be improved.

これに対し本実施形態においては、直交補空間を用いて等式制約条件付きのＬＱ最適化問題を無制約条件のＬＱ最適化問題に変換する。そして、最適制御装置２１０の軌道生成部２２２は、この変換した無制約条件のＬＱ最適化問題をリカッチ型再帰的計算法（Riccati recursion）を用いて解き、最適解を求解する。そして、軌道生成部２２２は、求解した最適解に基づいて重心の状態変数を算出し、該算出した重心の状態変数に基づいて重心軌道を生成する。
直交補空間を用いて無制約条件のＬＱ最適化問題に変換することで、その求解に高速かつ安定的なリカッチ型再帰的計算法を用いることができる。これにより、モデル予測制御においてＬＱ最適化問題の最適解を高速に求解し重心軌道を生成できる。 In contrast, in the present embodiment, an LQ optimization problem with equality constraints is converted into an unconstrained LQ optimization problem using orthogonal complement space. Then, the trajectory generation unit 222 of the optimal controller 210 solves the converted unconstrained LQ optimization problem using the Riccati recursive calculation method (Riccati recursion) to find the optimal solution. Then, the trajectory generation unit 222 calculates the state variable of the center of gravity based on the obtained optimal solution, and generates the center of gravity trajectory based on the calculated state variable of the center of gravity.
By converting to an unconstrained LQ optimization problem using the orthogonal complement space, a fast and stable riccat type recursive calculation method can be used for the solution. As a result, the optimal solution of the LQ optimization problem can be obtained at high speed in model predictive control, and the center of gravity trajectory can be generated.

なお、上記Riccati recursionは、最適化問題を行列表現した式に変換し、その変換した行列表現の式の最適解条件（ＫＫＴ（Karush-Kuhn- Tucker）条件）を示す式に対して再帰的計算を行うことにより、最適化問題の最適解を高速に求解するものである。詳細な計算方法については、既に、非特許文献（Parallel Implementation of Riccati Recursion for Solving Linear-Quadratic, Gianluca Frison John Bagterp Jorgensen）などに開示されており、これを援用できるものとする。 The above Riccati recursion is a recursive calculation for an expression that shows the optimal solution condition (KKT (Karush-Kuhn-Tucker) condition) of the converted matrix expression. By doing this, the optimal solution of the optimization problem is obtained at high speed. The detailed calculation method has already been disclosed in a non-patent document (Parallel Implementation of Riccati Recursion for Solving Linear-Quadratic, Gianluca Frison John Bagterp Jorgensen) and the like, and this can be used.

ここで、最初に、上述した直交補空間について詳細に説明する。直交補空間は、以下（１）−（３）のように定義される。
（１）２つの部分空間Ｖ及びＵの基底｛ｖ_ｉ｝^ｋ _ｉ＝１および｛ｕ｝^ｍ _ｉ＝１に含まれるベクトルが線形独立であるとき、基底｛ｖ_ｉ∈Ｒ^ｎ｝^ｋ _ｉ＝１∪で張られる部分空間をＶとＵの直和（direct sum）といい、Ｕ（＋）Ｖと表記する。以下、○の中に＋を（＋）と表記する。特に、Ｒ^ｎ＝Ｒ^ｋ＋ｍ＝Ｖ（＋）Ｕが成立するとき、ＵをＶの補空間（complement）という。
（２）部分空間Ｖ⊂Ｒ^ｎと部分空間Ｕ⊂Ｒ^ｎとが、_ｖＴ_ｕ＝０ for all ｖ ∈ Ｖ、all u ∈ Ｕを満たすとき、２つの部分空間は直交するという。
（３）部分空間Ｖとその補空間Ｕが直交するとき、ＵをＶの直交補空間（orthogonal complement）といい、Ｖ^⊥と表記する。
上記定義に基づいて下記命題（４）−（５）が成立する。
（４）線形独立なｍ（＜ｎ）個のベクトル｛ｙ_ｉ｝^ｍ _ｉ＝１と直交するベクトル集合α＝｛ｘ∈Ｒ^ｎ｜ｙ^Ｔ _１ｘ＝ｙ^Ｔ _２ｘ＝・・・＝ｙ^Ｔ _ｍｘ＝０｝は、ｎ−ｍ次元部分空間である。
（５）非直交基底｛ｕ_ｉ∈Ｒ^ｎ｝^ｎ _ｉ＝１からｍ個選択された基底ベクトルによって張られる部分空間Ｖ＝＜ｕ_１、ｕ_２、・・・、ｕ_ｍ＞の直交補空間は、その双直交基底｛ｖ_ｉ∈Ｒ^ｎ｝^ｎ _ｉ＝１によって、Ｖ^⊥＝＜ｖ_ｍ＋１、ｖ_ｍ＋２、・・・、ｖ_ｎ＞で表される。 Here, first, the above-described orthogonal complement space will be described in detail. The orthogonal complementary space is defined as (1)-(3) below.
(1) When the vectors included in the bases {v _i } ^k _{i = 1} and {u} ^m _{i = 1} of the two subspaces V and U are linearly independent, the base {v _i εR ⁿ } ^k _{i = 1 The} subspace spanned by ∪ is called the direct sum of V and U, and is expressed as U (+) V. Hereinafter, + is described as (+) in ○. In particular, when R ⁿ = R ^{k + m} = V (+) U holds, U is referred to as a V complement space.
(2) a subspace V⊂R ⁿ and subspace U⊂R ⁿ _{_{is, v T u = 0 for all}} v ∈ V, when satisfying all u ∈ U, 2 two subspace that is orthogonal.
(3) When the subspace V and its complement space U are orthogonal, U is referred to as an orthogonal complement space of V and denoted as V ^⊥ .
The following propositions (4) to (5) are established based on the above definition.
(4) Vector set α = {xεR ⁿ | y ^T ₁ x = y ^T ₂ x =... = Y orthogonal to m (<n) linearly independent vectors {y _i } ^m _{i = 1} ^T _m x = 0} is an nm dimension subspace.
(5) Non-orthogonal basis {u _i ∈ R ⁿ } ⁿ _{i = 1} orthogonal subspace of subspace V = <u ₁ , u ₂ ,..., U _m > spanned by m basis vectors selected from ₁ by its biorthogonal basis _{^{^{_{{v i ∈R n} n i}}}} = 1, V ⊥ = <v m + 1, v m + 2, ···, v n> represented by.

上記命題を簡略して説明すると、Ｃ_ｋ∈Ｒ^{ｍｋ×ｎｘ}の直交補空間Ｃ^⊥ _ｋ∈Ｒ^{ｎｘ×（ｎｘ−ｍｋ）}とは、ｎ_ｘ×ｎ_ｘの線形空間のうち、Ｃ_ｋの残りの空間（補空間）でＣ_ｋに直交する空間である。この直交補空間を用いて上記（１４）式のＬＱ最適化問題を変換することで、図１３に示す如く、等式制約条件Ｃ_ｋｘ＝ｄ_ｋ上に存在するｘを、Ｃ_ｋに平行なベクトルζと直交しＣ_ｋに終端する定数ベクトルσで表すことができる。換言すると、直交補空間を用いて、ｘをζに変数変換することで、ζをどのように動かしても必ず等式制約条件Ｃ_ｋｘ＝ｄ_ｋは満たされることとなる。このため、この等式制約条件を考慮することなく無制約条件でＬＱ最適化問題を求解できる。 By way simply the proposition, _{C k} The ∈R ^{mk × nx} orthogonal complement C ^⊥ _k ∈R ^{nx ×} of ^(nx-mk), among the linear space of _{_{n x}} × _{n x,} the remaining _{C k} in space (complement) of a space orthogonal to C _k. By converting the (14) equation LQ optimization problem using the orthogonal complement, as shown in FIG. 13, the x present on equality constraints C _{k x} = d _k, parallel to the C _k Can be represented by a constant vector σ that is orthogonal to the vector ζ and terminates at C _k . In other words, the equation constraint condition C _k x = d _k is always satisfied by changing the variable x to ζ using the orthogonal complement space, no matter how ζ is moved. Therefore, the LQ optimization problem can be solved under unconstrained conditions without considering this equality constraint condition.

次に、上述した直交補空間を用いた変換方法（以下、直交補空間変換と称す）について詳細に説明する。
本実施形態において、例えば、下記（１５）式に示すＱＲ分解（直交行列Ｑと上三角形行列Ｒの積に分解）を用いて直交補空間変換を行うことができる。

Next, a conversion method using the above-described orthogonal complementary space (hereinafter referred to as orthogonal complementary space conversion) will be described in detail.
In the present embodiment, for example, orthogonal complementary space transformation can be performed using QR decomposition (decomposition into the product of the orthogonal matrix Q and the upper triangular matrix R) shown in the following equation (15).

以上から、等式制約条件付きＬＱ最適化問題を直交補空間に投影することで、直交補空間変換を行い無制約条件のＬＱ最適化問題を次のように導出する。
まず、等式制約条件を示す上記（１３）式（Ｃ_ｋｘ［ｋ］＝ｄ_ｋ）をＱＲ分解することで、状態変数ｘの変換式である下記（１６）式が導出される。

From the above, by projecting the LQ optimization problem with equality constraints onto the orthogonal complement space, the orthogonal complement space transformation is performed and the unconstrained LQ optimization problem is derived as follows.
First, the following equation (16), which is a conversion equation for the state variable x, is derived by performing QR decomposition on the above equation (13) (C _k x [k] = d _k ) indicating the equality constraint condition.

次に上記状態方程式（８）式の左からＣ_ｋ＋１を掛けると下記（１７）式が導出される。
Ｃ_ｋ＋１ｘ［ｋ＋１］＝Ｃ_ｋ＋１Ａｘ［ｋ］＋Ｃ_ｋ＋１Ｂｕ［ｋ］・・・（１７）
さらに、上記（１７）式に上記（１３）式を代入して下記（１８）式を導出する。
Ｃ_ｋ＋１Ａｘ［ｋ］＋Ｃ_ｋ＋１Ｂｕ［ｋ］＝ｄ_ｋ＋１・・・（１８）
（ｋ＝０、１、・・・、Ｎ−１） Next, when the state equation (8) is multiplied by C _{k + 1} from the left, the following equation (17) is derived.
C _{k + 1} x [k + 1] = C _{k + 1} Ax [k] + C _{k + 1} Bu [k] (17)
Further, the following equation (18) is derived by substituting the above equation (13) into the above equation (17).
C _{k + 1} Ax [k] + C _{k + 1} Bu [k] = d _{k + 1} (18)
(K = 0, 1, ..., N-1)

上記変換と同様に、Ｃ_ｋ＋１Ｂの直交補空間を用いて変数変換を行う。
Ｃ_ｋ＋１Ｂを下記（１９）式に示すようにＱＲ分解する。

Similar to the above transformation, variable transformation is performed using an orthogonal complementary space of C _{k + 1} B.
QR decomposition is performed on C _{k + 1} B as shown in the following equation (19).

上記（１９）式を用いて上記（１８）式を変換し（ＱＲ分解を行い）、入力ｕの変換式である下記（２０）式を導出する。

但し、上記（２０）式における各パラメータを下記（２１）式に示すように設定する。

ｋ＝０のときは、上記（２０）式における各パラメータを下記（２２）式に示すように設定する。

The above equation (18) is converted using the above equation (19) (QR decomposition is performed), and the following equation (20), which is a conversion equation for the input u, is derived.

However, each parameter in the above equation (20) is set as shown in the following equation (21).

When k = 0, each parameter in the above equation (20) is set as shown in the following equation (22).

上記（１６）式の左からＤ^Ｔ _ｋを掛けて下記（２３）式を導出する。

但し、上記（２３）式において、正規直交性から下記（２４）式が成立する。

The following equation (23) is derived by multiplying D ^T _k from the left of the above equation (16).

However, in the above equation (23), the following equation (24) is established from orthonormality.

以上より、上記（８）式を上記（１６）式、（２０）式、及び（２３）式を用いて変形し、状態方程式の変換式である下記（２５）式を導出する。

但し、上記（２５）式における各パラメータを下記（２６）式に示すように設定する。

ｋ＝０のときは、上記（２５）式における各パラメータを下記（２７）式に示すように設定する。

From the above, the above equation (8) is transformed using the above equations (16), (20), and (23), and the following equation (25), which is a conversion equation of the state equation, is derived.

However, each parameter in the above equation (25) is set as shown in the following equation (26).

When k = 0, each parameter in the above equation (25) is set as shown in the following equation (27).

また、上記（１６）式及び（２０）式を用いて、上記（１２）式に示す評価関数ＪのΣの項は、下記（２８）式に示すように変形できる。

但し、ｋ＝０のときは、下記（２９）式が成立する。

また、ｋ＝Ｎのときは、下記（３０）式が成立する。

In addition, using the above equations (16) and (20), the Σ term of the evaluation function J shown in the above equation (12) can be modified as shown in the following equation (28).

However, when k = 0, the following equation (29) is established.

When k = N, the following equation (30) is established.

上記（１６）式及び（２０）式を用いて上記（１２）式に示す評価関数Ｊを変形し、評価関数の変換式である下記（３１）式を導出する。

但し、上記（３１）式における各パラメータを下記（３２）式に示すように設定する。

The evaluation function J shown in the above equation (12) is transformed using the above equations (16) and (20), and the following equation (31), which is a conversion equation of the evaluation function, is derived.

However, each parameter in the above equation (31) is set as shown in the following equation (32).

以上のように、等式制約条件付きＬＱ最適化問題に対して直交補空間変換を行い、下記（３３）式に示す無制約条件のＬＱ最適化問題を導出できる。すなわち、直交補空間変換を行うことで、上記（１４）式に示す等式制約条件付きＬＱ最適化問題を、下記（３３）式に示す無制約条件のＬＱ最適化問題に変換できる。本実施形態に係る軌道生成部２２２は、下記（３３）式に示す無制約条件のＬＱ最適化問題を、リカッチ型再帰的計算法を用いて最適解を高速に求解できる。

As described above, the orthogonal complementary space transformation is performed on the LQ optimization problem with equality constraints, and the unconstrained LQ optimization problem expressed by the following equation (33) can be derived. That is, by performing orthogonal complementary space transformation, the LQ optimization problem with equality constraints shown in the above equation (14) can be converted into an unconstrained LQ optimization problem shown in the following equation (33). The trajectory generation unit 222 according to the present embodiment can find an optimal solution for the unconstrained LQ optimization problem expressed by the following equation (33) at high speed using the Riccati-type recursive calculation method.

次に、上記直交補空間変換により変換した無制約条件のＬＱ最適化問題を、リカッチ型再帰的計算法を用いて求解する方法を説明する。
まず、上記（３３）式を行列表現すると、下記（３４）式及び（３５）式のように表現できる。

Next, a method for solving the unconstrained LQ optimization problem converted by the orthogonal complementary space transform using the Riccati-type recursive calculation method will be described.
First, when the above equation (33) is expressed as a matrix, it can be expressed as the following equations (34) and (35).

上記（３４）式及び（３５）式の最適解条件（ＫＫＴ条件）は、下記（３６）式となる。但し、下記（３７）式は、上記（３５）式のラグランジュ乗数である。

The optimum solution condition (KKT condition) of the above equations (34) and (35) is the following equation (36). However, the following equation (37) is a Lagrange multiplier of the above equation (35).

軌道生成部２２２は、上記（３６）式に示す式に対して、次のように、再帰的計算を行うことで、上記無制約条件のＬＱ最適化問題を高速かつ安定的に求解する。
まず、軌道生成部２２２は、上記（３６）式の行列内の各パラメータの順番を入れ替えることで、下記（３８）式のように表現する。

The trajectory generation unit 222 solves the unconstrained LQ optimization problem at high speed and stably by performing recursive calculation on the expression shown in the above expression (36) as follows.
First, the trajectory generation unit 222 represents the following equation (38) by changing the order of the parameters in the matrix of the equation (36).

そして、軌道生成部２２２は、上記（３８）式に対して、下記（３９）式に示す再帰計算を繰り返す。

Then, the trajectory generation unit 222 repeats the recursive calculation shown in the following equation (39) with respect to the above equation (38).

上記再帰計算を繰り返すことで、上記（３８）式は、下記（４０）式のように変形される。

By repeating the recursive calculation, the equation (38) is transformed into the following equation (40).

さらに、軌道生成部２２２は、上記（４０）式に対して、下記（４１）式に示す再帰計算を行うことで、上記（３３）式に示すＬＱ最適化問題の最適解ζを高速で求解する。

Furthermore, the trajectory generation unit 222 performs a recursive calculation shown in the following equation (41) on the above equation (40), thereby obtaining the optimum solution ζ of the LQ optimization problem shown in the above equation (33) at high speed. To do.

最後に、軌道生成部２２２は、上記求解した最適解ζと、上記（１６）式及び（２０）式（下記２式）と、を用いて、上記（１４）式に示す等式制約条件付きＬＱ最適化問題のパラメータを復元し、ｘ［ｋ］及びｕ［ｋ］を算出する。
ｘ［ｋ］＝Ｄ_ｋζ［ｋ］＋ｅ_ｋ
ｕ［ｋ］＝Ｎ_ｋζ［ｋ］＋Ｍ_ｋｖ［ｋ］＋ｌ_ｋ Finally, the trajectory generation unit 222 uses the calculated optimal solution ζ and the above-described equations (16) and (20) (the following two equations) with the equation constraint condition shown in the above-mentioned equation (14). The parameters of the LQ optimization problem are restored, and x [k] and u [k] are calculated.
x [k] = D _k ζ [k] + e _k
u [k] = N _k ζ [k] + M _k v [k] + l _k

軌道生成部２２２は、算出したｘ［ｋ］（重心Ｇのｘ座標、重心Ｇのｘ軸方向速度、重心Ｇのｙ座標、重心Ｇのｙ軸方向速度、および、各接触点における接触力（６軸力））の時系列データに基づいて、重心軌道を生成する。このようにして、予測区間内において、等式制約条件を満たし、かつ評価関数Ｊを最小化する重心軌道が高速に生成される。すなわち、予測区間内において移動ロボットの安定な移動を実現する重心軌道を高速に生成することができる。 The trajectory generation unit 222 calculates the calculated x [k] (the x coordinate of the center of gravity G, the x-axis direction speed of the center of gravity G, the y-coordinate of the center of gravity G, the y-axis direction speed of the center of gravity G, and the contact force at each contact point ( A center-of-gravity trajectory is generated based on time-series data of 6-axis force))). In this way, the center-of-gravity trajectory that satisfies the equality constraint condition and minimizes the evaluation function J is generated at high speed within the prediction interval. That is, it is possible to generate a center-of-gravity trajectory that realizes stable movement of the mobile robot within the prediction interval at high speed.

図１４は、上述した直交補空間変換を行った上記（３３）式に示す無制約条件のＬＱ最適化問題に変換する際のフローを示す図である。
等式制約条件の上記（１３）式に対してＱＲ分解を行って、状態変数の変換式である上記（１６）式が導出される（ステップＳ１０１）。 FIG. 14 is a diagram showing a flow when converting to the unconstrained LQ optimization problem shown in the above equation (33), which is obtained by performing the above-described orthogonal complementary space conversion.
QR decomposition is performed on the above equation (13) of the equation constraint, and the above equation (16), which is a state variable conversion equation, is derived (step S101).

上記入力（接触力（６軸力）の微分値）ｕと重心の状態変数ｘとの関係を示す状態方程式（８）式から導出した（１８）式に対してＱＲ分解を行って、入力ｕの変換式である上記（２０）式が導出される（ステップＳ１０２）。 QR decomposition is performed on the equation (18) derived from the equation of state (8) indicating the relationship between the input (differential value of the contact force (6-axis force)) u and the state variable x of the center of gravity, and the input u The above equation (20) which is a conversion equation is derived (step S102).

状態方程式（８）式を、導出された状態変数ｘの変換式（１６）式、入力ｕの変換式（２０）式、及び、状態変数ｘの変換式（１６）式から導出した（２３）式を用いて変形し、状態方程式の変換式である（２５）式が導出される（ステップＳ１０３）。 The state equation (8) is derived from the derived equation (16) for the state variable x, equation (20) for the input u, and equation (16) for the state variable x (23). The equation (25), which is a transformation equation of the state equation, is derived using the equation (step S103).

上記導出した状態変数ｘの変換式（１６）式と、入力ｕの変換式（２０）式に基づいて、上記（１２）式に示す評価関数を変形し、評価関数の変換式である上記（３１）式が導出される（ステップＳ１０４）。変換後の無制約条件のＬＱ最適化問題は、上述の如く、上記導出された評価関数の変換式（３１）式と、状態方程式の変換式（２５）式と、を含むこととなる。 Based on the derived conversion equation (16) of the state variable x and the conversion equation (20) of the input u, the evaluation function shown in the above equation (12) is modified to convert the evaluation function ( 31) is derived (step S104). As described above, the unconstrained LQ optimization problem after conversion includes the equation (31) for the derived evaluation function and the equation (25) for the state equation.

図１５は、本実施形態に係る最適制御装置による最適制御方法を示すフローチャートである。
接触点計画設定部２２１は接触点計画（等式制約条件のＣ_ｋ及びｄ_ｋ）を設定する（ステップＳ２０１）。 FIG. 15 is a flowchart showing an optimal control method by the optimal control apparatus according to the present embodiment.
The contact point plan setting unit 221 sets a contact point plan (equals constraint conditions C _k and d _k ) (step S201).

軌道生成部２２２は、接触点計画設定部２２１により設定された接触点計画に基づいて、上記（３３）式のＬＱ最適化問題を行列表現し、その最適解条件に対して再帰的計算を行うことで、ＬＱ最適化問題の最適解ζを求解する（ステップＳ２０２）。 Based on the contact point plan set by the contact point plan setting unit 221, the trajectory generation unit 222 represents the LQ optimization problem of the above equation (33) as a matrix and performs recursive calculation on the optimal solution condition. Thus, the optimum solution ζ of the LQ optimization problem is obtained (step S202).

軌道生成部２２２は、求解した最適解ζと、上記（１６）式及び（２０）式と、を用いて、上記（１４）式に示す等式制約条件付きＬＱ最適化問題のパラメータを復元し、重心の状態変数ｘ［ｋ］及び入力ｕ［ｋ］を算出する（ステップＳ２０３）。
軌道生成部２２２は、算出したｘ［ｋ］の時系列データに基づいて、重心軌道を生成する（ステップＳ２０４）。 The trajectory generation unit 222 restores the parameters of the LQ optimization problem with the equation constraint shown in the above equation (14) using the obtained optimal solution ζ and the above equations (16) and (20). The state variable x [k] and the input u [k] of the center of gravity are calculated (step S203).
The trajectory generation unit 222 generates a gravity center trajectory based on the calculated time series data of x [k] (step S204).

動作制御部２２３は、軌道生成部２２２により生成された重心軌道に従って移動ロボット１００の全身動作させるように、各関節のモータ１ａ〜２４ａを制御する（ステップＳ２０５）。 The motion control unit 223 controls the motors 1a to 24a of the respective joints so that the mobile robot 100 operates in the whole body according to the gravity center trajectory generated by the trajectory generation unit 222 (step S205).

以上、本実施形態において、軌道生成部２２２は、等式制約条件付き最適化問題を直交補空間を用いて変換した無制約条件の最適化問題を、再帰的計算法を用いて最適解を求解し、該求解した最適解に基づいて重心の状態変数を算出し、算出した重心の状態変数に基づいて重心軌道を生成する。これにより、モデル予測制御において最適化問題の最適解を高速に求解し重心軌道を生成できる。 As described above, in the present embodiment, the trajectory generation unit 222 solves the unconstrained optimization problem obtained by converting the optimization problem with equality constraints using the orthogonal complement space, and finds the optimal solution using the recursive calculation method. Then, a state variable of the center of gravity is calculated based on the obtained optimum solution, and a center of gravity trajectory is generated based on the calculated state variable of the center of gravity. As a result, the optimal solution of the optimization problem can be obtained at high speed in model predictive control, and the center of gravity trajectory can be generated.

（第２実施形態）
本実施形態において、軌道生成部２２２は、上記等式制約条件に加えて不等式制約条件を加えたＬＱ最適化問題を求解する。ここで、接触点の安定性の拘束を示す不等式制約条件について説明する。 (Second Embodiment)
In this embodiment, the trajectory generation unit 222 finds an LQ optimization problem in which an inequality constraint is added to the above equation constraint. Here, the inequality constraint condition indicating the constraint on the stability of the contact point will be described.

移動ロボットの接触点が安定して接触を保つ為の不等式制約条件を導入する。
図１６に接触点の座標系（上添え字ｌ（エル）がついている）と、接触多角形（接触点の支持多角形）と、を示した。 Introducing inequality constraints to keep the contact points of mobile robots stable.
FIG. 16 shows the coordinate system of the contact point (with the superscript l (el)) and the contact polygon (support polygon of the contact point).

接触点の座標系は、接触点を原点とし、かつ、接触面の姿勢ｒ_ｉに合わせて定義されているとする。
ここで、接触点の座標系で定義される接触力（６軸力）θ_ｉ ^ｌを次のように表わす。
θ_ｉ ^ｌ＝［ｆ_ｉｘ ^ｌ、ｆ_ｉｙ ^ｌ、ｆ_ｉｚ ^ｌ、τ_ｉｘ ^ｌ、τ_ｉｙ ^ｌ、τ_ｉｚ ^ｌ］^Ｔ Coordinate system of the touch point, the contact point as the origin, and a are defined in accordance with the orientation r _i of the contact surface.
Here, the contact force (six-axis force) θ _i ^l defined in the coordinate system of the contact point is expressed as follows.
_{^{_{^{_{θ i l = [f ix l}}}}} , f iy l, f iz l, τ ix l, τ iy l, τ iz l] T

すると、接触力（６軸力）θ_ｉ ^ｌは、接触面の姿勢行列Φ_ｉ＝ｒｏｔ（ｒ_ｉ）を用いて下記（４２）式のように表現できる。
なお、ｒｏｔは、オイラー角を姿勢行列に変換する関数である。

Then, the contact force (six-axis force) θ _i ^l can be expressed by the following equation (42) using the contact surface posture matrix Φ _i = rot (r _i ).
Note that rot is a function that converts Euler angles into a posture matrix.

接触点が安定して接触を保つ為には、
（１）接触点が離れないこと、
（２）接触点が滑らないこと、
（３）接触点が剥がれないこと、
という３つの制約条件を満たす必要がある。
上記３つの制約条件が理解しやすいように、図１７に、接触点が不安定化する場合を例示した。 In order for the contact point to maintain stable contact,
(1) The contact point must not be separated,
(2) The contact point does not slip,
(3) The contact point does not peel off,
It is necessary to satisfy the following three constraint conditions.
In order to facilitate understanding of the above three constraints, FIG. 17 illustrates a case where the contact point becomes unstable.

（１）接触点が離れない為には、接触面の鉛直力が正であれば良い。即ち、下記（４３）式を満たす必要がある。

(1) In order not to leave the contact point, the vertical force on the contact surface may be positive. That is, it is necessary to satisfy the following formula (43).

（２）接触点が滑らない為には、接触面に平行な２軸力が摩擦力以下であれば良い。即ち下記（４４）式がその条件である。ただし接触面の摩擦係数をμ_ｉとする。

(2) To prevent the contact point from slipping, the biaxial force parallel to the contact surface may be equal to or less than the friction force. That is, the following equation (44) is the condition. However, the friction coefficient of the contact surface is μ _i .

（３）接触点が剥がれない為の条件は、接触多角形のｈ個の頂点座標
（ｘ_ｉ１ ^ｌ，ｙ_ｉ１ ^ｌ），・・・・・（ｘ_ｉｈ ^ｌ，ｙ_ｉｈ ^ｌ）
を用いて下記（４５）式のように表される。
（ただし接触多角形の頂点は反時計回りに順に与えられているとする）。

(3) The condition for the contact point not to be peeled off is that the vertex coordinates (x _i1 ^l , y _i1 ^l ) of the contact polygon (x _ih ^l , y _ih ^l )
Is expressed as the following equation (45).
(However, the vertices of the contact polygon are given in order counterclockwise).

以上、（４３）、（４４）、（４５）式をまとめると次のようになる。

The expressions (43), (44), and (45) are summarized as follows.

（４６）式に（４２）式を代入し、ｋ番目のサンプリング点としてインデックスを付け加える。すなわち、下記（４７）式は、ｋ番目の接触点が安定な接触を維持するための条件式である。したがって、安定な多点接触動作を実現するためには、全サンプリング点の全接触点において下記（４７）式が成立する必要がある。

Substituting equation (42) into equation (46), an index is added as the kth sampling point. That is, the following expression (47) is a conditional expression for maintaining a stable contact at the k-th contact point. Therefore, in order to realize a stable multipoint contact operation, the following equation (47) needs to be satisfied at all contact points of all sampling points.

ｋ番目のサンプリング点において全接触点が上記（４７）式を満足する為の条件は、下記（４８）式のように表現できる。

The condition for all the contact points to satisfy the equation (47) at the k-th sampling point can be expressed as the following equation (48).

（不等式制約条件）
（Ｐ_ｋｘ［ｘ］≦ｑ_ｋ）・・・（４９）
なお、上記不等式制約条件の一般式（４９）式の右辺ｑ_ｋをｑ_ｋ＝Ｏと置けば、上記導出した（４８）式と一致する。上記（１４）式に示す等式制約条件付きＬＱ最適化問題に上記（４８）を加えることで、下記（５０）式に示す等式制約条件及び不等式制約条件付きＬＱ最適化問題が導出される。

(Inequality constraints)
(P _k x [x] ≦ q _k ) (49)
If the right side q _k of the general formula (49) of the inequality constraint condition is set as q _k = O, it matches the derived formula (48). By adding the above (48) to the LQ optimization problem with equality constraints shown in the above equation (14), the LQ optimization problem with equality constraints and inequality constraints shown in the following equation (50) is derived. .

本実施形態において、上記実施形態１で行った直交補空間変換に加えて、さらに、上記（４９）式に示す不等式制約条件の変換を行う。具体的には、上記（１６）式を上記（４９）式に代入することで、下記（５１）式を導出する。

In the present embodiment, in addition to the orthogonal complementary space transformation performed in the first embodiment, the inequality constraint condition shown in the above equation (49) is further transformed. Specifically, the following equation (51) is derived by substituting the above equation (16) into the above equation (49).

以上から、本実施形態において、直交補空間変換を行うことで、上記（５０）式に示す等式制約条件及び不等式制約条件付きＬＱ最適化問題を、下記（５２）式に示す無制約条件のＬＱ最適化問題に変換できる。本実施形態に係る軌道生成部２２２は、下記（５２）式に示す無制約条件のＬＱ最適化問題を、リカッチ型再帰的計算法を用いて最適解を求解する。

From the above, in the present embodiment, by performing orthogonal complementary space transformation, the LQ optimization problem with the equality constraint condition and the inequality constraint condition shown in the above equation (50) can be solved by the unconstraint condition shown in the following equation (52). It can be converted into an LQ optimization problem. The trajectory generation unit 222 according to the present embodiment finds an optimal solution for the unconstrained LQ optimization problem expressed by the following equation (52) using the Riccati-type recursive calculation method.

次に、上記直交補空間変換により変換した無制約条件のＬＱ最適化問題を、リカッチ型再帰的計算法を用いて求解する方法を説明する。
まず、上記実施形態１と同様に、上記（５２）式を行列表現すると、下記（５３）式乃至（５５）式のように表現できる。

Next, a method for solving the unconstrained LQ optimization problem converted by the orthogonal complementary space transform using the Riccati-type recursive calculation method will be described.
First, as in the first embodiment, when the above equation (52) is expressed in a matrix, it can be expressed as the following equations (53) to (55).

下記（５６）式に示す、上記（５４）式及び（５５）式のラグランジュ乗数を導入する。

The Lagrange multipliers of the above equations (54) and (55) shown in the following equation (56) are introduced.

続いて、下記（５７）式に示すように、上記（５５）式にスラック変数を導入する。

Subsequently, as shown in the following formula (57), slack variables are introduced into the formula (55).

上記（５６）式及び（５７）式の導入により、上記（５３）式乃至（５５）式に示すＬＱ最適化問題の最適解条件（ＫＫＴ）は、下記（５８）式で表現できる。

By introducing the above formulas (56) and (57), the optimal solution condition (KKT) for the LQ optimization problem shown in the above formulas (53) to (55) can be expressed by the following formula (58).

上記（５３）式乃至（５５）式は、凸２次計画問題と称されるＬＱ最適化問題であり。内点法やアクティブセット法などの既知の求解法（収束演算）を用いて効率的に求解できる。これら求解法はニュートン法をベースにした求解法であり、ニュートン法の収束演算中でニュートンの方向計算を行い、リカッチ型再帰的計算法による連立一次方程式の求解を行うこととなる。また、凸２次計画問題の計算量の大部分は、この連立一方程式の計算が占めているため、この計算の高速化が非常に有効となる。 The above formulas (53) to (55) are LQ optimization problems called convex quadratic programming problems. An efficient solution can be obtained by using a known solution method (convergence calculation) such as an interior point method or an active set method. These solution methods are solution methods based on the Newton method, in which Newton's direction is calculated during the convergence operation of the Newton method, and simultaneous linear equations are solved by the Riccati-type recursive calculation method. Further, since the calculation of the simultaneous one equation accounts for most of the calculation amount of the convex quadratic programming problem, it is very effective to speed up the calculation.

上述の如く、直交補空間変換を行うことで、上記（５０）式に示す等式制約条件及び不等式制約条件付きＬＱ最適化問題を、上記（５２）式に示す無制約条件のＬＱ最適化問題に変換する。これにより、このＬＱ最適化問題の連立一次方程式の求解に、安定かつ高速のリカッチ型再帰的計算法を用いることができる。したがって、ＬＱ最適化問題の最適解を高速に求解できる。 As described above, by performing orthogonal complementary space transformation, the LQ optimization problem with equality and inequality constraints shown in the above equation (50) is changed into the unconstrained LQ optimization problem shown in the above equation (52). Convert to As a result, a stable and high-speed Riccati-type recursive calculation method can be used to solve the simultaneous linear equations of the LQ optimization problem. Therefore, the optimal solution of the LQ optimization problem can be obtained at high speed.

本実施形態において、軌道生成部２２２は、リカッチ型再帰的計算法による連立一次方程式の求解を、例えば、内点法やアクティブセット法などの収束演算の中で行う。以下、本実施形態において、内点法を用いた求解法を説明するがこれに限定されない。アクティブセット法を用いた求解法も、内点法と同様の手法で求解できる。 In the present embodiment, the trajectory generation unit 222 performs the solution of the simultaneous linear equations by the Riccati-type recursive calculation method in, for example, a convergence operation such as an interior point method or an active set method. Hereinafter, in the present embodiment, a solution method using the interior point method will be described, but the present invention is not limited to this. The solution method using the active set method can also be solved by the same method as the interior point method.

内点法は、上記（５８）式をニュートン法とラインサーチにより効率的に解くことにより、最適解を求解する手法である。なお、本実施形態においては、内点法の中で、最もスタンダードな主双対内点法を用いる場合について説明するが、これに限定されない。 The interior point method is a method for finding an optimal solution by efficiently solving the above equation (58) by Newton's method and line search. In the present embodiment, the case of using the most standard principal dual interior point method among interior point methods will be described, but the present invention is not limited to this.

まず、上記（５８）式にニュートン法を適用すると、下記（５９）式及び（６０）式が導出される。但し、下記（６０）式において、（○の中に×）は要素同志の積を意味し、Λ≒ｄｉａｇ（λ）、Ｚ≒ｄｉａｇ（ｚ）とする。

First, when the Newton method is applied to the above equation (58), the following equations (59) and (60) are derived. However, in the following equation (60), (× in ○) means a product of elements, and Λ≈diag (λ) and Z≈diag (z).

次に、下記（６１）式及び（６２）式に示すようにcomplementary measure μとステップ幅α_ｐを導入する。

Then it introduced Complementary its measure mu and step size alpha _p as shown in the following (61) and (62) below.

ここで、complementary measure μは収束演算の残差の総計、ステップ幅α_ｐはλ≧０、ｚ≧０を満足する範囲でニュートン方向への最大のスッテップ幅を求めていると理解すると分かり易い。 Here, it is easy to understand that complementary measure μ is the total of the residuals of the convergence calculation, and step width α _p is the maximum step width in the Newton direction within a range satisfying λ ≧ 0 and z ≧ 0.

なお、主双対内点法のアルゴリズムを簡略して記載すると以下のようになる。

The algorithm for the principal dual interior point method is simply described as follows.

軌道生成部２２２は、上記主双対内点法を用いて収束演算を行い、最適解ζを求解する。軌道生成部２２２は、上記実施形態１と同様に、上記求解した最適解ζと、上記（１６）式及び（２０）式と、を用いて、上記（５０）式に示す等式制約条件及び不等式条件付きＬＱ最適化問題のパラメータを復元し、その最適解であるｘ［ｋ］及びｕ［ｋ］を得る。 The trajectory generation unit 222 performs a convergence operation using the main dual interior point method to find an optimal solution ζ. Similar to the first embodiment, the trajectory generating unit 222 uses the calculated optimal solution ζ and the above equations (16) and (20), and the equation constraints and The parameters of the inequality conditional LQ optimization problem are restored, and the optimal solutions x [k] and u [k] are obtained.

ここで、主双対内点法のアルゴリズム内に示した上記（５９）式の求解について詳細に説明する。まず、下記（６３）式が成立する。

Here, the solution of the equation (59) shown in the algorithm of the main dual interior point method will be described in detail. First, the following equation (63) is established.

上記（６３）式を用いて、上記（５９）式は、下記（６４）式のように表現できる。

Using the above equation (63), the above equation (59) can be expressed as the following equation (64).

ここで、Θ≒Ｚ^−１Λ及びｒ_ｚ′≒ｒ_ｚ−Ｚ^−１ｒ_λと置くと、上記（６４）式は、下記（６５）式のように表現できる。

さらに、上記行列（６５）式の各係数を並べ替えると、下記（６６）式のように表現できる。

Here, when Θ≈Z ⁻¹ Λ and r _z ′ ≈r _z −Z ⁻¹ r _λ , the above expression (64) can be expressed as the following expression (65).

Further, by rearranging the coefficients of the matrix (65), the coefficients can be expressed as the following (66).

ここで、下記（６７）式が成立する。

Here, the following equation (67) is established.

上記（６７）式を用いて、上記（６６）式は、下記（６８）式のように表現できる。

但し、上記（６８）式において、パラメータを下記（６９）式のように設定している。

Using the above expression (67), the above expression (66) can be expressed as the following expression (68).

However, in the above equation (68), the parameters are set as in the following equation (69).

上記（６８）式は、上述した実施形態１のリカッチ型再帰的計算法で示した（３８）式と同様の形となっている。したがって、軌道生成部２２２は、上記（６８）式に示す連立１次方程式についても、上記実施形態１と同様に、リカッチ型再帰的計算法を用いて高速かつ安定的に求解できる。 The above equation (68) has the same form as the equation (38) shown in the Riccati-type recursive calculation method of the first embodiment described above. Therefore, the trajectory generating unit 222 can solve the simultaneous linear equations shown in the above equation (68) at high speed and stably using the Riccati-type recursive calculation method as in the first embodiment.

すなわち、軌道生成部２２２は、上記（６８）式に対して、下記（７０）式に示す再帰計算を繰り返す。

That is, the trajectory generation unit 222 repeats the recursive calculation shown in the following equation (70) with respect to the above equation (68).

さらに、軌道生成部２２２は、下記（７１）式に示す再帰計算を行うことで、（Δｖ_ｋ、Δζ_ｋ、Δｙ_ｋ）を算出する。

軌道生成部２２２は、算出したΔζ_ｋを上記（６７）式に代入することで、Δｚ_ｋを算出する。軌道生成部２２２は、算出したΔｚ_ｋ＝[Δｚ_１ ^Ｔ、Δｚ_２ ^Ｔ、・・・Δｚ_Ｎ ^Ｔ]^Ｔを上記（６３）式に代入することで、Δλを算出する。以上により、上記（５９）式の求解が完了する。 Further, the trajectory generation unit 222 calculates (Δv _k , Δζ _k , Δy _k ) by performing a recursive calculation shown in the following equation (71).

The trajectory generation unit 222 calculates Δz _k by substituting the calculated Δζ _k into the above equation (67). The trajectory generation unit 222 calculates Δλ by substituting the calculated Δz _k = [Δz ₁ ^T , Δz ₂ ^T ,... Δz _N ^T ] ^T into the equation (63). Thus, the solution of the above equation (59) is completed.

図１８は、上述した等式制約条件及び不等式制約条件付きＬＱ最適化問題の最適解の求解フローを示すフローチャートである。
まず、軌道生成部２２２は、解ベクトルの初期解（η＝η_０、ｙ＝ｙ_０、ｚ＝ｚ_０、λ＝λ_０）を行う（ステップＳ３０１）。 FIG. 18 is a flowchart showing a solution flow of an optimal solution of the LQ optimization problem with equality constraints and inequality constraints described above.
First, the trajectory generation unit 222 performs an initial solution (η = η ₀ , y = y ₀ , z = z ₀ , λ = λ ₀ ) of the solution vector (step S301).

軌道生成部２２２は、繰返パラメータｎ＝０を設定する（ステップＳ３０２）。
軌道生成部２２２は、上記（６０）式を用いて、残差［ｒ_η、ｒ_ｙ、ｒ_ｚ、ｒ_λ］を算出する（ステップＳ３０３）。 The trajectory generation unit 222 sets the repetition parameter n = 0 (step S302).
The trajectory generation unit 222 calculates residuals [r _η , r _y , r _z , r _λ ] using the above equation (60) (step S303).

軌道生成部２２２は、リカッチ型再帰的計算法によるニュートン方向［Δη、Δｙ、Δｚ、Δλ］の計算を行う（ステップＳ３０４）。
軌道生成部２２２は、上記（６２）式を用いて、ステップ幅α_ｐを算出する（ステップＳ３０５）。 The trajectory generation unit 222 calculates the Newton directions [Δη, Δy, Δz, Δλ] by the Riccati-type recursive calculation method (step S304).
The trajectory generation unit 222 calculates the step width α _p using the above equation (62) (step S305).

軌道生成部２２２は、上記算出したニュートン方向［Δη、Δｙ、Δｚ、Δλ］とステップ幅α_ｐとに基づいて、下記式を用いて解ベクトルの更新を行う（ステップＳ３０６）。
［η、ｙ、ｚ、λ］＝［η、ｙ、ｚ、λ］＋βα_ｐ［Δη、Δｙ、Δｚ、Δλ］ Trajectory generation unit 222, the calculated Newtonian direction [Δη, Δy, Δz, Δλ ] and based on the step size alpha _p, and updates the solution vector using the following equation (step S306).
[Η, y, z, λ] = [η, y, z, λ] + βα _p [Δη, Δy, Δz, Δλ]

軌道生成部２２２は、上記（６１）式を用いて、complementary measure μを算出する（ステップＳ３０７）。
軌道生成部２２２は、条件（μ＞μ_min and ｎ＜ｎ_max）を満足するか否かを判定する（ステップＳ３０８）。 The trajectory generation unit 222 calculates complementary measure μ using the above equation (61) (step S307).
The trajectory generation unit 222 determines whether or not the condition (μ> μ _min and n <n _max ) is satisfied (step S308).

軌道生成部２２２は、条件（μ＞μ_min and ｎ＜ｎ_max）を満足すると判定したとき（ステップＳ３０８のＹＥＳ）、ｎ＝ｎ＋１を設定し、上記（ステップＳ３０３）の処理に戻る。 When the trajectory generation unit 222 determines that the conditions (μ> μ _min and n <n _max ) are satisfied (YES in step S308), n = n + 1 is set, and the process returns to the above (step S303).

軌道生成部２２２は、条件（μ＞μ_min and ｎ＜ｎ_max）を満足しないと判定したとき（ステップＳ３０８のＮＯ）、上記収束したときのηに基づいて、上記（５３）乃至（５５）式からの最適解ζを算出する。そして、軌道生成部２２２は、この最適解ζと、上記（１６）式及び（２０）式と、を用いて、上記（５２）式に示す等式制約条件及び不等式制約条件付きＬＱ最適化問題のパラメータを復元し、ｘ［ｋ］及びｕ［ｋ］を算出する（ステップＳ３０９）。 When the trajectory generation unit 222 determines that the conditions (μ> μ _min and n <n _max ) are not satisfied (NO in step S308), the trajectory generation unit 222 performs the above (53) to (55) based on η when converged The optimal solution ζ is calculated from the equation. Then, the trajectory generation unit 222 uses this optimal solution ζ and the above equations (16) and (20) to solve the LQ optimization problem with the equality and inequality constraints shown in the above equation (52). Are restored, and x [k] and u [k] are calculated (step S309).

（第３実施形態）
上記実施形態１に係る軌道生成部２２２は、線形不変な等式制約条件付き最適化問題を求解しているが、本実施形態３に係る軌道生成部２２２は、線形時変な等式制約条件付き最適化問題を求解する。 (Third embodiment)
The trajectory generation unit 222 according to the first embodiment solves an optimization problem with linear invariant equality constraints, but the trajectory generation unit 222 according to the third embodiment has linear time-varying equality constraints. Solve a problem with optimization.

例えば、サンプリング間隔が変化するような場合を考えると、Δｔは固定ではなく、Δｔ_ｋのようにサンプリング点毎に変化することとなる。この場合、上記（８）式は、下記（７２）式のように線形時変の制御システムとして表現できる。

For example, considering the case where the sampling interval is such as to change, Delta] t is not fixed, so that the changes for each sampling point as Delta] t _k. In this case, the above equation (8) can be expressed as a linear time-varying control system as the following equation (72).

ここで、線形時変の最適化問題の状態方程式は、例えば、下記（７３）式に示す関係が成立する。すなわち、（７３）式に示す状態方程式は、線形時変の制御パラメータＡ_ｋ、Ｂ_ｋを含むこととなる。

Here, the state equation of the linear time-varying optimization problem holds, for example, the relationship shown in the following equation (73). That is, the state equation shown in the equation (73) includes linear time-varying control parameters A _k and B _k .

上記（７３）式、（１２）式、及び（１３）式より、軌道生成部２２２は、下記（７４）式に示す等式制約条件付きＬＱ最適化問題を求解し、重心軌道を生成することとなる。

但し、上記（７４）式において、下記（７５）式が成立するものとする。

From the above equations (73), (12), and (13), the trajectory generator 222 solves the LQ optimization problem with equality constraints shown in the following equation (74) and generates the center of gravity trajectory. It becomes.

However, in the above equation (74), the following equation (75) is assumed to hold.

以上から、実施形態１と同様に、等式制約条件付きＬＱ最適化問題を直交補空間に投影することで、直交補空間変換を行い無制約条件のＬＱ最適化問題を導出する。
まず、上記（１３）式（等式制約条件：Ｃ_ｋｘ［ｋ］＝ｄ_ｋ）より、状態変数ｘの変換式である下記（７６）式が上記実施形態１と同様に導出される。

From the above, as in the first embodiment, by projecting the LQ optimization problem with equality constraints onto the orthogonal complement space, the orthogonal complement space transformation is performed to derive the unconstrained LQ optimization problem.
First, from the above equation (13) (equal constraint: C _k x [k] = d _k ), the following equation (76), which is a conversion equation for the state variable x, is derived in the same manner as in the first embodiment.

Ｃ_ｋ＋１Ｂの直交補空間を用いて変数変換を行う。Ｃ_ｋ＋１Ｂを下記（７７）式に示すようにＱＲ分解する。

Variable transformation is performed using the orthogonal complement space of C _{k + 1} B. QR decomposition is performed on C _{k + 1} B as shown in the following equation (77).

上記（７７）式を用いて上記（２０）式と同様に、入力ｕの変換式である下記（７８）式を導出する。

但し、上記（７８）式における各パラメータを下記（７９）式に示すように設定する。

ｋ＝０のときは、上記（７８）式における各パラメータを下記（８０）式に示すように設定する。

The following equation (78), which is a conversion equation for the input u, is derived using the above equation (77) in the same manner as the above equation (20).

However, each parameter in the above equation (78) is set as shown in the following equation (79).

When k = 0, each parameter in the above equation (78) is set as shown in the following equation (80).

以上より、上記（８）式を上記（７３）式、（７６）式、及び（７８）式を用いて変形し、状態方程式の変換式である下記（８１）式を導出する。

但し、上記（８１）式における各パラメータを下記（８２）式に示すように設定する。

ｋ＝０のときは、上記（８１）式における各パラメータを下記（８３）式に示すように設定する。

From the above, the above equation (8) is transformed using the above equations (73), (76), and (78), and the following equation (81), which is a conversion equation of the state equation, is derived.

However, each parameter in the above equation (81) is set as shown in the following equation (82).

When k = 0, each parameter in the above equation (81) is set as shown in the following equation (83).

上記実施形態１と同様に、上記（１２）式に示す評価関数Ｊを変形し、評価関数の変換式である下記（８４）式を導出する。

但し、上記（８４）式における各パラメータを下記（８５）式に示すように設定する。

As in the first embodiment, the evaluation function J shown in the above equation (12) is modified to derive the following equation (84) that is a conversion equation of the evaluation function.

However, each parameter in the above equation (84) is set as shown in the following equation (85).

以上から、本実施形態に係る軌道生成部２２２は、実施形態１と同様に、上記直交補空間を用いて変換した下記（８６）式に示す無制約条件のＬＱ最適化問題を、リカッチ型再帰的計算法を用いて最適解ζを高速に求解できる。

As described above, the trajectory generation unit 222 according to the present embodiment, as in the first embodiment, converts the unconstrained LQ optimization problem expressed by the following equation (86) converted using the orthogonal complement space into the Riccati-type recursion. The optimal solution ζ can be obtained at high speed using a genetic calculation method.

最後に、軌道生成部２２２は、上記求解した最適解ζと、上記（１６）式及び（２０）式と、を用いて、上記（７４）式に示す等式制約条件付きＬＱ最適化問題のパラメータを復元し、ｘ［ｋ］及びｕ［ｋ］を算出する。 Finally, the trajectory generation unit 222 uses the optimum solution ζ obtained above and the above equations (16) and (20) to solve the LQ optimization problem with the equation constraint shown in the above equation (74). The parameters are restored and x [k] and u [k] are calculated.

（第４実施形態）
本実施形態４に係る軌道生成部２２２は、線形時変な、所定の区間内だけ接触力が変化しないように設定した、入力を含む等式制約条件付き最適化問題を求解する。 (Fourth embodiment)
The trajectory generation unit 222 according to the fourth embodiment solves an optimization problem with equality constraints including input, which is set so that the contact force does not change only within a predetermined interval, which is linear and time-varying.

例えば、未来のサンプリング区間において、移動ロボットが一定の力で物体を押して動かす等の、接触力を変動させたくない区間が存在する場合を想定する。より具体的には、２番目の接触点と最後から２番目の接触点をある区間内だけ接触力を変化しないようにした場合、当該区間における等式制約条件を下記（８７）式に示すように入力Ｅ_ｋ（ｕ）を含むこととなる。

For example, assume that there is a section in the future sampling section where the mobile robot does not want to change the contact force, such as pushing and moving an object with a constant force. More specifically, when the contact force is not changed only within a certain section between the second contact point and the second contact point from the end, the equality constraint condition in the section is expressed by the following equation (87). Will include the input E _k (u).

なお、上記接触力を変動させたくない区間以外の区間においては、Ｃ_ｋは下記（８８）式のように設定できる。

Note that C _k can be set as in the following equation (88) in a section other than the section where the contact force is not desired to be varied.

上記式より、軌道生成部２２２は、下記（８９）式に示す入力を含む等式制約条件付きＬＱ最適化問題を求解し、重心軌道を生成することとなる。

但し、下記（９０）式が成立する。

From the above formula, the trajectory generation unit 222 solves the LQ optimization problem with an equality constraint including the input shown in the following formula (89), and generates a barycentric trajectory.

However, the following equation (90) is established.

まず、等式制約条件内の入力ｕに対する係数Ｅ_ｋの転置行列をＱＲ分解すると下記（９１）式が導出される。

First, when the transposed matrix of the coefficient E _k with respect to the input u within the equality constraint condition is subjected to QR decomposition, the following equation (91) is derived.

上記（９１）式を用いて、上記入力を含む等式制約条件の（８７）式を、下記（９２）式に示すように変換できる。

Using the above equation (91), the equation (87) of the equation constraint including the above input can be converted as shown in the following equation (92).

上記（９２）式を上記状態方程式（８）式に代入することで、下記（９３）式が導出される。

以降の式変換の方法は、上記実施形態３と同一であるため、省略して説明する。以上から、線形時変な、入力を含む等式制約条件付き最適化問題を直交補空間変換を行い、下記（９４）に示す無制約条件のＬＱ最適化問題を導出する。

By substituting the equation (92) into the equation (8), the following equation (93) is derived.

Since the subsequent formula conversion method is the same as that of the third embodiment, the description will be omitted. From the above, the linear time-varying optimization problem with equality constraints including input is subjected to orthogonal complementary space transformation to derive the unconstrained LQ optimization problem shown in (94) below.

軌道生成部２２２は、導出した上記（９４）式に示す無制約条件のＬＱ最適化問題を、リカッチ型再帰的計算法を用いて最適解ζを高速に求解できる。最後に、軌道生成部２２２は、上記求解した最適解ζと、上記（１６）式及び（２０）式と、を用いて、上記（８９）式に示す入力を含む等式制約条件付きＬＱ最適化問題のパラメータを復元し、ｘ［ｋ］及びｕ［ｋ］を算出する。なお、軌道生成部２２２は、上記同様に、線形不変な、入力を含む等式制約条件付き最適化問題を求解してもよい。 The trajectory generation unit 222 can solve the unconstrained LQ optimization problem shown in the derived equation (94) at high speed using the Riccati-type recursive calculation method. Finally, the trajectory generation unit 222 uses the optimum solution ζ obtained above and the above equations (16) and (20) to perform LQ optimum with equality constraints including the input shown in the above equation (89). The parameters of the conversion problem are restored, and x [k] and u [k] are calculated. As described above, the trajectory generation unit 222 may solve an optimization problem with an equation constraint including an input that is linear invariant.

（第５実施形態）
本実施形態４に係る軌道生成部２２２は、線形時変な、所定の区間内だけ接触力が変化しないように設定した入力を含む等式制約条件、及び、所定の区間内だけ接触力に制限をかけるように設定した入力を含む不等式制約条件付き最適化問題を求解する。 (Fifth embodiment)
The trajectory generation unit 222 according to the fourth embodiment is linear time-varying, an equality constraint condition including an input set so that the contact force does not change only within a predetermined section, and the contact force is limited only within the predetermined section. Solve an inequality-constrained optimization problem that includes inputs set to apply.

例えば、移動ロボットの手先や足先の急激な接触力の変動を防ぐように、接触力の変化に制限をかけたい場合を想定する。例えば、２番目の接触点の接触力の増加量と、最後から２番目の接触点の接触力の減少量と、をある区間内だけ、制限したい場合、制限をかけたい区間における不等式制約条件は、下記（９５）式のように入力Γ_ｋｕ[ｋ]を含むこととなる。

For example, a case is assumed where it is desired to limit the change in contact force so as to prevent a sudden change in contact force of the hand or foot of the mobile robot. For example, if you want to limit the amount of increase in contact force at the second contact point and the amount of decrease in contact force at the penultimate contact point only within a certain section, the inequality constraint condition in the section you want to limit is As shown in the following equation (95), the input Γ _k u [k] is included.

但し、上記（９５）式において、Δｆ_ｌｍは各接触点の６軸力の制限値を縦に並べたベクトルである。また、上記制限をかけたい区間以外の区間においては、Ｐ_ｋは下記（９６）式のように設定できる。

However, in the above equation (95), Δf _lm is a vector in which limit values of the six-axis forces at each contact point are arranged vertically. In addition, P _k can be set as in the following equation (96) in a section other than the section where the restriction is desired.

上記式より、軌道生成部２２２は、下記（９７）式に示す入力を含む等式制約条件付きＬＱ最適化問題を求解し、重心軌道を生成することとなる。

但し、下記（９８）式が成立する。

From the above equation, the trajectory generation unit 222 solves the LQ optimization problem with equality constraints including the input shown in the following equation (97), and generates a centroid trajectory.

However, the following equation (98) is established.

不等式制約条件を示す上記（９５）式に上記（９２）式を代入することで、不等式制約条件を下記（９９）式に変換する。

但し、上記（９９）式のパラメータを下記（１００）式のように設定する。

By substituting the above equation (92) into the above equation (95) indicating the inequality constraint condition, the inequality constraint condition is converted into the following equation (99).

However, the parameter of the above equation (99) is set as the following equation (100).

上記（３３）式と上記（９９）式から、軌道生成部２２２は、下記（１０１）式に示す無制約条件のＬＱ最適化問題を、リカッチ型再帰的計算法を用いて最適解ζを求解することとなる。

以降に示す、上記無制約条件のＬＱ最適化問題に対するリカッチ型再帰的計算法による最適解ζの求解方法は、上記実施形態２において説明した求解方法と略同一であるため、相違点のみを説明する。 From the above equation (33) and the above equation (99), the trajectory generator 222 solves the unconstrained LQ optimization problem shown in the following equation (101) using the Riccati-type recursive calculation method. Will be.

Since the solution method of the optimal solution ζ by the Riccati-type recursive calculation method for the unconstrained LQ optimization problem described below is substantially the same as the solution method described in the second embodiment, only the differences will be described. To do.

上記（５３）式乃至（５５）式の係数行列Ｐが下記（１０２）式に置き換わる。

The coefficient matrix P in the equations (53) to (55) is replaced with the following equation (102).

したがって、上記（６６）式は、下記（１０３）式に置き換わる。

Therefore, the above equation (66) is replaced with the following equation (103).

ここで、下記（１０４）式が成立する。

Here, the following equation (104) is established.

上記（１０４）式を用いて、上記（１０３）式を下記（１０５）式のように変形できる。

Using the above equation (104), the above equation (103) can be transformed into the following equation (105).

但し、上記（１０５）式の各パラメータを下記（１０６）式のように設定する。

実施形態２の上記（６８）式と上記（１０５）式との相違は、Ｓ_ｋ（ハット）がＳ′_ｋ（ハット）となっているだけで、その他のパラメータは同一である。したがって、軌道生成部２２２は、以降の計算について、上記実施形態２と同一の計算を行い、最適解ζを高速に求解できる。最後に、軌道生成部２２２は、上記求解した最適解ζと、上記（１６）式及び（２０）式と、を用いて、入力を含む等式制約条件及び不等式制約条件付きＬＱ最適化問題のパラメータを復元し、ｘ［ｋ］及びｕ［ｋ］を算出する。なお、軌道生成部２２２は、上記同様に、線形不変な、入力を含む等式制約条件及び不等式制約条件付き最適化問題を求解してもよい。また、軌道生成部２２２は、線形不変あるいは線形時変な、等式制約条件及び入力を含む不等式制約条件付き最適化問題を求解してもよい。 However, each parameter of the above equation (105) is set as the following equation (106).

The difference between the expression (68) and the expression (105) in the second embodiment is that S _k (hat) is S ′ _k (hat), and other parameters are the same. Therefore, the trajectory generation unit 222 performs the same calculation as that in the second embodiment for the subsequent calculations, and can obtain the optimum solution ζ at high speed. Finally, the trajectory generation unit 222 uses the optimum solution ζ obtained above and the above equations (16) and (20) to solve the LQ optimization problem with equality constraints and inequality constraints including input. The parameters are restored and x [k] and u [k] are calculated. As described above, the trajectory generation unit 222 may solve linearly invariant equality constraints including inputs and optimization problems with inequality constraints. Further, the trajectory generation unit 222 may solve an optimization problem with an inequality constraint condition including an equation constraint condition and an input that is linear invariant or linear time varying.

なお、本発明は上記実施の形態に限られたものではなく、趣旨を逸脱しない範囲で適宜変更することが可能である。
本発明は、例えば、図１５や図１８に示す処理を、ＣＰＵ２１０ａにコンピュータプログラムを実行させることにより実現することも可能である。 Note that the present invention is not limited to the above-described embodiment, and can be changed as appropriate without departing from the spirit of the present invention.
For example, the present invention can be realized by causing the CPU 210a to execute the processing shown in FIG. 15 and FIG. 18.

プログラムは、様々なタイプの非一時的なコンピュータ可読媒体（non-transitory computer readable medium）を用いて格納され、コンピュータに供給することができる。非一時的なコンピュータ可読媒体は、様々なタイプの実体のある記録媒体（tangible storage medium）を含む。非一時的なコンピュータ可読媒体の例は、磁気記録媒体（例えばフレキシブルディスク、磁気テープ、ハードディスクドライブ）、光磁気記録媒体（例えば光磁気ディスク）、ＣＤ−ＲＯＭ（Read Only Memory）、ＣＤ−Ｒ、ＣＤ−Ｒ／Ｗ、半導体メモリ（例えば、マスクＲＯＭ、ＰＲＯＭ（Programmable ROM）、ＥＰＲＯＭ（Erasable PROM）、フラッシュＲＯＭ、ＲＡＭ（random access memory））を含む。 The program may be stored using various types of non-transitory computer readable media and supplied to a computer. Non-transitory computer readable media include various types of tangible storage media. Examples of non-transitory computer-readable media include magnetic recording media (for example, flexible disks, magnetic tapes, hard disk drives), magneto-optical recording media (for example, magneto-optical disks), CD-ROMs (Read Only Memory), CD-Rs, CD-R / W and semiconductor memory (for example, mask ROM, PROM (Programmable ROM), EPROM (Erasable PROM), flash ROM, RAM (random access memory)) are included.

また、プログラムは、様々なタイプの一時的なコンピュータ可読媒体（transitory computer readable medium）によってコンピュータに供給されてもよい。一時的なコンピュータ可読媒体の例は、電気信号、光信号、及び電磁波を含む。一時的なコンピュータ可読媒体は、電線及び光ファイバ等の有線通信路、又は無線通信路を介して、プログラムをコンピュータに供給できる。 The program may also be supplied to the computer by various types of transitory computer readable media. Examples of transitory computer readable media include electrical signals, optical signals, and electromagnetic waves. The temporary computer-readable medium can supply the program to the computer via a wired communication path such as an electric wire and an optical fiber, or a wireless communication path.

１ａ-２８ａ…モータ、１ｂ-２８ｂ…エンコーダ、２５…接触力センサ、１００…移動ロボット、２１０…最適制御装置、２２１…接触点計画設定部、２２２…軌道生成部、２２３…動作制御部。 DESCRIPTION OF SYMBOLS 1a-28a ... Motor, 1b-28b ... Encoder, 25 ... Contact force sensor, 100 ... Mobile robot, 210 ... Optimal control apparatus, 221 ... Contact point plan setting part, 222 ... Trajectory generation part, 223 ... Operation control part.

Claims

A contact point plan in which the position of the contact point where the moving means touches the moving robot that moves while grounding two or more moving means alternately and the attitude of the moving means when touching the ground is time-series data. Contact point planning means to be set;
Based on the contact point plan set by the contact point plan setting means, a trajectory generating means for generating a center of gravity trajectory for the mobile robot to move while the moving means contacts the contact point;
An optimal control device comprising:
The trajectory generating means constructs a prediction model that receives an amount based on a contact force when the moving means is grounded, and uses the prediction model to calculate a state variable of the center of gravity of the mobile robot in a prediction section having a predetermined time width. Representing the state variable of the center of gravity using a predetermined evaluation criterion in the prediction section, and performing model prediction control for generating the center of gravity trajectory of the mobile robot based on the calculated state variable of the center of gravity.
The evaluation criterion is to minimize an evaluation function including a square of an amount based on the contact force at each contact point within a prediction interval,
An equation constraint including the evaluation criteria, a linear state equation indicating a relationship between the input based on the contact force and the state variable of the center of gravity, and an equation constraint expressed by a linear equation of the mobile robot With the orthogonal complement space is converted into an unconstrained optimization problem that does not include the equality constraints,
The trajectory generating means solves the converted unconstrained optimization problem in the prediction interval by using a recursive calculation method to find an optimal solution, and determines the state variable of the center of gravity based on the calculated optimal solution. calculate,
An optimal control device characterized by that.

The optimal control device according to claim 1,
The trajectory generating means constructs a prediction model that receives a differential value of contact force when the moving means is grounded,
The evaluation criterion includes a criterion of allocating the contact force and a differential value of the contact force to each contact point based on a weight set corresponding to each contact point, and the contact force and An evaluation function including the sum of squares of the derivative of the contact force is minimized within the prediction interval,
An equation constraint including the evaluation criteria, a state equation indicating the relationship between the input of the differential value of the contact force and the state variable of the center of gravity, and an equation constraint indicating a constraint of the balance of force of the mobile robot The constrained optimization problem is converted into an unconstrained optimization problem that does not include the equality constraint condition using orthogonal complement space.
An optimal control device characterized by that.

The optimal control device according to claim 1 or 2,
A QR variable decomposition is performed on the equation representing the equation constraint to derive a state variable conversion equation, and the equation derived from the state equation indicating the relationship between the input of the differential value of the contact force and the state variable of the center of gravity is obtained. QR conversion is performed on the input, and an input conversion equation is derived, and the state equation is converted based on the state equation, the state variable conversion equation, the input conversion equation, and the state variable conversion equation. An expression is derived, and an evaluation function conversion expression is derived based on the derived state variable conversion expression, the input conversion expression, and the evaluation function of the optimization problem with equality constraints,
The unconstrained optimization problem includes a conversion equation of the derived evaluation function and a conversion equation of the state equation,
Optimal control device characterized by that.

The optimal control device according to any one of claims 1 to 3,
The trajectory generating means includes
For the optimal solution condition of the expression expressing the unconstrained optimization problem in a matrix form, the optimal solution is solved using a recursive calculation method, and the calculated optimal solution and the equation indicating the equality constraint condition are Calculating time series data of the state variable of the center of gravity based on a state variable conversion formula derived by QR decomposition;
An optimal control device characterized by that.

The optimal control device according to any one of claims 1 to 4,
The equation constraint includes an input set so that the contact force does not change only within a predetermined interval.
An optimal control device characterized by that.

The optimal control device according to any one of claims 1 to 5,
The trajectory generating means includes
An unconstrained optimization problem obtained by transforming an equality constraint condition including the equality constraint condition and an inequality constraint condition indicating the stability constraint of the contact point and an optimization problem with the inequality constraint condition using an orthogonal complement space A recursive calculation method to find an optimal solution, and calculate time series data of the state variable of the center of gravity based on the obtained optimal solution,
An optimal control device characterized by that.

The optimal control device according to claim 6,
The trajectory generating means includes
Applying Newton's method to the optimal solution condition of the expression expressing the unconstrained optimization problem as a matrix, calculating the Newton direction using the recursive calculation method in the convergence operation of the Newton method, Calculate the optimal solution based on the calculated Newton direction.
An optimal control device characterized by that.

The optimal control device according to claim 7,
The trajectory generating means applies an interior point method or an active set method to an optimal solution condition of an expression that represents the optimization problem of the unconstrained condition in a matrix form.

The optimal control device according to any one of claims 6 to 8,
The inequality constraint condition includes an input set to limit the contact force only within a predetermined interval.
An optimal control device characterized by that.

The optimal control device according to any one of claims 1 to 9,
The optimization control apparatus, wherein the state equation of the optimization problem includes a linear time-varying control parameter.

The optimal control device according to any one of claims 1 to 10,
An optimum control apparatus, further comprising: a control unit that controls the moving unit based on the center-of-gravity trajectory generated by the trajectory generating unit.

A contact point plan in which the position of the contact point where the moving means touches the moving robot that moves while grounding two or more moving means alternately and the attitude of the moving means when touching the ground is time-series data. Steps to set,
Based on the set contact point plan, generating a center of gravity trajectory for the mobile robot to move while the moving means contacts the contact point;
An optimal control method including
By constructing a prediction model that receives an amount based on contact force when the moving means is grounded, the prediction model represents a state variable of the center of gravity of the mobile robot in a prediction interval of a predetermined time width, and in the prediction interval Calculating a state variable of the center of gravity using a predetermined evaluation criterion, and performing model predictive control for generating a center of gravity trajectory of the mobile robot based on the calculated state variable of the center of gravity,
The evaluation criterion is to minimize an evaluation function including a square of an amount based on the contact force at each contact point within a prediction interval,
An equation constraint including the evaluation criteria, a linear state equation indicating a relationship between the input based on the contact force and the state variable of the center of gravity, and an equation constraint expressed by a linear equation of the mobile robot With the orthogonal complement space is converted into an unconstrained optimization problem that does not include the equality constraints,
In the prediction interval, the converted unconstrained optimization problem is solved by using a recursive calculation method to find an optimal solution, and the state variable of the center of gravity is calculated based on the obtained optimal solution.
An optimal control method characterized by that.

A contact point plan in which the position of the contact point where the moving means touches the moving robot that moves while grounding two or more moving means alternately and the attitude of the moving means when touching the ground is time-series data. Process to set,
Based on the set contact point plan, a process of generating a center-of-gravity trajectory for the mobile robot to move while the moving means contacts the contact point;
An optimal control program for causing a computer to execute
By constructing a prediction model that receives an amount based on contact force when the moving means is grounded, the prediction model represents a state variable of the center of gravity of the mobile robot in a prediction interval of a predetermined time width, and in the prediction interval Calculating a state variable of the center of gravity using a predetermined evaluation criterion, and performing model predictive control for generating a center of gravity trajectory of the mobile robot based on the calculated state variable of the center of gravity,
The evaluation criterion is to minimize an evaluation function including a square of an amount based on the contact force at each contact point within a prediction interval,
An equation constraint including the evaluation criteria, a linear state equation indicating a relationship between the input based on the contact force and the state variable of the center of gravity, and an equation constraint expressed by a linear equation of the mobile robot With the orthogonal complement space is converted into an unconstrained optimization problem that does not include the equality constraints,
In the prediction interval, the converted unconstrained optimization problem is solved by using a recursive calculation method to find an optimal solution, and the state variable of the center of gravity is calculated based on the obtained optimal solution.
An optimal control program characterized by that.