JP7475562B1

JP7475562B1 - Motion planning device

Info

Publication number: JP7475562B1
Application number: JP2023568059A
Authority: JP
Inventors: 隆之助渡辺; 僚太岡本
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2023-08-22
Filing date: 2023-08-22
Publication date: 2024-04-26
Anticipated expiration: 2043-08-22

Abstract

本開示はサンプリング手法による動的システムの動作計画装置に関し、動的システムに達成させる目標に基づいて動的システムの運動を表現する数式モデルと動的システムの事前に考慮すべき状態制約を出力する行動計画部と、数式モデルと状態制約に基づいて入力サンプリング範囲を制限し、数式モデルに基づいて入力サンプリング範囲での状態推定演算により、動的システムの動作計画を生成する動作計画生成部と、を有している。The present disclosure relates to an operation planning device for a dynamic system using a sampling method, the device having an action planning unit that outputs a mathematical model expressing the motion of the dynamic system based on a goal to be achieved by the dynamic system and state constraints of the dynamic system that should be considered in advance, and an operation plan generation unit that limits an input sampling range based on the mathematical model and the state constraints, and generates an operation plan for the dynamic system by performing a state estimation calculation within the input sampling range based on the mathematical model.

Description

本開示は動作計画装置に関し、特に、サンプリング手法による動的システムの動作計画装置に関する。 The present disclosure relates to a motion planning device, and in particular to a motion planning device for dynamic systems using a sampling technique.

動的システムの動作計画問題においていくつかの手法が提案されており、そのうちの１つとしてサンプリング手法によるものが開発されている。動作計画問題においては対象が満たすべき制約が存在するが、サンプリング手法では各パーティクルにおいて制約を満たす有効なパーティクル（有効パーティクル）と制約を満たさない無効なパーティクル（無効パーティクル）を判定した上で動作計画を立てるため、複雑な制約にも対応しやすいという利点がある。 Several methods have been proposed for the motion planning problem of dynamic systems, one of which is a sampling method. In motion planning problems, there are constraints that the target must satisfy, but the sampling method has the advantage of being able to easily deal with complex constraints, as it creates a motion plan after determining for each particle whether it is a valid particle that satisfies the constraints (valid particles) or an invalid particle that does not (invalid particles).

例えば、特許文献１の車両制御システムでは、制御入力のサンプリングを実施した後に物理モデルの時間発展を計算し、状態制約に対する有効パーティクルの判定を行うことで、有効パーティクルのみを用いて所望の運動を満たす確率が最適となるように車両を制御している。For example, in the vehicle control system of Patent Document 1, the control input is sampled, the time evolution of a physical model is calculated, and valid particles are determined for state constraints, thereby controlling the vehicle so as to optimize the probability of achieving the desired motion using only valid particles.

また、特許文献２の自動運転システムでは、路面の滑りの制約の範囲内で制御入力のサンプリングを実施することで路面の滑りに対する有効パーティクルのみを用いて所望の運動を満たす確率が最適となるように車両を制御している。In addition, in the autonomous driving system of Patent Document 2, the control input is sampled within the constraints of road surface slippage, and the vehicle is controlled to optimize the probability of achieving the desired motion using only effective particles for road surface slippage.

特許第６４９４８７２号公報Patent No. 6494872 特許第６５９４５８９号公報Patent No. 6594589

A. D. Ames X. Xu, J. W. Grizzle and P. Tabuada. “Control barrier function based quadratic programs for safety critical systems." IEEE Transactions on Automatic Control, Vol. 62, No. 8, pp.3861-3876, 2016.A. D. Ames, X. Xu, J. W. Grizzle and P. Tabuada. “Control barrier function based quadratic programs for safety critical systems.” IEEE Transactions on Automatic Control, Vol. 62, No. 8, pp.3861-3876, 2016. Q. Nguyen and K. Sreenath. “Exponential control barrier functions for enforcing high relative-degree safety-critical constraints." 2016 American Control Conference. 2016.Q. Nguyen and K. Sreenath. “Exponential control barrier functions for enforcing high relative-degree safety-critical constraints.” 2016 American Control Conference. 2016.

サンプリング手法では、サンプリングに基づいて近似される確率密度分布の近似精度を保証するために大量のサンプリング計算が必要である。特許文献１のように入力サンプリングを実施した後に物理モデルの時間発展を計算し、制約を満たす有効パーティクルを判定する場合、有効パーティクル数が少ないと確率密度分布の近似精度を保証できない。つまり、有効なパーティクルの減少によって手法の機能低下が生じる可能性が高くなる。 In sampling methods, a large amount of sampling calculations is required to ensure the approximation accuracy of the probability density distribution that is approximated based on sampling. When calculating the time evolution of a physical model after performing input sampling as in Patent Document 1 and determining effective particles that satisfy constraints, if the number of effective particles is small, the approximation accuracy of the probability density distribution cannot be guaranteed. In other words, a decrease in effective particles increases the possibility of a decrease in the functionality of the method.

また、特許文献２の手法では、事前に入力サンプリングを制限しているため、有効パーティクル数を確保しやすいが、路面の滑りに限定されているため他の用途に適用することが難しく、他の状態制約を考慮する場合には特許文献１と同様の結果となる。 In addition, the method of Patent Document 2 limits the input sampling in advance, making it easy to ensure a valid number of particles, but since it is limited to road surface slippage, it is difficult to apply it to other applications, and when other state constraints are taken into account, the results are similar to those of Patent Document 1.

本開示は、上記のような問題点を解決するためになされたものであり、有効なパーティクルの減少によって手法の機能低下が生じる可能性が低く、適用先のシステムにおいて事前に考慮すべき状態制約が限定されないサンプリング手法による動作計画装置を提供することを目的とする。 The present disclosure has been made to solve the problems described above, and aims to provide an action planning device using a sampling method that is unlikely to cause a degradation in the functionality of the method due to a decrease in valid particles, and that does not limit the state constraints that must be considered in advance in the system to which it is applied.

本開示に係る動作計画装置は、サンプリング手法による動的システムの動作計画装置であって、前記動的システムに達成させる目標に基づいて前記動的システムの運動を表現する数式モデルと前記動的システムの事前に考慮すべき状態制約を出力する行動計画部と、前記数式モデルと前記状態制約に基づいて事前に入力サンプリング範囲を制限し、制限された前記入力サンプリング範囲での状態推定演算により、前記動的システムの動作計画を生成する動作計画生成部と、を有している。

The motion planning device according to the present disclosure is a motion planning device for a dynamic system using a sampling method, and includes: an action planning unit that outputs a mathematical model expressing the motion of the dynamic system based on a goal to be achieved by the dynamic system and state constraints of the dynamic system that should be considered in advance; and a motion plan generation unit that limits an input sampling range in advance based on the mathematical model and the state constraints, and generates a motion plan for the dynamic system by performing a state estimation calculation within the limited input sampling range.

本開示に係る動作計画装置によれば、事前に考慮すべき状態制約の範囲内で入力サンプリングを行うため、サンプリングに基づいて近似される確率密度分布の近似精度を向上させることができ有効なパーティクルの減少によって手法の機能低下が生じる可能性を低減でき、適用先のシステムにおいて事前に考慮すべき状態制約が限定されない。 According to the motion planning device disclosed herein, input sampling is performed within the range of state constraints that should be considered in advance, thereby improving the approximation accuracy of the probability density distribution approximated based on the sampling, reducing the possibility of the performance of the method deteriorating due to a decrease in valid particles, and there is no limit to the state constraints that should be considered in advance in the system to which the method is applied.

本開示に係る実施の形態１の動作計画装置を搭載した移動体の構成を示す概略図である。1 is a schematic diagram illustrating a configuration of a moving body equipped with a motion planning device according to a first embodiment of the present disclosure. 本開示に係る実施の形態１の動作計画装置を搭載した移動体の構成を示す概略図である。1 is a schematic diagram illustrating a configuration of a moving body equipped with a motion planning device according to a first embodiment of the present disclosure. 本開示に係る実施の形態１の動作計画装置を搭載した移動体が移動を行う状況の一例を模式的に示す図である。FIG. 1 is a diagram illustrating an example of a situation in which a moving object equipped with the motion planning device according to the first embodiment of the present disclosure moves. 本開示に係る実施の形態１の動作計画装置の構成を示すブロック図である。1 is a block diagram showing a configuration of a motion planning device according to a first embodiment of the present disclosure. 本開示に係る実施の形態１で用いる座標系を模式的に示す図である。FIG. 2 is a diagram illustrating a coordinate system used in the first embodiment according to the present disclosure. 本開示に係る実施の形態１の動作計画装置で用いるパーティクルフィルタの演算処理を示すフローチャートである。5 is a flowchart showing a calculation process of a particle filter used in the motion planning device according to the first embodiment of the present disclosure. 本開示に係る実施の形態１の動作計画装置における動作計画生成の結果を模式的に示す図である。1 is a diagram illustrating a schematic diagram of a result of generating a motion plan in the motion planning device according to the first embodiment of the present disclosure. FIG. 本開示に係る実施の形態３の動作計画装置における入力サンプリング値の修正方法を模式的に示す図である。FIG. 13 is a diagram illustrating a method of correcting input sampling values in a motion planning apparatus according to a third embodiment of the present disclosure. 本開示に係る実施の形態４の動作計画装置における入力サンプリング範囲の制限方法を模式的に示す図である。FIG. 13 is a diagram illustrating a method for limiting an input sampling range in a motion planning device according to a fourth embodiment of the present disclosure. 実施の形態１～４の動作計画装置を実現するハードウェア構成を示す図である。FIG. 2 is a diagram illustrating a hardware configuration for implementing the motion planning device according to the first to fourth embodiments. 実施の形態１～４の動作計画装置を実現するハードウェア構成を示す図である。FIG. 2 is a diagram illustrating a hardware configuration for implementing the motion planning device according to the first to fourth embodiments.

＜実施の形態１＞
＜移動体のシステム構成＞
図１および図２は、本開示に係る実施の形態１の動作計画装置を搭載した動的システムである移動体１のシステム構成を示す概略図である。図１は移動体１を上面から見た透視図であり、図２は移動体１の側面図である。図１に示すように移動体１は、駆動装置として、車輪１０１、アクチュエータ１０２およびバッテリ１０３を備えている。アクチュエータ１０２はバッテリ１０３から得た電力を駆動力に変換し、車輪１０１へ入力する。アクチュエータ１０２は制御装置１０によって制御される。制御装置１０は、記憶部と演算部を備えており、設定された制御プログラムに従ってアクチュエータ１０２を制御し、移動体１の移動を制御する。 <First embodiment>
<System configuration of a moving object>
1 and 2 are schematic diagrams showing a system configuration of a moving body 1, which is a dynamic system equipped with a motion planning device according to a first embodiment of the present disclosure. FIG. 1 is a perspective view of the moving body 1 as seen from above, and FIG. 2 is a side view of the moving body 1. As shown in FIG. 1, the moving body 1 includes wheels 101, actuators 102, and a battery 103 as driving devices. The actuators 102 convert power obtained from the battery 103 into driving force and input the power to the wheels 101. The actuators 102 are controlled by a control device 10. The control device 10 includes a storage unit and a calculation unit, and controls the actuators 102 according to a set control program to control the movement of the moving body 1.

本実施の形態では、３つの車輪と３つのアクチュエータを使用し全方向に移動可能な移動体１に動作計画装置を搭載した例を示しているが、これに限定されず、例えば差動二輪移動体または脚式移動ロボットに搭載することもできる。ここで、差動二輪移動体とは、独立に回転可能な車輪を２つ有しており、各車輪の回転速度差を利用して直進および旋回を行う移動体である。In this embodiment, an example is shown in which the motion planning device is mounted on a mobile body 1 that can move in all directions using three wheels and three actuators, but the present invention is not limited to this, and the motion planning device can also be mounted on, for example, a differential two-wheel mobile body or a leg-type mobile robot. Here, a differential two-wheel mobile body is a mobile body that has two wheels that can rotate independently and uses the difference in rotation speed between the wheels to move straight and turn.

また、本開示の適用対象である動的システムは移動体に限らず、例えばロボットアームまたは天井クレーンなどの運動が数式モデル（微分方程式）として表される動的システムであれば適用可能である。 In addition, the dynamic systems to which this disclosure applies are not limited to moving bodies, but can be applied to any dynamic system in which the movement of a robot arm or overhead crane, for example, is expressed as a mathematical model (differential equation).

動的システムを移動体とした場合は、目標地点に向けて複雑な目標軌道を設定することが可能となる。 When the dynamic system is treated as a moving body, it is possible to set a complex target trajectory toward a target location.

移動体１は、観測装置として、図１に示す車輪角度センサ１１１、図２に示す光学式測域センサ１１２および深度カメラ１１３を備えている。当該観測装置は制御装置１０に接続されている。The moving body 1 is equipped with a wheel angle sensor 111 shown in Fig. 1, an optical range sensor 112 shown in Fig. 2, and a depth camera 113 as observation devices. The observation devices are connected to the control device 10.

車輪角度センサ１１１は各車輪１０１に設けられ、車輪１０１の回転量を検出する。車輪１０１の回転量に基づいて制御装置１０により移動体１の移動量を算出する。車輪角度センサ１１１は、例えばロータリーエンコーダによって構成される。The wheel angle sensor 111 is provided on each wheel 101 and detects the amount of rotation of the wheel 101. The control device 10 calculates the amount of movement of the mobile body 1 based on the amount of rotation of the wheel 101. The wheel angle sensor 111 is, for example, configured by a rotary encoder.

光学式測域センサ１１２は、例えばＬｉＤＡＲ（Light Detection and Ranging）であり、移動体１の上面に備えられている。光学式測域センサ１１２は、移動体１の周辺環境における空間の物理的な形状データを走査平面に沿って計測する。計測された形状データに基づいて、制御装置１０は周辺環境の地図を作成する。作成された地図を参照することで、移動体１が地図平面上のどの位置にいるのかを推定する。この際に、地図を予め作成し、その地図情報を参照することで移動体１の位置を推定することもできる。The optical range sensor 112 is, for example, a LiDAR (Light Detection and Ranging) sensor, and is provided on the top surface of the moving body 1. The optical range sensor 112 measures physical shape data of the space in the surrounding environment of the moving body 1 along a scanning plane. Based on the measured shape data, the control device 10 creates a map of the surrounding environment. By referring to the created map, the position of the moving body 1 on the map plane is estimated. At this time, it is also possible to create a map in advance and estimate the position of the moving body 1 by referring to the map information.

図３は、移動体１が移動を行う状況の一例を模式的に示す図である。図３において、光学式測域センサ１１２で得られた空間の物理的な形状データは、移動体１の走行に対して障害となる人５００および障害となる物６００の形状を含む。制御装置１０は、これら形状データを移動体１が避けるべき対象であると認知する。なお、以下の説明では、人、壁、別の移動体などの移動体１の移動を妨げる対象全てを障害物として表記する。 Figure 3 is a diagram showing a schematic example of a situation in which the moving body 1 moves. In Figure 3, the physical shape data of the space obtained by the optical range sensor 112 includes the shapes of a person 500 and an object 600 that are obstacles to the movement of the moving body 1. The control device 10 recognizes these shape data as objects that the moving body 1 should avoid. In the following explanation, all objects that impede the movement of the moving body 1, such as a person, a wall, or another moving body, are referred to as obstacles.

前方深度カメラ１１３は、車両前方の空間の物理的な形状データを画像と共に取得する。前方深度カメラ１１３がカメラ画角内の測域情報を取得することで、光学式測域センサ１１２では走査平面上の形状データ（３次元空間上のある平面を切り取ったデータ）を補う。The forward depth camera 113 acquires physical shape data of the space in front of the vehicle along with an image. The forward depth camera 113 acquires range information within the camera's angle of view, and the optical range sensor 112 supplements the shape data on the scanning plane (data cut from a plane in three-dimensional space).

ただし、上述した観測装置は一例であり、移動体１が移動する環境において障害物の形状データを得る方法は特に限定されない。However, the above-mentioned observation device is just one example, and the method of obtaining shape data of obstacles in the environment in which the mobile body 1 moves is not particularly limited.

＜装置構成＞
図４は、本開示に係る実施の形態１の動作計画装置３００の構成を示すブロック図と、動作計画装置３００に接続される駆動装置１００および観測装置２００の構成を示すブロック図である。 <Device Configuration>
FIG. 4 is a block diagram showing the configuration of a motion planning apparatus 300 according to the first embodiment of the present disclosure, and a block diagram showing the configurations of a drive device 100 and an observation device 200 connected to the motion planning apparatus 300.

図４に示されるように、駆動装置１００は、先に説明した、車輪１０１、アクチュエータ１０２およびバッテリ１０３を備えている。観測装置２００は、先に説明した、車輪角度センサ１１１、光学式測域センサ１１２および深度カメラ１１３を備えている。As shown in Figure 4, the driving device 100 includes the wheels 101, the actuator 102, and the battery 103 described above. The observation device 200 includes the wheel angle sensor 111, the optical range sensor 112, and the depth camera 113 described above.

図４に示される動作計画装置３００は、制御装置１０に含まれ、行動計画部３１０と動作計画生成部３２０とを備えている。制御装置１０は、目標軌道に従って移動体１を制御する機器であり、例えば、組み込み計算機として搭載される。The motion planning device 300 shown in Figure 4 is included in the control device 10 and has an action planning unit 310 and a motion plan generating unit 320. The control device 10 is a device that controls the moving body 1 according to a target trajectory, and is installed, for example, as an embedded computer.

行動計画部３１０は、移動体状態推定部３１１、移動目標演算部３１２および状態制約演算部３１３を有している。The action planning unit 310 has a moving body state estimation unit 311, a moving target calculation unit 312, and a state constraint calculation unit 313.

移動体状態推定部３１１は、例えば、全地球測位センサ（ＧＰＳ）からの情報に基づいて移動体１の自己位置推定を実行する。移動体状態推定部３１１で推定された自己位置情報は、動作計画生成部３２０と移動目標演算部３１２へ出力される。The mobile object state estimation unit 311 performs self-position estimation of the mobile object 1 based on information from, for example, a global positioning sensor (GPS). The self-position information estimated by the mobile object state estimation unit 311 is output to the motion plan generation unit 320 and the movement target calculation unit 312.

移動目標演算部３１２では、移動体状態推定部３１１から出力された移動体１の自己位置および観測装置２００から出力された障害物情報などに基づいて、移動体１に達成させる目標を計算し、状態制約演算部３１３に出力する。当該目標は、例えば、参照軌道および目標地点、回避したい障害物の情報などである。The moving target calculation unit 312 calculates a goal to be achieved by the moving body 1 based on the self-position of the moving body 1 output from the moving body state estimation unit 311 and the obstacle information output from the observation device 200, and outputs the calculated goal to the state constraint calculation unit 313. The goal is, for example, a reference trajectory, a target point, and information on an obstacle to be avoided.

行動計画部３１０内の状態制約演算部３１３では、移動目標演算部３１２から出力された移動体１に達成させる目標に基づいて、移動体１の数式モデルと事前に考慮すべき状態制約情報を演算して出力する。本実施の形態１では、事前に考慮すべき状態制約情報は、移動体１と障害物との相対位置情報であり、移動体１の位置と障害物位置と障害物形状を含む情報に基づいて出力する。The state constraint calculation unit 313 in the action planning unit 310 calculates and outputs a mathematical model of the moving body 1 and state constraint information that should be considered in advance based on the goal to be achieved by the moving body 1 output from the movement target calculation unit 312. In the present embodiment 1, the state constraint information that should be considered in advance is relative position information between the moving body 1 and an obstacle, and is output based on information including the position of the moving body 1, the obstacle position, and the obstacle shape.

状態制約演算部３１３を行動計画部３１０内に有し、動作計画生成部３２０とは独立させることで、動的システムの動作と状態制約とを表す数式モデルおよびサンプリング手法に依存することなく、統一的に制約を扱うことができる。By having the state constraint calculation unit 313 within the action planning unit 310 and making it independent from the action plan generation unit 320, it is possible to handle constraints in a unified manner without relying on the mathematical model and sampling method that represent the behavior and state constraints of the dynamic system.

状態制約演算部３１３から出力された相対位置情報は動作計画生成部３２０へ入力される。例えば、観測装置２００の光学式測域センサ１１２からは移動体１と周辺環境との相対距離が得られる。ただし、相対距離はセンサから直接得られる情報に限らず、１つ以上のセンサからの値に基づいて計算した値とすることができる。例えば、ＧＰＳから、制御対象のシステムと回避対象物の位置を取得し、それぞれの位置から相対距離を計算することができる。また、２つのカメラ映像センサを用いて、それぞれの画像データを取得し、それぞれの画像データにおける視差を用いて相対距離を計算することができる。 The relative position information output from the state constraint calculation unit 313 is input to the operation plan generation unit 320. For example, the relative distance between the moving body 1 and the surrounding environment is obtained from the optical range sensor 112 of the observation device 200. However, the relative distance is not limited to information obtained directly from the sensor, but can be a value calculated based on values from one or more sensors. For example, the positions of the system to be controlled and the object to be avoided can be obtained from a GPS, and the relative distance can be calculated from each position. In addition, two camera image sensors can be used to obtain image data of each, and the relative distance can be calculated using the parallax in the image data of each.

また、センサを使わず現象を数式化した値に基づいたソフトウェアによる計算のみの値とすることもできる。例えば、人またはロボットを検知していない場合でも人またはロボットの飛び出しを数式で常に予測する処理を行う。これは、シミュレーションにより、検知していない人の飛び出しまたは、移動体の進入禁止範囲を示す仮想的な壁などを配置する、あるいはセンサ範囲外から確率的に障害物が現れるものとして予測計算する処理である。 It is also possible to use only software calculations based on values that have been mathematically transformed into phenomena without using sensors. For example, even when no people or robots are detected, a process is performed that constantly predicts, using a mathematical formula, whether a person or robot will suddenly jump out. This is a process that uses simulation to place virtual walls that indicate undetected people or areas where moving objects are prohibited from entering, or to predict and calculate the probabilistic appearance of obstacles from outside the sensor range.

動作計画生成部３２０は、目標軌道生成部３２１および目標軌道記憶部３２２を有している。目標軌道生成部３２１は、行動計画部３１０から出力された移動体１と障害物との相対位置情報、例えば相対距離に基づいて、事前に障害物を回避することを考慮して、目標を達成しながら移動するための目標軌道を生成する。この目標軌道の生成に、サンプリング手法を適用する。生成された目標軌道は、目標軌道記憶部３２２へ出力される。The motion plan generating unit 320 has a target trajectory generating unit 321 and a target trajectory storage unit 322. The target trajectory generating unit 321 generates a target trajectory for moving while achieving a goal, taking into consideration avoiding obstacles in advance, based on relative position information between the moving body 1 and obstacles, such as relative distance, output from the action planning unit 310. A sampling method is applied to generate this target trajectory. The generated target trajectory is output to the target trajectory storage unit 322.

目標軌道記憶部３２２は、目標軌道生成部３２１から得られた目標軌道を記憶し、必要な分の目標軌道情報を選択し動作計画として駆動装置１００へ出力する。駆動装置１００は受け取った動作計画に従って移動体１を動作させる。The target trajectory memory unit 322 stores the target trajectory obtained from the target trajectory generation unit 321, selects the necessary target trajectory information, and outputs it as an operation plan to the driving device 100. The driving device 100 operates the moving body 1 according to the received operation plan.

＜移動体の座標系＞
図５は、実施の形態１で用いる座標系を模式的に示す図である。図５のＸ軸およびＹ軸は慣性座標系とし、ｘ_ｒおよびｙ_ｒは慣性座標系での移動体１の重心位置を表す。ただし、ｘ_ｒおよびｙ_ｒは移動体１の位置を表すことができれば重心位置に限定されず、例えば形状中心点、深度カメラ設置点および測域センサ設置点などとすることができる。ｖ_ｘおよびｖ_ｙは、慣性座標系での移動体１のＸ方向とＹ方向の速度である。また、ｘ_ｏおよびｙ_ｏを障害物６００の代表点位置とし、同じくＸＹ慣性座標系で表す。障害物６００の代表点位置は１点以上とすることができ、例えば移動体１と障害物６００との最短距離となる点、障害物６００の形状中心および障害物６００の重心位置等とすることができる。 <Moving object coordinate system>
FIG. 5 is a diagram showing a schematic diagram of a coordinate system used in the first embodiment. The X-axis and Y-axis in FIG. 5 are inertial coordinate systems, and _xr and _yr represent the center of gravity of the moving body 1 in the inertial coordinate system. However, _xr and _yr are not limited to the center of gravity as long as they can represent the position of the moving body 1, and can be, for example, the shape center point, the depth camera installation point, and the range sensor installation point. _vx and _vy are the X-direction and Y-direction velocities of the moving body 1 in the inertial coordinate system. In addition, _xo and _yo are the representative point positions of the obstacle 600, and are also expressed in the XY inertial coordinate system. The representative point position of the obstacle 600 can be one or more points, and can be, for example, the point that is the shortest distance between the moving body 1 and the obstacle 600, the shape center of the obstacle 600, and the center of gravity of the obstacle 600.

＜サンプリング手法＞
本実施の形態では、目標軌道生成部３２１は、観測装置２００から得られた情報に基づいて、制御装置１０が移動体１を制御するための指標となる目標軌道を、サンプリング手法により動作計画として生成する。実施の形態１の動作計画装置３００は、移動体１の運動を数学的に表した数式モデルｆを用いて状態の時間発展演算を行い、かつ事前に状態制約を考慮した上で、設計された最適化問題を解くことで目標軌道を生成する。本実施の形態では、サンプリング手法の一種であるパーティクルフィルタを用いる。本開示の効果は、動的システムの時間発展に関わる演算がサンプリング手法の定式化内に含まれる全ての場合を対象とするため、サンプリング手法がパーティクルフィルタに限定されない。この効果に関する詳細は、後述する具体的な定式化で説明する。 <Sampling method>
In this embodiment, the target trajectory generating unit 321 generates a target trajectory, which is an index for the control device 10 to control the moving body 1, as a motion plan by a sampling method based on information obtained from the observation device 200. The motion planning device 300 of the first embodiment performs a time evolution calculation of the state using a mathematical model f that mathematically represents the motion of the moving body 1, and generates a target trajectory by solving an optimization problem designed in advance after taking into account state constraints. In this embodiment, a particle filter, which is a type of sampling method, is used. The effect of the present disclosure applies to all cases in which calculations related to the time evolution of a dynamic system are included in the formulation of the sampling method, so the sampling method is not limited to the particle filter. Details of this effect will be described in the specific formulation described later.

＜サンプリング手法による動作計画生成の定式化＞
本実施の形態では、目標軌道生成部３２１で用いる移動体の状態量ｘと入力ｕを以下の数式（１）のように設定する。 <Formulation of motion plan generation using sampling method>
In this embodiment, the state quantity x of the moving body and the input u used in the target trajectory generating unit 321 are set as shown in the following formula (1).

ここで、座標系は図４に示した座標系とし、ｖ_ｘはＸ方向の速度、ｖ_ｙはＹ方向の速度とする。 Here, the coordinate system is that shown in FIG. 4, _{v_x} is the velocity in the X direction, and _{v_y} is the velocity in the Y direction.

移動体の運動を表す数式モデルｆは以下の数式（２）で表され、入力に対して線形である。このため、入力サンプリング範囲の抽出が単純化でき、計算負荷を低減できる。The mathematical model f representing the motion of a moving object is expressed by the following mathematical formula (2) and is linear with respect to the input. This simplifies the extraction of the input sampling range and reduces the computational load.

なお、数式（１）と数式（２）は数式モデルの一例であるため、システムの特性に合わせて状態量ｘ、入力ｕ、数式モデルｆを選択することができる。座標系も直交座標系に限らず、例えば、経路座標系で定義することができる。 Note that formulas (1) and (2) are examples of formula models, so the state quantity x, input u, and formula model f can be selected according to the characteristics of the system. The coordinate system is not limited to the Cartesian coordinate system, and can be defined, for example, in the path coordinate system.

本実施の形態では、事前に考慮すべき状態制約として、移動体１が障害物と衝突しない状態制約ｈ（ｘ）≧０を以下の数式（３）のように表す。In this embodiment, the state constraint to be considered in advance is the state constraint h(x)≧0, which indicates that the moving body 1 does not collide with an obstacle, and is expressed as the following equation (3).

ここで、ｒ_ｍは障害物との一定間隔を表す値である。状態制約ｈ（ｘ）≧０が満たされている間は、障害物とｒ_ｍ以上の距離を保って移動することを意味する。また、この状態制約ｈ（ｘ）≧０は、ｈ（ｘ）≦０として考えることもできる。ただし、本実施の形態では状態制約ｈ（ｘ）≧０を対象とするため、正負の符号に注意する。 Here, r _m is a value that indicates a certain distance from an obstacle. This means that while the state constraint h(x) ≧ 0 is satisfied, the robot moves while maintaining a distance of r _m or more from the obstacle. This state constraint h(x) ≧ 0 can also be considered as h(x) ≦ 0. However, since this embodiment is concerned with the state constraint h(x) ≧ 0, attention should be paid to the positive and negative signs.

なお、状態制約ｈ（ｘ）はスカラ値であり、状態制約ｈ（ｘ）がスカラ値であるので、入力サンプリング範囲の抽出が単純化でき、計算負荷を低減できる。 In addition, the state constraint h(x) is a scalar value, and since the state constraint h(x) is a scalar value, the extraction of the input sampling range can be simplified and the computational load can be reduced.

なお、障害物の位置ｘ_ｏ，ｙ_ｏを数式モデルに含めても良く、障害物の位置ｘ_ｏ，ｙ_ｏが動的システムであるとして定義しても良い。 The positions x _o and y _o of the obstacle may be included in the mathematical model, and the positions x _o and y _o of the obstacle may be defined as a dynamic system.

ここまでをまとめて最適化問題を以下の数式（４）のように設定する。 To summarise all of the above, we set the optimization problem as shown in equation (4) below.

ここで、Ｊ（ｘ，ｕ）は評価関数である。評価関数は所望の評価値に応じて設計することができる。例えば、目標地点までの距離の時間積分値または入力の大きさの時間積分値を用いることができる。 Here, J(x,u) is the evaluation function. The evaluation function can be designed according to the desired evaluation value. For example, the time integral of the distance to the target point or the time integral of the magnitude of the input can be used.

数式（４）では、目標地点までの距離の時間積分値および入力の大きさの時間積分値が最小となるように最適化することを表している。なお、評価関数は数学分野における最適化理論の言葉として用いている。Equation (4) expresses the optimization that minimizes the time integral of the distance to the target point and the time integral of the magnitude of the input. Note that the term "evaluation function" is used as a term in optimization theory in the field of mathematics.

本実施の形態では、全方位に移動可能な移動体を対象に定式化を説明したが、動的システムの数式モデルｆと状態制約ｈ（ｘ）を表現できる対象であればこれに限定されない。例えば、差動二輪モデルまたは四輪車両モデルを対象とすることもできる。In this embodiment, the formulation has been described for a mobile object that can move in all directions, but the formulation is not limited to this as long as the object can express the mathematical model f of the dynamic system and the state constraint h(x). For example, a differential two-wheel model or a four-wheel vehicle model can also be used.

＜サンプリング手法による動作計画生成＞
本実施の形態では、サンプリング手法としてパーティクルフィルタを採用する。パーティクルフィルタとは、確率密度分布による時系列データの予測手法である。このパーティクルフィルタによる状態推定演算を実行することで、数式（４）の最適化問題を逐次的に解く。 <Motion plan generation using sampling method>
In this embodiment, a particle filter is used as a sampling method. The particle filter is a method for predicting time series data using a probability density distribution. By executing a state estimation calculation using this particle filter, the optimization problem of Equation (4) is solved sequentially.

状態推定演算としてのパーティクルフィルタは、複数のパーティクルによって状態の確率密度分布を近似するものであり、例えば、ある状態量を示すパーティクルが多ければ、その状態の確率密度が高いことになる。この場合、確率密度分布の近似精度を保証するためには大量のパーティクルが必要である。すなわち、本開示の対象である手法では、有効なパーティクルの減少によって手法の機能低下が生じる。この機能低下は、アルゴリズム内で計算されたパーティクルが制約を違反して削除された場合に生じる。本開示に係る技術は、この機能低下の発生要因である制約を事前に考慮することで有効パーティクル数を保ち機能低下を抑制する。 A particle filter as a state estimation calculation uses multiple particles to approximate the probability density distribution of a state; for example, if there are many particles indicating a certain state quantity, the probability density of that state is high. In this case, a large number of particles are required to ensure the accuracy of the approximation of the probability density distribution. In other words, in the method that is the subject of this disclosure, a decrease in effective particles causes a degradation of the method's performance. This degradation occurs when a particle calculated within the algorithm violates a constraint and is deleted. The technology disclosed herein maintains the number of effective particles and suppresses degradation of performance by taking into account in advance the constraints that cause this degradation.

＜パーティクルフィルタの演算フロー＞
図６は、本実施の形態の動作計画装置３００における目標軌道生成部３２１で実行されるパーティクルフィルタの演算処理を示すフローチャートである。 <Particle filter calculation flow>
FIG. 6 is a flowchart showing the particle filter calculation process executed by the target trajectory generating unit 321 in the motion planning apparatus 300 according to this embodiment.

演算処理を開始すると、目標軌道生成部３２１は、まず、事前に考慮すべき状態制約の情報を取得する（ステップＳ１０１）。この状態制約をパーティクルごとに満たすように確率密度分布を近似する。When the calculation process starts, the target trajectory generation unit 321 first obtains information on state constraints that should be considered in advance (step S101). The probability density distribution is approximated so that the state constraints are satisfied for each particle.

目標軌道生成部３２１は、Ｎ_ｐ個のパーティクルを初期化する（ステップＳ１０２）。ここで、Ｎ_ｐは２以上の整数である。このとき、Ｎ_ｐ個のパーティクルはそれぞれ異なる状態量を有することができる。また、現在の移動体１の状態量に基づいて初期化することもできる。ここで、パーティクルの初期化とは、Ｎ_ｐ個のパーティクルをソフトウェア上で用意するための処理であり、ステップＳ１０２以降の処理を実行するために必要な事前準備である。 The target trajectory generating unit 321 initializes _Np particles (step S102). Here, _Np is an integer equal to or greater than 2. At this time, the _Np particles can each have a different state quantity. Alternatively, the particles can be initialized based on the current state quantity of the moving body 1. Here, the initialization of the particles is a process for preparing _Np particles on software, and is a preparatory step required for executing the processes in and after step S102.

本実施の形態では、パーティクルの状態量Ｐは数式（１）で定義する。また、ｎ個目のパーティクルの状態量をＰ_ｎと表記する。 In this embodiment, the state quantity P of a particle is defined by the following equation (1): The state quantity of the n-th particle is represented as _Pn .

本実施の形態では、全てのパーティクルについて変数の初期値は同じ値とし、ｘ_ｒ＝０、ｙ_ｒ＝０、ｖ_ｘ＝０、ｖ_ｙ＝０とする。また、各パーティクルに対し重みｗを定義し、初期値は全パーティクルで等しく、以下の数式（５）のように設定する。また、時刻ｔを定義し、初期値０を設定する。 In this embodiment, the initial values of variables are the same for all particles, i.e., _xr = 0, _yr = 0, _vx = 0, and _vy = 0. A weight w is defined for each particle, and the initial value is the same for all particles and is set as shown in the following formula (5). Time t is also defined, and the initial value is set to 0.

次に、目標軌道生成部３２１は、事前に考慮すべき状態制約ｈ（ｘ）≧０に基づいて入力サンプリング範囲を抽出する（ステップＳ１０３）。Next, the target trajectory generation unit 321 extracts the input sampling range based on the state constraint h(x)≧0 that should be considered in advance (step S103).

以下、その抽出方法を説明する。まず、状態制約を表す関数ｈ（ｘ）の時間微分を計算する。本実施の形態では、具体的に以下の数式（６）の計算となる。The extraction method is explained below. First, the time derivative of the function h(x) representing the state constraint is calculated. In this embodiment, this is specifically calculated using the following formula (6).

ただし、状態量ｘと入力ｕは、数式（１）で定義され、数式モデルｆは数式（２）で表される。 However, the state quantity x and input u are defined by equation (1), and the mathematical model f is expressed by equation (2).

そして、以下の数式（７）で表される不等式を満たす入力サンプリング範囲でパーティクルを生成する。 Then, particles are generated in the input sampling range that satisfies the inequality expressed by the following equation (7).

本実施の形態では、具体的に以下の数式（８）で表される不等式となる。 In this embodiment, the inequality is specifically expressed by the following equation (8).

ここで、α（ｈ）は拡張クラスκ関数と呼ばれ、単調増加かつα（０）＝０である。例えば、α（ｈ）＝α^・ｈが用いられる。なお、「α^・」は正の定数である。 Here, α(h) is called an extended class κ function, which is monotonically increasing and α(0) = 0. For example, α(h) = α ^· h is used, where "α ^· " is a positive constant.

ある時刻ｔ≧０で状態制約ｈ（ｘ）≧０を満たしている場合に、数式（８）の不等式を満たす入力サンプリングと数式モデルｆによって時間発展する状態量ｘは、時刻ｔ以降もｈ（ｘ）≧０を満たし続ける。理論的な証明は非特許文献１に開示されている。非特許文献１は、移動体が障害物に衝突しない速度範囲の計算方法を開示しており、より具体的には、非特許文献１のII.Ｂ節およびIII.Ｂ節に開示されている。 When the state constraint h(x) ≥ 0 is satisfied at a certain time t ≥ 0, the state quantity x that evolves over time by the input sampling that satisfies the inequality in equation (8) and the mathematical model f continues to satisfy h(x) ≥ 0 even after time t. The theoretical proof is disclosed in Non-Patent Document 1. Non-Patent Document 1 discloses a method for calculating the speed range in which a moving object will not collide with an obstacle, and more specifically, this is disclosed in Sections II.B and III.B of Non-Patent Document 1.

なお、本実施の形態では数式（６）と数式（８）を連続時間で表現しているが、定義した離散時間幅Δｔを用いて離散時間表現とすることもできる。 In this embodiment, equations (6) and (8) are expressed in continuous time, but they can also be expressed in discrete time using the defined discrete time width Δt.

以上のように、状態制約ｈ（ｘ）≧０を数式（８）の条件で事前に考慮した入力サンプリング範囲で、上述したシステムの数式モデルｆにより離散時間幅Δｔ秒後の状態量ｘ^＋を予測する。これにより、制約を考慮したパーティクルの状態予測が可能となる。 As described above, the state quantity x+ after a discrete time width Δt seconds is predicted by the above-mentioned mathematical model f of the system in the input sampling range in which the state constraint h(x)≧ ⁰ is considered in advance under the condition of mathematical formula (8). This makes it possible to predict the state of the particle taking the constraint into account.

数式（８）を満たす入力サンプリング範囲で生成されたパーティクルの状態量Ｐ_ｎは状態制約ｈ（ｘ）≧０を満たす。従って、生成された全てのパーティクルが有効であり、確率密度分布の近似精度を保証できる。ここで、確率密度分布の近似精度とは有効パーティクルの数で定義される。最初に用意したＮ_ｐ個から減った分だけ確率密度分布の近似精度が低下するが、生成された全てのパーティクルが有効であるので、近似精度を保証できる。確率密度分布の近似精度が保証されるので、事後処理が不要になり効率が良い。 The state quantity _Pn of particles generated in the input sampling range that satisfies the formula (8) satisfies the state constraint h(x)≧0. Therefore, all generated particles are valid, and the approximation accuracy of the probability density distribution can be guaranteed. Here, the approximation accuracy of the probability density distribution is defined as the number of valid particles. Although the approximation accuracy of the probability density distribution decreases by the number of valid particles that is reduced from the initially prepared _Np , since all generated particles are valid, the approximation accuracy can be guaranteed. Since the approximation accuracy of the probability density distribution is guaranteed, post-processing is not required, and efficiency is high.

次に、目標軌道生成部３２１は、状態制約に基づいた離散時間幅Δｔ秒後の状態を予測する（ステップＳ１０４）。パーティクルの状態予測は、本実施の形態では、数式（２）の数式モデルを用いる。ここで、ステップＳ１０３で抽出された入力サンプリング範囲で乱数を用いてパーティクルを生成することで、状態制約ｈ（ｘ）≧０に基づいたパーティクル生成ができる。Next, the target trajectory generation unit 321 predicts the state after a discrete time width Δt seconds based on the state constraint (step S104). In this embodiment, the particle state prediction uses the mathematical model of formula (2). Here, particles are generated using random numbers in the input sampling range extracted in step S103, so that particles can be generated based on the state constraint h(x)≧0.

パーティクルの状態量Ｐは予測状態量ｘ^＋と入力サンプリング値である入力ｕを用いて更新し、以下の数式（９）のように表す。 The state quantity P of the particle is updated using the predicted state quantity x ⁺ and the input u which is the input sampling value, and is expressed as the following formula (9).

ここで、パーティクルの状態量ｘ、予測状態量ｘ^＋、入力サンプリング値である入力ｕは全て列ベクトルであり、簡略化のため転置を用いている。 Here, the particle state quantity x, the predicted state quantity x ⁺ , and the input u which is the input sampling value are all column vectors, and transposition is used for simplification.

パーティクルの状態量ＰをＮ_ｐ回計算した後に、各パーティクルの観測値を求める（ステップＳ１０５）。観測変数は動作計画の目標に基づき設計する。動作計画の目標は、移動体１の周辺環境またはユーザの設定により決定する。本実施の形態では、理想的な移動経路の維持と、移動速度の維持と、障害物との距離の保持とを目標とする。これらの目標に基づき、理想的な移動経路との横偏差ｙ_ｄ、移動速度ｖ、障害物との距離ｄに対する観測変数としてφを以下の数式（１０）で表す。 After calculating the state quantity P of the particle _Np times, the observation value of each particle is obtained (step S105). The observation variables are designed based on the goal of the motion plan. The goal of the motion plan is determined by the surrounding environment of the moving body 1 or the user's setting. In this embodiment, the goals are to maintain an ideal movement path, to maintain a movement speed, and to maintain a distance from an obstacle. Based on these goals, φ is expressed as the following equation (10) as an observation variable for the lateral deviation y _d from the ideal movement path, the movement speed v, and the distance d from an obstacle.

ここで、ｅは自然対数を表している。それぞれの値はパーティクルの状態量ｘを用いて表現できるため、観測変数は状態量ｘの関数φ（ｘ）として考えることもできる。以降の議論では、表記の簡単化のためφとして記載する。なお、観測変数としては、理想的な移動経路との横偏差ｙ_ｄ、移動速度ｖ、障害物との距離ｄの少なくとも１つを含む。 Here, e represents the natural logarithm. Since each value can be expressed using the state quantity x of the particle, the observation variable can also be considered as a function φ(x) of the state quantity x. In the following discussion, it is written as φ for simplicity of notation. The observation variables include at least one of the lateral deviation y _d from the ideal movement path, the movement speed v, and the distance d from the obstacle.

次に、各パーティクルの観測値φ、理想観測値φ_ｉとの差から、各パーティクルの重みｗを更新する。ここで、理想観測値φ_ｉとは、仮想的に設計した理想状態にある移動体１に対する観測値であり、走行計画の目標から決定される。従って、移動体１が動作計画の目標を満たしている場合に、移動体１は理想状態となる。本実施の形態では、理想観測値φ_ｉは以下の数式（１１）で表す。 Next, the weight w of each particle is updated based on the difference between the observed value φ of each particle and the ideal observed value φ _i . Here, the ideal observed value φ _i is an observed value for the moving body 1 in a virtually designed ideal state, and is determined based on the target of the travel plan. Therefore, when the moving body 1 satisfies the target of the operation plan, the moving body 1 is in the ideal state. In this embodiment, the ideal observed value φ _i is expressed by the following formula (11).

ここで、それぞれの値は観測変数φに対応する。従って、本実施の形態では、横偏差は０、理想的な移動速度はｖ_ｉ、障害物との距離ｄは大きく保つ、すなわちｄは無限大（∞）に近付けることで数式（１０）の自然対数ｅで表される値は０に近付けることが理想状態である。 Here, each value corresponds to the observation variable φ. Therefore, in this embodiment, the lateral deviation is 0, the ideal moving speed is v _i , and the distance d from the obstacle is kept large, that is, d approaches infinity (∞) so that the value expressed by the natural logarithm e in formula (10) approaches 0, which is the ideal state.

パーティクルフィルタの理論に基づき、各パーティクルの重みｗを更新する。更新は、以下の数式（１２）で表すように、ｎ個目のパーティクルの重みｗ_ｎと尤度γとに比例し、全パーティクルの重みの積算値が１となるようにする。 The weight w of each particle is updated based on the theory of particle filters. The weight w is proportional to the weight wn of the _n -th particle and the likelihood γ, as shown in the following formula (12), so that the integrated value of the weights of all particles becomes 1.

ここで、各パーティクルの尤度γは、予め設定しておいたパーティクルの状態量ｘに関する共分散行列Ｑと観測値φに関する共分散行列Ｒを用いて、以下の数式（１３）を用いてで計算する。Here, the likelihood γ of each particle is calculated using the covariance matrix Q for the particle's state quantity x and the covariance matrix R for the observation value φ, which have been set in advance, using the following formula (13).

ここで、ｄｅｔ演算子は正方行列の行列式を計算することを表し、行列Ｓは、以下の数式（１４）で表される。 Here, the det operator represents the calculation of the determinant of a square matrix, and the matrix S is expressed by the following equation (14).

ただし、行列Ｈは状態量ｘがある値（バーｘ）の場合の観測変数φを状態量ｘで微分した微分係数であり、以下の数式（１５）で定義する。 where matrix H is the differential coefficient obtained by differentiating the observation variable φ with respect to the state quantity x when the state quantity x has a certain value (bar x), and is defined by the following equation (15).

次に、目標軌道生成部３２１は、各パーティクルの重みｗに基づいて、パーティクルのリサンプリングを行う（ステップＳ１０６）。ただし、本実施の形態では、パーティクルの大幅なばらつきを防ぐため、仮想有効パーティクル数Ｎ_ｅｆｆがしきい値Ｎ_ｔｈ以下となる場合にのみリサンプリングを行い、それ以外の場合はこのステップでは何も行わない。ここで、仮想有効パーティクル数Ｎ_ｅｆｆは、以下の数式（１６）を用いて計算する。なお、リサンプリングは毎回実行することもできる。 Next, the target trajectory generating unit 321 resamples the particles based on the weight w of each particle (step S106). However, in this embodiment, in order to prevent a large variation in the particles, resampling is performed only when the virtual effective particle number _Neff is equal to or less than the threshold value _Nth , and otherwise, nothing is performed in this step. Here, the virtual effective particle number _Neff is calculated using the following formula (16). Note that resampling can also be performed every time.

ここで、「仮想有効パーティクル数」としているのは、各パーティクルの重みを用いて仮想的にパーティクル数を計算しているためであり、各パーティクルの重みが等しい場合に、数式（１６）の計算結果がパーティクル数と一致する。 Here, the term "virtual effective particle number" is used because the number of particles is calculated virtually using the weight of each particle, and when the weights of each particle are equal, the calculation result of formula (16) will match the number of particles.

リサンプリング方法としては、通常のパーティクルフィルタと同様に、経験分布関数から等間隔にサンプリングする。リサンプリングを行った場合、数式（５）に基づいて各パーティクルの重みは同等として初期化する。 The resampling method is the same as in normal particle filters, where we sample at equal intervals from the empirical distribution function. When resampling is performed, the weights of each particle are initialized as equal based on formula (5).

次に、目標軌道生成部３２１は、上述の処理で得られたパーティクルの状態量Ｐについて重みｗに基づく加重平均値を計算し、動作計画として状態量ｘと入力ｕを目標軌道生成部３２１に記憶し（ステップＳ１０７）、時刻をｔ＋Δｔとして更新する。Next, the target trajectory generating unit 321 calculates a weighted average value based on the weight w for the state quantity P of the particle obtained by the above-mentioned process, stores the state quantity x and the input u in the target trajectory generating unit 321 as an operation plan (step S107), and updates the time as t + Δt.

次に、目標軌道生成部３２１は、更新された時刻ｔが動作計画の計画対象期間である計画ホライズン値τ_ｈに達したかどうかを判断する（ステップＳ１０８）。ｔ＜τ_ｈの場合（Ｎｏの場合）、ステップＳ１０３以下の処理を繰り返す。一方、ｔ≧τ_ｈの場合（Ｙｅｓの場合）、動作計画として記憶した状態量ｘおよび入力ｕのデータを、目標軌道および目標入力データとして出力し、動作計画生成の演算を終了する。 Next, the target trajectory generating unit 321 judges whether the updated time t has reached the planning horizon value _τh , which is the planning target period of the motion plan (step S108). If t< _τh (No), the process repeats the process from step S103 onward. On the other hand, if t≧ _τh (Yes), the data of the state quantity x and the input u stored as the motion plan are output as the target trajectory and the target input data, and the calculation for generating the motion plan is terminated.

図７は、上述したサンプリング手法による動作計画生成の結果を模式的に示す図である。ここでは簡単にするため、パーティクルの個数をＮ_ｐ＝１０としているが、実際には推定演算される状態数の１０倍を目安にしてサンプリングを実施している。 7 is a diagram showing a schematic diagram of the result of generating an operation plan by the above-mentioned sampling method. For simplicity, the number of particles is set to N _p =10 here, but in practice, sampling is performed with a rough guideline of 10 times the number of states to be estimated.

図７において、推定演算される全パーティクルの初期値はノード３３０を参照し、本実施の形態では、ノード３３０と同じ値として演算を開始する。これらのパーティクルに対して数式（１）～数式（８）を用いて説明した処理を経てそれぞれの状態遷移を予測し、数式（９）を用いてパーティクルの状態量を更新してパーティクルの更新値を得る。これらは、数式モデルの数式（２）に従いながら数式（３）で表現された状態制約ｈ（ｘ）≧０を満たすように、例えば、図７中のノード３３０を初期値とした場合に、パーティクル３３１として示すようなばらつきを持つ。 In Figure 7, the initial values of all particles to be estimated refer to node 330, and in this embodiment, the calculation begins with the same value as node 330. The state transitions of these particles are predicted through the process described using equations (1) to (8), and the state quantities of the particles are updated using equation (9) to obtain updated values of the particles. These have a variation, for example, as shown as particle 331 when node 330 in Figure 7 is set as the initial value, so as to satisfy the state constraint h(x) ≧ 0 expressed in equation (3) while following equation (2) of the mathematical model.

更新された各パーティクル３３１に対して、数式（１０）～数式（１６）を用いて説明した処理を経て、目標軌道と目標入力が周辺環境との関係に応じて重みが計算され、重みに応じたリサンプリングが実行され、例えば、新しいノード３３２のようになる。ここでは、数式（１０）の観測値を計算するために、理想経路４００と、障害物６００（または人５００）の情報を取得している。 For each updated particle 331, the weights are calculated according to the relationship between the target trajectory and the target input and the surrounding environment through the process described using equations (10) to (16), and resampling according to the weights is performed, resulting in, for example, a new node 332. Here, information on the ideal path 400 and the obstacle 600 (or person 500) is acquired to calculate the observed value of equation (10).

＜効果＞
以上説明した動作計画生成部３２０の構成によれば、パーティクルフィルタに代表されるサンプリング手法による動作計画を動的システムへ適用する場合に、状態制約ｈ（ｘ）≧０を事前に考慮した入力サンプリング範囲でパーティクルを生成することで、確率密度分布の精度を保証できる効率の良い動作計画を実施することができる。＜Effects＞
According to the configuration of the motion plan generating unit 320 described above, when a motion plan based on a sampling method represented by a particle filter is applied to a dynamic system, particles are generated within an input sampling range that takes into consideration in advance the state constraint h(x)≧0, so that an efficient motion plan that can guarantee the accuracy of the probability density distribution can be implemented.

状態制約ｈ（ｘ）≧０を事前に考慮しない場合は障害物と衝突すると判定されたパーティクルは、システムの安全性またはシミュレーションの整合性を保つために後処理が実施される。例えば、衝突すると判定されたパーティクルは削除すると設定すると、確率密度分布の精度を保証できない。例えば、後処理として重みの調整をする場合は、動的システムの数式モデルｆとの整合性を保つ必要があり、調整のための計算負荷が発生する可能性がある。 If the state constraint h(x) ≥ 0 is not considered in advance, particles determined to collide with an obstacle are subjected to post-processing to ensure the safety of the system or the consistency of the simulation. For example, if it is set that particles determined to collide are deleted, the accuracy of the probability density distribution cannot be guaranteed. For example, when adjusting weights as post-processing, it is necessary to maintain consistency with the mathematical model f of the dynamic system, and the calculation load for the adjustment may occur.

なお、本実施の形態では、サンプリング手法としてパーティクルフィルタを例示して説明したが、サンプリング手法は例えばモンテカルロ法を技術背景として有する異なる手法とすることもできる。本開示の特徴は入力サンプリング範囲を数式モデルで表現された状態制約に基づいて事前に制限することであり、サンプリング手法自体の形態とは無関係に導入できる。In this embodiment, a particle filter has been described as an example of a sampling method, but the sampling method can also be a different method that has a technical background in the Monte Carlo method, for example. A feature of the present disclosure is that the input sampling range is limited in advance based on state constraints expressed in a mathematical model, and can be introduced regardless of the form of the sampling method itself.

また、数式（３）に示すように状態制約として、障害物との衝突回避を考慮した制約に限定したが、これに限定されない。状態制約として、車線の逸脱を考慮した制約、危険領域への侵入を考慮した制約、急な操舵による転倒を考慮した制約、ロボットアームであれば可動域を考慮した制約、特異姿勢を考慮した制約が挙げられる。 As shown in formula (3), the state constraints are limited to those that take into account collision avoidance with obstacles, but are not limited to this. State constraints include constraints that take into account lane departure, constraints that take into account entry into dangerous areas, constraints that take into account tipping over due to sudden steering, constraints that take into account the range of motion in the case of a robot arm, and constraints that take into account singular postures.

＜実施の形態２＞
次に、本開示に係る実施の形態２の動作計画装置３００について説明する。なお、実施の形態２の動作計画装置３００の構成は、図４に示した実施の形態１の動作計画装置３００の構成と同じである。 <Embodiment 2>
Next, a description will be given of a motion planning apparatus 300 according to a second embodiment of the present disclosure. Note that the configuration of the motion planning apparatus 300 according to the second embodiment is the same as the configuration of the motion planning apparatus 300 according to the first embodiment shown in FIG.

以上説明した実施の形態１の数式モデル（数式（１）、数式（２））と状態制約（数式（３））の関係においては、状態制約を表す関数ｈ（ｘ）の時間に関する一階微分で入力項が現れているが、二階以上の時間微分をすることもできる。 In the relationship between the mathematical model (formula (1) and formula (2)) and the state constraint (formula (3)) of embodiment 1 described above, the input term appears as the first derivative with respect to time of the function h(x) representing the state constraint, but second or higher order time derivatives can also be performed.

例えば、移動体１の状態量ｘをＸＹ座標系の位置および速度とし、入力ｕをＸＹ座標系における加速度として以下の数式（１７）のように再定義する。For example, the state quantity x of the moving body 1 is redefined as its position and velocity in the XY coordinate system, and the input u is redefined as its acceleration in the XY coordinate system, as shown in the following equation (17).

移動体１の数式モデルｆも以下の数式（１８）のように再定義する。 The mathematical model f of moving body 1 is also redefined as follows: (18).

状態制約ｈ（ｘ）≧０は、例えば実施の形態１と同様に障害物との衝突を防ぐものとし、数式（３）のように表す。The state constraint h(x) ≧ 0 is intended to prevent collision with an obstacle, as in embodiment 1, and is expressed as equation (3).

ここで、再定義された移動体の数式モデルｆに基づいて、数式（３）の一階微分と二階微分の時間微分を計算すると、それぞれ以下の数式（１９）と数式（２０）の計算となる。 Now, based on the redefined mathematical model f of the moving body, if we calculate the time derivatives of the first and second derivatives of equation (3), we get the following equations (19) and (20), respectively.

また、以下の数式（２１）のようにベクトル値関数η（ｘ）を定義する。

Also, a vector-valued function η(x) is defined as shown in the following equation (21).

このベクトル値関数η（ｘ）を用いて、状態制約ｈ（ｘ）≧０を事前に考慮するように入力サンプリング範囲を制限するためには、数式（７）の条件を拡張して以下の数式（２２）の条件とする。 To limit the input sampling range using this vector-valued function η(x) so as to take into account in advance the state constraint h(x) ≥ 0, the condition in equation (7) is extended to the condition in equation (22) below.

ここで、Ｋ_ｂ＝［ｋ_ｂ１，ｋ_ｂ２］は全ての要素が正であるベクトルを表す。 Here, K _b =[k _b1 , k _b2 ] represents a vector in which all elements are positive.

数式（２２）で表された入力サンプリング範囲内で乱数に基づいてパーティクルを生成することで、生成されたパーティクルの状態量ｘは状態制約ｈ（ｘ）を満たし続ける。つまり、実施の形態１の結果と同様に無効パーティクルが生成されずに、効率良くサンプリングを実行できる。状態制約ｈ（ｘ）を満たすことに関する数学的な証明は非特許文献２に開示されている。非特許文献２は、移動体が障害物に衝突しない加速度範囲の計算方法を開示しており、より具体的には、非特許文献２のIII節に開示されている。By generating particles based on random numbers within the input sampling range expressed by formula (22), the state quantity x of the generated particles continues to satisfy the state constraint h(x). In other words, as with the results of embodiment 1, invalid particles are not generated, and sampling can be performed efficiently. A mathematical proof of the satisfaction of the state constraint h(x) is disclosed in Non-Patent Document 2. Non-Patent Document 2 discloses a method of calculating the acceleration range in which a moving body does not collide with an obstacle, and more specifically, this is disclosed in Section III of Non-Patent Document 2.

＜実施の形態３＞
次に、本開示に係る実施の形態３の動作計画装置３００について説明する。なお、実施の形態２の動作計画装置３００の構成は、図４に示した実施の形態１の動作計画装置３００の構成と同じである。 <Third embodiment>
Next, a description will be given of a motion planning apparatus 300 according to a third embodiment of the present disclosure. Note that the configuration of the motion planning apparatus 300 according to the second embodiment is the same as the configuration of the motion planning apparatus 300 according to the first embodiment shown in FIG.

以上説明した実施の形態１における入力サンプリング範囲の制限は、数式（７）で制限された入力サンプリング範囲から乱数に基づいて入力値を取得していた。この制限方法について、最初に制限されない入力サンプリング範囲で乱数に基づいて入力値を取得した後に、数式（７）の制約条件を満たすように入力値を修正することもできる。また、修正方法として入力の修正量を評価関数とする最適化問題に基づいた修正方法を用いることができる。In the above-described first embodiment, the input sampling range is restricted by obtaining an input value based on a random number from the input sampling range restricted by formula (7). Regarding this restriction method, it is also possible to first obtain an input value based on a random number in an unrestricted input sampling range, and then modify the input value so as to satisfy the constraint condition of formula (7). In addition, a modification method based on an optimization problem in which the input modification amount is an evaluation function can be used as a modification method.

図８は、実施の形態３における入力値の修正方法を模式的に示す図である。ここでは、実施の形態１の場合を例として示している。 Figure 8 is a diagram showing a schematic diagram of a method for correcting input values in embodiment 3. Here, embodiment 1 is shown as an example.

図８において、制限されない入力サンプリング範囲である元の入力サンプリング範囲８０４で乱数に基づいて入力値を取得すると、例えば、数式（７）を満たす入力値８０１と、数式（７）を満たさない、すなわち違反する入力値８０２について、数式（７）の左辺の正負が変わる境界８０６に基づいて入力値を分けることができる。 In Figure 8, when input values are obtained based on random numbers in the original input sampling range 804, which is an unrestricted input sampling range, the input values can be divided, for example, into input value 801 that satisfies formula (7) and input value 802 that does not satisfy formula (7), i.e., violates formula (7), based on boundary 806 where the positive and negative signs of the left side of formula (7) change.

違反する入力値８０２について、例えば各値について修正量８０５を導入し、元の入力サンプリング範囲８０４と数式（７）で制限された入力サンプリング範囲を満たすように修正後の入力値８０３とする。そして、入力値８０１と修正された入力値８０３によって図６に示したステップＳ１０４以降の処理を実行する。For example, a correction amount 805 is introduced for each of the violating input values 802, and the corrected input value 803 is obtained so as to satisfy the original input sampling range 804 and the input sampling range restricted by formula (7). Then, the processing from step S104 onward shown in FIG. 6 is executed using the input value 801 and the corrected input value 803.

本実施の形態においては、入力サンプリング範囲を制限することで、例えば数式（７）で制限された範囲から乱数に基づいて入力値を取得することが難しい場合に、別の最適化問題によって入力値を乱数に基づいて取得した後に修正することができる。In this embodiment, by limiting the input sampling range, for example, if it is difficult to obtain an input value based on a random number from the range limited by formula (7), the input value can be obtained based on a random number using a different optimization problem and then corrected.

また、乱数に基づいて取得した入力値を数式（７）を満たすように制限するための入力の修正量を評価関数とすることで、設計者が入力サンプリング範囲を設計者の望ましい範囲に設計することができ、入力値の修正量を直接評価できる。 In addition, by using the amount of input correction for restricting the input value obtained based on the random number to satisfy equation (7) as an evaluation function, the designer can design the input sampling range to the designer's desired range and directly evaluate the amount of correction of the input value.

入力の修正量を評価関数とする場合には、例えば入力の修正量の２乗として評価関数を設定することができる。このとき、例えば数式（７）が入力に対して線形であれば２次計画問題の解析解を適用することができるため、計算効率が良くなる。 When the input correction amount is used as the evaluation function, the evaluation function can be set, for example, as the square of the input correction amount. In this case, if, for example, formula (7) is linear with respect to the input, an analytical solution to the quadratic programming problem can be applied, improving calculation efficiency.

＜実施の形態４＞
次に、本開示に係る実施の形態４の動作計画装置３００について説明する。なお、実施の形態４の動作計画装置３００の構成は、図４に示した実施の形態１の動作計画装置３００の構成と同じである。 <Fourth embodiment>
Next, a description will be given of a motion planning apparatus 300 according to a fourth embodiment of the present disclosure. Note that the configuration of the motion planning apparatus 300 according to the fourth embodiment is the same as the configuration of the motion planning apparatus 300 according to the first embodiment shown in FIG.

実施の形態１または実施の形態３で説明した入力サンプリング範囲の制限は、乱数に基づいて得られた入力値が全て数式（７）を満たすことを要求していたが、実施の形態４では、事前に状態制約ｈ（ｘ）の情報に基づいて入力サンプリング範囲を調整することで、乱数に基づいて取得される入力値の一部または全部が数式（７）を満たさないことも許容する。The input sampling range restriction described in embodiment 1 or embodiment 3 required that all input values obtained based on random numbers satisfy formula (7), but in embodiment 4, by adjusting the input sampling range in advance based on information about the state constraint h(x), it is also acceptable for some or all of the input values obtained based on random numbers to not satisfy formula (7).

すなわち、数式（７）を満たす入力値が一定数以上となるように入力サンプリング範囲を確率密度分布関数によって近似的に制限し、範囲を調整する。パーティクルフィルタの状態推定演算に確率密度分布の情報を適用できることで、計算負荷を低減できる。 In other words, the input sampling range is approximately restricted by the probability density distribution function and the range is adjusted so that the number of input values that satisfy formula (7) is equal to or greater than a certain number. By being able to apply information about the probability density distribution to the state estimation calculation of the particle filter, the computational load can be reduced.

例えば、入力サンプリング範囲を対称的に制限する場合はガウス分布とし、ガウス分布を特徴づける平均値と標準偏差を入力サンプリング範囲を調整するパラメータとする。または、最適化問題に基づいてガウス分布を特徴づける平均値と標準偏差の両方または片方を入力サンプリング範囲を調整するパラメータとする。なお、入力サンプリング範囲を特定の方向に制限したい場合にはガンマ分布とする。 For example, if you want to limit the input sampling range symmetrically, use a Gaussian distribution, and use the mean and standard deviation that characterize the Gaussian distribution as parameters to adjust the input sampling range. Alternatively, based on an optimization problem, use both the mean and/or standard deviation that characterize the Gaussian distribution as parameters to adjust the input sampling range. Note that if you want to limit the input sampling range in a specific direction, use a gamma distribution.

パーティクルフィルタの状態推定演算にガウス分布の情報を適用できることで、計算負荷を低減できる。また、入力サンプリング範囲をガウス分布で規定することで、範囲を調整するパラメータを平均値と標準偏差に絞ることができ、計算負荷を低減できる。 The computational load can be reduced by applying Gaussian distribution information to the state estimation calculation of the particle filter. In addition, by specifying the input sampling range with a Gaussian distribution, the parameters for adjusting the range can be narrowed down to the mean and standard deviation, thereby reducing the computational load.

また、ガウス分布による範囲の調整を最適化問題に基づいて調整することで、機械的に調整することができる。 In addition, the range adjustment using the Gaussian distribution can be adjusted mechanically by adjusting it based on an optimization problem.

以下の説明では、入力サンプリング範囲をガウス分布として制限するものとし、図９に制限方法の例を模式的に示している。ここでは、実施の形態１の場合を例として示している。In the following explanation, the input sampling range is limited to a Gaussian distribution, and an example of the limiting method is shown in FIG. 9. Here, the case of embodiment 1 is shown as an example.

図９において、調整された入力サンプリング範囲である元の入力サンプリング範囲をガウス分布として定め、平均値８１１と標準偏差８１２で特徴づける。このとき、乱数に基づいて入力値を取得すると数式（７）を違反する値が多く抽出される。例えば、平均値８１１が数式（７）を違反する場合は、乱数に基づいて取得される入力値が数式（７）を満たす確率は０．５未満である。 In Figure 9, the original input sampling range, which is the adjusted input sampling range, is defined as a Gaussian distribution and characterized by a mean value 811 and a standard deviation 812. In this case, when an input value is obtained based on a random number, many values that violate formula (7) are extracted. For example, if the mean value 811 violates formula (7), the probability that an input value obtained based on a random number satisfies formula (7) is less than 0.5.

ここで、数式（７）の左辺の正負が変わる境界８０６に基づき、例えば平均値８１１を修正量８１３で修正して修正後平均値８１５とし、標準偏差８１２を修正量８１４で修正して修正後標準偏差８１６とする。なお、修正方法としては、実施の形態３と同様に入力の修正量を評価関数とする最適化問題に基づいた修正方法を用いることができる。Here, based on the boundary 806 where the sign of the left side of formula (7) changes, for example, the mean value 811 is corrected by the correction amount 813 to obtain the corrected mean value 815, and the standard deviation 812 is corrected by the correction amount 814 to obtain the corrected standard deviation 816. As a correction method, a correction method based on an optimization problem in which the input correction amount is the evaluation function can be used, as in the third embodiment.

これらの修正後平均値８１５と修正後標準偏差８１６で特徴づけられたガウス分布から乱数に基づいて取得される入力値は、元の入力サンプリング範囲から取得される入力値と比較して、数式（７）を満たす値が多く抽出される。この入力サンプリング値を用いて、図６のステップＳ１０４以降の処理を実行する。例えば、平均値８１１を修正量８１３で修正して修正後平均値８１５とすると、乱数に基づいて取得される入力値が数式（７）を満たす確率は０．５以上として調整できる。 Compared to input values obtained from the original input sampling range, input values obtained based on random numbers from a Gaussian distribution characterized by these modified mean value 815 and modified standard deviation 816 have more extracted values that satisfy formula (7). Using these input sampling values, the processing from step S104 onward in FIG. 6 is performed. For example, if the mean value 811 is modified by the modification amount 813 to obtain the modified mean value 815, the probability that the input value obtained based on random numbers satisfies formula (7) can be adjusted to 0.5 or more.

入力サンプリング範囲を予め設定した確率密度分布関数で近似することで、確率密度分布を特徴づけるパラメータのみを修正するだけで済む。従って、計算を単純化することができ、計算負荷を低減できる。 By approximating the input sampling range with a preset probability density distribution function, it is only necessary to modify the parameters that characterize the probability density distribution. This simplifies the calculations and reduces the computational load.

＜ハードウェア構成＞
なお、以上説明した実施の形態１～４の動作計画装置３００の各構成要素は、コンピュータを用いて構成することができ、コンピュータがプログラムを実行することで実現される。すなわち、例えば、図１０に示す処理回路１０００により実現される。処理回路１０００には、ＣＰＵ（Central Processing Unit）、ＤＳＰ（Digital Signal Processor）などのプロセッサが適用され、記憶装置に格納されるプログラムを実行することで各部の機能が実現される。なお、目標軌道記憶部３２２はコンピュータに含まれる記憶装置により実現される。 <Hardware Configuration>
Each component of the motion planning apparatus 300 according to the first to fourth embodiments described above can be configured using a computer, and is realized by the computer executing a program. That is, for example, it is realized by a processing circuit 1000 shown in FIG. 10. A processor such as a CPU (Central Processing Unit) or a DSP (Digital Signal Processor) is applied to the processing circuit 1000, and the function of each part is realized by executing a program stored in a storage device. It is noted that the target trajectory storage unit 322 is realized by a storage device included in the computer.

処理回路１０００には、専用のハードウェアが適用されても良い。処理回路１０００が専用のハードウェアである場合、処理回路１０００は、例えば、単一回路、複合回路、プログラム化したプロセッサ、並列プログラム化したプロセッサ、ＡＳＩＣ（Application Specific Integrated Circuit）、ＦＰＧＡ（Field-Programmable Gate Array）、またはこれらを組み合わせたもの等が該当する。Dedicated hardware may be applied to the processing circuit 1000. When the processing circuit 1000 is dedicated hardware, the processing circuit 1000 may be, for example, a single circuit, a composite circuit, a programmed processor, a parallel programmed processor, an ASIC (Application Specific Integrated Circuit), an FPGA (Field-Programmable Gate Array), or a combination of these.

動作計画装置３００は、構成要素の各々の機能が個別の処理回路で実現することもでき、それらの機能がまとめて１つの処理回路で実現することもできる。The motion planning device 300 may have each of its component functions realized by separate processing circuits, or the functions may be realized together by a single processing circuit.

また、図１１には、処理回路１０００がプロセッサを用いて構成されている場合におけるハードウェア構成を示している。この場合、動作計画装置３００の各部の機能は、ソフトウェア等（ソフトウェア、ファームウェア、またはソフトウェアとファームウェア）との組み合わせにより実現される。ソフトウェア等はプログラムとして記述され、メモリ１００２に格納される。処理回路１０００として機能するプロセッサ１００１は、メモリ１００２（記憶装置）に記憶されたプログラムを読み出して実行することにより、各部の機能を実現する。すなわち、このプログラムは、動作計画装置３００の構成要素の動作の手順および方法をコンピュータに実行させるものであると言える。 Figure 11 also shows the hardware configuration when the processing circuit 1000 is configured using a processor. In this case, the functions of each part of the motion planning device 300 are realized by a combination of software, etc. (software, firmware, or software and firmware). The software, etc. are written as a program and stored in the memory 1002. The processor 1001 functioning as the processing circuit 1000 realizes the functions of each part by reading and executing the program stored in the memory 1002 (storage device). In other words, it can be said that this program causes the computer to execute the procedure and method of operation of the components of the motion planning device 300.

ここで、メモリ１００２は、例えば、ＲＡＭ、ＲＯＭ、フラッシュメモリー、ＥＰＲＯＭ（Erasable Programmable Read Only Memory）、ＥＥＰＲＯＭ（Electrically Erasable Programmable Read Only Memory）等の、不揮発性または揮発性の半導体メモリ、ＨＤＤ（Hard Disk Drive）、磁気ディスク、フレキシブルディスク、光ディスク、コンパクトディスク、ミニディスク、ＤＶＤ（Digital Versatile Disc）およびそのドライブ装置等、または、今後使用されるあらゆる記憶媒体とすることができる。Here, memory 1002 may be, for example, a non-volatile or volatile semiconductor memory such as RAM, ROM, flash memory, EPROM (Erasable Programmable Read Only Memory), EEPROM (Electrically Erasable Programmable Read Only Memory), etc., a HDD (Hard Disk Drive), a magnetic disk, a flexible disk, an optical disk, a compact disk, a mini disk, a DVD (Digital Versatile Disc) and its drive device, etc., or any storage medium to be used in the future.

以上、動作計画装置３００の各構成要素の機能が、ハードウェアおよびソフトウェア等の何れか一方で実現される構成について説明した。しかしこれに限ったものではなく、動作計画装置３００の一部の構成要素を専用のハードウェアで実現し、別の一部の構成要素をソフトウェア等で実現することもできる。例えば、一部の構成要素については専用のハードウェアとしての処理回路１０００でその機能を実現し、他の一部の構成要素についてはプロセッサ１００１としての処理回路１０００がメモリ１００２に格納されたプログラムを読み出して実行することによってその機能を実現することが可能である。 A configuration has been described above in which the functions of each component of the motion planning device 300 are realized by either hardware or software, etc. However, this is not limited to this, and some components of the motion planning device 300 can be realized by dedicated hardware, and other components can be realized by software, etc. For example, it is possible to realize the functions of some components by the processing circuit 1000 as dedicated hardware, and realize the functions of other components by the processing circuit 1000 as the processor 1001 reading and executing a program stored in the memory 1002.

以上のように、動作計画装置３００は、ハードウェア、ソフトウェア等、またはこれらの組み合わせによって、上述の各機能を実現することができる。As described above, the motion planning device 300 can realize each of the above-mentioned functions through hardware, software, etc., or a combination of these.

本開示は詳細に説明されたが、上記した説明は、全ての局面において、例示であって、本開示がそれに限定されるものではない。例示されていない無数の変形例が、本開示の範囲から外れることなく想定され得るものと解される。Although the present disclosure has been described in detail, the above description is illustrative in all respects and does not limit the present disclosure. It is understood that countless variations not illustrated can be envisioned without departing from the scope of the present disclosure.

なお、本開示は、その開示の範囲内において、各実施の形態を自由に組み合わせたり、各実施の形態を適宜、変形、省略することが可能である。 In addition, within the scope of this disclosure, it is possible to freely combine the various embodiments, and to modify or omit each embodiment as appropriate.

Claims

A motion planning device for a dynamic system using a sampling method, comprising:
a behavior planning unit that outputs a mathematical model that expresses a motion of the dynamic system based on a goal to be achieved by the dynamic system and a state constraint that should be considered in advance of the dynamic system;
a motion plan generation unit that limits an input sampling range in advance based on the mathematical model and the state constraint, and generates a motion plan for the dynamic system by a state estimation calculation in the limited input sampling range.

The operation plan generating unit
The motion planning device according to claim 1 , wherein the input sampling range is restricted by obtaining an input value based on a random number in the unrestricted input sampling range and then correcting the input value so as to satisfy the state constraint.

The operation plan generating unit
The motion planning device according to claim 1 , wherein the input sampling range is adjusted so that a certain number of input values satisfy the input sampling range.

The operation plan generating unit
4. The motion planning device according to claim 2, wherein the input value is corrected based on an optimization problem.

The operation plan generating unit
5. The motion planning apparatus according to claim 4, wherein a correction amount when correcting said input value is set as an evaluation function of said optimization problem.

The motion planning device according to claim 1, wherein the mathematical model is linear with respect to the input.

The motion planning device according to claim 1, wherein the state constraint is a scalar-valued constraint for input sampling by first-order differentiation.

The operation plan generating unit
4. The motion planning apparatus according to claim 3, wherein the input sampling range is defined by a probability density distribution such that the input value that satisfies the input sampling range is equal to or greater than the certain number.

The operation plan generating unit
9. The motion planning apparatus according to claim 8, wherein the probability density distribution is a Gaussian distribution, and a mean value and a standard deviation characterizing the Gaussian distribution are used as parameters for adjusting the input sampling range.

The operation plan generating unit
9. The motion planning apparatus according to claim 8, wherein the probability density distribution is a Gaussian distribution, and both or one of a mean value and a standard deviation characterizing the Gaussian distribution based on an optimization problem is used as a parameter for adjusting the input sampling range.

The sampling method includes:
2. The motion planning apparatus according to claim 1, which is a particle filter that approximates a probability density distribution of a state by a plurality of particles.

The motion planning device according to any one of claims 1 to 3, wherein the dynamic system is a moving body.