JP2019102066A

JP2019102066A - Travel route estimation device, travel route estimation method, and program

Info

Publication number: JP2019102066A
Application number: JP2018195803A
Authority: JP
Inventors: 達史松林; Tatsufumi Matsubayashi; 寛清武; Hiroshi Kiyotake; 匡宏幸島; Masahiro Kojima; 浩之戸田; Hiroyuki Toda; 琢馬大塚; Takuma Otsuka; 清水　仁; Hitoshi Shimizu; 清水　　仁; 上田　修功; Shuko Ueda; 修功上田
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2017-12-01
Filing date: 2018-10-17
Publication date: 2019-06-24
Anticipated expiration: 2038-10-17
Also published as: JP7206784B2

Abstract

To efficiently estimate an action route of an agent so as to reduce an observation error even under such a situation that a time delay occurs.SOLUTION: A simulator execution unit 141 calculates, based on a simulation result, an objective function indicating an observation error between a calculated value for an agent regarding each of observation points in a predetermined number of time zones including an estimation object time zone and a time zone subsequent thereto and an observation value in an actual environment. A next input parameter determination unit 142 determines a next input parameter on the basis of an in input parameter and a calculation result of the objective function. Calculation and determination are repeated, so that an optimization control unit 143 obtains an input parameter optimizing the objective function.SELECTED DRAWING: Figure 2

Description

本発明は、移動経路推定装置、移動経路推定方法、およびプログラムに関し、特に、時間帯ごとに得られたある地点での観測人数に基づいて、観測に合うようなエージェントの移動経路の推定を行う移動経路推定装置、移動経路推定方法、およびプログラムに関する。 The present invention relates to a movement path estimation device, a movement path estimation method, and a program, and in particular, performs estimation of a movement path of an agent matching an observation based on the number of people observed at a certain point obtained for each time zone. The present invention relates to a movement path estimation device, a movement path estimation method, and a program.

大規模なイベントなどで起こりうる混雑に対応するため、マルチエージェントシミュレータ（ＭＡＳ）を用いた人流の把握や制御策の策定が行われている。より正確な人流の把握や効果的な制御策の策定のためには、精度の高いシミュレーションが必要となる。そのためには、観測したデータをもとに、その観測データを再現するパラメータを決定する技術が必要不可欠となる。 In order to cope with the congestion that may occur in a large-scale event etc., grasping of the flow of people using a multi agent simulator (MAS) and formulation of control measures are being carried out. Highly accurate simulations are required for more accurate understanding of human flow and development of effective control measures. For that purpose, the technique which determines the parameter which reproduces the observation data based on the observation data becomes indispensable.

観測データとは、時間および空間的にある粒度で計測し、集計化されたデータを指す。具体的には、図９に示すように観測の時間幅を

としたときに、ある時刻

から時刻

までの時間帯

にある観測地点ｋで観測された通行人数

が得られる。 Observational data refers to data that is measured and aggregated at a certain granularity in time and space. Specifically, as shown in FIG. 9, the observation time width is

When the time is

From to time

Time zone

Number of people observed at observation point k at

Is obtained.

また、ＭＡＳとは、図１０に示すように、エージェントに対して、発生時刻、出発地、目的地、そして通る経路を指定すると実行され、各エージェントの通過ノードと通過時間を出力するものとする。パラメータとは、図１１に示すように、ある時間帯Ｔ_ｊに経路Ｒ_ｉを通るエージェント数を並べたベクトル

を、全時間帯分つないだＩ×Ｊ次元ベクトル

である。 In addition, as shown in FIG. 10, MAS is executed when an occurrence time, a departure place, a destination, and a route are specified to the agent, and the transit node and transit time of each agent are output. . The parameter is a vector in which the number of agents passing the route R _i is arranged in a certain time zone T _j as shown in FIG.

The whole time zone is connected I × J dimensional vector

It is.

ＭＡＳの出力を集計することで得られる観測値

と、実際の観測値

との誤差を最小化するという問題は、ブラックボックス関数最適化の問題と捉えることができ、最適化手法の従来技術としてベイズ最適化（非特許文献１）がある。 Observations obtained by aggregating the output of MAS

And the actual observed values

The problem of minimizing the error with the above can be regarded as the problem of black box function optimization, and there is Bayesian optimization (Non-Patent Document 1) as a prior art of the optimization method.

最適化手法は、探索すべきパラメータと目的関数をあらかじめ設定し、パラメータに対する目的関数値を逐次的に得ることで、目的関数を近似し、最小値を与えるパラメータを探索するアプローチである。 The optimization method is an approach of approximating an objective function and searching for a parameter giving a minimum value by setting in advance a parameter to be searched and an objective function and sequentially obtaining an objective function value for the parameter.

今回の問題に対して、入力パラメータと目的関数の決め方により二つのナイーブな設定が考えられる。それぞれのイメージ図を図１２、図１３に示す。 For this problem, two naive settings can be considered depending on how to determine the input parameters and the objective function. The respective image diagrams are shown in FIG. 12 and FIG.

図１２に示す一つ目のナイーブな設定（設定１）は、入力パラメータを“全時間帯の各移動経路を通るエージェント数

を要素とするＩ×Ｊ次元ベクトル

”、目的関数を“全時間帯、各観測地点ｋにおける観測誤差”と設定するものである。ここで、時間帯

から時間帯

までの誤差を下記式（１）であらわすとすると、設定１では下記式（２）の最適化問題を解くことでパラメータ

を求める。 The first naive setting (setting 1) shown in FIG. 12 sets the input parameter to “the number of agents passing each movement route in the entire time zone.

I × J dimensional vector whose element is

The objective function is set as "all time zone, observation error at each observation point k". Here, the time zone

From time zone

Assuming that the error up to the point is expressed by the following equation (1), in setting 1, the parameter is solved by solving the optimization problem of the following equation (2)

Ask for

図１３に示す二つ目のナイーブな設定（以下、設定２）は、入力パラメータを“一つの時間帯Ｔ_ｊに各ルートＲ_ｉを通るエージェント数

を要素とするＩ次元ベクトル

”、目的関数を“一つの時間帯Ｔ_ｊ、各観測地点ｋにおける観測誤差”と設定するものである。 The second naive setting (hereinafter referred to as setting 2) shown in FIG. 13 is that the input parameter is “the number of agents passing each route R _i in one time zone T _j

I-dimensional vector whose element is

The objective function is set as “one time zone T _j , observation error at each observation point k”.

つまり、設定２では、下記式（３）の最適化問題を解くことで、パラメータ

を求める。 That is, in setting 2, the parameters are solved by solving the optimization problem of the following equation (3)

Ask for

J.Snoek, H.Larochelle, and R.P.Adams. Practical Bayesian optimization of machine learning algorithms. In Advances in Neural Information Processing Systems (NIPS), 2012.J.Snoek, H. Larochelle, and R.P. Adams. Practical Bayesian optimization of machine learning algorithms. In Advances in Neural Information Processing Systems (NIPS), 2012. Nello Cristianini, John Shawa-Taylor, 大北剛(訳)：カーネル法によるパターン解析(2010).Nello Cristianini, John Shawa-Taylor, Takeshi Ohkita (Translation): Kernel pattern analysis (2010).

設定１では、全時刻・全ルートを通るエージェントの人数を入力値とすることでベイズ最適化を行う手法である。しかしながら、この定式化では、パラメータの次元数が大きくなりすぎてしまい、目的関数の近似精度が悪化してしまう。そのため、効率的な解の探索が困難となってしまう。 In setting 1, Bayesian optimization is performed by using the number of agents passing all the time and all routes as input values. However, in this formulation, the dimensionality of the parameters becomes too large, and the approximation accuracy of the objective function deteriorates. Therefore, it becomes difficult to search for an efficient solution.

一方、設定２では、入力次元数が移動経路数となるため、次元数の問題は生じない。この設定は、エージェントが発生時間帯内で観測地点に到着する場合は自然である。なぜなら、同時間帯内に到着するということは、各時刻に発生したエージェントは同時間帯の観測誤差にしか影響を与えないからである。このような例として、小規模な会場におけるブース間の移動などが考えられる。 On the other hand, in setting 2, since the number of input dimensions is the number of movement paths, the problem of the number of dimensions does not occur. This setting is natural when the agent arrives at the observation point within the generation time zone. The reason is that arriving in the same time zone means that agents generated at each time affect only the observation error of the same time zone. As such an example, movement between booths in a small scale venue can be considered.

しかしながら、大規模なイベントを考えると、発生時刻と同時間帯内の到着を仮定することが出来るとは限らない。例えば、図１４に示すように８：０２に出発２で発生したエージェントが目的地１２へと向かう場合を考える。この時、観測間隔が５分だとすると、観測地点（３）では、エージェントは同時間内に観測されるものの、観測地点（１０）では発生から観測まで時間がかかってしまい、翌時間帯に観測されてしまう。この例のように、観測地点が発生場所と離れている場合、エージェントの発生時間帯と観測時間帯にはズレ（時間遅れ）が発生する。つまり、エージェントが影響を与える観測誤差はいくつかの時間帯にまたがっていることを考慮する必要がある。このような場合に従来技術では、適応困難であるという問題が存在した。 However, considering a large-scale event, it is not always possible to assume arrival within the same time zone as the occurrence time. For example, as shown in FIG. 14, consider a case where an agent generated at departure 2 at 8:02 goes to the destination 12. At this time, if the observation interval is 5 minutes, although agents are observed within the same time at observation point (3), it takes time from generation to observation at observation point (10), and it is observed in the next time zone It will As in this example, when the observation point is separated from the occurrence point, a gap (time delay) occurs between the occurrence time zone of the agent and the observation time zone. In other words, it is necessary to take into account that the observation error affected by the agent spans several time zones. In such a case, in the prior art, there is a problem that adaptation is difficult.

本発明は上記の点に鑑みてなされたものであり、時間遅れが発生する状況下であっても、観測誤差を小さくするようなエージェントの行動経路を効率的に推定することができる移動経路推定装置、移動経路推定方法、およびプログラムを提供することを目的とする。 The present invention has been made in view of the above points, and it is possible to efficiently estimate an agent's action route that reduces observation error even under a situation where time delay occurs. An object of the present invention is to provide an apparatus, a movement path estimation method, and a program.

本発明に係る移動経路推定装置は、実環境における複数の経路の各々についての、推定対象時間帯での前記経路を通過するエージェントの数を要素とする入力パラメータに基づいて、各時間帯において複数のエージェントが移動するシミュレーションを実行し、前記シミュレーションの結果から、推定対象時間帯及び前記推定対象時間帯の次の時間帯を含む所定数の時間帯において、観測地点の各々についての前記エージェントに関する計算値と、実環境での観測値との観測誤差を表す目的関数を計算するシミュレータ実行部と、前記入力パラメータと、前記目的関数の計算結果とに基づいて、次の入力パラメータを決定する次入力パラメータ決定部と、予め定めた繰り返し条件を満たすまで、前記シミュレータ実行部による実行と、前記次入力パラメータ決定部による決定とを繰り返させ、前記目的関数を最適化する入力パラメータを求める最適化制御部と、を含んで構成される。 The moving path estimation apparatus according to the present invention is configured to calculate a plurality of moving paths in each time zone based on input parameters having the number of agents passing through the paths in the estimation target time zone for each of a plurality of paths in the real environment. The simulation of the movement of the agent is executed, and from the result of the simulation, the calculation regarding the agent for each of the observation points in a predetermined number of time zones including the estimation target time zone and the next time zone of the estimation target time zone A simulator execution unit that calculates an objective function representing an observation error between a value and an observation value in a real environment, a next input that determines a next input parameter based on the input parameter and a calculation result of the objective function Execution by the simulator execution unit until the parameter determination unit and the predetermined repetition condition are satisfied; It was repeated a determination by the force parameter determination unit configured to include a, and optimization controller for determining an input parameter to optimize the objective function.

また、本発明に係る移動経路推定方法は、シミュレータ実行部が、実環境における複数の経路の各々についての、推定対象時間帯での前記経路を通過するエージェントの数を要素とする入力パラメータに基づいて、各時間帯において複数のエージェントが移動するシミュレーションを実行し、前記シミュレーションの結果から、推定対象時間帯及び前記推定対象時間帯の次の時間帯を含む所定数の時間帯において、観測地点の各々についての前記エージェントに関する計算値と、実環境での観測値との観測誤差を表す目的関数を計算するステップと、次入力パラメータ決定部が、前記入力パラメータと、前記目的関数の計算結果とに基づいて、次の入力パラメータを決定するステップと、最適化制御部が、予め定めた繰り返し条件を満たすまで、前記シミュレータ実行部による実行と、前記次入力パラメータ決定部による決定とを繰り返させ、前記目的関数を最適化する入力パラメータを求めるステップと、を含む。 Further, in the movement path estimation method according to the present invention, the simulator execution unit is based on an input parameter whose element is the number of agents passing through the path in the estimation target time zone for each of a plurality of paths in the real environment. Simulation is executed in each time zone, and from the result of the simulation, the observation point is observed in a predetermined number of time zones including the estimation target time zone and the next time zone of the estimation target time zone. Calculating an objective function representing an observation error between the calculated value of each of the agents and the observed value in the real environment, the next input parameter determination unit calculating the input parameter, and the calculation result of the objective function Determining the next input parameter, and the optimization control unit satisfies the predetermined repetitive condition. And execution by the simulator execution unit causes repeated and decision by the next input parameter determination unit includes, a step of determining the input parameters to optimize the objective function.

本発明に係る移動経路推定装置及び移動経路推定方法によれば、シミュレータ実行部が、実環境における複数の経路の各々についての、推定対象時間帯での前記経路を通過するエージェントの数を要素とする入力パラメータに基づいて、各時間帯において複数のエージェントが移動するシミュレーションを実行し、前記シミュレーションの結果から、推定対象時間帯及び前記推定対象時間帯の次の時間帯を含む所定数の時間帯において、観測地点の各々についての前記エージェントに関する計算値と、実環境での観測値との観測誤差を表す目的関数を計算する。 According to the movement path estimation device and the movement path estimation method according to the present invention, the simulator execution unit sets the number of agents passing through the path in the estimation target time zone for each of the plurality of paths in the real environment as an element Based on the input parameters, a simulation in which a plurality of agents move in each time zone is executed, and from the result of the simulation, a predetermined number of time zones including the estimated time zone and the next time zone of the estimated time zone And calculate an objective function representing an observation error between the calculated value of the agent for each of the observation points and the observation value in the real environment.

そして、次入力パラメータ決定部が、前記入力パラメータと、前記目的関数の計算結果とに基づいて、次の入力パラメータを決定し、最適化制御部が、予め定めた繰り返し条件を満たすまで、前記シミュレータ実行部による実行と、前記次入力パラメータ決定部による決定とを繰り返させ、前記目的関数を最適化する入力パラメータを求める。 Then, the next input parameter determination unit determines the next input parameter based on the input parameter and the calculation result of the objective function, and the simulator controls the simulator until the optimization control unit satisfies a predetermined repetition condition. The execution by the execution unit and the determination by the next input parameter determination unit are repeated to obtain an input parameter for optimizing the objective function.

このように、シミュレーションの結果から、推定対象時間帯及び推定対象時間帯の次の時間帯を含む所定数の時間帯において、観測地点の各々についてのエージェントに関する計算値と、実環境での観測値との観測誤差を表す目的関数を計算することと、入力パラメータと、目的関数の計算結果とに基づいて、次の入力パラメータを決定することとを繰り返し、目的関数を最適化する入力パラメータを求めることにより、時間遅れが発生する状況下であっても、観測誤差を小さくするようなエージェントの行動経路を効率的に推定することができる。 Thus, from the simulation results, in the predetermined number of time zones including the estimation target time zone and the next time zone of the estimation target time zone, the calculated values regarding the agent for each of the observation points and the observation values in the real environment Calculating the objective function representing the observation error with the above, determining the next input parameter based on the input parameter and the calculation result of the objective function, and finding the input parameter for optimizing the objective function Thus, even under a situation where time delay occurs, it is possible to efficiently estimate the agent's action path that reduces the observation error.

また、本発明に係る移動経路推定装置は、全ての時間帯の各々を、前記推定対象時間帯として、前記最適化制御部により前記入力パラメータを求めることを繰り返させる反復制御部を更に含み、前記反復制御部は、予め定めた繰り返し条件を満たすまで、全ての時間帯の各々を、前記推定対象時間帯として、前記最適化制御部により前記入力パラメータを繰り返し求めることを、繰り返し実行させることができる。 The movement path estimation device according to the present invention further includes an iterative control unit that causes the optimization control unit to repeatedly obtain the input parameter, with each of all the time zones as the estimation target time zone, The iterative control unit can repeatedly execute repeatedly obtaining the input parameter by the optimization control unit as each of the time zones to be estimated, until the predetermined repetitive condition is satisfied. .

このように、予め定めた繰り返し条件を満たすまで、全ての時間帯の各々を、推定対象時間帯として、入力パラメータを繰り返し求めることを、全ての時間帯の各々を、推定対象時間帯として、繰り返し実行させることにより、時間遅れが発生する状況下であっても、観測誤差を小さくするようなエージェントの行動経路を精度良く推定することができる。 In this way, it is repeatedly determined that the input parameters are repeatedly determined using each of all time zones as an estimation target time zone until each of the all time zones is an estimation target time zone until the predetermined repetition condition is satisfied. By performing the execution, even under a situation where a time delay occurs, it is possible to accurately estimate the agent's action path that reduces the observation error.

また、本発明に係る移動経路推定装置における前記シミュレータ実行部は、全ての時間帯のうち、前記推定対象時間帯の前の所定数の時間帯、前記推定対象時間帯、及び前記推定対象時間帯の次の時間帯を含む所定数の時間帯において複数のエージェントが移動する前記シミュレーションを実行することができる。 Further, the simulator execution unit in the movement path estimation device according to the present invention may include, among all the time zones, a predetermined number of time zones before the estimation target time zone, the estimation target time zone, and the estimation target time zone. The simulation may be performed in which a plurality of agents move in a predetermined number of time zones including the next time zone of.

このように、全ての時間帯のうち、推定対象時間帯の前の所定数の時間帯、推定対象時間帯、及び推定対象時間帯の次の時間帯を含む所定数の時間帯において複数のエージェントが移動するシミュレーションを実行することにより、時間遅れが発生する状況下であっても、観測誤差を小さくするようなエージェントの行動経路を高速に推定することができる。 Thus, among all the time zones, a plurality of agents in a predetermined number of time zones including the predetermined number of time zones before the estimation target time zone, the estimation target time zone, and the next time zone of the estimation target time zone By executing a simulation in which moves, even under a situation where a time delay occurs, it is possible to quickly estimate the agent's action path that reduces the observation error.

本発明に係るプログラムは、上記の移動経路推定装置の各部として機能させるためのプログラムである。 A program according to the present invention is a program for functioning as each part of the above movement path estimation apparatus.

本発明の移動経路推定装置、移動経路推定方法、およびプログラムによれば、時間遅れが発生する状況下であっても、観測誤差を小さくするようなエージェントの行動経路を効率的に推定することができる。 According to the movement path estimation device, the movement path estimation method, and the program of the present invention, it is possible to efficiently estimate the action path of the agent which reduces the observation error even under the situation where time delay occurs. it can.

本発明の第１の実施の形態の概要を示すイメージ図である。It is an image figure showing an outline of a 1st embodiment of the present invention. 本発明の第１の実施の形態に係る移動経路推定装置の構成を示すブロック図である。BRIEF DESCRIPTION OF THE DRAWINGS It is a block diagram which shows the structure of the movement path | route estimation apparatus based on the 1st Embodiment of this invention. 本発明の第１の実施の形態に係る移動経路推定装置の移動経路推定処理ルーチンを示すフローチャートである。It is a flowchart which shows the movement path | route estimation processing routine of the movement path | route estimation apparatus based on the 1st Embodiment of this invention. 本発明の第１の実施の形態に係る移動経路推定装置の最適化処理ルーチンを示すフローチャートである。It is a flowchart which shows the optimization process routine of the movement path | route estimation apparatus based on the 1st Embodiment of this invention. 本発明の第２の実施の形態の概要を示すイメージ図である。It is an image figure which shows the outline | summary of the 2nd Embodiment of this invention. 本発明の第２の実施の形態に係る移動経路推定装置の構成を示すブロック図である。It is a block diagram which shows the structure of the movement path | route estimation apparatus based on the 2nd Embodiment of this invention. 本発明の第２の実施の形態に係る移動経路推定装置の移動経路推定処理ルーチンを示すフローチャートである。It is a flowchart which shows the movement path | route estimation processing routine of the movement path | route estimation apparatus based on the 2nd Embodiment of this invention. 本発明の第３の実施の形態の概要を示すイメージ図である。It is an image figure which shows the outline | summary of the 3rd Embodiment of this invention. 道路ネットワークと観測データの一例である。It is an example of a road network and observation data. ＭＡＳの入出力の一例である。It is an example of input / output of MAS. 入力パラメータの一例である。It is an example of an input parameter. 従来のナイーブな入出力設定の一例である（設定１）。This is an example of a conventional naive input / output setting (setting 1). 従来のナイーブな入出力設定の一例である（設定２）。This is an example of a conventional naive input / output setting (setting 2). 時間遅れのイメージ図である。It is an image figure of time delay.

以下、本発明の実施の形態について図面を用いて説明する。 Hereinafter, embodiments of the present invention will be described using the drawings.

＜本発明の第１の実施の形態に係る移動経路推定装置の概要＞
まず、本発明の第１の実施形態の概要について説明する。 <Overview of Moving Path Estimation Device According to First Embodiment of the Present Invention>
First, an outline of the first embodiment of the present invention will be described.

本実施形態では、入力次元数の増加を防ぐため、入力パラメータを設定２と同じく“一つの時間帯Ｔ_ｊに各ルートＲ_ｉを通るエージェント数

を要素とするＩ次元ベクトル

”とする。 In this embodiment, in order to prevent an increase in the number of input dimensions, the number of agents passing each route R _i in one time zone T _j is the same as in the setting 2

I-dimensional vector whose element is

"

しかし、目的関数は設定２と異なり、同時間帯の観測誤差だけではなく、時間遅れによって影響を与える翌時間帯以降の測定誤差も用いる。図１は、本実施形態の概要を示すイメージ図である。 However, the objective function is different from setting 2 and uses not only the observation error of the same time zone but also the measurement error of the next time zone which is affected by the time delay. FIG. 1 is an image diagram showing an outline of the present embodiment.

つまり、本実施形態では、下記式（４）の最適化問題を解くことにより、各時間帯に各ルートを通るエージェント数を求める。 That is, in the present embodiment, the number of agents passing each route in each time zone is determined by solving the optimization problem of the following equation (4).

ここで、Ｄはエージェントが発生してから観測されるまでに最大でかかる時間帯数とする。Ｄはエージェントがどれだけ遅れるかを表すものであるため、状況に応じて適宜変更することができる。 Here, D is the number of time zones that it takes at most after an agent is generated and observed. Since D represents how much the agent is delayed, it can be appropriately changed according to the situation.

目的関数を

とすることにより、時間帯Ｔ_ｊに発生するエージェントが時間遅れによって影響を与える時間帯の観測誤差まで考慮することができるため、高精度にエージェントの移動経路を推定できるようになる。 Objective function

And by, it is possible to consider to the observation error of the time period that the agent which occurs in the time zone T _j affect the time delay, it becomes possible to estimate the movement path of the agent with high accuracy.

＜本発明の第１の実施の形態に係る移動経路推定装置の構成＞
図２を参照して、本発明の実施の形態に係る移動経路推定装置の構成について説明する。図２は、本発明の実施の形態に係る移動経路推定装置の構成を示すブロック図である。 <Configuration of Moving Path Estimation Device According to First Embodiment of the Present Invention>
The configuration of the moving path estimation apparatus according to the embodiment of the present invention will be described with reference to FIG. FIG. 2 is a block diagram showing the configuration of a moving path estimation apparatus according to an embodiment of the present invention.

移動経路推定装置１００は、ＣＰＵと、ＲＡＭと、後述する移動経路推定処理ルーチンを実行するためのプログラムを記憶したＲＯＭとを備えたコンピュータで構成され、機能的には次に示すように構成されている。 The movement path estimation apparatus 100 is configured by a computer including a CPU, a RAM, and a ROM storing a program for executing a movement path estimation processing routine described later, and is functionally configured as follows: ing.

図２に示すように、本実施形態に係る移動経路推定装置１００は、シミュレータ設定処理部１１０と、最適化処理部１２０と、データ記憶部１３０と、最適化実行部１４０と、最適パラメータ処理部１５０とを備え、入力装置や表示装置などの外部装置２００に接続されて構成される。 As shown in FIG. 2, the movement path estimation apparatus 100 according to the present embodiment includes a simulator setting processing unit 110, an optimization processing unit 120, a data storage unit 130, an optimization execution unit 140, and an optimum parameter processing unit. 150 and is connected to an external device 200 such as an input device or a display device.

シミュレータ設定処理部１１０は、外部装置２００より入力された、シミュレータを実行するに当たり必要なフィールドを、シミュレータ設定テーブル１３１に設定する。 The simulator setting processing unit 110 sets, in the simulator setting table 131, fields necessary for executing the simulator, which are input from the external device 200.

最適化処理部１２０は、外部装置２００より入力された最大最適化実行回数Ｓ、時間分割数Ｊ、及び時間遅れ定数Ｄを、最適化設定テーブル１３２に設定する。 The optimization processing unit 120 sets the maximum number of optimization execution times S, the number of time divisions J, and the time delay constant D input from the external device 200 in the optimization setting table 132.

最大最適化実行回数Ｓは、一つの時間帯に対して最適化を行う最大回数であり、時間分割数Ｊは、データが観測された時間帯の分割個数であり、時間遅れ定数Ｄは、エージェントが発生時間帯から観測されるまでにかかる時間帯の最大数である。 The maximum number of times of optimization execution S is the maximum number of times of optimization for one time zone, the number of time divisions J is the number of divisions of the time zone in which the data was observed, and the time delay constant D is the agent Is the maximum number of time periods it takes to be observed from the occurrence time zone.

データ記憶部１３０は、シミュレータ設定テーブル１３１と、最適化設定テーブル１３２と、最適パラメータテーブル１３３とを備えている。 The data storage unit 130 includes a simulator setting table 131, an optimization setting table 132, and an optimum parameter table 133.

シミュレータ設定テーブル１３１は、人流シミュレータを実行するに当たり必要なフィールドを有する。必要なフィールドとしては、例えば道路ネットワーク、エージェントの歩行速度などである。 The simulator setting table 131 has fields necessary for executing a human flow simulator. The required fields are, for example, the road network, the walking speed of the agent, and the like.

最適化設定テーブル１３２は、最大最適化実行回数フィールド、最大繰り返し回数フィールド、時間帯分割数フィールド、時間遅れ定数フィールドを有する。 The optimization setting table 132 has a maximum optimization execution number field, a maximum repetition number field, a time zone division number field, and a time delay constant field.

最大最適化実行回数フィールドには、最大最適化実行回数Ｓが設定され、時間帯分割数フィールドには、時間分割数Ｊが設定され、時間遅れ定数フィールドには、時間遅れ定数Ｄが設定される。 Maximum optimization execution number S is set in the maximum optimization execution number field, time division number J is set in the time zone division number field, and time delay constant D is set in the time delay constant field .

最適パラメータテーブル１３３は、最適化実行部１４０より入力された最適パラメータが設定される。 In the optimum parameter table 133, the optimum parameters input from the optimization execution unit 140 are set.

最適化実行部１４０は、シミュレータ設定テーブル１３１と、最適化設定テーブル１３２との設定を読み込んで、目的関数

を最適化する入力パラメータ

を求める。 The optimization execution unit 140 reads the settings of the simulator setting table 131 and the optimization setting table 132 to obtain an objective function.

Parameters to optimize

Ask for

具体的には、最適化実行部１４０は、シミュレータ実行部１４１と、次入力パラメータ決定部１４２と、最適化制御部１４３と、最適化結果保管部１４４と、時間帯最適化結果保管部１４５とを備えている。 Specifically, the optimization execution unit 140 includes a simulator execution unit 141, a next input parameter determination unit 142, an optimization control unit 143, an optimization result storage unit 144, and a time zone optimization result storage unit 145. Is equipped.

シミュレータ実行部１４１は、実環境における複数の経路Ｒ_ｉの各々についての、推定対象時間帯Ｔ_ｊでの経路Ｒ_ｉを通過するエージェントの数を要素とする入力パラメータ

に基づいて、各時間帯Ｔ_ｊにおいて複数のエージェントが移動するシミュレーションを実行し、シミュレーションの結果から、推定対象時間帯Ｔ_ｊ及び推定対象時間帯の次の時間帯Ｔ_ｊ＋Ｄを含む所定数の時間帯において、観測地点の各々についてのエージェントに関する計算値

と、実環境での観測値

との観測誤差を表す目的関数

を計算する。 The simulator execution unit 141 uses, as an element, the number of agents passing through the route R _i in the estimation target time zone T _j for each of the plurality of routes R _i in the real environment.

Based on the above, a simulation in which a plurality of agents move in each time zone T _j is executed, and from the simulation results, a predetermined number of times including the estimation target time zone T _j and the next time zone T _{j + D} of the estimation target time zone Calculated values for the agent for each of the observation points in the

And observed values in the real environment

Objective function representing the observation error with

Calculate

具体的には、シミュレータ実行部１４１は、時間帯Ｔ_ｊにおける入力パラメータ

に基づいて、人流シミュレータ（ＭＡＳ）を実行し、目的関数値

を算出し、

を最適化結果保管部１４４に追加する。 Specifically, the simulator execution unit 141 inputs an input parameter in the time zone T _j

Run a human flow simulator (MAS) based on the objective function value

Calculate

Is added to the optimization result storage unit 144.

ここで、人流シミュレータ自体は、全ての時間帯のシミュレーションを行っており、シミュレータ実行部１４１は、その人流シミュレータの実行結果から、目的関数値

を算出する。 Here, the human flow simulator itself simulates all time zones, and the simulator execution unit 141 determines the objective function value from the execution result of the human flow simulator.

Calculate

次入力パラメータ決定部１４２は、入力パラメータ

と、目的関数

の計算結果とに基づいて、次の入力パラメータ

を決定する。 The next input parameter determination unit 142 determines the input parameter

And the objective function

The following input parameters based on the calculation results of

Decide.

例えば、最適化手法としてベイズ最適化を用いた場合では、ガウス過程などの確率モデルを用いることによって、次の入力パラメータを決定する。具体的には、次入力パラメータ決定部１４２は、最適化結果保管部１４４内の全てのデータを用いて、パラメータ

と、目的関数値

との関係式を、例えばガウス過程による推定を行う。

とあらわされるとき、あるパラメータ

における目的関数値

は、下記式（５）であらわされる平均値

と、下記式（６）であらわされる分散値

を持つガウス分布に従う。 For example, in the case of using Bayesian optimization as the optimization method, the following input parameters are determined by using a probability model such as a Gaussian process. Specifically, the next input parameter determination unit 142 uses the all data in the optimization result storage unit 144 to generate parameters.

And the objective function value

For example, estimation with a Gaussian process is performed using

When it is expressed as a certain parameter

Objective function value in

Is the average value represented by the following formula (5)

And the dispersion value represented by the following equation (6)

Follow a Gaussian distribution with.

ただし、上付きの記号Ｔは行列の転置を表し、上付き記号”−１”は逆行列を表す。このカーネル関数は、問題に応じて変更してよい。代表的なカーネルとして、線形カーネル、ガウスカーネルなどがある（非特許文献２参照）。 However, superscript symbol T represents transposition of a matrix, and superscript symbol "-1" represents inverse matrix. This kernel function may change depending on the problem. Typical kernels include linear kernels and Gaussian kernels (see Non-Patent Document 2).

ここで、

であり、ｋ、Ｋは、カーネル関数と呼ばれる、パラメータ

と

との類似度を定義する関数

を用いて、下記式（７）、及び式（８）のように書くことができる。 here,

Where k and K are parameters called kernel functions

When

Functions that define the degree of similarity with

The following equation (7) and equation (8) can be written using

そして、次入力パラメータ決定部１４２は、下記式（９）に従い、次に人流シミュレータに入力するパラメータ

を決定する。 Then, the next input parameter determination unit 142 sets the next parameter to be input to the human flow simulator according to the following equation (9):

Decide.

ここで、

は獲得関数と呼ばれ、パラメータ

が最小値を与える可能性を定量的に評価するための指標である。獲得関数には、確率改善（ＰＩ）や、期待値改善（ＥＩ）などを用いてもよい。 here,

Is called the acquisition function, and the parameters

Is an index to quantitatively evaluate the possibility of giving the minimum value. As the acquisition function, probability improvement (PI) or expectation value improvement (EI) may be used.

最適化制御部１４３は、時間帯Ｔ_ｊを推定対象時間帯として、予め定めた繰り返し条件を満たすまで、シミュレータ実行部１４１による実行と、次入力パラメータ決定部１４２による決定とを繰り返させ、目的関数

を最適化する入力パラメータ

を求める The optimization control unit 143 repeats the execution by the simulator execution unit 141 and the determination by the next input parameter determination unit 142 until the predetermined repetition condition is satisfied, with the time zone T _j as the estimation target time zone, and the objective function

Parameters to optimize

Ask for

具体的には、まず、最適化制御部１４３は、最適化結果保管部１４４を空集合に初期化する。 Specifically, first, the optimization control unit 143 initializes the optimization result storage unit 144 to an empty set.

次に、最適化制御部１４３は、時間帯Ｔ_ｊにおいて、最大最適化実行回数Ｓを超えるまで、シミュレータ実行部１４１と、次入力パラメータ決定部１４２との処理を繰り返させる。 Next, the optimization control unit 143 repeats the processing of the simulator execution unit 141 and the next input parameter determination unit 142 until the maximum optimization execution number S is exceeded in the time zone T _j .

最適化制御部１４３は、シミュレータ実行部１４１に、次入力パラメータ決定部１４２により得られたパラメータ

を入力パラメータとして入力して、人流シミュレータを実行させる。 The optimization control unit 143 causes the simulator execution unit 141 to obtain the parameters obtained by the next input parameter determination unit 142.

Is input as an input parameter to run a human flow simulator.

そして、最適化制御部１４３は、繰り返しが終わると、最適化結果保管部１４４に格納された入力パラメータの中で誤差を最小とする

を算出し、時間帯最適化結果保管部１４５に、

を格納する。ここで、Ｄａｔａは、最適化結果保管部１４４を意味する。 Then, the optimization control unit 143 minimizes the error among the input parameters stored in the optimization result storage unit 144 after the repetition.

Time zone optimization result storage unit 145,

Store Here, Data means the optimization result storage unit 144.

そして、最適化制御部１４３は、最適化結果保管部１４４を空集合に初期化して、次の時間帯Ｔ_ｊ＋１を推定対象時間として同様の処理を行う。 Then, the optimization control unit 143 initializes the optimization result storage unit 144 to an empty set, and performs the same processing with the next time zone T _{j + 1} as the estimation target time.

最適化制御部１４３は、全ての時間帯（Ｔ_１〜Ｔ_Ｊ）について上記処理を行うと、時間帯最適化結果保管部１４５に含まれるデータをまとめて最適パラメータ

を算出し、最適パラメータテーブル１３３に格納する。 When the optimization control unit 143 performs the above process for all the time zones (T _{1 to} T _J ), the optimization control unit 143 puts together the data included in the time zone optimization result storage unit 145 to obtain optimum parameters.

Are calculated and stored in the optimum parameter table 133.

最適化結果保管部１４４は、シミュレータ実行部１４１により得られた時間帯Ｔ_ｊにおける入力パラメータ

と目的関数値

との組である

を、記憶する。 The optimization result storage unit 144 receives input parameters in the time zone T _j obtained by the simulator execution unit 141.

And objective function values

It is a pair with

Remember.

また、最適化結果保管部１４４は、最適化制御部１４３により初期化すべき命令を取得すると、記憶している

を全て削除し、空集合にする。 Further, the optimization result storage unit 144 stores it when the optimization control unit 143 acquires an instruction to be initialized.

Delete all and make it an empty set.

時間帯最適化結果保管部１４５は、最適化制御部１４３により得られた時間帯Ｔ_ｊにおける最適パラメータ

を記憶している。 The time zone optimization result storage unit 145 obtains the optimum parameters in the time zone T _j obtained by the optimization control unit 143.

I remember.

最適パラメータ処理部１５０は、最適パラメータテーブル１３３から取得した最適パラメータ

を、外部装置２００に出力する。出力処理は、例えば、外部装置２００から出力のリクエストが入力された場合等に実行すればよい。 The optimum parameter processing unit 150 acquires the optimum parameters acquired from the optimum parameter table 133.

Are output to the external device 200. The output process may be executed, for example, when an output request is input from the external device 200.

＜本発明の第１の実施の形態に係る移動経路推定装置の作用＞
図３は、本発明の第１の実施の形態に係る移動経路推定処理ルーチンを示すフローチャートである。 <Operation of Moving Path Estimation Device According to First Embodiment of the Present Invention>
FIG. 3 is a flowchart showing a movement path estimation processing routine according to the first embodiment of the present invention.

シミュレータ設定処理部１１０と、最適化処理部１２０に、各設定が入力されると、移動経路推定装置３００において、図３に示す移動経路推定処理ルーチンが実行される。 When each setting is input to the simulator setting processing unit 110 and the optimization processing unit 120, the movement route estimation processing routine shown in FIG. 3 is executed in the movement route estimation apparatus 300.

まず、ステップＳ１００において、シミュレータ設定処理部１１０は、外部装置２００より入力された、シミュレータを実行するに当たり必要なフィールドを、最適化処理部１２０は、外部装置２００より入力された最大最適化実行回数Ｓ、時間分割数Ｊ、及び時間遅れ定数Ｄを、それぞれ取得する。 First, in step S100, the simulator setting processor 110 receives the field necessary for executing the simulator, which is input from the external device 200, and the optimization processor 120 performs the maximum number of optimization executions input from the external device 200. S, the time division number J, and the time delay constant D are acquired respectively.

ステップＳ１１０において、シミュレータ設定処理部１１０は、上記ステップＳ１００により取得したシミュレータを実行するに当たり必要なフィールドを、シミュレータ設定テーブル１３１に設定する。 In step S110, the simulator setting processing unit 110 sets, in the simulator setting table 131, fields necessary for executing the simulator acquired in step S100.

ステップＳ１２０において、最適化処理部１２０は、上記ステップＳ１００により取得した最大最適化実行回数Ｓ、時間分割数Ｊ、及び時間遅れ定数Ｄを、最適化設定テーブル１３２に設定する。 In step S120, the optimization processing unit 120 sets the maximum optimization execution number S, the number of time divisions J, and the time delay constant D acquired in step S100 in the optimization setting table 132.

ステップＳ１３０において、最適化制御部１４３は、最適化結果保管部１４４（Ｄａｔａ）を空集合に初期化する。 In step S130, the optimization control unit 143 initializes the optimization result storage unit 144 (Data) to an empty set.

ステップＳ１４０において、ｊ＝１とする。ｊは推定対象時間帯Ｔ_ｊを数えるためのカウンタである。 In step S140, j = 1. j is a counter for counting the estimation target time zone T _j .

ステップＳ１５０において、最適化実行部１４０は、最適化処理を実行する。 In step S150, the optimization execution unit 140 executes optimization processing.

ステップＳ１６０において、最適化制御部１４３は、最適化結果保管部１４４に格納された入力パラメータの中で誤差を最小とする

を算出し、時間帯最適化結果保管部１４５に、

を格納する。 In step S160, the optimization control unit 143 minimizes the error among the input parameters stored in the optimization result storage unit 144.

Time zone optimization result storage unit 145,

Store

ステップＳ１７０において、ｊ＝ｊ＋１とする。 In step S170, j = j + 1.

ステップＳ１８０において、最適化制御部１４３は、ｊが時間分割数Ｊより大きいか否かを判定する。 In step S180, the optimization control unit 143 determines whether j is larger than the time division number J.

ｊがＪよりも大きくない場合（ステップＳ１８０のＮＯ）、ステップＳ１９０において、最適化制御部１４３は、最適化結果保管部１４４を空集合に初期化して、再度ステップＳ１５０〜Ｓ１７０の処理を繰り返す。 If j is not larger than J (NO in step S180), in step S190, the optimization control unit 143 initializes the optimization result storage unit 144 to an empty set, and repeats the processing in steps S150 to S170 again.

一方、ｊがＪよりも大きい場合（ステップＳ１８０のＹＥＳ）、ステップＳ２００において、最適化制御部１４３は、時間帯最適化結果保管部１４５に含まれるデータをまとめて最適パラメータ

を生成し、最適パラメータテーブル１３３に格納する。 On the other hand, when j is larger than J (YES in step S180), in step S200, the optimization control unit 143 puts together the data included in the time zone optimization result storage unit 145 and generates the optimum parameters.

Are stored in the optimum parameter table 133.

ステップＳ２１０において、最適パラメータ処理部１５０は、最適パラメータテーブル１３３から取得した最適パラメータ

を、外部装置２００に出力する。 In step S210, the optimal parameter processing unit 150 determines the optimal parameter acquired from the optimal parameter table 133.

Are output to the external device 200.

ここで、上記ステップＳ１５０における最適化処理について説明する。図４は、最適化処理ルーチンを示すフローチャートである。 Here, the optimization process in step S150 will be described. FIG. 4 is a flowchart showing the optimization processing routine.

ステップＳ３００において、ｓ＝１とする。ｓは最適化を行った回数を数えるためのカウンタである。 In step S300, s = 1. s is a counter for counting the number of times of optimization.

ステップＳ３１０において、シミュレータ実行部１４１は、実環境における複数の経路Ｒ_ｉの各々についての、推定対象時間帯Ｔ_ｊでの経路Ｒ_ｉを通過するエージェントの数を要素とする入力パラメータ

に基づいて、各時間帯Ｔ_ｊにおいて複数のエージェントが移動するシミュレーションを実行する。 In step S310, the simulator execution unit 141 uses, as an element, the number of agents passing the route R _i in the estimation target time zone T _j for each of the plurality of routes R _i in the real environment.

Based on the simulation in which a plurality of agents move in each time zone T _j .

ステップＳ３２０において、シミュレータ実行部１４１は、上記ステップＳ３１０によるシミュレーションの結果から、推定対象時間帯Ｔ_ｊ及び推定対象時間帯の次の時間帯Ｔ_ｊ＋Ｄを含む所定数の時間帯において、観測地点の各々についてのエージェントに関する計算値

と、実環境での観測値

との観測誤差を表す目的関数

を計算する。 In step S320, the simulator execution unit 141 determines each of the observation points in a predetermined number of time zones including the estimation target time zone T _j and the next time zone T _{j + D} of the estimation target time zone from the simulation result in step S310. Calculated value for agent about

And observed values in the real environment

Objective function representing the observation error with

Calculate

ステップＳ３３０において、シミュレータ実行部１４１は、上記ステップＳ３２０で算出した

を最適化結果保管部１４４に追加する。 In step S330, the simulator execution unit 141 calculates in step S320.

Is added to the optimization result storage unit 144.

ステップＳ３４０において、次入力パラメータ決定部１４２は、最適化結果保管部１４４内の全てのデータを用いて、パラメータ

と、目的関数値

との関係式を推定する。 In step S340, the next input parameter determination unit 142 uses the all data in the optimization result storage unit 144 to generate parameters.

And the objective function value

Estimate the relationship with.

ステップＳ３５０において、次入力パラメータ決定部１４２は、式（９）に従い、次に人流シミュレータに入力するパラメータ

を決定する。 In step S350, the next-input-parameter determining unit 142 sets the next parameter to be input to the human flow simulator according to equation (9).

Decide.

ステップＳ３６０において、ｓ＝ｓ＋１とする。 In step S360, s = s + 1.

ステップＳ３７０において、最適化制御部１４３は、ｓが最大最適化実行回数Ｓより大きいか否かを判定する。 In step S370, the optimization control unit 143 determines whether s is greater than the maximum optimization execution number S or not.

ｓが最大最適化実行回数Ｓより大きくない場合（ステップＳ３７０のＮＯ）、ステップＳ３１０に戻り、ステップＳ３１０〜Ｓ３６０の処理を繰り返す。 If s is not larger than the maximum optimization execution number S (NO in step S370), the process returns to step S310, and the processes in steps S310 to S360 are repeated.

最大最適化実行回数Ｓより大きい場合（ステップＳ３７０のＹＥＳ）、リターンする。 If it is larger than the maximum optimization execution number S (YES in step S370), the process returns.

以上説明したように、本実施形態によれば、シミュレーションの結果から、推定対象時間帯及び推定対象時間帯の次の時間帯を含む所定数の時間帯において、観測地点の各々についてのエージェントに関する計算値と、実環境での観測値との観測誤差を表す目的関数を計算することと、入力パラメータと、目的関数の計算結果とに基づいて、次の入力パラメータを決定することとを繰り返し、目的関数を最適化する入力パラメータを求めることにより、時間遅れが発生する状況下であっても、観測誤差を小さくするようなエージェントの行動経路を効率的に推定することができる。 As described above, according to the present embodiment, from the result of simulation, in the predetermined number of time zones including the estimation target time zone and the next time zone of the estimation target time zone, the calculation regarding the agent for each of the observation points Calculate the objective function that represents the observation error of the value and the observation value in the real environment, and repeatedly determine the next input parameter based on the input parameter and the calculation result of the objective function. By obtaining input parameters that optimize the function, it is possible to efficiently estimate the agent's action path that reduces the observation error, even in a situation where a time delay occurs.

＜本発明の第２の実施の形態に係る移動経路推定装置の概要＞
次に、本発明の第２の実施形態の概要について説明する。 <Overview of Moving Path Estimation Device According to Second Embodiment of the Present Invention>
Next, an outline of the second embodiment of the present invention will be described.

図５は、本実施形態の概要を示すイメージ図である。本実施形態では、第１の実施形態によって、時間帯Ｔ_１、Ｔ_２、Ｔ_３…と逐次的に最適化を行うことで、最終的に全ての時間帯に各ルートを通るエージェント数を求める。その際、全時間帯の最適化を一度行った後、さらに複数回、時間帯Ｔ_１、Ｔ_２、Ｔ_３…というように最適化を行う。 FIG. 5 is an image diagram showing an outline of the present embodiment. In the present embodiment, according to the first embodiment, the number of agents passing through each route in all the time zones is finally determined by sequentially optimizing the time zones T ₁ , T ₂ , T ₃ . . At this time, optimization of the entire time zone is performed once, and then optimization is performed a plurality of times such as time zones T ₁ , T ₂ , T ₃ , and so on.

時間帯Ｔ_ｊの入力パラメータは前後Ｄ個の時間帯に発生したエージェントの影響を受けるため、同じ時間帯であっても前後の時間帯が最適化された後に、もう一度最適化することで、さらにエージェントの移動経路推定の精度を上げることができる。 Since the input parameters of time zone T _j are affected by agents generated in the previous and subsequent D time zones, optimization is performed again after the previous and subsequent time zones are optimized even in the same time zone. It is possible to increase the accuracy of agent movement path estimation.

＜本発明の第２の実施の形態に係る移動経路推定装置の構成＞
図６を参照して、本発明の実施の形態に係る移動経路推定装置の構成について説明する。図６は、本発明の実施の形態に係る移動経路推定装置３００の構成を示すブロック図である。なお、第１の実施の形態に係る移動経路推定装置１００と同様の構成については、同一の符号を付して詳細な説明は省略する。 <Configuration of Moving Path Estimation Device According to Second Embodiment of the Present Invention>
The configuration of the movement path estimation apparatus according to the embodiment of the present invention will be described with reference to FIG. FIG. 6 is a block diagram showing a configuration of the movement path estimation apparatus 300 according to the embodiment of the present invention. In addition, about the structure similar to the movement path | route estimation apparatus 100 based on 1st Embodiment, the same code | symbol is attached | subjected and detailed description is abbreviate | omitted.

最適化処理部３２０は、外部装置２００より入力された最大最適化実行回数Ｓ、最大繰り返し回数Ｒ、時間分割数Ｊ、及び時間遅れ定数Ｄを、最適化設定テーブル３３２に設定する。 The optimization processing unit 320 sets the maximum number of times of optimization execution S, the maximum number of repetitions R, the number of time divisions J, and the time delay constant D input from the external device 200 in the optimization setting table 332.

最大繰り返し回数Ｒは、最大最適化実行回数Ｓを１単位としたときの最大単位数である。 The maximum number of repetitions R is the maximum number of units when the maximum number of optimization executions S is one unit.

最適化設定テーブル３３２は、最大最適化実行回数フィールド、最大繰り返し回数フィールド、時間帯分割数フィールド、時間遅れ定数フィールドを有する。最大繰り返し回数フィールドには、最大繰り返し回数Ｒが設定される。 The optimization setting table 332 has a maximum optimization execution number field, a maximum repetition number field, a time zone division number field, and a time delay constant field. The maximum number of repetitions R is set in the maximum number of repetitions field.

最適化実行部３４０は、シミュレータ実行部１４１と、次入力パラメータ決定部１４２と、最適化制御部１４３と、最適化結果保管部１４４と、時間帯最適化結果保管部１４５と、反復制御部３４６を備えている。 The optimization execution unit 340 includes a simulator execution unit 141, a next input parameter determination unit 142, an optimization control unit 143, an optimization result storage unit 144, a time zone optimization result storage unit 145, and an iteration control unit 346. Is equipped.

反復制御部３４６は、全ての時間帯（Ｔ_１〜Ｔ_Ｊ）の各々を、推定対象時間帯として、最適化制御部１４３により入力パラメータを求めることを繰り返させる。 The iterative control unit 346 causes the optimization control unit 143 to repeatedly obtain input parameters, with each of all time zones (T _{1 to} T _J ) as estimation target time zones.

具体的には、反復制御部３４６は、最適化制御部１４３が、全ての時間帯（Ｔ_１〜Ｔ_Ｊ）についての処理を終えると、シミュレータ実行部１４１による実行と、次入力パラメータ決定部１４２による決定とを繰り返した回数が最大繰り返し回数Ｒを超えるまで、最適化制御部１４３に対して再度時間帯Ｔ_１から処理を行わせる。 Specifically, when the optimization control unit 143 completes the processing for all time zones (T _{1 to} T _J ), the iterative control unit 346 executes the processing by the simulator execution unit 141 and the next input parameter determination unit 142. to greater than the maximum number of repetitions R is the number of times repeated and determination by to perform the process again from the time zone T ₁ with respect to the optimization control unit 143.

この時、最適化制御部１４３は最適化結果保管部１４４を空集合に初期化せず、最適化結果保管部１４４に、

とその目的関数値

を格納する。 At this time, the optimization control unit 143 does not initialize the optimization result storage unit 144 to an empty set, and the optimization result storage unit 144

And its objective function value

Store

また、繰り返した回数が最大繰り返し回数Ｒを超えた場合には、反復制御部３４６は、最適化制御部１４３に、時間帯最適化結果保管部１４５に含まれるデータをまとめて最適パラメータ

を算出させ、最適パラメータテーブル１３３に格納させる。 In addition, when the number of repetitions exceeds the maximum number of repetitions R, the repetition control unit 346 causes the optimization control unit 143 to combine the data included in the time zone optimization result storage unit 145 into an optimum parameter.

Are calculated and stored in the optimum parameter table 133.

＜本発明の第２の実施の形態に係る移動経路推定装置の作用＞
図７は、本発明の実施の形態に係る移動経路推定処理ルーチンを示すフローチャートである。なお、第１の実施の形態に係る移動経路推定処理ルーチンと同様の処理については、同一の符号を付して詳細な説明は省略する。 <Operation of Moving Path Estimation Device According to Second Embodiment of Present Invention>
FIG. 7 is a flowchart showing a movement path estimation processing routine according to the embodiment of the present invention. In addition, about the process similar to the movement path | route estimation process routine which concerns on 1st Embodiment, the same code | symbol is attached | subjected and detailed description is abbreviate | omitted.

ステップＳ４００において、シミュレータ設定処理部１１０は、外部装置２００より入力された、シミュレータを実行するに当たり必要なフィールドを、最適化処理部３２０は、外部装置２００より入力された最大最適化実行回数Ｓ、時間分割数Ｊ、最大繰り返し回数Ｒ、及び時間遅れ定数Ｄを、それぞれ取得する。 In step S400, the simulator setting processing unit 110 receives the field necessary for executing the simulator, which is input from the external device 200, the optimization processing unit 320, the maximum optimization execution number S input from the external device 200, The time division number J, the maximum number of repetitions R, and the time delay constant D are acquired respectively.

ステップＳ４２０において、最適化処理部３２０は、上記ステップＳ４００により取得した最大最適化実行回数Ｓ、時間分割数Ｊ、最大繰り返し回数Ｒ、及び時間遅れ定数Ｄを、最適化設定テーブル３３２に設定する。 In step S420, the optimization processing unit 320 sets the maximum optimization execution number S, the number of time divisions J, the maximum number of repetitions R, and the time delay constant D acquired in step S400 in the optimization setting table 332.

ステップＳ４３５において、ｒ＝１とする。ｒは、最適化処理を繰り返した回数を数えるためのカウンタである。 In step S435, r is set to 1. r is a counter for counting the number of times the optimization process is repeated.

ステップＳ１７０において、ｊ＝ｊ＋１とする。なお、ｊ＋１＞Ｊとなる場合には、ｊ＝１とする。 In step S170, j = j + 1. When j + 1> J, j = 1.

ステップＳ４７５において、ｒ＝ｒ＋１とする。 In step S475, r = r + 1.

ステップＳ４８０において、反復制御部３４６は、ｒが最大繰り返し回数Ｒより大きいか否かを判定する。 In step S480, the iteration control unit 346 determines whether r is greater than the maximum number of iterations R.

ｒが最大繰り返し回数Ｒより大きくない場合（ステップＳ４８０のＮＯ）、ステップＳ４９０において、最適化制御部１４３は、ｒが時間分割数Ｊより大きいか否かを判定する。 If r is not larger than the maximum number of repetitions R (NO in step S480), the optimization control unit 143 determines whether r is larger than the number of time divisions J in step S490.

ｒがＪよりも大きくない場合（ステップＳ４９０のＮＯ）、ステップＳ５００において、最適化結果保管部１４４を空集合に初期化して、再度ステップＳ１５０〜Ｓ４８０の処理を繰り返す。 If r is not larger than J (NO in step S490), in step S500, the optimization result storage unit 144 is initialized to an empty set, and the processing in steps S150 to S480 is repeated again.

一方、ｒがＪよりも大きい場合（ステップＳ４９０のＹＥＳ）、ステップＳ５１０において、最適化制御部１４３は、時間帯最適化結果保管部１４５に格納されている

とその目的関数値

を読み出して、最適化結果保管部１４４に格納して、再度ステップＳ１５０〜Ｓ４８０の処理を繰り返す。 On the other hand, when r is larger than J (YES in step S 490), the optimization control unit 143 stores the time zone optimization result storage unit 145 in step S 510.

And its objective function value

Are stored in the optimization result storage unit 144, and the processing of steps S150 to S480 is repeated again.

一方、ｊがＪよりも大きい場合（ステップＳ４９０のＹＥＳ）、 On the other hand, when j is larger than J (YES in step S 490),

また、一方、ｒが最大繰り返し回数Ｒより大きい場合（ステップＳ４８０のＹＥＳ）、ステップＳ２００に進む。 On the other hand, if r is larger than the maximum number of repetitions R (YES in step S480), the process proceeds to step S200.

以上説明したように、本実施形態によれば、予め定めた繰り返し条件を満たすまで、全ての時間帯の各々を、推定対象時間帯として、入力パラメータを繰り返し求めることを、全ての時間帯の各々を、推定対象時間帯として、繰り返し実行させることにより、時間遅れが発生する状況下であっても、観測誤差を小さくするようなエージェントの行動経路を精度良く推定することができる。 As described above, according to the present embodiment, it is determined that each of all the time zones is repeatedly assumed to be an estimation target time zone until the predetermined repetition condition is satisfied. By repeatedly executing the process as the estimation target time zone, it is possible to accurately estimate the agent's action path which reduces the observation error even under a situation where a time delay occurs.

＜本発明の第３の実施の形態に係る移動経路推定装置＞
次に、本発明の第３の実施形態について説明する。 <Movement Path Estimation Device According to Third Embodiment of the Present Invention>
Next, a third embodiment of the present invention will be described.

図８は、本実施形態の概要を示すイメージ図である。本実施形態では、第１の実施形態、または第２の実施形態において、時間帯Ｔ_ｊの最適化を行う際、前後Ｄ個の時間帯のエージェントのみを発生させることで、全時間帯のエージェントを行動させる場合に比べ、ＭＡＳの実行時間が短縮され、結果として最適化の高速化が可能となる。 FIG. 8 is an image diagram showing an outline of the present embodiment. In the present embodiment, in the first embodiment or the second embodiment, when time zone T _j is optimized, agents in all time zones can be generated by generating only agents in D time zones before and after. The execution time of the MAS can be shortened compared to the case of making the action of H. As a result, the optimization can be speeded up.

本実施形態において、第１の実施形態、または第２の実施形態に係る移動経路推定装置と同様の構成については、同一の符号を付して詳細な説明は省略する。 In the present embodiment, the same components as those of the movement path estimation apparatus according to the first embodiment or the second embodiment are denoted by the same reference numerals, and detailed description will be omitted.

シミュレータ実行部１４１は、全ての時間帯（Ｔ_１〜Ｔ_Ｊ）のうち、推定対象時間帯の前の所定数の時間帯（Ｔ_ｊ−Ｄ）、推定対象時間帯（Ｔ_ｊ）、及び推定対象時間帯の次の時間帯を含む所定数の時間帯（Ｔ_ｊ＋Ｄ）において複数のエージェントが移動するシミュレーションを実行する。 The simulator execution unit 141 determines a predetermined number of time zones (T _{j -D} ) before the estimation target time zone among all the time zones (T _{1 to} T _J ), an estimation target time zone (T _j ), and estimation. A simulation is performed in which a plurality of agents move in a predetermined number of time zones (T _{j + D 2} ) including a time zone next to the target time zone.

具体的には、シミュレータ実行部１４１は、人流シミュレータを実行する場合において、実際のシミュレータ上では、パラメータ

によって発生するエージェントのみを行動させる。 Specifically, in the case of executing a human flow simulator, the simulator execution unit 141 uses parameters on an actual simulator.

Act only the agents generated by.

以上説明したように、本実施形態によれば、全ての時間帯のうち、推定対象時間帯の前の所定数の時間帯、推定対象時間帯、及び推定対象時間帯の次の時間帯を含む所定数の時間帯において複数のエージェントが移動するシミュレーションを実行することにより、時間遅れが発生する状況下であっても、観測誤差を小さくするようなエージェントの行動経路を高速に推定することができる。 As described above, according to the present embodiment, among all the time zones, the predetermined number of time zones before the estimation target time zone, the estimation target time zone, and the next time zone of the estimation target time zone are included. By executing a simulation in which a plurality of agents move in a predetermined number of time zones, it is possible to quickly estimate an agent's action path which reduces observation error even under a situation where time delay occurs. .

なお、本発明は、上述した実施の形態に限定されるものではなく、この発明の要旨を逸脱しない範囲内で様々な変形や応用が可能である。 The present invention is not limited to the above-described embodiment, and various modifications and applications can be made without departing from the scope of the present invention.

上述の実施形態では、最適化手法の例としてベイズ最適化を用いる場合について説明したが、ベイズ最適化に限定されるものではなく、他の最適化手法やアルゴリズムを適用しても良い。例えば、最適化手法として、遺伝的アルゴリズムを適用することができる。 Although the above-mentioned embodiment demonstrated the case where Bayesian optimization was used as an example of the optimization method, it is not limited to Bayesian optimization, You may apply another optimization method and algorithm. For example, a genetic algorithm can be applied as an optimization method.

本実施形態では、複数の観測地ｋにおいてある時間帯Ｔ_ｊに計測された通行人数

から各時間帯に経路Ｒ_ｉを通過した人数

を推定するという場合を考えたが、様々な観測データ（通行人数、単位時間当たりの観測地点の密度、目的地までの平均移動時間等）に対して柔軟に適応させることが出来る。このため、種々のデバイスから得られたデータに対応することが出来る。 In the present embodiment, the number of passing people measured in a certain time zone T _j at a plurality of observation sites k

The number of people who passed route R _i in each time zone from

It is possible to flexibly adapt to various observation data (the number of people passing, density of observation points per unit time, average traveling time to the destination, etc.). Thus, it is possible to cope with data obtained from various devices.

また、本願明細書中において、プログラムが予めインストールされている実施形態として説明したが、移動経路推定装置の各構成要素の動作をプログラムとして構築し、移動経路推定装置として利用されるコンピュータにインストールして実行させる、またはネットワークを介して流通させることが可能である。また、当該プログラムを、コンピュータ読み取り可能な記録媒体に格納して提供することも可能である。 Also, in the present specification, although the program is described as an embodiment installed in advance, the operation of each component of the movement path estimation device is constructed as a program, and installed in a computer used as the movement path estimation device It can be implemented or distributed via a network. Further, the program can be provided by being stored in a computer readable recording medium.

１００移動経路推定装置
１１０シミュレータ設定処理部
１２０最適化処理部
１３０データ記憶部
１３１シミュレータ設定テーブル
１３２最適化設定テーブル
１３３最適パラメータテーブル
１４０最適化実行部
１４１シミュレータ実行部
１４２次入力パラメータ決定部
１４３最適化制御部
１４４最適化結果保管部
１４５時間帯最適化結果保管部
１５０最適パラメータ処理部
２００外部装置
３００移動経路推定装置
３２０最適化処理部
３３２最適化設定テーブル
３４０最適化実行部
３４６反復制御部 100 travel path estimation device 110 simulator setting processing unit 120 optimization processing unit 130 data storage unit 131 simulator setting table 132 optimization setting table 133 optimum parameter table 140 optimization execution unit 141 simulator execution unit 142 next input parameter determination unit 143 optimization Control unit 144 Optimization result storage unit 145 Time zone optimization result storage unit 150 Optimal parameter processing unit 200 External device 300 Movement path estimation device 320 Optimization processing unit 332 Optimization setting table 340 Optimization execution unit 346 Repeating control unit

Claims

Performing a simulation in which a plurality of agents move in each time zone based on input parameters whose elements are the number of agents passing through the paths in the estimated time zone for each of a plurality of paths in the real environment; From the result of the simulation, in the predetermined number of time zones including the estimation target time zone and the next time zone of the estimation target time zone, the calculated value regarding the agent for each of the observation points, and the observation value in the real environment A simulator execution unit that calculates an objective function representing an observation error of
A next input parameter determination unit that determines a next input parameter based on the input parameter and the calculation result of the objective function;
An optimization control unit for obtaining an input parameter for optimizing the objective function, causing the execution by the simulator execution unit and the determination by the next input parameter determination unit to be repeated until a predetermined repetition condition is satisfied;
A movement path estimation device including

And an iterative control unit that causes the optimization control unit to repeatedly obtain the input parameter as each of the all time zones as the estimated time zone.
The iterative control unit causes the optimization control unit to repeatedly and repeatedly determine the input parameter as the estimation target time zone, until the predetermined repetitive condition is satisfied. The movement path estimation device according to 1.

The simulator execution unit is a predetermined number of time periods including a predetermined number of time zones before the estimation target time zone, the estimation target time zone, and a time zone next to the estimation target time zone among all time zones. The movement path estimation apparatus according to claim 1, wherein the simulation in which a plurality of agents move in a band is executed.

The simulator execution unit moves a plurality of agents in each time zone based on input parameters whose elements are the number of agents passing through the paths in the estimation target time zone for each of a plurality of routes in the real environment The simulation is executed, and from the result of the simulation, in the predetermined number of time zones including the estimation target time zone and the next time zone of the estimation target time zone, the calculated values regarding the agent for each of the observation points and the real environment Computing an objective function representing the observation error with the observation at
Determining a next input parameter based on the input parameter and the calculation result of the objective function;
Obtaining an input parameter for optimizing the objective function by causing the optimization control unit to repeat the execution by the simulator execution unit and the determination by the next input parameter determination unit until a predetermined repetition condition is satisfied;
Travel path estimation method including:

The iterative control unit further includes the step of causing the optimization control unit to repeatedly obtain the input parameter by setting each of all the time zones as the estimation target time zone,
The step of repeatedly causing the optimization control unit to repeatedly obtain the input parameter as the estimation target time period until the predetermined repetition condition is satisfied. The movement path estimation method of 4.

The step of executing includes a predetermined number of time periods including a predetermined number of time zones before the estimation target time zone, the estimation target time zone, and a next time zone of the estimation target time zone among all time zones. The movement path estimation method according to claim 4 or 5, wherein the simulation in which a plurality of agents move in a band is executed.

The program for functioning a computer as each part of the movement path | route estimation apparatus in any one of Claims 1 thru | or 3.