JP5560154B2

JP5560154B2 - Model parameter estimation apparatus and program thereof

Info

Publication number: JP5560154B2
Application number: JP2010222017A
Authority: JP
Inventors: 俊枝三須; 善明鹿喰; 慎一境田
Original assignee: Japan Broadcasting Corp
Current assignee: Japan Broadcasting Corp
Priority date: 2010-09-30
Filing date: 2010-09-30
Publication date: 2014-07-23
Anticipated expiration: 2030-09-30
Also published as: JP2012078980A

Description

本発明は、逐次モンテカルロ法によりモデルのパラメータを推定するモデルパラメータ推定装置およびそのプログラムに関する。 The present invention relates to a model parameter estimation apparatus and its program for estimating model parameters by a sequential Monte Carlo method.

従来、ある観測モデルにおいて、過去の状態から現在の状態を推定する手法として、逐次モンテカルロ法（粒子フィルタ、パーティクルフィルタ）がある。この逐次モンテカルロ法は、過去の観測情報を逐次入力し、観測モデルに適合した現在の状態（モデルのパラメータ）を逐次的に計算する手法である。すなわち、逐次モンテカルロ法（粒子フィルタ、パーティクルフィルタ）は、観測情報からパラメータの解の候補となる複数の仮説（粒子、パーティクル）を設定し、その複数の仮説を用いて、モデルパラメータの確率分布関数をモンテカルロ近似することでパラメータを推定している（非特許文献１参照）。
一方、本願出願人は、逐次モンテカルロ法を、自律分散環境で実行する手法を提案した（特許文献１参照）。この手法は、複数の分散エージェント（状態推定装置）のそれぞれが、分担して異なる観測を行い、通信を介して、状態の推定を統合するものである。 Conventionally, in a certain observation model, there is a sequential Monte Carlo method (particle filter, particle filter) as a method for estimating a current state from a past state. This sequential Monte Carlo method is a method in which past observation information is sequentially input, and a current state (model parameter) adapted to the observation model is sequentially calculated. That is, the sequential Monte Carlo method (particle filter, particle filter) sets a plurality of hypotheses (particles, particles) that are candidates for parameter solutions from observation information, and uses these hypotheses to calculate the probability distribution function of model parameters. Is estimated by Monte Carlo approximation (see Non-Patent Document 1).
On the other hand, the applicant of the present application has proposed a method of executing the sequential Monte Carlo method in an autonomous distributed environment (see Patent Document 1). In this method, each of a plurality of distributed agents (state estimation apparatuses) shares different observations and integrates state estimation via communication.

特開２００７−７２７０９号公報JP 2007-72709 A

A Doucet, N de Freitas and N Gordon: "Sequential Monte Carlo Methods in Practice," Springer, 2001. ISBN 978-0387951461.A Doucet, N de Freitas and N Gordon: "Sequential Monte Carlo Methods in Practice," Springer, 2001. ISBN 978-0387951461.

しかし、非特許文献１に記載の逐次モンテカルロ法では、特にモデルパラメータが高次元である場合や、モデルパラメータの確率分布がパラメータ空間内に広く分布する場合、その次元や分布に対して十分に多くの仮説を用いないとモンテカルロ近似の近似精度が不足してしまう。
このように、従来の逐次モンテカルロ法は、モンテカルロ近似の近似精度が不足してしまうと、局所的な解（局所解）に陥ってしまい大局的な最適解を求めることができないという問題がある。また、モンテカルロ近似の近似精度を高めるために、仮説を増やすと、その分、演算負荷が増加し、高速に状態推定を行うことができないという問題がある。 However, in the sequential Monte Carlo method described in Non-Patent Document 1, when the model parameters are high-dimensional, or when the probability distribution of the model parameters is widely distributed in the parameter space, the number and dimension are sufficiently large. If this hypothesis is not used, the approximation accuracy of the Monte Carlo approximation will be insufficient.
As described above, the conventional sequential Monte Carlo method has a problem that if the approximation accuracy of the Monte Carlo approximation is insufficient, it falls into a local solution (local solution) and a global optimum solution cannot be obtained. In addition, if the number of hypotheses is increased in order to increase the approximation accuracy of the Monte Carlo approximation, there is a problem that the calculation load increases correspondingly and state estimation cannot be performed at high speed.

一方、特許文献１に記載の逐次モンテカルロ法を自律分散環境で実行する手法は、個々の観測による推定を分散し統合することで、並列動作が可能になり、高速に状態を推定することができる点で優れている。しかし、前記したように、仮説が多い場合であっても、個々の状態推定装置において、高速に状態推定を行うための更なる工夫が求められていた。 On the other hand, the method of executing the sequential Monte Carlo method described in Patent Document 1 in an autonomous distributed environment enables parallel operation by distributing and integrating estimations of individual observations, and can estimate the state at high speed. Excellent in terms. However, as described above, even when there are many hypotheses, further contrivances are required for performing state estimation at high speed in each state estimation apparatus.

本発明は、以上のような問題に鑑みてなされたものであり、逐次モンテカルロ法により、高速で、かつ、精度が高い状態推定を行うモデルパラメータ推定装置およびそのプログラムを提供することを課題とする。 The present invention has been made in view of the above problems, and it is an object of the present invention to provide a model parameter estimation apparatus and its program for performing state estimation at high speed and high accuracy by a sequential Monte Carlo method. .

本発明は、前記課題を解決するために創案されたものであり、まず、請求項１に記載のモデルパラメータ推定装置は、逐次モンテカルロ法によりモデルのパラメータを推定するモデルパラメータ推定装置において、仮説記憶手段と、複数の仮説処理手段と、制御手段と、代表値演算手段と、を備える構成とした。 The present invention has been made to solve the above-mentioned problems. First, the model parameter estimation apparatus according to claim 1 is a model parameter estimation apparatus that estimates model parameters by a sequential Monte Carlo method. And a plurality of hypothesis processing means, control means, and representative value calculation means.

かかる構成において、モデルパラメータ推定装置は、複数の仮説処理手段によって、予め定めたパラメータの分布である逐次モンテカルロ法の提案分布に基づいて仮説データを標本抽出することで、新たな仮説データを生成し、仮説記憶手段に記憶されている仮説データを当該仮説データの生成時刻とともに更新する。なお、この逐次モンテカルロ法の提案分布は、予めその分布が既知の分布である。例えば、この提案分布として、確率密度関数を用いることができる。このように、仮説処理手段は、ある仮説データを提案分布に基づいて標本抽出することで、提案分布に分布が近似した仮説データを生成する。 In such a configuration, the model parameter estimation apparatus generates new hypothesis data by sampling hypothesis data based on a proposed Monte Carlo method distribution that is a predetermined parameter distribution by a plurality of hypothesis processing means. The hypothesis data stored in the hypothesis storage means is updated together with the generation time of the hypothesis data . The proposed distribution of the sequential Monte Carlo method is a distribution whose distribution is already known. For example, a probability density function can be used as the proposed distribution. In this way, the hypothesis processing means generates hypothesis data whose distribution approximates the proposed distribution by sampling certain hypothesis data based on the proposed distribution.

このとき、モデルパラメータ推定装置は、制御手段によって、仮説処理手段が仮説データの更新処理の動作中であるか否かの状態を保持し、更新動作を行っていない仮説処理手段に対して、仮説記憶手段に記憶されている仮説データの生成時刻に基づいて、前回の更新時刻以降で更新されていない仮説データを分配して、複数の仮説処理手段を動作させる。これによって、複数の仮説処理手段は、並列に起動され、仮説データごとに個々に新たな仮説データを生成し、仮説記憶手段に記憶されている仮説データを更新する。
このように、モデルパラメータ推定装置は、制御手段によって、仮説データの更新を早く終わった仮説処理手段に対して、順次未更新の仮説データを割り振ることができる。なお、仮説処理手段が仮説データの更新処理の動作中であるか否かの判定は、例えば、処理中の仮説処理手段に対してフラグを設定する等、一般的な判定手法でよい。 At this time, the model parameter estimation device retains the state of whether or not the hypothesis processing means is in the process of updating the hypothesis data by the control means, and the hypothesis processing means that is not performing the update operation Based on the generation time of hypothesis data stored in the storage means, hypothesis data that has not been updated since the last update time is distributed, and a plurality of hypothesis processing means are operated. As a result, the plurality of hypothesis processing means are activated in parallel, generate new hypothesis data individually for each hypothesis data, and update the hypothesis data stored in the hypothesis storage means.
In this way, the model parameter estimation apparatus can sequentially assign unupdated hypothesis data to the hypothesis processing means that has finished updating hypothesis data early by the control means. Whether or not the hypothesis processing means is in the process of updating the hypothesis data may be determined by a general determination method such as setting a flag for the hypothesis processing means being processed.

そして、モデルパラメータ推定装置は、制御手段によって、仮説記憶手段に記憶されている仮説データが、前回の更新時刻以降の仮説データにすべて更新された段階で、更新時刻を次回の更新時刻に更新する。これによって、ある時点ごとに同期がとられて仮説データが更新されることになる。 Then, the model parameter estimation apparatus, the control means, hypothesis data that is stored in the hypothesis storage means, at a stage that is updated every hypothesis data since the last update time, the update time for the next update time update To do . As a result, the hypothesis data is updated at a certain point in time.

そして、モデルパラメータ推定装置は、代表値演算手段によって、仮説記憶手段に記憶されている複数の仮説データの統計的な代表値を、パラメータの解の推定値として演算する。なお、この代表値を演算するタイミングは任意のタイミングでよい。例えば、代表値演算手段は、一定時間間隔、あるいは、外部からの指示信号により、動作させることとすればよい。 Then, the model parameter estimation device calculates the statistical representative value of the plurality of hypothesis data stored in the hypothesis storage means as the estimated value of the parameter solution by the representative value calculation means. The timing for calculating the representative value may be any timing. For example, the representative value calculation means may be operated at a constant time interval or by an instruction signal from the outside.

また、請求項２に記載のモデルパラメータ推定装置は、請求項１に記載のモデルパラメータ推定装置において、仮説記憶手段には、仮説データに対応付けて当該仮説データの重要度を示す重みを記憶し、仮説処理手段が、モンテカルロ標本化手段と、観測模擬手段と、尤度評価手段と、重み更新手段と、を備える構成とした。 Also, the model parameter estimation apparatus according to claim 2, in the model parameter estimation apparatus according to claim 1, in the hypothesis storage means in association with the hypothesis data store weight indicating the importance of the hypothesis data The hypothesis processing means includes a Monte Carlo sampling means, an observation simulation means, a likelihood evaluation means, and a weight update means.

かかる構成において、モデルパラメータ推定装置は、仮説処理手段のモンテカルロ標本化手段によって、提案分布によりパラメータの解の候補となる仮説データを標本抽出して生成する。そして、モデルパラメータ推定装置は、仮説処理手段の観測模擬手段によって、モンテカルロ標本化手段で生成された仮説データから、予め定めた観測モデルにより更新後の観測値である模擬観測値を生成する。 In this configuration, the model parameter estimation device samples and generates hypothesis data that is a candidate for a parameter solution based on the proposed distribution by the Monte Carlo sampling unit of the hypothesis processing unit. Then, the model parameter estimation device generates a simulated observation value, which is an observation value updated by a predetermined observation model, from the hypothesis data generated by the Monte Carlo sampling means by the observation simulation means of the hypothesis processing means.

そして、モデルパラメータ推定装置は、仮説処理手段の尤度評価手段によって、外部から観測値を入力し、現時点で入力した観測値である入力観測値と、観測模擬手段で生成された模擬観測値とが近似する度合いを示す尤度を算出する。この入力観測値と模擬観測値との近似の度合いは、例えば、当該観測値がベクトルであれば、ベクトル間の距離が近いほど、近似の度合いが大きいといえる。このように、入力観測値と模擬観測値とが近似すれば、模擬観測値を生成した仮説データはより尤もらしい仮説であるといえる。 Then, the model parameter estimation device inputs the observation value from the outside by the likelihood evaluation means of the hypothesis processing means, the input observation value that is the observation value input at the present time, the simulated observation value generated by the observation simulation means, The likelihood indicating the degree of approximation is calculated. As for the degree of approximation between the input observation value and the simulated observation value, for example, if the observation value is a vector, it can be said that the closer the distance between vectors, the greater the degree of approximation. In this way, if the input observation value and the simulated observation value are approximated, it can be said that the hypothesis data that generated the simulated observation value is a more likely hypothesis.

そこで、モデルパラメータ推定装置は、仮説処理手段の重み更新手段によって、尤度評価手段で算出された尤度に基づいて、入力観測値と模擬観測値とが近いほど値が大きくなるように重みを更新する。
そして、モデルパラメータ推定装置は、代表値演算手段によって、仮説記憶手段に記憶されている複数の仮説データを、当該仮説データに対応する重みに基づいて加重平均することで代表値を演算する。 Therefore, the model parameter estimation device uses the weight updating unit of the hypothesis processing unit to assign a weight based on the likelihood calculated by the likelihood evaluating unit so that the value becomes larger as the input observation value and the simulated observation value are closer to each other. Update.
Then, the model parameter estimation device calculates a representative value by performing a weighted average of the plurality of hypothesis data stored in the hypothesis storage unit based on the weight corresponding to the hypothesis data by the representative value calculation unit.

また、請求項３に記載のモデルパラメータ推定装置は、請求項１または請求項２に記載のモデルパラメータ推定装置において、再標本化手段をさらに備える構成とした。 The model parameter estimation device according to claim 3 is the model parameter estimation device according to claim 1 or 2 , further comprising a resampling means.

かかる構成において、モデルパラメータ推定装置は、再標本化手段によって、仮説記憶手段に記憶されている複数の仮説データを、予め定めた確率密度分布により再標本化する。このように、確率密度分布により再標本化することで、仮説データは、確率密度分布に近い分布で再編成されることになり、偏在した分布データを排除することができる。 In such a configuration, the model parameter estimation device resamples the plurality of hypothesis data stored in the hypothesis storage unit by the resampling unit using a predetermined probability density distribution. In this way, by re-sampling with the probability density distribution, the hypothesis data is reorganized with a distribution close to the probability density distribution, and uneven distribution data can be eliminated.

また、請求項４に記載のモデルパラメータ推定装置は、請求項３に記載のモデルパラメータ推定装置において、制御手段が、一定時間間隔、または、仮説記憶手段に記憶されている複数の仮説データがパラメータ空間において予め定めた条件以上偏在している場合に、再標本化手段を動作させることを特徴とする。 The model parameter estimation device according to claim 4 is the model parameter estimation device according to claim 3 , wherein the control unit is configured to use a plurality of hypothesis data stored in the hypothesis storage unit as parameters. The resampling means is operated when it is unevenly distributed over a predetermined condition in the space.

かかる構成において、モデルパラメータ推定装置は、一定時間間隔、または、複数の仮説データがパラメータ空間において偏在している場合に、制御手段が再標本化手段を動作させることで、仮説データは、確率密度分布に近い分布で再編成されることになり、偏在した仮説データを排除することができる。 In such a configuration, the model parameter estimating device operates the resampling unit when the hypothetical data is distributed at a certain time interval or when a plurality of hypothetical data is unevenly distributed in the parameter space. As a result, the hypothesis data that is unevenly distributed can be eliminated.

さらに、請求項５に記載のモデルパラメータ推定装置は、請求項２に記載のモデルパラメータ推定装置において、再標本化手段をさらに備え、制御手段が、仮説記憶手段に記憶されている複数の仮説データにおいて、予め定めた最小値よりも重みの小さい仮説データが、予め定めた数より多くなった場合に、再標本化手段を動作させることを特徴とする。 Furthermore , the model parameter estimation device according to claim 5 is the model parameter estimation device according to claim 2 , further comprising a resampling means, wherein the control means stores a plurality of hypothesis data stored in the hypothesis storage means. The re-sampling means is operated when hypothesis data having a weight smaller than a predetermined minimum value exceeds a predetermined number.

かかる構成において、モデルパラメータ推定装置は、重みの小さい仮説データが増えたときに、制御手段が再標本化手段を動作させることで、仮説データは、確率密度分布に近い分布で再編成されることになり、重みの小さい仮説データを排除することができる。 In such a configuration, when the hypothesis data with a small weight increases, the model parameter estimation device causes the control means to operate the resampling means, so that the hypothesis data is reorganized with a distribution close to the probability density distribution. Thus, hypothesis data having a small weight can be excluded.

また、請求項６に記載のモデルパラメータ推定プログラムは、逐次モンテカルロ法によりモデルのパラメータを推定するために、コンピュータを、複数の仮説処理手段、制御手段、代表値演算手段、として機能させる構成とした。 The model parameter estimation program according to claim 6 is configured to cause a computer to function as a plurality of hypothesis processing means, control means, and representative value calculation means in order to estimate model parameters by a sequential Monte Carlo method. .

かかる構成において、モデルパラメータ推定プログラムは、複数の仮説処理手段によって、予め定めたパラメータの分布である逐次モンテカルロ法の提案分布に基づいて仮説データを標本抽出することで、新たな仮説データを生成し、仮説記憶手段に記憶されている仮説データを当該仮説データの生成時刻とともに更新する。
このとき、モデルパラメータ推定プログラムは、制御手段によって、仮説処理手段が仮説データの更新処理の動作中であるか否かの状態を保持し、更新動作を行っていない仮説処理手段に対して、仮説記憶手段に記憶されている仮説データの生成時刻に基づいて、前回の更新時刻以降で更新されていない仮説データを分配して、複数の仮説処理手段を動作させる。これによって、複数の仮説処理手段は、並列に起動され、仮説データごとに個々に新たな仮説データを生成し、仮説記憶手段に記憶されている仮説データを更新する。 In such a configuration, the model parameter estimation program generates new hypothesis data by sampling hypothesis data based on a proposed Monte Carlo method distribution that is a predetermined parameter distribution by a plurality of hypothesis processing means. The hypothesis data stored in the hypothesis storage means is updated together with the generation time of the hypothesis data .
At this time, the model parameter estimation program retains the state of whether or not the hypothesis processing means is in the process of updating the hypothesis data by the control means, and applies the hypothesis to the hypothesis processing means not performing the update operation. Based on the generation time of hypothesis data stored in the storage means, hypothesis data that has not been updated since the last update time is distributed, and a plurality of hypothesis processing means are operated. As a result, the plurality of hypothesis processing means are activated in parallel, generate new hypothesis data individually for each hypothesis data, and update the hypothesis data stored in the hypothesis storage means.

そして、モデルパラメータ推定プログラムは、制御手段によって、仮説記憶手段に記憶されている仮説データが、前回の更新時刻以降の仮説データにすべて更新された段階で、更新時刻を次回の更新時刻に更新する。
そして、モデルパラメータ推定プログラムは、代表値演算手段によって、仮説記憶手段に記憶されている複数の仮説データの統計的な代表値を、パラメータの解の推定値として演算する。 Then, the model parameter estimation program, by the control means, hypothesis data that have been stored in the hypothesis storage means, at the stage that has been updated all the hypothesis data since the last update time, the update time in the next update time update To do .
Then, the model parameter estimation program calculates the statistical representative value of the plurality of hypothesis data stored in the hypothesis storage means as the estimated value of the parameter solution by the representative value calculation means.

本発明は、以下に示す優れた効果を奏するものである。
請求項１，６に記載の発明によれば、逐次モンテカルロ法による複数の仮説データを、仮説データごとに、個々の仮説処理手段で分担して処理することができ、並列処理を実現することができる。これによって、本発明は、個々の仮説処理手段を、並列計算機、マルチコアＣＰＵを搭載した計算機等で演算することが可能になり、多量の仮説データであっても高速に演算が可能になり、精度よくパラメータの推定を行うことができる。 The present invention has the following excellent effects.
According to the first and sixth aspects of the present invention, a plurality of hypothesis data by the sequential Monte Carlo method can be processed by each hypothesis processing means for each hypothesis data, and parallel processing can be realized. it can. As a result, according to the present invention, each hypothesis processing means can be operated by a parallel computer, a computer equipped with a multi-core CPU, etc., and even a large amount of hypothesis data can be calculated at high speed. It is possible to estimate parameters well.

また、請求項１，６に記載の発明によれば、仮説処理手段が動作を行っていない空き状態をなくすことができ、複数の仮説処理手段を効率よく動作させることができる。これによって、本発明は、各仮説処理手段の負荷状態に応じて、より高速動作可能な仮説処理手段に負荷が分散されるため、高速にパラメータの推定を行うことができる。 Further , according to the first and sixth aspects of the present invention, it is possible to eliminate an empty state in which the hypothesis processing means is not operating, and it is possible to efficiently operate a plurality of hypothesis processing means. As a result, according to the present invention, the load is distributed to the hypothesis processing means capable of operating at higher speed according to the load state of each hypothesis processing means, so that the parameter can be estimated at high speed.

請求項２に記載の発明によれば、観測値に応じて、仮説データに重みを付加することができるため、パラメータの推定に仮説データの有効性を加味することができる。これによって、本発明は、仮説データが少ない場合であっても正確なパラメータ推定が可能になり、より高速にパラメータの推定を行うことができる。 According to the second aspect of the present invention, since the weight can be added to the hypothesis data according to the observed value, the effectiveness of the hypothesis data can be added to the parameter estimation. Thus, according to the present invention, accurate parameter estimation is possible even when hypothesis data is small, and parameter estimation can be performed at higher speed.

請求項３〜５に記載の発明によれば、仮説データを、確率密度分布に近い分布で再編成することができる。これによって、本発明は、仮説データがパラメータ空間内において偏在することを防止し、より少ない仮説データで正確なパラメータを推定することができる。 According to the invention described in claims 3 to 5 , the hypothesis data can be reorganized with a distribution close to the probability density distribution. Thereby, the present invention prevents hypothesis data from being unevenly distributed in the parameter space, and can estimate an accurate parameter with less hypothesis data.

本発明の実施形態に係るモデルパラメータ推定装置の全体構成を示すブロック構成図である。It is a block block diagram which shows the whole structure of the model parameter estimation apparatus which concerns on embodiment of this invention. 本発明の実施形態に係るモデルパラメータ推定装置の仮説処理手段の構成を示すブロック構成図である。It is a block block diagram which shows the structure of the hypothesis processing means of the model parameter estimation apparatus which concerns on embodiment of this invention. 本発明の実施形態に係るモデルパラメータ推定装置の制御手段が管理する仮説処理手段の状態管理テーブルの内容を示す図である。It is a figure which shows the content of the state management table of the hypothesis processing means which the control means of the model parameter estimation apparatus which concerns on embodiment of this invention manages. 本発明の実施形態に係るモデルパラメータ推定装置の仮説記憶手段の記憶内容を示す図である。It is a figure which shows the memory content of the hypothesis memory | storage means of the model parameter estimation apparatus which concerns on embodiment of this invention. 本発明の実施形態に係るモデルパラメータ推定装置の全体動作を示すフローチャートである。It is a flowchart which shows the whole operation | movement of the model parameter estimation apparatus which concerns on embodiment of this invention. 本発明の実施形態に係るモデルパラメータ推定装置の仮説並列処理動作を示すフローチャートである。It is a flowchart which shows hypothesis parallel processing operation | movement of the model parameter estimation apparatus which concerns on embodiment of this invention.

以下、本発明の実施形態について図面を参照して説明する。
［モデルパラメータ推定装置の構成］
最初に、図１を参照して、本発明の実施形態に係るモデルパラメータ推定装置の構成について説明する。モデルパラメータ推定装置１は、外部から入力される観測対象の観測値に基づいて、予め定めたモデルに適合するパラメータ（モデルパラメータ）を逐次推定するものである。 Embodiments of the present invention will be described below with reference to the drawings.
[Configuration of model parameter estimation device]
First, the configuration of the model parameter estimation apparatus according to the embodiment of the present invention will be described with reference to FIG. The model parameter estimation apparatus 1 sequentially estimates a parameter (model parameter) that matches a predetermined model based on an observation value of an observation target input from the outside.

ここで、観測対象とは、動き、変動等の状態の変化を伴うもの、あるいは、事象であって、動きを伴う人物、車両等や、変動を伴う株価等がその対象となりうる。例えば、モデルパラメータ推定装置１は、人物を観測対象として、カメラで撮影した画像における人物の位置を観測値とすることで、人物の３次元位置や、速度等をモデルパラメータとして推定することができる。
ここでは、モデルパラメータ推定装置１は、ある時刻ｔ（例えば、ｔは整数）においてモデルパラメータｘ_ｔを推定することとするが、この時刻ｔは、時間の次元を持つ物理的な時刻であっても構わないし、単純に処理のステップを表すインデックスであっても構わない。以下では、便宜上、「時刻」という表現を用いることとする。 Here, the observation target is a thing accompanied by a change in a state such as movement or fluctuation, or an event, and a person, a vehicle, etc. accompanied by a movement, a stock price accompanied by a fluctuation, or the like can be the target. For example, the model parameter estimation apparatus 1 can estimate a person's three-dimensional position, speed, and the like as model parameters by using a person as an observation target and using the position of the person in an image captured by a camera as an observation value. .
Here, the model parameter estimation device 1 estimates the model parameter x _t at a certain time t (for example, t is an integer). This time t is a physical time having a time dimension. It is also possible to use an index that simply represents a processing step. Hereinafter, for the sake of convenience, the expression “time” is used.

また、ここで、モデルとは、観測対象の観測値から推定される状態（モデルパラメータ）を確率密度分布に近似することが可能なプロセスである。また、モデルパラメータは、モデルの内部処理によって得られる１以上のデータであって、スカラ、ベクトル、行列、テンソル、関数など、その値の数学的表現は任意であるが、ここでは、ベクトルを用いて表現することとする。 Here, the model is a process capable of approximating a state (model parameter) estimated from an observation value of an observation target to a probability density distribution. The model parameter is one or more data obtained by internal processing of the model, and its mathematical expression such as a scalar, a vector, a matrix, a tensor, or a function is arbitrary. Here, a vector is used. To express.

ここでは、モデルパラメータ推定装置１は、時刻ｔにおけるモデルパラメータｘ_ｔの解の候補となる仮説データ（以下、仮説という）をＫ_ｔ個（Ｋ_ｔは自然数）生成し、時刻ｔのｋ番目（ｋ＝０，１，…，Ｋ_ｔ−１）の仮説ｘ_ｔ ^（ｋ）と、その仮説に対する重みｗ_ｔ ^（ｋ）とのＫ_ｔ対により、以下の（１）式に示すモデルパラメータｘ_ｔの確率密度分布ｐ（ｘ_ｔ）を近似するように動作するものとする。なお、δは、ディラック（Ｄｉｒａｃ）のデルタ関数である。また、仮説ｘ_ｔ ^（ｋ）は、モデルパラメータの候補であるため、モデルパラメータと同次元のデータであって、ベクトルを用いて表現する。 Here, the model parameter estimation device 1, become candidates hypotheses data of the solution of the model parameters x _t at time t (hereinafter, referred to as the hypothesis) and K _t pieces (K _t is a natural number) generated, k-th time t ( The model parameter x _t shown in the following equation (1) is obtained by K _t pair of the hypothesis x _t ^{(k) of} k = 0, 1,..., K _t −1) and the weight w _t ^(k) for the hypothesis. Let us operate to approximate the probability density distribution p (x _t ). Note that δ is a Dirac delta function. Since the hypothesis x _t ^(k) is a model parameter candidate, it is data of the same dimension as the model parameter, and is expressed using a vector.

以下、図１を参照して、モデルパラメータ推定装置１の各構成について説明する。
図１に示すように、ここでは、モデルパラメータ推定装置１は、制御手段１０と、仮説記憶手段２０と、複数（Ｎ個）の仮説処理手段３０（３０_１，３０_２，…，３０_Ｎ）と、再標本化手段４０と、代表値演算手段５０と、を備えている。 Hereinafter, with reference to FIG. 1, each structure of the model parameter estimation apparatus 1 is demonstrated.
As shown in FIG. 1, here, the model parameter estimation device 1 includes a control unit 10, a hypothesis storage unit 20, and a plurality (N) of hypothesis processing units 30 (30 ₁ , 30 ₂ ,..., 30 _N ). And a re-sampling means 40 and a representative value calculation means 50.

制御手段１０は、仮説記憶手段２０に対するデータ（仮説、重み）のアクセス制御を行ったり、複数の仮説処理手段３０、再標本化手段４０および代表値演算手段５０との間で、進行状況や終了状況に応じて、各手段の動作制御やデータの授受を行うものである。 The control means 10 performs access control of data (hypotheses, weights) to the hypothesis storage means 20, and progress status and end between a plurality of hypothesis processing means 30, resampling means 40 and representative value calculation means 50. Depending on the situation, operation control of each means and data exchange are performed.

具体的には、制御手段１０は、Ｎ個の仮説処理手段３０_１，３０_２，…，３０_Ｎから、動作を行っていない（空き状態の）仮説処理手段３０を検索し、仮説記憶手段２０に記憶されるＫ_ｔ個の仮説（仮説群）を、空き状態の仮説処理手段３０に仮説ごとに分配するスケジューリングを行う。そして、制御手段１０は、仮説処理手段３０_１，３０_２，…，３０_Ｎが行った仮説の再標本化を行うように再標本化手段４０を動作させたり、モデルパラメータの代表値を演算するように代表値演算手段５０を動作させたりする制御を行う。
ここでは、制御手段１０は、仮説アクセス制御手段１１と、仮説並列処理制御手段１２と、再標本化制御手段１３と、代表値演算制御手段１４と、を備えている。 Specifically, the control means 10 searches the hypothesis processing means 30 that is not operating (in an empty state) from the _N hypothesis processing means 30 ₁ , 30 ₂ ,. Scheduling is performed to distribute the K _t hypotheses (hypothesis group) stored in the hypothesis processing means 30 in an empty state for each hypothesis. Then, the control means 10 operates the re-sampling means 40 so as to re-sample the hypotheses performed by the hypothesis processing means 30 ₁ , 30 ₂ ,..., 30 _N , or calculates a representative value of the model parameter. In this way, control for operating the representative value calculating means 50 is performed.
Here, the control means 10 includes a hypothesis access control means 11, a hypothesis parallel processing control means 12, a resampling control means 13, and a representative value calculation control means 14.

仮説アクセス制御手段１１は、仮説記憶手段２０に記憶されている仮説や重みに対するアクセス（読み出し、書き込み）を制御するものである。ここでは、仮説処理手段３０、再標本化手段４０および代表値演算手段５０と、制御手段１０内部の各手段は、この仮説アクセス制御手段１１を介して、仮説記憶手段２０に対して仮説や重みの読み出しや、書き込みを行うこととする。 The hypothesis access control means 11 controls access (reading and writing) to hypotheses and weights stored in the hypothesis storage means 20. Here, the hypothesis processing means 30, the resampling means 40, the representative value calculation means 50, and each means in the control means 10 are connected to the hypothesis storage means 20 via the hypothesis access control means 11. Reading and writing are performed.

仮説並列処理制御手段１２は、空き状態の仮説処理手段３０を検索し、仮説記憶手段２０に記憶される複数（Ｋ_ｔ個）の仮説を、仮説ごとに割り振る（分配する）スケジューリングを行うものである。
この仮説並列処理制御手段１２は、ある時刻ｔにおいて、仮説記憶手段２０に記憶されている複数の仮説の中で、更新されていない時刻（ｔ−１）の仮説および重みを更新するように、仮説更新処理を行っていない空き状態の仮説処理手段３０を動作させて、未更新の仮説を特定する識別番号（例えば、ｋ番目）を通知する。これによって、動作を指示された仮説処理手段３０が、時刻（ｔ−１）におけるｋ番目の仮説および重みから、時刻ｔにおけるｋ番目の仮説および重みを生成し、仮説記憶手段２０に書き込むことで、仮説および重みを更新する。 The hypothesis parallel processing control unit 12 searches for the hypothesis processing unit 30 in an empty state, and performs scheduling for allocating (distributing) a plurality of (K _t ) hypotheses stored in the hypothesis storage unit 20 for each hypothesis. is there.
The hypothesis parallel processing control unit 12 updates a hypothesis and weight at a time (t−1) that has not been updated among a plurality of hypotheses stored in the hypothesis storage unit 20 at a certain time t. The hypothesis processing means 30 in the empty state in which the hypothesis update processing is not performed is operated, and an identification number (for example, k-th) for specifying an unupdated hypothesis is notified. As a result, the hypothesis processing means 30 instructed to operate generates the kth hypothesis and weight at the time t from the kth hypothesis and weight at the time (t−1), and writes it to the hypothesis storage means 20. Update hypotheses and weights.

なお、仮説処理手段３０_１，３０_２，…，３０_Ｎが空き状態であるか否かは、フラグによって判定することができる。例えば、仮説並列処理制御手段１２は、ある仮説処理手段３０に仮説および重みの更新を指示した段階で、当該仮説処理手段３０に対応したフラグをセットし、更新完了の通知を取得した段階で、そのフラグをリセットする。例えば、仮説並列処理制御手段１２は、図３に示すような状態管理テーブルによって、仮説処理手段３０_１，３０_２，…，３０_Ｎがそれぞれ動作状態（例えば、“１”）であるか、空き状態（例えば、“０”）であるかを示す値を対応付けて状態を保持し、管理する。この状態管理テーブルは、図示を省略したメモリ等に設定すればよい。なお、仮説並列処理制御手段１２は、図３に示すように、動作状態時に、どの仮説を処理しているのかを仮説番号ｋとして設定することとする。 Incidentally, the hypothesis processing means ₃₀ _1, 30 2, ..., is whether 30 or _N is empty, it can be determined by a flag. For example, the hypothesis parallel processing control means 12 sets a flag corresponding to the hypothesis processing means 30 at the stage of instructing a certain hypothesis processing means 30 to update the hypothesis and weight, and at the stage of obtaining a notification of update completion, Reset the flag. For example, the hypothesis parallel-processing control unit 12, the state management table as shown in FIG. 3, the hypothesis processing means ₃₀ _1, 30 2, ..., or 30 _N are each operating state (e.g., "1"), the free A state indicating the state (for example, “0”) is associated with the state, and the state is held and managed. This state management table may be set in a memory or the like not shown. As shown in FIG. 3, the hypothesis parallel processing control means 12 sets which hypothesis is being processed as a hypothesis number k in the operating state.

このように、仮説並列処理制御手段１２は、空き状態の仮説処理手段３０に対して、仮説および重みの更新を順次動作させることで、複数の仮説処理手段３０によって、仮説ごとに並列処理を行うことができる。
そして、仮説並列処理制御手段１２は、仮説記憶手段２０に記憶されている仮説が次の時刻である時刻ｔの仮説にすべて更新された段階で、時刻を更新する。
なお、ここでは、仮説並列処理制御手段１２は、仮説記憶手段２０に記憶されている仮説が時刻ｔの仮説にすべて更新された段階で、その旨を再標本化制御手段１３に通知し、さらに、仮説の再標本化が完了した段階で、その旨を代表値演算制御手段１４に通知することとする。 As described above, the hypothesis parallel processing control unit 12 performs parallel processing for each hypothesis by the plurality of hypothesis processing units 30 by causing the hypothesis processing unit 30 in the empty state to sequentially update the hypotheses and weights. be able to.
Then, the hypothesis parallel processing control means 12 updates the time when all the hypotheses stored in the hypothesis storage means 20 are updated to the hypothesis at time t, which is the next time.
Here, the hypothesis parallel processing control means 12 notifies the resampling control means 13 to that effect when all the hypotheses stored in the hypothesis storage means 20 have been updated to the hypotheses at time t, When the hypothesis re-sampling is completed, the representative value calculation control means 14 is notified to that effect.

再標本化制御手段１３は、仮説記憶手段２０に記憶されている複数の仮説（仮説群）を再標本化する再標本化手段４０に対して、起動制御を行うものである。
この再標本化制御手段１３は、仮説並列処理制御手段１２から、仮説記憶手段２０に記憶されている仮説が時刻ｔの仮説にすべて更新された旨が通知された段階、すなわち、一定時間間隔（ここでは時刻ｔ）で再標本化手段４０を起動する。 The resampling control means 13 performs activation control on the resampling means 40 for resampling a plurality of hypotheses (hypotheses group) stored in the hypothesis storage means 20.
This re-sampling control means 13 is in a stage when it is notified from the hypothesis parallel processing control means 12 that all the hypotheses stored in the hypothesis storage means 20 have been updated to the hypotheses at time t, that is, at a constant time interval ( Here, the resampling means 40 is activated at time t).

なお、再標本化手段４０は、必ずしも定期的に起動する必要はなく、例えば、再標本化制御手段１３は、仮説記憶手段２０に記憶されている複数の仮説（仮説群）の状態を検出し、その状態が予め定めた条件を満たした場合に起動することとしてもよい。
例えば、この条件として、再標本化制御手段１３は、仮説記憶手段２０に記憶されている仮説群における重み配分が極端に偏在したことを条件として再標本化手段４０を起動する。なお、重み配分が偏在しているか否かの判定は、仮説記憶手段２０に記憶されている重み値集合の統計量（例えば、重み値の分散、重み値の標準偏差、重み値の最大値と最小値の差など）や、重み値集合を大津の二値化手法などにより２クラスに分けたときのクラス内分散とクラス間分散に基づき計算される数量（例えば、クラス間分散とクラス内分散の比によって定義される分離度）等によって判定することができる。
また、前記条件として、予め定めた最小値よりも重みの小さい仮説が、予め定めた数より多くなったことを条件としてもよい。 Note that the resampling means 40 does not necessarily have to be activated periodically. For example, the resampling control means 13 detects the states of a plurality of hypotheses (hypotheses group) stored in the hypothesis storage means 20. It may be activated when the state satisfies a predetermined condition.
For example, as this condition, the resampling control unit 13 activates the resampling unit 40 on the condition that the weight distribution in the hypothesis group stored in the hypothesis storage unit 20 is extremely unevenly distributed. Whether or not the weight distribution is unevenly distributed is determined based on the statistics of the weight value set stored in the hypothesis storage means 20 (for example, the variance of the weight value, the standard deviation of the weight value, the maximum value of the weight value) (E.g. difference between minimum values) and quantity calculated based on intra-class variance and inter-class variance when the weight set is divided into two classes using Otsu's binarization method (for example, inter-class variance and intra-class variance) The degree of separation defined by the ratio) can be determined.
The condition may be that the number of hypotheses having a weight smaller than a predetermined minimum value is greater than a predetermined number.

代表値演算制御手段１４は、仮説記憶手段２０に記憶されている複数の仮説から、代表値を生成する代表値演算手段５０に対して、起動制御を行うものである。
この代表値演算制御手段１４は、仮説並列処理制御手段１２から、仮説記憶手段２０に記憶されている仮説が再標本化された旨を通知された段階、すなわち、一定時間間隔（ここでは時刻ｔ）で代表値演算手段５０を起動する。 The representative value calculation control means 14 performs activation control on the representative value calculation means 50 that generates a representative value from a plurality of hypotheses stored in the hypothesis storage means 20.
The representative value calculation control means 14 is in a stage when it is notified from the hypothesis parallel processing control means 12 that the hypothesis stored in the hypothesis storage means 20 has been resampled, that is, at a certain time interval (here, time t ) Activates the representative value calculating means 50.

なお、代表値演算手段５０は、必ずしも定期的に起動する必要はなく、例えば、モデルパラメータ推定装置１を操作するユーザから要求があった場合、すなわち、外部から図示を省略した入力手段を介して代表値（推定値）を出力する旨の指示信号が入力されたときのみに起動することとしてもよい。 The representative value calculation means 50 does not necessarily have to be started periodically. For example, when there is a request from a user who operates the model parameter estimation device 1, that is, through an input means not shown from the outside. It may be activated only when an instruction signal for outputting a representative value (estimated value) is input.

仮説記憶手段２０は、仮説処理手段３０によって演算されるモデルパラメータの仮説と、その仮説の重要度を示す重みとを、時刻に対応付けて複数記憶するものである。この仮説記憶手段２０は、ハードディスク、半導体メモリ等、一般的な記憶装置で構成することができる。ここでは、時刻ｔにおける仮説の総数をＫ_ｔ個（Ｋ_ｔは自然数）とし、そのｋ番目（ｋ＝０，１，…，Ｋ_ｔ−１）の仮説をｘ_ｔ ^（ｋ）とし、ｋ番目の重みをｗ_ｔ ^（ｋ）とする。この仮説記憶手段２０には、図４に示すように、ｋ番目を示す仮説番号ごとに、仮説と、重みと、当該仮説が生成された時刻とを対応付けて記憶しておくこととする。
なお、初期状態（時刻ｔ＝０）においては、仮説記憶手段２０に、事前にＫ_ｔ個の仮説と重みが予め記憶されているものとする。
この仮説記憶手段２０に記憶されている仮説および重みは、制御手段１０によって読み出され、仮説処理手段３０によって新たな仮説および重みが生成された段階で、制御手段１０によって更新される。 The hypothesis storage means 20 stores a plurality of model parameter hypotheses calculated by the hypothesis processing means 30 and a weight indicating the importance of the hypothesis in association with the time. The hypothesis storage means 20 can be composed of a general storage device such as a hard disk or a semiconductor memory. Here, the total number of hypotheses at time t is K _t (K _t is a natural number), the k th (k = 0, 1,..., K _t −1) hypothesis is x _t ^(k) , and the k th Let w _t ^(k) be the weight of. As shown in FIG. 4, the hypothesis storage means 20 stores a hypothesis, a weight, and a time when the hypothesis is generated in association with each hypothesis number indicating the kth.
In the initial state (time t = 0), it is assumed that K _t hypotheses and weights are stored in advance in the hypothesis storage unit 20.
The hypothesis and weight stored in the hypothesis storage unit 20 are read by the control unit 10 and updated by the control unit 10 when a new hypothesis and weight are generated by the hypothesis processing unit 30.

仮説処理手段３０は、過去の仮説あるいは観測値から、予め定めた逐次モンテカルロ法の提案分布に基づく標本化（標本抽出）を行うことで、現時点における仮説および重みを生成するものである。なお、この仮説処理手段３０は、複数（３０_１，３０_２，…，３０_Ｎ）備えられ、それぞれ並列に動作する。
また、それぞれの仮説処理手段３０_１，３０_２，…，３０_Ｎは、制御手段１０から、時刻（ｔ−１）において推定した仮説ｘ_ｔ−１ ^（ｋ）を特定する識別番号（ここでは、仮説番号ｋ）を通知されることで、時刻ｔにおけるｋ番目の仮説ｘ_ｔ ^（ｋ）および重みｗ_ｔ ^（ｋ）を生成する。 The hypothesis processing means 30 generates hypotheses and weights at the present time by performing sampling (sampling) based on a proposed distribution of a predetermined sequential Monte Carlo method from past hypotheses or observation values. The hypothesis processing means 30 is provided in a plurality (30 ₁ , 30 ₂ ,..., 30 _N ) and operates in parallel.
Further, each hypothesis processing means 30 ₁ , 30 ₂ ,..., 30 _N receives an identification number (here, the hypothesis x _t-1 ^(k) estimated at time (t−1) from the control means 10. By being notified of the hypothesis number k), the k-th hypothesis x _t ^(k) and the weight w _t ^(k) at time t are generated.

ここで、図２を参照（適宜図１参照）して、仮説処理手段３０の詳細な構成について説明する。
図２に示すように、仮説処理手段３０は、モンテカルロ標本化手段３１と、観測模擬手段３２と、尤度評価手段３３と、重み更新手段３４と、を備えている。 Here, the detailed configuration of the hypothesis processing means 30 will be described with reference to FIG.
As shown in FIG. 2, the hypothesis processing means 30 includes a Monte Carlo sampling means 31, an observation simulation means 32, a likelihood evaluation means 33, and a weight update means 34.

モンテカルロ標本化手段３１は、制御手段１０から通知される時刻（ｔ−１）におけるｋ番目の仮説ｘ_ｔ−１ ^（ｋ）から、時刻ｔにおける仮説ｘ_ｔ ^（ｋ）を生成するものである。なお、モンテカルロ標本化手段３１は、制御手段１０から通知された時刻（ｔ−１）以前の仮説ｘ_０ ^（ｋ），ｘ_１ ^（ｋ），…，ｘ_ｔ−１ ^（ｋ）および時刻ｔ以前における観測値ｙ_０，ｙ_１，…，ｙ_ｔの一つ以上から時刻ｔにおける仮説ｘ_ｔ ^（ｋ）を生成することとしてもよい。 The Monte Carlo sampling unit 31 generates a hypothesis x _t ^(k) at time t from the ^k _- th hypothesis x _t-1 ^{(k) at} time (t−1) notified from the control unit 10. Note that the Monte Carlo sampling unit 31 uses the hypotheses x ₀ ^(k) , x ₁ ^(k) ,..., X _t−1 ^(k) before the time (t−1) notified from the control unit 10 and before the time t. observed value _y _0, y 1 in, ..., it is also possible to generate a hypothesis _x ^{t (k)} at time t from one or more of _{y t.}

ここでは、モンテカルロ標本化手段３１は、例えば、観測値の履歴（ｙ_０，ｙ_１，…，ｙ_ｔ）と仮説の履歴（ｘ_０ ^（ｋ），ｘ_１ ^（ｋ），…，ｘ_ｔ−１ ^（ｋ））とに基づき、以下の（２）式に示す予め定めた提案分布πから、（３）式に示すように標本抽出を行うことで、時刻ｔの仮説ｘ_ｔ ^（ｋ）を生成する。 Here, the Monte Carlo sampling means 31 is, for example, the observation value history (y ₀ , y ₁ ,..., Y _t ) and the hypothesis history (x ₀ ^(k) , x ₁ ^(k) ,..., X _t− ⁾ . ₁ ^(k) ), sampling is performed as shown in Equation (3) from a predetermined proposal distribution π shown in Equation (2) below, and hypothesis x _t ^(k) at time t is ^obtained . Generate.

ここで、提案分布とは、対応するモデルから直接標本（サンプル）を抽出する代わりに、別の分布関数（より容易に標本抽出可能な分布関数を用いることが好ましい）を用いて近似を行った分布関数（Ｐｒｏｐｏｓａｌｄｉｓｔｒｉｂｕｔｉｏｎ）である。
また、ここで、標本抽出とは、提案分布πと分布が近似するように、母集団（仮説や観測値の履歴）から、標本をランダムに選択することである。
なお、ここでは、モンテカルロ標本化手段３１は、提案分布πの一例として、以下の（４）式に示すように、確率密度関数ｐ（事前遷移確率）を用いることとする。 Here, the proposed distribution is approximated using another distribution function (preferably a distribution function that can be sampled more easily) instead of extracting a sample (sample) directly from the corresponding model. It is a distribution function (Proposal distribution).
Here, sampling means to randomly select a sample from a population (hypothesis or history of observation values) so that the distribution is approximate to the proposed distribution π.
Here, the Monte Carlo sampling means 31 uses a probability density function p (prior transition probability) as shown in the following equation (4) as an example of the proposed distribution π.

例えば、時刻ｔの仮説ｘ_ｔ ^（ｋ）を生成する際に、当該仮説ｘ_ｔ ^（ｋ）が確率的に無作為（ランダム）に決定される分散共分散行列Σ_ｖのランダムウォークにより状態が遷移する場合、モンテカルロ標本化手段３１は、（４）式の確率密度関数ｐとして、以下の（５）式の確率密度関数ｐ（ｘ_ｔ｜ｘ_ｔ−１）を用いることができる。 For example, when generating a hypothesis x _t ^(k) at time t, the state transitions by a random walk of the variance-covariance matrix Σ _{v in which} the hypothesis x _t ^(k) is determined randomly (randomly). In this case, the Monte Carlo sampling unit 31 can use the probability density function p (x _t | x _t−1 ) of the following expression (5) as the probability density function p of the expression (4).

ここで、「ｄｉｍ」は、仮説ｘ_ｔの情報の次元数を示す。
この確率密度関数ｐを使用する場合、モンテカルロ標本化手段３１は、制御手段１０から通知される仮説ｘ_ｔ−１ ^（ｋ）から、時刻ｔにおける仮説ｘ_ｔ ^（ｋ）を生成することになる。 Here, the "dim" indicates the number of dimensions of information of the hypothesis _{x t.}
When this probability density function p is used, the Monte Carlo sampling unit 31 generates a hypothesis x _t ^(k) at time t from the hypothesis x _t-1 ^(k) notified from the control unit 10.

このように、モンテカルロ標本化手段３１は、予め定めた提案分布（あるいは、確率密度関数）から、標本抽出を行うことで、時刻ｔの仮説ｘ_ｔ ^（ｋ）を生成する。そして、モンテカルロ標本化手段３１は、生成した時刻ｔの仮説ｘ_ｔ ^（ｋ）を、制御手段１０を介して仮説記憶手段２０に書き込み、仮説を更新する。また、ここでは、モンテカルロ標本化手段３１は、生成した時刻ｔの仮説ｘ_ｔ ^（ｋ）を観測模擬手段３２に出力する。 In this way, the Monte Carlo sampling means 31 generates a hypothesis x _t ^(k) at time t by sampling from a predetermined proposal distribution (or probability density function). Then, the Monte Carlo sampling unit 31 writes the generated hypothesis x _t ^(k) at time t to the hypothesis storage unit 20 via the control unit 10 and updates the hypothesis. Here, the Monte Carlo sampling means 31 outputs the generated hypothesis x _t ^(k) at time t to the observation simulation means 32.

観測模擬手段３２は、モンテカルロ標本化手段３１で標本化され生成された仮説が、予めモデル化した観測モデルによりどのような観測値として観測されるかを模擬（演算）するものである。
ここでは、この観測模擬手段３２は、モンテカルロ標本化手段３１で標本化され生成された時刻ｔにおけるモデルパラメータ（仮説ｘ_ｔ ^（ｋ））が、観測モデルｐ（ｙ_ｔ｜ｘ_ｔ）により、以下の（６）式に示すように、ｋ番目の観測値ｙ_ｔ ^（ｋ）を模擬結果（模擬観測値）として生成する。 The observation simulation unit 32 simulates (calculates) what observation value the hypothesis sampled and generated by the Monte Carlo sampling unit 31 is observed by the observation model modeled in advance.
Here, the observation simulation means 32 uses the observation model p (y _t | x _t ) as a model parameter (hypothesis x _t ^(k) ) at time t sampled and generated by the Monte Carlo sampling means 31. As shown in Equation (6), the k-th observation value y _t ^(k) is generated as a simulation result (simulation observation value).

より具体的には、この観測モデルｐは、以下の（７）式に示すように、模擬観測値ｙ_ｔが、モデルパラメータｘ_ｔを関数ｈ_ｔにより算出された結果として生成されるモデルとして定義してもよい。 More specifically, the observation model p, as shown in the following equation (7), defined as a model which simulated observations y _t is generated as a result of the calculated model parameter x _t by a function h _t May be.

この関数ｈ_ｔは、例えば、観測モデルが、等加速度運動モデル、等速度運動モデルである場合、時刻（ｔ−１）の位置から、等加速度、等速度によって時刻ｔの位置を算出する関数である。すなわち、関数ｈ_ｔは、観測モデルに応じて、予め定めた関数によって定義しておけばよい。 For example, when the observation model is a constant acceleration motion model or a constant velocity motion model, this function h _t is a function for calculating the position at time t from the position at time (t−1) by the constant acceleration and constant velocity. is there. That is, the function h _t, depending on the observation model, it is sufficient to define the predetermined function.

このように、観測模擬手段３２は、モンテカルロ標本化手段３１で標本化され生成された仮説を、観測モデルにより模擬することで、模擬観測値ｙ_ｔ ^（ｋ）を生成する。そして、観測模擬手段３２は、模擬（演算）した結果の観測値（模擬観測値ｙ_ｔ ^（ｋ））を尤度評価手段３３に出力する。 In this way, the observation simulation unit 32 generates the simulated observation value y _t ^(k) by simulating the hypothesis sampled and generated by the Monte Carlo sampling unit 31 using the observation model. Then, the observation simulation means 32 outputs an observation value (simulation observation value y _t ^(k) ) as a result of simulation (calculation) to the likelihood evaluation means 33.

尤度評価手段３３は、外部から入力される観測値（入力観測値）に基づいて、観測模擬手段３２で生成された模擬結果（模擬観測値）がどれだけ尤もらしい結果であるか（どれだけ近似した値であるか）を示す尤度を算出するものである。
ここでは、尤度評価手段３３は、観測模擬手段３２で生成された模擬結果である時刻ｔにおける模擬観測値ｙ_ｔ ^（ｋ）と、外部から入力される時刻ｔにおける入力観測値ｙ_ｔとを比較し、尤度Ｌ_ｔ ^（ｋ）を算出する。この尤度Ｌ_ｔ ^（ｋ）は、好ましくは、模擬観測値ｙ_ｔ ^（ｋ）と入力観測値ｙ_ｔとが近いほど大きな値をとることとする。 Likelihood evaluation means 33 is based on the observation value (input observation value) input from the outside, how much the simulation result (simulation observation value) generated by observation simulation means 32 is a likely result (how much) The likelihood indicating whether or not it is an approximate value is calculated.
Here, the likelihood evaluation unit 33, and the simulated observations y _{t ^(k)} at time t is generated simulated results observed simulator means 32, and an input observation value y _t at time t input from the outside The likelihood L _t ^(k) is calculated by comparison. The likelihood L _t ^(k) preferably takes a larger value as the simulated observation value y _t ^(k) and the input observation value y _t are closer.

例えば、尤度評価手段３３は、以下の（８）式に示すマハラノビス（Ｍａｈａｌａｎｏｂｉｓ）距離に基づくガウス関数によって尤度Ｌ_ｔ ^（ｋ）を定義することができる。 For example, the likelihood evaluation means 33 can define the likelihood L _t ^(k) by a Gaussian function based on the Mahalanobis distance shown in the following equation (8).

ここで、Σ_ｙは、観測空間を正規化してマハラノビス距離を定義するための分散共分散行列であり、Ｔは転置を示す。これによって、尤度Ｌ_ｔ ^（ｋ）は、模擬観測値ｙ_ｔ ^（ｋ）と入力観測値ｙ_ｔとが近いほど大きな値となる。 Here, Σ _y is a variance covariance matrix for defining the Mahalanobis distance by normalizing the observation space, and T indicates transposition. As a result, the likelihood L _t ^(k) increases as the simulated observation value y _t ^(k) and the input observation value y _t are closer.

また、例えば、尤度評価手段３３は、以下の（９）式に示すように、マハラノビス距離に基づく指数関数によって、尤度Ｌ_ｔ ^（ｋ）を定義することとしてもよい。 Further, for example, the likelihood evaluating means 33 may define the likelihood L _t ^(k) by an exponential function based on the Mahalanobis distance as shown in the following equation (9).

このように、尤度評価手段３３は、入力観測値から、観測模擬手段３２で生成された模擬観測値の尤度を算出する。そして、尤度評価手段３３は、算出した尤度を重み更新手段３４に出力する。 In this way, the likelihood evaluation means 33 calculates the likelihood of the simulated observation value generated by the observation simulation means 32 from the input observation value. Then, the likelihood evaluation unit 33 outputs the calculated likelihood to the weight update unit 34.

重み更新手段３４は、尤度評価手段３３で算出された尤度Ｌ_ｔ ^（ｋ）に基づいて、制御手段１０から通知される時刻（ｔ−１）におけるｋ番目の重みｗ_ｔ−１ ^（ｋ）から、時刻ｔにおける重みｗ_ｔ ^（ｋ）を生成し、重みを更新するものである。
すなわち、重み更新手段３４は、尤度評価手段３３で算出された尤度Ｌ_ｔ ^（ｋ）が大きければ、時刻ｔの仮説ｘ_ｔ ^（ｋ）がより尤もらしいものとして、その仮説に対する重みを増加させ、尤度Ｌ_ｔ ^（ｋ）が小さければ、その仮説に対する重みを減少させる。 The weight update unit 34 is based on the likelihood L _t ^(k) calculated by the likelihood evaluation unit 33, and the k-th weight w _t−1 ^(k ) at the time (t−1) notified from the control unit 10. ⁾ To generate a weight w _t ^(k) at time t and update the weight.
That is, if the likelihood L _t ^(k) calculated by the likelihood evaluation unit 33 is large, the weight update unit 34 increases the weight for the hypothesis that the hypothesis x _t ^{(k) at} the time t is more likely. If the likelihood L _t ^(k) is small, the weight for the hypothesis is decreased.

例えば、モンテカルロ標本化手段３１が、前記（３）式により、提案分布πによって時刻ｔの仮説ｘ_ｔ ^（ｋ）を生成した場合であれば、重み更新手段３４は、以下の（１０）式により、時刻（ｔ−１）における重みｗ_ｔ−１ ^（ｋ）から、時刻ｔにおける重みｗ_ｔ ^（ｋ）を生成する。 For example, if the Monte Carlo sampling means 31 generates a hypothesis x _t ^(k) at time t from the proposed distribution π according to the expression (3), the weight updating means 34 is expressed by the following expression (10). , from the time the weight _{w ^t-1} ^(k) in the (t-1), to produce the weight _w ^{t (k)} at time t.

また、例えば、モンテカルロ標本化手段３１が、前記（５）式により、提案分布πとして確率密度関数ｐを用いて時刻ｔの仮説ｘ_ｔ ^（ｋ）を生成した場合であれば、重み更新手段３４は、以下の（１１）式により、時刻（ｔ−１）における重みｗ_ｔ−１ ^（ｋ）から、時刻ｔにおける重みｗ_ｔ ^（ｋ）を生成する。 Further, for example, if the Monte Carlo sampling means 31 generates the hypothesis x _t ^(k) at time t using the probability density function p as the proposed distribution π according to the above equation (5), the weight update means 34 Generates the weight w _t ^(k) at time t from the weight w _t-1 ^{(k) at} time (t−1) by the following equation (11).

このように、重み更新手段３４は、時刻（ｔ−１）における重みｗ_ｔ−１ ^（ｋ）を、尤度評価手段３３で算出された尤度Ｌ_ｔ ^（ｋ）に基づいて更新し、時刻ｔにおける重みｗ_ｔ ^（ｋ）を生成する。そして、重み更新手段３４は、更新した重みｗ_ｔ ^（ｋ）を、制御手段１０を介して仮説記憶手段２０に書き込み、重みを更新する。 Thus, the weight update unit 34 updates the weight w _t-1 ^(k) at time (t−1) based on the likelihood L _t ^(k) calculated by the likelihood evaluation unit 33, and A weight w _t ^(k) at _t is generated. Then, the weight update unit 34 writes the updated weight w _t ^(k) to the hypothesis storage unit 20 via the control unit 10 and updates the weight.

以上説明したように、仮説処理手段３０は、制御手段１０から通知されるｋ番目の時刻（ｔ−１）における仮説ｘ_ｔ−１ ^（ｋ）およびその重みｗ_ｔ−１ ^（ｋ）を仮説記憶手段２０から読み出し、時刻ｔにおける新たな仮説ｘ_ｔ ^（ｋ）およびその重みｗ_ｔ ^（ｋ）を生成し、制御手段１０を介して仮説記憶手段２０に書き込む。 As described above, the hypothesis processing means 30 stores the hypothesis x _t-1 ^(k) and its weight w _t-1 ^(k) at the k-th time (t−1) notified from the control means 10 as a hypothesis memory. Read from the means 20, generate a new hypothesis x _t ^(k) and its weight w _t ^(k) at time t, and write them to the hypothesis storage means 20 via the control means 10.

なお、制御手段１０において、時刻（ｔ−１）におけるすべての仮説が処理されるまで、仮説処理手段３０には、制御手段１０から時刻（ｔ−１）における他の仮説および重みが通知される場合がある。そこで、仮説処理手段３０は、新たな時刻ｔの仮説が入力されるまでは、観測値ｙ_ｔを図示を省略した記憶手段（メモリ）に保持しておくものとする。
図１に戻って、モデルパラメータ推定装置１の全体構成について説明を続ける。 The hypothesis processing means 30 is notified of other hypotheses and weights at time (t-1) from the control means 10 until all hypotheses at time (t-1) are processed in the control means 10. There is a case. Therefore, the hypothesis processing means 30, until the hypothesis of new time t is entered, it is assumed to hold the illustrated observations y _t to omit the storage means (memory).
Returning to FIG. 1, the description of the overall configuration of the model parameter estimation apparatus 1 will be continued.

再標本化手段４０は、仮説記憶手段２０に記憶されている仮説を再編成するものである。ここで、仮説を再編成するとは、仮説（モデルパラメータ）が予め定めた分布に近似するように仮説を再度抽出し直すことをいう。この再標本化手段４０は、制御手段１０によって、一定時間間隔（ここでは時刻ｔ）で起動されるものとする。
ここでは、再標本化手段４０は、仮説記憶手段２０に記憶されている以下の（１２）式に示した時刻（ｔ−１）における仮説および重みのＫ_ｔ−１個の対のデータを読み出し、以下の（１３）式に示した時刻ｔにおける仮説および重みのＫ_ｔ個の対のデータに再編成し、仮説記憶手段２０に書き込む。例えば、Ｋ_ｔ＝Ｋ_ｔ−１＝Ｋ_ｔ−２＝…＝Ｋ_０＝定数〔Ｋ〕としても構わない。 The resampling means 40 reorganizes the hypotheses stored in the hypothesis storage means 20. Here, reorganizing a hypothesis means extracting the hypothesis again so that the hypothesis (model parameter) approximates a predetermined distribution. The re-sampling means 40 is assumed to be activated by the control means 10 at regular time intervals (here, time t).
Here, the resampling means 40 reads the data of K _t−1 pairs of hypotheses and weights at time (t−1) shown in the following equation (12) stored in the hypothesis storage means 20. The data is rearranged into K _t pairs of hypotheses and weights at time t shown in the following equation (13), and written in the hypothesis storage means 20. For example, K _t = K _t−1 = K _t−2 =... = K ₀ = constant [K] may be used.

このとき、再編成される（１３）式が近似する分布（確率密度分布）と、（１２）式が近似する分布（確率密度分布）とは、互いに類似することが好ましい。
例えば、再標本化手段４０は、ｋ番目の新しい仮説ｘ_ｔ ^（ｋ）を、以下の（１４）式に示すｘに関する確率密度分布Ｄ（ｘ）から標本を抽出することで生成する。 At this time, it is preferable that the distribution (probability density distribution) approximated by Equation (13) to be reorganized and the distribution (probability density distribution) approximated by Equation (12) are similar to each other.
For example, the resampling means 40 generates the k-th new hypothesis x _t ^(k) by extracting a sample from the probability density distribution D (x) regarding x shown in the following equation (14).

また、δは、ディラック（Ｄｉｒａｃ）のデルタ関数である。
ここで、この確率密度分布Ｄ（ｘ）から標本を抽出する手法について説明する。
まず、再標本化手段４０は、時刻（ｔ−１）の重みｗ_ｔ−１ ^（ｋ）（ｋ＝０，１，…，Ｋ_ｔ−１−１）を正規化し、かつ、ｋに関して累積した数列（（１５）式）を、以下の（１６）式により算出する。 Further, δ is a Dirac delta function.
Here, a method for extracting a sample from the probability density distribution D (x) will be described.
First, the resampling means 40 normalizes the weight w _t−1 ^(k) (k = 0, 1,..., K _t−1 −1) at time (t−1) and accumulates it with respect to k. The numerical sequence (Equation (15)) is calculated by the following Equation (16).

次に、再標本化手段４０は、ｋ番目（ｋ＝０，１，…，Ｋ_ｔ−１）〔Ｋ_ｔは自然数，例えば、Ｋ_ｔ＝Ｋ_ｔ−１＝Ｋ_ｔ−２＝…＝Ｋ_０＝定数〔Ｋ〕〕の標本ｕ^（ｋ）を、以下の（１７）式に示すように、０以上１未満の連続一様分布Ｕから標本抽出する。 Next, the re-sampling means 40 uses the k-th (k = 0, 1,..., K _t −1) [K _t is a natural number, eg, K _t = K _t−1 = K _t−2 =. A sample u ^(k) of ₀ = constant [K]] is sampled from a continuous uniform distribution U of 0 or more and less than 1 as shown in the following equation (17).

なお、この標本抽出は、擬似乱数によってランダムに抽出するが、当該擬似乱数の取り得る値域を０以上１未満に正規化することで、分布Ｕに近似した標本を抽出することができる。
そして、再標本化手段４０は、以下の（１８）式により、ｋ番目の新しい仮説ｘ_ｔ ^（ｋ）を生成する。 This sample extraction is performed randomly using pseudo-random numbers, but a sample approximated to the distribution U can be extracted by normalizing the range that the pseudo-random numbers can take to be 0 or more and less than 1.
Then, the resampling means 40 generates the k-th new hypothesis x _t ^(k) by the following equation (18).

また、再標本化手段４０は、新しい重みｗ_ｔ ^（ｋ）として、以下の（１９）式に示すように、一様な値を設定する。 Further, the resampling means 40 sets a uniform value as shown in the following equation (19) as the new weight w _t ^(k) .

このように、再標本化手段４０は、仮説記憶手段２０に記憶されている仮説を再標本化することで、仮説がパラメータ空間内において極端に偏在したり、重みの小さい仮説が多くなることを防止することができる。 As described above, the resampling means 40 resamples the hypotheses stored in the hypothesis storage means 20, so that the hypotheses are extremely unevenly distributed in the parameter space or there are many hypotheses with small weights. Can be prevented.

代表値演算手段５０は、仮説記憶手段２０に記憶されている仮説から代表値を求め、モデルパラメータの推定値として出力するものである。この代表値演算手段５０は、制御手段１０によって、一定時間間隔（ここでは時刻ｔ）で起動されるものとする。
ここでは、代表値演算手段５０は、仮説記憶手段２０に記憶されている以下の（２０）式に示した時刻ｔにおける仮説および重みのＫ_ｔ個の対のデータを読み出し、代表値を求める。 The representative value calculation means 50 obtains a representative value from the hypothesis stored in the hypothesis storage means 20 and outputs it as an estimated value of the model parameter. The representative value calculation means 50 is started by the control means 10 at a constant time interval (here, time t).
Here, the representative value calculation means 50 reads K _t pairs of hypotheses and weights at time t shown in the following equation (20) stored in the hypothesis storage means 20 and obtains representative values.

ここでは、代表値演算手段５０は、（２０）式の仮説および重みを用いて、以下の（２１）式に示す演算により、仮説の加重平均（重み付き平均）を求め、代表値ｘ_ｒｅｐとする。 Here, the representative value calculating means 50 obtains a weighted average (weighted average) of the hypothesis by using the hypothesis and weight of the equation (20), and calculates the representative value x _rep and To do.

なお、代表値演算手段５０は、加重平均に限定されず、統計的処理により種々の値を代表値とすることができる。例えば、代表値演算手段５０は、すべての仮説の相加平均を代表値としたり、最も重みが大きい仮説を代表値としたり等、他の統計的処理によって代表値を求めることとしてもよい。 The representative value calculation means 50 is not limited to the weighted average, and various values can be used as representative values by statistical processing. For example, the representative value calculating means 50 may obtain the representative value by other statistical processing such as using an arithmetic average of all hypotheses as a representative value, or using a hypothesis having the largest weight as a representative value.

以上説明したようにモデルパラメータ推定装置１を構成することで、逐次モンテカルロ法により、モデルパラメータを推定する際に、確率分布を近似する十分な仮説を用いたとしても、仮説ごとに、複数の仮説処理手段３０によって、並列して演算を行うことができるため、高速に状態推定を行うことができる。
なお、モデルパラメータ推定装置１は、一般的なコンピュータを前記した各手段として機能させるプログラム（モデルパラメータ推定プログラム）により動作させることができる。 By configuring the model parameter estimation device 1 as described above, even when a sufficient hypothesis that approximates the probability distribution is used when estimating model parameters by the sequential Monte Carlo method, a plurality of hypotheses are provided for each hypothesis. Since processing can be performed in parallel by the processing means 30, state estimation can be performed at high speed.
Note that the model parameter estimation device 1 can be operated by a program (model parameter estimation program) that causes a general computer to function as each of the above-described means.

［モデルパラメータ推定装置の動作］
次に、図５を参照（構成については、適宜図１参照）して、本発明の実施形態に係るモデルパラメータ推定装置の動作について説明する。
まず、モデルパラメータ推定装置１は、制御手段１０の仮説並列処理制御手段１２によって、時刻ｔを初期化する（ステップＳ１）。例えば、初期値として変数ｔに値“０”を設定する。 [Operation of model parameter estimation device]
Next, the operation of the model parameter estimation apparatus according to the embodiment of the present invention will be described with reference to FIG.
First, the model parameter estimation device 1 initializes the time t by the hypothesis parallel processing control means 12 of the control means 10 (step S1). For example, the value “0” is set to the variable t as an initial value.

そして、モデルパラメータ推定装置１は、制御手段１０の仮説並列処理制御手段１２によって、複数の仮説処理手段３０_１，３０_２，…，３０_Ｎに対して、仮説記憶手段２０に記憶されている複数の仮説および重み（仮説群）を分担して処理させる仮説並列処理を行う（ステップＳ２）。この仮説並列処理は、仮説並列処理制御手段１２が、空き状態の仮説処理手段３０を検索し、仮説記憶手段２０に記憶される複数（Ｋ_ｔ個）の仮説を、空き状態の仮説処理手段３０に分配することで、複数の仮説処理手段３０_１，３０_２，…，３０_Ｎで並列に複数の仮説更新処理を行う。 The model parameter estimation apparatus 1 is stored in the hypothesis storage unit 20 with respect to the plurality of hypothesis processing units 30 ₁ , 30 ₂ ,..., 30 _N by the hypothesis parallel processing control unit 12 of the control unit 10. Hypothesis parallel processing is performed in which the hypotheses and weights (hypothesis group) are shared and processed (step S2). In this hypothesis parallel processing, the hypothesis parallel processing control means 12 searches the hypothesis processing means 30 in the empty state, and a plurality of (K _t ) hypotheses stored in the hypothesis storage means 20 are replaced with the hypothesis processing means 30 in the empty state. , 30 _N , a plurality of hypothesis update means 30 ₁ , 30 ₂ ,..., 30 _N perform a plurality of hypothesis update processes in parallel.

ここで、図６を参照（構成については適宜図１，２参照）して、このステップＳ２の仮説並列処理について、詳細に説明を行う。
このステップＳ２において、まず、モデルパラメータ推定装置１は、制御手段１０の仮説並列処理制御手段１２によって、仮説処理手段３０_１，３０_２，…，３０_Ｎの中で、処理を行っていない空き状態の仮説処理手段３０が存在するか否かを判定する（ステップＳ２１）。なお、ここでは、仮説処理手段３０_１，３０_２，…，３０_Ｎが空き状態であるか否かはフラグによって判定することとし、仮説並列処理制御手段１２が、初期段階で予めすべての仮説処理手段３０_１，３０_２，…，３０_Ｎが空き状態である旨のフラグを初期化（例えば、“０”）しておくこととする。 Here, referring to FIG. 6 (refer to FIGS. 1 and 2 as appropriate for the configuration), the hypothesis parallel processing in step S2 will be described in detail.
In this step S2, first, the model parameter estimation apparatus 1, by hypothesis parallel-processing control unit 12 of the control means 10, the hypothesis processing means ₃₀ _1, 30 2, ..., idle state in the 30 _N, not processed It is determined whether or not the hypothesis processing means 30 exists (step S21). Here, the hypothesis processing means _{_{30 1, 30 2, ...,}} 30 N is whether it is idle and be determined by the flag, the hypothesis parallel-processing control unit 12, in advance all hypotheses treated at an early stage Assume that a flag indicating that the means 30 ₁ , 30 ₂ ,..., 30 _N are empty is initialized (for example, “0”).

ここで、空き状態の仮説処理手段３０が存在しないと判定した場合（ステップＳ２１でＮｏ）、仮説並列処理制御手段１２は、ステップＳ２８に動作を移す。
一方、空き状態の仮説処理手段３０が存在すると判定した場合（ステップＳ２１でＹｅｓ）、仮説並列処理制御手段１２は、仮説処理手段３０に対して、以下の仮説更新処理を実行させる。 Here, if it is determined that there is no empty hypothesis processing means 30 (No in step S21), the hypothesis parallel processing control means 12 moves the operation to step S28.
On the other hand, if it is determined that there is an empty hypothesis processing means 30 (Yes in step S21), the hypothesis parallel processing control means 12 causes the hypothesis processing means 30 to execute the following hypothesis update processing.

まず、仮説並列処理制御手段１２は、空き状態の仮説処理手段３０を起動し、時刻（ｔ−１）のｋ番目の仮説ｘ_ｔ−１ ^（ｋ）および重みｗ_ｔ−１ ^（ｋ）を更新する旨を指示する。なお、このとき、仮説並列処理制御手段１２は、当該空き状態の仮説処理手段３０に対応したフラグを、処理中となったことを示す値（例えば、“１”）にセットすることとする。 First, the hypothesis parallel processing control unit 12 activates the hypothesis processing unit 30 in an empty state, and updates the ^kth hypothesis x _t-1 ^(k) and weight w _t-1 ^(k) at time (t−1). Instruct to do so. At this time, the hypothesis parallel processing control means 12 sets the flag corresponding to the hypothesis processing means 30 in the empty state to a value (for example, “1”) indicating that processing is in progress.

そして、仮説処理手段３０は、制御手段１０から通知される時刻（ｔ−１）におけるｋ番目の仮説ｘ_ｔ−１ ^（ｋ）および重みｗ_ｔ−１ ^（ｋ）を、仮説記憶手段２０から読み出す（ステップＳ２２）。
そして、仮説処理手段３０は、ステップＳ２２で仮説記憶手段２０から読み出した時刻（ｔ−１）におけるｋ番目の仮説ｘ_ｔ−１ ^（ｋ）および重みｗ_ｔ−１ ^（ｋ）から、時刻ｔにおけるｋ番目の仮説ｘ_ｔ ^（ｋ）および重みｗ_ｔ ^（ｋ）を生成する仮説更新処理を実行する。 Then, the hypothesis processing means 30 reads the ^kth hypothesis x _t-1 ^(k) and the weight w _t-1 ^(k) at the time (t−1) notified from the control means 10 from the hypothesis storage means 20. (Step S22).
Then, the hypothesis processing means 30 determines the time at the time t from the kth hypothesis x _t-1 ^(k) and the weight w _t-1 ^{(k) at} the time (t−1) read from the hypothesis storage means 20 in step S22. A hypothesis updating process for generating the ^kth hypothesis x _t ^(k) and the weight w _t ^(k) is executed.

すなわち、仮説処理手段３０は、モンテカルロ標本化手段３１によって、予め定めた提案分布（あるいは、確率密度関数）により、標本抽出を行うことで、時刻ｔにおける新たな仮説ｘ_ｔ ^（ｋ）を生成する（ステップＳ２３）。例えば、モンテカルロ標本化手段３１は、前記（５）式に示した確率密度関数ｐ（ｘ_ｔ｜ｘ_ｔ−１）により、仮説ｘ_ｔ ^（ｋ）を生成する。 In other words, the hypothesis processing means 30 generates a new hypothesis x _t ^(k) at time t by sampling by the Monte Carlo sampling means 31 using a predetermined proposal distribution (or probability density function). (Step S23). For example, the Monte Carlo sampling means 31 generates a hypothesis x _t ^(k) by the probability density function p (x _t | x _t−1 ) shown in the above equation (5).

そして、仮説処理手段３０は、観測模擬手段３２によって、ステップＳ２２で標本化され生成された仮説ｘ_ｔ ^（ｋ）が、観測モデルによりどのような観測値（模擬観測値）として観測されるかを模擬（演算）する（ステップＳ２４）。例えば、観測模擬手段３２は、前記（７）式に示した予め観測モデルを定義した関数ｈ_ｔの演算により、模擬観測値ｙ_ｔ ^（ｋ）を算出する。 Then, the hypothesis processing means 30 determines what observation value (simulated observation value) the hypothesis x _t ^(k) sampled and generated in step S22 by the observation simulation means 32 is observed by the observation model. Simulation (calculation) is performed (step S24). For example, the observation simulation unit 32 calculates the simulated observation value y _t ^(k) by the calculation of the function h _t that defines the observation model in advance as shown in the equation (7).

さらに、仮説処理手段３０は、尤度評価手段３３によって、時刻ｔにおいて外部から入力される観測値（入力観測値ｙ_ｔ）に基づいて、ステップＳ２４で演算された模擬観測値ｙ_ｔ ^（ｋ）がどれだけ尤もらしい結果であるかを評価する（ステップＳ２５）。例えば、尤度評価手段３３は、前記（８）式に示したマハラノビス距離に基づくガウス関数によって仮説を評価する尤度Ｌ_ｔ ^（ｋ）を算出する。 Further, the hypothesis processing means 30 is based on the observation value (input observation value y _t ) input from the outside by the likelihood evaluation means 33 at the time t, and the simulated observation value y _t ^(k) calculated in step S24. Is a plausible result (step S25). For example, the likelihood evaluation means 33 calculates the likelihood L _t ^(k) for evaluating the hypothesis using a Gaussian function based on the Mahalanobis distance shown in the above equation (8).

そして、仮説処理手段３０は、重み更新手段３４によって、ステップＳ２５で算出された尤度に基づいて、時刻（ｔ−１）における重みｗ_ｔ−１ ^（ｋ）から、時刻ｔにおける新たな重みｗ_ｔ ^（ｋ）を生成する（ステップＳ２６）。例えば、重み更新手段３４は、前記（１１）式に示すように、ｗ_ｔ−１ ^（ｋ）に尤度Ｌ_ｔ ^（ｋ）を乗算することで新たな重みｗ_ｔ ^（ｋ）を生成する。 The hypothesis processing means 30 then calculates a new weight w at time t from the weight w _t-1 ^{(k) at} time (t−1) based on the likelihood calculated at step S25 by the weight update means 34. _t ^(k) is generated (step S26). For example, the weight update unit 34 generates a new weight w _t ^(k) by multiplying w _t−1 ^(k) by the likelihood L _t ^(k) as shown in the equation (11).

その後、仮説処理手段３０は、ステップＳ２３で生成された新たな仮説ｘ_ｔ ^（ｋ）と、ステップＳ２６で生成された新たな重みｗ_ｔ ^（ｋ）とを、制御手段１０を介して、仮説記憶手段２０に書き込む（ステップＳ２７）。 Thereafter, the hypothesis processing means 30 stores the new hypothesis x _t ^(k) generated in step S23 and the new weight w _t ^(k) generated in step S26 via the control means 10 as a hypothesis memory. Write to the means 20 (step S27).

このステップＳ２２からステップＳ２７までの仮説更新処理動作によって、仮説処理手段３０は、制御手段１０から通知される時刻（ｔ−１）におけるｋ番目の仮説ｘ_ｔ−１ ^（ｋ）および重みｗ_ｔ−１ ^（ｋ）を、時刻ｔにおけるｋ番目の仮説ｘ_ｔ ^（ｋ）および重みｗ_ｔ ^（ｋ）に更新する。なお、このとき、仮説処理手段３０は、制御手段１０に対して仮説処理を完了したことを通知する。これによって、制御手段１０の仮説並列処理制御手段１２は、当該仮説処理手段３０に対応するフラグを、空き状態である旨の示す値（例えば、“０”）にセットする。 By the hypothesis update processing operation from step S22 to step S27, the hypothesis processing unit 30 causes the kth hypothesis x _t-1 ^(k) and the weight w _t− at the time (t−1) notified from the control unit 10. ₁ ^(k) is updated to the k-th hypothesis x _t ^(k) and weight w _t ^(k) at time t. At this time, the hypothesis processing means 30 notifies the control means 10 that the hypothesis processing has been completed. As a result, the hypothesis parallel processing control unit 12 of the control unit 10 sets the flag corresponding to the hypothesis processing unit 30 to a value (for example, “0”) indicating that the hypothesis processing unit 30 is in an empty state.

そして、仮説並列処理制御手段１２は、仮説記憶手段２０を参照し、時刻（ｔ−１）における仮説ｘ_ｔ−１ ^（ｋ）および重みｗ_ｔ−１ ^（ｋ）が、すべて時刻ｔにおける仮説ｘ_ｔ ^（ｋ）および重みｗ_ｔ ^（ｋ）に更新されたか否かを判定する（ステップＳ２８）。 Then, the hypothesis parallel processing control means 12 refers to the hypothesis storage means 20, and the hypothesis x _t-1 ^(k) and the weight w _t-1 ^{(k) at} time ⁽ _t-1 ⁾ are all hypothesis x at time t. _It is determined whether or not _t ^(k) and weight w _t ^(k) have been updated (step S28).

ここで、まだすべての仮説および重みが更新されていないと判定した場合（ステップＳ２８でＮｏ）、仮説並列処理制御手段１２は、ステップＳ２１に戻ってｋを更新（ここでは、インクリメント）して動作を継続する。一方、すべての仮説および重みが更新されたと判定した場合（ステップＳ２８でＹｅｓ）、仮説並列処理制御手段１２は、時刻ｔにおける仮説並列処理の動作を終了する。 If it is determined that all hypotheses and weights have not been updated yet (No in step S28), the hypothesis parallel processing control means 12 returns to step S21 to update k (increment here) and operate. Continue. On the other hand, if it is determined that all hypotheses and weights have been updated (Yes in step S28), the hypothesis parallel processing control means 12 ends the hypothesis parallel processing operation at time t.

なお、ここでは、ステップＳ２１の判定後、順次、ステップＳ２２からＳ２７を動作した後、ステップＳ２８の判定を行う手順で説明したが、このステップＳ２２からＳ２７までの仮説処理手段３０の動作は、個々の仮説処理手段３０_１，３０_２，…，３０_Ｎが個別に並列して動作すればよい。 Here, after the determination in step S21, the steps S22 to S27 are sequentially operated, and then the procedure for performing the determination in step S28 has been described. However, the operation of the hypothesis processing means 30 from this step S22 to S27 is described individually. The hypothesis processing means 30 ₁ , 30 ₂ ,..., 30 _N may be operated individually in parallel.

すなわち、仮説並列処理制御手段１２は、ステップＳ２１において、空き状態の仮説処理手段３０を起動し、仮説更新処理（ステップＳ２２からＳ２７）の実行を指示した後は、その処理の完了を待たずに、ステップＳ２８に移行する。これによって、仮説更新処理（ステップＳ２２からＳ２７）は、並列に動作することになる。 That is, the hypothesis parallel processing control unit 12 activates the hypothesis processing unit 30 in the empty state in step S21 and directs execution of the hypothesis update processing (steps S22 to S27) without waiting for completion of the processing. The process proceeds to step S28. As a result, the hypothesis update process (steps S22 to S27) operates in parallel.

なお、このとき、仮説並列処理制御手段１２は、ステップＳ２８において、すべての仮説および重みが更新されたと判定されなかった場合であっても、Ｋ_ｔ個の仮説について、すでに仮説処理手段３０に仮説更新処理を指示した場合、ステップＳ２１には移行せず、すべての仮説処理が終了するまで待機し、すべての仮説および重みの更新が完了した時点で仮説並列処理の動作を終了することとする。 At this time, even if the hypothesis parallel processing control means 12 does not determine in step S28 that all hypotheses and weights have been updated, the hypothesis processing means 30 has already been processed with respect to the K _t hypotheses. When the update process is instructed, the process does not proceed to step S21, but waits until all hypothesis processes are completed, and the operation of the hypothesis parallel process is terminated when the update of all hypotheses and weights is completed.

以上のステップＳ２の動作によって、仮説記憶手段２０に記憶されている仮説および重みがすべて新たな時刻に対応して更新されることになる。
図５に戻って、モデルパラメータ推定装置１の全体動作について説明を続ける。 Through the operation in step S2, the hypotheses and weights stored in the hypothesis storage means 20 are all updated corresponding to the new time.
Returning to FIG. 5, the overall operation of the model parameter estimation apparatus 1 will be described.

ステップＳ２の後、モデルパラメータ推定装置１は、制御手段１０の再標本化制御手段１３によって、再標本化手段４０を起動し、仮説記憶手段２０に記憶されている仮説の再標本化を行う（ステップＳ３）。この起動された再標本化手段４０は、例えば、仮説記憶手段２０に記憶されている仮説および重みから、前記（１２）式に示した確率密度分布によって仮説を標本抽出することで再標本化を行い、新たな仮説および新たな重みを生成する。 After step S2, the model parameter estimation apparatus 1 activates the resampling means 40 by the resampling control means 13 of the control means 10, and resamples the hypotheses stored in the hypothesis storage means 20 ( Step S3). The activated re-sampling means 40 re-samples the hypothesis by, for example, sampling the hypothesis from the hypothesis and weight stored in the hypothesis storage means 20 according to the probability density distribution shown in the equation (12). And generate new hypotheses and new weights.

その後、モデルパラメータ推定装置１は、制御手段１０の代表値演算制御手段１４によって、代表値演算手段５０を起動し、仮説記憶手段２０に記憶されている仮説から、代表値を演算して、モデルパラメータの推定値として出力する（ステップＳ４）。この代表値演算手段５０は、例えば、仮説記憶手段２０に記憶されている仮説および重みから、前記（２１）式に示す重み付き平均演算によって、代表値を算出する。 Thereafter, the model parameter estimation device 1 activates the representative value calculation means 50 by the representative value calculation control means 14 of the control means 10, calculates the representative value from the hypothesis stored in the hypothesis storage means 20, and The estimated value of the parameter is output (step S4). The representative value calculation means 50 calculates a representative value from the hypotheses and weights stored in the hypothesis storage means 20, for example, by the weighted average calculation shown in the equation (21).

そして、モデルパラメータ推定装置１は、制御手段１０によって、予め定めた最終時刻まで処理が終了したか否かを判定し（ステップＳ５）、処理が終了した場合（ステップＳ５でＹｅｓ）、動作を終了する。一方、予め定めた最終時刻まで処理が終了していない場合（ステップＳ５でＮｏ）、モデルパラメータ推定装置１は、時刻ｔを更新（インクリメント）し（ステップＳ６）、ステップＳ２に戻って、仮説の更新処理を継続する。
以上の動作によって、モデルパラメータ推定装置１は、複数の仮説処理手段３０によって、仮説ごとに並列して演算を行うことができるため、高速に状態推定を行うことができる。 Then, the model parameter estimation device 1 determines whether or not the process has been completed up to a predetermined final time by the control means 10 (step S5), and ends the operation when the process is completed (Yes in step S5). To do. On the other hand, if the process has not been completed until the predetermined final time (No in step S5), the model parameter estimation device 1 updates (increments) the time t (step S6), returns to step S2, and returns to the hypothesis. Continue the update process.
With the above operation, the model parameter estimation device 1 can perform the calculation for each hypothesis in parallel by the plurality of hypothesis processing means 30, and therefore can perform state estimation at high speed.

以上、本発明の実施形態に係るモデルパラメータ推定装置１の構成および動作について説明したが、本発明は、この実施形態に限定されるものではない。
例えば、モデルパラメータ推定装置１において、仮説を再標本化する機能は、重みが過小な仮説が生じる無駄を回避し、有効な（重みの小さくない）仮説を増加させるため、仮説の集合が局所解に陥る危険を低減できる点で優れているが、必ずしも必須ではない。この仮説の標本化の機能を省略する場合、再標本化制御手段１３および再標本化手段４０を構成から省略すればよい。 The configuration and operation of the model parameter estimation device 1 according to the embodiment of the present invention have been described above, but the present invention is not limited to this embodiment.
For example, in the model parameter estimation apparatus 1, the function of re-sampling hypotheses avoids the waste of hypotheses with too small weights and increases effective (not small weight) hypotheses. Although it is excellent in that it can reduce the risk of falling into, it is not always essential. When this hypothetical sampling function is omitted, the resampling control means 13 and the resampling means 40 may be omitted from the configuration.

以上説明したように、本発明に係るモデルパラメータ推定装置１は、複数の仮説処理手段３０による並列演算によって、逐次モンテカルロ法の処理を高速に実行することができる。このように、複数の仮説処理手段３０による並列処理を実現したことで、モデルパラメータ推定装置１は、並列ハードウェアで実装することも可能である。また、この並列処理は、複数のＣＰＵを搭載した計算機や、マルチコア、ハイパースレッディング対応のＣＰＵを搭載した計算機によるマルチスレッド処理、クラスタ計算機による並列演算処理にも適しており、ハードウェアリソースを有効に活用した高速演算処理が可能となる。 As described above, the model parameter estimation apparatus 1 according to the present invention can execute the sequential Monte Carlo method at high speed by the parallel calculation by the plurality of hypothesis processing means 30. Thus, by realizing parallel processing by the plurality of hypothesis processing means 30, the model parameter estimation device 1 can also be implemented by parallel hardware. This parallel processing is also suitable for multi-thread processing by computers equipped with multiple CPUs, multi-core, hyper-threading compatible CPUs, and parallel computing processing by cluster computers. Utilizing high-speed arithmetic processing becomes possible.

また、モデルパラメータ推定装置１で演算可能な逐次モンテカルロ法は汎用性があるため、例えば、気象などの物理現象、金融システム、生体システムのモデルパラメータの推定に広く適用することができる。また、モデルパラメータ推定装置１は、機械学習におけるオンライン学習、映像・音声の復元処理などの逆問題解決法、制御系におけるオブザーバ、センサネットワークなど計測システムにおけるデータ融合など、逐次モンテカルロ法を適用可能な分野に応用することができる。 In addition, the sequential Monte Carlo method that can be calculated by the model parameter estimation apparatus 1 is versatile, and can be widely applied to estimation of model parameters of physical phenomena such as weather, financial systems, and biological systems, for example. In addition, the model parameter estimation apparatus 1 can apply a sequential Monte Carlo method such as online learning in machine learning, inverse problem solving methods such as video / audio restoration processing, data fusion in measurement systems such as observers and sensor networks in control systems. Can be applied in the field.

１モデルパラメータ推定装置
１０制御手段
１１仮説アクセス制御手段
１２仮説並列処理制御手段
１３再標本化制御手段
１４代表値演算制御手段
２０仮説記憶手段
３０仮説処理手段
３１モンテカルロ標本化手段
３２観測模擬手段
３３尤度評価手段
３４重み更新手段
４０再標本化手段
５０代表値演算手段 DESCRIPTION OF SYMBOLS 1 Model parameter estimation apparatus 10 Control means 11 Hypothesis access control means 12 Hypothesis parallel processing control means 13 Resampling control means 14 Representative value calculation control means 20 Hypothesis storage means 30 Hypothesis processing means 31 Monte Carlo sampling means 32 Observation simulation means 33 Likelihood Degree evaluation means 34 weight update means 40 resampling means 50 representative value calculation means

Claims

In the model parameter estimation device that estimates the parameters of the model by the sequential Monte Carlo method,
Hypothesis storage means for storing a plurality of hypothesis data as candidate solutions of the parameter in association with the generation time of the hypothesis data ;
A new hypothesis data is generated by sampling the hypothesis data based on the proposed Monte Carlo method distribution, which is a predetermined parameter distribution, and the hypothesis stored in the hypothesis storage means together with the generation time A plurality of hypothesis processing means for updating data;
Control means for operating the plurality of hypothesis processing means by distributing a plurality of hypothesis data stored in the hypothesis storage means to the plurality of hypothesis processing means;
Representative value calculation means for calculating representative values of a plurality of hypothesis data stored in the hypothesis storage means as estimated values of the solution of the parameter,
The control means holds the state of whether or not the hypothesis processing means is in the process of updating the hypothesis data, based on the generation time for the hypothesis processing means not performing the update operation, the hypothesis data that have not been updated in the last update time later distributed to the plurality of hypotheses processing means, the hypothesis storage means hypothesis that have been stored in the data has been updated, all the hypotheses data since the last update time A model parameter estimation device, wherein the update time is updated to the next update time in a step.

The hypothesis storage means stores a weight indicating the importance of the hypothesis data in association with the hypothesis data,
The hypothesis processing means includes
Monte Carlo sampling means for sampling and generating hypothesis data that is a candidate for the parameter solution by the proposed distribution;
Observation simulation means for generating simulated observation values, which are observation values updated by a predetermined observation model, from hypothesis data generated by this Monte Carlo sampling means,
A likelihood evaluation unit that inputs an observation value from the outside and calculates a likelihood indicating a degree of approximation between the input observation value that is an observation value input at the present time and the simulated observation value generated by the observation simulation unit;
Based on the likelihood calculated by the likelihood evaluation means, the weight update means for updating the weight so that the value becomes larger as the input observation value and the simulated observation value are closer,
The representative value calculating unit calculates the representative value by performing a weighted average of a plurality of hypothesis data stored in the hypothesis storage unit based on a weight corresponding to the hypothesis data. Item 2. The model parameter estimation device according to Item 1 .

Model parameters according to claim 1 or claim 2, further comprising a resampling unit for resampling the plurality of hypotheses data stored in the hypothesis storage means by a predetermined probability density distribution Estimating device.

The control means operates the re-sampling means when a plurality of hypothesis data stored in the hypothesis storage means are unevenly distributed over a predetermined condition in a parameter space at a certain time interval. The model parameter estimation apparatus according to claim 3 , wherein

Re-sampling means for re-sampling a plurality of hypothesis data stored in the hypothesis storage means with a predetermined probability density distribution;
In the plurality of hypothesis data stored in the hypothesis storage means, the control means resamples the hypothesis data when hypothesis data having a weight smaller than a predetermined minimum value exceeds a predetermined number. The model parameter estimation apparatus according to claim 2 , wherein the model parameter estimation apparatus is operated.

To estimate the parameters of the model by the sequential Monte Carlo method,
In a hypothesis storage means for storing a plurality of hypothesis data that are candidates for the parameter solution in association with the generation time of the hypothesis data, the hypothesis data based on a proposed distribution of the sequential Monte Carlo method that is a predetermined distribution of the parameters the by sampling, to generate new hypotheses data, a plurality of hypotheses processing means to update the hypotheses data stored in the hypothesis storage means together with the generation time,
Control means for distributing the plurality of hypothesis data stored in the hypothesis storage means to the plurality of hypothesis processing means and operating the plurality of hypothesis processing means;
A representative value calculation means for calculating a representative value of a plurality of hypothesis data stored in the hypothesis storage means as an estimated value of the solution of the parameter;
The control means holds the state of whether or not the hypothesis processing means is in the process of updating the hypothesis data, based on the generation time for the hypothesis processing means not performing the update operation, the hypothesis data that have not been updated in the last update time later distributed to the plurality of hypotheses processing means, the hypothesis storage means hypothesis that have been stored in the data has been updated, all the hypotheses data since the last update time A model parameter estimation program that updates the update time to the next update time in a step.