JP5175515B2

JP5175515B2 - Model construction apparatus, model construction method and program

Info

Publication number: JP5175515B2
Application number: JP2007258918A
Authority: JP
Inventors: 藤誠佐
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2007-10-02
Filing date: 2007-10-02
Publication date: 2013-04-03
Anticipated expiration: 2027-10-02
Also published as: JP2009087235A

Description

本発明は、たとえばモデルパラメータ数の多い場合に用いて好適なモデル構築装置、モデル構築方法およびモデル構築プログラムに関する。 The present invention relates to a model construction apparatus, a model construction method, and a model construction program suitable for use when, for example, the number of model parameters is large.

分析対象物の何らかの性質が数値として表すことができ、その性質が確率的な振る舞いをするとき、数値化された性質は確率変数と呼ばれる。いくつかのモデルパラメータθを用いて１つ以上の確率変数の振る舞いを簡潔に表すためにさまざまな確率モデルが提案されている。 When a certain property of an analysis object can be expressed as a numerical value and the property behaves stochastically, the numerical property is called a random variable. Various stochastic models have been proposed to concisely represent the behavior of one or more random variables using several model parameters θ.

例えば、下記の式(1)の正規分布モデルは平均(μ)と標準偏差(σ)という２つのモデルパラメータ(θ={μ, σ})を用いて確率変数Xが値xをとりうる確率を記述することが可能な確率モデルである。

For example, the normal distribution model of the following equation (1) is the probability that the random variable X can take the value x using two model parameters (θ = {μ, σ}), the mean (μ) and standard deviation (σ). Is a probabilistic model that can describe

確率モデルには大きく分けて２つの種類が存在する。１つは生成モデルと呼ばれ、もう一つは予測モデルと呼ばれる。前者の生成モデルは、対象の性質に関する１つ以上の確率変数である属性変数Xがあるとき、Xが特定の値xをとる確率(Pr(X=x))を記述するための確率モデルである。一方、後者の予測モデルは、属性変数Xの他に対象の性質に関する１つ以上の確率変数である目的変数Yを用意し、Xが特定の値xをとるときにYが特定の値yをとる確率(Pr(Y=y|X=x))を記述するための確率モデルである。例えば、式(1)の一次元正規分布モデルや多次元正規分布モデルは生成モデルであり、線形重回帰モデルや一般化線形モデルなどは予測モデルに分類される。 There are two types of probability models. One is called a generation model and the other is called a prediction model. The former generation model is a probability model for describing the probability (Pr (X = x)) that X takes a specific value x when there is an attribute variable X that is one or more random variables related to the properties of the object. is there. On the other hand, in the latter prediction model, in addition to the attribute variable X, an objective variable Y that is one or more random variables related to the target property is prepared, and when X takes a specific value x, Y has a specific value y. This is a probability model for describing the probability (Pr (Y = y | X = x)). For example, the one-dimensional normal distribution model and multi-dimensional normal distribution model of Equation (1) are generation models, and linear multiple regression models, generalized linear models, and the like are classified as prediction models.

確率モデルのパラメータは、モデル化したい性質について数値化した情報を複数の対象物について収集した学習データから決定(学習)することができる。確率モデル構築装置は読み込んだ学習データを用いて最適な確率モデルのパラメータを決定するための装置である。 The parameters of the probabilistic model can be determined (learned) from learning data collected from a plurality of objects of information obtained by quantifying the property to be modeled. The probabilistic model construction device is a device for determining optimal probabilistic model parameters using read learning data.

正規分布やポアソン分布などの単純な確率分布を用いて複雑な確率的現象をモデル化するために、単純な確率分布を２つ以上組み合わせた混合モデルと呼ばれる確率モデルが用いられる。混合モデルにおいて、各事例の振る舞いは複数存在する単純な確率分布のいずれか（あるいは組み合わせ）によって説明できればよい。混合モデルのモデルパラメータは学習データを用いてEMアルゴリズムなどによって決定することができる（非特許文献１）。 In order to model a complex probabilistic phenomenon using a simple probability distribution such as a normal distribution or a Poisson distribution, a probability model called a mixed model in which two or more simple probability distributions are combined is used. In the mixed model, the behavior of each case may be explained by any one (or combination) of a plurality of simple probability distributions. The model parameters of the mixed model can be determined by EM algorithm or the like using learning data (Non-patent Document 1).

店舗の売り上げや地価、人口動態などの地表上で生じる現象に関する確率変数について確率モデルによって振る舞いを正確に表現できれば、対象地域における店舗の売り上げや地価の予測、人口動態の構造解析など様々な応用を行うことができる。そのためには、そのような地理空間的な現象のための確率モデルである地理空間モデルを、地理空間情報を含む学習データである地理空間データから決定することが必要になる。 If the behavior of the random variables related to phenomena on the ground surface such as store sales, land prices, and demographics can be accurately expressed by a probabilistic model, various applications such as store sales, land price prediction, and demographic structural analysis in the target area will be possible. It can be carried out. For that purpose, it is necessary to determine a geospatial model that is a probabilistic model for such a geospatial phenomenon from geospatial data that is learning data including geospatial information.

複雑な地理空間的現象をモデル化するためには混合モデルを用いることが有効であるが、地理空間的現象には空間依存性が存在する場合がある。例えば、対象エリアの各地点における機器の故障発生について数値化した故障指数という確率変数が(単純な正規分布では表現しきれないといった)複雑な振る舞いを示すため、混合モデルを用いてモデル化する場合を考える。故障指数の分布が単純な正規分布にならないのは、塩害という隠れた空間的要因が存在するためであり、海に近いエリアと海から遠いエリアとでは故障指数の確率分布が異なるからだとする。そのような場合、ある地点が塩害エリアに含まれるとき隣の地点も塩害エリアに含まれる可能性は高いという点を考慮してモデルパラメータθの学習を行わなければならない。空間依存性が存在する地理空間的現象を扱うためには、このような地点と地点との位置的な関係を考慮した混合モデルである地理空間混合モデルを構築することが必要になるが、通常のEMアルゴリズムなどでは空間依存性を考慮したパラメータ学習を行うことができない。 In order to model complex geospatial phenomena, it is effective to use a mixed model, but geospatial phenomena may have spatial dependence. For example, when using a mixed model to model a failure variable that is a numerical value of the failure index of equipment at each point in the target area, indicating a complex behavior (such as cannot be expressed with a simple normal distribution) think of. The reason why the failure index distribution does not become a simple normal distribution is that there is a hidden spatial factor called salt damage, and that the probability distribution of the failure index differs between an area close to the sea and an area far from the sea. In such a case, it is necessary to learn the model parameter θ in consideration of the fact that when a certain point is included in the salt damage area, there is a high possibility that the adjacent point is also included in the salt damage area. In order to handle geospatial phenomena with spatial dependence, it is necessary to build a geospatial mixed model that is a mixed model that considers the positional relationship between such points. The EM algorithm cannot perform parameter learning considering spatial dependence.

空間依存性を考慮した確率モデルとしては、画像処理などに応用されているマルコフ確率場(Markov Random Fields, 以下MRF)が存在する（非特許文献２）。MRFでは空間依存性パラメータλ(以下MRFパラメータ)を用いることによって隣接する地点間の依存関係を考慮している。MRFパラメータは画像サンプルデータから決定することができる。一般に画像処理の学習データにはどのピクセルとどのピクセルが異なるラベルになるかに関する情報が含まれているが、地理空間データを用いて混合モデルを構築する際にはそのような領域の境界情報が得られない点が地理空間混合モデル構築の困難さのひとつである。 As a probabilistic model that takes into account spatial dependence, there is a Markov Random Field (hereinafter referred to as MRF) that is applied to image processing and the like (Non-Patent Document 2). In MRF, the dependence between adjacent points is taken into account by using a spatial dependence parameter λ (hereinafter referred to as MRF parameter). MRF parameters can be determined from image sample data. In general, the learning data for image processing includes information on which pixels and which pixels have different labels, but when building a mixed model using geospatial data, boundary information of such regions is included. This is one of the difficulties in building a geospatial mixed model.

MRFによって空間依存性を考慮した地理空間混合モデル構築方法として非特許文献３に提案された方法がある。非特許文献３の方法ではn次元連続値ベクトルであるモデルパラメータθのすべての組み合わせについてMRFパラメータλを導入し、準ニュートン法とマルコフ連鎖モンテカルロ法(以下MCMC法)によってθとλの推定を行っている。
Finite Mixture Models, Wiley-Interscience, ISBN: 0471006262 Image Analysis, Random Fields and Markov Chain Monte Carlo Methods: A Mathematical Introduction, Springer, ISBN: 3540442138 Mark S. Kaiser, Noel Cressie, Jaehyung Lee, Spatial Mixture Models based on Exponential Family Conditional Distributions, Statistica Sinica 12, pages 449-474, 2002. There is a method proposed in Non-Patent Document 3 as a method for constructing a geospatial mixed model in which spatial dependence is considered by MRF. In the method of Non-Patent Document 3, MRF parameter λ is introduced for all combinations of model parameters θ which are n-dimensional continuous value vectors, and θ and λ are estimated by quasi-Newton method and Markov chain Monte Carlo method (hereinafter MCMC method). ing.
Finite Mixture Models, Wiley-Interscience, ISBN: 0471006262 Image Analysis, Random Fields and Markov Chain Monte Carlo Methods: A Mathematical Introduction, Springer, ISBN: 3540442138 Mark S. Kaiser, Noel Cressie, Jaehyung Lee, Spatial Mixture Models based on Exponential Family Conditional Distributions, Statistica Sinica 12, pages 449-474, 2002.

MRFを用いた地理空間混合モデル構築によって地理的な空間依存性を考慮しつつ混合モデルを構築することが可能になるが、既存手法ではモデルパラメータθの個数nに対してn×n個のMRFパラメータλが必要であった。そして、準ニュートン法とMCMC法によって厳密な最適パラメータを求めているため、個数nが多い確率モデルを用いた混合モデルでは多くの学習データと計算時間が必要になってしまうという問題点があった。 Although it is possible to build a mixed model by considering geospatial dependence by building a mixed geospatial model using MRF, the existing method uses n × n MRFs for the number n of model parameters θ. The parameter λ was required. In addition, since the exact optimal parameters are obtained by the quasi-Newton method and the MCMC method, there is a problem that a mixed model using a stochastic model with a large number n requires a lot of learning data and calculation time. .

本発明は以上のような問題を解決するためになされたものであり、その目的は、モデルパラメータの多い場合にも効率的に地理空間混合モデルを構築可能なモデル構築装置、モデル構築方法およびモデル構築プログラムを提供することにある。 The present invention has been made to solve the above-described problems, and an object of the present invention is to provide a model construction apparatus, a model construction method, and a model that can efficiently construct a geospatial mixed model even when there are many model parameters. To provide a construction program.

本発明の一態様としてのモデル構築装置は、
評価対象の性質を数値によって表した少なくとも１つの変数と、地理空間における位置を示す位置データとを含む複数の事例を有する地理空間データを記憶する地理空間データ記憶手段と、
前記変数の確率分布をモデル化した複数の各確率モデルのパラメータを表すパラメータ情報を記憶するパラメータ記憶手段と、
前記地理空間における前記位置毎に適用するべき前記確率モデルを表した適用モデル情報を記憶する適用モデル情報記憶手段と、
前記地理空間内の各前記位置に適用されるべき確率モデルと、前記地理空間内の各前記位置に対してあらかじめ定義した近傍範囲に含まれる１つ以上の近傍位置に適用される確率モデルとの関係に基づいて、同一または異なる２つの前記確率モデルからなる各組について前記２つの確率モデル間の依存性を数値によって表したモデル依存性情報を算出するモデル依存性算出手段と、
前記モデル依存性算出手段によって算出された前記モデル依存性情報を記憶するモデル依存性情報記憶手段と、
前記パラメータ情報と前記モデル依存性情報との組に対する前記地理空間データの尤度が高くなるように、前記地理空間における位置毎に適用するべき前記確率モデルを前記複数の確率モデルの中から選択し、前記位置毎に選択した前記確率モデルを示すように前記適用モデル情報を更新する確率モデル選択手段と、
前記更新された適用モデル情報に基づき、前記地理空間データを、同一の確率モデルが適用される複数のグループに分割し、あらかじめ与えられたモデル規範を最大化するように、前記複数のグループの各々に対応する前記確率モデルのパラメータを学習し、各前記確率モデルの学習されたパラメータを示すように前記パラメータ情報を更新するパラメータ学習手段と、
を備える。 The model construction apparatus as one aspect of the present invention is:
Geospatial data storage means for storing geospatial data having a plurality of cases including at least one variable representing the property of the evaluation object by numerical value and position data indicating a position in geospatial space;
Parameter storage means for storing parameter information representing parameters of a plurality of probability models obtained by modeling the probability distribution of the variables;
Application model information storage means for storing application model information representing the probability model to be applied for each position in the geographic space;
A probability model to be applied to each position in the geospace, and a probability model applied to one or more neighboring positions included in a predefined neighborhood range for each position in the geospace. Model dependence calculation means for calculating model dependence information in which a dependence between the two probability models is numerically expressed for each set of the same or different two probability models based on a relationship;
Model dependence information storage means for storing the model dependence information calculated by the model dependence calculation means;
The probability model to be applied for each position in the geospatial is selected from the plurality of probability models so that the likelihood of the geospatial data with respect to the set of the parameter information and the model dependency information is high. A probability model selection means for updating the applied model information to indicate the probability model selected for each position;
Based on the updated application model information, each of the plurality of groups is configured to divide the geospatial data into a plurality of groups to which the same probability model is applied, and to maximize a predetermined model criterion. Parameter learning means for learning parameters of the probability model corresponding to and updating the parameter information to indicate the learned parameters of each probability model;
Is provided.

本発明の一態様としてのモデル構築方法は、
評価対象の性質を数値によって表した少なくとも１つの変数と、地理空間における位置を示す位置データとを含む複数の事例を有する地理空間データを記憶する地理空間データ記憶手段と、
前記変数の確率分布をモデル化した複数の各確率モデルのパラメータを表すパラメータ情報を記憶するパラメータ記憶手段と、
前記地理空間における前記位置毎に適用するべき前記確率モデルを表した適用モデル情報を記憶する適用モデル情報記憶手段と、
を準備する準備ステップと、
前記地理空間内の各前記位置に適用されるべき確率モデルと、前記地理空間内の各前記位置に対してあらかじめ定義した近傍範囲に含まれる１つ以上の近傍位置に適用される確率モデルとの関係に基づいて、同一または異なる２つの前記確率モデルからなる各組について前記２つの確率モデル間の依存性を数値によって表したモデル依存性情報を算出するモデル依存性情報算出ステップと、
前記モデル依存性情報をモデル依存性情報記憶手段に記憶するステップと、
前記パラメータ情報と前記モデル依存性情報との組に対する前記地理空間データの尤度が高くなるように、前記地理空間における位置毎に適用するべき前記確率モデルを前記複数の確率モデルの中から選択し、前記位置毎に選択した前記確率モデルを示すように前記適用モデル情報を更新する確率モデル選択ステップと、
前記更新された適用モデル情報に基づき、前記地理空間データを、同一の確率モデルが適用される複数のグループに分割し、あらかじめ与えられたモデル規範を最大化するように、前記複数のグループの各々に対応する前記確率モデルのパラメータを学習し、各前記確率モデルの学習されたパラメータを示すように前記パラメータ情報を更新するパラメータ学習ステップと、
を備える。 A model construction method as one aspect of the present invention includes:
Geospatial data storage means for storing geospatial data having a plurality of cases including at least one variable representing the property of the evaluation object by numerical value and position data indicating a position in geospatial space;
Parameter storage means for storing parameter information representing parameters of a plurality of probability models obtained by modeling the probability distribution of the variables;
Application model information storage means for storing application model information representing the probability model to be applied for each position in the geographic space;
Preparation steps, and
A probability model to be applied to each position in the geospace, and a probability model applied to one or more neighboring positions included in a predefined neighborhood range for each position in the geospace. A model dependency information calculating step for calculating model dependency information that represents numerically the dependency between the two probability models for each set of the same or different two probability models based on the relationship;
Storing the model dependency information in a model dependency information storage means;
The probability model to be applied for each position in the geospatial is selected from the plurality of probability models so that the likelihood of the geospatial data with respect to the set of the parameter information and the model dependency information is high. A probability model selection step of updating the applied model information to indicate the probability model selected for each position;
Based on the updated application model information, each of the plurality of groups is configured to divide the geospatial data into a plurality of groups to which the same probability model is applied, and to maximize a predetermined model criterion. Learning a parameter of the probability model corresponding to and updating the parameter information to indicate the learned parameter of each probability model; and
Is provided.

本発明の一態様としてのモデル構築プログラムは、
評価対象の性質を数値によって表した少なくとも１つの変数と、地理空間における位置を示す位置データとを含む複数の事例を有する地理空間データを記憶する地理空間データ記憶手段にアクセスするステップと、
前記変数の確率分布をモデル化した複数の各確率モデルのパラメータを表すパラメータ情報を記憶するパラメータ記憶手段にアクセスするステップと、
前記地理空間における前記位置毎に適用するべき前記確率モデルを表した適用モデル情報を記憶する適用モデル情報記憶手段にアクセスするステップと、
前記地理空間内の各前記位置に適用されるべき確率モデルと、前記地理空間内の各前記位置に対してあらかじめ定義した近傍範囲に含まれる１つ以上の近傍位置に適用される確率モデルとの関係に基づいて、同一または異なる２つの前記確率モデルからなる各組について前記２つの確率モデル間の依存性を数値によって表したモデル依存性情報を算出するモデル依存性算出ステップと、
前記モデル依存性情報をモデル依存性情報記憶手段に記憶するステップと、
前記パラメータ情報と前記モデル依存性情報との組に対する前記地理空間データの尤度が高くなるように、前記地理空間における位置毎に適用するべき前記確率モデルを前記複数の確率モデルの中から選択し、前記位置毎に選択した前記確率モデルを示すように前記適用モデル情報を更新する確率モデル選択ステップと、
前記更新された適用モデル情報に基づき、前記地理空間データを、同一の確率モデルが適用される複数のグループに分割し、あらかじめ与えられたモデル規範を最大化するように、前記複数のグループの各々に対応する前記確率モデルのパラメータを学習し、各前記確率モデルの学習されたパラメータを示すように前記パラメータ情報を更新するパラメータ学習ステップと、
を備える。 The model construction program as one aspect of the present invention is:
Accessing geospatial data storage means for storing geospatial data having a plurality of cases including at least one variable representing the property to be evaluated numerically and position data indicating a position in geospatial;
Accessing parameter storage means for storing parameter information representing a parameter of each of a plurality of probability models modeling the probability distribution of the variable;
Accessing application model information storage means for storing application model information representing the probability model to be applied for each position in the geographic space;
A probability model to be applied to each position in the geospace, and a probability model applied to one or more neighboring positions included in a predefined neighborhood range for each position in the geospace. A model dependency calculating step for calculating model dependency information that represents numerically the dependency between the two probability models for each set of the same or different two probability models based on the relationship;
Storing the model dependency information in a model dependency information storage means;
The probability model to be applied for each position in the geospatial is selected from the plurality of probability models so that the likelihood of the geospatial data with respect to the set of the parameter information and the model dependency information is high. A probability model selection step of updating the applied model information to indicate the probability model selected for each position;
Based on the updated application model information, each of the plurality of groups is configured to divide the geospatial data into a plurality of groups to which the same probability model is applied, and to maximize a predetermined model criterion. Learning a parameter of the probability model corresponding to and updating the parameter information to indicate the learned parameter of each probability model; and
Is provided.

本発明により、モデルパラメータの多い場合にも効率的に混合モデルを構築できる。 According to the present invention, a mixed model can be efficiently constructed even when there are many model parameters.

以下、図面に基づいて、本発明の実施の形態について説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

図２は、本発明に関わる地理空間混合モデル構築装置の一実施の形態を示した構成図である。図２に示されるように、この本発明に関わる地理空間混合モデル構築装置は、地理空間データ記憶手段（地理空間データ記憶手段）２０１、混合モデル学習手段（パラメータ学習手段）２０２、混合モデルパラメータ記憶手段（パラメータ記憶手段）２０３、地点状態最尤推定手段（確率モデル選択手段）２０４、地点状態記憶手段（適用モデル情報記憶手段）２０５、地点状態MRFパラメータ推定手段（モデル依存性算出手段）２０６、地点状態MRFパラメータ記憶手段（モデル依存性情報記憶手段）２０７を備えている。 FIG. 2 is a configuration diagram showing an embodiment of a mixed geospatial model construction apparatus according to the present invention. As shown in FIG. 2, the geospatial mixed model construction apparatus according to the present invention includes a geospatial data storage unit (geospatial data storage unit) 201, a mixed model learning unit (parameter learning unit) 202, and a mixed model parameter storage. Means (parameter storage means) 203, point state maximum likelihood estimation means (probability model selection means) 204, point state storage means (applied model information storage means) 205, point state MRF parameter estimation means (model dependence calculation means) 206, A point state MRF parameter storage means (model dependence information storage means) 207 is provided.

各手段はたとえばプログラムモジュールとして実現することができ、この場合、各プログラムモジュールを含むプログラムを図１に示すコンピュータシステムおいて実行することで各手段による機能を実現することができる。このコンピュータシステムには、プログラム命令を実行するＣＰＵ１０２、メモリ等の主記憶装置１０３、ハードディスク、磁気ディスク装置または光磁気ディスク装置等の外部記憶装置１０４、ユーザによるデータ入力を行う入力装置１０５、ユーザにデータ表示を行う表示装置１０６およびこれらを互いに接続するバス１０１が備わっている。 Each means can be realized, for example, as a program module. In this case, the function of each means can be realized by executing a program including each program module in the computer system shown in FIG. This computer system includes a CPU 102 for executing program instructions, a main storage device 103 such as a memory, an external storage device 104 such as a hard disk, a magnetic disk device or a magneto-optical disk device, an input device 105 for inputting data by a user, A display device 106 for displaying data and a bus 101 for connecting them to each other are provided.

図２において、地理空間データ記憶手段２０１には、モデル化対象の地理空間的な現象に関連する様々な性質を数値化した確率変数である対象変数データと、地理上の位置を表す位置データとを含む地理空間データが記憶される。地理空間データには複数の事例が格納されている。 In FIG. 2, the geospatial data storage unit 201 includes target variable data that is a random variable obtained by quantifying various properties related to a geospatial phenomenon to be modeled, and position data that represents a geographical position. Geospatial data including is stored. A plurality of cases are stored in the geospatial data.

生成モデルを構築したい場合には対象変数データにはひとつ以上の属性変数Xが含まれ、予測モデルを構築したい場合には対象変数データにはひとつ以上の属性変数Xとひとつ以上の目的変数Yが含まれなければならない。 If you want to build a generation model, the target variable data contains one or more attribute variables X. If you want to build a prediction model, the target variable data contains one or more attribute variables X and one or more target variables Y. Must be included.

背景技術の欄で説明したように、生成モデルは、対象の性質に関する１つ以上の確率変数である属性変数Xがあるとき、Xが特定の値xをとる確率(Pr(X=x))を記述するための確率モデルである。一方、予測モデルは、属性変数Xの他に対象の性質に関する１つ以上の確率変数である目的変数Yを用意し、Xが特定の値xをとるときにYが特定の値yをとる確率(Pr(Y=y|X=x))を記述するための確率モデルである。例えば、一次元正規分布モデルや多次元正規分布モデルは生成モデルであり、線形重回帰モデルや一般化線形モデルなどは予測モデルに分類される。ここで予測モデルにおいて属性変数Ｘと目的変数Ｙとの集合は、たとえばＬ個の変数に相当し、予測モデル（確率モデル）は、Ｌ−Ｓ（Ｓは１以上の整数）個の変数が与えられたときの残りのＳ個の変数の確率分布をモデル化したものといえる。 As explained in the background art section, when there is an attribute variable X that is one or more random variables related to the target property, the generation model has a probability that X takes a specific value x (Pr (X = x)) Is a stochastic model for describing On the other hand, in addition to the attribute variable X, the prediction model prepares an objective variable Y that is one or more random variables related to the target property, and the probability that Y takes a specific value y when X takes a specific value x This is a probabilistic model for describing (Pr (Y = y | X = x)). For example, a one-dimensional normal distribution model or a multidimensional normal distribution model is a generation model, and a linear multiple regression model, a generalized linear model, or the like is classified as a prediction model. Here, the set of the attribute variable X and the objective variable Y in the prediction model corresponds to, for example, L variables, and the prediction model (probability model) is given by LS (S is an integer of 1 or more) variables. It can be said that the probability distribution of the remaining S variables is modeled.

位置データは対象変数データの各事例の位置に関する情報を集めたものであり、例えば、空間を格子状に区切っていた場合には格子位置のインデックスなどが含まれる。その他、対象物の緯度経度やポリゴン情報などが含まれていてもよい。 The position data is a collection of information on the positions of the respective cases of the target variable data. For example, when the space is divided into a grid, an index of the grid position is included. In addition, the latitude and longitude of the object, polygon information, and the like may be included.

図４（Ａ）は地理空間データの例を示し、格子位置のインデックスという位置データであるPos属性（位置属性）、各格子における機器の故障発生指数（たとえば故障発生件数）を表すX属性、および、事例番号に相当するID属性が含まれている。図４（Ｂ）はPos（位置）に従ってIDを格子状に並べたものを表し、図４（Ｃ）はPos（位置）に従ってXを格子状に並べたものを表す。この例では、５×５＝２５個の格子が存在する。 FIG. 4 (A) shows an example of geospatial data, Pos attribute (position attribute) which is position data called a grid position index, an X attribute representing a failure occurrence index (for example, the number of failure occurrences) of equipment in each lattice, and The ID attribute corresponding to the case number is included. FIG. 4B shows an ID arranged in a grid according to Pos (position), and FIG. 4C shows an X arranged in a grid according to Pos (position). In this example, there are 5 × 5 = 25 lattices.

混合モデルパラメータ記憶手段２０３には、混合モデルを構成するあらかじめ定められた確率モデルの個数k（kは２以上の整数）に従って、各確率モデルのパラメータ{θ1, …, θk}が混合モデルパラメータθとして記憶される。 In the mixed model parameter storage means 203, the parameters {θ1,..., Θk} of each probability model are stored in the mixed model parameter θ according to a predetermined number k of probability models constituting the mixed model (k is an integer of 2 or more). Is remembered as

図５はk=3の混合モデルパラメータの例を示す。モデルパラメータθは、{a,b,c}へラベル付けされた、３つの確率モデルのモデルパラメータ{θa, θb, θc}を含む。各確率モデルのパラメータは下記の式(１)の１次元正規分布の平均μと標準偏差σから構成されており、モデルパラメータ数n=２ということになる。よって、この例では混合モデルパラメータの総数n×k=６となる。

FIG. 5 shows an example of a mixed model parameter for k = 3. The model parameter θ includes model parameters {θa, θb, θc} of three probability models labeled {a, b, c}. The parameters of each probability model are composed of the average μ and standard deviation σ of the one-dimensional normal distribution of the following equation (1), and the number of model parameters is n = 2. Therefore, in this example, the total number of mixed model parameters is n × k = 6.

地点状態記憶手段２０５には、各地点（Pos）において混合モデルに含まれる確率モデルのうちどれを用いるかを識別するための離散値情報が地点状態S（適用モデル情報）として格納される。図６（Ｂ）は図４の地理空間データと図５の混合モデルパラメータとを用いたときの地点状態の例であり、２５個の各格子に対して{a,b,c}のいずれかの識別ラベルが付与されている。 In the point state storage means 205, discrete value information for identifying which of the probability models included in the mixed model is used at each point (Pos) is stored as a point state S (applied model information). FIG. 6B is an example of a point state when the geospatial data of FIG. 4 and the mixed model parameters of FIG. 5 are used, and one of {a, b, c} for each of 25 grids. The identification label is given.

地点状態MRFパラメータ記憶手段２０７には、地点状態Sに関するマルコフ確率場の空間依存性パラメータλ（モデル依存性情報）が格納される。地点状態Sの空間依存関係をマルコフ確率場でモデル化すると、例えば、ある地点Siがラベルaとなる確率は式(2)のように表すことができる（非特許文献２）。

ここで、N(Si)はSiの近傍である。また、λはN(Si)(Siの近傍)のラベル値がSiのラベル値に与える影響をモデル化するためのパラメータであり、λ=0の場合には近傍のラベル値によるSiのラベル値への影響はないとみなすことができる。 The spot state MRF parameter storage means 207 stores the Markov random field spatial dependence parameter λ (model dependence information) regarding the spot state S. When the spatial dependency of the point state S is modeled by a Markov random field, for example, the probability that a certain point Si becomes the label a can be expressed as in Expression (2) (Non-patent Document 2).

Here, N (Si) is in the vicinity of Si. Λ is a parameter for modeling the effect of the label value of N (Si) (in the vicinity of Si) on the label value of Si. When λ = 0, the label value of Si is determined by the label value in the vicinity. It can be assumed that there is no impact on

以降の説明では２次元空間（平面）内において上下左右に隣り合う位置群を“近傍”（近傍範囲）と定義する。ただし、近傍の定義は、モデル構築の目的に応じて、異なってもよい。たとえばある位置からみて左隣の位置のみを近傍とし、上隣、右隣、上隣の各位置は、近傍に含めない場合も考えられる。またある位置からみて上下左右にそれぞれ２つ先の位置を近傍と定義する場合も考えられる。また、空間データが３次元の位置情報を有する場合は、３次元空間で近傍が定義されてもよい。 In the following description, a group of positions adjacent to each other vertically and horizontally in a two-dimensional space (plane) is defined as “neighbor” (neighbor range). However, the definition of the neighborhood may be different depending on the purpose of model construction. For example, it can be considered that only the position on the left side when viewed from a certain position is set as the vicinity, and the positions on the upper side, the right side, and the upper side are not included in the vicinity. In addition, there may be a case where two positions ahead, down, left, and right as viewed from a certain position are defined as neighborhoods. In addition, when the spatial data has three-dimensional position information, the neighborhood may be defined in the three-dimensional space.

図１０（Ｂ）は２次の近傍を採用した場合の地点状態MRFパラメータの例を示している。ここで、(a,a)=1は、地点Siがaとなる確率は近傍にラベルaが存在すると高くなるという依存関係を表し、(a,c)=-1は、地点Siがaとなる確率は近傍にラベルcが存在すると低くなるという依存関係を表し、(a,b)=1は、地点Siがaとなる確率は近傍にラベルbが存在しても影響ないことを表す（非特許文献２参照）。 FIG. 10B shows an example of the point state MRF parameter when the second-order neighborhood is adopted. Here, (a, a) = 1 represents a dependency relationship that the probability that the point Si is a becomes higher when the label a exists in the vicinity, and (a, c) =-1 indicates that the point Si is a (A, b) = 1 indicates that the probability that the point Si is a has no effect even if there is a label b in the vicinity ((a, b) = 1) Non-patent document 2).

混合モデル学習手段２０２では、地理空間データ記憶手段２０１に記憶された地理空間データと地点状態記憶手段２０５に記憶された地点状態を用いて混合モデルパラメータを学習し、学習した混合モデルパラメータを混合モデルパラメータ記憶手段２０３に格納する。 The mixed model learning unit 202 learns the mixed model parameter using the geospatial data stored in the geospatial data storage unit 201 and the point state stored in the point state storage unit 205, and uses the learned mixed model parameter as a mixed model. Store in the parameter storage means 203.

図３は、本発明に関わる混合モデル学習手段の一実施の形態を示した構成図である。図３に示されるように、この本発明に関わる混合モデルパラメータ学習手段は、初期混合モデル学習手段３０１、地理空間データ分割手段３０２、分割地理空間データ記憶手段３０３、モデル学習手段３０４、を備えている。 FIG. 3 is a block diagram showing an embodiment of the mixed model learning means according to the present invention. As shown in FIG. 3, the mixed model parameter learning unit according to the present invention includes an initial mixed model learning unit 301, a geospatial data dividing unit 302, a divided geospatial data storing unit 303, and a model learning unit 304. Yes.

初期混合モデル学習手段３０１では、各地点に対し地点状態の値（図６（Ｂ））が定まっていない場合に各事例に空間的依存性が存在しないと仮定して、各事例に基づき混合モデルパラメータを算出し、算出した混合モデルパラメータを混合モデルパラメータ記憶手段２０３に格納する。この際、確率モデルの個数と、確率モデルの型とはあらかじめユーザにより指定しておく。空間的依存性を無視する場合、EMアルゴリズムなどの一般的な方法によって混合モデルのモデルパラメータを得ることが可能である（非特許文献１）。なおユーザにより混合モデルのモデルパラメータを指定してもよい。 The initial mixed model learning means 301 assumes that there is no spatial dependence in each case when the value of the point state (FIG. 6B) is not fixed for each point, and the mixed model based on each case. The parameter is calculated, and the calculated mixed model parameter is stored in the mixed model parameter storage unit 203. At this time, the number of probability models and the type of probability model are specified in advance by the user. When ignoring the spatial dependence, it is possible to obtain model parameters of the mixed model by a general method such as an EM algorithm (Non-Patent Document 1). Note that the model parameters of the mixed model may be specified by the user.

地理空間データ分割手段３０２では、各地点に対する地点状態の値が定まっている場合に、地理空間データ（図４（Ａ））の各事例を地点状態値によって排他的に分割する。すなわち、地理空間データDを地点状態Sの値({1,..k})に従って{D1,…,Dk}に分割する。{D1,…,Dk}はそれぞれ分割地理空間データ（グループ）に相当する。地理空間データ分割手段３０２は、各分割地理空間データを分割地理空間データ記憶手段３０３に格納する。 The geospatial data dividing unit 302 exclusively divides each case of the geospatial data (FIG. 4A) by the point state value when the value of the point state for each point is determined. That is, the geospatial data D is divided into {D1,..., Dk} according to the value ({1, .. k}) of the point state S. {D1,..., Dk} each correspond to divided geospatial data (group). The geospatial data dividing unit 302 stores each divided geospatial data in the divided geospatial data storage unit 303.

モデル学習手段３０４では、分割地理空間データのそれぞれ(Di)を用いてモデル学習を行うことによりモデルパラメータ(θi)を決定し、各分割地理空間データから得られたモデルパラメータの集合を混合モデルパラメータとして混合モデルパラメータ記憶手段２０３に格納する。つまり、モデル学習手段３０４は、各分割地理空間データ（グループ）に対して、モデル学習アルゴリズムに応じた規範（あらかじめ与えられたモデル規範）を最大化するように、各分割地理空間データに対応する確率モデルのモデルパラメータを学習（最適化）する。このモデル学習では空間依存性を考慮する必要はない。 The model learning means 304 determines model parameters (θi) by performing model learning using each (Di) of the divided geospatial data, and sets a set of model parameters obtained from each divided geospatial data as mixed model parameters. Is stored in the mixed model parameter storage means 203. That is, the model learning unit 304 corresponds to each divided geospatial data so as to maximize a standard (a model standard given in advance) according to the model learning algorithm for each divided geospatial data (group). Learn (optimize) model parameters of a probabilistic model. In this model learning, it is not necessary to consider spatial dependence.

本実施形態では確率モデルとして正規分布モデルを用いているため、モデル学習アルゴリズムとしてはたとえば最尤推定またはベイズ推定などを用いることができる。最尤推定の場合は、モデル規範を最大化することは、学習データ（分割地理空間データ）に対して、正規分布モデル（正規分布関数）に基づいた尤度関数の値（尤度）を最大化することに相当する。 In this embodiment, since a normal distribution model is used as the probability model, for example, maximum likelihood estimation or Bayesian estimation can be used as the model learning algorithm. In the case of maximum likelihood estimation, maximizing the model criterion is to maximize the value (likelihood) of the likelihood function based on the normal distribution model (normal distribution function) for the training data (divided geospatial data). This is equivalent to

確率モデルとしては、正規分布モデルの他にも、線形回帰分析、決定木、ベイジアンネットを利用した確率モデルも可能である。線形回帰分析では、モデル規範を最大化することは、学習データ（分割地理空間データ）と、線形回帰モデルの出力との自乗誤差を最小にすることに相当する。決定木では、モデル規範を最大化することは、学習データに対して情報量またはGini値などの値を最大にすることに相当する。ベイジアンネットでは、モデル規範を最大化することは、学習データに対して事後分布を最大化すること（尤度の最大化）に相当する。 As the probability model, in addition to the normal distribution model, a probability model using linear regression analysis, a decision tree, and a Bayesian network is also possible. In linear regression analysis, maximizing the model criterion corresponds to minimizing the square error between the learning data (divided geospatial data) and the output of the linear regression model. In a decision tree, maximizing a model criterion is equivalent to maximizing a value such as an information amount or a Gini value for learning data. In the Bayesian network, maximizing the model criterion corresponds to maximizing the posterior distribution (maximizing likelihood) for the learning data.

地点状態最尤推定手段２０４では、混合モデルパラメータθと地点状態MRFパラメータλ、および、地理空間データDを用いて、混合モデルパラメータと地点状態MRFパラメータが与えられたときの地理空間データの尤度がなるべく高くなるような地点状態S*を推定する。すなわち、

となるS*を求める。式(3)において地点状態Sは地理空間データDにおける各事例の地点状態の集合であり、地点状態の全ての組み合わせ中から、Prが最大になるSを求める。ただし、Sの取りうる値は非常に多いため局所最適解を求めることしかできないことが多い。式(3)の局所最適解の探索方法としては、MCMC法、ICM法など様々な手法が提案されている（非特許文献２）。地点状態最尤推定手段２０４は、各事例について推定した地点状態の集合を地点状態Sとして地点状態記憶手段２０５に格納する。 The point state maximum likelihood estimation means 204 uses the mixed model parameter θ, the point state MRF parameter λ, and the geospatial data D, and the likelihood of the geospatial data when the mixed model parameter and the point state MRF parameter are given. Estimate the point state S * such that is as high as possible. That is,

Find S * that becomes In Equation (3), the point state S is a set of point states of each case in the geospatial data D, and S that maximizes Pr is obtained from all combinations of the point states. However, since there are many possible values of S, it is often only possible to obtain a local optimal solution. Various methods such as the MCMC method and the ICM method have been proposed as a search method for the local optimum solution of Equation (3) (Non-Patent Document 2). The point state maximum likelihood estimating unit 204 stores the set of point states estimated for each case in the point state storage unit 205 as the point state S.

地点状態MRFパラメータ推定手段２０６では、地点状態記憶手段２０５内の地点状態Sを用いて地点状態MRFパラメータλを推定し、推定した地点状態MRFパラメータλを地点状態MRFパラメータ記憶手段２０７に格納する。 The point state MRF parameter estimation unit 206 estimates the point state MRF parameter λ using the point state S in the point state storage unit 205, and stores the estimated point state MRF parameter λ in the point state MRF parameter storage unit 207.

図７は、本発明に関わる地点状態MRFパラメータ推定手段の一実施の形態を示した構成図である。図７に示されるように、この本発明に関わる地点状態MRFパラメータ推定手段は、１次頻度算出手段７０１、２次頻度算出手段７０２、および、空間依存パラメータ算出手段７０３を備えている。 FIG. 7 is a block diagram showing an embodiment of the point state MRF parameter estimation means according to the present invention. As shown in FIG. 7, the spot state MRF parameter estimation unit according to the present invention includes a primary frequency calculation unit 701, a secondary frequency calculation unit 702, and a space-dependent parameter calculation unit 703.

１次頻度算出手段７０１は地点状態から各離散値（a, b, c）の頻度を算出し、２次頻度算出手段７０２は、同一または異なる離散値の組の頻度を算出する。そして、空間依存パラメータ算出手段７０３は、算出された１次頻度と２次頻度から地点状態MRFパラメータ（依存性情報）を算出し、地点状態MRFパラメータ記憶手段２０７に格納する。地点状態MRFパラメータ推定手段の詳細な動作説明は後述する。 The primary frequency calculation means 701 calculates the frequency of each discrete value (a, b, c) from the point state, and the secondary frequency calculation means 702 calculates the frequency of the same or different set of discrete values. Then, the space-dependent parameter calculation means 703 calculates a spot state MRF parameter (dependency information) from the calculated primary frequency and secondary frequency, and stores it in the spot state MRF parameter storage means 207. Detailed operation description of the point state MRF parameter estimation means will be described later.

図８は、図２の地理空間混合モデル構築装置により行われる処理の実行手順を示したフローチャートである。図８に示されるように、この地理空間混合モデル構築装置の実行手順は、初期混合モデル学習ステップ８０１、地点状態最尤推定ステップ８０２、混合モデル学習ステップ８０３、地点状態MRFパラメータ推定ステップ８０４、終了判定ステップ８０５を備えている。以下では、図４の地理空間データを用いて、図８のフローチャートの実行過程を詳しく述べる。 FIG. 8 is a flowchart showing an execution procedure of processing performed by the geospatial mixed model construction device of FIG. As shown in FIG. 8, the execution procedure of this geospatial mixed model construction apparatus includes an initial mixed model learning step 801, a point state maximum likelihood estimation step 802, a mixed model learning step 803, a point state MRF parameter estimation step 804, and an end. A determination step 805 is provided. Hereinafter, the execution process of the flowchart of FIG. 8 will be described in detail using the geospatial data of FIG.

ステップ８０１では、混合モデル学習手段２０２における初期混合モデル学習手段３０１によって、各事例に空間的依存性が存在しないと仮定して混合モデルパラメータを算出する。図５はステップ８０１によって算出された混合モデルパラメータの一例を示す。ラベル{a,b,c}が付けられた３つの正規分布モデル({Ma,Mb,Mc}と表す)のモデルパラメータ{θa, θb, θc}が示される。 In step 801, the mixed model learning unit 301 in the mixed model learning unit 202 calculates mixed model parameters on the assumption that there is no spatial dependency in each case. FIG. 5 shows an example of the mixed model parameter calculated in step 801. Model parameters {θa, θb, θc} of three normal distribution models (represented as {Ma, Mb, Mc}) labeled {a, b, c} are shown.

ステップ８０２では、地点状態最尤推定手段２０４によって、上述した式(3)によって表される地点状態Sの最尤推定値(現実的には近似最適値)が算出される。 In step 802, the point state maximum likelihood estimating means 204 calculates the maximum likelihood estimated value (practically approximate optimum value) of the point state S represented by the above-described equation (3).

より詳しくは、まず、すべての地点について、属性変数値xと{Ma,Mb,Mc}との乖離値を計算する。例えば、正規分布モデルにおける乖離値としては、式(1)のlogをとったものに-1を掛けた式(4)などを用いることができる。なお、属性変数値xが平均値のとき、乖離値は最小である。

式(4)を用いると、例えば、事例(ID=)1とモデルMaとの乖離値は、

と算出できる。 More specifically, first, the divergence value between the attribute variable value x and {Ma, Mb, Mc} is calculated for all points. For example, as the deviation value in the normal distribution model, equation (4) obtained by multiplying the log of equation (1) by -1 can be used. When the attribute variable value x is an average value, the divergence value is minimum.

Using equation (4), for example, the deviation value between case (ID =) 1 and model Ma is

And can be calculated.

すべての事例について{Ma,Mb,Mc}とX（各事例のxの集合）との乖離値を計算した結果を図６（Ａ）の表における{Ma,Mb,Mc}にそれぞれ示す。また、最も乖離値が小さいモデルの識別値を図６（Ａ）の表におけるBestに示す。 The result of calculating the divergence value between {Ma, Mb, Mc} and X (the set of x in each case) is shown in {Ma, Mb, Mc} in the table of FIG. Further, the identification value of the model having the smallest deviation value is shown as “Best” in the table of FIG.

次に、得られた乖離値と地点状態MRFパラメータλとを用いて最適な地点状態推定値を決定する。現時点では、ステップ８０２の１回目であり、λは定まっていないので、λ=0とみなされる。その場合、各地点についてBestの値が、最適な地点状態値として推定される。図６（Ｂ）はそのようにして得られた地点状態を示している。また、図６（Ｃ）は、図６（Ｂ）の地点状態値を空間的にプロットしたものを示し、図６（Ｄ）は、図６（Ｂ）の各地点状態値を、ラベルa, b, cごとに塗りつぶしパターンを変えてプロットしたものを示す。 Next, an optimum point state estimated value is determined using the obtained divergence value and the point state MRF parameter λ. At this time, it is the first time in step 802, and λ is not determined, so that λ = 0 is assumed. In that case, the value of Best for each point is estimated as the optimum point state value. FIG. 6B shows the point state obtained in this way. FIG. 6C shows a spatial plot of the point state values of FIG. 6B, and FIG. 6D shows the point state values of FIG. The plot is shown with different fill patterns for b and c.

ステップ８０３では、得られた地点状態Sと地理空間データDとを用いてモデルパラメータθの学習を行う。上記ステップ８０２の一回目における地点状態Sの算出ではλ=0とみなしたため、ステップ８０３の一回目では、ステップ８０１で得られた初期混合モデルと同じモデルパラメータθが得られる(よって、ステップ８０３の１回目はスキップしてよい。ここではスキップしたと仮定する)。 In step 803, the model parameter θ is learned using the obtained point state S and geospatial data D. Since the calculation of the point state S in the first step 802 is considered as λ = 0, the same model parameter θ as that in the initial mixed model obtained in step 801 is obtained in the first step 803 (thus, in step 803). The first time may be skipped (assuming that it was skipped here).

ステップ８０４では、ステップ８０２で得られた地点状態Sから地点状態MRFパラメータ推定手段２０６によって、地点状態に関する空間依存性のパラメータλ（依存性情報）が推定される。例えば、図６（Ｂ）〜図６（Ｄ）の地点状態が得られた場合、まず、地点状態MRFパラメータ推定手段２０６における１次頻度算出手段７０１によって、各格子における各ラベル（a〜c）の頻度π1（１次頻度）が算出される。次に、２次頻度算出手段７０２によって、各格子と隣接する格子とのペアについて、重複を避けてラベルペアの頻度π2（２次頻度）が算出される。図９（Ａ）と図９（Ｂ）に、１次頻度π１の例と２次頻度π２の例をそれぞれ示す。この例では図９（Ｂ）から、４０個のラベルペアが存在するこがわかる。 In step 804, the space state parameter λ (dependency information) regarding the point state is estimated from the point state S obtained in step 802 by the point state MRF parameter estimating means 206. For example, when the point states in FIGS. 6B to 6D are obtained, first, each label (a to c) in each lattice is obtained by the primary frequency calculating unit 701 in the point state MRF parameter estimating unit 206. The frequency π1 (primary frequency) is calculated. Next, the secondary frequency calculation means 702 calculates the frequency π2 (secondary frequency) of the label pair while avoiding duplication with respect to the pair of each lattice and the adjacent lattice. FIGS. 9A and 9B show an example of the primary frequency π1 and an example of the secondary frequency π2, respectively. In this example, it can be seen from FIG. 9B that there are 40 label pairs.

得られた１次頻度π1と２次頻度π2を用いて、地点状態MRFパラメータ推定手段２０６における空間依存パラメータ算出手段７０３によって、例えば以下のような手順に従って地点状態に関する空間依存性のパラメータλを推定する。 By using the obtained primary frequency π 1 and secondary frequency π 2, the spatial dependence parameter calculation means 703 in the spot state MRF parameter estimation means 206 estimates the spatial dependence parameter λ related to the spot state, for example, according to the following procedure. To do.

まず、ラベルペアの頻度π2と、ラベルの頻度π1から計算されるラベルペアの期待値との差をλ’として算出する。 First, the difference between the label pair frequency π 2 and the expected value of the label pair calculated from the label frequency π 1 is calculated as λ ′.

例えば、図９（Ａ）から、ラベルaの生じる確率は12/25であり、したがってラベルペアa-aが生じる確率は(12/25)²である。一方、実際のラベルペアa-aの発生確率は図９（Ｂ）から10/40である。そこで、これらの確率の比のlogをとると、λ’(a,a)=log((10/40)/(12/25)²)≒0.035となる。λ’が正の値をとるということは１次頻度π１から算出された２次頻度の期待値よりも、実際の２次頻度π２のほうが大きいということなので、aのとなりはaになりやすいという正の自己相関が働いていると推定できる。 For example, from FIG. 9A, the probability of occurrence of label a is 12/25, and therefore the probability of occurrence of label pair aa is (12/25) ² . On the other hand, the actual occurrence probability of the label pair aa is 10/40 from FIG. 9B. Therefore, taking the log of the ratio of these probabilities, λ ′ (a, a) = log ((10/40) / (12/25) ² ) ≈0.035. The fact that λ ′ takes a positive value means that the actual secondary frequency π2 is larger than the expected value of the secondary frequency calculated from the primary frequency π1, so that the next to a is likely to be a. It can be estimated that positive autocorrelation is working.

また、ラベルペアa-cが生じる確率は図９（Ａ）から2*(12/25)*(4/25)となるので、λ’(a,c)≒-0.311となる。λ’が負の値をとるということは１次頻度π１から算出された２次頻度の期待値よりも、実際の２次頻度π２のほうが小さいということなので、aのとなりはcになりにくいという負の相互相関が働いていると推定できる。 Further, since the probability that the label pair a-c occurs is 2 * (12/25) * (4/25) from FIG. 9A, λ ′ (a, c) ≈−0.311. The fact that λ ′ takes a negative value means that the actual secondary frequency π2 is smaller than the expected value of the secondary frequency calculated from the primary frequency π1, so that the next to a is less likely to be c. It can be estimated that negative cross-correlation is working.

他のラベルペアについても同様にしてλ’を算出し、算出した全てのλ’をまとめたものを図１０（Ａ）に示す。 Λ ′ is calculated in the same manner for other label pairs, and all the calculated λ ′ are summarized in FIG.

ここで、自己相関に関してはλ’>0となるものは+1、そうでないものは0とし、相互相関に関してはλ’<0となるものは-1、そうでないものは0とする。すなわち自己相関に関しては負の相関は考慮せず、正の相関が働くか否かのみを考慮し、相互相関に関しては正の相関は考慮せず、負の相関が働くか否かのみを考慮する。このようにしてλ’の値を変更すると、図１０（Ｂ）に示すように各ラベルペアについて空間依存性のパラメータλが得られる。 Here, regarding the autocorrelation, λ ′> 0 is +1, otherwise is 0, and λ ′ <0 is −1, and otherwise is 0. In other words, autocorrelation does not consider negative correlations, only considers whether positive correlations work, and does not consider positive correlations for cross-correlation, only considers whether negative correlations work . When the value of λ ′ is changed in this way, a space-dependent parameter λ is obtained for each label pair as shown in FIG.

λ’およびλの算出方法は様々なバリエーションが考えられる。例えば、λ’=λとしたり、λが{0, +1, -1}以外の値をとり得るようにしたり、ユーザーパラメータαなどを導入して{0, +α, -α}の値をとるようにしたりすることができる。 Various variations of the calculation method of λ ′ and λ can be considered. For example, λ '= λ, λ can take a value other than {0, +1, -1}, or the value of {0, + α, -α} Or take it.

ステップ８０５では、終了条件が満たされるか否かの判定が行われ終了条件が満たされる場合は処理を終了し、満たされない場合はステップ８０２に戻る。終了条件としては、図８のフローのループ回数が所定回数に達したことや、θまたはλの変化がなくなったことなどが考えられる。今回の場合（１度目のループの場合）、全てのラベルペアについてλ=0ならば終了するが、１つのラベルペアでもλ≠0であれば継続する（終了条件を満たさない）と仮定する。したがって、図１０（Ｂ）に示すようにλ≠0のペアが存在するため、ステップ８０２に戻ることにする。 In step 805, it is determined whether or not the end condition is satisfied. If the end condition is satisfied, the process ends. If not, the process returns to step 802. The termination condition may be that the number of loops in the flow in FIG. 8 has reached a predetermined number or that the change in θ or λ has been eliminated. In this case (in the case of the first loop), it is assumed that the process ends if λ = 0 for all label pairs, but continues even if one label pair has λ ≠ 0 (the end condition is not satisfied). Therefore, as shown in FIG. 10B, since there exists a pair of λ ≠ 0, the process returns to step 802.

ステップ８０２の２回目では、まず、最新のモデルパラメータθを用いて各モデルとデータＸとの乖離値が算出される。ステップ８０３の１回目はスキップされているので、算出される乖離値は、ステップ８０２の１回目に算出した(図６（Ａ）に示す)乖離値と同じである。ただし、今回は、１回目のステップ８０４でλが求まっているため、空間依存性も考慮して最適な地点状態Sを探索しなければならない。ここでは、近似探索手法として知られるICM（非特許文献２）を用いた例を示す。ICMではランダムに選択した地点においてある状態値（ラベル）をとった場合のペナルティを計算し、最もペナルティの低いラベルに置き換えていくという処理を繰り返す。ペナルティとしては、モデルとデータとの乖離値、または、λに負の符号を掛け合わせたものなどが考えられる。 In the second time of step 802, first, a deviation value between each model and data X is calculated using the latest model parameter θ. Since the first time in step 803 is skipped, the calculated divergence value is the same as the divergence value calculated in the first time in step 802 (shown in FIG. 6A). However, since λ is obtained in the first step 804 this time, the optimum point state S must be searched in consideration of spatial dependence. Here, an example using ICM (Non-Patent Document 2) known as an approximate search method is shown. ICM repeats the process of calculating a penalty when a certain state value (label) is taken at a randomly selected point and replacing it with the label with the lowest penalty. As the penalty, a deviation value between the model and the data, or a value obtained by multiplying λ by a negative sign can be considered.

図６（Ａ）の乖離値と図１０（Ｂ）のλとを用いたとき、例えば、地点２０（ID=20）のラベル値をaにすることを考える。このとき乖離値は(2.9-0.4=)2.5増加する。またa-bのペアは３つ減りa-aのペアが３つ増えるので、空間依存性に関するペナルティ（空間全体における依存性の変化量）は-3だけ減少する。従って合計で-0.5のペナルティ減少になる。そこで、地点２０のラベルはbからaに変更になる。すなわち合計値（演算値）が閾値（ここではゼロ）より小さいため、ラベルは変更になる。なお、地点２０（ID=20）のラベル値をラベルcに変更する場合についても同様にしてペナルティの減少を計算し、ペナルティの減少がより大きい方のラベルへ変更するようにしてもよい。 When the divergence value in FIG. 6A and λ in FIG. 10B are used, for example, consider that the label value of the point 20 (ID = 20) is a. At this time, the deviation value increases by (2.9-0.4 =) 2.5. Also, since the number of a-b pairs is reduced by three and the number of a-a pairs is increased by three, the penalty related to spatial dependence (change in dependence in the whole space) is reduced by -3. Therefore, the penalty is reduced by -0.5. Therefore, the label of the point 20 is changed from b to a. That is, the label is changed because the total value (calculated value) is smaller than the threshold value (here, zero). In the case where the label value at the point 20 (ID = 20) is changed to the label c, the penalty reduction may be calculated in the same manner, and the label may be changed to the label with the larger penalty reduction.

以上のような処理を他の地点（事例）についてもいくつか選択して行う。すなわち、処理効率の観点から全ての事例でなくいくつかの事例について行う。このようにして、地点状態Sを探索した結果を図１１（Ａ）〜図１１（Ｃ）に一例として示す。 The above processing is performed by selecting some other points (examples). That is, some cases are performed instead of all cases from the viewpoint of processing efficiency. The results of searching for the spot state S in this way are shown as an example in FIGS. 11 (A) to 11 (C).

ステップ８０３の２回目では、得られた地点状態Sと地理空間データDを用いてモデルパラメータθの学習を行う。まず、混合モデル学習手段２０２における地理空間データ分割手段３０２によって、地点状態の離散値（ラベル値）に応じて地理学習データDを分割する。図１１（Ａ）の地点状態を用いたときにおける、モデルMa用の学習データDaを図１２に示す。この学習データを用いて混合モデル学習手段２０２におけるモデル学習手段３０４によってモデルパラメータθaが学習される。具体的には図１２の学習データDaからXの平均と標準偏差とを計算する。なおこの計算は、最尤推定法において、尤度関数におけるパラメータの最尤推定値（尤度関数を最大にするパラメータの値）を求めていることと等化である。モデルMa, Mb用の学習データDb, Dcについても同様にしてＸの平均と標準偏差とを計算する。そのようにして得られたモデルパラメータθを図１３に示す。 In step 803, the model parameter θ is learned using the obtained point state S and geospatial data D. First, the geospatial data dividing unit 302 in the mixed model learning unit 202 divides the geographic learning data D according to the discrete values (label values) of the point states. FIG. 12 shows learning data Da for the model Ma when the point state of FIG. 11A is used. The model parameter θa is learned by the model learning means 304 in the mixed model learning means 202 using this learning data. Specifically, the average and standard deviation of X are calculated from the learning data Da in FIG. This calculation is equivalent to obtaining the maximum likelihood estimation value of the parameter in the likelihood function (the value of the parameter that maximizes the likelihood function) in the maximum likelihood estimation method. For the learning data Db and Dc for the models Ma and Mb, the average and standard deviation of X are calculated in the same manner. The model parameter θ thus obtained is shown in FIG.

ステップ８０４の２回目では、図１１（Ｂ）の地点状態を用いて、１回目と同様に地点状態MRFパラメータλの推定を行う。図１４に２ループ目の地点状態MRFパラメータλの算出結果を示す。 In the second time of step 804, the point state MRF parameter λ is estimated using the point state of FIG. 11B as in the first time. FIG. 14 shows the calculation result of the point state MRF parameter λ in the second loop.

ステップ８０５の２回目では、終了条件が満たされずに、ステップ８０２に戻ったとする。 In the second time of step 805, it is assumed that the end condition is not satisfied and the process returns to step 802.

ステップ８０２の３回目では、図１３のモデルパラメータと図１４の地点状態MRFパラメータとを用いて最適な地点状態Sの算出を行う。まず、最新のモデルパラメータθを用いてモデルとデータＸとの乖離値を各地点について算出する。各地点について算出した乖離値のうち、地点３と地点２２のみに関する乖離値を図１５に示す。次に、最適な地点状態Sを探索する。たとえば地点３のラベル値をbにすることを考える。このとき乖離値は2.4増加する。またa-bのペアが３つ減りb-bのペアが３つ増えるので空間依存性に関するペナルティは-3減少する。従って合計で-0.6のペナルティ減少になる。そこで、地点３のラベルがaからbに変更になる。同様に地点２２もラベル値をaにすることで空間依存性が３減少するので、ラベルがbからaに変更になる。３ループ目で得られた地点状態Sを図１６に示す。 In the third time of step 802, the optimal spot state S is calculated using the model parameters of FIG. 13 and the spot state MRF parameters of FIG. First, the deviation value between the model and the data X is calculated for each point using the latest model parameter θ. Of the divergence values calculated for each point, the divergence values relating only to point 3 and point 22 are shown in FIG. Next, the optimum point state S is searched. For example, consider that the label value of point 3 is b. At this time, the deviation value increases by 2.4. Also, since the number of a-b pairs is reduced by 3 and the number of b-b pairs is increased by 3, the penalty for spatial dependence is reduced by -3. Therefore, the penalty is reduced by -0.6. Therefore, the label of point 3 is changed from a to b. Similarly, since the spatial dependency of the point 22 is reduced by setting the label value to a, the label is changed from b to a. FIG. 16 shows the point state S obtained in the third loop.

ステップ８０３の３回目の計算結果を図１７（Ａ）、ステップ８０４の３回目の計算結果を図１７（Ｂ）に示す。３ループ目でループが終了する終了条件を用いると仮定すると、図１７（Ａ）および図１７（Ｂ）が最終的に得られた地理空間混合モデルのパラメータθとλに相当する。 FIG. 17A shows the third calculation result in step 803, and FIG. 17B shows the third calculation result in step 804. Assuming that an end condition for ending the loop at the third loop is used, FIGS. 17A and 17B correspond to the parameters θ and λ of the finally obtained geospatial mixed model.

最終的に、本例では、図１６からも分かるように、大きく、１つのaエリア、２つのbエリア、２つのcエリアに空間が分かれ、故障指数Xの確率分布は、エリアごとに同じモデルパラメータをとる混合モデルによって表される。そこで、同一のエリアについて注意深く調べることにより、故障指数に影響を与えている隠れた要因を発見することが可能になると期待できる。 Finally, in this example, as can be seen from FIG. 16, the space is divided into one a area, two b areas, and two c areas, and the probability distribution of the failure index X is the same model for each area. Represented by a mixed model that takes parameters. Therefore, it can be expected that by investigating carefully the same area, it will be possible to discover hidden factors affecting the failure index.

本実施形態では、モデルパラメータθに関するMRFではなく、離散値の地点状態Sに関するMRFを用いた地理空間混合モデルを採用しており、上記のような手順に従うことで、準ニュートン法などの計算コストの必要な手法を使うことなく地理空間混合モデルを構築できる。 In this embodiment, a geospatial mixed model using MRF related to the discrete point state S instead of MRF related to the model parameter θ is adopted, and the calculation cost of the quasi-Newton method or the like is obtained by following the above procedure. It is possible to construct a mixed geospatial model without using the necessary methods.

また、本実施形態ではXが１次元であるため正規分布パラメータはn=2（μとσの２つ）であったが、例えばXが4次元連続値ベクトルの場合、多次元正規分布パラメータは最大n=4+4*4=20必要である。このとき、本実施形態では、確率モデルの個数kの増加に対して、パラメータλの個数は2乗で増加するもののモデルパラメータ数nは線形にしか増加しないので、本実施形態は、モデルパラメータ数の多い場合に用いて効率的である。 In this embodiment, since X is one-dimensional, the normal distribution parameter is n = 2 (two of μ and σ). However, when X is a four-dimensional continuous value vector, for example, the multi-dimensional normal distribution parameter is Maximum n = 4 + 4 * 4 = 20 is required. At this time, in the present embodiment, the number of parameters λ increases in a square while the number of model parameters n increases only in a linear manner, while the number of parameters λ increases in a square. It is efficient when used in many cases.

本発明のハードウェア構成を表すブロック図。The block diagram showing the hardware constitutions of this invention. 本発明の一実施形態に関わる地理空間混合モデル構築装置の構成図。The block diagram of the geospatial mixed model construction apparatus in connection with one Embodiment of this invention. 本発明の一実施形態に関わる混合モデル学習手段の構成図。The block diagram of the mixed model learning means in connection with one Embodiment of this invention. 地理空間データの例を示す図。The figure which shows the example of geospatial data. 混合モデルパラメータの例（その１）を示す図。The figure which shows the example (the 1) of a mixed model parameter. 地点状態の推定結果の例（その１）を示す図。The figure which shows the example (the 1) of the estimation result of a point state. 本発明の一実施形態に関わる地点状態MRFパラメータ推定手段の構成図。The block diagram of the point state MRF parameter estimation means in connection with one Embodiment of this invention. 本発明の一実施形態に関わる地理空間混合モデル構築装置のフローチャートを示す図。The figure which shows the flowchart of the geospatial mixed model construction apparatus in connection with one Embodiment of this invention. １次頻度と２次頻度の例を示す図。The figure which shows the example of a primary frequency and a secondary frequency. 空間依存パラメータ算出手段７０３の計算結果の例（その１）を示す図。The figure which shows the example (the 1) of the calculation result of the space dependence parameter calculation means 703. 地点状態の推定結果の例（その２）を示す図。The figure which shows the example (the 2) of the estimation result of a point state. 分割地理空間データの例を示す図。The figure which shows the example of division | segmentation geospatial data. 混合モデルパラメータの例（その２）を示す図。The figure which shows the example (the 2) of a mixed model parameter. 空間依存パラメータ算出手段７０３の計算結果の例（その２）を示す図。The figure which shows the example (the 2) of the calculation result of the space dependence parameter calculation means 703. モデルとデータの乖離値の算出結果の例を示す図。The figure which shows the example of the calculation result of the deviation value of a model and data. 地点状態の推定結果の例（その３）を示す図。The figure which shows the example (the 3) of the estimation result of a point state. モデルパラメータθと地理状態MRFパラメータλの例を示す図。The figure which shows the example of model parameter (theta) and geographic state MRF parameter (lambda).

Claims

Geospatial data storage means for storing geospatial data having a plurality of cases including at least one variable representing the property of the evaluation object by numerical value and position data indicating a position in geospatial space;
Parameter storage means for storing parameter information representing parameters of a plurality of probability models obtained by modeling the probability distribution of the variables;
Application model information storage means for storing application model information representing the probability model to be applied for each position in the geographic space;
A probability model to be applied to each position in the geospace, and a probability model applied to one or more neighboring positions included in a predefined neighborhood range for each position in the geospace. Model dependence calculation means for calculating model dependence information in which a dependence between the two probability models is numerically expressed for each set of the same or different two probability models based on a relationship;
Model dependence information storage means for storing the model dependence information calculated by the model dependence calculation means;
The probability model to be applied for each position in the geospatial is selected from the plurality of probability models so that the likelihood of the geospatial data with respect to the set of the parameter information and the model dependency information is high. A probability model selection means for updating the applied model information to indicate the probability model selected for each position;
Based on the updated application model information, each of the plurality of groups is configured to divide the geospatial data into a plurality of groups to which the same probability model is applied, and to maximize a predetermined model criterion. Parameter learning means for learning parameters of the probability model corresponding to and updating the parameter information to indicate the learned parameters of each probability model;
Model building device with

The model dependence calculation means further calculates the model dependence information based on the updated applied model information, and indicates the model dependence information in the model dependence information storage means to indicate the calculated model dependence information. Update dependency information,
The probability model selection unit is configured to apply the probability to be applied for each position in the geospatial so that the likelihood of the geospatial data with respect to a set of updated parameter information and updated model dependency information is high. Select a model,
The model construction apparatus according to claim 1.

The at least one variable includes L (L is an integer of 2 or more) variables,
The probability model models a probability distribution of the remaining S variables when LS (S is an integer of 1 or more) number of the variables is given.
The model construction apparatus according to claim 1, wherein the model construction apparatus is a model construction apparatus.

The model dependence calculation means includes
Calculating the frequency of each probability model from the applied model information as primary frequency information;
By obtaining a set of a probability model of each position in the geospace and the probability model of the vicinity position included in the vicinity range of each position, each set of the two probability models of the same or different Calculate the frequency as secondary frequency information,
The model construction apparatus according to any one of claims 1 to 3, wherein the model dependence information is calculated using the primary frequency information and the secondary frequency information.

The model dependence calculation means includes
From the primary frequency information, calculate an expected value of the frequency of each set of the two probability models that are the same or different,
Calculating the model dependency information based on the difference between the expected value of the frequency of each set and the frequency of each set indicated in the secondary frequency information;
The model construction device according to claim 4 characterized by things.

Geospatial data storage means for storing geospatial data having a plurality of cases including at least one variable representing the property of the evaluation object by numerical value and position data indicating a position in geospatial space;
Parameter storage means for storing parameter information representing parameters of a plurality of probability models obtained by modeling the probability distribution of the variables;
Application model information storage means for storing application model information representing the probability model to be applied for each position in the geographic space;
Preparation steps, and
A probability model to be applied to each position in the geospace, and a probability model applied to one or more neighboring positions included in a predefined neighborhood range for each position in the geospace. A model dependency information calculating step for calculating model dependency information that represents numerically the dependency between the two probability models for each set of the same or different two probability models based on the relationship;
Storing the model dependency information in a model dependency information storage means;
The probability model to be applied for each position in the geospatial is selected from the plurality of probability models so that the likelihood of the geospatial data with respect to the set of the parameter information and the model dependency information is high. A probability model selection step of updating the applied model information to indicate the probability model selected for each position;
Based on the updated application model information, each of the plurality of groups is configured to divide the geospatial data into a plurality of groups to which the same probability model is applied, and to maximize a predetermined model criterion. Learning a parameter of the probability model corresponding to and updating the parameter information to indicate the learned parameter of each probability model; and
Model building method with

Accessing geospatial data storage means for storing geospatial data having a plurality of cases including at least one variable representing the property to be evaluated numerically and position data indicating a position in geospatial;
Accessing parameter storage means for storing parameter information representing a parameter of each of a plurality of probability models modeling the probability distribution of the variable;
Accessing application model information storage means for storing application model information representing the probability model to be applied for each position in the geographic space;
A probability model to be applied to each position in the geospace, and a probability model applied to one or more neighboring positions included in a predefined neighborhood range for each position in the geospace. A model dependency calculating step for calculating model dependency information that represents numerically the dependency between the two probability models for each set of the same or different two probability models based on the relationship;
Storing the model dependency information in a model dependency information storage means;
The probability model to be applied for each position in the geospatial is selected from the plurality of probability models so that the likelihood of the geospatial data with respect to the set of the parameter information and the model dependency information is high. A probability model selection step of updating the applied model information to indicate the probability model selected for each position;
Based on the updated application model information, each of the plurality of groups is configured to divide the geospatial data into a plurality of groups to which the same probability model is applied, and to maximize a predetermined model criterion. Learning a parameter of the probability model corresponding to and updating the parameter information to indicate the learned parameter of each probability model; and
Model building program with