JP6365064B2

JP6365064B2 - Population generation program, population generation method, and population generation apparatus

Info

Publication number: JP6365064B2
Application number: JP2014152015A
Authority: JP
Inventors: 直希濱田
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2014-07-25
Filing date: 2014-07-25
Publication date: 2018-08-01
Anticipated expiration: 2034-07-25
Also published as: JP2016031543A

Description

本発明は、母集団生成プログラム等に関する。 The present invention relates to a population generation program and the like.

近年、計算機の性能向上により、エージェントシミュレーションが都市計画、交通、エネルギー、防災等の各種の公共分野で利用されている。例えば、エージェントシミュレーションを利用することで、人口推移や都市の維持に最適な人口内訳を知ることができる。 In recent years, agent simulation has been used in various public fields such as city planning, transportation, energy, disaster prevention, etc., due to improved computer performance. For example, by using agent simulation, it is possible to know the population breakdown that is optimal for population transition and city maintenance.

ここで、上記のような分野のエージェントシミュレーションを利用する場合には、現実に即した初期条件の実測データをシミュレータに与えることが望ましい。しかし、係る実測データは、調査コストやプライバシーの観点から、入手することが難しい。 Here, when using the agent simulation in the field as described above, it is desirable to provide the simulator with actually measured data of initial conditions that are realistic. However, it is difficult to obtain the actual measurement data from the viewpoint of survey cost and privacy.

このため、実測データを入手する代わりに、対象の母集団から少数のサンプルを入手し、カーネル密度推定を利用し、少数のサンプルから母集団全体を近似的に再構築する従来技術がある。このカーネル密度推定では、バンド幅行列とよばれるパラメータの最適値を探索し、探索したパラメータを用いて、複雑な分布を再現する。 For this reason, there is a conventional technique in which instead of obtaining actual measurement data, a small number of samples are obtained from a target population, and kernel density estimation is used to approximately reconstruct the entire population from a small number of samples. In this kernel density estimation, an optimum value of a parameter called a bandwidth matrix is searched, and a complex distribution is reproduced using the searched parameter.

Dorin Comaniciu「An Algorithm for Data-Driven Bandwidth Selection」IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL.25, NO.2, FEBRUARY 2003Dorin Comaniciu `` An Algorithm for Data-Driven Bandwidth Selection '' IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL.25, NO.2, FEBRUARY 2003

しかしながら、上述した従来技術では、母集団を計算するための計算量が大きくなるという問題がある。 However, the above-described conventional technique has a problem that the amount of calculation for calculating the population increases.

上記のカーネル密度推定は、取り扱う次元が低次元の場合には複雑な分布を表現できるが、高次元の場合には、バンド幅行列の最適値を計算するまでの計算量が大きくなってしまう。 The above kernel density estimation can express a complex distribution when the dimension to be handled is low, but in the case of a high dimension, the amount of calculation until the optimum value of the bandwidth matrix is calculated becomes large.

１つの側面では、計算量が大きくなることを防いで、少数のサンプルから母集団を再構築することができる母集団生成プログラム、母集団生成方法および母集団生成装置を提供することを目的とする。 In one aspect, an object of the present invention is to provide a population generation program, a population generation method, and a population generation device capable of preventing a large amount of calculation and reconstructing a population from a small number of samples. .

第１の案では、コンピュータに下記の処理を実行させる。コンピュータに、サンプル集合に含まれる複数の点について、前記点の近傍に存在する近傍点をそれぞれ特定する処理を実行させる。コンピュータに、前記サンプル集合に含まれる点と前記点に対応する近傍点とを選択し、選択した点および近傍点の分布に基づいて新たな点を生成する処理を実行させる。コンピュータに、生成した新たな点を母集団に追加する処理を実行させる。 In the first plan, the computer executes the following processing. A computer is caused to execute a process of specifying neighboring points existing in the vicinity of the points for a plurality of points included in the sample set. The computer is caused to select a point included in the sample set and a neighboring point corresponding to the point, and execute a process of generating a new point based on the selected point and a distribution of neighboring points. The computer is caused to execute a process of adding the generated new point to the population.

本発明の１実施態様によれば、計算量が大きくなることを防いで、少数のサンプルから母集団を再構築することができるという効果を奏する。 According to one embodiment of the present invention, it is possible to prevent a calculation amount from increasing and to reconstruct a population from a small number of samples.

図１は、本実施例に係る母集団生成装置の構成を示す機能ブロック図である。FIG. 1 is a functional block diagram illustrating the configuration of the population generation apparatus according to the present embodiment. 図２は、サンプル集合情報のデータ構造の一例を示す図である。FIG. 2 is a diagram illustrating an example of a data structure of the sample set information. 図３は、ｋ近傍情報のデータ構造の一例を示す図である。FIG. 3 is a diagram illustrating an example of a data structure of k neighborhood information. 図４は、白色化サンプル集合のデータ構造の一例を示す図である。FIG. 4 is a diagram illustrating an example of the data structure of the whitened sample set. 図５は、近傍点を特定する処理を説明するための図である。FIG. 5 is a diagram for explaining the process of specifying the neighboring points. 図６は、生成部の処理を説明するための図である。FIG. 6 is a diagram for explaining the processing of the generation unit. 図７は、母集団集合情報のデータ構造の一例を示す図である。FIG. 7 is a diagram illustrating an example of a data structure of the population set information. 図８は、本実施例に係る母集団生成装置の処理手順を示すフローチャートである。FIG. 8 is a flowchart illustrating the processing procedure of the population generation apparatus according to the present embodiment. 図９は、生成部が、新しい点を生成するその他の処理の一例を示す図である。FIG. 9 is a diagram illustrating an example of another process in which the generation unit generates a new point. 図１０は、母集団生成プログラムを実行するコンピュータの一例を示す図である。FIG. 10 is a diagram illustrating an example of a computer that executes a population generation program.

以下に、本願の開示する母集団生成プログラム、母集団生成方法および母集団生成装置の実施例を図面に基づいて詳細に説明する。なお、この実施例によりこの発明が限定されるものではない。 Hereinafter, embodiments of a population generation program, a population generation method, and a population generation device disclosed in the present application will be described in detail with reference to the drawings. Note that the present invention is not limited to the embodiments.

本実施例に係る母集団生成装置の一例について説明する。図１は、本実施例に係る母集団生成装置の構成を示す機能ブロック図である。図１に示すように、この母集団生成装置１００は、通信部１１０、入力部１２０、表示部１３０、記憶部１４０、制御部１５０を有する。 An example of the population generation apparatus according to the present embodiment will be described. FIG. 1 is a functional block diagram illustrating the configuration of the population generation apparatus according to the present embodiment. As illustrated in FIG. 1, the population generation apparatus 100 includes a communication unit 110, an input unit 120, a display unit 130, a storage unit 140, and a control unit 150.

通信部１１０は、ネットワークを介して他の装置とデータ通信を実行する処理部である。例えば、通信部１１０は、通信装置等に対応する。例えば、通信部１１０は、後述するパラメータ情報１４１、サンプル集合情報１４２を、他の装置から受信した場合には、受信したパラメータ情報１４１、サンプル集合情報１４２を、制御部１５０に出力する。 The communication unit 110 is a processing unit that performs data communication with other devices via a network. For example, the communication unit 110 corresponds to a communication device or the like. For example, the communication unit 110 outputs the received parameter information 141 and sample set information 142 to the control unit 150 when receiving parameter information 141 and sample set information 142 described later from another apparatus.

入力部１２０は、各種の情報を母集団生成装置１００に入力する入力装置である。例えば、入力部１２０は、キーボードやマウス、タッチパネル等に対応する。例えば、利用者は、入力部１２０を操作して、パラメータ情報１４１、サンプル集合情報１４２を入力してもよい。 The input unit 120 is an input device that inputs various types of information to the population generation device 100. For example, the input unit 120 corresponds to a keyboard, a mouse, a touch panel, or the like. For example, the user may input the parameter information 141 and the sample set information 142 by operating the input unit 120.

表示部１３０は、制御部１５０から出力される各種データを表示する表示装置である。例えば、表示部１３０は、液晶ディスプレイやタッチパネル等に対応する。 The display unit 130 is a display device that displays various data output from the control unit 150. For example, the display unit 130 corresponds to a liquid crystal display, a touch panel, or the like.

記憶部１４０は、パラメータ情報１４１、サンプル集合情報１４２、ｋ近傍情報１４３、母集団情報１４４を有する。記憶部１４０は、例えば、ＲＡＭ（Random Access Memory）、ＲＯＭ（Read Only Memory）、フラッシュメモリ（Flash Memory）などの半導体メモリ素子などの記憶装置に対応する。 The storage unit 140 includes parameter information 141, sample set information 142, k neighborhood information 143, and population information 144. The storage unit 140 corresponds to a storage device such as a semiconductor memory element such as a random access memory (RAM), a read only memory (ROM), and a flash memory.

パラメータ情報１４１は、「近傍サイズｋ」と、「出力サイズｌ」と、「抽出点数ｍ」とを有する。近傍サイズｋは、サンプル集合情報１４２のある点のｋ近傍を求める場合のサイズを指定するパラメータである。出力サイズｌは、母集団情報１４４に含ませる点の数を指定するパラメータである。「抽出点数ｍ」は、サンプル集合情報１４２に含まれる点と、この点の近傍点を抽出して、後述する交叉カーネルに与える場合に、抽出する点の数を指定するパラメータである。 The parameter information 141 includes “neighbor size k”, “output size l”, and “number of extraction points m”. The neighborhood size k is a parameter for designating the size when the k neighborhood of a certain point in the sample set information 142 is obtained. The output size l is a parameter that specifies the number of points to be included in the population information 144. The “number of extraction points m” is a parameter that specifies the number of points to be extracted when the points included in the sample set information 142 and the points near the points are extracted and given to the crossing kernel described later.

サンプル集合情報１４２は、母集団に含まれる点の情報のうち、一部の点の情報に対応するサンプルである。図２は、サンプル集合情報のデータ構造の一例を示す図である。 The sample set information 142 is a sample corresponding to information on some points in the information on points included in the population. FIG. 2 is a diagram illustrating an example of a data structure of the sample set information.

図２に示す例において、横軸は属性「世帯主年齢」に対応する軸であり、縦軸は属性「世帯年収」に対応する軸である。また、各属性の値に対応して、各点ｘ_１〜ｘ_１５が配置される。図２に示す例では、説明の便宜上、２種類の属性に対応する２つの軸を示すが、このサンプル集合情報１４２は、２種類以上の属性に対応する軸を有する。サンプル集合情報１４２の各点ｘ_１〜ｘ_１５は、属性がｎ種類の場合には、ｎ次元のベクトルで表される。 In the example shown in FIG. 2, the horizontal axis is an axis corresponding to the attribute “household age”, and the vertical axis is an axis corresponding to the attribute “household annual income”. Further, the points x _{1 to} x ₁₅ are arranged corresponding to the values of the attributes. In the example shown in FIG. 2, for convenience of explanation, two axes corresponding to two types of attributes are shown, but this sample set information 142 has axes corresponding to two or more types of attributes. Each point x _{1 to} x ₁₅ of the sample set information 142 is represented by an n-dimensional vector when there are n types of attributes.

ｋ近傍情報１４３は、サンプル集合情報１４２の点と、この点の近傍に存在する近傍点との関係を示す情報である。図３は、ｋ近傍情報のデータ構造の一例を示す図である。図３に示すように、このｋ近傍情報１４３は、点と、近傍点とを対応付ける。近傍点の数は、近傍サイズｋにより指定される数となる。 The k-neighbor information 143 is information indicating a relationship between a point of the sample set information 142 and a neighboring point existing in the vicinity of this point. FIG. 3 is a diagram illustrating an example of a data structure of k neighborhood information. As shown in FIG. 3, the k neighborhood information 143 associates points with neighborhood points. The number of neighboring points is the number specified by the neighborhood size k.

母集団情報１４４は、サンプル集合情報１４２を基に生成される母集団の情報である。母集団情報１４４を生成する処理は後述する。 The population information 144 is population information generated based on the sample set information 142. The process for generating the population information 144 will be described later.

制御部１５０は、取得部１５１と、白色化処理部１５２と、特定部１５３と、生成部１５４と、逆白色化処理部１５５とを有する。制御部１５０は、例えば、ＡＳＩＣ（Application Specific Integrated Circuit）や、ＦＰＧＡ（Field Programmable Gate Array）などの集積装置に対応する。また、制御部１５０は、例えば、ＣＰＵ（Central Processing Unit）やＭＰＵ（Micro Processing Unit）等の電子回路に対応する。 The control unit 150 includes an acquisition unit 151, a whitening processing unit 152, a specifying unit 153, a generation unit 154, and an inverse whitening processing unit 155. The control unit 150 corresponds to an integrated device such as an application specific integrated circuit (ASIC) or a field programmable gate array (FPGA). The control unit 150 corresponds to an electronic circuit such as a CPU (Central Processing Unit) or an MPU (Micro Processing Unit).

取得部１５１は、通信部１１０または入力部１２０から、パラメータ情報１４１およびサンプル集合情報１４２を取得する。取得部１５１は、パラメータ情報１４１およびサンプル集合情報１４２を、記憶部１４０に登録する。 The acquisition unit 151 acquires parameter information 141 and sample set information 142 from the communication unit 110 or the input unit 120. The acquisition unit 151 registers the parameter information 141 and the sample set information 142 in the storage unit 140.

白色化処理部１５２は、サンプル集合情報１４２に対して白色化処理を実行する処理部である。白色化処理部１５２は、白色化処理を行ったサンプル集合情報１４２を、特定部１５３に出力する。以下の説明では、白色化処理を行ったサンプル集合情報１４２を、白色化サンプル集合と表記する。 The whitening processing unit 152 is a processing unit that performs whitening processing on the sample set information 142. The whitening processing unit 152 outputs the sample set information 142 that has been subjected to whitening processing to the specifying unit 153. In the following description, the sample set information 142 that has been subjected to the whitening process is referred to as a whitened sample set.

ここで、白色化処理の一例について説明する。白色化処理部１５２は、サンプル集合情報１４２に含まれる点全体の平均ベクトルが０となり、かつ、点全体の共分散行列が単位行列となるように、サンプル集合情報１４２の各点の位置を調整する。 Here, an example of the whitening process will be described. The whitening processing unit 152 adjusts the position of each point in the sample set information 142 so that the average vector of the entire points included in the sample set information 142 becomes 0 and the covariance matrix of the entire points becomes a unit matrix. To do.

白色化処理部１５２が、サンプル集合情報１４２に含まれる点の平均ベクトルを０にする処理の一例について説明する。白色化処理部１５２は、サンプル集合情報１４２に含まれる点の平均ベクトルを計算し、各点のベクトルの値から平均ベクトルの値を減算することにより、サンプル集合情報１４２に含まれる点の平均ベクトルを０にする。 An example of processing in which the whitening processing unit 152 sets the average vector of points included in the sample set information 142 to 0 will be described. The whitening processing unit 152 calculates the average vector of the points included in the sample set information 142, and subtracts the average vector value from the vector value of each point, thereby calculating the average vector of the points included in the sample set information 142. Set to 0.

白色化処理部１５２が、サンプル集合情報１４２に含まれる点の共分散行列が単位行列となるように調整する処理の一例について説明する。白色化処理部１５２は、サンプル集合情報１４２の各点に対して、各点の間隔を拡大、縮小する処理、原点を中心に各点の位置を回転する処理を実行することで、各点のベクトルの値を更新し、更新後の各点のベクトルの値によって、共分散行列を算出する。白色化処理部１５２は、共分散行列が単位行列となる、各点の間隔、回転量を算出する。 An example of processing in which the whitening processing unit 152 performs adjustment so that the covariance matrix of points included in the sample set information 142 becomes a unit matrix will be described. The whitening processing unit 152 performs, for each point of the sample set information 142, processing for enlarging or reducing the interval between the points, and processing for rotating the position of each point around the origin. The vector value is updated, and the covariance matrix is calculated based on the updated vector value of each point. The whitening processing unit 152 calculates an interval between each point and a rotation amount in which the covariance matrix is a unit matrix.

白色化処理部１５２は、上述したサンプル集合情報１４２に含まれる点の平均ベクトルを０にする処理と、共分散行列を単位行列とする処理とを実行することで、式（１）を満たす白色化行列「Ｔ」を特定する。式（１）において、「Ｓ」は、サンプル集合情報１４２に含まれる点のベクトル値を並べたサンプル行列を示すものであり、「Ｅ［ＴＳ］」は、白色化サンプル行列ＴＳの平均ベクトルを示すものであり、「Ｃｏｖ[ＴＳ]」は、白色化サンプル行列ＴＳの共分散行列を示すものであり、「Ｉ」は、単位行列を示すものである。 The whitening processing unit 152 performs processing that sets the average vector of points included in the sample set information 142 described above to 0 and processing that uses a covariance matrix as a unit matrix, thereby satisfying Equation (1). Specify the quantization matrix “T”. In Expression (1), “S” indicates a sample matrix in which vector values of points included in the sample set information 142 are arranged, and “E [TS]” indicates an average vector of the whitened sample matrix TS. “Cov [TS]” indicates a covariance matrix of the whitened sample matrix TS, and “I” indicates a unit matrix.

Ｅ[ＴＳ]＝０、Ｃｏｖ[ＴＳ]＝Ｉ・・・（１） E [TS] = 0, Cov [TS] = I (1)

例えば、白色化処理部１５２は、サンプル集合情報１４２に含まれる各点のベクトルに、白色化行列Ｔを乗算することで、各点のベクトルの値を更新し、白色化サンプル集合を生成する。白色化処理部１５２は、生成した白色化サンプル集合の情報を、特定部１５３および生成部１５４に出力する。また、白色化処理部１５２は、白色化行列Ｔの情報を逆白色化処理部１５５に出力する。 For example, the whitening processing unit 152 multiplies the vector of each point included in the sample set information 142 by the whitening matrix T, thereby updating the value of the vector of each point to generate a whitened sample set. The whitening processing unit 152 outputs information on the generated whitened sample set to the specifying unit 153 and the generating unit 154. Further, the whitening processing unit 152 outputs information on the whitening matrix T to the inverse whitening processing unit 155.

図４は、白色化サンプル集合のデータ構造の一例を示す図である。図４に示す例において、横軸は白色化処理部１５２が出力した第１主成分Ａ_１（一般に、前記世帯主年齢とは異なる）に対応する軸であり、縦軸は第２主成分Ａ_２（一般に、前記世帯年収とは異なる）に対応する軸である。白色化処理部１５２により白色化処理されることで、図２に示す各点ｘ_１〜ｘ_１５の位置は、図４に示す各点ｘ_１〜ｘ_１５の位置に移動している。 FIG. 4 is a diagram illustrating an example of the data structure of the whitened sample set. In the example illustrated in FIG. 4, the horizontal axis is an axis corresponding to the first principal component A ₁ (generally different from the household head age) output from the whitening processing unit 152, and the vertical axis is the second principal component A. ₂ (generally different from the household annual income). The whitening unit 152 by being processed whitened, the position of each point _x 1 _{~x 15} shown in FIG. 2 is moved to the position of each point _x 1 _{~x 15} shown in FIG.

特定部１５３は、白色化サンプル集合の各点について、点の近傍に存在する近傍点を特定する処理部である。特定部１５３は、点と、この点に対応する近傍点との関係を、ｋ近傍情報１４３に登録する。特定部１５３は、近傍点の数を、パラメータ情報１４１の近傍サイズｋに対応させる。 The specifying unit 153 is a processing unit that specifies, for each point in the whitened sample set, neighboring points that exist in the vicinity of the point. The specifying unit 153 registers the relationship between the point and the neighboring point corresponding to this point in the k neighborhood information 143. The identifying unit 153 associates the number of neighboring points with the neighborhood size k of the parameter information 141.

特定部１５３の処理の一例について説明する。図５は、近傍点を特定する処理を説明するための図である。特定部１５３は、各点について、他の点とのユーグリッド距離をそれぞれ算出する。図５の各矢印が、ユーグリッド距離を示す。特定部１５３は、基準となる点とのユーグリッド距離が最も近いものからｋ番目までの点を、基準となる点の近傍点とする。ｋ番目は、近傍サイズｋに対応する。 An example of processing of the specifying unit 153 will be described. FIG. 5 is a diagram for explaining the process of specifying the neighboring points. The specifying unit 153 calculates the Eugrid distance from each point for each point. Each arrow in FIG. 5 indicates the Eugrid distance. The identifying unit 153 sets the points from the closest Eugrid distance to the reference point to the k-th point as the vicinity points of the reference point. The kth corresponds to the neighborhood size k.

例えば、基準点を点ｘ_１とし、この点ｘ_１とのユーグリッド距離が近い点を近い順に、ｘ_２、ｘ_３、ｘ_４、ｘ_５、・・・とする。また、近傍サイズｋを「３」とする。この場合には、特定部１５３は、点ｘ_１の近傍点を「ｘ_２、ｘ_３、ｘ_４」とし、点と近傍点との関係を、ｋ近傍情報１４３に登録する。特定部１５３は、他の点ｘ_２〜ｘ_１５についても、同様にして、近傍点を特定し、ｋ近傍情報１４３に登録する。 For example, the reference point and the point _{x 1,} sequentially point near the Euclidean distance is short between the point _{_{_{_{x 1, x 2, x 3}}}} , x 4, x 5, and,. The neighborhood size k is set to “3”. In this case, the specifying unit 153 sets the neighborhood points of the point x ₁ to “x ₂ , x ₃ , x ₄ ”, and registers the relationship between the points and the neighborhood points in the k neighborhood information 143. The identifying unit 153 identifies neighboring points for the other points x _{2 to} x _{15 in} the same manner and registers them in the k neighboring information 143.

図１の説明に戻る。生成部１５４は、白色化サンプル集合に含まれる点とこの点に対応する近傍点とを選択し、選択した点および近傍点の分布に基づいて、新たな点を生成する処理部である。生成部１５４は、新たに生成した点を母集団情報１４４に追加し、母集団情報１４４に含まれる点の数が、出力サイズｌとなるまで、上記処理を繰り返し実行する。生成部１５４は、新たに生成した点を加えた母集団情報１４４を、逆白色化処理部１５５に出力する。 Returning to the description of FIG. The generation unit 154 is a processing unit that selects a point included in the whitened sample set and a neighboring point corresponding to this point, and generates a new point based on the selected point and the distribution of the neighboring point. The generation unit 154 adds the newly generated points to the population information 144 and repeats the above processing until the number of points included in the population information 144 reaches the output size l. The generation unit 154 outputs the population information 144 to which the newly generated points are added to the inverse whitening processing unit 155.

ここで、生成部１５４が新たな点を生成する処理の一例について説明する。生成部１５４は、白色化サンプル集合に含まれる複数の点から、点ｘを一様ランダムに選択する。また、生成部１５４は、一様にランダム選択した点ｘに対応する近傍点から、一様ランダムに非復元抽出する。生成部１５４は、点と近傍点との数を、抽出点数ｍと同数とする。 Here, an example of a process in which the generation unit 154 generates a new point will be described. The generation unit 154 uniformly selects a point x from a plurality of points included in the whitened sample set. Further, the generation unit 154 performs non-restoration extraction uniformly and randomly from neighboring points corresponding to the point x that is uniformly and randomly selected. The generation unit 154 sets the number of points and neighboring points to be the same as the number of extraction points m.

例えば、生成部１５４は、白色化サンプル集合に含まれる複数の点ｘ_１〜ｘ_１５から、１点を一様ランダムに選択して、点ｘ_１を選択した場合について説明する。生成部１５４は、ｋ近傍情報１４３を参照し、点ｘ_１に対応する近傍点を特定する。生成部１５４は、特定した近傍点から、「抽出点数ｍ−１」個の点を一様ランダムに非復元抽出する。 For example, a case where the generation unit 154 selects one point uniformly and randomly from a plurality of points x _{1 to} x ₁₅ included in the whitened sample set and selects the point x ₁ will be described. Generator 154, refers to the k-nearest neighbor information 143, identifies a neighboring point corresponding to the point _{x 1.} The generation unit 154 uniformly and non-restoratively extracts “number of extracted points m−1” points from the specified neighboring points.

以下の説明では、白色化サンプル集合から一様ランダムに抽出した点と、この点の近傍点から一様ランダムに非復元抽出した点とを適宜、抽出点と表記する。生成部１５４は、抽出点の数が、抽出点数ｍと同数となるように点を抽出する。 In the following description, a point extracted uniformly and randomly from the whitened sample set and a non-restored and extracted point uniformly from the vicinity of this point will be referred to as extracted points as appropriate. The generation unit 154 extracts points so that the number of extraction points is the same as the extraction point number m.

生成部１５４は、抽出点を交叉カーネルに与え、新たな点ｙを生成する。図６は、生成部の処理を説明するための図である。図６に示す例では、白色化サンプル集合から一様ランダムに抽出した点をｘ_１３とし、この点ｘ_１３に基づいて特定した近傍点を点ｘ_４、ｘ_６、ｘ_１２、ｘ_１４とする。この近傍点から一様ランダムに非復元抽出された点を点ｘ_１２、ｘ_１４とする。生成部１５４は、抽出点ｘ_１２、ｘ_１３、ｘ_１４を、交叉カーネルに与え、新たな点ｙ_１を生成する。生成部１５４は、点ｙ_１を、母集団情報１４４に加える。生成部１５４は、母集団情報１４４のサイズが、出力サイズｌとなるまで、新たな点を生成する処理を繰り返し実行する。 The generation unit 154 gives the extracted point to the crossing kernel and generates a new point y. FIG. 6 is a diagram for explaining the processing of the generation unit. In the example shown in FIG. 6, the points extracted uniformly randomly from whitened sample set as _{x 13,} to the neighboring point identified on the basis of this point _{x 13} a point _{_{_{x 4, x 6, x 12}}} , x 14 . Points x ₁₂ and x ₁₄ are points that have been non-restored and extracted uniformly and randomly from the neighboring points. The generation unit 154 gives the extraction points x ₁₂ , x ₁₃ , and x ₁₄ to the crossing kernel, and generates a new point y ₁ . The generation unit 154 adds the point y ₁ to the population information 144. The generation unit 154 repeatedly executes a process of generating new points until the size of the population information 144 reaches the output size l.

生成部１５４が利用する交叉カーネルの一例について説明する。交叉カーネルは、式（２）によって定義される。式（２）において式（５）は、抽出点の平均ベクトルを示し、式（３）によって定義される。式（１）において、ε_ｉは式（４）によって定義される。ε_ｉは、式（４）に示すように、平均「０」、標準偏差「１／ｍ」のガウス分布「Ｎ（０、１／ｍ）」に従う乱数である。式（２）から式（４）に示す「ｍ」は、抽出点数ｍに対応する。 An example of the crossing kernel used by the generation unit 154 will be described. The crossover kernel is defined by equation (2). In Expression (2), Expression (5) represents an average vector of extraction points and is defined by Expression (3). In equation (1), ε _i is defined by equation (4). ε _i is a random number according to a Gaussian distribution “N (0, 1 / m)” having an average “0” and a standard deviation “1 / m”, as shown in Expression (4). “M” shown in the equations (2) to (4) corresponds to the number of extraction points m.

生成部１５４は、出力サイズｌとなった母集団情報１４４を、逆白色化処理部１５５に出力する。 The generation unit 154 outputs the population information 144 having the output size l to the inverse whitening processing unit 155.

逆白色化処理部１５５は、母集団情報１４４に含まれる各点のベクトルに、白色化行列Ｔの逆行列「Ｔ^−１」を乗算することで、逆白色化処理を実行する処理部である。逆白色化処理部１５５は、白色化されている母集団情報１４４に対して逆白色化処理を行うことで、本来の属性値をもつ母集団集合情報１４４を生成する。逆白色化処理部１５５は、生成した母集団集合情報１４４を、記憶部１４０に登録する。 The inverse whitening processing unit 155 is a processing unit that performs the inverse whitening processing by multiplying the vector of each point included in the population information 144 by the inverse matrix “T ⁻¹ ” of the whitening matrix T. . The reverse whitening processing unit 155 performs reverse whitening on the whitened population information 144 to generate population set information 144 having original attribute values. The inverse whitening processing unit 155 registers the generated population set information 144 in the storage unit 140.

図７は、母集団集合情報のデータ構造の一例を示す図である。図７に示す例において、横軸は属性「世帯種年齢」に対応する軸であり、縦軸は属性「世帯年収」に対応する軸である。上記のように、生成部１５４によって、母集団情報１４４の点が、出力サイズｌまで増やされており、図２に示した少数のサンプルｘ_１〜ｘ_１５から、母集団全体が再構築される。 FIG. 7 is a diagram illustrating an example of a data structure of the population set information. In the example illustrated in FIG. 7, the horizontal axis is an axis corresponding to the attribute “household type age”, and the vertical axis is an axis corresponding to the attribute “household annual income”. As described above, the generator 154, the point of the population information 144 has been increased to output size l, from a small number of samples x ₁ ~x ₁₅ shown in FIG. 2, the reconstructed entire population .

次に、本実施例に係る母集団生成装置１００の処理手順について説明する。図８は、本実施例に係る母集団生成装置の処理手順を示すフローチャートである。図８に示すように、母集団生成装置１００の取得部１５１は、近傍サイズｋを取得する（ステップＳ１０１）。取得部１５１は、出力サイズｌを取得する（ステップＳ１０２）。取得部１５１は、抽出点数ｍを取得する（ステップＳ１０３）。取得部１５１は、取得した各パラメータを、パラメータ情報１４１に登録する。 Next, a processing procedure of the population generation apparatus 100 according to the present embodiment will be described. FIG. 8 is a flowchart illustrating the processing procedure of the population generation apparatus according to the present embodiment. As illustrated in FIG. 8, the acquisition unit 151 of the population generation device 100 acquires the neighborhood size k (step S101). The acquisition unit 151 acquires the output size l (step S102). The acquisition unit 151 acquires the extraction point m (step S103). The acquisition unit 151 registers each acquired parameter in the parameter information 141.

母集団生成装置１００の白色化処理部１５２は、サンプル集合情報１４２を取得し（ステップＳ１０４）、サンプル集合情報１４２に対して白色化処理を実行する（ステップＳ１０５）。 The whitening processing unit 152 of the population generation device 100 acquires the sample set information 142 (Step S104), and executes a whitening process on the sample set information 142 (Step S105).

母集団生成装置１００の特定部１５３は、白色化サンプル集合の各点のｋ近傍を計算する（ステップＳ１０６）。ステップＳ１０６において、特定部１５３は、白色化サンプル集合の各点について、点の近傍に存在する近傍点を特定し、特定結果をｋ近傍情報１４３に登録する。 The specifying unit 153 of the population generation apparatus 100 calculates the k neighborhood of each point of the whitened sample set (step S106). In step S <b> 106, the specifying unit 153 specifies a neighboring point that exists in the vicinity of the point for each point of the whitened sample set, and registers the specifying result in the k-neighbor information 143.

母集団生成装置１００の生成部１５４は、白色化サンプル集合から点ｘ_１を、一様ランダムに選択する（ステップＳ１０７）。ステップＳ１０７では一例として、一様ランダムに選択した点を、点ｘ_１とする。 Generator 154 of the population generator 100, the point _{x 1} from whitening sample set is selected uniformly randomly (step S107). As an example in step S107, the selected point on the uniform random, and the point x _1.

母集団生成装置１００の生成部１５４は、点ｘ_１のｋ近傍から点ｘ_２、・・・、ｘ_ｍを、一様ランダムに非復元抽出する（ステップＳ１０８）。生成部１５４は、抽出点ｘ_１、・・・ｘ_ｍを、交叉カーネルに与え、１点ｙを生成する（ステップＳ１０９）。ステップＳ１０９の抽出点ｘ_１、・・・ｘ_ｍは一例である。 The generation unit 154 of the population generation apparatus 100 performs non-restoration extraction of points x ₂ ,..., X _m uniformly and randomly from the vicinity of k of the point x ₁ (step S108). The generation unit 154 gives the extraction points x ₁ ,... X _m to the crossing kernel, and generates one point y (step S109). The extraction points x ₁ ,..., X _{m in} step S109 are an example.

生成部１５４は、ｙを母集団情報１４４に追加する（ステップＳ１１０）。生成部１５４は、母集団情報１４４のサイズがｌであるか否かを判定する（ステップＳ１１１）。生成部１５４は、白色化母集団のサイズがｌでない場合には（ステップＳ１１１，Ｎｏ）、ステップＳ１０７に移行する。 The generation unit 154 adds y to the population information 144 (Step S110). The generation unit 154 determines whether the size of the population information 144 is l (step S111). If the size of the whitening population is not l (No at Step S111), the generation unit 154 proceeds to Step S107.

一方、母集団情報１４４のサイズがｌの場合には（ステップＳ１１１，Ｙｅｓ）、母集団生成装置１００の逆白色化処理部１５５は、母集団情報１４４を逆白色化処理して母集団情報を生成する（ステップＳ１１２）。逆白色化処理部１５５は、母集団情報１４４を出力する（ステップＳ１１３）。 On the other hand, when the size of the population information 144 is 1 (Yes at Step S111), the inverse whitening processing unit 155 of the population generation apparatus 100 performs the whitening process on the population information 144 to obtain the population information. Generate (step S112). The inverse whitening processing unit 155 outputs the population information 144 (step S113).

次に、本実施例に係る母集団生成装置１００の効果について説明する。母集団生成装置１００の特定部１５３が、白色化処理を行ったサンプル集合に含まれる複数の点について、点の近傍に存在する近傍点をそれぞれ特定する。母集団生成装置１００の生成部１５４は、白色化処理を行ったサンプル集合に含まれる点とこの点に対応する近傍点とを選択し、選択した点および近傍点の分布に基づいて新たな点を生成し、生成した点を母集団に追加する。このため、本実施例に係る母集団生成装置１００によれば、計算量が大きくなることを防いで、少数のサンプルから母集団を構築することができる。 Next, the effect of the population generation apparatus 100 according to the present embodiment will be described. The specifying unit 153 of the population generating apparatus 100 specifies neighboring points that exist in the vicinity of the points for a plurality of points included in the sample set subjected to the whitening process. The generation unit 154 of the population generation apparatus 100 selects a point included in the sample set subjected to whitening processing and a neighboring point corresponding to this point, and creates a new point based on the selected point and the distribution of the neighboring point. And add the generated points to the population. For this reason, according to the population generation apparatus 100 according to the present embodiment, it is possible to prevent a calculation amount from increasing and to construct a population from a small number of samples.

例えば、本実施例にかかる母集団生成装置１００の生成部１４５は、サンプル集合に含まれる点とこの点に対応する近傍点とを、式（１）に示す交叉カーネルに与え、新たな点を生成する。この交叉カーネルは、例えば、式（１）に基づいて、新たな点を生成するものであり、従来のカーネル密度推定のように最適なバンド幅行列を算出するステップが存在しない。このため、計算量を大きくすることなく、高次元の複雑な分布をもつ母集団集合を生成することができる。 For example, the generation unit 145 of the population generation apparatus 100 according to the present embodiment gives a point included in the sample set and a neighboring point corresponding to this point to the crossing kernel shown in Expression (1), and adds a new point. Generate. This crossing kernel is for generating a new point based on, for example, Expression (1), and there is no step of calculating an optimum bandwidth matrix as in the conventional kernel density estimation. Therefore, a population set having a high-dimensional complex distribution can be generated without increasing the amount of calculation.

また、本実施例に係る母集団生成装置１００は、少数の点を有するサンプル集合情報１４２を基にして、出力サイズｌの母集団情報１４４を生成する。このため、調査コストやプライバシーの観点から、入手することが難しい実測データに近い情報を取得することができ、各種のエージェントシミュレーションに利用することができる。 Further, the population generation apparatus 100 according to the present embodiment generates the population information 144 of the output size l based on the sample set information 142 having a small number of points. For this reason, it is possible to acquire information close to actual measurement data that is difficult to obtain from the viewpoint of survey cost and privacy, and it can be used for various agent simulations.

ところで、本実施例に係る母集団生成装置１００の生成部１５４は、サンプル集合に含まれる点とこの点に対応する近傍点とを交叉カーネルに与えて新しい点を生成していたが、新しい点を生成する処理はこれに限定されない。生成部１５４は、サンプル集合の点について、該点の近傍点のｎ次までの統計的モーメントを持つ確率密度関数を割り当て、割り当てた確率密度関数に従って、新しい点を生成してもよい。 Incidentally, the generation unit 154 of the population generation apparatus 100 according to the present embodiment generates a new point by giving a point included in the sample set and a neighboring point corresponding to this point to the crossing kernel. The process of generating is not limited to this. The generation unit 154 may assign a probability density function having statistical moments up to the n-th order of neighboring points of the sample set point to generate a new point according to the assigned probability density function.

図９は、生成部１５４が、新しい点を生成するその他の処理の一例を示す図である。ここでは説明の便宜上、サンプル集合に含まれる点を１次元にし、サンプル集合の点をｘ_１〜ｘ_４とする。生成部１５４は、サンプル集合に含まれる点ｘ_１，ｘ_２，ｘ_３、ｘ_４のガウス分布を特定する。ｘ_１のガウス分布をガウス分布１０ａとし、ｘ_２のガウス分布をガウス分布１０ｂとし、ｘ_３のガウス分布をガウス分布１０ｃとし、ｘ_４のガウス分布をガウス分布１０ｄとする。生成部１５４は、ガウス分布１０ａ〜１０ｄを平均した確率分布２０を特定する。生成部１５４は、係る確率分布２０に従って、新たな点ｙを生成する。このように、生成部１５４は、交叉カーネル以外の確率密度関数を利用して、新たな点を生成してもよい。 FIG. 9 is a diagram illustrating an example of other processing in which the generation unit 154 generates a new point. For convenience of explanation, a point included in the sample set into a one-dimensional, the points of the sample collection and x ₁ ~x _4. The generation unit 154 specifies a Gaussian distribution of points x ₁ , x ₂ , x ₃ , x ₄ included in the sample set. The Gaussian distribution of x ₁ where the Gaussian distribution 10a, the Gaussian distribution of the _{x 2} where the Gaussian distribution 10b, and Gaussian distribution of _{x 3} where the Gaussian distribution 10c, and Gaussian 10d Gaussian distribution of _{x 4.} The generation unit 154 specifies the probability distribution 20 obtained by averaging the Gaussian distributions 10a to 10d. The generation unit 154 generates a new point y according to the probability distribution 20. In this way, the generation unit 154 may generate a new point using a probability density function other than the crossover kernel.

次に、上記実施例に示した母集団生成装置１００と同様の機能を実現する母集団生成プログラムを実行するコンピュータの一例について説明する。図１０は、母集団生成プログラムを実行するコンピュータの一例を示す図である。 Next, an example of a computer that executes a population generation program that realizes the same function as the population generation apparatus 100 shown in the above embodiment will be described. FIG. 10 is a diagram illustrating an example of a computer that executes a population generation program.

図１０に示すように、コンピュータ２００は、各種演算処理を実行するＣＰＵ２０１と、ユーザからのデータの入力を受け付ける入力装置２０２と、ディスプレイ２０３とを有する。また、コンピュータ２００は、記憶媒体からプログラム等を読取る読み取り装置２０４と、ネットワークを介して他のコンピュータとの間でデータの授受を行うインターフェース装置２０５とを有する。また、コンピュータ２００は、各種情報を一時記憶するＲＡＭ２０６と、ハードディスク装置２０７とを有する。そして、各装置２０１〜２０７は、バス２０８に接続される。 As illustrated in FIG. 10, the computer 200 includes a CPU 201 that executes various arithmetic processes, an input device 202 that receives data input from a user, and a display 203. The computer 200 includes a reading device 204 that reads a program and the like from a storage medium, and an interface device 205 that exchanges data with other computers via a network. The computer 200 also includes a RAM 206 that temporarily stores various information and a hard disk device 207. The devices 201 to 207 are connected to the bus 208.

ハードディスク装置２０７は、特定プログラム２０７ａ、生成プログラム２０７ｂを有する。ＣＰＵ２０１は、特定プログラム２０７ａ、生成プログラム２０７ｂを読み出してＲＡＭ２０６に展開する。特定プログラム２０７ａは、特定プロセス２０６ａとして機能する。生成プログラム２０７ｂは、生成プロセス２０６ｂとして機能する。 The hard disk device 207 has a specific program 207a and a generation program 207b. The CPU 201 reads the specific program 207 a and the generation program 207 b and develops them in the RAM 206. The specific program 207a functions as a specific process 206a. The generation program 207b functions as a generation process 206b.

例えば、特定プロセス２０６ａは、特定部１５３に対応する。生成プロセス２０６ｂは、生成部１５４に対応する。 For example, the specifying process 206 a corresponds to the specifying unit 153. The generation process 206 b corresponds to the generation unit 154.

なお、特定プログラム２０７ａ、生成プログラム２０７ｂについては、必ずしも最初からハードディスク装置２０７に記憶させておかなくても良い。例えば、コンピュータ２００に挿入されるフレキシブルディスク（ＦＤ）、ＣＤ−ＲＯＭ、ＤＶＤディスク、光磁気ディスク、ＩＣカードなどの「可搬用の物理媒体」に各プログラムを記憶させておく。そして、コンピュータ２００が特定プログラム２０７ａ、生成プログラム２０７ｂを読み出して実行するようにしてもよい。 Note that the specific program 207a and the generation program 207b are not necessarily stored in the hard disk device 207 from the beginning. For example, each program is stored in a “portable physical medium” such as a flexible disk (FD), a CD-ROM, a DVD disk, a magneto-optical disk, and an IC card inserted into the computer 200. Then, the computer 200 may read and execute the specific program 207a and the generation program 207b.

以上の各実施例を含む実施形態に関し、さらに以下の付記を開示する。 The following supplementary notes are further disclosed with respect to the embodiments including the above examples.

（付記１）コンピュータに、
サンプル集合に含まれる主成分毎の値に対応付けられた複数の点について、各点の近傍に存在する近傍点をそれぞれ特定し、
前記サンプル集合に含まれる点と該点に対応する近傍点とを選択し、選択した点および近傍点を所定の確率密度関数に割り当てることで、新たな点が取り得る主成分毎の値と確率とを対応付けた分布を生成し、生成した分布に基づいて新たな点を生成し、
生成した新たな点を母集団に追加する
処理を実行させることを特徴とする母集団生成プログラム。 (Supplementary note 1)
For a plurality of points associated with the values for each principal component included in the sample set, identify neighboring points that exist in the vicinity of each point,
By selecting a point included in the sample set and a neighboring point corresponding to the point, and assigning the selected point and the neighboring point to a predetermined probability density function, the value and probability of each principal component that can be taken by the new point Is generated, a new point is generated based on the generated distribution,
A population generation program characterized by executing a process of adding a generated new point to a population.

（付記２）前記新たな点を生成する処理は、前記サンプル集合に含まれる点と前記点に対応する近傍点とを選択し、選択した点および近傍点の平均ベクトルを算出し、選択した点および近傍点の各ベクトルと、前記平均ベクトルと、所定のガウス分布に従う値に基づいて、前記新たな点を生成することを特徴とする付記１に記載の母集団生成プログラム。 (Additional remark 2) The process which produces | generates the said new point selects the point contained in the said sample set, and the neighboring point corresponding to the said point, calculates the average vector of the selected point and a neighboring point, and selected point The population generation program according to appendix 1, wherein the new point is generated based on each vector of neighboring points, the average vector, and a value according to a predetermined Gaussian distribution.

（付記３）前記新たな点を生成する処理は、所定数未満の点を有する前記サンプル集合に含まれる点と前記点に対応する近傍点とを選択し、選択した点および近傍点の分布に基づいて、新たな点を生成する処理を繰り返し実行することで、所定数以上の点を有する母集団を生成することを特徴とする付記１または２に記載の母集団生成プログラム。 (Additional remark 3) The process which produces | generates the said new point selects the point contained in the said sample set which has less than a predetermined number of points, and the neighboring point corresponding to the said point, and selects the distribution of the selected point and the neighboring point. The population generation program according to appendix 1 or 2, wherein a population having a predetermined number or more points is generated by repeatedly executing a process of generating new points based on the above.

（付記４）コンピュータが実行する点生成方法であって、
サンプル集合に含まれる主成分毎の値に対応付けられた複数の点について、各点の近傍に存在する近傍点をそれぞれ特定し、
前記サンプル集合に含まれる点と該点に対応する近傍点とを選択し、選択した点および近傍点を所定の確率密度関数に割り当てることで、新たな点が取り得る主成分毎の値と確率とを対応付けた分布を生成し、生成した分布に基づいて新たな点を生成し、
生成した新たな点を母集団に追加する
処理を実行することを特徴とする母集団生成方法。 (Appendix 4) A point generation method executed by a computer,
For a plurality of points associated with the values for each principal component included in the sample set, identify neighboring points that exist in the vicinity of each point,
By selecting a point included in the sample set and a neighboring point corresponding to the point, and assigning the selected point and the neighboring point to a predetermined probability density function, the value and probability of each principal component that can be taken by the new point Is generated, a new point is generated based on the generated distribution,
A method of generating a population, characterized by executing a process of adding a generated new point to the population.

（付記５）前記新たな点を生成する処理は、前記サンプル集合に含まれる点と前記点に対応する近傍点とを選択し、選択した点および近傍点の平均ベクトルを算出し、選択した点および近傍点の各ベクトルと、前記平均ベクトルと、所定のガウス分布に従う値に基づいて、前記新たな点を生成することを特徴とする付記４に記載の母集団生成方法。 (Additional remark 5) The process which produces | generates the said new point selects the point contained in the said sample set, and the neighboring point corresponding to the said point, calculates the average vector of the selected point and a neighboring point, and selected point The population generation method according to appendix 4, wherein the new point is generated based on each vector of neighboring points, the average vector, and a value according to a predetermined Gaussian distribution.

（付記６）前記新たな点を生成する処理は、所定数未満の点を有する前記サンプル集合に含まれる点と前記点に対応する近傍点とを選択し、選択した点および近傍点の分布に基づいて、新たな点を生成する処理を繰り返し実行することで、所定数以上の点を有する母集団を生成することを特徴とする付記４または５に記載の母集団生成方法。 (Additional remark 6) The process which produces | generates the said new point selects the point contained in the said sample set which has less than a predetermined number of points, and the neighboring point corresponding to the said point, and selects the distribution of the selected point and the neighboring point. 6. The population generation method according to appendix 4 or 5, wherein a population having a predetermined number or more points is generated by repeatedly executing a process of generating new points.

（付記７）サンプル集合に含まれる主成分毎の値に対応付けられた複数の点について、各点の近傍に存在する近傍点をそれぞれ特定する特定部と、
前記サンプル集合に含まれる点と該点に対応する近傍点とを選択し、選択した点および近傍点を所定の確率密度関数に割り当てることで、新たな点が取り得る主成分毎の値と確率とを対応付けた分布を生成し、生成した分布に基づいて新たな点を生成し、生成した新たな点を母集団に追加する生成部と
を有することを特徴とする母集団生成装置。 (Additional remark 7) About the some point matched with the value for every main component contained in a sample set, the specific part which each specifies the nearby point which exists in the vicinity of each point,
By selecting a point included in the sample set and a neighboring point corresponding to the point, and assigning the selected point and the neighboring point to a predetermined probability density function, the value and probability of each principal component that can be taken by the new point And a generation unit for generating a new point based on the generated distribution, and adding the generated new point to the population.

（付記８）前記生成部は、前記サンプル集合に含まれる点と前記点に対応する近傍点とを選択し、選択した点および近傍点の平均ベクトルを算出し、選択した点および近傍点の各ベクトルと、前記平均ベクトルと、所定のガウス分布に従う値に基づいて、前記新たな点を生成することを特徴とする付記７に記載の母集団生成装置。 (Supplementary Note 8) The generation unit selects a point included in the sample set and a neighboring point corresponding to the point, calculates an average vector of the selected point and the neighboring point, and selects each of the selected point and the neighboring point. The population generation apparatus according to appendix 7, wherein the new point is generated based on a vector, the average vector, and a value according to a predetermined Gaussian distribution.

（付記９）前記生成部は、所定数未満の点を有する前記サンプル集合に含まれる点と前記点に対応する近傍点とを選択し、選択した点および近傍点の分布に基づいて、新たな点を生成する処理を繰り返し実行することで、所定数以上の点を有する母集団を生成することを特徴とする付記７または８に記載の母集団生成装置。 (Supplementary Note 9) The generation unit selects a point included in the sample set having less than a predetermined number of points and a neighboring point corresponding to the point, and generates a new point based on the selected point and the distribution of neighboring points. The population generation apparatus according to appendix 7 or 8, wherein a population having a predetermined number or more points is generated by repeatedly executing a process of generating points.

１００母集団生成装置
１５３特定部
１５４生成部 100 population generation device 153 identification unit 154 generation unit

Claims

On the computer,
For a plurality of points associated with the values for each principal component included in the sample set, identify neighboring points that exist in the vicinity of each point,
By selecting a point included in the sample set and a neighboring point corresponding to the point, and assigning the selected point and the neighboring point to a predetermined probability density function, the value and probability of each principal component that can be taken by the new point Is generated, a new point is generated based on the generated distribution,
A population generation program characterized by executing a process of adding a generated new point to a population.

The process of generating the new point selects a point included in the sample set and a neighboring point corresponding to the point, calculates an average vector of the selected point and the neighboring point, and selects the selected point and the neighboring point. The population generation program according to claim 1, wherein the new point is generated based on each vector, the average vector, and a value according to a predetermined Gaussian distribution.

The process of generating the new point selects a point included in the sample set having less than a predetermined number of points and a neighboring point corresponding to the point, and based on the selected point and the distribution of neighboring points, a new point is created. The population generation program according to claim 1 or 2, wherein a population having a predetermined number or more points is generated by repeatedly executing a process for generating such points.

A point generation method executed by a computer,
For a plurality of points associated with the values for each principal component included in the sample set, identify neighboring points that exist in the vicinity of each point,
By selecting a point included in the sample set and a neighboring point corresponding to the point, and assigning the selected point and the neighboring point to a predetermined probability density function, the value and probability of each principal component that can be taken by the new point Is generated, a new point is generated based on the generated distribution,
A method of generating a population, characterized by executing a process of adding a generated new point to the population.

For a plurality of points associated with the values for each principal component included in the sample set, a specifying unit that specifies each neighboring point existing in the vicinity of each point;
By selecting a point included in the sample set and a neighboring point corresponding to the point, and assigning the selected point and the neighboring point to a predetermined probability density function, the value and probability of each principal component that can be taken by the new point And a generation unit for generating a new point based on the generated distribution, and adding the generated new point to the population.