JP6721535B2

JP6721535B2 - LLE calculation device, LLE calculation method, and LLE calculation program

Info

Publication number: JP6721535B2
Application number: JP2017093347A
Authority: JP
Inventors: 靖宏藤原
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2017-05-09
Filing date: 2017-05-09
Publication date: 2020-07-15
Anticipated expiration: 2037-05-09
Also published as: JP2018190251A

Description

本発明は、ＬＬＥ計算装置、ＬＬＥ計算方法及びＬＬＥ計算プログラムに関する。 The present invention relates to an LLE calculation device, an LLE calculation method, and an LLE calculation program.

ＬＬＥ（Locally Linear Embedding）は非線形次元削減の代表的な手法の一つである。ＬＬＥの基本的アイデアはそれぞれのデータポイントをそれらの近傍のデータポイントによる回帰によって近似し、各データポイントの低次元への埋め込みを計算するというものである。ＬＬＥでは再構築コストと埋め込みコストを最小化する二つの最適化問題を解く。これらの最適化において乱数による初期値の生成や学習率の設定等は必要でないため、ＬＬＥは効果的に次元削減を行うことができる。 LLE (Locally Linear Embedding) is one of the typical methods for reducing nonlinear dimensions. The basic idea of LLE is to approximate each data point by regression with its neighboring data points and calculate the embedding of each data point into the low dimension. LLE solves two optimization problems that minimize reconstruction cost and embedding cost. In these optimizations, generation of initial values by random numbers, setting of learning rate, etc. are not necessary, so that LLE can effectively reduce the dimension.

Sam T. Roweis and Lawrence K. Saul, Nonlinear Dimensionality Reduction by Locally Linear Embedding, Science, 290(5500):2323-2326, 2000.Sam T. Roweis and Lawrence K. Saul, Nonlinear Dimensionality Reduction by Locally Linear Embedding, Science, 290(5500):2323-2326, 2000.

しかしながら、従来のＬＬＥには計算に要するコストが高いという問題があった。例えば、Ｋを各データポイントへの近傍のノードからのエッジの数とすると、従来のＬＬＥのアルゴリズムは各データポイントに対してＫ×Ｋの大きさのグラム行列を計算し、ラグランジュの未定乗数法を用いてグラム行列の逆行列からエッジの重みを計算する（例えば、非特許文献１を参照）。逆行列を計算するコストは各データポイントへのエッジの数の３乗であるため、Ｎをデータポイントの数としたときにエッジの重みを計算するコストはＯ（ＮＫ^３）となる。 However, the conventional LLE has a problem that the calculation cost is high. For example, where K is the number of edges from neighboring nodes to each data point, the conventional LLE algorithm calculates a Gram matrix of size K×K for each data point, and the Lagrange undetermined multiplier method. Is used to calculate the edge weight from the inverse matrix of the Gram matrix (for example, see Non-Patent Document 1). Since the cost of calculating the inverse matrix is the cube of the number of edges to each data point, the cost of calculating the edge weight is O(NK ³ ) when N is the number of data points.

また各データポイントの低次元への埋め込みを計算するために従来のＬＬＥのアルゴリズムはＬＬＥカーネルの固有値ベクトルを計算する。ＬＬＥカーネルの大きさはＮ×Ｎであるため、固有ベクトルを計算するために必要なコストはＯ（Ｎ^３）となる。さらに固有値ベクトルを保持するためにＯ（Ｎ^２）のメモリコストが必要となる。そのため大規模なデータに対してＬＬＥを適用することは多くの計算コストとメモリコストが必要となるため非現実的であった。 Also, the conventional LLE algorithm calculates the eigenvalue vector of the LLE kernel in order to calculate the embedding of each data point in the lower dimension. Since the size of the LLE kernel is N×N, the cost required to calculate the eigenvector is O(N ³ ). Furthermore, a memory cost of O(N ² ) is required to hold the eigenvalue vector. Therefore, applying LLE to large-scale data is unrealistic because it requires a large amount of calculation cost and memory cost.

本発明のＬＬＥ計算装置は、複数のデータポイントを有する多次元行列の特異値分解を計算する特異値分解計算部と、前記複数のデータポイントから計算対象のデータポイントを選択するデータポイント選択部と、前記複数のデータポイントのそれぞれについて、前記計算対象のデータポイントとの間の近似の距離を、前記特異値分解に基づいて計算する近似距離計算部と、前記近似の距離が所定値以下であるデータポイントのそれぞれと、前記計算対象のデータポイントとの間のユークリッド距離の推定値を計算する距離推定部と、前記ユークリッド距離の推定値が所定値以下であるデータポイントのそれぞれと、前記計算対象のデータポイントとの間のユークリッド距離を計算し、当該計算したユークリッド距離が所定値以下であるデータポイントを、前記計算対象のデータポイントの近傍のデータポイントに確定する近傍確定部と、前記計算対象のデータポイントと前記計算対象のデータポイントの近傍のデータポイントのそれぞれとの間のエッジの重みを計算するエッジ重み計算部と、前記エッジの重みに基づいて、前記複数のデータポイントについてのＬＬＥカーネルの固有ベクトルを計算するベクトル計算部と、を有することを特徴とする。 The LLE calculation device of the present invention includes a singular value decomposition calculation unit that calculates a singular value decomposition of a multidimensional matrix having a plurality of data points, and a data point selection unit that selects a data point to be calculated from the plurality of data points. For each of the plurality of data points, an approximate distance between the data point to be calculated and an approximate distance calculation unit that calculates based on the singular value decomposition, and the approximate distance is less than or equal to a predetermined value. A distance estimation unit that calculates an estimated value of the Euclidean distance between each of the data points and the data point of the calculation target, each of the data points for which the estimated value of the Euclidean distance is a predetermined value or less, and the calculation target A proximity determining unit that determines the Euclidean distance between the data point and the data point whose calculated Euclidean distance is less than or equal to a predetermined value as a data point near the data point to be calculated, and the calculation target. Edge weight calculator for calculating the edge weight between each data point and each of the data points in the vicinity of the data point to be calculated, and an LLE kernel for the plurality of data points based on the edge weight. And a vector calculation unit that calculates an eigenvector of

本発明によれば、ＬＬＥの計算に要するコストを低減させることができる。 According to the present invention, the cost required for calculating LLE can be reduced.

図１は、第１の実施形態に係るＬＬＥ計算装置の構成の一例を示す図である。FIG. 1 is a diagram illustrating an example of the configuration of the LLE calculation device according to the first embodiment. 図２は、補題１を示す図である。FIG. 2 is a diagram showing Lemma 1. 図３は、補題２を示す図である。FIG. 3 is a diagram showing Lemma 2. 図４は、補題３を示す図である。FIG. 4 is a diagram showing Lemma 3. 図５は、第１の実施形態に係るＬＬＥ計算処理のアルゴリズムの一例を示す図である。FIG. 5 is a diagram illustrating an example of an algorithm of the LLE calculation process according to the first embodiment. 図６は、定理１、定理２及び定理３を示す図である。FIG. 6 is a diagram showing Theorem 1, Theorem 2 and Theorem 3. 図７は、第１の実施形態に係るＬＬＥ計算装置の処理の流れを示すフローチャートである。FIG. 7 is a flowchart showing a processing flow of the LLE calculation device according to the first embodiment. 図８は、ＬＬＥ計算プログラムを実行するコンピュータの一例を示す図である。FIG. 8 is a diagram illustrating an example of a computer that executes the LLE calculation program.

以下に、本願に係るＬＬＥ計算装置、ＬＬＥ計算方法及びＬＬＥ計算プログラムの実施形態を図面に基づいて詳細に説明する。なお、本発明は、以下に説明する実施形態により限定されるものではない。 Hereinafter, embodiments of an LLE calculation device, an LLE calculation method, and an LLE calculation program according to the present application will be described in detail with reference to the drawings. The present invention is not limited to the embodiments described below.

［従来のＬＬＥ］
まず従来のＬＬＥの説明を行う。Ｘ∈Ｒ^Ｎ×ＭをＭ次元からなるＮ個のデータポイントの行列とする。ｘ_ｉ＝（ｘ_ｉ［１］，ｘ_ｉ［２］，．．．，ｘ_ｉ［Ｍ］）を行列Ｘのｉ番目の行ベクトルとすると、ｘ_ｉはｉ番目のデータポイントに対応する。 [Conventional LLE]
First, the conventional LLE will be described. Let XεR ^N×M be a matrix of N data points of M dimensions. Let x _i =(x _i [1], x _i [2],..., x _i [M]) be the i th row vector of matrix X, then x _i corresponds to the i th data point.

ＬＬＥは高次元のデータセットＸから低次元のデータセットＹ∈Ｒ^Ｎ×ｍを計算する。ｙ_ｉ＝（ｙ_ｉ［１］，ｙ_ｉ［２］，．．．，ｙ_ｉ［ｍ］）を長さがｍの行ベクトルとすると、ｙ_ｉはデータポイントｘ_ｉの低次元へのマッピングに対応する。ＬＬＥにおける基本的考えはデータポイントは大域的に非線形な構造を持っていたとしても、各データポイントとそれらの近傍のデータポイントは局所的に線形に近似できるというものである。近傍から各データポイントを表現することでＬＬＥは高次元のデータを低次元で表現することができる。 LLE computes a low-dimensional dataset YεR ^N×m from a high-dimensional dataset X. Let y _i =(y _i [1], y _i [2],..., y _i [m]) be a row vector of length m, and y _i is the mapping of data point x _{i to} the low dimension. Corresponding to. The basic idea in LLE is that even if a data point has a globally nonlinear structure, each data point and its neighboring data points can be locally linearly approximated. By expressing each data point from the neighborhood, LLE can express high-dimensional data in low dimension.

従来のＬＬＥのアルゴリズムは再構築コストと埋め込みコストを最小化する二つの最適化問題を解くパートから構成される。はじめのパートでは各データポイントに対してユークリッド距離から近傍のデータポイントを計算し、それらの回帰分析によりエッジの重みを計算する。ｘ_ｐを回帰分析によって表現するデータポイントとすると、従来のＬＬＥのアルゴリズムは式（１）の再構築コストを最小化する。 The conventional LLE algorithm is composed of two optimization problem solving parts that minimize the reconstruction cost and the embedding cost. In the first part, for each data point, the neighboring data points are calculated from the Euclidean distance, and the edge weight is calculated by their regression analysis. The conventional LLE algorithm minimizes the reconstruction cost of equation (1), where x _p is the data point represented by the regression analysis.

式（１）において｜｜・｜｜はベクトルのＬ２ノルムであり、^〜ｘ_ｐは式（２）のように計算されるｘ_ｐの回帰分析の結果である。ここで^〜・は・の直上に^〜が記された記号を示すものとする。 || · || in formula (1) is the L2 norm of the ^vector, the ~ _{x p} is the result of regression analysis of the calculated _{x p} by the equation (2). Here, ^~ -indicates a symbol in which ^~ is written immediately above.

式（２）においてＮ［ｘ_ｐ］はデータポイントｘ_ｐのＫ個の近傍のデータポイントの集合であり、Ｗ［ｉ，ｊ］はデータポイントｘ_ｊからｘ_ｉのエッジの重みである。重みを計算するために、式（１）においてデータポイントｘ_ｐに対するエッジの重みの和が１になるように再構築コストを最小化する。再構築コストを最小化するためにグラム行列にラグランジュの未定乗数法を適用する。具体的にはｘ_ｉとｘ_ｊをデータポイントｘ_ｐに対するＫ近傍とすると、データポイントｘ_ｐに対するＫ×Ｋのグラム行列Ｇ_ｐの要素は、Ｇ_ｐ［ｉ，ｊ］をグラム行列Ｇ_ｐの［ｉ，ｊ］成分としたときに式（３）のように計算される。 In Equation (2), N[x _p ] is a set of K neighboring data points of the data point x _p , and W[i,j] is an edge weight of the data points x _j to x _i . To calculate the weights, the reconstruction cost is minimized so that the sum of the edge weights for the data points x _p in equation (1) is 1. Apply Lagrange's undetermined multiplier method to the Gram matrix to minimize the reconstruction cost. And in particular with the K near the _{x i} and _{x j} for the data points _{x p,} elements of the Gram matrix _{G p} of K × K for the data point _{x p} _is, G p _[i, j] the Gram matrix _{G p} When the [i,j] component is used, it is calculated as in Expression (3).

ここで＜・，・＞は二つのベクトルの内積とする。Ｇ_ｐ ^−１を行列Ｇ_ｐの逆行列としたときに各データポイントのエッジの重みは式（４）のように計算される。 Here, ···· is the inner product of two vectors. When G _p ⁻¹ is an inverse matrix of the matrix G _p , the edge weight of each data point is calculated as in Expression (4).

式（４）のようにエッジの重みを計算するには逆行列Ｇ_ｐ ^−１を求める必要がある。ＬＬＥのアルゴリズムの二つ目のパートでは式（５）の埋め込みコストを最小化することでデータポイントｘ_ｐに対するベクトルｙ_ｐを求める。 In order to calculate the edge weight as in Expression (4), it is necessary to obtain the inverse matrix G _p ⁻¹ . The second part of the LLE algorithm determining the vector y _p for the data points x _p by minimizing the embedded cost of formula (5).

Ｉを単位行列としたときにＫ＝（Ｉ−Ｗ）^Ｔ（Ｉ−Ｗ）をＮ×Ｎの大きさのＬＬＥカーネルとし、ＷをＮ×Ｎのエッジの重みの行列とすると、埋め込みコストはＹ^ＴＫＹと書き換えることができる。ここで・^Ｔは行列・の転置行列とする。 When I is an identity matrix and K=(I−W) ^T (I−W) is an LLE kernel of size N×N, and W is a matrix of N×N edge weights, the embedding cost is It can be rewritten as Y ^T KY. Here, · ^T is a transposed matrix of matrix.

埋め込みコストには二つの制約がある。はじめの制約はＹ^ＴＹ＝Ｉというものである。これは解がランクｍということである。また０を零ベクトルとしたときにΣ_ｐｙ_ｐ＝０であるというものである。よってＬＬＥのアルゴリズムの二つ目のパートは式（６）の最適化問題を解くこととなる。 There are two restrictions on the embedding cost. The first constraint is that Y ^T Y=I. This means that the solution is rank m. Further, when 0 is a zero vector, Σ _p y _p =0. Therefore, the second part of the LLE algorithm is to solve the optimization problem of equation (6).

式（６）はカーネルＫの底のｍ＋１個の固有ベクトルが解となる固有値問題である。なおここで最小の固有値の固有ベクトルは全ての要素が同一になるため破棄される。 Expression (6) is an eigenvalue problem in which the m+1 eigenvectors at the bottom of the kernel K are solutions. Here, the eigenvector having the smallest eigenvalue is discarded because all the elements are the same.

従来のＬＬＥのアルゴリズムは高い計算コストが必要になる。はじめのパートでは行列Ｇ_ｐを計算し、その逆行列を計算する。行列Ｇ_ｐの大きさがＫ×Ｋであり、その要素が長さＭのベクトルの内積から計算されるため、行列Ｇ_ｐを全てのデータポイントに対して求める計算コストはＯ（ＮＫ^２Ｍ）となる。また行列Ｇ_ｐの逆行列を計算する必要があるため、エッジの重みを求めるためにＯ（ＮＫ^３）の計算コストが必要になる。さらにすべてのノードに対して近傍を求めるのにＯ（Ｎ^２Ｍ）の計算コストが必要になる。また二つ目のパートではＯ（Ｎ^３）の計算コストが必要になる。これはＮ×ＮのカーネルＫの固有値分解を行う必要があるからである。そのため従来のＬＬＥの計算コストはＯ（Ｎ（Ｋ^２Ｍ＋Ｋ^３）＋Ｎ^２Ｍ＋Ｎ^３）となる。またメモリコストはカーネルＫの固有値分解を行うためＯ（Ｎ^２）となる。結果として大規模なデータに対してＬＬＥを適用するのは現実的ではない。 The conventional LLE algorithm requires high calculation cost. In the first part, the matrix G _p is calculated and its inverse matrix is calculated. Since the size of the matrix G _p is K×K and its elements are calculated from the inner product of the vectors of length M, the calculation cost for obtaining the matrix G _p for all data points is O(NK ² M). Becomes Further, since it is necessary to calculate the inverse matrix of the matrix G _p , the calculation cost of O(NK ³ ) is required to obtain the edge weight. Further, the calculation cost of O(N ² M) is required to obtain the neighbors for all the nodes. In addition, the second part requires a calculation cost of O(N ³ ). This is because it is necessary to perform the eigenvalue decomposition of the N×N kernel K. Therefore, the calculation cost of the conventional LLE is O(N(K ² M+K ³ )+N ² M+N ³ ). The memory cost is O(N ² ) because the eigenvalue decomposition of the kernel K is performed. As a result, it is not realistic to apply LLE to large-scale data.

［第１の実施形態の構成］
図１を用いて、第１の実施形態の構成について説明する。図１は、第１の実施形態に係るＬＬＥ計算装置の構成の一例を示す図である。図１に示すように、ＬＬＥ計算装置１０は、特異値分解計算部１０１、データポイント選択部１０２、近似距離計算部１０３、距離推定部１０４、近傍確定部１０５、エッジ重み計算部１０６、並び替え部１０７、ＬＵ分解計算部１０８及びベクトル計算部１０９を有する。 [Configuration of First Embodiment]
The configuration of the first embodiment will be described with reference to FIG. FIG. 1 is a diagram illustrating an example of the configuration of the LLE calculation device according to the first embodiment. As shown in FIG. 1, the LLE calculation device 10 includes a singular value decomposition calculation unit 101, a data point selection unit 102, an approximate distance calculation unit 103, a distance estimation unit 104, a neighborhood determination unit 105, an edge weight calculation unit 106, and rearrangement. It has a unit 107, an LU decomposition calculation unit 108, and a vector calculation unit 109.

［第１の実施形態のＬＬＥ］
まず本実施形態におけるＬＬＥの計算方法及び数理的な背景について説明する。その後、ＬＬＥ計算装置１０の各機能部によって行われる処理について説明する。 [LLE of the first embodiment]
First, the calculation method of LLE and the mathematical background in this embodiment will be described. Then, the processing performed by each functional unit of the LLE calculation device 10 will be described.

本実施形態では多次元空間において二つのデータポイントが近くにあればそれらの近傍の大部分は共通するという知見に基づいてＬＬＥを高速に計算する。ここで、式（３）のグラム行列の定義に見られるように、従来のＬＬＥにおいては、データポイントｘ_ｉとｘ_ｊをデータポイントｘ_ｐのＫ近傍とすると、データポイントｘ_ｐのグラム行列の［ｉ，ｊ］はＧ_ｐ［ｉ，ｊ］＝＜ｘ_ｐ−ｘ_ｉ，ｘ_ｐ−ｘ_ｊ＞となる。そのため、ｘ_ｐ＝ｘ_ｑであるようなデータポイントｘ_ｐとｘ_ｑに対して、ｘ_ｐとｘ_ｑがデータポイントｘ_ｉとｘ_ｊを近傍として共有したとしてもそれらのグラム行列の［ｉ，ｊ］成分は同じにならない。 In the present embodiment, if two data points are close to each other in a multidimensional space, the LLE is calculated at high speed based on the knowledge that most of their neighborhoods are common. Here, equation (3) as seen in the definition of Gram matrix of, in the conventional LLE, when the data points x _i and x _j and K near the data points x _p, the data point x _p of the Gram matrix [I,j] becomes G _p [i,j]=<x _p −x _i, x _p −x _j >. _Therefore, x p = _x to the data points _{x p} and _{x q} such that _q, _{x p} and _{x q} are [i their Gram matrix even share data points _{x i} and _{x j} as a neighboring, j] components are not the same.

そこで、本実施形態では、高速に処理を行うため、オリジナルのアルゴリズムとは異なる方法でラグランジュの未定乗数法をデータポイントｘ_ｐに対して適用し、行列Ｃ_ｐを計算する。本実施形態では、行列Ｃ_ｐの［ｉ，ｊ］成分はＣ_ｐ［ｉ，ｊ］＝＜ｘ_ｉ，ｘ_ｊ＞となるため、もし同じデータポイントを近傍として共有していれば、異なるデータポイントに対して行列Ｃ_ｐは同じ成分を保持することとなる。そのため高速に行列Ｃ_ｐを計算することが可能になる。 Therefore, in the present embodiment, in order to perform the processing at high speed, the Lagrange's undetermined multiplier method is applied to the data points x _p by a method different from the original algorithm, and the matrix C _p is calculated. In the present embodiment, since the [i,j] component of the matrix C _p is C _p [i,j]=<x _i, x _j >, different data can be obtained if the same data point is shared as a neighborhood. The matrix C _p will hold the same components for the points. Therefore, the matrix C _p can be calculated at high speed.

さらにこの手法を用いることで逆行列を高速に計算することが可能になる。具体的には、この手法により行列Ｃ_ｐは行列Ｃ_ｑと同じ成分を有するため、Ｗｏｏｄｂｕｒｙの公式を用いることにより、行列Ｃ_ｑ ^−１から漸進的に行列Ｃ_ｐ ^−１を計算することができる。 Furthermore, by using this method, the inverse matrix can be calculated at high speed. Specifically, since the matrix C _p has the same components as the matrix C _q by this method, the matrix C _p ⁻¹ can be gradually calculated from the matrix C _q ⁻¹ by using the Woodbury formula. ..

埋め込みコストを下げるために、本発明では逆ベキ乗法を用いてカーネルＫの固有値ベクトルを高速に計算する（例えば、参考文献１（Brian Bradie, A Friendly Introduction to Numerical Analysis, 2007, Pearson.）を参照）。しかし本実施形態ではナイーブにこの手法は用いない。それはこの手法を用いるのにカーネルＫの逆行列を計算する必要があり、それによりＯ（Ｎ^３）の計算コストが発生するからである。またＯ（Ｎ^２）のメモリコストが発生するという問題もある。先に述べたとおりカーネルＫはＫ＝（Ｉ−Ｗ）^Ｔ（Ｉ−Ｗ）と計算される。ここでＷは近傍のデータポイントからのエッジの重みから計算されるため、カーネルＫは疎なデータ構造となる。しかしカーネルＫが疎であってもその逆行列は密な構造となり、高い計算コストとメモリコストが必要になる。 In order to reduce the embedding cost, the present invention calculates the eigenvalue vector of the kernel K at high speed using the inverse power method (see, for example, Reference 1 (Brian Bradie, A Friendly Introduction to Numerical Analysis, 2007, Pearson.)). ). However, in this embodiment, this method is not used for naiveness. This is because it is necessary to calculate the inverse matrix of the kernel K in order to use this method, which causes a calculation cost of O(N ³ ). There is also a problem that a memory cost of O(N ² ) occurs. As described above, the kernel K is calculated as K=(I−W) ^T (I−W). Here, since W is calculated from edge weights from neighboring data points, the kernel K has a sparse data structure. However, even if the kernel K is sparse, its inverse matrix has a dense structure, which requires high calculation cost and memory cost.

これらの問題を解決するために、本実施形態では行列Ｉ−ＷのＬＵ分解を計算し、カーネルＫの固有ベクトルを計算する。ここでＬＵ分解とは行列を下三角行列と上三角行列に分解する手法である（例えば、参考文献２（William H. Press, Saul A. Teukolsky, William T Vetterling, Brian P. Flannery, Numerical Recipes 3rd Edition, 2007, Cambridge University Press.）を参照）。下三角行列と上三角行列から固有ベクトルを計算できるため、カーネルＫの計算を避けることができる。また下三角行列と上三角行列は疎なデータ構造を持つため、計算コストとメモリコストを抑えることができる。 In order to solve these problems, in this embodiment, the LU decomposition of the matrix I-W is calculated, and the eigenvector of the kernel K is calculated. Here, the LU decomposition is a method of decomposing a matrix into a lower triangular matrix and an upper triangular matrix (for example, reference 2 (William H. Press, Saul A. Teukolsky, William T Vetterling, Brian P. Flannery, Numerical Recipes 3rd. Edition, 2007, Cambridge University Press.)). Since the eigenvector can be calculated from the lower triangular matrix and the upper triangular matrix, the calculation of the kernel K can be avoided. Further, since the lower triangular matrix and the upper triangular matrix have a sparse data structure, the calculation cost and the memory cost can be suppressed.

本実施形態は従来のＬＬＥのアルゴリズムと同じ計算結果となる。これは本実施形態が再構築コストと埋め込みコストを最小化するような計算を行うからである。すなわち本実施形態はオリジナルのアルゴリズムの計算結果を変えることなく、少ない計算コストとメモリコストで次元削減を行うことができる。 The present embodiment has the same calculation result as the conventional LLE algorithm. This is because the present embodiment performs calculations that minimize the reconstruction cost and the embedding cost. That is, this embodiment can reduce the dimension with a small calculation cost and memory cost without changing the calculation result of the original algorithm.

［第１の実施形態のエッジの重みの計算］
次にエッジの重みを高速計算するための手法について述べる。本発明ではラグランジュの未定乗数法を用いてデータポイントｘ_ｐの行列Ｃ_ｐを計算する。ラグランジュの未定乗数法は、複数の変数を持つ関数の一定の制約下における定常状態を求めることができる（例えば、参考文献３（Christopher M. Bishop, Pattern Recognition and Machine Learning, 2010, Springer.）を参照）。ＬＬＥの場合、変数は再構築コストを最小化するエッジの重みであり、制約は各データポイントへのエッジの重み和が１になるというものである。γをラグランジュの未定乗数とすると、各データポイントに対してラグランジュ関数Ｌは以下のようになる。 [Calculation of Edge Weight in First Embodiment]
Next, a method for calculating edge weights at high speed is described. In the present invention, the Lagrange undetermined multiplier method is used to calculate the matrix C _p of the data points x _p . The Lagrangian undetermined multiplier method can obtain a steady state of a function having a plurality of variables under constant constraints (for example, see Reference 3 (Christopher M. Bishop, Pattern Recognition and Machine Learning, 2010, Springer.). reference). In LLE, the variable is the edge weight that minimizes the reconstruction cost, and the constraint is that the sum of the edge weights for each data point is one. Letting γ be a Lagrange undetermined multiplier, the Lagrange function L for each data point is:

よってｘ_ｉ∈Ｎ［ｘ_ｐ］であるようなデータポイントｘ_ｉのエッジの重みＷ［ｐ，ｉ］に対して関数ＬがＷ［ｐ，ｉ］に対して定常状態になる条件は∂Ｌ／∂Ｗ［ｐ，ｉ］＝０となる。同様にγに対して関数Ｌの条件は∂Ｌ／∂γ＝０となる。そのため関数Ｌが定常状態になる条件は式（１２）のようになる。 Therefore, for the edge weight W[p,i] of the data point x _i such that x _i εN[x _p ], the condition for the function L to be in a steady state for W[p,i] is ∂L /∂W[p,i]=0. Similarly, the condition of the function L for γ is ∂L/∂γ=0. Therefore, the condition that the function L is in the steady state is as shown in Expression (12).

式（１２）においてＣ_ｐは（Ｋ＋１）×（Ｋ＋１）の行列であり、ｗとｐは以下で与えられる長さがＫ＋１の列ベクトルである。 In Equation (12), C _p is a (K+1)×(K+1) matrix, and w and p are column vectors of length K+1 given below.

ここでｘ_ｉ，ｘ_ｊ∈Ｎ［ｘ_ｐ］である。式（１３）から行列Ｃ_ｐの要素はＣ_ｐ［ｉ，ｊ］＝＜ｘ_ｉ，ｘ_ｊ＞と計算することができ、行列Ｃ_ｐの要素はｘ_ｐとは独立に計算することができることがわかる。このためもしデータポイントｘ_ｐとｘ_ｑが同じ近傍を共有するとき、データポイントｘ_ｐの行列Ｃ_ｐの要素はデータポイントｘ_ｑの行列Ｃ_ｑを参照することで高速に計算することができる。もしｄがデータポイントｘ_ｐとｘ_ｑで異なる近傍の数とすると（すなわちｄ＝Ｋ−｜Ｎ［ｘ_ｐ］∩Ｎ［ｘ_ｑ］｜のとき）、本実施形態では行列Ｃ_ｐを求めるためにＯ（ｄＫＭ）の計算コストが必要となる。 Here, x _i , x _j εN[x _p ]. The elements of the matrix _{C p} from equation _{(13) C p [i,} j] = <x i, x j> can be calculated with the elements of the matrix _{C p} be can be calculated independently of _{x p} I understand. Therefore, if the data points x _p and x _q share the same neighborhood, the elements of the matrix C _p of the data points x _p can be calculated at high speed by referring to the matrix C _q of the data points x _q . If d is the number of different neighbors at the data points x _p and x _q (that is, d=K−|N[x _p ]∩N[x _q ]|), the matrix C _p is obtained in this embodiment. Requires a calculation cost of O(dKM).

［第１の実施形態の逆行列の計算］
次にＷｏｏｄｂｕｒｙの公式を用いて行列Ｃ_ｐの逆行列Ｃ_ｐ ^−１を高速に求める手法について述べる。式（１４）の通り、ベクトルｗのｉの要素はエッジの重みＷ［ｐ，ｉ］に対応する。そのためＣ_ｐ ^−１が行列Ｃ_ｐの逆行列である式（１２）からエッジの重みをｗ＝Ｃ_ｐ ^−１と計算することができる。しかし直接行列Ｃ_ｐから行列Ｃ_ｐ ^−１を求めるにはＯ（Ｋ^３）の計算コストがかかるため、この手法の計算コストは高くなる。 [Calculation of Inverse Matrix of First Embodiment]
Then we describe a method for obtaining the inverse matrix _C ^{p -1} of the matrix _{C p} at high speed using the formula Woodbury. As shown in Expression (14), the element of i in the vector w corresponds to the edge weight W[p,i]. Therefore, the edge weight can be calculated as w=C _p ⁻¹ from the equation (12) in which C _p ⁻¹ is the inverse matrix of the matrix C _p . But since the directly from the matrix C _p Request matrix C _p ^-1 consuming computational cost O (K ^3), calculate the cost of this approach is high.

Ｗｏｏｄｂｕｒｙの公式を用いてエッジの重みを高速に計算する手法を述べるが、まずｄ＝１である場合、すなわちデータポイントｘ_ｐがデータポイントｘ_ｑに対して一つだけ異なる近傍を持つ場合（Ｋ−｜Ｎ［ｘ_ｐ］∩Ｎ［ｘ_ｑ］｜＝１）について述べる。ｘ_ｋをデータポイントｘ_ｐにおけるただ一つの異なる近傍のデータポイントとし、ｘ_ｋをデータポイントｘ_ｑのｘ_ｋに対する近傍のデータポイントとする。そのためｘ_ｋ＝Ｎ［ｘ_ｐ］−Ｎ［ｘ_ｐ］∩Ｎ［ｘ_ｑ］でありｘ_ｋ＝Ｎ［ｘ_ｑ］−Ｎ［ｘ_ｐ］∩Ｎ［ｘ_ｑ］となる。 A method for calculating edge weights at high speed using the Woodbury formula will be described. First, when d=1, that is, when the data point x _p has only one different neighborhood with respect to the data point x _q (K _{- | N [x p] ∩N} [x q] | = 1) is described. Let x _{k be} the only different neighbor data point at data point x _p , and let x _k be the data point neighbor of data point x _q with respect to x _k . Therefore _{_{x k = N [x p]}} -N [x p] is _{_{_{∩N [x q] x k =}}} N [x q] -N [x p] ∩N [x q] become.

ここで、データポイントに対応する各ベクトルの分散は１であること、すなわち＜ｘ_ｉ，ｘ_ｉ＞＝１であることを仮定する。この仮定は後に外す。行列Ｃ_ｐとＣ_ｑのｋ番目の行及び列にデータポイントｘ_ｋとｘ_ｋ´が対応する逆行列Ｃ_ｐ ^−１を更新するときに、以下の行列ΔＣを用いる。 Here, it is assumed that the variance of each vector corresponding to a data point is 1, that is <x _i, x _i >=1. This assumption will be removed later. The following matrix ΔC is used when updating the inverse matrix C _p ⁻¹ corresponding to the data points x _k and x _{k′ in} the k th row and column of the matrices C _p and C _q .

行列ΔＣに対して図２に示す補題１（補助定理１）が成り立つ。図２は、補題１を示す図である。また、補題１は、行列ΔＣは非対称行列Ｃ_ｐとＣ_ｑから計算されるが対称行列であり、行列ΔＣのｋの行及び列のみが非ゼロ要素を持つことを示している。この補助定理より行列ΔＣについての図３に示す補題２（補助定理２）が成り立つ。図３は、補題２を示す図である。 The lemma 1 (lemma 1) shown in FIG. 2 holds for the matrix ΔC. FIG. 2 is a diagram showing Lemma 1. Further, Lemma 1 shows that the matrix ΔC is a symmetric matrix calculated from the asymmetric matrices C _p and C _q , and only k rows and columns of the matrix ΔC have nonzero elements. From this lemma, Lemma 2 (lemma 2) for the matrix ΔC shown in FIG. 3 is established. FIG. 3 is a diagram showing Lemma 2.

Ｖ^ＴＶ＝Ｉであり、ΔＣｖ_１＝｜｜ｆ｜｜ｖ_１かつΔＣｖ_２＝−｜｜ｆ｜｜ｖ_２であるため、補助定理２は補助定理１とともに、式（１９）は行列ΔＣのランク２の固有値分解に対応していることがわかる。この式を用いることにより行列ＶとＦをＯ（Ｋ）の計算コストで求めることができる。これはベクトルｆの長さがＫ＋１だからである。そのため補助定理２から行列ΔＣの固有値分解をＯ（Ｋ）の計算コストで求めることができる。 Since V ^T V=I and ΔCv ₁ =||f||v ₁ and ΔCv ₂ =−||f||v ₂ , the lemma 2 and the formula (19) are the matrix ΔC. It can be seen that it corresponds to the eigenvalue decomposition of rank 2. By using this formula, the matrices V and F can be obtained at a calculation cost of O(K). This is because the length of the vector f is K+1. Therefore, the eigenvalue decomposition of the matrix ΔC can be obtained from the lemma 2 at a calculation cost of O(K).

Ｗｏｏｄｂｕｒｙの公式を用いることでデータポイントｘ_ｑの逆行列Ｃ_ｑ ^−１からデータポイントｘ_ｐの逆行列Ｃ_ｐ ^−１を漸進的に更新することができる。式（１６）からＣ_ｐ＝Ｃ_ｑ＋ΔＣであるため、補助定理２からＣ_ｐ＝Ｃ_ｑ＋ΔＣとなる。そのためＷｏｏｄｂｕｒｙの公式を用いることで逆行列Ｃ_ｐ ^−１を以下のように計算できる。 It can be progressively updated inverse matrix _C ^{p -1} data points _{x p} from the inverse matrix _C ^{q -1} of the data points _{x q} by using the official Woodbury. Since a _{_C} p = _C _q + _ΔC from equation (16), consisting of Lemma 2 and _{_C} p = _C _q + _ΔC. Therefore, the inverse matrix C _p ⁻¹ can be calculated as follows by using the Woodbury formula.

式（２０）においてＦ^−１とＶ^ＴＣ_ｑ ^−１Ｖは２×２の行列であるため、逆行列（Ｆ^−１＋Ｖ^ＴＣ_ｑ ^−１Ｖ）^−１を求める計算コストは一定になる。また行列Ｃ_ｑ ^−１とＶのサイズはそれぞれ（Ｋ＋１）×（Ｋ＋１）と（Ｋ＋１）×２になる。そのため式（２０）からＣ_ｐ ^−１を求めるにはＯ（Ｋ^２）の計算コストが必要となる。 In Formula (20), since F ⁻¹ and V ^T C _q ⁻¹ V are 2×2 matrices, the calculation cost for _obtaining the inverse matrix (F ⁻¹ +V ^T C _q ⁻¹ V) ⁻¹ is constant. .. The sizes of the matrices C _q ⁻¹ and V are (K+1)×(K+1) and (K+1)×2, respectively. Therefore, the calculation cost of O(K ² ) is required to obtain C _p ⁻¹ from the equation (20).

式（２０）から逆行列Ｃ_ｐ ^−１を高速に求めることができるが、この式は行列Ｃ_ｐにおいて＜ｘ_ｉ，ｘ_ｉ＞＝１が成り立つことを仮定している。ここではこの仮定を用いてまず行列Ｃ_ｐ ^−１を更新する。そしてＣ´_ｐを行列Ｃ_ｐに対応する行列としたとき、行列Ｃ´_ｐのために（Ｋ＋１）×（Ｋ＋１）の行列ΔＣを式（２１）のように計算する。 Although the inverse matrix C _p ^-1 can be obtained at high speed from the equation (20), this equation assumes that <x _i, x _i >=1 holds in the matrix C _p . Here, the matrix C _p ^-1 is updated using this assumption. And when the corresponding matrix _C'p matrix _{C p,} the matrix ΔC of (K + 1) × (K + 1) for the matrix _C'p is calculated as Equation (21).

行列ΔＣ´から行列Ｃ´_ｐとＣ_ｐの違いは［ｋ，ｋ］要素に限られることがわかり、また行列ΔＣ´は＜ｘ_ｉ，ｘ_ｉ＞＝１ｉｎｍａｔｒｉｘＣ_ｐという仮定を落とすことに対応することがわかる。そのため、もしその仮定を落とせばＣ_ｐ＝Ｃ_ｑ＋ΔＣ＋ΔＣ´が成り立つことがわかる。もしｅ_ｋをｋ番目の要素以外は０である単位ベクトルだとすると、行列ΔＣ´を式（２２）のように求めることができる。 It can be seen from the matrix ΔC′ that the differences between the matrices C′ _p and C _p are limited to [k, k] elements, and the matrix ΔC′ drops the assumption that <x _i , x _i >=1 in matrix C _p. It turns out that it corresponds to. Therefore, if the assumption is dropped, it can be seen that C _p =C _q +ΔC+ΔC′ holds. If e _k is a unit vector that is 0 except for the k-th element, the matrix ΔC′ can be obtained as in Expression (22).

そのためＷｏｏｄｂｕｒｙの公式を用いることで式（２０）の後で行列Ｃ´_ｐの逆行列（Ｃ´_ｐ）^−１を式（２３）のように計算することができる。 Therefore the inverse matrix of the matrix _C'p after formula (20) by using the official Woodbury the ^{_(C'p) -1} can be computed as in Equation (23).

式（２０）と同様、式（２３）から逆行列をＯ（Ｋ^２）の計算コストで高速に計算することができる。式（２０）と（２３）において、ｄ＝１が成り立つこと、つまりデータポイントｘ_ｑに対してデータポイントｘ_ｐは一つだけ異なる近傍のデータポイントを有することを仮定した。しかしｄ＞１の場合であっても、繰り返し上記処理をｘ_ｉ∈｛ｘ_ｊ｜ｘ_ｊ∈Ｎ［ｘ_ｐ］ａｎｄｘ_ｊ／∈Ｎ［ｘ_ｑ］｝であるようなデータポイント一つ一つに繰り返し適用することによって逆行列を高速に求めることができる。なお、ここでは、ｘ／∈Ｎは、ｘが集合Ｎに含まれないことを示すものとする。 Similar to Expression (20), the inverse matrix can be calculated at high speed from Expression (23) at a calculation cost of O(K ² ). In equations (20) and (23), it was assumed that d=1 holds, that is, for data point x _q , data point x _p has one different neighboring data point. However, even if d>1, iteratively repeats the above process for each data point such that x _i ε{x _j |x _j εN[x _p ] and x x _j /εN[x _q ]}. The inverse matrix can be obtained at high speed by repeatedly applying the two. Note that, here, x/εN indicates that x is not included in the set N.

［第１の実施形態の近傍のデータポイントの計算］
次に各データポイントに対して近傍のデータポイントを高速に計算する手法について述べる。本発明ではＳＶＤを用いて近似の距離を計算し、近傍を計算する。ＳＶＤは距離の下限値を求めることができるため、正確に近傍を求めることができる。もし^〜ｘ_ｐ＝（^〜ｘ_ｐ［１］，^〜ｘ_ｐ［２］，．．．，^〜ｘ_ｐ［ｓ］）がデータポイントｘ_ｐ＝（ｘ_ｐ［１］，ｘ_ｐ［２］，．．．，ｘ_ｐ［Ｍ］）に対するランクがｓのＳＶＤによる次元削減とし、Ｅ［ｘ_ｐ，ｘ_ｑ］をデータポイントｘ_ｐとｘ_ｑのユークリッド距離とすると、ＳＶＤが距離の下限値を与えるとはＥ［ｘ_ｐ，ｘ_ｑ］≧Ｅ［^〜ｘ_ｐ，^〜ｘ_ｑ］が成り立つと言うことである。 [Calculation of data points in the vicinity of the first embodiment]
Next, we describe a method to calculate the neighboring data points at high speed for each data point. In the present invention, the approximate distance is calculated using SVD, and the neighborhood is calculated. Since the SVD can find the lower limit value of the distance, it can accurately find the neighborhood. If ^~ _xp =( ^~ _xp [1], ^~ _xp [2], ..., ^~ _xp [s]) is a data point _xp =( _xp [1], _xp [2], ..., x _p [M]) is the dimension reduction by SVD of rank s, and E[x _p , x _q ] is the Euclidean distance between data points x _p and x _q , then SVD defines the lower bound of the distance. Giving means that E[ _xp , _xq ] ≥ E[ ^~ _xp , ^~ _xq ] holds.

近傍を求めるのに既存の研究ではまず近似の距離を計算してから、もしデータポイントが近似の距離により近傍になる可能性があると判定されたらユークリッド距離を計算する（例えば、参考文献４（Dennis Shasha, High Performance Discovery in Time Series: Techniques and Case Studies, 2004, Springer-Verlag New York Inc..）を参照）。なお各データポイントのＳＶＤは既存の手法を用いてＯ（ＮＭｌｏｇｓ）の計算コストで求めることができる（例えば、参考文献５（Nathan Halko, Per-Gunnar Martinsson, Joel A. Tropp, Finding Structure with Randomness:Probabilistic Algorithms for Constructing Approｘimate Matriｘ Decompositions, 2011, SIAM Review.）を参照）。 In the existing research to calculate the neighborhood, first, the approximate distance is calculated, and then, if it is determined that the data point may be the neighborhood due to the approximate distance, the Euclidean distance is calculated (for example, reference 4 ( Dennis Shasha, High Performance Discovery in Time Series: Techniques and Case Studies, 2004, Springer-Verlag New York Inc..)). Note that the SVD of each data point can be obtained at a calculation cost of O(NMlogs) using an existing method (for example, Reference 5 (Nathan Halko, Per-Gunnar Martinsson, Joel A. Tropp, Finding Structure with Randomness: Probabilistic Algorithms for Constructing Approximate Matrix Decompositions, 2011, SIAM Review.)).

近似の距離の精度を向上するために、ユークリッド距離を近傍グラフの構造を用いて推定する。データポイントｘ_ｑはデータポイントｘ_ｒの近傍とする。このときＳＶＤを用いることによってＥ［^〜ｘ_ｑ， ^〜ｘ_ｒ］とＥ［ｘ_ｑ，ｘ_ｒ］が成り立つ。またもしデータポイントｘ_ｐの近傍を求める過程でデータポイントｘ_ｒのユークリッド距離を計算すれば、Ｅ［^〜ｘ_ｐ，^〜ｘ_ｒ］ａｎｄＥ［ｘ_ｐ，ｘ_ｒ］が成り立つ。Ｅ＿［ｘ_ｐ，ｘ_ｑ］をデータポイントｘ_ｐからｘ_ｑの推定距離とすると、本実施形態では推定距離を式（２４）のように計算する。ここで・＿は・の直下に＿が記された記号を示すものとする。 In order to improve the accuracy of the approximate distance, the Euclidean distance is estimated using the structure of the neighborhood graph. The data point x _q is near the data point x _r . In this case ^{_{^{_{E [~ x q, ~ x}}}} r] by using SVD and _{E _[x} q, x _r] holds. Further, if if calculating the Euclidean distance of the data point _{x p} data points _{x r} in the process of obtaining the vicinity ^{_{^{_{of, E [~ x p, ~}}}} x r] and E [x p, x r] holds. Assuming that E_[x _p , x _q ] is the estimated distance from the data point x _p to x _q , the estimated distance is calculated as in Expression (24) in this embodiment. Here, ·_ indicates a symbol with _ written directly under ·.

ここでｕ_ｒ［ｘ_ｐ］はｕ_ｒ［ｘ_ｐ］＝（^〜ｘ_ｐ［ｓ＋１］−^〜ｘ_ｒ［ｓ＋１］，^〜ｘ_ｐ［ｓ＋２］−^〜ｘｒ［ｓ＋２］，．．．，^〜ｘ_ｐ［Ｍ］−^〜ｘ_ｒ［Ｍ］）であるような長さがＭ−ｓのベクトルである。推定距離について図４に示す補題３（補助定理３）が成り立つ。図４は、補題３を示す図である。 Here _u r _{[x p]} is _{_{^{_{u r [x p] = (}}}} ~ x p [s + 1] - ~ x r [s + 1], ~ x p [s + 2] - ~ xr [s + 2], ..., ~ x a vector of length Ms such that _p [M]- ^to _xr [M]). Lemma 3 (Lemma 3) shown in FIG. 4 holds for the estimated distance. FIG. 4 is a diagram showing Lemma 3.

補助定理３は推定距離Ｅ［ｘ_ｐ，ｘ_ｑ］のＥ［ｘ_ｐ，ｘ_ｑ］に対する誤差はＥ［^〜ｘ_ｐ， ^〜ｘ_ｑ］より小さいため、Ｅ［ｘ_ｐ，ｘ_ｑ］も用いることでＥ［^〜ｘ_ｐ， ^〜ｘ_ｑ］の近似の精度を向上できることを示している。また補助定理３のＥ［ｘ_ｐ，ｘ_ｑ］≧Ｅ［ｘ_ｐ，ｘ_ｑ］は下限値の性質を満たす、すなわち正確に近傍を求めることができることを示している。 Used for Lemma 3 error for _{E _[x} p, x _q] of the estimated distance _{E _[x} p, x _q] is less than ^{_{^{_{E [~ x p, ~ x}}}} q], E [x p, x q] also shows that can improve the accuracy of the approximation of ^{_{^{_{E [~ x p, ~ x}}}} q] by. Further, E[x _p, x _q ]≧E[x _p, x _q ] of Lemma 3 shows that the property of the lower limit value is satisfied, that is, the neighborhood can be accurately obtained.

この手法において式（２４）で用いられるベクトルｕ_ｒ［ｘ_ｐ］とｕ_ｒ［ｘ_ｑ］のＬ_２ノルムを式（２５）及び式（２６）のように計算する。 In this method, the L ₂ norm of the vectors u _r [x _p ] and u _r [x _q ] used in equation (24) is calculated as in equations (25) and (26).

ここでもしデータポイントｘ_ｐの近傍としてｘ_ｒをチェックするするためにデータポイントｘ_ｐとｘ_ｒのユークリッド距離を計算し、データポイントｘ_ｑがｘ_ｒの近傍であれば、式（２５）と（２６）を用いることによりＥ［ｘ_ｐ，ｘ_ｑ］を求めるための計算コストはＯ（１）となる。これはこの場合、Ｅ［ｘ_ｐ，ｘｒ］とＥ［^〜ｘ_ｐ，^〜ｘ_ｒ］とＥ［ｘ_ｑ，ｘ_ｒ］とＥ［^〜ｘ_ｑ，^〜ｘ_ｒ］をＯ（１）の計算コストで求めることができるからである。 Here if calculates the Euclidean distance of the data point x _p and x _r to check the x _r as the vicinity of the data point x _p, if the data point x _q is in the vicinity of x _r, in the formula (25) By using (26), the calculation cost for _obtaining E[x _p, x _q ] is O(1). In this case this, _{E [x} p, xr] and ^{_{^{_{E [~ x p, ~ x}}}} r] and _{E _[x} q, x _r] and ^{_{^{_{E [~ x q, ~ x}}}} r] the calculation of O (1) This is because it can be calculated at cost.

次に固有ベクトルを高速に計算する方法について述べる。従来のＬＬＥのアルゴリズムはデータセットＸの低次元への埋め込みＹを埋め込みコストへの最適解により計算する。エッジの重み行列Ｗに対してＬＬＥカーネルはＫ＝（Ｉ−Ｗ）^Ｔ（Ｉ−Ｗ）と計算されるが、この最適解はカーネルＫの底の固有ベクトルに対応する。そのためもしλ_ｉが｜λ_１｜≧｜λ_２｜≧．．．≧｜λ_Ｎ｜であるようなカーネルＫのｉ番目の固有値であり、またｚ_ｉをそれに対応する固有ベクトルとすると、データセットＸに対するｍ次元の埋め込みＹはｍ＋１個の底の固有ベクトルからＹ＝［ｚ_Ｎ−ｍ，ｚ_{Ｎ−ｍ＋１}，．．．，ｚ_Ｎ−１］として計算することができる。 Next, a method for calculating the eigenvector at high speed will be described. The conventional LLE algorithm calculates the embedding Y in the low dimension of the data set X by the optimal solution to the embedding cost. For the edge weight matrix W, the LLE kernel is calculated as K=(I−W) ^T (I−W), but this optimal solution corresponds to the bottom eigenvector of the kernel K. Therefore, if λ _i is |λ ₁ |≧|λ ₂ |≧. ．． If the i-th eigenvalue of the kernel K such that ≧|λ _N |, and z _i is the corresponding eigenvector, then the m-dimensional embedding Y for the dataset X is Y=[ from the m+1 bottom eigenvectors. z _N-m, z _N-m+1 ,. ．． , Z _N−1 ].

高速に固有ベクトルを計算するために、本実施形態では逆ベキ乗法を用いる。逆ベキ乗法は最も底の固有ベクトルを、固有値を計算する行列の逆行列から計算する方法で、具体的には逆行列Ｋ−１から固有ベクトルｚ_Ｎを計算することができる。そのためａ_０を長さがＮの任意の列ベクトルとすると、底の固有値λ_Ｎａとその固有ベクトルｚ_Ｎはベキ乗法をａ_τ＝Ｋ^−１ａ_τ−１と適用して計算することができる。しかし逆行列Ｋ−１は密な構造となるため、この手法に必要な計算コストはＯ（Ｎ^３）となりメモリコストはＯ（Ｎ^２）となる。 In order to calculate the eigenvector at high speed, the inverse power method is used in this embodiment. The inverse power method is a method of calculating the bottom eigenvector from the inverse matrix of the matrix for calculating the eigenvalue, and specifically, the eigenvector z _N can be calculated from the inverse matrix K−1. Therefore, if a ₀ is an arbitrary column vector of length N, the base eigenvalue λ _Na and its eigenvector z _N can be calculated by applying the power method to a _τ =K ⁻¹ a _τ ⁻¹ . However, since the inverse matrix K-1 has a dense structure, the calculation cost required for this method is O(N ³ ) and the memory cost is O(N ² ).

直接カーネルＫの逆行列を避けるために、本発明ではＩ−ＷのＬＵ分解を計算する。具体的にはＬとＵを下三角行列及び上三角行列としたとき、行列ＬとＵをＬＵ＝Ｉ−Ｗとして計算する。その結果式（２７）のようになる。 In order to avoid the inverse matrix of the direct kernel K, we compute the LU decomposition of I-W. Specifically, when L and U are a lower triangular matrix and an upper triangular matrix, the matrix L and U are calculated as LU=I-W. As a result, equation (27) is obtained.

本実施形態では行列ＬとＵにベキ乗法を用いてａ_τ−１からａ_τを式（２８）のように計算する。 In the present embodiment, the powers of the matrices L and U are used to calculate a _τ-1 to a _τ as in Expression (28).

式（２８）においてｂとｂ´とｂ´´とａ_τを計算するために下三角行列と上三角行列に前進代入と後進代入を適用する（例えば、参考文献２を参照）。式（２８）は下三角行列と上三角行列からベクトルａ_τをベクトルａ_τ−１からａ_τ−１＝Ｕ^ＴＬ^ＴＬＵ_ａτという形で求められることを示している。式（２７）からＫ＝Ｕ^ＴＬ^ＴＬＵが成り立つため、カーネルＫの逆行列を直接用いることなくａ_τ＝Ｋ^−１ａ_τ−１を計算することができる。式（２８）から式（２９）のようにベクトルａ_τが収束するまでベキ乗法を適用することにより、底の固有値λ_Ｎと固有ベクトルｚ_Ｎを計算することができる。 In equation (28), forward substitution and backward substitution are applied to the lower triangular matrix and the upper triangular matrix in order to calculate b, b′, b″, and a _τ (for example, see Reference 2). Equation (28) shows that obtained in the form of _{^{^{a τ-1 = U T L}}} T LU aτ vector a _tau from the vector a _tau-1 from the lower and upper triangular matrices. Since the equation ^{^{(27) K = U T L}} T LU holds, it is possible to calculate the _{^{_{a τ = K -1 a τ-}}} 1 without using the inverse matrix of kernel K directly. The base eigenvalue λ _N and the eigenvector z _N can be calculated by applying the power method until the vector a _τ converges as in Expressions (28) to (29).

カーネルＫの下三角行列と上三角行列は疎な構造を持つため、本発明は高速にベクトルａ_τを式（２８）から計算することができる。次にその他の底の固有値λ_Ｎ−ｉと固有ベクトルｚ_Ｎ−ｉ（ｉ＝１，２，．．．，ｍ）を計算する方法を述べる。このような固有値と固有ベクトルを求める方法としてホテリング法が有名である（例えば、参考文献１を参照）。ホテリング法の基本的アイデアは固有値を計算する行列を最大固有値が０としその他の固有値が変わらないようにシフトするというものである。その結果２番目に大きな固有値がシフトした行列において最大の固有値になる。具体的にはベキ乗法を用いて式（３０）の行列Ｈ_ｉの固有値と固有ベクトルを計算することでλ_Ｎ−ｉとｚ_Ｎ−ｉを計算する。 Since the lower triangular matrix and the upper triangular matrix of the kernel K have a sparse structure, the present invention can quickly calculate the vector a _τ from the equation (28). Next, a method for calculating the other base eigenvalues λ _N-i and eigenvectors z _N-i (i=1, 2,..., _M ) will be described. The Hotelling method is famous as a method for obtaining such an eigenvalue and an eigenvector (for example, see Reference 1). The basic idea of the Hotelling method is to shift the matrix for calculating the eigenvalues so that the maximum eigenvalue is 0 and other eigenvalues remain unchanged. As a result, the second largest eigenvalue becomes the largest eigenvalue in the shifted matrix. Specifically, λ _N-i and z _N-i are calculated by calculating the eigenvalues and eigenvectors of the matrix H _i of Expression (30) using the power method.

ここでｈ_ｉ，τを長さがＮの列ベクトルとしたとき、固有値λ_Ｎ−ｉと固有ベクトルｚ_Ｎ−ｉはｈ_ｉ，τ＝Ｈ_ｉｈ_{ｉ，τ−１}とベキ乗法を適することにより求めることができる。しかしこの方法は行列Ｈ_ｉが密な構造を持つため、Ｏ（Ｎ^３）の計算コストとＯ（Ｎ^２）のメモリコストが必要になるという問題がある。 Here, when h _i,τ is a column vector of length N, the eigenvalue λ _N-i and the eigenvector z _N-i are hi _i,τ =H _i hi _i,τ-1 You can ask. However, this method has a problem that a matrix for H _i has a dense structure, memory cost of O computational cost (N ³⁾ and O (N ²⁾ is required.

行列Ｈ_ｉを直接計算することを避けるために、本発明ではベクトルｈ_ｉ，τをベクトルｈ_{ｉ，τ−１}から式（３１）のように計算する。 In order to avoid directly calculating the matrix H _i , in the present invention, the vector h _i,τ is calculated from the vector h _i,τ−1 as shown in Expression (31).

ここで式（２８）と同様に以下のようにベクトルａ_τは前進代入と後進代入を用いてベクトルｈ_{ｉ，τ−１}から式（３２）のように計算する。 Here, similarly to the equation (28), the vector a _τ is calculated from the vector h _{i, τ-1} as shown in the equation (32) using forward substitution and backward substitution as follows.

式（３１）において長さがＮのベクトルｚ_Ｎ−ｊｚ_Ｎ−ｊ ^Ｔｈ_{ｉ，τ−１}／λ_Ｎ−ｊはベクトルを右から左へ計算することでＯ（Ｎ）の計算コストで求める。Ｋ^−１＝（Ｕ^ＴＬ^ＴＬＵ）^−１となるため、式（３０）、（３１）、（３２）から式（３３）が成り立つ。 In equation (31), the vector of length N z _N−j z _N−j ^T hi _{, τ−1} /λ _N−j is calculated from the right to the left at the calculation cost of O(N). Ask. Since K ⁻¹ =(U ^T L ^T LU) ⁻¹ , the formula (33) is established from the formulas (30), (31), and (32).

式（３３）は式（３１）と（３２）からベクトルｈ_ｉ，τをホテリング法の行列Ｈ_ｉを用いて計算できることを示している。本実施形態ではカーネルＫの固有値λ_Ｎ−ｉと固有ベクトルｚ_Ｎ−ｉ（１≦ｉ≦ｍ）を式（３４）に基づきベキ乗法によりベクトルｈ_ｉ，τが収束するまで計算することにより求める。 Expression (33) shows that the vector h _i,τ can be calculated from the expressions (31) and (32) by using the matrix H _{i of the} Hotelling method. In the present embodiment, the eigenvalue λ _N−i of the kernel K and the eigenvector z _N−i (1≦i≦m) are calculated by the power method based on the equation (34) until the vector h _i,τ converges.

直接行列Ｈ_ｉを計算しないため、本発明は高速に固有値と固有ベクトルを計算することができる。 Since the matrix H _i is not calculated directly, the present invention can calculate eigenvalues and eigenvectors at high speed.

次に逆ベキ乗法において用いる行列Ｉ−ＷのＬＵ分解を高速に計算する方法について述べる。行列ＬとＵの要素はもし行列Ｗ´がＷ´＝Ｉ−Ｗと与えられるときクラウトのアルゴリズム（例えば、参考文献２を参照）により式（３５）及び式（３６）のように計算できる。 Next, a method for calculating the LU decomposition of the matrix I-W used in the inverse power method at high speed will be described. The elements of the matrices L and U can be calculated as in equations (35) and (36) by the Kraut's algorithm (see eg reference 2) if the matrix W′ is given as W′=I−W.

式（３５）と（３６）は行列ＬとＵの列は左から右へ、また各列において要素は上から下へ計算できることがわかる。そのため行列ＬとＵの要素は対応する行列Ｗ´、Ｌ、Ｕの上と左の要素から計算することができる。 It can be seen that in equations (35) and (36), the columns of the matrices L and U can be calculated from left to right, and the elements in each column from top to bottom. Therefore, the elements of the matrices L and U can be calculated from the elements above and to the left of the corresponding matrices W', L, U.

本実施形態では行列ＬとＵの非零要素の数を減らしてＬＵ分解を高速に計算する。これはもしＬ［ｉ，ｋ］＝０又はＵ［ｋ，ｊ］＝０となれば式（３５）と（３６）から効果的にＬ［ｉ，ｋ］Ｕ［ｋ，ｊ］の計算を省くことができるからである。本実施形態では行列ＬとＵの上と左の要素はもし対応する行列Ｗ´の要素が零であれば零になることと、行列Ｗ´の上と左の要素はＷ´＝Ｉ−Ｗであるためもし対応する行列Ｗの左と上の要素が零になれば零になることを利用する。行列Ｗはエッジの重みから計算されるため、行列Ｗのデータポイントを並び変えることで上と左の要素が疎になればＬＵ分解を高速に行うことができる。 In this embodiment, the number of non-zero elements in the matrices L and U is reduced to calculate the LU decomposition at high speed. This means that if L[i,k]=0 or U[k,j]=0, then the calculation of L[i,k]U[k,j] is effectively performed from equations (35) and (36). This is because it can be omitted. In this embodiment, the upper and left elements of the matrices L and U become zero if the corresponding elements of the matrix W′ are zero, and the upper and left elements of the matrix W′ are W′=I−W. Therefore, if the left and upper elements of the corresponding matrix W become zero, the fact that they become zero is used. Since the matrix W is calculated from edge weights, LU decomposition can be performed at high speed if the upper and left elements become sparse by rearranging the data points of the matrix W.

本実施形態ではデータポイントを次数の昇順で並び変えることでＬＵ分解を高速に行う。ここで次数とはデータポイントから出ているエッジの本数である。並び変えることで行列Ｗの上と左の要素を疎にできるため、行列Ｉ−ＷのＬＵ分解を高速行うことができる。データポイントを並び変えるのには分布数え上げソートを用いることでＯ（Ｎ）の計算コストで並び替えを行うことができる（例えば、参考文献６（Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, Clifford Stein, Introduction to Algorithms, 2009, The MIT Press.）を参照）。ｎを三角行列における平均的な各行ベクトルの非零要素の数とすると、行列ＬとＵを保持するメモリコストはＯ（Ｎｎ）となる。また式（３５）と（３６）から行列Ｉ−ＷのＬＵ分解はＯ（Ｎｎ^２）の計算コストで行うことができる。 In this embodiment, the LU decomposition is performed at high speed by rearranging the data points in ascending order of the order. Here, the order is the number of edges generated from the data point. By rearranging the elements, the elements above and to the left of the matrix W can be made sparse, so that the LU decomposition of the matrix IW can be performed at high speed. Sorting the data points can be done at a computational cost of O(N) by using a distributed counting sort (see eg Reference 6 (Thomas H. Cormen, Charles E. Leiserson, Ronald L. Rivest, Clifford Stein, Introduction to Algorithms, 2009, The MIT Press.)). If n is the number of non-zero elements of each average row vector in the triangular matrix, the memory cost for holding the matrices L and U is O(Nn). Further, from the equations (35) and (36), the LU decomposition of the matrix I-W can be performed at a calculation cost of O(Nn ² ).

［第１の実施形態のアルゴリズム］
本実施形態のアルゴリズムを、ＬＬＥ計算装置１０の各機能部の処理とともに、図５を用いて説明する。図５は、第１の実施形態に係るＬＬＥ計算処理のアルゴリズムの一例を示す図である。 [Algorithm of First Embodiment]
The algorithm of this embodiment will be described together with the processing of each functional unit of the LLE calculation device 10 with reference to FIG. FIG. 5 is a diagram illustrating an example of an algorithm of the LLE calculation process according to the first embodiment.

アルゴリズム１５において、Ｓはエッジの重みを計算するために選択したデータポイントの集合であり、Ｄは選択されたデータポイントの近傍のデータポイントの集合であり、Ｃ［ｘ_ｐ］は近傍グラフにおいてデータポイントｘ_ｐに直接つながっているデータポイントの集合である。アルゴリズム１５において近傍を高速に計算するためにＳＶＤを計算するが、もしｓをｓ＝２^ｌであるようなＳＶＤのランクとしたとき、本実施形態ではランクを２^１，２^２，．．．，２^ｌと徐々に増やしていきながら近傍を求めていく。計算コストの観点では粗い近似であれば高速に近似距離を計算することができるが近似の精度は高くない。一方、細かい精度であれば高い精度で近似距離を計算できるが、高い計算コストがかかってしまう。そのため、本実施形態では近傍にならないデータポイントは粗い近似で枝刈りし、近傍になる可能性が高いデータポイントは細かい近似で近似距離を計算する。結果として本発明は各データポイントの距離に応じて適切に近似の精度を設定して近傍を求めることができる。 In Algorithm 15, S is the set of data points selected to calculate the edge weights, D is the set of data points in the neighborhood of the selected data points, and C[x _p ] is the data in the neighborhood graph. A set of data points directly connected to the point x _p . In the algorithm 15, the SVD is calculated in order to calculate the neighborhood at a high speed. However, if s is a rank of the SVD such that s=2 ^l , in the present embodiment, the ranks are 2 ¹ , 2 ² ,. ．． , 2 ^l, and gradually increase to find the neighborhood. From the viewpoint of calculation cost, if the approximation is rough, the approximate distance can be calculated at high speed, but the accuracy of the approximation is not high. On the other hand, if the accuracy is fine, the approximate distance can be calculated with high accuracy, but the calculation cost will be high. Therefore, in the present embodiment, data points that are not close to each other are pruned by coarse approximation, and data points that are likely to be close to each other are calculated by fine approximation to calculate an approximate distance. As a result, the present invention can determine the neighborhood by appropriately setting the approximation accuracy according to the distance of each data point.

アルゴリズム１５に示すように、まず、行列Ｘ、低次元数ｍ、ＳＶＤのランクｓ（＝２^ｌ）が入力される。そして、特異値分解計算部１０１は、複数のデータポイントを有する多次元行列の特異値分解（ＳＶＤ）を計算する。具体的には、特異値分解計算部１０１はＳを初期化し、行列ＸのＳＶＤを計算する（１−２行目）。 As shown in Algorithm 15, first, the matrix X, the low-dimensional number m, and the rank s (=2 ^l ) of the SVD are input. Then, the singular value decomposition calculation unit 101 calculates the singular value decomposition (SVD) of the multidimensional matrix having a plurality of data points. Specifically, the singular value decomposition calculation unit 101 initializes S and calculates the SVD of the matrix X (1-2nd line).

データポイント選択部１０２は、複数のデータポイントから計算対象のデータポイントを選択する。具体的には、データポイント選択部１０２は、他のデータポイントへ多くのエッジを持つデータポイントを選択する（４行目）。これは本実施形態では近傍グラフを用いて他のデータポイントへの距離を推定するからである。そして、データポイント選択部１０２は、Ｄとθを初期化し（５−６行目）、Ｋ個の距離が∞になるデータポイントを追加することでＮ［ｘ_ｐ］を初期化する（７行目）。 The data point selection unit 102 selects a data point to be calculated from a plurality of data points. Specifically, the data point selection unit 102 selects a data point having many edges to other data points (fourth row). This is because in the present embodiment, the distance to other data points is estimated using the neighborhood graph. Then, the data point selection unit 102 initializes D and θ (5th to 6th lines), and initializes N[x _p ] by adding K data points at which the distance becomes ∞ (7th line). Eye).

近似距離計算部１０３は、複数のデータポイントのそれぞれについて、計算対象のデータポイントとの間の近似の距離を、特異値分解に基づいて計算する。このとき、近似距離計算部１０３は、近傍を計算するためにランクを２^１から２^ｌに徐々に増やしていくが、もし近似の距離がθより大きくなればデータポイントを枝刈りする（９−１３行目）。 The approximate distance calculation unit 103 calculates the approximate distance between each of the plurality of data points and the data point to be calculated based on the singular value decomposition. At this time, the approximate distance calculation unit 103 gradually increases the rank from 2 ¹ to 2 ¹ in order to calculate the neighborhood, but if the approximate distance becomes larger than θ, the data point is pruned (9- (Line 13).

また、距離推定部１０４は、近似の距離が所定値以下であるデータポイントのそれぞれと、計算対象のデータポイントとの間のユークリッド距離の推定値を計算する。つまり、距離推定部１０４は、ＳＶＤを用いてもデータポイントを枝刈りできなかった場合はグラフ構造を用いて距離を推定する（１４−１８行目）。 The distance estimation unit 104 also calculates an estimated value of the Euclidean distance between each data point whose approximate distance is equal to or less than a predetermined value and the data point to be calculated. That is, the distance estimation unit 104 estimates the distance using the graph structure when the data point cannot be pruned using the SVD (14th to 18th lines).

近傍確定部１０５は、ユークリッド距離の推定値が所定値以下であるデータポイントのそれぞれと、計算対象のデータポイントとの間のユークリッド距離を計算し、当該計算したユークリッド距離が所定値以下であるデータポイントを、計算対象のデータポイントの近傍のデータポイントに確定する。近傍確定部１０５は、データポイントを、選択されたデータポイントの近傍に確定した場合は、Ｎ［ｘ_ｐ］をそのデータポイントの距離を用いて更新する（１９−２５行目）。 The neighborhood determining unit 105 calculates the Euclidean distance between each data point whose estimated value of the Euclidean distance is less than or equal to a predetermined value and the data point to be calculated, and the calculated Euclidean distance is less than or equal to the predetermined value. Establish the point to a data point near the data point to be calculated. When the proximity determining unit 105 determines the data point to be in the vicinity of the selected data point, the proximity determining unit 105 updates N[x _p ] using the distance of the data point (lines 19-25).

そして、近傍確定部１０５は、最も多くの近傍を持つデータポイントを見つける（２６行目）。これはエッジ重み計算部１０６が、エッジの重みを他のデータポイントの近傍を用いて計算するためである。 Then, the neighborhood determination unit 105 finds a data point having the largest number of neighborhoods (26th line). This is because the edge weight calculation unit 106 calculates the edge weight using the neighborhood of another data point.

エッジ重み計算部１０６は、計算対象のデータポイントと計算対象のデータポイントの近傍のデータポイントのそれぞれとの間のエッジの重みを計算する。エッジ重み計算部１０６は、もし共有する近傍がある場合はＷｏｏｄｂｕｒｙの公式を用いてエッジの重みを計算し（２７−２８行目）、そうでない場合はグラム行列を用いてエッジの重みを計算する（２９−３０行目）。このように、エッジ重み計算部１０６は、Ｗｏｏｄｂｕｒｙの公式を用いて、計算済みデータポイントの近傍データポイントに関する逆行列から、計算対象のデータポイントの近傍のデータポイントに関する逆行列を計算することができる。 The edge weight calculation unit 106 calculates edge weights between a data point to be calculated and each data point near the data point to be calculated. The edge weight calculation unit 106 calculates the edge weight using the Woodbury formula when there is a shared neighborhood (lines 27-28), and otherwise uses the Gram matrix to calculate the edge weight. (Lines 29-30). In this way, the edge weight calculation unit 106 can use the Woodbury formula to calculate the inverse matrix of the data points in the vicinity of the data point to be calculated from the inverse matrix of the data points in the vicinity of the calculated data point. ..

つまり、エッジ重み計算部１０６は、エッジの重みが計算済みである計算済みデータポイントの近傍データポイントに、計算対象のデータポイントの近傍のデータポイントのうちの少なくとも一部が含まれる場合、計算済みデータポイントについてのエッジの重みの計算結果に基づいて、計算対象のデータポイントと計算対象のデータポイントの近傍のデータポイントのそれぞれとの間のエッジの重みを計算することができる。 In other words, the edge weight calculation unit 106 calculates if the data points in the vicinity of the calculated data points for which the edge weights have been calculated include at least some of the data points in the vicinity of the data point to be calculated. Based on the calculation result of the edge weight for the data point, the edge weight between the data point to be calculated and each data point in the vicinity of the data point to be calculated can be calculated.

並び替え部１０７は、複数のデータポイントについてのエッジの重みを表す行列を、次数の昇順に並び替える（３２−３３行目）。また、ＬＵ分解計算部１０８は、複数のデータポイントの隣接行列のＬＵ分解を計算する。 The rearrangement unit 107 rearranges the matrix representing the edge weights of a plurality of data points in ascending order of the order (lines 32-33). The LU decomposition calculation unit 108 also calculates the LU decomposition of the adjacency matrix of a plurality of data points.

ベクトル計算部１０９は、エッジの重みに基づいて、複数のデータポイントについてのＬＬＥカーネルの固有ベクトルを計算する。例えば、ベクトル計算部１０９は、ベクトル計算部１０９は、ＬＵ分解計算部１０８によって計算されたＬＵ分解に基づいて、逆ベキ乗法を用いてＬＬＥカーネルの固有ベクトル及び固有値を計算する（３４−３６行目）。 The vector calculation unit 109 calculates the eigenvector of the LLE kernel for a plurality of data points based on the edge weight. For example, the vector calculation unit 109 calculates the eigenvector and eigenvalue of the LLE kernel using the inverse power multiplication method based on the LU decomposition calculated by the LU decomposition calculation unit 108 (lines 34-36). ).

なお、２８−３０行目で、エッジ重み計算部１０６は、常に２９行目を実行するようにしてもよい。この場合、従来のＬＬＥと同様に、エッジ重み計算部１０６は、グラム行列を用いてエッジの重みの計算を行う。また、並び替え部１０７及びＬＵ分解計算部１０８による処理が実行されないようにしてもよい。この場合、ベクトル計算部１０９は、従来のＬＬＥと同様の方法で固有ベクトル及び固有値を計算する。また、また、図５のアルゴリズム１５に対しては、図６に示す性質（定理１〜３）が成り立つ。図６は、定理１、定理２及び定理３を示す図である。 The edge weight calculation unit 106 may always execute the 29th line in the 28th to 30th lines. In this case, similarly to the conventional LLE, the edge weight calculation unit 106 calculates the edge weight using the Gram matrix. Further, the processing by the rearrangement unit 107 and the LU decomposition calculation unit 108 may not be executed. In this case, the vector calculation unit 109 calculates the eigenvector and the eigenvalue by the same method as the conventional LLE. Further, for the algorithm 15 shown in FIG. 5, the properties shown in FIG. 6 (Theorems 1 to 3) hold. FIG. 6 is a diagram showing Theorem 1, Theorem 2 and Theorem 3.

［第１の実施形態の処理］
図７を用いて、ＬＬＥ計算装置１０の処理の流れについて説明する。図７は、第１の実施形態に係るＬＬＥ計算装置の処理の流れを示すフローチャートである。まず、ＬＬＥ計算装置１０には、行列Ｘ、次元数ｍ及びランクｓが入力される（ステップＳ１０１）。次に、特異値分解計算部１０１はＳを空集合に初期化する（ステップＳ１０２）。そして、特異値分解計算部１０１は行列ＸのランクｓのＳＶＤを計算する（ステップＳ１０３）。 [Processing of First Embodiment]
The processing flow of the LLE calculation device 10 will be described with reference to FIG. 7. FIG. 7 is a flowchart showing a processing flow of the LLE calculation device according to the first embodiment. First, the matrix X, the number of dimensions m, and the rank s are input to the LLE calculation device 10 (step S101). Next, the singular value decomposition calculation unit 101 initializes S to an empty set (step S102). Then, the singular value decomposition calculation unit 101 calculates the SVD of the rank s of the matrix X (step S103).

データポイント選択部１０２は、ｉを１からＮまで増やしながら複数のデータポイントから計算対象のデータポイントを選択する（ステップＳ１０４、Ｓ１０５、Ｓ１３２）。そして、データポイント選択部１０２は、Ｄとθを初期化し（ステップＳ１０６、Ｓ１０７）、さらに、Ｋ個の距離が∞になるデータポイントを追加することでＮ［ｘ_ｐ］を初期化する（ステップＳ１０８）。 The data point selection unit 102 selects a data point to be calculated from a plurality of data points while increasing i from 1 to N (steps S104, S105, S132). Then, the data point selection unit 102 initializes D and θ (steps S106 and S107), and further initializes N[x _p ] by adding K data points at which the distance is ∞ (step S106). S108).

近似距離計算部１０３は、複数のデータポイントのそれぞれについて、計算対象のデータポイントとの間の近似の距離を、特異値分解に基づいて計算する（ステップＳ１０９、Ｓ１１０、Ｓ１１１）。このとき、近似距離計算部１０３は、近傍を計算するためにランクを２^１から２^ｌに徐々に増やしていくが、もし近似の距離がθより大きくなれば（ステップＳ１１２、Ｙｅｓ）データポイントを枝刈りする（ステップＳ１１３）。また、近似の距離がθより大きくない場合（ステップＳ１１２、Ｎｏ）近似距離計算部１０３は次の処理に進む（ステップＳ１１４）。 The approximate distance calculation unit 103 calculates an approximate distance between each of the plurality of data points and the data point to be calculated based on the singular value decomposition (steps S109, S110, S111). At this time, the approximate distance calculation unit 103 gradually increases the rank from 2 ¹ to 2 ¹ in order to calculate the neighborhood, but if the approximate distance becomes larger than θ (step S112, Yes), the data points are calculated. Pruning is performed (step S113). If the approximate distance is not larger than θ (No in step S112), the approximate distance calculation unit 103 proceeds to the next process (step S114).

ここで、枝刈りされていないデータポイントが残っていない場合（ステップＳ１１５、Ｎｏ）、ＬＬＥ計算装置１０は、ステップＳ１２８へ進む。一方、枝刈りされていないデータポイントが残っている場合（ステップＳ１１５、Ｙｅｓ）、距離推定部１０４は、近似の距離が所定値以下であるデータポイントのそれぞれと、計算対象のデータポイントとの間のユークリッド距離の推定値を計算する（ステップＳ１１６、Ｓ１１７）。距離の推定値がθより大きくなれば（ステップＳ１１８、Ｙｅｓ）、距離推定部１０４はデータポイントを枝刈りする（ステップＳ１１９）。また、近似の距離がθより大きくない場合（ステップＳ１１８、Ｎｏ）距離推定部１０４は次の処理に進む（ステップＳ１２０）。 Here, when there is no data point that has not been pruned (No in step S115), the LLE calculation device 10 proceeds to step S128. On the other hand, when data points that have not been pruned remain (step S115, Yes), the distance estimation unit 104 sets a distance between each data point whose approximate distance is equal to or less than a predetermined value and the data point to be calculated. The estimated value of the Euclidean distance is calculated (steps S116 and S117). If the estimated distance value is larger than θ (step S118, Yes), the distance estimation unit 104 prunes the data point (step S119). If the approximate distance is not larger than θ (No in step S118), the distance estimation unit 104 proceeds to the next process (step S120).

ここで、枝刈りされていないデータポイントが残っていない場合（ステップＳ１２１、Ｎｏ）、ＬＬＥ計算装置１０は、ステップＳ１２８へ進む。一方、枝刈りされていないデータポイントが残っている場合（ステップＳ１２１、Ｙｅｓ）、近傍確定部１０５は、ユークリッド距離の推定値が所定値以下であるデータポイントのそれぞれと、計算対象のデータポイントとの間のユークリッド距離を計算し（ステップＳ１２２）、当該計算したユークリッド距離が所定値以下であるデータポイント（ステップＳ１２３、Ｙｅｓ）を、計算対象のデータポイントの近傍のデータポイントに確定し、Ｎ［ｘ_ｐ］をそのデータポイントの距離を用いて更新する（ステップＳ１２４、Ｓ１２５、Ｓ１２６）。また、近傍確定部１０５は、θを更新する（ステップＳ１２７）。そして、近傍確定部１０５は、最も多くの近傍を持つデータポイントを見つける（ステップＳ１２８）。 Here, when there is no data point that has not been pruned (No in step S121), the LLE calculation device 10 proceeds to step S128. On the other hand, when data points that have not been pruned remain (step S121, Yes), the neighborhood determining unit 105 determines each of the data points for which the Euclidean distance estimated value is equal to or less than a predetermined value and the data point to be calculated. The Euclidean distance between the calculated data points is calculated (step S122), and the data points whose calculated Euclidean distance is less than or equal to a predetermined value (step S123, Yes) are determined as data points near the data point to be calculated, and N[ x _p ] is updated using the distance of the data point (steps S124, S125, S126). Further, the proximity determining unit 105 updates θ (step S127). Then, the neighborhood determination unit 105 finds a data point having the largest number of neighbors (step S128).

エッジ重み計算部１０６は、共有する近傍がある場合（ステップＳ１２９、Ｙｅｓ）はＷｏｏｄｂｕｒｙの公式を用いてエッジの重みを計算し（ステップＳ１３０）、そうでない場合（ステップＳ１２９、Ｎｏ）はグラム行列を用いてエッジの重みを計算する（ステップＳ１３１）。 The edge weight calculation unit 106 calculates the edge weight using the Woodbury formula when there is a shared neighborhood (step S129, Yes) (step S130), and when not (step S129, No), the gram matrix is calculated. The edge weight is calculated by using (step S131).

並び替え部１０７は、複数のデータポイントについてのエッジの重みを表す行列を、次数の昇順に並び替える（ステップＳ１３３）。また、ＬＵ分解計算部１０８は、複数のデータポイントの隣接行列のＬＵ分解を計算する（ステップＳ１３４）。 The rearrangement unit 107 rearranges the matrix representing the edge weights of a plurality of data points in ascending order of the order (step S133). Also, the LU decomposition calculation unit 108 calculates the LU decomposition of the adjacency matrix of the plurality of data points (step S134).

ベクトル計算部１０９は、エッジの重みに基づいて、複数のデータポイントについてのＬＬＥカーネルの固有ベクトルを計算する。例えば、ベクトル計算部１０９は、ＬＵ分解計算部１０８によって計算されたＬＵ分解に基づいて、逆ベキ乗法を用いてＬＬＥカーネルの固有ベクトル及び固有値を計算する（ステップＳ１３５、Ｓ１３６、Ｓ１３７、Ｓ１３８）。そして、ＬＬＥ計算装置１０は、次元削減された行列Ｙを出力する（ステップＳ１３９）。 The vector calculation unit 109 calculates the eigenvector of the LLE kernel for a plurality of data points based on the edge weight. For example, the vector calculation unit 109 calculates the eigenvector and eigenvalue of the LLE kernel using the inverse power method based on the LU decomposition calculated by the LU decomposition calculation unit 108 (steps S135, S136, S137, S138). Then, the LLE calculation device 10 outputs the dimension-reduced matrix Y (step S139).

［第１の実施形態の効果］
特異値分解計算部１０１は、複数のデータポイントを有する多次元行列の特異値分解を計算する。また、データポイント選択部１０２は、複数のデータポイントから計算対象のデータポイントを選択する。また、近似距離計算部１０３は、複数のデータポイントのそれぞれについて、計算対象のデータポイントとの間の近似の距離を、特異値分解に基づいて計算する。また、距離推定部１０４は、近似の距離が所定値以下であるデータポイントのそれぞれと、計算対象のデータポイントとの間のユークリッド距離の推定値を計算する。また、近傍確定部１０５は、ユークリッド距離の推定値が所定値以下であるデータポイントのそれぞれと、計算対象のデータポイントとの間のユークリッド距離を計算し、当該計算したユークリッド距離が所定値以下であるデータポイントを、計算対象のデータポイントの近傍のデータポイントに確定する。また、エッジ重み計算部１０６は、計算対象のデータポイントと計算対象のデータポイントの近傍のデータポイントのそれぞれとの間のエッジの重みを計算する。また、ベクトル計算部１０９は、エッジの重みに基づいて、複数のデータポイントについてのＬＬＥカーネルの固有ベクトルを計算する。このように、本実施形態では、近傍確定部１０５によってユークリッドの距離が実際に計算される対象を、近似の距離及びユークリッド距離の推定値を用いてあらかじめ削減している。このため、本実施形態によれば、ＬＬＥによる次元削減の際の計算コスト及びメモリコストを低減させることができる。 [Effects of First Embodiment]
The singular value decomposition calculation unit 101 calculates the singular value decomposition of a multidimensional matrix having a plurality of data points. Further, the data point selection unit 102 selects a data point to be calculated from the plurality of data points. Further, the approximate distance calculation unit 103 calculates an approximate distance between each of the plurality of data points and the data point to be calculated based on the singular value decomposition. The distance estimation unit 104 also calculates an estimated value of the Euclidean distance between each data point whose approximate distance is equal to or less than a predetermined value and the data point to be calculated. Also, the neighborhood determining unit 105 calculates the Euclidean distance between each data point whose estimated value of the Euclidean distance is less than or equal to a predetermined value and the data point to be calculated, and the calculated Euclidean distance is less than or equal to the predetermined value. Establish a data point as a data point near the data point to be calculated. Further, the edge weight calculation unit 106 calculates the weight of the edge between the data point to be calculated and each data point in the vicinity of the data point to be calculated. Further, the vector calculation unit 109 calculates the eigenvectors of the LLE kernel for the plurality of data points based on the edge weights. As described above, in the present embodiment, the target for which the Euclidean distance is actually calculated by the neighborhood determining unit 105 is reduced in advance by using the approximate distance and the estimated value of the Euclidean distance. Therefore, according to this embodiment, it is possible to reduce the calculation cost and the memory cost when the dimension is reduced by LLE.

エッジ重み計算部１０６は、エッジの重みが計算済みである計算済みデータポイントの近傍データポイントに、計算対象のデータポイントの近傍のデータポイントのうちの少なくとも一部が含まれる場合、計算済みデータポイントについてのエッジの重みの計算結果に基づいて、計算対象のデータポイントと計算対象のデータポイントの近傍のデータポイントのそれぞれとの間のエッジの重みを計算することができる。このように、同じ近傍を持つデータポイントについては、計算済みの結果を利用してすることで、エッジの重みの計算に要するコストを低減させることができる。 The edge weight calculation unit 106 calculates the calculated data points when the data points in the vicinity of the calculated data points for which the edge weights have been calculated include at least some of the data points in the vicinity of the data point to be calculated. The edge weight between the data point to be calculated and each of the data points in the vicinity of the data point to be calculated can be calculated based on the calculation result of the edge weight with respect to. In this way, for data points having the same neighborhood, by using the already calculated result, it is possible to reduce the cost required for calculating the edge weight.

エッジ重み計算部１０６は、Ｗｏｏｄｂｕｒｙの公式を用いて、計算済みデータポイントの近傍データポイントに関する逆行列から、計算対象のデータポイントの近傍のデータポイントに関する逆行列を計算することができる。このように、Ｗｏｏｄｂｕｒｙの公式を用いることで、特に逆行列の計算に要するコストを低減させることができる。 The edge weight calculation unit 106 can use the Woodbury formula to calculate the inverse matrix of the data points in the vicinity of the data point to be calculated from the inverse matrix of the data points in the vicinity of the calculated data point. As described above, by using the Woodbury formula, it is possible to reduce the cost particularly required for the calculation of the inverse matrix.

並び替え部１０７は、複数のデータポイントについてのエッジの重みを表す行列を、次数の昇順に並び替える。また、ＬＵ分解計算部１０８は、複数のデータポイントの隣接行列のＬＵ分解を計算する。このとき、ベクトル計算部１０９は、ＬＵ分解計算部１０８によって計算されたＬＵ分解に基づいてＬＬＥカーネルの固有ベクトルを計算する。このように、ＬＵ分解を行うことで、固有ベクトルの計算に要するコストを低減させることができる。 The sorting unit 107 sorts the matrix representing the edge weights of a plurality of data points in ascending order of the order. The LU decomposition calculation unit 108 also calculates the LU decomposition of the adjacency matrix of a plurality of data points. At this time, the vector calculation unit 109 calculates the eigenvector of the LLE kernel based on the LU decomposition calculated by the LU decomposition calculation unit 108. By performing LU decomposition in this way, it is possible to reduce the cost required to calculate the eigenvector.

［システム構成等］
また、図示した各装置の各構成要素は機能概念的なものであり、必ずしも物理的に図示のように構成されていることを要しない。すなわち、各装置の分散・統合の具体的形態は図示のものに限られず、その全部又は一部を、各種の負荷や使用状況等に応じて、任意の単位で機能的又は物理的に分散・統合して構成することができる。さらに、各装置にて行われる各処理機能は、その全部又は任意の一部が、ＣＰＵ（Central Processing Unit）及び当該ＣＰＵにて解析実行されるプログラムにて実現され、あるいは、ワイヤードロジックによるハードウェアとして実現され得る。 [System configuration, etc.]
Further, each constituent element of each illustrated device is functionally conceptual, and does not necessarily have to be physically configured as illustrated. That is, the specific form of distribution/integration of each device is not limited to the one shown in the figure, and all or part of the device may be functionally or physically distributed/arranged in arbitrary units according to various loads and usage conditions. It can be integrated and configured. Furthermore, each processing function performed in each device is realized in whole or in part by a CPU (Central Processing Unit) and a program that is analyzed and executed by the CPU, or a hardware by a wired logic. Can be realized as.

また、本実施形態において説明した各処理のうち、自動的に行われるものとして説明した処理の全部又は一部を手動的に行うこともでき、あるいは、手動的に行われるものとして説明した処理の全部又は一部を公知の方法で自動的に行うこともできる。この他、上記文書中や図面中で示した処理手順、制御手順、具体的名称、各種のデータやパラメータを含む情報については、特記する場合を除いて任意に変更することができる。 Further, of the processes described in the present embodiment, all or part of the processes described as being automatically performed may be manually performed, or the processes described as manually performed may be performed. All or part of the process can be automatically performed by a known method. In addition, the processing procedures, control procedures, specific names, and information including various data and parameters shown in the above-mentioned documents and drawings can be arbitrarily changed unless otherwise specified.

［プログラム］
一実施形態として、ＬＬＥ計算装置１０は、パッケージソフトウェアやオンラインソフトウェアとして上記のＬＬＥの計算を実行するＬＬＥ計算プログラムを所望のコンピュータにインストールさせることによって実装できる。例えば、上記のＬＬＥ計算プログラムを情報処理装置に実行させることにより、情報処理装置をＬＬＥ計算装置１０として機能させることができる。ここで言う情報処理装置には、デスクトップ型又はノート型のパーソナルコンピュータが含まれる。また、その他にも、情報処理装置にはスマートフォン、携帯電話機やＰＨＳ（Personal Handyphone System）等の移動体通信端末、さらには、ＰＤＡ（Personal Digital Assistant）等のスレート端末等がその範疇に含まれる。 [program]
As one embodiment, the LLE calculation device 10 can be implemented by installing an LLE calculation program that executes the above LLE calculation as package software or online software in a desired computer. For example, by causing the information processing apparatus to execute the above LLE calculation program, the information processing apparatus can be caused to function as the LLE calculation apparatus 10. The information processing device mentioned here includes a desktop or notebook personal computer. In addition, the information processing apparatus also includes a mobile communication terminal such as a smartphone, a mobile phone, a PHS (Personal Handyphone System), and a slate terminal such as a PDA (Personal Digital Assistant) in its category.

また、ＬＬＥ計算装置１０は、ユーザが使用する端末装置をクライアントとし、当該クライアントに上記のＬＬＥの計算に関するサービスを提供するＬＬＥ計算サーバ装置として実装することもできる。例えば、ＬＬＥ計算サーバ装置は、多次元行列を入力とし、次元削減した行列を出力とするＬＬＥ計算サービスを提供するサーバ装置として実装される。この場合、ＬＬＥ計算サーバ装置は、Ｗｅｂサーバとして実装することとしてもよいし、アウトソーシングによって上記のＬＬＥの計算に関するサービスを提供するクラウドとして実装することとしてもかまわない。 The LLE calculation device 10 can also be implemented as a LLE calculation server device that uses a terminal device used by a user as a client and provides the client with a service related to the above LLE calculation. For example, the LLE calculation server device is implemented as a server device that provides an LLE calculation service in which a multidimensional matrix is input and a dimension-reduced matrix is output. In this case, the LLE calculation server device may be implemented as a Web server, or may be implemented as a cloud that provides the above-mentioned LLE calculation service by outsourcing.

図８は、ＬＬＥ計算プログラムを実行するコンピュータの一例を示す図である。コンピュータ１０００は、例えば、メモリ１０１０、ＣＰＵ１０２０を有する。また、コンピュータ１０００は、ハードディスクドライブインタフェース１０３０、ディスクドライブインタフェース１０４０、シリアルポートインタフェース１０５０、ビデオアダプタ１０６０、ネットワークインタフェース１０７０を有する。これらの各部は、バス１０８０によって接続される。 FIG. 8 is a diagram illustrating an example of a computer that executes the LLE calculation program. The computer 1000 has, for example, a memory 1010 and a CPU 1020. The computer 1000 also has a hard disk drive interface 1030, a disk drive interface 1040, a serial port interface 1050, a video adapter 1060, and a network interface 1070. These units are connected by a bus 1080.

メモリ１０１０は、ＲＯＭ（Read Only Memory）１０１１及びＲＡＭ（Random access memory）１０１２を含む。ＲＯＭ１０１１は、例えば、ＢＩＯＳ（Basic Input Output System）等のブートプログラムを記憶する。ハードディスクドライブインタフェース１０３０は、ハードディスクドライブ１０９０に接続される。ディスクドライブインタフェース１０４０は、ディスクドライブ１１００に接続される。例えば磁気ディスクや光ディスク等の着脱可能な記憶媒体が、ディスクドライブ１１００に挿入される。シリアルポートインタフェース１０５０は、例えばマウス１１１０、キーボード１１２０に接続される。ビデオアダプタ１０６０は、例えばディスプレイ１１３０に接続される。 The memory 1010 includes a ROM (Read Only Memory) 1011 and a RAM (Random access memory) 1012. The ROM 1011 stores, for example, a boot program such as a BIOS (Basic Input Output System). The hard disk drive interface 1030 is connected to the hard disk drive 1090. The disk drive interface 1040 is connected to the disk drive 1100. For example, a removable storage medium such as a magnetic disk or an optical disk is inserted into the disk drive 1100. The serial port interface 1050 is connected to, for example, a mouse 1110 and a keyboard 1120. The video adapter 1060 is connected to the display 1130, for example.

ハードディスクドライブ１０９０は、例えば、ＯＳ（Operating System）１０９１、アプリケーションプログラム１０９２、プログラムモジュール１０９３、プログラムデータ１０９４を記憶する。すなわち、ＬＬＥ計算装置１０の各処理を規定するプログラムは、コンピュータにより実行可能なコードが記述されたプログラムモジュール１０９３として実装される。プログラムモジュール１０９３は、例えばハードディスクドライブ１０９０に記憶される。例えば、ＬＬＥ計算装置１０における機能構成と同様の処理を実行するためのプログラムモジュール１０９３が、ハードディスクドライブ１０９０に記憶される。なお、ハードディスクドライブ１０９０は、ＳＳＤにより代替されてもよい。 The hard disk drive 1090 stores, for example, an OS (Operating System) 1091, an application program 1092, a program module 1093, and program data 1094. That is, the program that defines each process of the LLE computing device 10 is implemented as a program module 1093 in which code executable by a computer is described. The program module 1093 is stored in the hard disk drive 1090, for example. For example, the hard disk drive 1090 stores a program module 1093 for executing the same processing as the functional configuration of the LLE computing device 10. The hard disk drive 1090 may be replaced by SSD.

また、上述した実施形態の処理で用いられる設定データは、プログラムデータ１０９４として、例えばメモリ１０１０やハードディスクドライブ１０９０に記憶される。そして、ＣＰＵ１０２０が、メモリ１０１０やハードディスクドライブ１０９０に記憶されたプログラムモジュール１０９３やプログラムデータ１０９４を必要に応じてＲＡＭ１０１２に読み出して実行する。 Further, the setting data used in the processing of the above-described embodiment is stored as the program data 1094 in the memory 1010 or the hard disk drive 1090, for example. Then, the CPU 1020 reads out the program module 1093 and the program data 1094 stored in the memory 1010 or the hard disk drive 1090 to the RAM 1012 as necessary and executes them.

なお、プログラムモジュール１０９３やプログラムデータ１０９４は、ハードディスクドライブ１０９０に記憶される場合に限らず、例えば着脱可能な記憶媒体に記憶され、ディスクドライブ１１００等を介してＣＰＵ１０２０によって読み出されてもよい。あるいは、プログラムモジュール１０９３及びプログラムデータ１０９４は、ネットワーク（ＬＡＮ（Local Area Network）、ＷＡＮ（Wide Area Network）等）を介して接続された他のコンピュータに記憶されてもよい。そして、プログラムモジュール１０９３及びプログラムデータ１０９４は、他のコンピュータから、ネットワークインタフェース１０７０を介してＣＰＵ１０２０によって読み出されてもよい。 The program module 1093 and the program data 1094 are not limited to being stored in the hard disk drive 1090, but may be stored in, for example, a removable storage medium and read by the CPU 1020 via the disk drive 1100 or the like. Alternatively, the program module 1093 and the program data 1094 may be stored in another computer connected via a network (LAN (Local Area Network), WAN (Wide Area Network), etc.). Then, the program module 1093 and the program data 1094 may be read by the CPU 1020 from another computer via the network interface 1070.

１０ＬＬＥ計算装置
１０１特異値分解計算部
１０２データポイント選択部
１０３近似距離計算部
１０４距離推定部
１０５近傍確定部
１０６エッジ重み計算部
１０７並び替え部
１０８ＬＵ分解計算部
１０９ベクトル計算部
10 LLE Calculator 101 Singular Value Decomposition Calculator 102 Data Point Selector 103 Approximate Distance Calculator 104 Distance Estimator 105 Neighbor Determinator 106 Edge Weight Calculator 107 Sorting Unit 108 LU Decomposition Calculator 109 Vector Calculator

Claims

A singular value decomposition calculator that calculates the singular value decomposition of a multidimensional matrix having multiple data points,
A data point selection unit that selects a data point to be calculated from the plurality of data points,
For each of the plurality of data points, an approximate distance between the data point to be calculated, an approximate distance calculation unit that calculates based on the singular value decomposition,
A distance estimation unit that calculates an estimated value of the Euclidean distance between each of the data points whose approximate distance is less than or equal to a predetermined value and the data point to be calculated,
The Euclidean distance between each of the data points whose estimated value of the Euclidean distance is less than or equal to a predetermined value and the data point to be calculated is calculated, and the calculated Euclidean distance is equal to or less than a predetermined value. A neighborhood determining unit that determines the data points near the data point to be calculated,
An edge weight calculation unit that calculates a weight of an edge between the data point to be calculated and each of the data points in the vicinity of the data point to be calculated,
A vector calculator that calculates an eigenvector of the LLE kernel for the plurality of data points based on the edge weights;
An LLE calculation device comprising:

The edge weight calculator calculates the edge weights when the data points in the vicinity of the calculated data points for which the edge weights have been calculated include at least a part of the data points in the vicinity of the data point to be calculated. Calculating an edge weight between the data point to be calculated and each of the data points in the vicinity of the data point to be calculated based on the calculation result of the weight of the edge for the processed data point. The LLE calculation device according to claim 1.

The edge weight calculator may use a Woodbury formula to calculate an inverse matrix of data points in the vicinity of the data point to be calculated from an inverse matrix of data points in the vicinity of the calculated data point. The LLE calculation device according to claim 2.

A sorting unit that sorts the matrix representing the edge weights of the plurality of data points in ascending order of order,
An LU decomposition calculator for calculating an LU decomposition of an adjacency matrix of the plurality of data points,
Further has
The LLE calculation device according to claim 1, wherein the vector calculation unit calculates the eigenvector of the LLE kernel based on the LU decomposition calculated by the LU decomposition calculation unit.

An LLE calculation method executed by an LLE calculation device, comprising:
A singular value decomposition calculation step for calculating the singular value decomposition of a multidimensional matrix having a plurality of data points,
A data point selecting step of selecting a data point to be calculated from the plurality of data points,
For each of the plurality of data points, an approximate distance between the data points to be calculated, an approximate distance calculation step of calculating based on the singular value decomposition,
A distance estimating step of calculating an estimated value of the Euclidean distance between each of the data points whose approximate distance is equal to or less than a predetermined value and the data point to be calculated,
The Euclidean distance between each of the data points whose estimated value of the Euclidean distance is less than or equal to a predetermined value and the data point to be calculated is calculated, and the calculated Euclidean distance is equal to or less than a predetermined value. A neighborhood confirmation step of confirming data points in the vicinity of the data point to be calculated,
An edge weight calculation step of calculating a weight of an edge between the data point to be calculated and each of the data points in the vicinity of the data point to be calculated,
A vector calculation step of calculating an eigenvector of the LLE kernel for the plurality of data points based on the edge weights;
An LLE calculation method comprising:

An LLE calculation program for causing a computer to function as the LLE calculation device according to claim 1.