JP2953929B2

JP2953929B2 - Modified Cholesky decomposition calculator

Info

Publication number: JP2953929B2
Application number: JP29791393A
Authority: JP
Inventors: 淳一萬; 健一高田
Original assignee: NIPPON DENKI GIJUTSU JOHO SHISUTEMU KAIHATSU KK; Nippon Electric Co Ltd
Current assignee: NIPPON DENKI GIJUTSU JOHO SHISUTEMU KAIHATSU KK; NEC Corp
Priority date: 1993-11-29
Filing date: 1993-11-29
Publication date: 1999-09-27
Anticipated expiration: 2014-09-27
Also published as: JPH07152726A

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【産業上の利用分野】本発明は、電子計算機を用いて数
値計算を行なう数値計算装置に関し、特に、修正コレス
キー分解を計算する修正コレスキー分解計算装置に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a numerical calculation apparatus for performing a numerical calculation using an electronic computer, and more particularly to a modified Cholesky decomposition calculation apparatus for calculating a modified Cholesky decomposition.

【０００２】[0002]

【従来の技術】与えられた対称行列の修正コレスキー分
解の計算をその行列を構成する小行列、すなわちブロッ
ク行列の計算に帰着させる方法をブロック化処理と呼
び、これにより修正コレスキー分解の計算の大部分がブ
ロック化行列の積と差に帰着される。2. Description of the Related Art A method of reducing the calculation of a modified Cholesky factorization of a given symmetric matrix into the calculation of a small matrix constituting the matrix, that is, a block matrix, is called a blocking process. Is reduced to the product and difference of the blocking matrices.

【０００３】与えられた行列を修正コレスキー分解する
方法として修正コレスキー法というのが知られている
が、これを電子計算機で高速に計算するために次のよう
な方法が行われている。なお、詳細はジェイ・ドンゲエ
ラ「ア・セット・オブ・レベル３・ベーシック・リニア
・アルジブラ・サブプログラムズ」( 文献１：Ｊ．ＤＯ
ＮＧＡＲＲＡ「A Set of Level 3 Basic Linear Algebra
Subprograms」 ACMTransaction on Mathematical Softw
are,Vol.16,No.1,March,1990)に記述されている。[0003] The modified Cholesky method is known as a method of modifying a given matrix by modified Cholesky decomposition, and the following method is used to calculate the modified Cholesky method at high speed by an electronic computer. For details, see Jay Dongeera, "A Set of Level 3 Basic Linear Algibra Subprograms" (Reference 1: J.DO)
NGARRA `` A Set of Level 3 Basic Linear Algebra
Subprograms '' ACMTransaction on Mathematical Softw
are, Vol. 16, No. 1, March, 1990).

【０００４】まず行列Ａ，Ｌ，ＤおよびＵをFirst, the matrices A, L, D and U are

【０００５】 [0005]

【０００６】のようにブロック行列に分割する。Ａ11，
Ｌ11およびＵ11は同じ次数Ｍの正方行列で、このＭを以
降ブロックサイズと呼ぶ。すると、Ａ＝ＬＤＵは、[0006] As shown in FIG. A11,
L11 and U11 are square matrices of the same order M, and M is hereinafter referred to as a block size. Then, A = LDU is

【０００７】 [0007]

【０００８】という３つの式に書き換えられる。Ｌ11，
Ｌ21，Ｕ11およびＵ12は修正コレスキー法で計算でき
る。この結果を用いれば式（３）の左辺が計算できるの
で、それを新たに行列Ａと見なして再びブロックに分割
して上記の計算を行なう。この過程を繰り返して修正コ
レスキー分解の計算を行なうのがブロック化処理による
修正コレスキー分解である。この方法では、上記の式
（３）の左辺の計算が、計算量の大部分を占める。した
がって修正コレスキー分解を高速に計算するという問題
は、Ａ22−Ｌ21Ｄ11Ｕ12 （４）と
いうブロック化行列の計算を高速に行なうという問題に
帰着する。ここでＡをＫ×Ｋ行列、Ｌ21をＫ×Ｍ行列そ
してＵ12をＭ×Ｋ行列とすると、Ｋは計算の過程で変化
するが、ブロックサイズＭは計算の過程で変化しない。
修正コレスキー分解の計算速度は、中央処理装置のレジ
スタの構成や、キャッシュメモリの容量といった電子計
算機のバードウェアの構成ならびに、与えられた行列Ａ
の次数とブロックサイズＭに依存する。このうち人為的
に設定可能なのは、ブロックサイズだけであるが、文献
１によると、従来は様々な行列の次数に対して、平均し
て最も大きな計算速度を示したブロックサイズに固定し
て計算を行なっているため、使用している電子計算機で
実現しうる最大の計算速度が、行列の任意の次数に対し
て得られるとは限らない。Can be rewritten into the following three equations. L11,
L21, U11 and U12 can be calculated by the modified Cholesky method. Since the use of the results of this is the left-hand side of equation (3) can be calculated, which was divided again regarded as new matrix A block performing the above calculation. The calculation of the modified Cholesky decomposition by repeating this process is the modified Cholesky decomposition by block processing. In this method, the calculation on the left side of Equation (3) occupies most of the calculation amount. Therefore, the problem of calculating the modified Cholesky decomposition at high speed results in the problem of calculating the blocking matrix A22-L21D11U12 (4) at high speed. Here the A K × K matrix and the K × M matrix and U 12 of L21 and M × K matrix, K is varies in the course of calculation, the block size M does not change in the course of calculations.
The calculation speed of the modified Cholesky decomposition depends on the configuration of the central processing unit registers, the configuration of the hardware of the computer such as the capacity of the cache memory, and the given matrix A.
And the block size M. Of these, only the block size can be set artificially, but according to Literature 1, the calculation is conventionally fixed to the block size showing the highest calculation speed on average for various matrix orders. Therefore, the maximum calculation speed that can be realized by the computer used is not always obtained for any order of the matrix.

【０００９】[0009]

【発明が解決しようとする課題】本発明の目的は、与え
られた行列に対して、最適なブロックサイズを自動的に
選択して、修正コレスキー分解を計算できるようにした
修正コレスキー分解計算装置を提供することにある。SUMMARY OF THE INVENTION An object of the present invention is to provide a modified Cholesky decomposition calculation in which an optimum block size is automatically selected for a given matrix so that a modified Cholesky decomposition can be calculated. It is to provide a device.

【００１０】[0010]

【課題を解決するための手段】本発明の修正コレスキー
分解計算装置は、対称行列を下三角行列Ｌ，対角行列
Ｄ、およびＬの転置行列Ｕを用いて、ＬＤＵ形式の行列
に表わす修正コレスキー分解を計算するための計算装置
であって、前記計算装置は入力装置と出力装置とデータ
処理部と記憶装置から構成され、前記入力装置は前記対
称行列を入力し、前記出力装置は前記記憶装置内の前記
ＬＤＵ形式の行列を出力し、前記記憶装置は、ブロック
サイズと行列の次数Ｎの関係を記録した関係データを記
憶するブロックサイズ決定用データ記憶部と、前記対称
行列と前記ＬＤＵ形式の行列を記憶する行列記憶部から
構成され、前記データ処理部は、行列記憶部内の前記対
称行列と前記関係データを参照し、前記対称行列の次数
Ｎと一致する前記関係データ内の前記ブロックサイズを
選択するブロックサイズ決定手段と、前記ブロックサイ
ズ決定手段により決定された前記ブロックサイズに前記
対称行列を分割し修正コレスキー演算を実行し前記ＬＤ
Ｕ形式の行列を求め、前記記憶部内の前記行列記憶部へ
格納する修正コレスキー分解計算手段とを有する。 SUMMARY OF THE INVENTION The modified Cholesky of the present invention.
The decomposition calculator calculates the symmetric matrix as a lower triangular matrix L, a diagonal matrix
Using a transpose matrix U of D and L, a matrix in LDU format
For calculating the modified Cholesky factorization represented by
Wherein the computing device comprises an input device, an output device and data
The input device comprises a processing unit and a storage device.
Input device, and the output device outputs the
Output a matrix in LDU format, wherein the storage device
Record the relationship data that records the relationship between the size and the degree N of the matrix.
The block size determination data storage unit and the symmetric
From the matrix storage unit that stores the matrix and the matrix in the LDU format
The data processing unit is configured to store the pair in a matrix storage unit.
The order of the symmetric matrix with reference to the
The block size in the relational data that matches N
Means for determining a block size to be selected;
To the block size determined by the size determination means.
The symmetric matrix is divided, a modified Cholesky operation is performed, and the LD
Calculate a matrix in U format and store it in the matrix storage section in the storage section.
And a modified Cholesky decomposition calculations means for storing.

【００１１】[0011]

【００１２】[0012]

【実施例】次に、本発明について図面を参照して説明す
る。Next, the present invention will be described with reference to the drawings.

【００１３】図１は本発明の一実施例を示すブロック図
である。この実施例は、キーボードなどの入力装置１
と、データ処理装置２と、データを記憶する記憶装置３
と、ディスプレイなどの出力装置４とから構成される。 FIG . 1 is a block diagram showing an embodiment of the present invention.
It is. This embodiment uses an input device 1 such as a keyboard.
, A data processing device 2 and a storage device 3 for storing data
And an output device 4 such as a display.

【００１４】データ処理部２はブロックサイズ決定手段
２１と、修正コレスキー分解計算手段２２とを備える。
記憶装置３はブロックサイズ決定用データ記憶部３１と
行列記憶部３２とを備える。The data processing unit 2 includes a block size determining means 21 and a modified Cholesky decomposition calculating means 22.
The storage device 3 includes a block size determination data storage unit 31 and a matrix storage unit 32.

【００１５】ブロックサイズ決定用データ記憶部３１
は、どんなブロックサイズを取れば最も速く修正コレス
キー分解ができるかというデータをいくつかの行列の数
次Ｎに対してあらかじめ記憶している。行列記憶部３２
は動作開始直後には、分解すべき行列Ａが記憶されてい
るが、データ処理装置２の動作に伴って、その内容は更
新される。Block size determination data storage unit 31
Stores in advance data on what block size is to be used for the fastest modified Cholesky decomposition for several orders N of several matrices. Matrix storage unit 32
Immediately after the start of the operation, the matrix A to be decomposed is stored, but its contents are updated with the operation of the data processing device 2.

【００１６】次に図１と図２を参照して、この実施例の
動作について説明する。Next, the operation of this embodiment will be described with reference to FIGS.

【００１７】入力装置１から与えられた行列Ａが行列記
憶部３２に格納され、行列Ａの次数Ｎがブロックサイズ
決定手段２１に与えられる（ステップＡ１）。ブロック
サイズ決定手段２１はあらかじめ定められたブロック化
処理を行う行列の次数の下限ＬとＮを比較する（ステッ
プＡ２）。もしＮ＜Ｌならば、修正コレスキー分解計算
手段２２はブロック化処理を用いずに修正コレスキー分
解を計算する（ステップＡ６）。もし、Ｎ＜Ｌでないな
らばブロックサイズ決定手段２１はＮを行列記憶部３２
にある最適ブロックサイズのデータと比較する（ステッ
プＡ３）。そして、該当する最適なブロックサイズが見
つかればその値を修正コレスキー分解計算手段２２に与
える。もし見つからなければ、既定値のブロックサイズ
を修正コレスキー分解計算手段２２に与える（ステップ
Ａ５）。そして修正コレスキー分解計算手段２２は行列
Ａを，従来の技術の欄で述べたように、文献１で示され
たブロック化処理により与えられたブロックサイズで修
正コレスキー分解し、結果を行列記憶部３２に格納する
（ステップＡ４）。記憶領域を節約するため、修正コレ
スキー分解の計算結果は入力された行列Ａのデータの上
に上書される。その格納のされ方であるが、行列Ａの下
三角部分に行列Ｌの下三角部分が、行列Ａの対角要素を
除く上三角部分に行列Ｕの上三角部分が格納される。最
後に出力装置４から行列記憶部３２に格納された修正コ
レスキー分解の計算結果が出力される（ステップＡ
７）。The matrix A given from the input device 1 is stored in the matrix storage unit 32, and the order N of the matrix A is given to the block size determining means 21 (step A1). The block size determining means 21 compares the lower limits L and N of the order of the matrix for performing the predetermined blocking process (step A2). If N <L, the modified Cholesky decomposition calculating means 22 calculates the modified Cholesky decomposition without using the blocking process (step A6). If N <L, the block size determination means 21 stores N in the matrix storage unit 32
(Step A3). Then, when a suitable optimal block size is found, the value is given to the modified Cholesky decomposition calculation means 22. If not found, a default block size is given to the modified Cholesky decomposition calculating means 22 (step A5). Then, the modified Cholesky decomposition calculating means 22 performs the modified Cholesky decomposition on the matrix A with the block size given by the blocking process shown in Document 1 as described in the section of the related art, and stores the result in a matrix. It is stored in the section 32 (step A4). To save storage space, the result of the modified Cholesky decomposition is overwritten on the input matrix A data. Regarding the storage method, the lower triangular part of the matrix L is stored in the lower triangular part of the matrix A, and the upper triangular part of the matrix U is stored in the upper triangular part of the matrix A excluding the diagonal elements. Finally, the output device 4 outputs the calculation result of the modified Cholesky decomposition stored in the matrix storage unit 32 (step A).
7).

【００１８】[0018]

【発明の効果】以上説明したように、従来のブロック化
処理による修正コレスキー分解の計算で固定されたブロ
ックサイズを用いているのに対し、本発明ではブロック
サイズ決定手段とブロックサイズ決定用データ記憶部を
設けることにより、最適なブロックサイズが自動的に選
択されて、修正コレスキー分解の計算速度が向上する。
また、本発明はブロック化処理を用いている修正コレス
キー分解以外の計算のための装置にも適用可能である。As described above, while the fixed block size is used in the calculation of the modified Cholesky decomposition by the conventional block processing, the present invention uses the block size determining means and the block size determining data. By providing the storage unit, the optimum block size is automatically selected, and the calculation speed of the modified Cholesky decomposition is improved.
Further, the present invention can be applied to an apparatus for calculation other than the modified Cholesky decomposition using the blocking process.

[Brief description of the drawings]

【図１】本発明の一実施例のブロック図である。FIG. 1 is a block diagram of one embodiment of the present invention.

【図２】図１の実施例の動作を示す流れ図である。FIG. 2 is a flowchart showing the operation of the embodiment of FIG.

[Explanation of symbols]

１入力装置２データ処理装置３記憶装置４出力装置２１ブロックサイズ決定手段２２修正コレスキー分解計算手段３１ブロックサイズ決定用データ記憶部３２行列記憶部 DESCRIPTION OF SYMBOLS 1 Input device 2 Data processing device 3 Storage device 4 Output device 21 Block size determination means 22 Modified Cholesky decomposition calculation means 31 Block size determination data storage unit 32 Matrix storage unit

───────────────────────────────────────────────────── フロントページの続き (56)参考文献特開平３−229363（ＪＰ，Ａ) 特開平６−75988（ＪＰ，Ａ) (58)調査した分野(Int.Cl.⁶，ＤＢ名) G06F 17/12 G06F 17/16 ＪＩＣＳＴファイル（ＪＯＩＳ)────────────────────────────────────────────────── ─── Continuation of the front page (56) References JP-A-3-229363 (JP, A) JP-A-6-75988 (JP, A) (58) Fields investigated (Int.Cl. ⁶ , DB name) G06F 17/12 G06F 17/16 JICST file (JOIS)

Claims

(57) [Claims]

1. A symmetric matrix is defined as a lower triangular matrix L, a diagonal matrix D,
And L are transposed into a matrix in LDU format
Calculation device for calculating the modified Cholesky decomposition
Therefore , the computing device is referred to as an input device, an output device, and a data processing unit.
The input device inputs the symmetric matrix.
The output device is a matrix of the LDU format in the storage device
And the storage device stores the relationship between the block size and the matrix order N.
For determining the block size for storing the relation data that records
A data storage unit, the symmetric matrix and the LDU matrix
The data processing unit is configured to store the symmetric matrix in the matrix storage unit and
Refer to the relational data and agree with the degree N of the symmetric matrix.
Block for selecting the block size in the relational data
Lock size determining means, and block size determining means
The symmetric matrix to the block size determined by
Divide and perform modified Cholesky operation to obtain the line in LDU format
Fixing a column and storing it in the matrix storage unit in the storage unit
Positive Cholesky decomposition calculation means
Modified Cholesky decomposition calculator.