JPH06202856A

JPH06202856A - Random sequence generation processing system for parallel computer system

Info

Publication number: JPH06202856A
Application number: JP4211131A
Authority: JP
Inventors: Masahide Fujisaki; 正英藤崎; Motoi Okuda; 基奥田
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1991-08-23
Filing date: 1992-08-07
Publication date: 1994-07-22

Abstract

PURPOSE:To generate a long-cycle random sequence having different arrangement at high speed by generating the random number sequence at each processor element according to M series of random sequence generating methods. CONSTITUTION:S random number initial value generating means 10 of a master processor element 1 generates (pxvxk) pieces of initial values of random numbers, and a random number initial value distributing means 11 distributes the (pxv) pieces of random number initial values to a slave processor element 2-i so as not to overlap them. On the other hand, a random number initial value receiving means 20-i of the slave processor element 2-i receives the random number initial values addressed to that element itself, and a random number generating means 21-i generates a random number value An (n>= pxv+1) by using the received random number initial values preferably according to the logic arithmetic of bit correspondence between a random number value An-pv and a random umber value An-qv when a parameter (v) satisfying the condition of 'qxv>alpha' is used or according to the logic arithmetic of bit correspondence between the random number value An-pv and a random number value An-pv+qv when a parameter (v) satisfying the condition of ' (p-q) xv>alpha' is used.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、並列計算機システムに
おける乱数列生成処理方式に関し、特に、並列計算機シ
ステムを構成する各プロセッサエレメントが、異なる並
びを持つ長い周期の乱数列を高速に生成できるようにす
る並列計算機システムにおける乱数列生成処理方式に関
する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a random number sequence generation processing method in a parallel computer system, and more particularly, to enable each processor element constituting the parallel computer system to generate a long period random number sequence having different sequences at high speed. The present invention relates to a random number sequence generation processing method in a parallel computer system.

【０００２】データ処理装置では、モンテカルロ法によ
るコンピュータ・シミュレーションを中心にして、長い
周期の乱数列を高速に生成していくことが要求されてい
る。一方、近年、データ処理能力の拡大を図るために、
並列計算機システムにより構成されるデータ処理システ
ムが普及しつつある。これから、並列計算機システムの
各プロセッサエレメントが、異なる並びを持つ長い周期
の乱数列を高速に生成していく必要性がでてきている。A data processing device is required to generate a random number sequence having a long period at high speed, centering on a computer simulation by a Monte Carlo method. On the other hand, in recent years, in order to expand the data processing capacity,
Data processing systems composed of parallel computer systems are becoming popular. From this, it has become necessary for each processor element of the parallel computer system to generate a random number sequence having a different sequence and a long period at high speed.

【０００３】[0003]

【従来の技術】これまでは、単一のベクトルプロセッサ
上で高速に乱数列を生成していく改良はなされてきたも
のの、並列計算機システムにおける乱数列の発生方法に
ついては殆ど提案されていないというのが実情である。2. Description of the Related Art Up to now, although improvements have been made in which a random number sequence is generated at high speed on a single vector processor, almost no proposal has been made regarding a method for generating a random number sequence in a parallel computer system. Is the reality.

【０００４】このようなことを背景にして、最近、P.Fr
edrickson et al.は、並列計算機システムにおける乱数
列の発生方法について１つの考え方を提案（Fredrickso
n,P., et al., "Pseudo-random trees in Monte Carl
o," Parallel Computing,Vol.1,No.2,1984,175-180.)し
た。この提案は、Pseudo-random trees という概念を導
入して、親プロセッサが、このPseudo-random trees を
用いて混合乗算法に従いつつ乱数生成の種を生成して子
プロセッサに分配し、各子プロセッサが、この分配され
た種を元にして混合乗算法に従って乱数列を生成してい
くことで、乱数列の生成を実現するという方法である。Against this background, recently P.Fr.
edrickson et al. proposed one way of thinking about the method of generating a random number sequence in a parallel computer system (Fredrickso
n, P., et al., "Pseudo-random trees in Monte Carl
o, "Parallel Computing, Vol.1, No.2, 1984, 175-180.). This proposal introduces the concept of Pseudo-random trees, and the parent processor uses these Pseudo-random trees. The seeds of random number generation are generated according to the mixed multiplication method and distributed to the child processors, and each child processor generates a random number sequence according to the mixed multiplication method based on the distributed seeds. It is a method of realizing generation.

【０００５】すなわち、ＸをPseudo-random trees の任
意の要素とすると、それから２つの要素Ｌ（Ｘ），Ｒ
（Ｘ）をＬ（Ｘ）＝（ａ_LＸ＋ｃ_L） mod ｍＲ（Ｘ）＝（ａ_RＸ＋ｃ_R） mod ｍで定義する。ここで、「ｘ mod ｙ」は、整数ｘを整数
ｙで割ったときの剰余を表している。この定義に従い、
初期値Ｘ₀が与えられると、図１３（ａ）のようにtree
が生成できる。このtreeの任意のノードから出発して右
側のsuccessor ばかりを取り出したものを、そのtreeの
なかの右系列と呼ぶ。図１３（ｂ）に示すように、特定
のノードの左分岐を取りあげるのは、その左分岐を出発
点として新しい右系列を作るときである。That is, if X is an arbitrary element of Pseudo-random trees, then two elements L (X), R
(X) is defined by L (X) = (a _L X + c _L ) mod m R (X) = (a _R X + c _R ) mod m. Here, “x mod y” represents the remainder when the integer x is divided by the integer y. According to this definition
Given an initial value X ₀ , tree as shown in FIG.
Can be generated. The successor on the right side taken from any node in this tree is called the right series in that tree. As shown in FIG. 13B, the left branch of a particular node is taken up when a new right series is created with the left branch as a starting point.

【０００６】P.Fredrickson et al.は、このPseudo-ran
dom trees に従い、親プロセッサが左分岐となる要素を
生成して、この生成した要素を子プロセッサに分配し、
子プロセッサがこの分配した要素から右系列を生成して
いくという方法により、各プロセッサで乱数列を発生し
ていく方法を提案した。[0006] P. Fredrickson et al.
According to the dom trees, the parent processor creates an element that becomes a left branch, and distributes this created element to the child processors,
We have proposed a method in which each processor generates a random number sequence by generating a right sequence from the distributed elements.

【０００７】[0007]

【発明が解決しようとする課題】しかしながら、P.Fred
rickson et al.の提案する乱数列の生成方法は、確か
に、並列計算機システムの各プロセッサにおいて乱数列
を生成できるようになるものの、混合乗算法に従ってい
るために、周期の短い乱数列しか生成できないという問
題点がある。例えば、３２ビットの計算機では、高々
（２³²−１）の周期の乱数列しか得られないのである。[Problems to be Solved by the Invention] However, P. Fred
Although the random number sequence generation method proposed by rickson et al. can certainly generate a random number sequence in each processor of a parallel computer system, it can generate only a random number sequence with a short period because it follows the mixed multiplication method. There is a problem. For example, a 32-bit computer can only obtain a random number sequence with a period of (2 ³² -1) at most.

【０００８】また、各プロセッサで生成される乱数列が
衝突しないようにすることは乱数列生成にとって極めて
重要なことである。しかしながら、P.Fredrickson et a
l.の提案する乱数列の生成方法に従うと、この衝突を防
ぐために、Pseudo-random trees の生成に用いる係数値
に複雑な制限が加わることから、その係数値の決定に困
難を伴うという問題点があった。It is extremely important for random number sequence generation to prevent the random number sequences generated by the respective processors from colliding. However, P. Fredrickson et a
According to the method of generating random number sequence proposed by l., in order to prevent this collision, there is a complicated limitation on the coefficient value used for generating pseudo-random trees, which makes it difficult to determine the coefficient value. was there.

【０００９】本発明はかかる事情に鑑みてなされたもの
であって、並列計算機システムを構成する各プロセッサ
エレメントが、異なる並びを持つ長い周期の乱数列を高
速に生成できるようにする新たな並列計算機システムに
おける乱数列生成処理方式の提供を目的とする。The present invention has been made in view of the above circumstances, and is a new parallel computer which enables each processor element constituting the parallel computer system to generate a random number sequence having a long sequence and a different sequence at a high speed. The purpose is to provide a random number sequence generation processing method in the system.

【００１０】[0010]

【課題を解決するための手段】図１ないし図３に本発明
の原理構成を図示する。図１に原理構成を示す本発明
は、分散メモリ型並列計算機システムに適用した場合の
原理構成である。1 to 3 show the principle configuration of the present invention. The present invention whose principle configuration is shown in FIG. 1 is a principle configuration when applied to a distributed memory type parallel computer system.

【００１１】図中、１は親プロセッサエレメント、２-i
（１≦ｉ≦ｋ）は子プロセッサエレメント、４は通信網
である。ここで、いずれかの子プロセッサエレメント２
-iが、親プロセッサエレメント１として機能する構成を
採ることも可能である。In the figure, 1 is a parent processor element, 2-i
(1 ≦ i ≦ k) is a child processor element, and 4 is a communication network. Here, one of the child processor elements 2
It is also possible to adopt a configuration in which -i functions as the parent processor element 1.

【００１２】この親プロセッサエレメント１は、乱数初
期値生成手段１０と、乱数初期値分配手段１１とを備え
る。この乱数初期値生成手段１０は、ｐ×ｖ×ｋ個の乱
数の初期値を生成する。この乱数初期値の生成方法は、
混合乗算法によることも可能であるし、これから説明す
るＭ系列の生成方法によることも可能であるし、この２
つの方法を合わせて用いることも可能である。生成され
る乱数初期値は、すべて０でなければ基本的には何でも
よい。乱数初期値分配手段１１は、乱数初期値生成手段
１０の生成する乱数初期値を、通信網４を介して子プロ
セッサエレメント２-iに対して重複しないようにｐ×ｖ
個ずつ分配する。The parent processor element 1 comprises a random number initial value generating means 10 and a random number initial value distributing means 11. The random number initial value generating means 10 generates initial values of p × v × k random numbers. This random number initial value generation method is
It is possible to use the mixed multiplication method or the M sequence generation method described below.
It is also possible to use the two methods in combination. The generated random number initial value may be basically any value as long as it is not all zero. The random number initial value distribution unit 11 is p × v so that the random number initial value generated by the random number initial value generation unit 10 does not overlap with the child processor element 2-i via the communication network 4.
Distribute each one.

【００１３】ここで、ｐはＭ系列（Tausworthe数列）の
乱数列の生成を規定する原始既約多項式「Ｘ^p＋Ｘ^q＋
１」のパラメータ、ｖは２の巾乗の数値として定義され
て、子プロセッサエレメント２-iのベクトル演算機構の
ベクトル長をα、ｑを上記の原始既約多項式のパラメー
タで表すならば、好ましくは、「ｑ×ｖ＞α」か「（ｐ
−ｑ）×ｖ＞α」の条件を充足する数値、ｋは乱数を生
成する子プロセッサエレメント２-iの個数である。子プ
ロセッサエレメント２-iがベクトル演算機構を持たない
場合には、ｖの値は１に設定されることになる。また、
ｖは、乱数初期値の生成効率を高めるために、上述の条
件の充足の内の最も小さな値であることが好ましい。Here, p is a primitive irreducible polynomial "X ^p + X ^q +" that defines the generation of a random number sequence of an M sequence (Tausworthe number sequence).
1 ", v is defined as a power of 2, and the vector length of the vector operation mechanism of the child processor element 2-i is represented by α and q by the above primitive irreducible polynomial parameters. Is “q × v> α” or “(p
−q) × v> α ”, and k is the number of child processor elements 2-i that generate random numbers. If the child processor element 2-i does not have a vector operation mechanism, the value of v will be set to 1. Also,
It is preferable that v is the smallest value satisfying the above conditions in order to increase the generation efficiency of the random number initial value.

【００１４】一方、子プロセッサエレメント２-iは、乱
数初期値受信手段２０-iと、乱数生成手段２１-iとを備
える。この乱数初期値受信手段２０-iは、親プロセッサ
エレメント１より分配されてくる自エレメント宛の乱数
初期値を受信する。乱数生成手段２１-iは、乱数初期値
受信手段２０-iの受信した乱数初期値を用いて、好まし
くは、「ｑ×ｖ＞α」の条件充足のパラメータｖが用い
られるときには、乱数値Ａ_n-pvと乱数値Ａ_n-qvとのビッ
ト対応の論理演算、「（ｐ−ｑ）×ｖ＞α」の条件充足
のパラメータｖが用いられるときには、乱数値Ａ_n-pvと
乱数値Ａ_n-pv+q _vとのビット対応の論理演算に従って、
新たな乱数値Ａ_n（ｎ≧ｐ×ｖ＋１）を生成する。On the other hand, the child processor element 2-i comprises a random number initial value receiving means 20-i and a random number generating means 21-i. The random number initial value receiving means 20-i receives the random number initial value addressed to its own element distributed from the parent processor element 1. The random number generating means 21-i uses the random number initial value received by the random number initial value receiving means 20-i, preferably when the parameter v satisfying the condition of “q × v> α” is used, the random number value A _When a parameter v that satisfies the condition of “(p−q) × v> α”, which is a logical operation corresponding to a bit between _n-pv and the random number A _n-qv , is used, the random number A _n-pv and the random number A _According to the bitwise logical operation with _{n-pv + q} _v ,
A new random number value A _n (n ≧ p × v + 1) is generated.

【００１５】ここで、好ましくは、「ｑ＞（ｐ−ｑ）」
が成立するときに、「ｑ×ｖ＞α」の条件充足が要求さ
れ、「ｑ＜（ｐ−ｑ）」が成立するときに、「（ｐ−
ｑ）×ｖ＞α」の条件充足が要求される。そして、乱数
生成手段２１-iは、好ましくは、ビット対応の論理演算
として排他的論理和演算を用いる。Here, it is preferable that "q>(p-q)".
Is satisfied, the condition satisfaction of “q × v> α” is required, and when “q <(p−q)” is satisfied, “(p−
q) × v> α ”is satisfied. Then, the random number generation means 21-i preferably uses an exclusive OR operation as the bit-based logical operation.

【００１６】図２に原理構成を示す本発明は、図１の原
理構成で用いていた通信網４の代わりに、親プロセッサ
エレメント１及び子プロセッサエレメント２-iが共にア
クセスできる共有メモリ５を通信機構として用いてい
る。In the present invention whose principle configuration is shown in FIG. 2, instead of the communication network 4 used in the principle configuration of FIG. 1, a shared memory 5 that can be accessed by both the parent processor element 1 and the child processor element 2-i is communicated. It is used as a mechanism.

【００１７】この原理構成に従う場合には、親プロセッ
サエレメント１は、図１で説明した乱数初期値生成手段
１０と、新たな乱数初期値書込手段１２とを備え、一
方、子プロセッサエレメント２-iは、図１で説明した乱
数生成手段２１-iと、新たな乱数初期値読込手段２２-i
とを備える。In accordance with this principle configuration, the parent processor element 1 comprises the random number initial value generating means 10 and the new random number initial value writing means 12 described in FIG. 1, while the child processor element 2- i is the random number generating means 21-i described in FIG. 1 and the new random number initial value reading means 22-i.
With.

【００１８】図１及び図２に原理構成を示す本発明で
は、親プロセッサエレメント１が乱数初期値を生成する
という構成を採るのに対して、図３に原理構成を示す本
発明は、乱数生成を要求される各プロセッサエレメント
自身が乱数初期値を生成していくという構成を採るもの
である。図中、３-i（１≦ｉ≦ｋ）は、乱数生成を要求
されるプロセッサエレメントである。In the present invention whose principle configuration is shown in FIGS. 1 and 2, the parent processor element 1 generates a random number initial value, whereas the present invention whose principle configuration is shown in FIG. Each processor element required to generate a random number initial value is configured to generate a random number initial value. In the figure, 3-i (1≤i≤k) is a processor element for which random number generation is required.

【００１９】この構成を採るときには、各プロセッサエ
レメント３-iは、図１で説明した乱数初期値生成手段１
０と同一の機能を発揮する乱数初期値生成手段３０-i
と、新たな乱数初期値抽出手段３１-iと、図１で説明し
た乱数生成手段２１-iと同一の機能を発揮する乱数生成
手段３２-iとを備える。When adopting this configuration, each processor element 3-i has the random number initial value generating means 1 described in FIG.
Random number initial value generation means 30-i exhibiting the same function as 0
And a new random number initial value extraction means 31-i and a random number generation means 32-i that exhibits the same function as the random number generation means 21-i described in FIG.

【００２０】[0020]

【作用】最初に、図１に原理構成を示す本発明の作用に
ついて説明する。図１に原理構成を示す本発明にあっ
て、親プロセッサエレメント１の乱数初期値生成手段１
０は、ｐ×ｖ×ｋ個の乱数の初期値を生成し、乱数初期
値分配手段１１は、各子プロセッサエレメント２-iに対
して、この生成された乱数初期値をｐ×ｖ個ずつ重複し
ないように分配する。すなわち、順番通りとかいったよ
うな所定の規則性に従って、重複しないように分配して
いくのである。そして、各子プロセッサエレメント２-i
の乱数初期値受信手段２０-iは、親プロセッサエレメン
ト１より分配されてくる自エレメント宛の乱数初期値を
受信し、乱数生成手段２１-iは、この受信した乱数初期
値を用いて乱数列を生成する。First, the operation of the present invention whose principle configuration is shown in FIG. 1 will be described. In the present invention whose principle configuration is shown in FIG. 1, random number initial value generating means 1 of the parent processor element 1
0 generates p × v × k random number initial values, and the random number initial value distribution unit 11 generates p × v random number initial values for each child processor element 2-i. Distribute so that they do not overlap. That is, according to a predetermined regularity such as the order, the distribution is performed so as not to overlap. Then, each child processor element 2-i
The random number initial value receiving means 20-i receives the random number initial value addressed to the self element distributed from the parent processor element 1, and the random number generating means 21-i uses the received random number initial value to generate a random number sequence. To generate.

【００２１】この乱数生成手段２１-iの乱数生成処理
は、Ｍ系列の乱数列の生成方法に従って実行されること
になる。Ｍ系列の乱数列生成方法では、Ａ₁〜Ａ_pのｐ
個の乱数初期値が与えられるときに、ビット対応の論理
演算として排他的論理和演算を用いるならば、２を法と
する原始既約多項式「Ｘ^p＋Ｘ^q＋１」から導出される
下記の漸化式Ａ_n＝ＥＯＲ（Ａ_n-p，Ａ_n-q）・・・・・式に従って、（ｐ＋１）項以降の乱数列を生成していく。
そして、この原始既約多項式「Ｘ^p＋Ｘ^q＋１」は「Ｘ
^p＋Ｘ^(p-q)＋１」と等価であることが証明されている
ことから、あるいは、この原始既約多項式「Ｘ^p＋Ｘ
^(p-q)＋１」から導出される下記の漸化式Ａ_n＝ＥＯＲ（Ａ_n-p，Ａ_n-p+q）・・・・式に従って、（ｐ＋１）項以降の乱数列を生成していく。The random number generation processing of the random number generation means 21-i is executed according to the method of generating the M-sequence random number sequence. In the M-sequence random number sequence generation method, _p of A _{1 to} A _p is used.
If exclusive OR operations are used as the bit-corresponding logical operations when the random number initial values are given, the following graduation derived from the primitive irreducible polynomial “X ^p + X ^q +1” modulo 2 is used. Formula A _n = EOR (A _np , A _nq ) ... The random number sequence after the (p + 1) term is generated according to the formula.
Then, this primitive irreducible polynomial "X ^p + X ^q +1" becomes "X
^p + X ^(pq) +1 ”, or because this primitive irreducible polynomial“ X ^p + X
^(pq) +1 ”, the following recurrence formula A _n = EOR (A _np , A _{n-p + q} ) ... The random number sequence after the (p + 1) term is generated according to the formula.

【００２２】ここで、上述の式に従う場合、Ａ_kとＡ
_p+k-qとからＡ_p+kを生成することから、Ａ₁〜Ａ_pの
ｐ個の乱数初期値を使って一度に生成できる乱数の個数
は、「ｐ＋ｋ−ｑ＝ｐ」の成立する「ｋ＝ｑ」個とな
る。一方、上述の式に従う場合、Ａ_kとＡ_k+qとから
Ａ_p+kを生成することから、Ａ₁〜Ａ_pのｐ個の乱数初
期値を使って一度に生成できる乱数の個数は、「ｋ＋ｑ
＝ｐ」の成立する「ｋ＝（ｐ−ｑ）」個となる。すなわ
ち、乱数生成効率の観点からみて、「ｑ＞（ｐ−ｑ）」
が成立するときには、式に従って乱数を生成していく
ことが好ましく、「（ｐ−ｑ）＜ｑ」が成立するときに
は、式に従って乱数を生成していくことが好ましいこ
とになる。Here, if the above equation is followed, A _k and A
_Since A _{p + k} is generated from _{p + kq} , the number of random numbers that can be generated at one time using p random number initial values of A _{1 to} A _p is “p + k−q = p”. k = q ”. On the other hand, when following the above formula, since A _{p + k} is generated from A _k and A _{k + q} , the number of random numbers that can be generated at one time using p random number initial values of A _{1 to} A _p is , "K + q
= P ”holds, there are“ k = (p−q) ”pieces. That is, “q> (p−q)” from the viewpoint of random number generation efficiency.
When is satisfied, it is preferable to generate random numbers according to the formula, and when “(p−q) <q” is satisfied, it is preferable to generate random numbers according to the formula.

【００２３】このように、乱数生成手段２１-iは、式
の漸化式に従う場合には、一度にｑ個の乱数しか生成で
きない。また、式の漸化式に従う場合には、一度に
（ｐ−ｑ）個の乱数しか生成できない。これでは、ｑや
（ｐ−ｑ）の値が、子プロセッサエレメント２-iのベク
トル演算機構のベクトル長αよりも小さいときには、折
角、ベクトル演算機構がα個の乱数を一度に生成できる
能力を有しているにもかかわらず、それを使用していな
いという問題点がある。従って、高速に乱数を生成でき
ないという問題点がある。As described above, the random number generation means 21-i can generate only q random numbers at a time when the recurrence formula is followed. Further, when the recurrence formula of the formula is followed, only (p−q) random numbers can be generated at one time. With this, when the value of q or (p−q) is smaller than the vector length α of the vector operation mechanism of the child processor element 2-i, the vector operation mechanism has the ability to generate α random numbers at a time. There is a problem in that it does not use it even though it has it. Therefore, there is a problem that random numbers cannot be generated at high speed.

【００２４】そこで、乱数生成手段２１-iは、２を法と
する原始既約多項式「Ｘ^p＋Ｘ^q＋１」が「（Ｘ^p＋Ｘ
^q＋１）^v」や「（Ｘ^p＋Ｘ^(p-q)＋１）^v」と等価で
あることが証明されていることに対応して、Ａ₁〜Ａ_pv
のｐ×ｖ個の乱数初期値が与えられるときに、これらの
原始既約多項式から導出される下記の漸化式Ａ_n＝ＥＯＲ（Ａ_n-pv，Ａ_n-rv）・・・・式但し、ｒ＝ｑあるいは（ｐ−ｑ）ＥＯＲ演算は他の論理演算でも可に従って、（ｐ×ｖ＋１）項以降の乱数列を生成してい
くことが可能であるという点に着目して、Ｍ系列の乱数
列生成方法に従いつつ高速に乱数を発生していく構成を
採る。Therefore, the random number generating means 21-i uses the primitive irreducible polynomial "X ^p + X ^q +1" ^modulo 2 as "(X ^p + X
Corresponding to the fact that it is proved to be equivalent to “ ^q +1) ^v ” and “(X ^p + X ^(pq) +1) ^v ”, A _{1 to} A _pv
The following recurrence formula A _n = EOR (A _n-pv , A _n-rv ) derived from these primitive irreducible polynomials when p × v random initial values of However, it should be noted that r = q or (p−q) EOR operation can generate random number sequence after (p × v + 1) terms according to other logical operations. The configuration is such that a random number is generated at high speed while following the sequence random number sequence generation method.

【００２５】すなわち、乱数生成手段２１-iは、ｒの値
としてｑを使用してこの式の漸化式を用いることで、
上述の式に関係する説明部分から分かるように、Ａ₁
〜Ａ _pvのｐ×ｖ個の乱数初期値から一度にｑ×ｖ個の乱
数を生成でき、以後このｑ×ｖ個を単位として効率的に
乱数を生成できる。一方、ｒの値として（ｐ−ｑ）を使
用してこの式の漸化式を用いることで、上述の式に
関係する説明部分から分かるように、Ａ₁〜Ａ_pvのｐ×
ｖ個の乱数初期値から一度に（ｐ−ｑ）×ｖ個の乱数を
生成でき、以後この（ｐ−ｑ）×ｖ個を単位として効率
的に乱数を生成できる。That is, the random number generation means 21-i uses the value of r.
Using the recurrence of this equation using q as
As can be seen from the description related to the above equation, A₁
~ A _pvQ × v random numbers at a time from p × v random initial values of
A number can be generated, and thereafter, q × v units can be efficiently used as a unit.
Can generate random numbers. On the other hand, (p-q) is used as the value of r.
Using the recurrence formula of this formula,
As you can see from the related explanation, A₁~ A_pvP ×
(p−q) × v random numbers at once from v random initial values
Can be generated, and thereafter, the efficiency in units of (p−q) × v
Can randomly generate random numbers.

【００２６】これから、乱数生成手段２１-iは、好まし
くは、ｑ×ｖがベクトル長αよりも大きくなるようなｖ
を選択して、「ｒ＝ｑ」とする式の漸化式に従い、ベ
クトル長αを生成単位として、（ｐ×ｖ＋１）項以降の
乱数列を生成していくか、（ｐ−ｑ）×ｖがベクトル長
αよりも大きくなるようなｖを選択して、「ｒ＝（ｐ−
ｑ）」とする式の漸化式に従い、ベクトル長αを生成
単位として、（ｐ×ｖ＋１）項以降の乱数列を生成して
いくことで、Ｍ系列の乱数列生成方法に従いつつ高速に
乱数を発生していくよう処理する。From this, the random number generating means 21-i preferably uses v such that q × v is larger than the vector length α.
Is selected, and a random number sequence of (p × v + 1) terms or later is generated with the vector length α as a generation unit according to the recurrence formula of “r = q”, or (p−q) × By selecting v such that v is larger than the vector length α, “r = (p−
q) ”according to a recurrence formula of a vector length α and a random number sequence of (p × v + 1) and subsequent terms is generated, thereby generating a high-speed random number sequence according to the M-sequence random number sequence generation method. Is processed.

【００２７】ここで、ｒとして、ｑを選択するか、（ｐ
−ｑ）を選択するかは、基本的には、いずれでもよいの
であるが、親プロセッサエレメント１の乱数初期値生成
手段１０がｐ×ｖ×ｋ個の乱数の初期値を生成する必要
があることから、ｖの値としては小さい方が適切であ
る。これから、「ｑ＞（ｐ−ｑ）」が成立するときに
は、ｒとしてｑを選択することが好ましく、「（ｐ−
ｑ）＞ｑ」が成立するときには、ｒとして（ｐ−ｑ）を
選択することが好ましい。Here, q is selected as r or (p
Basically, it is possible to select -q), but it is necessary that the random number initial value generating means 10 of the parent processor element 1 generates the initial values of p × v × k random numbers. Therefore, a smaller value of v is more suitable. From this, when "q>(p-q)" holds, it is preferable to select q as r, and "(p-
When “q)> q” holds, it is preferable to select (p−q) as r.

【００２８】また、「ｒ×ｖ＞α」の条件を充足する場
合、ｖとして、いかなる２の巾乗の数値を用いてもよい
のであるが、親プロセッサエレメント１の乱数初期値生
成手段１０がｐ×ｖ×ｋ個の乱数の初期値を生成する必
要があることから、ｖの値としては、同様に、この条件
を充足する数値の内の最も小さな値を示すものを用いる
ことが適切である。When the condition of "r × v>α" is satisfied, any power of 2 may be used as v, but the random number initial value generating means 10 of the parent processor element 1 uses Since it is necessary to generate initial values of p × v × k random numbers, it is appropriate to use the value of v that shows the smallest value among the numerical values that satisfy this condition. is there.

【００２９】そして、更に言うならば、「ｒ×ｖ＞α」
の条件を充足しなくても、ｖが１より大きな値を示すと
きには、ｖが１を示すときよりも、子プロセッサエレメ
ント２-iのベクトル演算機構を有効利用していることに
なるので、高速な乱数発生を実現できる。なお、当初か
ら「ｒ＞α」の条件が充足されるときには、ｖの値とし
ては１が設定されることになる。Further, to put it further, "r × v>α"
Even if the condition of is not satisfied, when v shows a value larger than 1, it means that the vector operation mechanism of the child processor element 2-i is effectively used as compared with the case where v shows 1. Random number generation. When the condition of “r> α” is satisfied from the beginning, 1 is set as the value of v.

【００３０】このようにして、本発明では、並列計算機
システムの各子プロセッサエレメント２-iは、Ｍ系列の
乱数列生成方法に従って乱数列を生成していく構成を採
るものである。In this way, according to the present invention, each child processor element 2-i of the parallel computer system has a configuration for generating a random number sequence in accordance with the M-sequence random number sequence generation method.

【００３１】次に、図２に原理構成を示す本発明の作用
について説明する。図２に原理構成を示す本発明にあっ
て、親プロセッサエレメント１の乱数初期値生成手段１
０は、ｐ×ｖ×ｋ個の乱数の初期値を生成し、乱数初期
値書込手段１２は、この生成されたｐ×ｖ×ｋ個の乱数
初期値を共有メモリ５に書き込む。そして、子プロセッ
サエレメント２-iの乱数初期値読込手段２２-iは、共有
メモリ５に書き込まれたｐ×ｖ×ｋ個の乱数初期値をｐ
×ｖ個ずつ他子プロセッサエレメント２-iと重複しない
ように読み込み、乱数生成手段２１-iは、この読み込ま
れたｐ×ｖ個の乱数初期値を用いてＭ系列の乱数列生成
方法に従って乱数列を生成する。Next, the operation of the present invention whose principle configuration is shown in FIG. 2 will be described. In the present invention whose principle configuration is shown in FIG. 2, the random number initial value generating means 1 of the parent processor element 1
0 generates p × v × k random number initial values, and the random number initial value writing means 12 writes the generated p × v × k random number initial values in the shared memory 5. Then, the random number initial value reading means 22-i of the child processor element 2-i sets p × v × k random number initial values written in the shared memory 5 to p.
The random number generation means 21-i reads x × each so that it does not overlap with the other child processor element 2-i, and the random number generation means 21-i uses the read p × v random number initial values according to the M-sequence random number sequence generation method. Generate a column.

【００３２】このようにして、並列計算機システムの各
子プロセッサエレメント２-iは、Ｍ系列の乱数列生成方
法に従って乱数列を生成していくのである。次に図３に
原理構成を示す本発明の作用について説明する。In this way, each child processor element 2-i of the parallel computer system generates a random number sequence in accordance with the M-sequence random number sequence generation method. Next, the operation of the present invention whose principle configuration is shown in FIG. 3 will be described.

【００３３】図３に原理構成を示す本発明にあって、各
プロセッサエレメント３-iの乱数初期値生成手段３０-i
は、図１の乱数初期値生成手段１０と同様にｐ×ｖ×ｋ
個の乱数の初期値を生成し、乱数初期値抽出手段３１-i
は、生成されたｐ×ｖ×ｋ個の乱数初期値の中から、自
エレメントに割り付けられるｐ×ｖ個の乱数初期値を他
プロセッサエレメント３-iと重複しないように抽出し、
乱数生成手段３２-iは、この抽出された乱数初期値を用
いてＭ系列の乱数列生成方法に従って乱数列を生成す
る。In the present invention whose principle configuration is shown in FIG. 3, random number initial value generating means 30-i of each processor element 3-i is provided.
Is p × v × k similarly to the random number initial value generating means 10 of FIG.
Initial value of each random number is generated, and the random number initial value extraction means 31-i
Extracts p × v random number initial values assigned to its own element from the generated p × v × k random number initial values so as not to overlap with other processor elements 3-i,
The random number generation means 32-i uses the extracted random number initial value to generate a random number sequence according to the M-sequence random number sequence generation method.

【００３４】このようにして、並列計算機システムの各
プロセッサエレメント３-iは、Ｍ系列の乱数列生成方法
に従って乱数列を生成していくのである。この構成に従
うと、プロセッサエレメント３-i間で乱数生成の処理の
ための通信処理を実行しなくても済むことになる。した
がって本発明のプロセッサは，共有メモリでも，通信網
でもどちらでも良い。In this way, each processor element 3-i of the parallel computer system generates a random number sequence in accordance with the M-sequence random number sequence generation method. According to this configuration, it is not necessary to execute the communication process for the random number generation process between the processor elements 3-i. Therefore, the processor of the present invention may be either a shared memory or a communication network.

【００３５】[0035]

【実施例】以下、実施例に従って本発明を詳細に説明す
る。次に、図１に示した実施例のより具体的な実施例に
従って、本発明を詳細に説明する。EXAMPLES The present invention will be described in detail below with reference to examples. Next, the present invention will be described in detail according to a more specific embodiment of the embodiment shown in FIG.

【００３６】本発明は、図１ないし図３で説明したよう
に、あらゆるタイプの並列計算機システムに対して適用
可能である。例えば、図４（ａ）に示すような通信網を
介して接続される構成を採る分散メモリ型の並列計算機
システムに対しても適用可能であり、また、図４（ｂ）
に示すようなメモリを共有する構成を採る共有メモリ型
の並列計算機システムに対しても可能であり、また、図
４（ｃ）に示すような共有メモリを保有しつつ通信網を
介して接続される構成を採るハイブリッド型の並列計算
機システムに対しても適用可能であるのである。ここ
で、図中のＰＥはプロセッサエレメントを表している。The present invention can be applied to all types of parallel computer systems as described with reference to FIGS. For example, the present invention is also applicable to a distributed memory type parallel computer system having a configuration connected through a communication network as shown in FIG. 4 (a), and FIG. 4 (b).
It is also possible for a shared memory type parallel computer system to adopt a configuration of sharing a memory as shown in FIG. 4 and is connected through a communication network while having a shared memory as shown in FIG. It can also be applied to a hybrid parallel computer system that adopts the above configuration. Here, PE in the figure represents a processor element.

【００３７】図５に、図１で説明した親プロセッサエレ
メント１及び子プロセッサエレメント２-iの実行する処
理の全体の流れを図示する。ここで、図中の左側部分が
親プロセッサエレメント１の実行する処理であり、右側
部分が子プロセッサエレメント２-iの実行する処理であ
る。FIG. 5 shows an overall flow of processing executed by the parent processor element 1 and the child processor element 2-i described in FIG. Here, the left part in the figure is the process executed by the parent processor element 1, and the right part is the process executed by the child processor element 2-i.

【００３８】この図に示すように、親プロセッサエレメ
ント１は、先ず最初に、ステップ１で、子プロセッサエ
レメント２-iで発生する乱数列の種となる乱数初期値を
発生するための前処理を実行し、次に、ステップ２で、
この乱数初期値を発生する。そして、続くステップ３
で、この発生した乱数初期値を通信網等を介して子プロ
セッサエレメント２-iに転送し、最後に、ステップ４
で、後処理を実行して処理を終了する。As shown in this figure, the parent processor element 1 first performs in step 1 a preprocessing for generating a random number initial value which is a seed of a random number sequence generated by the child processor element 2-i. Run, then in step 2,
This random number initial value is generated. And the following step 3
Then, the generated random number initial value is transferred to the child processor element 2-i via a communication network or the like, and finally, in step 4
Then, the post-processing is executed and the processing ends.

【００３９】一方、子プロセッサエレメント２-iは、先
ず最初に、ステップ５で、オペレーティングシステムか
ら、自エレメントのプロセッサ番号情報と、子プロセッ
サエレメント２-iの総台数情報とを入手する。次に、ス
テップ６で、乱数を使用しない処理部分の処理を実行し
てから、ステップ７で、親プロセッサエレメント１から
転送されてくる乱数初期値の中から自エレメント宛に分
配されてくる乱数初期値を受信する。On the other hand, the child processor element 2-i first obtains the processor number information of its own element and the total number information of the child processor elements 2-i from the operating system in step 5. Next, in step 6, the process of the processing part that does not use random numbers is executed, and then in step 7, the random number initial value distributed to the own element from the random number initial value transferred from the parent processor element 1 is executed. Receive a value.

【００４０】そして、続くステップ８で、この受信した
乱数初期値を用いて自エレメントで使用する乱数列を生
成し、ステップ９で、生成した乱数列を使用する処理部
分の処理を実行してから、ステップ１０で、乱数を使用
する要求があるか否かを判断する。このステップ１０の
判断で、乱数の使用要求があるときにはステップ８に戻
り、乱数の使用要求がないときには、ステップ１１に進
んで、乱数を使用しない処理部分の処理を実行してから
処理を終了する。Then, in the following step 8, a random number sequence to be used in its own element is generated using this received random number initial value, and in step 9, the processing of the processing part using the generated random number sequence is executed. In step 10, it is determined whether or not there is a request to use a random number. If it is determined in step 10 that there is a request for using a random number, the process returns to step 8, and if there is no request for using a random number, the process proceeds to step 11 to execute the process of the process part that does not use the random number, and then the process ends. .

【００４１】図６に、親プロセッサエレメント１の実行
する乱数初期値の生成処理の詳細な処理フローを図示す
る。親プロセッサエレメント１は、乱数初期値を生成す
るときには、図６の処理フローに示すように、先ず最初
に、ステップ２０で、変数ｌ_pに“ｐ×ｖ×ｋ”を設定
するとともに、ｑと（ｐ−ｑ）の内の大きい方を求めて
それを新たなｑ（これまではｒと記述してきたもの）と
して設定する。更に、この新たなｑに従って変数ｌ_qに
“ｑ×ｖ×ｋ”を設定するとともに、変数Ａ₁に“４９
９９”を設定する。ここで、上述したように、ｐとｑは
Ｍ系列の乱数値の生成を規定する原始既約多項式「Ｘ^p
＋Ｘ^q＋１」のパラメータ、ｖは２の巾乗の数値として
定義されて、子プロセッサエレメント２-iのベクトル演
算機構のベクトル長をαで表すならば、「ｑ×ｖ＞α」
の条件を充足する数値、ｋは乱数を生成する子プロセッ
サエレメント２-iの個数である。FIG. 6 shows a detailed processing flow of the random number initial value generation processing executed by the parent processor element 1. When generating the random number initial value, the parent processor element 1 first sets “p × v × k” in the variable l _p in step 20 as shown in the processing flow of FIG. The larger one of (p−q) is obtained and set as a new q (what has been described as r so far). Furthermore, according to this new _q , “q × v × k” is set to the variable l _q , and “49” is set to the variable A _1.
99 ". Here, as described above, p and q are primitive irreducible polynomials" X ^p "that define the generation of M-sequence random number values.
+ X ^q +1 ”, v is defined as a power of 2, and if the vector length of the vector operation mechanism of the child processor element 2-i is represented by α, then“ q × v> α ”
Is a numerical value that satisfies the condition of, and k is the number of child processor elements 2-i that generate random numbers.

【００４２】次に、ステップ２１で、Ａ_i＝Ａ_i-1×１３７３７１に従って、Ａ_i（ｉ＝２〜２×ｌ_p）の値を算出する。
この算出処理と“４９９９”に設定されているＡ₁とか
ら、２×ｌ_p個のＡ_i（ｉ＝１〜２×ｌ_p）の値が決定
されることになる。Next, at step 21, the value of A _i (i = 2 to 2 × 1 _p ) is calculated according to A _i = A _i-1 × 137371.
From this calculation process and A ₁ set to “4999”, 2 × l _p values of A _i (i = 1 to 2 × l _p ) are determined.

【００４３】続いて、ステップ２２で、ｉ＝１〜ｌ_pの
範囲にあるＡ_iを１つ選択するとともに、これに対応付
けられるＡ_i+lpを選択する。そして、この選択したＡ_i
の上位１６ビットと、この選択したＡ_i+lpの下位１６ビ
ットとを結合することで、選択したＡ_iを新たなものに
置き換える処理を実行する。すなわち、Ａ_iとＡ_i+lpと
をミックスして、新たにｌ_p個のＡ_i（ｉ＝１〜ｌ_p）
を生成するのである。ここまでの乱数生成方法は混合乗
算法に従っている。Then, in step 22, one A _{i in} the range of i = 1 to l _p is selected and A _{i + lp} associated with this is selected. And this selected A _i
The process of replacing the selected A _i with a new one is performed by combining the upper 16 bits of A _{i + lp with} the lower 16 bits of the selected A _{i + lp} . In other words, to mix the A _i and A _{i + lp,} newly l _p pieces of A _{i (i} = 1~l _p)
Is generated. The random number generation method so far follows the mixed multiplication method.

【００４４】続いて、ステップ２３で、Ｍ系列の乱数生
成方法に従って、Ａ_i＝ＥＯＲ（Ａ_i-lp，Ａ_i-lq）但し、ＥＯＲはビット対応の排他的論理和演算に従って、ステップ２２で生成されたＡ_i（ｉ＝１〜ｌ
_p）を用いて、Ａ₁とＡ _lp+1-lqとのビット対応の排他
的論理和演算に従ってＡ_lp+1を算出し、Ａ₂とＡ
_lp+2-lqとのビット対応の排他的論理和演算に従ってＡ
_lp+2を算出するというように、Ａ_kとＡ_lp+k-lqとのビ
ット対応の排他的論理和演算に従ってＡ_lp+kを算出して
いくことを繰り返すことで、新たに定義されるＡ_i（ｉ
＝ｌ_p＋１〜ｌ_p＋２×ｌ_q）の値を算出する。Then, in step 23, a random number of M series is generated.
According to the method_i= EOR (A_i-lp, A_i-lq) However, EOR is the A generated in step 22 according to the bitwise exclusive OR operation._i(I = 1 to l
_p), A₁And A _{lp + 1-lq}Bitwise exclusion with
A according to logical OR operation_{lp + 1}And calculate A₂And A
_{lp + 2-lq}A according to the exclusive OR operation corresponding to
_{lp + 2}To calculate A_kAnd A_{lp + k-lq}With
According to the exclusive OR operation corresponding to_{lp + k}Calculate
A is newly defined by repeating the process_i(I
= L_p+1 to 1_p＋ 2 × l_q) Value is calculated.

【００４５】この繰り返しの算出処理は、ステップ２２
で生成されたＡ_i（ｉ＝１〜ｌ_p）を用いることで算出
する第１段階としては、ステップ２２で生成されたＡ_i
の最後の数がＡ_lpであることから、ｌ_p＋ｋ−ｌ_q＝ｌ_p の成立する「ｋ＝ｌ_q」まで算出可能である。すなわ
ち、第１段階としては、ステップ２２で生成されたＡ_i
（ｉ＝１〜ｌ_p）を用いて、新たにＡ_i（ｉ＝ｌ_p＋１
〜ｌ_p＋ｌ_q）を生成する。This iterative calculation process is performed in step 22.
As the first stage of calculation by using A _i (i = 1 to l _p ) generated in step 22, A _i generated in step 22
Last number because it is A _lp of can be calculated from "k = l _q" which satisfies the _{_{l p + k-l q =}} l p. That is, in the first stage, A _i generated in step 22
(I = 1 to l _p ) is used to newly create A _i (i = 1 _p +1
~ L _p + l _q ).

【００４６】次に、この新たに生成されたＡ_i（ｉ＝ｌ
_p＋１〜ｌ_p＋ｌ_q）も含めることで算出する第２段階
としては、この第２段階で生成されたＡ_iの最後の数が
Ａ_lp _+lqであることから、ｌ_p＋ｋ−ｌ_q＝ｌ_p＋ｌ_q の成立する「ｋ＝２ｌ_q」まで算出可能である。すなわ
ち、第２段階としては、第１段階で生成されたＡ_i（ｉ
＝ｌ_p＋１〜ｌ_p＋ｌ_q）も含めて、第１段階で生成し
たものに続く、新たなＡ_i（ｉ＝ｌ_p＋ｌ_q＋１〜ｌ_p
＋２×ｌ_q）を生成する。Next, this newly generated A _i (i = 1
_p +1 to l _p + l _q ) is also included in the second stage, and since the last number of A _i generated in this second stage is A _lp _{+ lq} , l _p + k−l _q It is possible to calculate up to “k = 2l _q ” at which = l _p + l _q holds. That is, as the second stage, A _i (i
_{_{= L p + 1~l p + l}} q) be included, followed by those produced in the first stage, the new _{_{A i (i = l p +}} l q + 1~l p
+ 2 × l _q ) is generated.

【００４７】このようにして、このステップ２３では、
ステップ２２で生成されたＡ_i（ｉ＝１〜ｌ_p）から、
新たに定義されるＡ_i（ｉ＝ｌ_p＋１〜ｌ_p＋２×
ｌ_q）の値を算出することで、更にミックスされたＡ_i
が決定されることになる。Thus, in this step 23,
From A _i (i = 1 to l _p ) generated in step 22,
Newly defined as _{_{A i (i = l p +}} 1~l p + 2 ×
By calculating the value of l _q ), the further mixed A _i
Will be decided.

【００４８】後述するように、実際に乱数初期値として
使用されるものは、この決定されたＡ_i（ｉ＝ｌ_p＋１
〜ｌ_p＋２×ｌ_q）の内の後ろからｌ_p個のＡ_i（ｉ＝
２×ｌ_q＋１〜ｌ_p＋２×ｌ_q）である。As will be described later, what is actually used as the random number initial value is the determined A _i (i = l _p +1).
~ L _p + 2 × l _q ) of l _p A _i (i =
2 × l _q +1 to l _p + 2 × l _q ).

【００４９】このようにして、親プロセッサエレメント
１は、図６の処理フローを実行することで乱数初期値を
生成していく。この生成される乱数初期値は、ｋ個の子
プロセッサエレメント２-iに対して、ｐ×ｖ個ずつ一定
の規則に従って重複しないように分配されることにな
る。この分配方法は様々な方法に従うことが可能であっ
て、例えば、生成したｌ_p（＝ｐ×ｖ×ｋ）個の乱数初
期値の先頭から乱数初期値をｐ×ｖ個ずつ取り出してき
て、各子プロセッサエレメント２-iに順番に分配してい
く方法とか、図７に示すように、生成したｌ_p個の乱数
初期値の先頭から乱数初期値を１ずつ取り出してきて、
各子プロセッサエレメント２-iに順番に振り分けていく
方法といったように、様々な分配方法に従うことが可能
である。In this way, the parent processor element 1 generates a random number initial value by executing the processing flow of FIG. The random number initial values thus generated are distributed to the k child processor elements 2-i by p × v so as not to overlap each other according to a certain rule. This distribution method can follow various methods. For example, p × v random number initial values are extracted from the beginning of the generated l _p (= p × v × k) random number initial values, As a method of sequentially distributing to each child processor element 2-i, or as shown in FIG. 7, the random number initial values are taken out one by one from the head of the generated l _p random number initial values,
It is possible to follow various distribution methods such as a method of sequentially distributing to each child processor element 2-i.

【００５０】このようにして、親プロセッサエレメント
１は、子プロセッサエレメント２-iに対して、重複する
ことのないｐ×ｖ個の乱数初期値の組み合わせを分配し
ていく。In this way, the parent processor element 1 distributes to the child processor element 2-i a combination of p × v random number initial values that does not overlap.

【００５１】各子プロセッサエレメント２-iは、図８に
示す処理フローに従って、この親プロセッサエレメント
１から通知される自エレメント宛のｐ×ｖ個の乱数初期
値を用いて、データ処理に必要となる乱数列を生成して
いくことになる。Each child processor element 2-i uses the p * v random number initial value addressed to this element notified from the parent processor element 1 according to the processing flow shown in FIG. Will generate a random number sequence.

【００５２】すなわち、子プロセッサエレメント２-i
は、先ず最初に、ステップ３０で、ｐ，ｑ，ｖ，ｍａの
値をセットする。ここで、図６の処理フローで説明した
ように、ｐは親プロセッサエレメント１が想定した原始
既約多項式「Ｘ^p＋Ｘ^q＋１」のパラメータ、ｑはこの
原始既約多項式のｑと（ｐ−ｑ）の内の大きい方のパラ
メータ、ｖは２の巾乗の数値として定義されて、子プロ
セッサエレメント２-iのベクトル演算機構のベクトル長
をαで表すならば、「ｑ×ｖ＞α」の条件を充足する数
値、ｋは乱数を生成する子プロセッサエレメント２-iの
個数である。また、ｍａは生成する乱数量を規定する自
然数である。That is, the child processor element 2-i
First, in step 30, the values of p, q, v, and ma are set. Here, as described in the processing flow of FIG. 6, p is a parameter of the primitive irreducible polynomial “X ^p + X ^q +1” assumed by the parent processor element 1, and q is q of this primitive irreducible polynomial and (p− The larger parameter of q), v, is defined as a power of 2, and if the vector length of the vector operation mechanism of the child processor element 2-i is represented by α, then “q × v> α”. Is a numerical value that satisfies the condition of, and k is the number of child processor elements 2-i that generate random numbers. Further, ma is a natural number that defines the amount of random numbers to be generated.

【００５３】次に、ステップ３１で、親プロセッサエレ
メント１から通知されるｐ×ｖ個の乱数初期値を受信す
る。以下、説明の便宜上、この受信するｐ×ｖ個の乱数
初期値をＢ_i（ｉ＝１〜ｐ×ｖ）と記述することにす
る。Next, at step 31, p × v random number initial values notified from the parent processor element 1 are received. Hereinafter, for the sake of convenience of explanation, the received p × v random number initial values will be described as B _i (i = 1 to p × v).

【００５４】続いて、ステップ３２で、Ｍ系列の乱数列
生成方法を拡張することで導出される上述の式と同一
の漸化式であるＢ_i＝ＥＯＲ（Ｂ_i-pv，Ｂ_i-qv）・・・・・式に従って、ステップ３１で受信された乱数初期値Ｂ
_i（ｉ＝１〜ｐ×ｖ）を用いて、Ｂ₁とＢ_pv+1-qvとの
ビット対応の排他的論理和演算に従ってＢ_pv+1を算出
し、Ｂ₂とＢ_pV+2-qvとのビット対応の排他的論理和演
算に従ってＢ_pv+2を算出し、この算出処理をＢ_pv+maqv
が求まるまで繰り返していくことで、新たな乱数列Ｂ_i
（ｉ＝ｐ×ｖ＋１〜ｐ×ｖ＋ｍａ×ｑ×ｖ）を生成す
る。Then, in step 32, the recurrence formula is the same as the above formula derived by expanding the method for generating a random number sequence of M sequence. B _i = EOR (B _i-pv , B _i-qv ) ... The random number initial value B received in step 31 according to the equation
_i (i = 1 to p × v) is used to calculate B _{pv + 1} according to the bitwise exclusive OR operation of B ₁ and B _{pv + 1-qv,} and B ₂ and B _{pV + 2-} B _{pv + 2} is calculated according to the bitwise exclusive OR operation with _qv, and this calculation process is performed by B _{pv + maqv}
By repeating the above until a new random number sequence B _i
(I = p × v + 1 to p × v + ma × q × v) is generated.

【００５５】ここで、この乱数列Ｂ_i（ｉ＝ｐ×ｖ＋１
〜ｐ×ｖ＋ｍａ×ｑ×ｖ）の算出処理は、具体的には、
子プロセッサエレメント２-iのベクトル演算機構に合わ
せて、第１段階で、Ｂ_i（ｉ＝ｐ×ｖ＋１〜ｐ×ｖ＋ｑ
×ｖ）を算出し、第２段階で、Ｂ_i（ｉ＝ｐ×ｖ＋１＋
ｑ×ｖ〜ｐ×ｖ＋２×ｑ×ｖ）を算出し、第３段階で、
Ｂ_i（ｉ＝ｐ×ｖ＋１＋２×ｑ×ｖ〜ｐ×ｖ＋３×ｑ×
ｖ）を算出していって、これを第ｍａ段階まで実行して
いくことで行う。Here, this random number sequence B _i (i = p × v + 1
Specifically, the calculation process of (p × v + ma × q × v) is
In accordance with the vector operation mechanism of the child processor element 2-i, at the first stage, B _i (i = p × v + 1 to p × v + q
Xv) is calculated, and in the second step, B _i (i = p × v + 1 +
q × v to p × v + 2 × q × v), and in the third stage,
B _i (i = p × v + 1 + 2 × q × v to p × v + 3 × q ×
This is performed by calculating v) and executing it up to the ma-th stage.

【００５６】このように、式の漸化式を用いること
で、子プロセッサエレメント２-iは、自エレメントのベ
クトル演算機構をフルに利用して乱数列を高速に生成で
きるのである。これが、本発明で、Ｍ系列の乱数列生成
方法を規定する一般的な式や式の漸化式を用いるの
ではなくて、拡張した式の漸化式を用いて乱数列を生
成する理由である。この点については、式の漸化式導
入理由の説明箇所で詳述した。As described above, by using the recurrence formula of the formula, the child processor element 2-i can generate a random number sequence at high speed by fully utilizing the vector operation mechanism of its own element. This is the reason why, in the present invention, the random number sequence is generated using the recurrence formula of the expanded formula, not using the general formula or the recurrence formula of the formula that defines the method for generating the M-sequence random number sequence. is there. This point was explained in detail in the explanation of the reason for introducing the recurrence formula.

【００５７】更に正確に説明するならば、子プロセッサ
エレメント２-iは、図８の処理フローのステップ３２で
は、各段階で、ｑ×ｖ個を単位として乱数列を生成して
いくのではなくて、後述するように、ベクトル演算機構
のベクトル長αを単位として乱数列を生成していくよう
処理することになる。More precisely, in step 32 of the process flow of FIG. 8, the child processor element 2-i does not generate a random number sequence in q × v units at each stage in step 32. Then, as will be described later, the random number sequence is processed in units of the vector length α of the vector operation mechanism.

【００５８】このようにして、本発明では、並列計算機
システムの各子プロセッサエレメント２-iにおいて、ベ
クトル演算機構をフルに利用しつつ、拡張されたＭ系列
の乱数列生成方法に従って乱数列を生成する構成を採る
ことで、長い周期の衝突することのない乱数列を高速に
生成できるようになるのである。As described above, in the present invention, in each child processor element 2-i of the parallel computer system, a random number sequence is generated according to the extended M-sequence random number sequence generation method while fully utilizing the vector operation mechanism. By adopting such a configuration, it becomes possible to generate a random number sequence with a long cycle without collision at high speed.

【００５９】図９ないし図１２に、以上に説明した処理
フローを実現するための詳細なプログラムの一実施例を
図示する。ここで、図９は、親プロセッサエレメント１
の実行する乱数初期値生成処理を実現するためのプログ
ラム例、図１０は、親プロセッサエレメント１の実行す
る乱数初期値転送処理を実現するためのプログラム例、
図１１は、子プロセッサエレメント２-iの実行する乱数
初期値受信処理を実現するためのプログラム例、図１２
は、子プロセッサエレメント２-iの実行する乱数列生成
処理を実現するためのプログラム例である。9 to 12 show an example of a detailed program for realizing the processing flow described above. Here, FIG. 9 shows the parent processor element 1.
10 is a program example for realizing the random number initial value generation process executed by the above, FIG. 10 is a program example for realizing the random number initial value transfer process executed by the parent processor element 1,
FIG. 11 is a program example for realizing the random number initial value receiving process executed by the child processor element 2-i, FIG.
Is an example program for realizing the random number sequence generation process executed by the child processor element 2-i.

【００６０】次に、これらのプログラム内容について説
明する。図６に示した処理フローを実現することになる
図９のプログラムにおいて、部分は、図６の処理フロ
ーのステップ２０に対応するプログラム部分であって、
各種のパラメータ情報の設定を指示するプログラム部分
である。このプログラム部分で、ｋの値が“２¹⁰”、す
なわち、子プロセッサエレメント２-iの個数が１０２４
台であることが設定されるとともに、ｐの値が“２８
４”、ｑの値が“１４３”であることが設定される。Next, the contents of these programs will be described. In the program of FIG. 9 that realizes the processing flow shown in FIG. 6, the portion is the program portion corresponding to step 20 of the processing flow of FIG.
This is a program part for instructing the setting of various parameter information. In this program part, the value of k is “2 ¹⁰ ”, that is, the number of child processor elements 2-i is 1024.
Is set to be a stand, and the value of p is "28
It is set that the values of 4 ”and q are“ 143 ”.

【００６１】そして、このｑの値に従って、ｒ＝ＭＡＸ（ｑ，ｐ−ｑ）＝ｑ＝１４３であることから、このｑの値が新たなｑの値として用い
られることになる。更に、子プロセッサエレメント２-i
の持つベクトル演算機構のベクトル長αが“５１２”で
あることを想定するならば、新たに定義されるｑが、ｑ×ｖ＞５１２を充足する必要があることから、ｖの値として“２²＝
４”が設定されることになる。Since r = MAX (q, p-q) = q = 143 according to the value of q, this value of q is used as a new value of q. Furthermore, the child processor element 2-i
Assuming that the vector length α of the vector operation mechanism of is 512, the newly defined q needs to satisfy q × v> 512. ² =
4 "will be set.

【００６２】更に、この部分では、ｌ_p及びｌ_qの定
義と、「ＩＲＡＮＳＵ」という配列が、４バイトのデー
タ項目からなって、図６の処理フローのステップ２３で
説明したように、最大（ｌ_p＋２×ｌ_q）個のデータ値
を格納するものであることが設定される。そして、Ａ₁
の値を規定する「ＴＡＮＥ」という変数の値が“４９９
９”であることが設定されるとともに、Ａ_iの上位１６
ビットを抽出するために用いられる“ＦＦ００”という
値を持つデータＩＸと、Ａ_i+lpの下位１６ビットを抽出
するために用いられる“００ＦＦ”という値を持つデー
タＩＹとが設定される。Further, in this part, the definitions of l _p and l _q and the array “IRANSU” consist of 4-byte data items, and as described in step 23 of the processing flow of FIG. It is set to store l _p + 2 × l _q ) data values. And A ₁
The value of the variable "TANE" that defines the value of
Is set to 9 "and the top 16 of A _i
Data IX having a value of "FF00" used to extract bits and data IY having a value of "00FF" used to extract lower 16 bits of A _{i + lp} are set.

【００６３】また、図９のプログラムにおいて、部分
は、図６の処理フローのステップ２１に対応するプログ
ラム部分であって、Ａ_i＝Ａ_i-1×１３７３７１に従って、２×ｌ_p個のＡ_i（ｉ＝１〜２×ｌ_p）の値
を算出して、配列ＩＲＡＮＳＵの先頭から順番に格納す
るプログラム部分である。Further, in the program of FIG. 9, the portion is the program portion corresponding to step 21 of the processing flow of FIG. 6, and 2 × l _p A _i are obtained according to A _i = A _i-1 × 137371. calculates the value of the _{(i = 1~2 × l p)} , a program part to be stored in order from the beginning of the array IRANSU.

【００６４】また、部分は、図６の処理フローのステ
ップ２２に対応するプログラム部分であって、データＩ
Ｘと配列ＩＲＡＮＳＵに格納されるＡ_i（ｉ＝１〜
ｌ_p）とのビット対応の論理積演算でもってＡ_iの上位
１６ビットを抽出するとともに、データＩＹと配列ＩＲ
ＡＮＳＵに格納されるＡ_i+lp（ｉ＝１〜ｌ_p）とのビッ
ト対応の論理積演算でもってＡ_i+lpの下位１６ビットを
抽出し、更に、この抽出したＡ_iの上位１６ビットとＡ
_i+lpの下位１６ビットとのビット対応の論理和演算に従
って、新たなｌ_p個のＡ_i（ｉ＝１〜ｌ_p）の値を算出
して、配列ＩＲＡＮＳＵの先頭から順番に格納するプロ
グラム部分である。The portion is the program portion corresponding to step 22 of the processing flow of FIG.
X and A _i (i = 1 to 1) stored in the array IRANSU
The upper 16 bits of A _i are extracted by a logical AND operation corresponding to 1 _p ) and the data IY and array IR.
The lower 16 bits of A _{i + lp} are extracted by a logical AND operation corresponding to A _{i + lp} (i = 1 to l _p ) stored in ANSU, and the upper 16 bits of the extracted A _i are further extracted. And A
A program for calculating new l _p A _i (i = 1 to l _p ) values according to a logical OR operation corresponding to the lower 16 bits of _{i + lp} , and storing the values in order from the beginning of the array IRANSU It is a part.

【００６５】また、部分は、図６の処理フローのステ
ップ２３に対応するプログラム部分であって、配列ＩＲ
ＡＮＳＵに格納されるＡ₁とＡ_lp+1-lqとのビット対応
の排他的論理和演算に従ってＡ_lp+1を算出して、配列Ｉ
ＲＡＮＳＵの“ｌ_p＋１”番目に格納し、配列ＩＲＡＮ
ＳＵに格納されるＡ₂とＡ_lp+2-lqとのビット対応の排
他的論理和演算に従ってＡ_lp+2を算出して、配列ＩＲＡ
ＮＳＵの“ｌ_p＋２”番目に格納していくという処理を
繰り返していって、最後に、配列ＩＲＡＮＳＵに格納さ
れるＡ_2lqとＡ_lp+lqとのビット対応の排他的論理和演
算に従ってＡ_lp _+2lqを算出して、配列ＩＲＡＮＳＵの
“ｌ_p＋２×ｌ_q”番目に格納していくことで、配列Ｉ
ＲＡＮＳＵの“ｌ_p＋１”番目から“ｌ_p＋２×ｌ_q”
番目に、新たな２×ｌ_q個のＡ_i（ｉ＝ｌ_p＋１〜ｌ_p
＋２×ｌ_q）を格納するプログラム部分である。The portion is a program portion corresponding to step 23 of the processing flow of FIG.
An array I is calculated by calculating A _{lp + 1} according to the bitwise exclusive OR operation of A ₁ and A _{lp + 1-} lq stored in ANSU.
Stored in the "l _p +1" th RANSU, array IRAN
The array IRA is calculated by calculating A _{lp + 2} according to the bitwise exclusive OR operation of A ₂ and A _{lp + 2-lq} stored in SU.
The process of storing "l _p +2" th of NSU is repeated, and finally, according to the bitwise exclusive OR operation of A _2lq and A _{lp +} _lq stored in the array IRANSU, A _lp _{+ 2LQ} calculates the, by going to store the _{_{"l p + 2 × l q}} " th sequence IRANSU, SEQ I
"L _p +1" to "l _p + 2 × l _q " of RANSU
Th, the new 2 × l _q pieces of _{_{A i (i = l p +}} 1~l p
+ 2 × l _q ) is the program part that stores it.

【００６６】以上に説明した図９のプログラム内容から
分かるように、親プロセッサエレメント１は、この図９
のプログラムを実行することで、混合乗算法とＭ系列と
による生成方法に従いつつ乱数初期値を生成することに
なる。この親プロセッサエレメント１により生成された
乱数初期値は、図１０及び図１１に示すプログラムに従
って、子プロセッサエレメント２-iにｐ×ｖ個ずつ規則
的に分配される。ここで、この図１０及び図１１の分配
プログラムは、親プロセッサエレメント１が、全子プロ
セッサエレメント２-iに対して生成した乱数初期値を整
列させて通知する構成を採って、子プロセッサエレメン
ト２-iの側で、自エレメント宛に分配される乱数初期値
を一定の規則に従って選択的に受信していく方法を採っ
ている。As can be seen from the program contents of FIG. 9 explained above, the parent processor element 1 is
By executing the program (1), the random number initial value is generated while following the generation method based on the mixed multiplication method and the M series. The random number initial value generated by the parent processor element 1 is regularly distributed to the child processor elements 2-i by p × v in accordance with the programs shown in FIGS. Here, the distribution program of FIGS. 10 and 11 has a configuration in which the parent processor element 1 arranges and notifies the generated random number initial values to all the child processor elements 2-i. On the -i side, a method of selectively receiving the random number initial value distributed to its own element according to a certain rule is adopted.

【００６７】図１０のプログラムは、親プロセッサエレ
メント１により実行されるものであって、生成された配
列ＩＲＡＮＳＵの中の指定の乱数初期値を、全子プロセ
ッサエレメント２-iに転送していくことを指示するプロ
グラム例である。転送処理を実行するサブルーチンのＳ
ＥＮＤ命令の第１引数は、転送する配列の先頭アドレス
を表示し、第２引数は、この先頭アドレスから転送する
バイト数を表示する。The program of FIG. 10 is executed by the parent processor element 1, and transfers the designated random number initial value in the generated array IRANSU to all the child processor elements 2-i. It is an example of a program for instructing. S of the subroutine that executes the transfer process
The first argument of the END instruction displays the start address of the array to be transferred, and the second argument displays the number of bytes to transfer from this start address.

【００６８】このプログラムでは、配列ＩＲＡＮＳＵの
“２×ｌ_q＋１”番目の乱数初期値を先頭として、その
先頭の乱数初期値からｌ_p個の乱数初期値（１個の乱数
初期値のデータ長は４バイト）を全子プロセッサエレメ
ント２-iに転送していく例を開示してある。すなわち、
親プロセッサエレメント１は、このプログラムに従っ
て、図９のプログラムにより生成された乱数初期値Ａ_i
（ｉ＝ｌ_p＋１〜ｌ_p＋２×ｌ_q）の内の、後ろからｌ
_p個の乱数初期値をＡ_iを全子プロセッサエレメント２
-iに転送していくのである。In this program, the "2 × l _q +1" th random number initial value of the array IRANSU is set as the head, and l _p random number initial values (data length of one random number initial value are 4 bytes) to all the child processor elements 2-i. That is,
According to this program, the parent processor element 1 generates the random number initial value A _i generated by the program of FIG.
_{_{(I = l p + 1~l p}} + 2 × l q) of the, l from behind
Initialize _p random numbers to A _i for all child processor elements 2
-Transfer to i.

【００６９】一方、図８に示した処理フローのステップ
３１の処理を実現することになる図１１のプログラム
は、子プロセッサエレメント２-iにより実行されるもの
であって、親プロセッサエレメント１から転送されてく
る乱数初期値の中から、自エレメント宛に分配される乱
数初期値を選択的に受信していくことを指示するプログ
ラム例である。On the other hand, the program of FIG. 11 which realizes the processing of step 31 of the processing flow shown in FIG. 8 is executed by the child processor element 2-i and transferred from the parent processor element 1. This is an example of a program for instructing to selectively receive a random number initial value distributed to its own element from among the random number initial values that are received.

【００７０】ここで、受信処理を実行する部分のサブ
ルーチンのＲＥＣＶ命令の第１引数は、受信する乱数初
期値の格納先となる配列の先頭アドレスを表示する。第
２引数は、Ｘ方向（図７のプロセッサ番号方向）におけ
る受信開始部分までのバイト数を表示する。第３引数
は、Ｘ方向における前の受信部分から次の受信部分まで
のバイト数を表示する。第４引数は、Ｘ方向における受
信部分単位の持つバイト数を表示する。例えば、４バイ
トが受信部分の一単位であることを表示する。第５引数
は、Ｘ方向における受信部分の総バイト数を表示する。
例えば、受信部分の総単位数が一単位であるときには、
４バイトを表示する。Here, the first argument of the RECV instruction of the subroutine of the part that executes the receiving process displays the starting address of the array where the received random number initial value is stored. The second argument indicates the number of bytes up to the reception start portion in the X direction (processor number direction in FIG. 7). The third argument displays the number of bytes from the previous received portion to the next received portion in the X direction. The fourth argument indicates the number of bytes of the reception partial unit in the X direction. For example, it indicates that 4 bytes are one unit of the reception part. The fifth argument indicates the total number of bytes of the reception part in the X direction.
For example, when the total number of units of the receiving part is one unit,
Display 4 bytes.

【００７１】また、第６引数は、Ｙ方向（図７のもう一
方の方向）における受信開始部分のライン数を表示す
る。第７引数は、Ｙ方向における前の受信部分から次の
受信部分までのライン数を表示する。０であるときに
は、連続するラインであることを表示する。第８引数
は、Ｙ方向における受信部分単位の持つライン数を表示
する。１であるときには、受信部分が１つのラインに存
在することを表示する。第９変数は、Ｙ方向における受
信部分の総ライン数を表示する。例えば、ｐ×ｖ個を受
信するときにはｐ×ｖを表示する。The sixth argument displays the number of lines of the reception start portion in the Y direction (the other direction in FIG. 7). The seventh argument displays the number of lines from the previous reception portion to the next reception portion in the Y direction. When it is 0, it indicates that the lines are continuous. The eighth argument displays the number of lines that the receiving partial unit has in the Y direction. When it is 1, it indicates that the received portion is on one line. The ninth variable indicates the total number of lines in the reception part in the Y direction. For example, when receiving p × v, p × v is displayed.

【００７２】ここで、この図１１のプログラムは、汎用
的な構成を採っているので複雑なものになっているが、
専用的な構成を採る場合には、更に簡単なもので実現で
きることになる。Here, the program of FIG. 11 is complicated because it has a general-purpose configuration.
If a dedicated configuration is adopted, it can be realized with a simpler one.

【００７３】このプログラムでは、部分で、パラメー
タ情報と、受信するｐ×ｖ個の乱数初期値を格納するこ
とになる配列ＩＲとを定義した後、部分で自プロセッ
サエレメント番号ｎｃｉｄを入手し、部分で総子プロ
セッサエレメント個数ｋを入手する。そして、部分の
ＲＥＣＶ命令で、自プロセッサエレメント番号ｎｃｉｄ
の指定する先頭の乱数初期値から、ｋ個飛ばしでもって
乱数初期値を順次ｐ×ｖ個まで受信していく例を開示し
てある。このプログラムにより、各子プロセッサエレメ
ント２-iは、図７に示す態様に従って、親プロセッサエ
レメント１から転送されてくる乱数初期値を受信してい
くことになる。In this program, after defining the parameter information and the array IR that will store p × v random number initial values to be received in the part, the part obtains its own processor element number ncid, Then, the total number k of child processor elements is obtained. Then, in the RECV instruction of the part, the own processor element number ncid
An example is disclosed in which, from the initial random number initial value specified by, the random number initial values are sequentially received up to p × v by skipping k. By this program, each child processor element 2-i receives the random number initial value transferred from the parent processor element 1 according to the aspect shown in FIG.

【００７４】図８に示した処理フローのステップ３２の
処理を実現することになる図１２のプログラムにおい
て、部分は、各種のパラメータ情報の設定を指示する
プログラム部分である。このプログラム部分では、ｍａ
の値として“３”が設定されるとともに、図９のプログ
ラム例に対応して、ｐの値が“２８４”、ｑの値が“１
４３”、ｖの値が“４”であることを設定される。更
に、受信したｐ×ｖ個の乱数初期値を格納している配列
ＩＲと、生成する５１２×ｍａ個の乱数列を格納するこ
とになる配列ＪＲとが定義される。ここで、ベクトル演
算の可能になる最大ベクトル長は、この場合ｑ×ｖ個、
すなわち５７２個となるが、ベクトル演算機構のベクト
ル長αが５１２に最適化されていることに対応して、配
列ＪＲの最大格納個数は５１２×ｍａ個で定義されるこ
とになる。In the program of FIG. 12 that realizes the process of step 32 of the process flow shown in FIG. 8, the part is a program part for instructing the setting of various parameter information. In this program part, ma
"3" is set as the value of p, and the value of p is "284" and the value of q is "1" corresponding to the program example of FIG.
It is set that the values of 43 "and v are" 4 ". Furthermore, the array IR storing the received p * v random number initial values and the 512 * ma random number sequence to be generated are stored. An array JR is defined as follows, where the maximum vector length that allows vector operation is q × v in this case,
That is, the number is 572, but the maximum storage number of the array JR is defined by 512 × ma, corresponding to the fact that the vector length α of the vector operation mechanism is optimized to 512.

【００７５】図８に示した処理フローのステップ３２の
処理を実現することになるプログラム部分は、部分で
ある。このプログラム部分により、乱数列が生成される
ことになる。The part of the program that realizes the process of step 32 of the process flow shown in FIG. 8 is the part. A random number sequence is generated by this program part.

【００７６】ここで、この乱数列の算出処理は、「Ｂ_i
＝ＥＯＲ（Ｂ_i-pv，Ｂ_i-qv）」を使って、具体的には、
子プロセッサエレメント２-iのベクトル演算機構に合わ
せて、第１段階で、「ｉ＝ｐ×ｖ＋１〜ｐ×ｖ＋５１
２」個を算出し、第２段階で、「ｉ＝ｐ×ｖ＋１＋５１
２〜ｐ×ｖ＋２×５１２」を算出し、第３段階で、「ｉ
＝ｐ×ｖ＋１＋２×５１２〜ｐ×ｖ＋３×５１２」を算
出していって、これを第ｍａ段階まで実行していくとい
うように、ベクトル演算機構のベクトル長α（この場合
は、５１２）を単位として実行していくことになる。Here, the calculation process of this random number sequence is "B _i
= EOR (B _i-pv , B _i-qv ) ”, specifically,
In accordance with the vector operation mechanism of the child processor element 2-i, at the first stage, “i = p × v + 1 to p × v + 51
2 ”are calculated, and in the second stage,“ i = p × v + 1 + 51
2−p × v + 2 × 512 ”is calculated, and“ i
= P × v + 1 + 2 × 5122−p × v + 3 × 512 ”, and this is executed up to the ma-th stage, and the vector length α (512 in this case) of the vector operation mechanism is used as a unit. Will be executed as.

【００７７】なお、部分のプログラムは、生成された
乱数値を右方向に１ビットシフトしていくことで、符号
を表示することになる先頭ビットの値を常に“０”、す
なわち正を表示するように設定していくために設けられ
るものである。また、“０”から“１”の乱数を使用す
る場合には、生成された乱数列を格納する配列ＪＲの要
素を２³¹で割っていく処理が入ることになる。Note that the program of the part always shifts the generated random number value to the right by 1 bit so that the value of the leading bit for displaying the code is always "0", that is, positive. It is provided to set like this. Further, when using a random number from "0" to "1", a process of dividing the element of the array JR that stores the generated random number sequence by 2 ³¹ is required.

【００７８】このようにして、図９ないし図１２に示し
たプログラム例に従って、本発明の乱数列生成処理が実
現できるのである。In this way, the random number sequence generation processing of the present invention can be realized according to the program examples shown in FIGS.

【００７９】[0079]

【発明の効果】以上説明したように、本発明によれば、
並列計算機システムの各プロセッサエレメントが、Ｍ系
列の乱数列生成方法に従って乱数列を生成していく構成
を採ることから、長い周期の乱数列を生成できるように
なるとともに、混合乗算法のような困難性を伴うことな
く、衝突することのない乱数列を生成できるようにな
る。そして、このＭ系列の乱数列生成方法を適用する
にあたって、ベクトル演算機構を最大に利用する構成を
採ることから、各プロセッサエレメントは、乱数列を高
速に生成できるようになる。しかも、プロセッサエレメ
ント間の通信処理が乱数初期値のデータ転送以外に要求
されることがないし、この乱数初期値のデータ転送も必
要としない構成も採れることから、この高速性を確実な
ものとできるのである。As described above, according to the present invention,
Since each processor element of the parallel computer system is configured to generate a random number sequence in accordance with the M-sequence random number sequence generation method, it becomes possible to generate a random number sequence with a long period, and it is difficult to perform a mixed multiplication method. It becomes possible to generate a random number sequence that does not collide with the character. Then, in applying this M-sequence random number sequence generation method, since the configuration that makes maximum use of the vector operation mechanism is adopted, each processor element can generate a random number sequence at high speed. Moreover, since the communication process between the processor elements is not required other than the data transfer of the random number initial value, and the configuration that does not require the data transfer of the random number initial value is adopted, the high speed can be ensured. Of.

[Brief description of drawings]

【図１】本発明の原理構成図である。FIG. 1 is a principle configuration diagram of the present invention.

【図２】本発明の原理構成図である。FIG. 2 is a principle configuration diagram of the present invention.

【図３】本発明の原理構成図である。FIG. 3 is a principle configuration diagram of the present invention.

【図４】本発明の適用可能な並列計算機システムの説明
図である。FIG. 4 is an explanatory diagram of a parallel computer system to which the present invention is applicable.

【図５】本発明の処理の全体を説明する処理フローであ
る。FIG. 5 is a processing flow for explaining the overall processing of the present invention.

【図６】乱数初期値の生成処理の処理フローの一実施例
である。FIG. 6 is an example of a processing flow of random number initial value generation processing.

【図７】乱数初期値の分配方法の一実施例である。FIG. 7 is an example of a method of distributing a random number initial value.

【図８】乱数列生成の処理フローの一実施例である。FIG. 8 is an example of a processing flow of random number sequence generation.

【図９】親プロセッサエレメントの実行する乱数初期値
生成のためのコーディング例である。FIG. 9 is a coding example for generating a random number initial value executed by a parent processor element.

【図１０】親プロセッサエレメントの実行する乱数初期
値転送のためのコーディング例である。FIG. 10 is a coding example for random number initial value transfer executed by a parent processor element.

【図１１】子プロセッサエレメントの実行する乱数初期
値受信のためのコーディング例である。FIG. 11 is a coding example for receiving a random number initial value executed by a child processor element.

【図１２】子プロセッサエレメントの実行する乱数列生
成のためのコーディング例である。FIG. 12 is a coding example for generating a random number sequence executed by a child processor element.

【図１３】従来技術の説明図である。FIG. 13 is an explanatory diagram of a conventional technique.

[Explanation of symbols]

１親プロセッサエレメント２子プロセッサエレメント３プロセッサエレメント１０乱数初期値生成手段１１乱数初期値分配手段１２乱数初期値書込手段２０乱数初期値受信手段２１乱数生成手段２２乱数初期値読込手段３０乱数初期値生成手段３１乱数初期値抽出手段３２乱数生成手段 DESCRIPTION OF SYMBOLS 1 parent processor element 2 child processor element 3 processor element 10 random number initial value generating means 11 random number initial value distributing means 12 random number initial value writing means 20 random number initial value receiving means 21 random number generating means 22 random number initial value reading means 30 random number initial value Generating means 31 Random number initial value extracting means 32 Random number generating means

Claims

[Claims]

1. In a parallel computer system composed of a plurality of processor elements, any one of the processor elements generates an initial value of a random number, and a processor element required to generate a random number is an initial value of the generated random number. A random number sequence generation processing method in a parallel computer system, characterized in that a random number sequence is generated according to an M-sequence random number generation method using a random number initial value addressed to its own element in the above.

2. In a parallel computer system composed of a plurality of processor elements, each processor element that is required to generate a random number generates an initial value of a random number according to the same algorithm, and among the generated random number initial values, A random number initial value for its own element is extracted from the element, and the extracted random number initial value is used to generate a new random number sequence according to the M-sequence random number generation method. A random number sequence generation processing method in a computer system.

3. A parallel computer system comprising a plurality of processor elements, wherein any one of the processor elements is p × v × k (p
Is a primitive irreducible polynomial parameter that defines random number generation, v
Is a predetermined numerical value indicating a value of 1 or more, k is the initial value of a random number of the number of processor elements generating a random number), and the processor element required to generate the random number is q Approximate polynomial parameter, r to q
And (p−q), one of the generated p × v × k random number initial values is assigned to its own element by p × v random number initial values, A parallel computer characterized by processing to generate a new random number value A _n (n ≧ p × v + 1) by a logical operation corresponding to a bit of the random number value A _n-pv and the random number value A _n-rv. A random number sequence generation processing method in the system.

4. In a parallel computer system composed of a plurality of processor elements, each processor element has p × v × k pieces (p is a parameter of a primitive irreducible polynomial that defines random number generation, v Is a predetermined number indicating a value of 1 or more,
(k is the number of processor elements that generate the random number), generates an initial value of the random number, extracts p × v of the own element from the generated initial value of the random number, and defines q as the random number generation. If r, which is a parameter of the primitive irreducible polynomial to be defined, is defined as one of q and (p−q), using the extracted p × v random number initial values,
A parallel computer characterized by processing to generate a new random number value A _n (n ≧ p × v + 1) by a logical operation corresponding to a bit of the random number value A _n-pv and the random number value A _n-rv. A random number sequence generation processing method in the system.

5. The random number sequence generation processing method in the parallel computer system according to claim 3 or 4, wherein r is defined as a value showing a large value of q and (p−q), Random Number Sequence Generation Method for Parallel Computer Systems.

6. A random number sequence generation processing method in a parallel computer system according to claim 3, 4 or 5, wherein v represents a vector length of a vector operation mechanism of a processor element required to generate a random number by α, r × v
A random number sequence generation processing method in a parallel computer system characterized in that is selected as showing the smallest value among those showing a value larger than α.