JPS63229522A

JPS63229522A - Parallel sorting process method

Info

Publication number: JPS63229522A
Application number: JP6472587A
Authority: JP
Inventors: Yasuo Yamane; 康男山根
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1987-03-19
Filing date: 1987-03-19
Publication date: 1988-09-26
Anticipated expiration: 2011-06-26
Also published as: JP2509929B2

Abstract

PURPOSE:To extremely decrease the data transfer frequency between processors and to improve the processing efficiency by sending the elements of data allocated evenly and previously by each processor to other corresponding processors with each other excluding the allocation amount of its own when a partial charge range is designated. CONSTITUTION:The processors 1-1-1-m receive information from a control processor 4 to divide the range of the data value into (m) partial ranges in the number equal to the number of processors. Then a partial range to which each allocated element belongs is discriminated and sent to the processor to which said partial range is allocated with only the element allocated to its own left as it is. At the same time, the element sent to its own processor via the similar processes carried out by other processors in parallel with each other is received. When the mutual allocation is ended among elements, each processor sorts its own allocated data. Thus a sorting process of whole data is completed. In such a way, the data sorting process is completed just with a single time of transfer of data at most for each element.

Description

【発明の詳細な説明】〔概　要〕複数の処理装置からなる計算機システムにおける、並列
ソートの処理効率を改善する処理方法である。DETAILED DESCRIPTION OF THE INVENTION [Summary] This is a processing method for improving the processing efficiency of parallel sorting in a computer system consisting of a plurality of processing devices.

ｍ個の処理装置に、データの値の範囲をｍ個に分割した
分担範囲を割り当て、各処理装置が手持ちのソートする
データの要素を、分担範囲に従って相互に分配し、分配
後のデータについて、各個別にソートを実行することに
より、全データのソートを完了する。Allocating a sharing range that divides the data value range into m pieces to m processing devices, distributing the elements of the data to be sorted by each processing device to each other according to the sharing range, and regarding the distributed data, Complete the sorting of all data by performing each sort individually.

この方法により、ソートのための処理装置間のデータの
授受が少なくなり、効率良く処理を実行することができ
る。With this method, data exchange between processing devices for sorting is reduced, and processing can be executed efficiently.

[Industrial application field]

本発明は、複数の処理装置からなる計算機システムにお
ける並列ソートの処理方法に関する。The present invention relates to a parallel sort processing method in a computer system including a plurality of processing devices.

データをその大きさの順に配列するソートは、公知のよ
うに情報処理の各分野で多用される。Sorting, which arranges data in order of size, is widely used in various fields of information processing, as is well known.

[Conventional technology]

第２図は計算機システムの構成例を示すブロック図であ
る。FIG. 2 is a block diagram showing an example of the configuration of a computer system.

処理装置１−１〜１−ｍは、それぞれ記憶装置２−１〜
２−ｍに保持されるデータを読み出して、独立にソート
処理等を実行することのできる装置であり、又ネットワ
ーク３で構成される通信路によって相互に情報を交換す
ることができる。Processing devices 1-1 to 1-m are storage devices 2-1 to 1-m, respectively.
It is a device that can read out data held in 2-m and independently perform sorting processing, etc., and can also exchange information with each other through a communication path constituted by network 3.

制御処理装置４は、ネットワーク３によって処理装置１
−１〜１−ｍ等と通信することにより、システム全般に
関わる所要の制御を行う。The control processing device 4 is connected to the processing device 1 via the network 3.
-1 to 1-m, etc., performs necessary control related to the overall system.

大量のデータのソートを、このような計算機システムの
各処理装置１−１〜１−ｍにデータを分配して並列に処
理すれば、処理を高速化できる可能性があり、そのよう
な並列処理の代表的な方法として、公知のパイトニック
マージ法がある。If a large amount of data is sorted by distributing the data to each processing unit 1-1 to 1-m of such a computer system and processing it in parallel, it is possible to speed up the processing, and such parallel processing A typical method is the known pitonic merge method.

この方法によれば複数の処理装置に分配されているソー
ト対象のデータについて、先ず各２個の処理装置の組ご
とに次のような処理をする。According to this method, data to be sorted that is distributed to a plurality of processing devices is first subjected to the following processing for each pair of two processing devices.

即ち第３図に示すように、２個の処理装置Ａ、Ｂの組で
各処理装置に分配されているデータ（図は各４要素が分
配されている例を示す）を並列に個別処理して、例えば
Ａは昇順、Ｂは降順にソートした後（図の■）、画処理
装置のデータの対応する要素を比較して、例えば小さい
方の要素がＡ、大きい方がＢになるように移動する（図
の■）。That is, as shown in Fig. 3, a set of two processing devices A and B processes the data distributed to each processing device (the figure shows an example in which four elements are distributed each) in parallel and individually. For example, after sorting A in ascending order and B in descending order (■ in the figure), compare the corresponding elements of the image processing device data, and sort the data so that, for example, the smaller element becomes A and the larger element becomes B. Move (■ in the diagram).

そのデータについて、各処理装置で例えば昇順にソート
を行うことにより、処理装置Ａ、Ｂにわたるソートがで
きる（図の■）。By sorting the data in, for example, ascending order in each processing device, it is possible to sort across processing devices A and B (■ in the figure).

以上の処理を、他のすべての処理装置の組でもそれぞれ
の持つデータについて並列に実行していて、次に処理装
置Ａ、Ｂの結果と、別の処理装置Ｃ，Ｄで別のデータに
ついて処理した結果との、各２倍の長さのデータ　（図
の■）について前記と同様の要素間の比較と移動処理を
行い（図の■）、その結果を各２処理装置の組で前記の
ようにソートすることにより、処理装置Ａ−Ｄにわたる
ソートができる。The above processing is executed in parallel on all the other sets of processing devices for their own data, and then the results of processing devices A and B are processed, and the other processing devices C and D process other data. Compare and move the data (■ in the figure) between elements in the same way as above (■ in the figure), and use the results as described above in each pair of two processing devices. By sorting in this way, it is possible to sort across processing devices A to D.

このようにして、次は４処理装置×２、その次は８処理
装置×２というように逐次データ長を延ばして同様の処
理を進めることにより、全データにわたるソートを完了
する。In this manner, the data length is sequentially extended and the same processing is carried out, such as 4 processing units x 2, then 8 processing units x 2, and so on, thereby completing the sorting over all data.

[Problem that the invention seeks to solve]

前記の方式によってソートの並列処理ができるが、前記
の説明から容易に考えられるように、処理装置間でデー
タを比較するための全データ要素の１／２の移動と、比
較結果により必要な場合のデータ移動とが各部分のソー
ト処理ごとに繰り返して必要である。そのために処理装
置間で授受しなければならないデータ量が比較的多（な
り、処理速度を上げられないという問題がある。The above method allows for parallel processing of sorting, but as can be easily considered from the above explanation, it is possible to move 1/2 of all data elements to compare data between processing devices, and if necessary depending on the comparison result. data movement is required repeatedly for each part sorting process. For this reason, the amount of data that must be exchanged between processing devices is relatively large (therefore, there is a problem that the processing speed cannot be increased).

[Means for solving problems]

第１図は、本発明の構成を示す処理の流れ図である。 FIG. 1 is a process flowchart showing the configuration of the present invention.

図は各処理装置の処理の流れを示し、３０〜３２は処理
ステップである。The figure shows the processing flow of each processing device, and 30 to 32 are processing steps.

[For production]

各処理装置は、処理ステップ３０においてデータの値の
範囲を処理装置数に等しいｍ個の分担範囲に分割するた
めの情報を受は取る。Each processing device receives information for dividing the data value range into m sharing ranges equal to the number of processing devices in processing step 30.

処理ステップ３１において、各処理装置に分配されてい
るデータの各要素の属する分担範囲を識別して、その分
担範囲を割り当てられている処理装置へその要素を送り
、自身へ割り当てられている要素のみを残す。又、他の
処理装置で並行して行われる同様の処理によって、自身
宛に送られてくる要素を受は取る。In processing step 31, the allocation range to which each element of data distributed to each processing device belongs is identified, and the element is sent to the processing device to which that allocation range is allocated, and only the elements allocated to itself are identified. leave. It also receives elements sent to itself through similar processing performed in parallel by other processing devices.

要素の相互分配を終わると、処理ステップ３２において
、各処理装置が処理ステップ３１において割り当てられ
たデータをそれぞれソート処理することにより、データ
全体のソートを完了する。After the mutual distribution of the elements is completed, in processing step 32, each processing device sorts the data allocated in processing step 31, thereby completing the sorting of the entire data.

この処理方法により、データの転送は各要素について高
々１回行われるのみでソートを完了することができる。This processing method allows data transfer to be performed at most once for each element to complete the sorting.

〔Example〕

第２図の計算機システムにおいて、ソート対象のデータ
はほぼ均等な要素個数に分割してｍ個の記憶装置２−１
〜２−ｍに予め分配されているものとする。In the computer system shown in FIG. 2, data to be sorted is divided into approximately equal number of elements and stored in m storage devices 2-1.
~2-m shall be distributed in advance.

ｍ個の処理装置１−１〜１−ｍ（以下においてＰＥ、〜
ＰＥ、という）は、例えば制御処理装置（以下において
ＣＰという）４からネットワーク３によって全処理装置
に送られる指示により、それぞれに接続する記憶装置２
−１〜２−ｍからデータを入力してソート処理を開始す
る。m processing devices 1-1 to 1-m (hereinafter referred to as PE, ~
For example, the PE (hereinafter referred to as CP) receives instructions sent from a control processing device (hereinafter referred to as CP) 4 to all the processing devices via the network 3, and the storage device 2 connected to each processing device.
Input data from -1 to 2-m and start sorting processing.

こ＼で、例えばＣＰ４が、分担範囲の境界を示す情報と
して、値の小さい方からｍ−１個の各分担範囲の各上限
値（ａｔ、ａ２、〜ａ＋＊−１）を全処理装置ＰＥ、〜
ＰＥ、へ通知する。Here, for example, CP4 transmits each upper limit value (at, a2, ~a++-1) of m-1 assignment ranges from the smallest value to all processing units PE as information indicating the boundaries of assignment ranges. ,~
Notify PE.

ＰＥ、〜ＰＥ、の各々には１〜ｍの番号を付しておいて
、値の小さい分担範囲から番号順に各処理装置に割り当
てるものとし、各処理装置は上記の上限値から、ｍ個の
分担範囲を、Ｘをデータ要素の値として、次のように認
識する。Each of PE, ~PE, is given a number from 1 to m, and assigned to each processing device in numerical order starting from the smallest value, and each processing device receives m pieces from the above upper limit. The assignment range is recognized as follows, where X is the value of the data element.

ＰＲ，の分担範囲　　Ｘ≦ａ。PR, sharing range X≦a.

ＰＩＥ、の分担範囲　　ａ、＜ｘ≦ａ２ＰＥ、　、の分
担範囲　ａＭ−ｚ＜Ｘ≦ａ＋＋＋−１Ｐ［！ｆｉ　　の
分担範囲　ａ、−、＜ｘこのようにして決定する分担範
囲は、各範囲にほぼ同じ個数の要素が属するように定め
ることが並列処理の効率上から望ましい。PIE, sharing range a,<x≦a2PE, , sharing range aM-z<X≦a+++-1P[! Fi's allocation range a, -, <x It is desirable from the viewpoint of parallel processing efficiency that the allocation ranges determined in this manner be determined so that approximately the same number of elements belong to each range.

従って、例えば各要素のとる値がほぼ均等に分布してい
ることが分かっているような場合には、最初にＰＥｌ−
ＰＥ、からＣＰ４へ、冬毛持ちのデータの要素の最大値
と最小値とを通知し、ＣＰ４はその中から最大値と最小
値とをとることにより、現に処理するデータの値の範囲
を知り、この範囲をほぼｍ等分するように前記の上限値
を算出して、それらの値をＰＲ，〜ＰＨ，へ通知すれば
よい。Therefore, for example, if it is known that the values of each element are almost evenly distributed, first PEl-
PE notifies CP4 of the maximum and minimum values of the elements of the winter hair retention data, and CP4 knows the value range of the data currently being processed by taking the maximum and minimum values from among them. , the upper limit value may be calculated so as to divide this range into approximately m equal parts, and these values may be notified to PR, ~PH,.

しかし要素の値の分布に偏りがある場合を含めて、常に
各分担範囲にほぼ均等に要素が属するようにするために
、例えば以下に述べるようにして、データ中の小さい値
の要素からｒｎｉ／ｍ１番目の各要素を、ｉを１からｍ
−１まで変化して、ｍ〜１個求め、それらを分担範囲の
上限値とすることが望ましい。こ＼で、［ｘ］はＸより
大きい最小の整数（Ｘの小数部を切り上げた整数）を示
し、ｎはデータの要素の総個数である。However, in order to ensure that elements always belong to each sharing range almost equally, even when the distribution of element values is biased, for example, as described below, rni/ For each m1th element, i from 1 to m
It is desirable to change the value to -1, find m~1 pieces, and set them as the upper limit of the sharing range. Here, [x] indicates the smallest integer greater than X (an integer obtained by rounding up the decimal part of X), and n is the total number of data elements.

即ちＰＥ、　〜ＰＥ、は、記憶装置２−１〜２−ｍから
入力したソート対象のデータについて、以下に述べるよ
うにしてＣＰ４と情報を授受することにより、ＣＰ４で
各上限値を順次決定して通知する。That is, the PEs, ~PE, sequentially determine each upper limit value at the CP4 by exchanging information with the CP4 as described below for the data to be sorted inputted from the storage devices 2-1 to 2-m. We will notify you.

第４図は上限値決定処理の流れであり、左側にＣＰ４の
処理、右側にＰＥＩ−ＰＲ，で並行して実行される処理
を示す。FIG. 4 shows the flow of the upper limit value determination process, and the left side shows the process of CP4, and the right side shows the process executed in parallel in PEI-PR.

ＣＰ４は、処理ステップ１０で変数ｊを１に初期設定し
、変数ｋをｒ　ｎ　／　ｍ　１に設定して、ＰＥ、〜Ｐ
Ｅ、の処理を開始させる。CP4 initializes variable j to 1 in processing step 10, sets variable k to r n / m 1, and performs PE, ~P
Start processing E.

ＰＥ、〜ＰＥ、の各処理装置ＰＥ、は、処理ステップ２
０で変数ｊを１に初期設定し、記憶装置２−ｉからそれ
ぞれ入力したデータをソートして、現処理のＳｔとする
。Each processing device PE of PE, ~PE, performs processing step 2.
The variable j is initialized to 1 with 0, and the data inputted from the storage device 2-i are sorted and set as St for the current process.

処理ステップ２１でその中間値旧を求めて、ＣＰ４に通
知する。In processing step 21, the intermediate value old is determined and notified to the CP4.

ＣＰ４は処理ステップ１１で、ＰＥ、〜ＰＥ、から通知
されたｍ個のＭｉから、その中間値Ｍを求めて、Ｐ　Ｅ
　ｌ”　Ｐ　Ｒ、へ通知する。In processing step 11, CP4 calculates the intermediate value M from m Mi notified from PE, ~PE, and calculates the intermediate value M.
l” P R, to be notified.

各ＰＥｉ　は処理ステップ２２で、ＭによってＳｉの要
素を分類して、Ｍより小さい要素の集合を　５１１Ｍに等しい要素の集合を　　ＳｉｚＭより大きい要素の集合を　Ｓｉ３とし、Ｓｉ＋　、Ｓｉｚの要素の各個数ｂＬ、ｂｉｔを
ＣＰ４に通知する。In processing step 22, each PEi classifies the elements of Si according to M, and the set of elements smaller than M is 511, the set of elements equal to M is Siz, the set of elements larger than M is Si3, and Si+, the elements of Siz are Each number bL and bit are notified to CP4.

ＣＰ４は処理ステップ１２で、全処理装置から通知され
たｂｉ、　、　ｂｉｇを集計して合計ｂ１、ｂ２を求め
る。In processing step 12, CP4 totals bi, , big notified from all processing devices to obtain totals b1 and b2.

処理ステップ１３において、ｂｌ、ｂ２とｋを比較し、
例えばに≦ｂ１であれば１、ｂ、＜ｋ≦ｂｌ＋ｂ２であ
れば２　、’　ｂ、　十す、＜　ｋであれば３を全処理
装置に通知する。なお、２の場合には、次の上限値の処
理があれば、変数ｊを＋１し、又、新しいｊの値による
ｒｎｊ／ｍ１−　ｋ　−ｂｚ＋　１を新しいｋの値とし
て設定する。又、３の場合には、変数にの値からｂＩ＋
ｂ２を減じる。In processing step 13, compare bl, b2 and k,
For example, if ≦b1, 1, b; if <k≦bl+b2, 2, 'b, 10s; if <k, 3 is notified to all processing devices. In the case of 2, if there is a next upper limit value process, the variable j is incremented by 1, and rnj/m1-k-bz+1 based on the new value of j is set as the new value of k. In addition, in the case of 3, bI + from the value of the variable
Decrease b2.

各ＰＥｉ　は処理ステップ２３において、１〜３の通知
に従って、１ならＳｉ＋　、３ならＳＩ３を新しいＳｉ
に設定して、処理ステップ２１に戻り、このＳｉにより
中間値を求める処理からやり直す。In processing step 23, each PEi follows the notifications 1 to 3, and if it is 1, then Si+, if it is 3, then SI3 is the new Si
is set, the process returns to step 21, and the process is restarted from the process of calculating the intermediate value using this Si.

２を通知された場合には、そのときのＭの値をｊ番目の
上限値ａ、として記憶し、ｊ＝ｍ−１であれば、上限値
決定の処理終了とする。2, the value of M at that time is stored as the j-th upper limit value a, and if j=m-1, the upper limit value determination process is terminated.

ｊ＜ｍ−１であれば、次の上限値決定のために処理ステ
ップ２４に進んで、元のデータのソート結果のうち現に
ｊ番目の上限値としたＭの値より大きな要素のみを以後
の処理対象として、それらを最初のＳｉに設定し、ｊを
＋１して処理ステップ２１に戻り、新たなｊ番目の上限
値、即ち全データ要素について下からｒｎｊ／ｍ］番目
で、今回の処理対象についてはＣＦ２の処理ステップ１
３で新しいに値に設定したように、下からｒｎｊ／ｌ１
ｌｌ　−ｋ−ｂｚ＋１番目になる値の探索処理に入る。If j<m-1, the process proceeds to step 24 to determine the next upper limit value, and only the elements larger than the value of M, which is currently set as the jth upper limit value, out of the sorted results of the original data are used for subsequent processing. Set them as the first Si as processing targets, add 1 to j, return to processing step 21, and set the new jth upper limit value, that is, rnj/m]th from the bottom for all data elements, as the current processing target. CF2 processing step 1 for
As you set the new value in step 3, rnj/l1 from the bottom.
ll-k-bz+1 search processing begins.

このようにしてｍ−１個の上限値ａ、〜ａ□、が決定す
ると、各ＰＥｉが前記のようにして各分担範囲を認識し
て、１番目の分担範囲を自身の分担範囲とし、データ中
の自身の分担範囲の要素のみ残し、他の要素はそれぞれ
の属する分担範囲の他の処理装置ＰＥ、〜ＰＥ、へ、ネ
ットワーク３によって転送し、又他の処理装置から自身
へ転送される要素を受は取る。When m-1 upper limit values a, ~a□, are determined in this way, each PEi recognizes each assignment range as described above, sets the first assignment range as its own assignment range, and data Only the elements within its own sharing range are left, and the other elements are transferred via the network 3 to other processing devices PE, ~PE, in their respective sharing ranges, and elements transferred from other processing devices to itself. I will take it.

その結果各ＰＥｉに集まったデータを個別に並列ソート
すれば、ＰＥ、からＰＥ、までにわたって全データのソ
ートが完成する。As a result, if the data collected in each PEi is individually sorted in parallel, the sorting of all data from PE to PE is completed.

第５図は、３個の処理装置ＰＥＩ、ＰＨ２，ＰＥ３によ
って、１〜１５の内容の１５要素のデータを、前記説明
の処理方法によって並列ソートする例である。FIG. 5 shows an example in which data of 15 elements of contents 1 to 15 are sorted in parallel by the processing method described above by three processing devices PEI, PH2, and PE3.

図は、ＰＥ、、ＰＥ、、　ＰＥ３と示子欄に各処理装置
のデータの状況等を示し、左側の欄にＣＦ２の処理を示
しである。In the figure, the data status of each processing device is shown in the indicator columns PE, PE, PE3, and the processing of CF2 is shown in the left column.

第５図Ｔａ）は上限値の決定処理であり、ステップ■で
ＣＦ２はｋをｒ１５Ｘ　１／３１　＝　５にする。各Ｐ
Ｅｉにはステップ■に示すＤ１〜Ｄ、のようにデータが
入力されるものとする。Fig. 5 Ta) shows the upper limit value determination process, and in step (2), CF2 sets k to r15X 1/31 = 5. Each P
It is assumed that data such as D1 to D shown in step (2) are input to Ei.

ステップ■で各ＰＥｉのソート結果がＳＩ　ＮＳ３に示
すようになり、ステップ■に示す各中間値台、〜−１が
ＣＦ２へ送られるので、ステップ■のように、それらの
中間値Ｍ＝８が決定して各ＰＥｉへ通知される。In step ■, the sorting result of each PEi becomes as shown in SI NS3, and each intermediate value range, ~-1, shown in step ■ is sent to CF2, so as in step ■, their intermediate value M=8 is It is determined and notified to each PEi.

ステップ■のように、各ＰＥｉでは中間値Ｍ＝８による
分類が行われ、ステップ■に示す各集合の要素数がＣＦ
２へ通知される。As in step ■, each PEi is classified using the intermediate value M=8, and the number of elements in each set shown in step ■ is CF
2 will be notified.

ＣＦ２では、ステップ■のように要素数の集計を得、ｋ
＝５と比較して、ｋ＜ｂｔが得られるので、全ＰＥｉに
１が通知される。In CF2, obtain the total number of elements as in step ■, and k
Since k<bt is obtained compared to =5, 1 is notified to all PEi.

従って、ステップ■のように、各ＰＥｉではＳｉ１を新
たなＳｉとし、ステップ［相］のようにその中間値をＣ
Ｆ２へ通知する。こ＼で、要素数が偶数の場合には、取
り決めによって例えば小さい値を中間値とする。Therefore, as in step ■, in each PEi, Si1 is set as new Si, and as in step [phase], the intermediate value
Notify F2. Here, if the number of elements is even, a small value is set as the intermediate value by agreement.

ステップ■で、ＣＦ２においてＭ＝３が決定し、その中
間値による処理が、ステップ＠〜０で前と同様に行われ
て、ｂ、＋ｂ２＜ｋになるので、ｋの値が５−ｂ、−ｂ
、＝　２に更新され、全ＰＥｉに３が通知される。In step ■, M=3 is determined in CF2, and the process using the intermediate value is performed in the same way as before in step @~0, so that b,+b2<k, so the value of k is 5-b, -b
, = 2, and 3 is notified to all PEi.

従ってステップ［相］のように、各ＰＥｉではＳｔ、を
新たなＳｉとし、ステップ［相］のようにその中間値を
ＣＦ２へ通知する。　　　　　　　　　　、。Therefore, as in step [phase], in each PEi, St is set as a new Si, and as in step [phase], the intermediate value is notified to CF2. ,.

ステップ０で、ＣＦ２においてＭ・５が決定し、その中
間値による処理が、ステップ［相］〜［相］で前と同様
に行われて、ｂ、＋ｂ、＝ｋになるので、２が全ＰＥｉ
に通知されて、現在のＭ＝５が５番目の大きさの要素と
決定し、第１の上限値になる。In step 0, M・5 is determined in CF2, and the process using its intermediate value is performed in the same way as before in steps [phase] to [phase], resulting in b, +b, = k, so 2 is the total PEi
The current M=5 is determined to be the fifth largest element and becomes the first upper limit.

次にはに工ｒ１５Ｘ　２　／　３１　＝１０にして、図
示しない前記と同様の処理を行うことにより、第２の上
限値として１０が決定する。Next, by setting r15X 2 / 31 =10 and performing the same process as described above (not shown), 10 is determined as the second upper limit value.

第５図（ｂｌは、以上で決定した上限値５及び１０によ
って、各ＰＥｉが相互にデータを分配し、その結果のデ
ータを並列にソートする状況を示す。FIG. 5 (bl) shows a situation in which each PEi mutually distributes data according to the upper limit values 5 and 10 determined above, and the resulting data is sorted in parallel.

上限値が５と１０であるので、５以下が第１の処理装置
ＰＥ、　、６から１０までがＰＥ２．１０より大きいも
のがＰＥ３の分担範囲となり、ステップ０に示す手持ち
データから各地の処理装置の分担範囲の要素を、ステッ
プ＠のように転送する。Since the upper limit values are 5 and 10, 5 or less is the first processing device PE, 6 to 10 is PE2, and anything larger than 10 is the scope of PE3, and from the data on hand shown in step 0, the processing device of each place Transfer the elements in the assigned range like step @.

その結果各処理装置には、ステップ０のようにデータが
集まり、それらを各処理装置で並列にソートして、ステ
ップ０に示すように、３処理装置にわたって昇順に配列
されたソート結果を得ることができる。As a result, each processing device collects data as shown in step 0, and each processing device sorts the data in parallel to obtain sorted results arranged in ascending order across the three processing devices, as shown in step 0. Can be done.

〔Effect of the invention〕

以上の説明から明らかなように、本発明によれば、複数
の処理装置を有する計算機システムの並列ソート処理に
おいて、処理装置間のデータ転送回数を著しく減少する
ことにより、処理効率を向上するという著しい工業的効
果がある。As is clear from the above description, according to the present invention, processing efficiency is significantly improved by significantly reducing the number of data transfers between processing units in parallel sort processing of a computer system having a plurality of processing units. It has industrial effects.

[Brief explanation of drawings]

第１図は本発明の構成を示す処理の流れ図、第２図は計
算機システムの構成例ブロック図、第３図はパイトニッ
クマージ法の説明図、第４図は上限値決定処理の流れ図
、第５図は本発明のソート処理例の説明図である。図において、１−１〜１−ｍは処理装置、２−１〜２−ｍは記憶装置
、３はネットワーク、　　４は制御処理装置、１０〜１
３．２０〜２４．３０〜３２は処理ステップを示す。Fig. 1 is a flowchart of processing showing the configuration of the present invention, Fig. 2 is a block diagram of an example configuration of a computer system, Fig. 3 is an explanatory diagram of the pitonic merge method, Fig. 4 is a flowchart of upper limit determination processing, FIG. 5 is an explanatory diagram of an example of sorting processing according to the present invention. In the figure, 1-1 to 1-m are processing devices, 2-1 to 2-m are storage devices, 3 is a network, 4 is a control processing device, and 10 to 1
3.20-24.30-32 indicate processing steps.

Claims

[Claims]

(1) In a computer system in which m processing devices are connected via a communication path, each processing device executes in parallel the sorting of data consisting of a plurality of elements distributed to each processing device. At , a predetermined notification indicating a sharing range obtained by dividing the value range of the data element into m pieces is received (30), and the data element held by the processing device is divided into m pieces of sharing range. The elements are mutually distributed to the respective processing devices assigned to handle the respective tasks (31), and a sorting process is performed on the distributed elements (3
2) A parallel sort processing method characterized by the following.

(2) The predetermined notification indicating the allocation range has a predetermined magnitude relationship with ni/m, where the number of elements of the data is n and i is an integer taking a value from 1 to m-1, ni/
The value of each j_i-th size element from the smallest of the data elements, where j_i is the integer closest to m,
2. The parallel sort processing method according to claim 1, wherein said value is used as a value indicating a predetermined boundary between said m assignment ranges to identify said assignment range.