JPWO2008155852A1

JPWO2008155852A1 - Method and apparatus for aggregating tabular data in a shared memory parallel processing system

Info

Publication number: JPWO2008155852A1
Application number: JP2009520206A
Authority: JP
Inventors: 古庄　晋二; 晋二古庄; ジャヒルフセインジャスディーン
Original assignee: Turbo Data Laboratories Inc
Current assignee: Turbo Data Laboratories Inc
Priority date: 2007-06-21
Filing date: 2007-06-21
Publication date: 2010-08-26
Anticipated expiration: 2027-06-21
Also published as: WO2008155852A1; JP4881435B2

Abstract

メモリ共有型マルチプロセッサシステムにおいて、所定の項目の組に関する一意の項目値の組を抽出するために、所定の項目の組に関する項目値の組に対応する項目値番号の組の出現回数を並列的に数え上げ、出現回数の累積度数分布を表現する累積度数分布配列を共有メモリに生成し、レコード番号配列の要素の値を添字として使って項目値番号の組を並列的に読み出し、項目値番号の組を添字として使って累積度数分布配列の要素を並列的に読み出し、累積度数分布配列の要素をインデックスポインタとして利用してレコード番号配列の要素の値をソートされたレコード番号配列に並列的に格納し、インデックスポインタとして利用された累積度数分布配列の要素の値に従ってソートされたレコード番号配列を一意の項目値の組を有するレコードのレコード番号のグループに区分し、区分されたレコード番号の各グループから１個のレコード番号を抽出する。In a shared memory multiprocessor system, in order to extract a unique set of item values related to a set of predetermined items, the number of occurrences of a set of item value numbers corresponding to the set of item values related to the set of predetermined items is calculated in parallel. Generate a cumulative frequency distribution array that expresses the cumulative frequency distribution of the number of occurrences in the shared memory, read the item value number pairs in parallel using the element value of the record number array as a subscript, Reads the elements of the cumulative frequency distribution array in parallel using the tuple as a subscript, and stores the values of the elements of the record number array in parallel in the sorted record number array using the elements of the cumulative frequency distribution array as the index pointer And the record number array sorted according to the value of the element of the cumulative frequency distribution array used as the index pointer has a unique set of field values Divided into groups of record number code, and extracts one record number from the group of segmented record number.

Description

本発明は、メモリ共有型並列処理システム、特に、対称型マルチプロセッサ（ＳｙｍｍｅｔｒｉｃＭｕｌｔｉｐｌｅＰｒｏｃｅｓｓｏｒ）システムにおいて、表形式データから一意の項目値の組を抽出する方法及び装置に関係する。また、本発明は、共有メモリにアクセスする複数台のプロセッサを備えるコンピュータに、表形式データから一意の項目値の組を抽出するコードを実行させるプログラムに関係する。また、本発明は、上記方法を共有メモリにアクセスする複数台のプロセッサを備えるコンピュータに実行させるためのコンピュータプログラムプロダクト、及び、コンピュータプログラムが記録された記録媒体に関係する。 The present invention relates to a method and apparatus for extracting a unique set of item values from tabular data in a memory-sharing parallel processing system, particularly, a symmetric multiprocessor system. The present invention also relates to a program for causing a computer including a plurality of processors accessing a shared memory to execute a code for extracting a unique set of item values from tabular data. The present invention also relates to a computer program product for causing a computer including a plurality of processors that access a shared memory to execute the above method, and a recording medium on which the computer program is recorded.

本発明は、さらに、メモリ共有型並列処理システム、特に、対称型マルチプロセッサシステムにおいて、表形式データを多次元で集計する方法及び装置に関係する。また、本発明は、共有メモリにアクセスする複数台のプロセッサを備えるコンピュータに、表形式データを多次元で集計するコードを実行させるプログラムに関係する。また、本発明は、上記方法を共有メモリにアクセスする複数台のプロセッサを備えるコンピュータに実行させるためのコンピュータプログラムプロダクト、及び、コンピュータプログラムが記録された記録媒体に関係する。 The present invention further relates to a method and apparatus for tabulating tabular data in a multi-dimensional manner in a memory-sharing parallel processing system, particularly a symmetric multiprocessor system. The present invention also relates to a program for causing a computer including a plurality of processors that access a shared memory to execute a code for tabulating tabular data in a multidimensional manner. The present invention also relates to a computer program product for causing a computer including a plurality of processors that access a shared memory to execute the above method, and a recording medium on which the computer program is recorded.

従来、産業上の様々な分野において、大規模データを高速に処理することが求められている。大規模データの処理は、キャッシュやプリフェッチなどによるメモリアクセスの高速化、メモリ自体の高速化、及び、プロセッサの並列化のような演算処理の高速化、といったハードウェア技術の開発、ならびに、データ処理アルゴリズムの開発によって、高速化され続けている。たとえば、本発明者は、大規模データを高速に処理するため、特許文献１に記載されているような、「オンメモリデータ処理アルゴリズム」を提案している。 Conventionally, it is required to process large-scale data at high speed in various industrial fields. For large-scale data processing, development of hardware technologies such as high-speed memory access by cache and prefetch, high-speed memory, and high-speed arithmetic processing such as parallel processing of processors, and data processing The development of algorithms continues to speed up. For example, the present inventor has proposed an “on-memory data processing algorithm” as described in Patent Document 1 in order to process large-scale data at high speed.

また、大規模データを処理するために、効率の良いアルゴリズムが開発されている。大規模データ、特に、大規模な表形式データを処理する際に頻出する処理はソートである。効率的なソートアルゴリズムとして、基数（ＲＡＤＩＸ）ソートとカウンティング（ＣＯＵＮＴＩＮＧ）ソート（計数ソート、分布数え上げソートとも称される）が知られている。カウンティングソートは基数ソートの各桁のソートに利用されることがあり、効率の良いアルゴリズムである。 In addition, efficient algorithms have been developed for processing large-scale data. Sorting is a process that occurs frequently when processing large-scale data, particularly large-scale tabular data. As an efficient sorting algorithm, a radix (RADIX) sort and a counting (COUNTING) sort (also called a count sort or a distribution counting sort) are known. The counting sort is sometimes used for sorting each digit of the radix sort and is an efficient algorithm.

近年、複数個のプロセッサが並列に動作するマルチプロセッサや、１チップ上に複数個のＣＰＵコアが搭載されたマルチコアプロセッサ、特に、複数個のＣＰＵが同等に処理を分担する対称型マルチプロセッサのようなメモリ共有型並列処理システムが設計されている。大規模データの検索・集計・ソート等の処理は、多くの場合に大規模データに同じ演算を適用するので、特に、対称型マルチプロセッサは大規模データの高速処理に適していると考えられる。これに関して、本発明者は、特許文献２に記載されているように、大規模な表形式データを処理するために、効率的なソートアルゴリズムと組み合わされたメモリ共有型マルチプロセッサシステムを提案している。 In recent years, a multiprocessor in which a plurality of processors operate in parallel, a multicore processor in which a plurality of CPU cores are mounted on one chip, in particular, a symmetric multiprocessor in which a plurality of CPUs share processing equally. A shared memory parallel processing system has been designed. Since processing such as retrieval, aggregation, and sorting of large-scale data often applies the same operation to large-scale data, it is considered that a symmetric multiprocessor is particularly suitable for high-speed processing of large-scale data. In this regard, the inventor has proposed a memory-shared multiprocessor system combined with an efficient sorting algorithm to process large-scale tabular data as described in Patent Document 2. Yes.

特許文献２に記載されたメモリ共有型マルチプロセッサシステムでは、最初に、処理対象のレコードが分割されて複数台のプロセッサへ割り当てられる。次に、各プロセッサが処理対象のレコードに関連付けられた項目値番号のローカルな出現回数をカウントする。次に、各プロセッサでカウントされた項目値番号のローカルな出現回数を、項目値番号のグローバルな累計数、すなわち、複数台のプロセッサ間で共通に用いられる累計数に変換する。最後に、各プロセッサは、このグローバルな累計数をポインタとして利用することにより、割り当てられたレコードの順序を入れ替える。処理対象のレコードの複数台のプロセッサへの割り当て、ローカルな出現回数のカウント、及び、割り当てられたレコードの順序の入れ替えは、複数台のプロセッサが並列に処理可能である。また、グローバルな累計数の算出は、複数台のプロセッサの並列処理を利用してもよいが、メモリにシーケンシャルにアクセスできるためキャッシュへのヒット率が高いので、１台又は一部のプロセッサだけが担当して高速性を維持できる。 In the memory shared multiprocessor system described in Patent Document 2, first, a record to be processed is divided and assigned to a plurality of processors. Next, each processor counts the number of local occurrences of the item value number associated with the record to be processed. Next, the local appearance count of the item value number counted by each processor is converted into a global cumulative number of item value numbers, that is, a cumulative number commonly used among a plurality of processors. Finally, each processor changes the order of the allocated records by using the global cumulative number as a pointer. Allocation of records to be processed to a plurality of processors, counting the number of local appearances, and changing the order of allocated records can be processed in parallel by a plurality of processors. In addition, the global cumulative number may be calculated by using parallel processing of multiple processors, but because the cache hit rate is high because the memory can be accessed sequentially, only one or a part of the processors can be calculated. Responsible for maintaining high speed.

このように、特許文献２には、メモリ共有型並列ソート技術が開示されているが、メモリ共有型並列集計技術は記載されていない。また、本発明者は、特許文献３に記載されているように、メモリ分散型並列集計技術を既に提案している。メモリ共有型とメモリ分散型とを問わず、集計とは、例えば、情報の項目に対応した項目値を含むレコードの配列として表される表形式データから、ある項目（次元）のある項目値（次元値）毎に、別の項目の項目値に基づく数量（測度）を算出することである。測度の算出とは、測度の個数をカウントしたり、測度の総和を算出したり、測度の平均値を算出したりすることである。また、次元数は、２次元以上でも構わない。例えば、年齢という項目と、身長という項目と、体重という項目を含む表形式データである場合に、年齢・身長別に体重の平均値を求める処理は、年齢と身長を次元とし、体重の平均値を測度とする集計処理である。 Thus, Patent Document 2 discloses a memory shared parallel sort technique, but does not describe a memory shared parallel tabulation technique. The inventor has already proposed a memory distributed parallel tabulation technique as described in Patent Document 3. Regardless of the memory sharing type or the memory distributed type, the aggregation is, for example, an item value (item) having a certain item (dimension) from tabular data represented as an array of records including item values corresponding to information items. For each dimension value), a quantity (measure) based on the item value of another item is calculated. Calculation of a measure means counting the number of measures, calculating the sum of measures, or calculating an average value of measures. The number of dimensions may be two or more. For example, in the case of tabular data including an item of age, an item of height, and an item of weight, the process of calculating the average value of weight by age and height is based on age and height as the dimensions, and the average value of weight This is a tabulation process as a measure.

このような集計処理では、最初に、次元を設定し、次元値毎にレコードを分類することが必要である。次元値毎にレコードを分類するために、次元値に関してレコードをソートする処理を利用することが可能である。特に、多次元集計においては、レコードを次元値の組毎にグループ化し、同一のグループに属するレコードに含まれる項目値に基づく測度を算出することによって、表形式データ全体の集計が達成される。よって、メモリ共有型マルチプロセッサシステムにおいて、大規模な表形式データを高速集計するためには、集計のための、特に、多次元集計のための次元を高速に作成する技術が必要である。
国際公開ＷＯ００／１０１０３号公報国際公開ＷＯ２００６／１２６４６７号公報国際公開ＷＯ２００５／０４１０６７号公報 In such a tabulation process, it is necessary to first set a dimension and classify records for each dimension value. In order to classify records for each dimension value, it is possible to use a process for sorting records with respect to dimension values. In particular, in multidimensional tabulation, tabulation of entire tabular data is achieved by grouping records into sets of dimension values and calculating measures based on item values included in records belonging to the same group. Therefore, in a memory shared multiprocessor system, in order to tabulate large-scale tabular data at high speed, a technique for creating a dimension for tabulation, particularly for multi-dimensional tabulation, at high speed is required.
International Publication WO00 / 10103 International Publication WO2006 / 126467 International Publication WO2005 / 041067

本発明の目的は、上記の従来技術の問題点に鑑みて、メモリ共有型並列処理システムにおいて、表形式データから、多次元集計のための次元を表現する一意の項目値の組を効率的に抽出する方法を提供することである。 In view of the above-described problems of the prior art, an object of the present invention is to efficiently generate a set of unique item values representing a dimension for multidimensional aggregation from tabular data in a memory shared parallel processing system. It is to provide a method of extraction.

本発明のさらなる目的は、上記の従来技術の問題点に鑑みて、メモリ共有型並列処理システムにおいて、表形式データから、多次元集計のための次元を表現する一意の項目値の組を効率的に抽出する装置を提供することである。 In view of the above-described problems of the conventional technology, a further object of the present invention is to efficiently generate a set of unique item values representing a dimension for multidimensional aggregation from tabular data in a memory shared parallel processing system. It is to provide a device for extraction.

本発明は、さらに、メモリ共有型並列処理システムにおいて、表形式データの多次元集計を効率的に実行する方法及び装置の提供を目的とする。 It is another object of the present invention to provide a method and apparatus for efficiently executing multidimensional aggregation of tabular data in a memory shared parallel processing system.

上記本発明の目的を達成するため、本発明の一態様によれば、表形式データのレコードのレコード番号が所定のレコード順に従って格納されたレコード番号配列、上記表形式データの上記レコードの所定の項目の項目値に対応する項目値番号がレコード番号に従って格納された項目値番号配列、及び、上記表形式データの項目値が当該項目値に対応する項目値番号の順序に従って格納された項目値配列を記憶する共有メモリと、
上記共有メモリにアクセス可能である複数台のプロセッサと、
を備えたメモリ共有型マルチプロセッサシステムにおいて、
所定の項目の組に関する一意の項目値の組を抽出する方法であって、
（ａ）上記複数台のプロセッサを用いて、上記所定の項目の組に関する項目値の組に対応する項目値番号の組の出現回数を並列的に数え上げ、上記出現回数の累積度数分布を表現する累積度数分布配列を上記共有メモリに生成するステップと、
（ｂ）上記複数台のプロセッサを用いて、上記レコード番号配列の要素の値を添字として使って上記項目値番号の組を並列的に読み出し、上記項目値番号の組より計算された値を添字として使って上記累積度数分布配列の要素を並列的に読み出し、上記累積度数分布配列の要素をインデックスポインタとして利用して上記レコード番号配列の要素の値をソートバージョンのレコード番号配列（すなわち、ソートされたレコード番号配列）に並列的に格納するステップと、
（ｃ）上記インデックスポインタとして利用された上記累積度数分布配列の要素の値に従って上記ソートされたレコード番号配列を上記一意の項目値の組を有するレコードのレコード番号のグループに区分するステップと、
（ｄ）上記区分されたレコード番号の各グループから１個のレコード番号を抽出することにより、上記一意の項目値の組を特定するステップと、
を備える方法が提供される。In order to achieve the object of the present invention, according to one aspect of the present invention, a record number array in which record numbers of records of tabular data are stored in a predetermined record order, a predetermined number of records of the tabular data An item value number array in which item value numbers corresponding to item values of items are stored according to record numbers, and an item value array in which the item values of the above-mentioned tabular data are stored in the order of item value numbers corresponding to the item values A shared memory for storing
A plurality of processors capable of accessing the shared memory;
In a shared memory multiprocessor system with
A method for extracting a unique set of item values for a given set of items, comprising:
(A) Using the plurality of processors, the number of occurrences of item value number pairs corresponding to the item value pairs relating to the predetermined item set is counted in parallel to express the cumulative frequency distribution of the appearance times. Generating a cumulative frequency distribution array in the shared memory;
(B) Using the plurality of processors, using the values of the elements of the record number array as subscripts, the item value number pairs are read in parallel, and the values calculated from the item value number pairs are subscripted. Are used to read the elements of the cumulative frequency distribution array in parallel, and use the elements of the cumulative frequency distribution array as index pointers to convert the values of the elements of the record number array into the record number arrays of sorted versions (ie, sorted Stored in parallel in the record number array),
(C) partitioning the sorted record number array according to the value of the element of the cumulative frequency distribution array used as the index pointer into a group of record numbers of records having the set of unique item values;
(D) identifying one set of unique item values by extracting one record number from each group of the segmented record numbers;
A method comprising:

さらに、本発明の方法は、上記累積度数分布配列を上記共有メモリに並列的に生成するステップ（ａ）が、上記複数台のプロセッサを用いて、上記レコード番号配列に格納された上記レコード番号毎に、上記所定の項目の組に関する項目値の組に対応する項目値番号の組に含まれる上記項目値番号の範囲に応じて基数を設定することにより、上記項目値番号の組を、上記項目値番号の組に含まれる各項目値番号が各桁を表現する基数形式の仮想項目値番号に変換するステップを備えることを特徴とする。 Further, in the method of the present invention, the step (a) of generating the cumulative frequency distribution array in parallel in the shared memory is performed for each record number stored in the record number array using the plurality of processors. In addition, by setting the radix according to the range of the item value numbers included in the set of item value numbers corresponding to the set of item values related to the set of predetermined items, the set of item value numbers is A step of converting each item value number included in the set of value numbers into a virtual item value number in a radix format expressing each digit is provided.

さらに、本発明の方法は、上記所定の項目の組に含まれる項目毎に上記累積度数分布配列を上記共有メモリに並列的に生成するステップ（ｉ）と、上記所定の項目の組に含まれる項目毎に上記レコード番号配列の要素の値を上記ソートされたレコード番号配列に並列的に格納するステップ（ｉｉ）と、上記所定の項目の組に含まれる項目毎に上記ソートされたレコード番号配列を上記一意の項目値の組を有するレコードのレコード番号のグループに区分するステップ（ｉｉｉ）とを備え、
上記累積度数分布配列を上記共有メモリに並列的に生成するステップ（ｉ）が、上記複数台のプロセッサを用いて、上記項目に関する項目値に対応する項目値番号の出現回数を並列的に数え上げ、上記出現回数の累積度数分布を表現する累積度数分布配列を上記共有メモリに並列的に生成するステップを備え、
上記レコード番号配列の要素を上記ソートされたレコード番号配列に並列的に格納するステップ（ｉｉ）が、上記複数台のプロセッサを用いて、上記項目値番号配列の要素の値を添字として使って上記項目値番号を並列的に読み出し、上記項目値番号を添字として使って上記累積度数分布配列の要素を並列的に読み出し、上記累積度数分布配列の要素をインデックスポインタとして利用して上記レコード番号配列の要素の値を上記ソートされたレコード番号配列に並列的に格納するステップを備えることを特徴とする。Further, the method of the present invention includes the step (i) of generating the cumulative frequency distribution array in parallel in the shared memory for each item included in the predetermined item set, and the predetermined item set. (Ii) storing the values of the elements of the record number array in parallel in the sorted record number array for each item, and the sorted record number array for each item included in the predetermined set of items And (iii) dividing the record into groups of record numbers of records having the unique field value pairs,
The step (i) of generating the cumulative frequency distribution array in the shared memory in parallel uses the plurality of processors to count in parallel the number of occurrences of the item value number corresponding to the item value related to the item, Generating a cumulative frequency distribution array representing the cumulative frequency distribution of the occurrence counts in parallel in the shared memory,
The step (ii) of storing the elements of the record number array in parallel in the sorted record number array is performed using the plurality of processors and the values of the element of the item value number array as subscripts. Read the item value number in parallel, read the element of the cumulative frequency distribution array in parallel using the item value number as a subscript, and use the element of the cumulative frequency distribution array as an index pointer in the record number array A step of storing the element values in parallel in the sorted record number array is provided.

さらに、本発明の方法は、（ｅ）上記複数台のプロセッサを用いて、上記所定の項目値の組毎に別の項目に関する項目値を並列的に集計するステップをさらに備えることを特徴とする。 Furthermore, the method of the present invention further includes the step of (e) totalizing item values related to different items for each set of the predetermined item values in parallel using the plurality of processors. .

さらに、本発明の方法は、（ｆ）上記複数台のプロセッサを用いて、上記ソートされたレコード番号配列中のレコード番号及び前記レコード番号に対応するグループのグループ番号を、前記レコード番号を添字として使って、補助的なレコード番号配列及び補助的なグループ番号配列に並列的に格納するステップと、
（ｇ）上記複数台のプロセッサを用いて、上記補助的なレコード番号配列の要素を添字として使って上記別の項目に関する項目値番号配列から並列的に要素を読み出し、前記項目値番号配列の要素を添字として使って上記別の項目に関する項目値配列から並列的に項目値を読み出し、上記補助的なレコード番号配列の要素を添字として使って上記補助的なグループ番号配列の要素を読み出し、上記補助的なグループ番号配列の要素を集計配列の添字として使って上記読み出された項目値の集計結果を上記集計配列に格納するステップと、
をさらに備えることを特徴とする。Further, according to the method of the present invention, (f) using the plurality of processors, the record number in the sorted record number array and the group number of the group corresponding to the record number are used as the subscript. And storing in parallel in an auxiliary record number array and an auxiliary group number array;
(G) Using the plurality of processors, elements in the item value number array are read out in parallel from the item value number array relating to the other item using the elements in the auxiliary record number array as subscripts. Is used as a subscript to read the item value in parallel from the item value array for the other item, and the auxiliary record number array element is used as a subscript to read the auxiliary group number array element. Storing the aggregation result of the read item value in the aggregation array using an element of a typical group number array as a subscript of the aggregation array;
Is further provided.

さらに、本発明の方法は、（ｈ）上記複数台のプロセッサを用いて、上記ソートされたレコード番号配列の要素を添字として使って上記別の項目に関する項目値番号配列から並列的に要素を読み出し、前記項目値番号配列の要素を添字として使って上記別の項目に関する項目値配列から並列的に項目値を読み出し、上記ソートされたレコード番号配列の要素に対応するグループのグループ番号を集計配列の添字として使って上記読み出された項目値の集計結果を上記集計配列に格納するステップをさらに備えることを特徴とする。 Furthermore, the method of the present invention includes (h) using the plurality of processors to read elements in parallel from the item value number array relating to the other item using the elements of the sorted record number array as subscripts. , Reading the item value in parallel from the item value array relating to the other item using the element of the item value number array as a subscript, and the group number of the group corresponding to the element of the sorted record number array is The method further comprises the step of storing the totaled result of the read item values in the totaling array using the subscript as a subscript.

上記本発明のさらなる目的を達成するため、本発明の一態様によれば、表形式データのレコードのレコード番号が所定のレコード順に従って格納されたレコード番号配列、上記表形式データの上記レコードの所定の項目の項目値に対応する項目値番号がレコード番号に従って格納された項目値番号配列、及び、上記表形式データの項目値が当該項目値に対応する項目値番号の順序に従って格納された項目値配列を記憶する共有メモリと、
上記共有メモリにアクセス可能である複数台のプロセッサと、
を備えたメモリ共有型マルチプロセッサシステムにおいて、
所定の項目の組に関する一意の項目値の組を抽出する装置であって、
（ａ）上記複数台のプロセッサを用いて、上記所定の項目の組に関する項目値の組に対応する項目値番号の組の出現回数を並列的に数え上げ、上記出現回数の累積度数分布を表現する累積度数分布配列を上記共有メモリに生成する手段と、
（ｂ）上記複数台のプロセッサを用いて、上記レコード番号配列の要素の値を添字として使って上記項目値番号の組を並列的に読み出し、上記項目値番号の組より計算された値を添字として使って上記累積度数分布配列の要素を並列的に読み出し、上記累積度数分布配列の要素をインデックスポインタとして利用して上記レコード番号配列の要素の値をソートバージョンのレコード番号配列（すなわち、ソートされたレコード番号配列）に並列的に格納する手段と、
（ｃ）上記インデックスポインタとして利用された上記累積度数分布配列の要素の値に従って上記ソートされたレコード番号配列を上記一意の項目値の組を有するレコードのレコード番号のグループに区分する手段と、
（ｄ）上記区分されたレコード番号の各グループから１個のレコード番号を抽出することにより、上記一意の項目値の組を特定する手段と、
を備える装置が提供される。In order to achieve the further object of the present invention, according to one aspect of the present invention, a record number array in which record numbers of records in tabular data are stored in a predetermined record order, a predetermined number of records in the tabular data, An item value number array in which item value numbers corresponding to item values of items in the table are stored according to the record number, and an item value in which the item values of the tabular data are stored in the order of the item value numbers corresponding to the item values A shared memory for storing the array;
A plurality of processors capable of accessing the shared memory;
In a shared memory multiprocessor system with
A device that extracts a unique set of item values for a given set of items,
(A) Using the plurality of processors, the number of occurrences of a set of item value numbers corresponding to the set of item values relating to the set of predetermined items is counted in parallel, and the cumulative frequency distribution of the number of appearances is expressed. Means for generating a cumulative frequency distribution array in the shared memory;
(B) Using the plurality of processors, using the values of the elements of the record number array as subscripts, the item value number pairs are read in parallel, and the values calculated from the item value number pairs are subscripted. Are used to read the elements of the cumulative frequency distribution array in parallel, and use the elements of the cumulative frequency distribution array as index pointers to change the values of the elements of the record number array to the sorted version of the record number array (i.e., sorted A record number array) in parallel,
(C) means for dividing the sorted record number array into record number groups of records having the unique item value pairs according to the values of the elements of the cumulative frequency distribution array used as the index pointer;
(D) means for identifying the set of unique item values by extracting one record number from each group of the divided record numbers;
An apparatus comprising:

さらに本発明の一態様によれば、表形式データのレコードのレコード番号が所定のレコード順に従って格納されたレコード番号配列、上記表形式データの上記レコードの所定の項目の項目値に対応する項目値番号がレコード番号に従って格納された項目値番号配列、及び、上記表形式データの項目値が当該項目値に対応する項目値番号の順序に従って格納された項目値配列を記憶する共有メモリと、
上記共有メモリにアクセス可能である複数台のプロセッサと、
を備えたコンピュータにロードされ、所定の項目の組に関する一意の項目値の組を抽出するコードを上記コンピュータに実行させるコンピュータ読み取り可能なプログラムであって、
（ａ）上記複数台のプロセッサを用いて、上記所定の項目の組に関する項目値の組に対応する項目値番号の組の出現回数を並列的に数え上げ、上記出現回数の累積度数分布を表現する累積度数分布配列を上記共有メモリに生成するコードと、
（ｂ）上記複数台のプロセッサを用いて、上記レコード番号配列の要素の値を添字として使って上記項目値番号の組を並列的に読み出し、上記項目値番号の組より計算された値を添字として使って上記累積度数分布配列の要素を並列的に読み出し、上記累積度数分布配列の要素をインデックスポインタとして利用して上記レコード番号配列の要素の値をソートバージョンのレコード番号配列（すなわち、ソートされたレコード番号配列）に並列的に格納するコードと、
（ｃ）上記インデックスポインタとして利用された上記累積度数分布配列の要素の値に従って上記ソートされたレコード番号配列を上記一意の項目値の組を有するレコードのレコード番号のグループに区分するコードと、
（ｄ）上記区分されたレコード番号の各グループから１個のレコード番号を抽出することにより、上記一意の項目値の組を特定するコードと、
を含むプログラムが提供される。Furthermore, according to one aspect of the present invention, a record number array in which the record numbers of the records in the tabular data are stored according to a predetermined record order, and the item values corresponding to the item values of the predetermined items in the records of the tabular data An item value number array in which numbers are stored according to record numbers, and a shared memory for storing item value arrays in which the item values of the tabular data are stored in accordance with the order of the item value numbers corresponding to the item values
A plurality of processors capable of accessing the shared memory;
A computer-readable program that causes a computer to execute code for extracting a unique set of item values for a predetermined set of items, loaded on a computer comprising:
(A) Using the plurality of processors, the number of occurrences of item value number pairs corresponding to the item value pairs relating to the predetermined item set is counted in parallel to express the cumulative frequency distribution of the appearance times. Code for generating a cumulative frequency distribution array in the shared memory;
(B) Using the plurality of processors, using the values of the elements of the record number array as subscripts, the item value number pairs are read in parallel, and the values calculated from the item value number pairs are subscripted. Are used to read the elements of the cumulative frequency distribution array in parallel, and use the elements of the cumulative frequency distribution array as index pointers to convert the values of the elements of the record number array into the record number arrays of sorted versions (ie, sorted Code stored in parallel in the record number array)
(C) a code that divides the sorted record number array according to the value of the element of the cumulative frequency distribution array used as the index pointer into a group of record numbers of records having the unique item value pairs;
(D) a code for identifying a set of unique item values by extracting one record number from each group of the record numbers that are classified;
Is provided.

さらに本発明の一態様によれば、表形式データのレコードのレコード番号が所定のレコード順に従って格納されたレコード番号配列、上記表形式データの上記レコードの所定の項目の項目値に対応する項目値番号がレコード番号に従って格納された項目値番号配列、及び、上記表形式データの項目値が当該項目値に対応する項目値番号の順序に従って格納された項目値配列を記憶する共有メモリと、上記共有メモリにアクセス可能である複数台のプロセッサと、を備えたコンピュータにロードされ、所定の項目の組に関する一意の項目値の組を抽出する本発明の方法を上記コンピュータに実行させるためのコンピュータプログラムプロダクトが提供される。 Furthermore, according to one aspect of the present invention, a record number array in which the record numbers of the records in the tabular data are stored according to a predetermined record order, and the item values corresponding to the item values of the predetermined items in the records of the tabular data A shared memory for storing an item value number array in which numbers are stored according to record numbers, and an item value array in which the item values of the tabular data are stored in accordance with the order of the item value numbers corresponding to the item values; A computer program product for causing a computer to execute the method of the present invention for extracting a set of unique item values relating to a predetermined set of items, loaded on a computer comprising a plurality of processors capable of accessing a memory Is provided.

さらに本発明の一態様によれば、表形式データのレコードのレコード番号が所定のレコード順に従って格納されたレコード番号配列、上記表形式データの上記レコードの所定の項目の項目値に対応する項目値番号がレコード番号に従って格納された項目値番号配列、及び、上記表形式データの項目値が当該項目値に対応する項目値番号の順序に従って格納された項目値配列を記憶する共有メモリと、上記共有メモリにアクセス可能である複数台のプロセッサと、を備えたコンピュータにロードされ、所定の項目の組に関する一意の項目値の組を抽出する本発明の方法を上記コンピュータに実行させるためのコンピュータプログラムが記録された記憶媒体が提供される。 Furthermore, according to one aspect of the present invention, a record number array in which the record numbers of the records in the tabular data are stored according to a predetermined record order, and the item values corresponding to the item values of the predetermined items in the records of the tabular data A shared memory for storing an item value number array in which numbers are stored according to record numbers, and an item value array in which the item values of the tabular data are stored in accordance with the order of the item value numbers corresponding to the item values; A computer program for causing a computer to execute the method of the present invention, which is loaded into a computer having a plurality of processors capable of accessing a memory and extracts a set of unique item values relating to a predetermined set of items. A recorded storage medium is provided.

本発明によれば、大規模な表形式データの多次元集計処理を高速に実現することが可能になる。 According to the present invention, it is possible to realize multidimensional tabulation processing of large-scale tabular data at high speed.

図１は、本発明の一実施形態によるコンピュータシステムの概要図である。FIG. 1 is a schematic diagram of a computer system according to an embodiment of the present invention. 図２は、データ管理機構を説明するための表形式データの一例を表す図である。FIG. 2 is a diagram illustrating an example of tabular data for explaining the data management mechanism. 図３は、本発明の一実施形態によるデータ管理機構の説明図である。FIG. 3 is an explanatory diagram of a data management mechanism according to an embodiment of the present invention. 図４は、本発明の一実施形態による多次元集計処理のフローチャートである。FIG. 4 is a flowchart of multidimensional tabulation processing according to an embodiment of the present invention. 図５は、本発明の一実施形態による多項目一括ソート処理の説明図である。FIG. 5 is an explanatory diagram of multi-item batch sort processing according to an embodiment of the present invention. 図６は、本発明の一実施形態による仮想項目値番号を計算する処理の説明図である。FIG. 6 is an explanatory diagram of processing for calculating a virtual item value number according to an embodiment of the present invention. 図７は、本発明の一実施形態による仮想項目値番号を数え上げる処理の説明図である。FIG. 7 is an explanatory diagram of processing for counting up virtual item value numbers according to an embodiment of the present invention. 図８は、本発明の一実施形態による累積度数分布配列を生成する処理の説明図である。FIG. 8 is an explanatory diagram of processing for generating a cumulative frequency distribution array according to an embodiment of the present invention. 図９は、本発明の一実施例によるレコード番号配列の要素を並べ替える処理の説明図である。FIG. 9 is an explanatory diagram of processing for rearranging the elements of the record number array according to one embodiment of the present invention. 図１０は、本発明の一実施例によるレコード番号配列の要素を並べ替える処理の結果の説明図である。FIG. 10 is an explanatory diagram of the result of the process of rearranging the elements of the record number array according to one embodiment of the present invention. 図１１は、本発明の一実施例によるユニーク行抽出処理の説明図である。FIG. 11 is an explanatory diagram of unique row extraction processing according to an embodiment of the present invention. 図１２は、本発明の一実施形態によるユニーク行抽出処理によって抽出された一意の項目値の組を元のレコード番号配列に出現した順序に並べ替える処理の説明図である。FIG. 12 is an explanatory diagram of a process of rearranging a set of unique item values extracted by the unique row extraction process according to an embodiment of the present invention in the order of appearance in the original record number array. 図１３は、本発明の一実施形態によるユニーク行抽出処理によって抽出された一意の項目値の組を元のレコード番号配列に出現した順序に並べ替える処理の説明図である。FIG. 13 is an explanatory diagram of processing for rearranging a set of unique item values extracted by the unique row extraction processing according to an embodiment of the present invention in the order in which they appear in the original record number array. 図１４は、本発明の一実施形態による多項目逐次ソート処理の概要説明図である。FIG. 14 is a schematic explanatory diagram of multi-item sequential sorting processing according to an embodiment of the present invention. 図１５は、本発明の一実施形態による多項目逐次ソート処理の下位次元ソート処理における並列カウントアップ処理の説明図である。FIG. 15 is an explanatory diagram of the parallel count-up process in the low-order sort process of the multi-item sequential sort process according to an embodiment of the present invention. 図１６は、本発明の一実施形態による多項目逐次ソート処理の下位次元ソート処理における累積度数分布配列生成処理の説明図である。FIG. 16 is an explanatory diagram of the cumulative frequency distribution array generation process in the low-order sort process of the multi-item sequential sort process according to an embodiment of the present invention. 図１７は、本発明の一実施形態による多項目逐次ソート処理の下位次元ソート処理におけるレコード番号配列の要素を並べ替える処理の説明図である。FIG. 17 is an explanatory diagram of the process of rearranging the elements of the record number array in the lower-order sort process of the multi-item sequential sort process according to one embodiment of the present invention. 図１８は、本発明の一実施形態による多項目逐次ソート処理の上位次元ソート処理における並列カウントアップ処理の説明図である。FIG. 18 is an explanatory diagram of the parallel count-up process in the higher-order sort process of the multi-item sequential sort process according to an embodiment of the present invention. 図１９は、本発明の一実施形態による多項目逐次ソート処理の上位次元ソート処理における累積度数分布配列生成処理の説明図である。FIG. 19 is an explanatory diagram of the cumulative frequency distribution array generation process in the higher-order sort process of the multi-item sequential sort process according to an embodiment of the present invention. 図２０は、本発明の一実施形態による多項目逐次ソート処理の上位次元ソート処理におけるレコード番号配列の要素を並べ替える処理の説明図である。FIG. 20 is an explanatory diagram of the process of rearranging the elements of the record number array in the higher-order sort process of the multi-item sequential sort process according to an embodiment of the present invention. 図２１は、本発明の一実施形態によるレコード番号昇順タイプの測度並列算出処理における並べ替え処理の説明図である。FIG. 21 is an explanatory diagram of the rearrangement process in the measure number ascending order type measure parallel calculation process according to an embodiment of the present invention. 図２２は、本発明の一実施形態によるレコード番号昇順タイプの測度並列算出処理における集計処理の説明図である。FIG. 22 is an explanatory diagram of the aggregation process in the measure parallel calculation process of the record number ascending order type according to one embodiment of the present invention. 図２３は、本発明の一実施形態によるグループ番号昇順タイプの測度並列算出処理における集計処理の説明図である。FIG. 23 is an explanatory diagram of a counting process in the group number ascending type measure parallel calculation process according to an embodiment of the present invention. 図２４Ａ、Ｂは、実施例１による多段階並列ソートのソート対象のデータ構造の説明図である。24A and 24B are explanatory diagrams of the data structure to be sorted in the multi-stage parallel sort according to the first embodiment. 図２５は、実施例１による多段階並列ソート方法のフローチャートである。FIG. 25 is a flowchart of the multistage parallel sorting method according to the first embodiment. 図２６Ａ、Ｂは、実施例１による多段階並列ソート方法の第１段階のカウントアップステップの説明図（その１）である。26A and 26B are explanatory diagrams (part 1) of the count-up step in the first stage of the multi-stage parallel sorting method according to the first embodiment. 図２７Ａ、Ｂは、実施例１による多段階並列ソート方法の第１段階のカウントアップステップの説明図（その２）である。27A and 27B are explanatory diagrams (part 2) of the count-up step in the first stage of the multi-stage parallel sorting method according to the first embodiment. 図２８Ａ、Ｂは、実施例１による多段階並列ソート方法の第１段階のカウントアップステップの説明図（その３）である。28A and 28B are explanatory diagrams (part 3) of the count-up step in the first stage of the multi-stage parallel sorting method according to the first embodiment. 図２９Ａ、Ｂは、実施例１による昇順の多段階並列ソート方法の第１段階の累計数化ステップの説明図である。FIGS. 29A and 29B are explanatory diagrams of the first-stage cumulative numbering step of the ascending order multi-stage parallel sorting method according to the first embodiment. 図３０Ａ、Ｂは、実施例１による昇順の多段階並列ソート方法の第１段階の転送ステップの説明図（その１）である。30A and 30B are explanatory diagrams (part 1) of the transfer step in the first stage of the ascending multi-stage parallel sort method according to the first embodiment. 図３１Ａ、Ｂは、実施例１による昇順の多段階並列ソート方法の第１段階の転送ステップの説明図（その２）である。FIGS. 31A and 31B are explanatory diagrams (part 2) of the transfer step in the first stage of the ascending order multi-stage parallel sort method according to the first embodiment. 図３２Ａ、Ｂは、実施例１による昇順の多段階並列ソート方法の第１段階の転送ステップの説明図（その３）である。FIGS. 32A and 32B are explanatory diagrams (part 3) of the transfer step of the first stage of the ascending order multi-stage parallel sort method according to the first embodiment. 図３３は、実施例１による多段階並列ソート方法の第２段階の初期化ステップの説明図である。FIG. 33 is an explanatory diagram of the initialization step of the second stage of the multi-stage parallel sorting method according to the first embodiment. 図３４Ａ、Ｂは、実施例１による多段階並列ソート方法の第２段階のカウントアップステップの説明図（その１）である。FIGS. 34A and 34B are explanatory diagrams (part 1) of the second-stage count-up step of the multi-stage parallel sorting method according to the first embodiment. 図３５Ａ、Ｂは、実施例１による多段階並列ソート方法の第２段階のカウントアップステップの説明図（その２）である。FIGS. 35A and 35B are explanatory diagrams (part 2) of the second count-up step of the multi-stage parallel sorting method according to the first embodiment. 図３６Ａ、Ｂは、実施例１による多段階並列ソート方法の第２段階のカウントアップステップの説明図（その３）である。36A and 36B are explanatory diagrams (part 3) of the count-up step in the second stage of the multi-stage parallel sorting method according to the first embodiment. 図３７は、実施例１による昇順の多段階並列ソート方法の第２段階の累計数化ステップの説明図である。FIG. 37 is an explanatory diagram of the second-stage accumulation step in the ascending multi-stage parallel sorting method according to the first embodiment. 図３８Ａ、Ｂは、実施例１による昇順の多段階並列ソート方法の第２段階の転送ステップの説明図（その１）である。38A and 38B are explanatory diagrams (part 1) of the second-stage transfer step of the ascending multi-stage parallel sort method according to the first embodiment. 図３９Ａ、Ｂは、実施例１による昇順の多段階並列ソート方法の第２段階の転送ステップの説明図（その２）である。FIGS. 39A and 39B are explanatory diagrams (part 2) of the second-stage transfer step of the ascending multi-stage parallel sort method according to the first embodiment. 図４０Ａ、Ｂは、実施例１による昇順の多段階並列ソート方法の第２段階の転送ステップの説明図（その３）である。40A and 40B are explanatory diagrams (part 3) of the second-stage transfer step of the ascending multi-stage parallel sort method according to the first embodiment. 図４１Ａ−Ｃは、実施例１による昇順の多段階並列ソート方法を適用した結果を示す図（その１）である。FIGS. 41A to 41C are diagrams (part 1) illustrating a result of applying the ascending multi-stage parallel sorting method according to the first embodiment. 図４２Ａ、Ｂは、実施例１による昇順の多段階並列ソート方法を適用した結果を示す図（その２）である。42A and 42B are diagrams (part 2) illustrating a result of applying the ascending multi-stage parallel sorting method according to the first embodiment. 図４３、実施例２による多段階ソートを説明するためのデータ構造図である。FIG. 43 is a data structure diagram for explaining multi-stage sorting according to the second embodiment. 図４４Ａ−Ｄは、実施例２による多段階ソートの第１段階のカウントアップ及び累計数化の説明図である。FIGS. 44A to 44D are explanatory diagrams of count-up and cumulative numbering in the first stage of multi-stage sorting according to the second embodiment. 図４５Ａ、Ｂは、実施例２による多段階ソートの第１段階のレコード番号転送の説明図である。45A and 45B are explanatory diagrams of record number transfer in the first stage of multistage sorting according to the second embodiment. 図４６は、実施例２による多段階ソートの第１段階のレコード番号転送の結果の説明図である。FIG. 46 is an explanatory diagram of the result of record number transfer in the first stage of multi-stage sorting according to the second embodiment. 図４７は、実施例２による多段階ソートの第２段階の初期状態を表す図である。FIG. 47 is a diagram illustrating an initial state of the second stage of the multistage sorting according to the second embodiment. 図４８Ａ−Ｃは、実施例２による多段階ソートの第２段階のカウントアップ及び累計数化の説明図である。48A to 48C are explanatory diagrams of the second stage count-up and cumulative numbering in the multi-stage sort according to the second embodiment. 図４９Ａ、Ｂは、実施例２による多段階ソートの第２段階のレコード番号転送の説明図である。49A and 49B are explanatory diagrams of record number transfer in the second stage of multistage sorting according to the second embodiment. 図５０Ａ−Ｃは、実施例３における集計処理の内容を説明する図であり、図５０Ａは出力される集計結果テーブルを表し、図５０Bは１００万行からなる集計元テーブルを表し、図５０Ｃは６３２３３４個のレコードからなる集計結果テーブルを表している。50A to 50C are diagrams for explaining the contents of the tabulation processing in the third embodiment. FIG. 50A shows an output tabulation result table, FIG. 50B shows a tabulation source table composed of 1 million rows, and FIG. This represents a total result table composed of 632334 records. 図５１Ａ−Ｃは、実施例３の測定結果を示す図であり、図５１Ａは実施例３の５方式による処理の所要時間（ミリ秒）を表し、図５１Ｂは本発明による方式が旧方式に対して高速化された倍率を表し、図５１Ｃは図５１Ｂと同じ高速化倍率をグラフ化したものである。51A-C are diagrams showing the measurement results of Example 3, FIG. 51A shows the required time (milliseconds) of processing by the five methods of Example 3, and FIG. 51B shows the method according to the present invention in the old method. FIG. 51C is a graph showing the same high speed magnification as FIG. 51B. 図５２Ａ−Ｃは、実施例４における集計処理の内容を説明する図であり、図５２Ａは出力される集計結果テーブルを表し、図５２Bは１００万行からなる集計元テーブルを表し、図５２Ｃは６３２３３４個のレコードからなる集計結果テーブルを表している。52A-C are diagrams for explaining the contents of the tabulation process in the fourth embodiment. FIG. 52A shows an output tabulation result table, FIG. 52B shows a tabulation source table consisting of 1 million rows, and FIG. This represents a total result table composed of 632334 records. 図５３Ａ−Ｃは、実施例４の測定結果を示す図であり、図５３Ａは実施例４の５方式による処理の所要時間（ミリ秒）を表し、図５３Ｂは本発明による方式が旧方式に対して高速化された倍率を表し、図５３Ｃは図５３Ｂと同じ高速化倍率をグラフ化したものである。53A-C are diagrams showing the measurement results of Example 4, FIG. 53A shows the required time (milliseconds) of processing by the five methods of Example 4, and FIG. 53B shows that the method according to the present invention is changed to the old method. FIG. 53C is a graph showing the same high speed magnification as FIG. 53B.

Explanation of symbols

１０コンピュータシステム
１２−１，１２−２，・・・，１２−ｐＣＰＵ
１４共有メモリ
１６ＲＯＭ
１８固定記憶装置
２０ＣＤ−ＲＯＭドライバ
２２Ｉ／Ｆ
２４入力装置
２６表示装置10 Computer system 12-1, 12-2, ..., 12-p CPU
14 Shared memory 16 ROM
18 Fixed storage device 20 CD-ROM driver 22 I / F
24 input device 26 display device

以下、本発明を実施するための種々の形態を図面と共に詳細に説明する。 Hereinafter, various embodiments for carrying out the present invention will be described in detail with reference to the drawings.

［コンピュータシステム構成］
図１は、本発明の一実施形態による表形式データの集計処理を実施するコンピュータシステム１０の概略図である。図１に示すように、このコンピュータシステム１０は、プログラムを実行することによりシステム全体および個々の構成部分を制御するｐ台のプロセッサ（ＣＰＵ）１２−１、１２−２、．．．１２−ｐと、ワークデータ等を記憶する、たとえば、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）のような共有メモリ１４と、プログラム等を記憶するＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）１６と、ハードディスク等の固定記憶媒体１８と、ＣＤ−ＲＯＭ１９をアクセスするためのＣＤ−ＲＯＭドライバ２０と、ＣＤ−ＲＯＭドライバ２０及び外部ネットワーク等（図示せず）へ繋がれた外部端子に接続されているインタフェース（Ｉ／Ｆ）２２と、キーボード及びマウス等のような入力装置２４と、コンピュータモニターのような表示装置２６とを備えている。ＣＰＵ１２、ＲＡＭ１４、ＲＯＭ１６、外部記憶媒体１８、Ｉ／Ｆ２２、入力装置２４及び表示装置２６は、バス２８を介して相互に接続されている。図示されていないが、各ＣＰＵは固有のローカルメモリを備え、ローカルメモリにはキャッシュメモリが搭載されている。[Computer system configuration]
FIG. 1 is a schematic diagram of a computer system 10 that performs tabular data tabulation processing according to an embodiment of the present invention. As shown in FIG. 1, the computer system 10 includes p processors (CPU) 12-1, 12-2,... That control the entire system and individual components by executing a program. . . 12-p, for storing work data and the like, for example, a shared memory 14 such as a RAM (Random Access Memory), a ROM (Read Only Memory) 16 for storing a program and the like, a fixed storage medium 18 such as a hard disk, and the like A CD-ROM driver 20 for accessing the CD-ROM 19; an interface (I / F) 22 connected to an external terminal connected to the CD-ROM driver 20 and an external network (not shown); An input device 24 such as a keyboard and a mouse, and a display device 26 such as a computer monitor are provided. The CPU 12, RAM 14, ROM 16, external storage medium 18, I / F 22, input device 24, and display device 26 are connected to each other via a bus 28. Although not shown, each CPU has a unique local memory, and a cache memory is mounted on the local memory.

本実施形態による方法をコンピュータシステム１０に実行させるプログラムは、ＣＤ−ＲＯＭ１９に収容され、ＣＤ−ＲＯＭドライバ２０に読取られても良いし、ＲＯＭ１６に予め記憶されていても良い。また、いったんＣＤ−ＲＯＭ１９から読み出したものを、外部記憶媒体１８の所定の領域に記憶しておいても良い。或いは、上記プログラムは、ネットワーク（図示せず）、外部端子、及び、Ｉ／Ｆ２２を介して、外部から供給されるものであっても良い。 A program for causing the computer system 10 to execute the method according to the present embodiment may be stored in the CD-ROM 19 and read by the CD-ROM driver 20 or may be stored in the ROM 16 in advance. Further, what is once read from the CD-ROM 19 may be stored in a predetermined area of the external storage medium 18. Alternatively, the program may be supplied from the outside via a network (not shown), an external terminal, and the I / F 22.

また、本発明の一実施形態によるメモリ共有型マルチプロセッサシステムは、コンピュータシステム１０に表形式データを集計するプログラムを実行させることにより実現される。 In addition, the memory sharing multiprocessor system according to the embodiment of the present invention is realized by causing the computer system 10 to execute a program that tabulates tabular data.

［情報ブロックに基づくデータ管理機構］
図２はデータ管理機構を説明するための表形式データの一例を表す図である。この表形式データは、上述の国際公開第ＷＯ００／１０１０３号に提案したデータ管理機構を用いることにより、コンピュータ内では図３に示されるようなデータ構造として記憶される。このデータ構造は、市販されているコンピュータ、たとえば、パーソナルコンピュータのハードウェア資源、特に、プロセッサ及びメモリを使用して大規模な表形式データの検索、ソート、集計等を実現するために提案された、コンピュータのメモリ上に置かれる表形式データのデータ構造であることに注意すべきである。[Data management mechanism based on information blocks]
FIG. 2 is a diagram illustrating an example of tabular data for explaining the data management mechanism. This tabular data is stored in the computer as a data structure as shown in FIG. 3 by using the data management mechanism proposed in the above-mentioned International Publication No. WO00 / 10103. This data structure has been proposed to realize retrieval, sorting, aggregation, etc. of large-scale tabular data using hardware resources of commercially available computers, for example, personal computers, in particular, processors and memories. It should be noted that the data structure of tabular data placed on the computer memory.

図３に示すように、表形式データの各レコードの並び順の番号と、内部データの並び順の番号を対応付けるレコード番号配列３０１（以下、この配列を「ＯｒｄＳｅｔ」のように略記する。）には、表形式のレコード毎に内部データの並び順番号が値として配置される。この例では、すべての表形式データが内部データとして表されるため、表形式データのレコード番号と内部データの並び順番号とは一致する。 As shown in FIG. 3, a record number array 301 (hereinafter, this array is abbreviated as “OrdSet”) that associates the order number of the records in the tabular data with the order number of the internal data. In the table format, the order number of the internal data is arranged as a value for each record in the table format. In this example, since all tabular data is represented as internal data, the record number of the tabular data matches the arrangement order number of the internal data.

ここで、本明細書中での配列の記法について説明する。一般に、配列Ａは、添字をｉとすると、配列の要素がＡ［ｉ］のように表記できるが、図面中では、配列は、配列の要素Ａ［ｉ］は、実線で囲まれた領域内に示され、要素Ａ［ｉ］と要素Ａ［ｉ＋１］の境界は点線で示されている。また、要素Ａ［ｉ］の添字ｉが要素Ａ［ｉ］の左側に示されている。また、配列の添字ｉは０から始まる整数で表されている。 Here, the notation of the arrangement | sequence in this specification is demonstrated. In general, an array element A can be expressed as A [i], where i is a subscript. However, in the drawing, an array element A [i] is within a region surrounded by a solid line. The boundary between the element A [i] and the element A [i + 1] is indicated by a dotted line. The subscript i of element A [i] is shown on the left side of element A [i]. Further, the subscript i of the array is represented by an integer starting from 0.

もう一度図３に戻ると、性別に関しては、表形式データのレコード０に対応する内部データの並び順番号は、配列ＯｒｄＳｅｔ３０１から「０」であることがわかる。並び順番号が「０」であるレコードに関する実際の性別の値、即ち、「男」又は「女」は、実際の値が所定の順序に従ってソートされた値リストである項目値配列３０３（以下、項目値配列、すなわち、値リストを「ＶＬ」のように略記する。）へのポインタ配列である項目値番号配列３０２（以下、項目値番号配列、すなわち、ポインタ配列を「ＶＮｏ」のように略記する。）を参照することによって取得できる。ポインタ配列３０２は、配列ＯｒｄＳｅｔ３０１に格納されている並び順番号の順に従って、実際の値リスト３０３中の要素を指し示すポインタを格納している。これにより、表形式データのレコード「０」に対応する性別の項目値は、（１）配列ＯｒｄＳｅｔ３０１からレコード「０」に対応する並び順番号「０」を取り出し、（２）値リストへのポインタ配列３０２から並び順番号「０」に対応する要素「１」を取り出し、（３）値リスト３０３から、値リストへのポインタ配列３０２から取り出された要素「１」によって指し示される要素「女」を取り出すことにより取得できる。 Returning to FIG. 3 again, regarding gender, it can be seen that the arrangement order number of the internal data corresponding to the record 0 of the tabular data is “0” from the array OrdSet 301. The actual gender value relating to the record whose arrangement order number is “0”, that is, “male” or “female” is an item value array 303 (hereinafter referred to as a value list) in which the actual values are sorted in a predetermined order. The item value array 302, which is a pointer array to the item value array, that is, the value list is abbreviated as “VL” (hereinafter, the item value number array, ie, the pointer array is abbreviated as “VNo”). Can be obtained by referring to The pointer array 302 stores pointers that point to elements in the actual value list 303 in the order of the arrangement order numbers stored in the array OrdSet 301. As a result, the gender item value corresponding to the record “0” of the tabular data is (1) the arrangement order number “0” corresponding to the record “0” is extracted from the array OrdSet 301 and (2) a pointer to the value list The element “1” corresponding to the sequence number “0” is extracted from the array 302, and (3) the element “woman” indicated by the element “1” extracted from the value array 303 from the pointer array 302 to the value list. Can be obtained by taking out

他のレコードに対しても、また、年齢及び身長に関しても同様に項目値を取得することができる。 The item values can be acquired in the same manner for other records and also for age and height.

このように表形式データは、値リストＶＬと、値リストへのポインタ配列ＶＮｏの組合せにより表現され、この組合せを、特に、「情報ブロック」と称する。図３には、性別、年齢及び身長に関する情報ブロックがそれぞれ情報ブロック３０８、３０９及び３１０として示されている。 In this way, the tabular data is expressed by a combination of the value list VL and the pointer array VNo to the value list, and this combination is particularly referred to as an “information block”. In FIG. 3, information blocks regarding gender, age, and height are shown as information blocks 308, 309, and 310, respectively.

単一のコンピュータが、単一のメモリ（物理的には複数であっても良いが、単一のアドレス空間に配置されアクセスされるという意味で単一のメモリ）であれば、当該メモリに、順序集合の配列ＯｒｄＳｅｔ、各情報ブロックを構成する値リストＶＬおよびポインタ配列ＶＮｏとを記憶しておけばよい。しかしながら、大量のレコードを保持するためには、その大きさに伴ってメモリ容量も大きくなるため、これらの大量のレコードを並列処理できるのが望ましい。 If a single computer is a single memory (which can be physically multiple, but a single memory in the sense that it is located and accessed in a single address space) An ordered set array OrdSet, a value list VL constituting each information block, and a pointer array VNo may be stored. However, in order to hold a large number of records, the memory capacity increases with the size, so it is desirable that these large numbers of records can be processed in parallel.

そこで、本実施形態においては、複数台のプロセッサが共有メモリに記憶されたレコードのデータにアクセスし、複数台のプロセッサの並列処理により、高速な集計を実現している。 Therefore, in this embodiment, a plurality of processors access the record data stored in the shared memory, and high-speed aggregation is realized by parallel processing of the plurality of processors.

［多次元集計処理］
次に、本実施形態によるメモリ共有型マルチプロセッサシステムにおける多次元集計処理を説明する。本実施形態による集計処理は、本質的に複数台のプロセッサが同じ処理を実行することによって達成される。すなわち、この集計処理は、単一の集計処理命令を複数台のプロセッサへ与えることにより、複数台のプロセッサが並列に動作して集計処理を実行できるように構成されている。複数台のプロセッサは、同じ動作を実行するので、一つのプログラムを作成するだけで、並列処理を実現できる。[Multidimensional aggregation processing]
Next, multidimensional tabulation processing in the memory shared multiprocessor system according to the present embodiment will be described. The aggregation process according to the present embodiment is achieved essentially by a plurality of processors executing the same process. In other words, this aggregation process is configured so that a plurality of processors can operate in parallel and execute the aggregation process by giving a single aggregation processing instruction to a plurality of processors. Since a plurality of processors execute the same operation, parallel processing can be realized only by creating one program.

図４は本発明の一実施形態による表形式データの多次元集計処理のフローチャートである。この多次元集計処理は、図１に示されるようなコンピュータシステム１０によって実行される。コンピュータシステム１０の共有メモリ１４は、表形式データのレコードのレコード番号が所定のレコード順に従って格納されたレコード番号配列、上記表形式データの上記レコードの所定の項目の項目値に対応する項目値番号がレコード番号に従って格納された項目値番号配列、及び、上記表形式データの項目値が当該項目値に対応する項目値番号の順序に従って格納された項目値配列を記憶する。複数台のプロセッサ１２−１，・・・，１２−ｐは共有メモリ１４にアクセス可能である。 FIG. 4 is a flowchart of multi-dimensional tabulation processing of tabular data according to an embodiment of the present invention. This multidimensional tabulation process is executed by a computer system 10 as shown in FIG. The shared memory 14 of the computer system 10 includes a record number array in which the record numbers of the tabular data records are stored in a predetermined record order, and an item value number corresponding to the item value of the predetermined item of the record of the tabular data. Is stored according to the record number, and the item value array in which the item values of the tabular data are stored according to the order of the item value numbers corresponding to the item values is stored. The plurality of processors 12-1,..., 12-p can access the shared memory 14.

各プロセッサは、上記所定の項目の組に関する項目値の組に対応する項目値番号の組の出現回数を並列的に数え上げ（ステップ４０１）、上記出現回数の累積度数分布を表現する累積度数分布配列を上記共有メモリに生成する（ステップ４０２）。所定の項目の組とは、多次元集計の際に用いられる次元の組に相当し、項目値の組は次元値の組に相当する。 Each processor enumerates in parallel the number of occurrences of the set of item value numbers corresponding to the set of item values relating to the set of predetermined items (step 401), and a cumulative frequency distribution array expressing the cumulative frequency distribution of the number of appearances Is generated in the shared memory (step 402). The predetermined item set corresponds to a dimension set used in multi-dimensional aggregation, and the item value set corresponds to a dimension value set.

次に、各プロセッサは、上記レコード番号配列の要素の値を添字として使って上記項目値番号の組を並列的に読み出し、上記項目値番号の組によって計算された値を添字として使って上記累積度数分布配列の要素を並列的に読み出し、上記累積度数分布配列の要素をインデックスポインタとして利用して上記レコード番号配列の要素の値をソートバージョンのレコード番号配列（すなわち、ソートされたレコード番号配列）に並列的に格納する（ステップ４０３）。たとえば、項目値番号の組は、各項目値番号が各桁に対応するように基数表現された値に変換することが可能である。この基数表現への変換は、乗算の代わりにビットシフト演算を用いることによって高速化される。しかし、項目値番号の組から値への変換は、基数表現に限定されることはなく、変換後の値から元の項目値番号の組へ逆変換できる演算であれば、どのような変換でも構わない。これにより、元のレコード番号配列の要素は、項目値の組に対応する項目値番号の組に関してソートされ、ソートされた要素が格納されたソートバージョンのレコード番号配列が生成される。 Next, each processor reads the item value number set in parallel using the element value of the record number array as a subscript, and uses the value calculated by the item value number set as a subscript to perform the accumulation. Reads the elements of the frequency distribution array in parallel and uses the elements of the cumulative frequency distribution array as an index pointer to convert the values of the elements of the record number array into a sorted version of the record number array (ie, the sorted record number array) (Step 403). For example, a set of item value numbers can be converted to a value expressed in a radix so that each item value number corresponds to each digit. The conversion to the radix representation is accelerated by using a bit shift operation instead of multiplication. However, the conversion from a pair of item value numbers to a value is not limited to the radix representation, and any conversion can be used as long as it can be converted back from the converted value to the original set of item value numbers. I do not care. As a result, the elements of the original record number array are sorted with respect to the set of item value numbers corresponding to the set of item values, and a sorted version of the record number array in which the sorted elements are stored is generated.

さらに、少なくとも１台のプロセッサが、上記インデックスポインタとして利用された上記累積度数分布配列の要素の値に従って上記ソートバージョンのレコード番号配列を上記一意の項目値の組を有するレコードのレコード番号のグループに区分する（ステップ４０４）。これにより、レコードは、同じ項目値の組、すなわち、同じ次元値の組を有するレコードのグループに分類される。 Further, at least one processor converts the record number array of the sorted version into the record number group of the record having the unique item value pair according to the value of the element of the cumulative frequency distribution array used as the index pointer. Partition (step 404). As a result, the records are classified into groups of records having the same set of field values, that is, the same set of dimension values.

少なくとも１台のプロセッサが、上記区分されたレコード番号の各グループから１個のレコード番号を抽出することにより、上記一意の項目値の組を特定する（ステップ４０５）。これにより、多次元集計の際に利用される一意の次元値の組が特定される。 At least one processor identifies one set of the unique item values by extracting one record number from each group of the divided record numbers (step 405). Thereby, a unique set of dimension values used in multidimensional aggregation is specified.

最後に、各プロセッサは、上記所定の項目値の組毎に別の項目に関する項目値を並列的に集計する（ステップ４０６）。 Finally, each processor totals item values related to different items for each set of the predetermined item values in parallel (step 406).

以下では、多次元並列集計処理において用いられる多次元並列ソート処理（多項目一括ソート処理又は多項目逐次ソート処理）、ユニーク行抽出処理、及び、測度並列算出処理（レコード番号昇順集計処理又はグループ番号昇順集計処理）について詳述する。 In the following, multidimensional parallel sort processing (multi-item batch sort processing or multi-item sequential sort processing), unique row extraction processing, and measure parallel calculation processing (record number ascending order summarization processing or group number) used in multidimensional parallel tabulation processing (Ascending process) will be described in detail.

さらに、本発明の種々の実施形態では、対称型マルチプロセッサシステムの特長を十分に発揮させるため、種々の配列に適用される処理は、各配列を複数台のプロセッサにほぼ均等に割り当てることにより、複数台のプロセッサによって並列処理される。以下では、発明の理解を助けるために、プロセッサの台数ｎが４であるとして説明するが、プロセッサの台数が４台に限定されないことは当業者に明白であろう。 Furthermore, in various embodiments of the present invention, in order to fully demonstrate the features of the symmetric multiprocessor system, the processing applied to the various arrays is performed by assigning each array to a plurality of processors substantially evenly. Parallel processing is performed by a plurality of processors. In the following, in order to help the understanding of the invention, it is assumed that the number of processors n is four, but it will be apparent to those skilled in the art that the number of processors is not limited to four.

［多項目一括ソート処理］
多項目一括ソート処理とは、表形式データの複数の項目に関するソートを一括に行う処理であり、（ｉ）ソート処理が高速化される、（ｉｉ）プログラムの記述が容易化される、（ｉｉｉ）同一の項目値の組を有するレコードの抽出が簡単化される、という利点がある。[Multi-item batch sort processing]
The multi-item batch sort process is a process that batch-sorts a plurality of items of tabular data, and (i) the sort process is accelerated, (ii) the program description is simplified, (iii) ) There is an advantage that the extraction of records having the same set of field values is simplified.

図５は、本発明の一実施形態による多項目一括ソート処理の説明図である。同図に示された表形式データ５０１は、レコード番号が０から１５で表される１６個のレコードを含み、各レコードには、「年齢」という項目Ａｇｅと「クラス」という項目Ｃｌａｓｓが含まれている。項目「年齢」は多次元集計の第１の次元Ｄｉｍ−１に対応し、項目「クラス」は多次元集計の第２の次元Ｄｉｍ−２に対応している。この表形式データは、本実施形態では、レコード番号配列５０３と、第１の次元に関する項目値番号配列５１１及び項目値配列５１２と、第２の次元に関する項目値番号配列５２１及び項目値配列５２２というデータ構造５０２の形で共有メモリ１４上に構築されている。 FIG. 5 is an explanatory diagram of multi-item batch sort processing according to an embodiment of the present invention. The tabular data 501 shown in the figure includes 16 records whose record numbers are represented by 0 to 15, and each record includes an item Age “age” and an item class “class”. ing. The item “age” corresponds to the first dimension Dim-1 of the multidimensional aggregation, and the item “class” corresponds to the second dimension Dim-2 of the multidimensional aggregation. In this embodiment, the tabular data is referred to as a record number array 503, an item value number array 511 and an item value array 512 for the first dimension, and an item value number array 521 and an item value array 522 for the second dimension. It is constructed on the shared memory 14 in the form of a data structure 502.

このような表形式データは、「クラス」を下位次元とし、「年齢」を上位次元として、「クラス」及び「年齢」の２次元に関してソートすると、同図の矢印の右側に示されるような表形式データに変換される。ソート順は、各項目の項目値配列に格納されている項目値の順番と一致する。よって、同図の例では、「年齢」は昇順にソートされ、「クラス」はアルファベット順にソートされている。しかし、実際のソートは、項目値自体ではなく、項目値に対応する項目値番号に関して行われるので、本実施形態は実データのデータ形式とは無関係に適用できるという利点がある。 Such a tabular data is a table such as that shown on the right side of the arrow in the figure when “Class” is the lower dimension, “Age” is the upper dimension, and the “Class” and “Age” dimensions are sorted. Converted to format data. The sort order matches the order of the item values stored in the item value array of each item. Therefore, in the example of FIG. 3, “Age” is sorted in ascending order, and “Class” is sorted in alphabetical order. However, since the actual sorting is performed not on the item value itself but on the item value number corresponding to the item value, this embodiment has an advantage that it can be applied regardless of the data format of the actual data.

表形式データ５５１は、多項目一括ソートの結果に従って表形式データの行の順序を入れ替えたものである。このソート結果は、本実施形態では、レコード番号配列５０３からレコード番号配列５５３への要素の順序の入れ替えによって表現されている。尚、本実施形態の説明において、多項目一括ソート処理の次元数は２として説明しているが、次数が２に限定されないことは当業者に明白であろう。 The tabular data 551 is obtained by changing the order of the rows of tabular data according to the result of the multi-item batch sort. In this embodiment, this sort result is expressed by changing the order of elements from the record number array 503 to the record number array 553. In the description of the present embodiment, the number of dimensions of the multi-item batch sort process is described as 2, but it will be apparent to those skilled in the art that the order is not limited to 2.

次に、図５に示された表形式データの例を用いて、多項目一括ソート処理を詳細に説明する。 Next, the multi-item batch sort process will be described in detail using an example of the tabular data shown in FIG.

多項目一括ソート処理は、複数の項目のそれぞれに関する各項目値を組み合わせて新たな仮想的な項目値番号（以下では、仮想項目値番号と称する）を生成し、この仮想項目値番号に基づいてレコード番号を一度にソートする処理である。項目値番号の組み合わせは、上位次元の項目値番号が上位の桁、下位次元の項目値番号が下位の桁となるように、項目値番号の組を基数形式で表現することによって仮想項目値番号に変換される。基数は、各項目値番号が取り得る値の範囲に応じて決められる。たとえば、図５の例のように、第１の次元の項目値番号が０から２までの値を取り、第２の次元の項目値番号が０から３の値を取るならば、基数として４以上の整数を設定することにより、項目値番号の組み合わせが、４を基数とする基数形式で表現された仮想項目値番号に変換できる。 The multi-item batch sort process generates a new virtual item value number (hereinafter referred to as a virtual item value number) by combining each item value related to each of a plurality of items, and based on this virtual item value number This process sorts record numbers at a time. The combination of field value numbers is a virtual field value number by expressing the field value number pair in radix format so that the field value number of the higher dimension is the upper digit and the field value number of the lower dimension is the lower digit. Is converted to The radix is determined according to the range of values that each item value number can take. For example, if the item value number of the first dimension takes a value from 0 to 2 and the item value number of the second dimension takes a value from 0 to 3, as in the example of FIG. By setting the above integers, the combination of item value numbers can be converted into virtual item value numbers expressed in a radix format with a base of 4.

図６は本発明の一実施形態による仮想項目値番号の計算を説明する図である。本実施形態では、４台のプロセッサＣＰＵ−０、ＣＰＵ−１、ＣＰＵ−２、ＣＰＵ−３のそれぞれは、レコード番号の個数と一致するサイズを有する項目値番号配列のそれぞれの部分を受け持ち、第１の次元の項目値番号と第２の次元の項目値番号を組み合わせて仮想項目値番号を生成する。 FIG. 6 is a diagram for explaining calculation of a virtual item value number according to an embodiment of the present invention. In the present embodiment, each of the four processors CPU-0, CPU-1, CPU-2, and CPU-3 is responsible for each part of the item value number array having a size that matches the number of record numbers. A virtual item value number is generated by combining the item value number of the first dimension and the item value number of the second dimension.

具体的には、ｉがレコード番号を表し（本例では、ｉ＝０，１，２，・・・，１５）、第１の次元の項目値番号配列をＶＮｏ＿１（図５及び６において５１１で示されている）、第２の次元の項目値番号配列ＶＮｏ＿２（図５及び６において５２１で示されている）、仮想項目値番号配列をＶ＿ＶＮｏ（図６において６０１で示されている）とすると、
ＣＰＵ−０が、受け持ち範囲６０２、すなわち、ｉ＝０，１，２，３に対して、
Ｖ＿ＶＮｏ［ｉ］＝ＶＮｏ＿１［ｉ］＊４＋ＶＮｏ＿２［ｉ］
を計算し、同様に、ＣＰＵ−１が、受け持ち範囲６０３、すなわち、ｉ＝４，５，６，７に対して、
Ｖ＿ＶＮｏ［ｉ］＝ＶＮｏ＿１［ｉ］＊４＋ＶＮｏ＿２［ｉ］
を計算し、ＣＰＵ−２が、受け持ち範囲６０４、すなわち、ｉ＝８，９，１０，１１に対して、
Ｖ＿ＶＮｏ［ｉ］＝ＶＮｏ＿１［ｉ］＊４＋ＶＮｏ＿２［ｉ］
を計算し、ＣＰＵ−３が、受け持ち範囲６０５、すなわち、ｉ＝１２，１３，１４，１５に対して、
Ｖ＿ＶＮｏ［ｉ］＝ＶＮｏ＿１［ｉ］＊４＋ＶＮｏ＿２［ｉ］
を計算する。Specifically, i represents a record number (in this example, i = 0, 1, 2,..., 15), and an item value number array of the first dimension is designated as VNo_1 (511 in FIGS. 5 and 6). The second dimension item value number array VNo_2 (indicated by 521 in FIGS. 5 and 6), and the virtual item value number array V_VNo (indicated by 601 in FIG. 6). ,
CPU-0 is in charge range 602, i.e., i = 0, 1, 2, 3,
V_VNo [i] = VNo_1 [i] * 4 + VNo_2 [i]
Similarly, CPU-1 is in charge range 603, i.e., for i = 4, 5, 6, 7
V_VNo [i] = VNo_1 [i] * 4 + VNo_2 [i]
CPU-2 calculates the range of responsibility 604, i.e., i = 8, 9, 10, 11
V_VNo [i] = VNo_1 [i] * 4 + VNo_2 [i]
CPU-3 calculates the range of responsibility 605, i.e., i = 12, 13, 14, 15,
V_VNo [i] = VNo_1 [i] * 4 + VNo_2 [i]
Calculate

上記の説明では、仮想項目値番号配列の計算は、乗算と加算によって実現されているが、計算を高速化するため、好ましくは、乗算の代わりにビットシフト演算が採用される。 In the above description, the calculation of the virtual item value number array is realized by multiplication and addition. However, in order to speed up the calculation, a bit shift operation is preferably employed instead of multiplication.

次に、各プロセッサは、それぞれの受け持つレコード番号配列の範囲内の要素に関して、仮想項目値番号の出現回数を数え上げる。図７は本発明の一実施形態による仮想項目値番号を数え上げる処理の説明図である。この数え上げ処理もまた、プロセッサが受け持ち範囲を分担する。本例では、レコード番号配列５０３は、４台のプロセッサのそれぞれの受け持ち範囲７０１、７０２、７０３及び７０４に分割される。 Next, each processor counts up the number of occurrences of the virtual item value number with respect to the elements within the range of the record number array that it handles. FIG. 7 is an explanatory diagram of processing for counting up virtual item value numbers according to an embodiment of the present invention. This counting process is also shared by the processor. In this example, the record number array 503 is divided into the respective handling ranges 701, 702, 703, and 704 of the four processors.

レコード番号をｉ、レコード番号配列をＯｒｄＳｅｔ、仮想項目値番号配列をＶ＿ＶＮｏ、プロセッサ番号をｊ（ｊ＝０，１，２，３）としてプロセッサｊが数え上げた仮想項目値番号の個数を格納するカウント配列をＣｏｕｎｔ［ｊ］とすると、本実施形態による数え上げ処理は、たとえば、Ｃ言語スタイルでは、
ｆｏｒ（ｉ＝０；ｉ＜４；ｉ＋＋）｛
ｆｏｒ（ｊ＝０；ｊ＜４；ｊ＋＋）｛
Ｃｏｕｎｔ［ｊ］［Ｖ＿ＶＮｏ［ＯｒｄＳｅｔ［ｊ＊４＋ｉ］］］＋＋；
｝
｝
と記述される。図７の例では、初期化されたカウント配列７１０、７１１、７１２及び７１３は、仮想項目値番号の数え上げ処理によって、カウント配列７２０、７２１、７２２及び７２３となる。尚、本例において、第１の項目値配列をＶＬ＿１、第２の項目値配列をＶＬ＿２とすると、カウント配列Ｃｏｕｎｔのサイズは、（ＶＬ＿１のサイズ）×（ＶＬ＿２）のサイズに一致する。Count that stores the number of virtual item value numbers counted by processor j, with record number i, record number array OrdSet, virtual item value number array V_VNo, processor number j (j = 0, 1, 2, 3) When the array is Count [j], the counting process according to the present embodiment is, for example, in C language style.
for (i = 0; i <4; i ++) {
for (j = 0; j <4; j ++) {
Count [j] [V_VNo [OrdSet [j * 4 + i]]] ++;
}
}
Is described. In the example of FIG. 7, the initialized count arrays 710, 711, 712, and 713 become count arrays 720, 721, 722, and 723 by the virtual item value number counting process. In this example, when the first item value array is VL_1 and the second item value array is VL_2, the size of the count array Count is equal to the size of (VL_1 size) × (VL_2).

この数え上げ処理の結果は、たとえば、ＣＰＵ−０の受け持ち範囲内では、仮想順位番号＝０、２、５及び９がそれぞれ１個ずつ存在し、ＣＰＵ−１の受け持ち範囲内では、仮想順位番号＝６、７、１０及び１１がそれぞれ１個ずつ存在し、ＣＰＵ−２の受け持ち範囲内では、仮想順位番号０、３、４及び５がそれぞれ１個ずつ存在し、ＣＰＵ−３の受け持ち範囲内では、仮想順位番号＝２、５、１０及び１１がそれぞれ１個ずつ存在することを表している。 As a result of the counting process, for example, one virtual rank number = 0, 2, 5, and 9 exists in the CPU-0 range, and in the CPU-1 range, the virtual rank number = There is one each of 6, 7, 10 and 11, and within the CPU-2 range, there is one virtual rank number 0, 3, 4 and 5, and within the CPU-3 range. , The virtual rank numbers = 2, 5, 10 and 11 each exist.

続いて、本発明の一実施形態では、コンピュータシステム１０は、仮想項目値番号配列の各要素を累計数化して、仮想項目値番号の出現回数の累積度数分布配列を生成する。図８は、本発明の一実施形態による累積度数分布配列を生成する処理の説明図である。プロセッサｊの受け持ち範囲内のカウント配列をＣｏｕｎｔ［ｊ］、累積度数分布配列をＡｇｇｒ［ｊ］とすると、累積度数分布は、
Ａｇｇｒ［０］［０］＝０
Ａｇｇｒ［１］［０］＝Ａｇｇｒ［０］［０］＋Ｃｏｕｎｔ［０］［０］
Ａｇｇｒ［２］［０］＝Ａｇｇｒ［１］［０］＋Ｃｏｕｎｔ［１］［０］
Ａｇｇｒ［３］［０］＝Ａｇｇｒ［２］［０］＋Ｃｏｕｎｔ［２］［０］
Ａｇｇｒ［０］［１］＝Ａｇｇｒ［３］［０］＋Ｃｏｕｎｔ［３］［０］
Ａｇｇｒ［１］［１］＝Ａｇｇｒ［０］［１］＋Ｃｏｕｎｔ［０］［１］
Ａｇｇｒ［２］［１］＝Ａｇｇｒ［１］［１］＋Ｃｏｕｎｔ［１］［１］
Ａｇｇｒ［３］［１］＝Ａｇｇｒ［２］［１］＋Ｃｏｕｎｔ［２］［１］
Ａｇｇｒ［０］［２］＝Ａｇｇｒ［３］［１］＋Ｃｏｕｎｔ［３］［１］
・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・
途中省略
・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・
Ａｇｇｒ［３］［１０］＝Ａｇｇｒ［２］［１０］＋Ｃｏｕｎｔ［２］［１０］
Ａｇｇｒ［０］［１１］＝Ａｇｇｒ［３］［１０］＋Ｃｏｕｎｔ［３］［１０］
Ａｇｇｒ［１］［１１］＝Ａｇｇｒ［０］［１１］＋Ｃｏｕｎｔ［０］［１１］
Ａｇｇｒ［２］［１１］＝Ａｇｇｒ［１］［１１］＋Ｃｏｕｎｔ［１］［１１］
Ａｇｇｒ［３］［１１］＝Ａｇｇｒ［２］［１１］＋Ｃｏｕｎｔ［２］［１１］
として計算される。この計算は、４台のプロセッサのうちのいずれか１台のプロセッサが単独で行ってもよいが、４台のプロセッサに仮想項目値番号の範囲を割り当てて、４台のプロセッサによって並列に行ってもよい。Subsequently, in one embodiment of the present invention, the computer system 10 accumulates each element of the virtual item value number array to generate a cumulative frequency distribution array of the number of appearances of the virtual item value number. FIG. 8 is an explanatory diagram of processing for generating a cumulative frequency distribution array according to an embodiment of the present invention. When the count array within the range of the processor j is Count [j] and the cumulative frequency distribution array is Aggr [j], the cumulative frequency distribution is
Aggr [0] [0] = 0
Aggr [1] [0] = Aggr [0] [0] + Count [0] [0]
Aggr [2] [0] = Aggr [1] [0] + Count [1] [0]
Aggr [3] [0] = Aggr [2] [0] + Count [2] [0]
Aggr [0] [1] = Aggr [3] [0] + Count [3] [0]
Aggr [1] [1] = Aggr [0] [1] + Count [0] [1]
Aggr [2] [1] = Aggr [1] [1] + Count [1] [1]
Aggr [3] [1] = Aggr [2] [1] + Count [2] [1]
Aggr [0] [2] = Aggr [3] [1] + Count [3] [1]
・・・・・・・・・・・・・・・・・・・・・・・・・・
Omitted along the way ...
Aggr [3] [10] = Aggr [2] [10] + Count [2] [10]
Aggr [0] [11] = Aggr [3] [10] + Count [3] [10]
Aggr [1] [11] = Aggr [0] [11] + Count [0] [11]
Aggr [2] [11] = Aggr [1] [11] + Count [1] [11]
Aggr [3] [11] = Aggr [2] [11] + Count [2] [11]
Is calculated as This calculation may be performed by any one of the four processors alone, but the range of virtual item value numbers is assigned to the four processors and performed in parallel by the four processors. Also good.

４台のプロセッサによって並列的に累積度数分布を計算する場合には、最初に、各仮想項目値番号ｉの出現回数の合計値Ｓｕｍ［ｉ］を、たとえば、
ｆｏｒ（ｉ＝０；ｉ＜１２；ｉ＋＋）｛
ｆｏｒ（ｊ＝０；ｊ＜４；ｊ＋＋）｛
Ｓｕｍ［ｉ］＝Ｃｏｕｎｔ［ｊ］［ｉ］；
｝
｝
によって、並列的に計算する。本実施形態では、たとえば、ＣＰＵ−０がｉ＝０，１，２のループを担当し、ＣＰＵ−１がｉ＝３，４，５のループを担当し、ＣＰＵ−２がｉ＝６，７，８のループを担当し、ＣＰＵ−３がｉ＝９，１０，１１のループを担当する。When the cumulative frequency distribution is calculated in parallel by four processors, first, the total value Sum [i] of the number of occurrences of each virtual item value number i is, for example,
for (i = 0; i <12; i ++) {
for (j = 0; j <4; j ++) {
Sum [i] = Count [j] [i];
}
}
To calculate in parallel. In this embodiment, for example, CPU-0 is in charge of a loop of i = 0, 1, 2, CPU-1 is in charge of a loop of i = 3, 4, 5, and CPU-2 is i = 6, 7 , 8 and CPU-3 is responsible for i = 9, 10, 11 loops.

続いて、たとえば、４台のプロセッサが並列的に、
Ａｇｇｒ［０］［０］＝０
Ａｇｇｒ［１］［０］＝Ａｇｇｒ［０］［０］＋Ｃｏｕｎｔ［０］［０］
Ａｇｇｒ［２］［０］＝Ａｇｇｒ［１］［０］＋Ｃｏｕｎｔ［１］［０］
Ａｇｇｒ［３］［０］＝Ａｇｇｒ［２］［０］＋Ｃｏｕｎｔ［２］［０］
Ａｇｇｒ［０］［１］＝Ｓｕｍ［０］
Ａｇｇｒ［１］［１］＝Ａｇｇｒ［０］［１］＋Ｃｏｕｎｔ［０］［１］
Ａｇｇｒ［２］［１］＝Ａｇｇｒ［１］［１］＋Ｃｏｕｎｔ［１］［１］
Ａｇｇｒ［３］［１］＝Ａｇｇｒ［２］［１］＋Ｃｏｕｎｔ［２］［１］
Ａｇｇｒ［０］［２］＝Ｓｕｍ［１］
・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・
途中省略
・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・・
Ａｇｇｒ［３］［１０］＝Ａｇｇｒ［２］［１０］＋Ｃｏｕｎｔ［２］［１０］
Ａｇｇｒ［０］［１１］＝Ｓｕｍ［１０］
Ａｇｇｒ［１］［１１］＝Ａｇｇｒ［０］［１１］＋Ｃｏｕｎｔ［０］［１１］
Ａｇｇｒ［２］［１１］＝Ａｇｇｒ［１］［１１］＋Ｃｏｕｎｔ［１］［１１］
Ａｇｇｒ［３］［１１］＝Ａｇｇｒ［２］［１１］＋Ｃｏｕｎｔ［２］［１１］
を実行することによって、累積度数分布配列が得られる。Subsequently, for example, four processors are connected in parallel,
Aggr [0] [0] = 0
Aggr [1] [0] = Aggr [0] [0] + Count [0] [0]
Aggr [2] [0] = Aggr [1] [0] + Count [1] [0]
Aggr [3] [0] = Aggr [2] [0] + Count [2] [0]
Aggr [0] [1] = Sum [0]
Aggr [1] [1] = Aggr [0] [1] + Count [0] [1]
Aggr [2] [1] = Aggr [1] [1] + Count [1] [1]
Aggr [3] [1] = Aggr [2] [1] + Count [2] [1]
Aggr [0] [2] = Sum [1]
・・・・・・・・・・・・・・・・・・・・・・・・・・
Omitted along the way ...
Aggr [3] [10] = Aggr [2] [10] + Count [2] [10]
Aggr [0] [11] = Sum [10]
Aggr [1] [11] = Aggr [0] [11] + Count [0] [11]
Aggr [2] [11] = Aggr [1] [11] + Count [1] [11]
Aggr [3] [11] = Aggr [2] [11] + Count [2] [11]
To obtain a cumulative frequency distribution array.

図８には、最終的な累積度数分布配列８０１、８０２、８０３及び８０４が示されている。 FIG. 8 shows final cumulative frequency distribution arrays 801, 802, 803, and 804.

本発明の一実施形態による多項目一括ソート処理では、最後に、元のレコード番号配列の要素をソート後のレコード番号配列に転送する。図９は、本発明の一実施形態によるレコード番号配列の要素を並べ替える処理の説明図である。この処理では、レコード番号配列５０３は、図７と同様に、４台のプロセッサの受け持ち範囲７０１、７０２、７０３及び７０４に分割され、４台のプロセッサが並列処理を実行する。元のレコード番号配列をＯｒｄＳｅｔ、ソート後のレコード番号配列をＯｒｄＳｅｔ’、仮想項目値番号配列をＶ＿ＶＮｏ、累積度数分布配列をＡｇｇｒ［ｊ］（ｊはプロセッサ番号）とすると、プロセッサ０番、すなわち、ＣＰＵ−０による並べ替え処理は、
ｆｏｒ（ｉ＝０；ｉ＜４；ｉ＋＋）｛
ＯｒｄＳｅｔ’［Ａｇｇｒ［０］［Ｖ＿ＶＮｏ［ＯｒｄＳｅｔ［ｉ］］］］＝ＯｒｄＳｅｔ［ｉ］；
Ａｇｇｒ［０］［Ｖ＿ＶＮｏ［ＯｒｄＳｅｔ［ｉ］］］＋＋；
｝
と記述できる。ここで、Ａｇｇｒ［０］［Ｖ＿ＶＮｏ［ＯｒｄＳｅｔ［ｉ］］］＋＋から、累積度数分布配列の要素がインデックスポインタとして利用されていることがわかる。たとえば、図８では、レコード番号０に対して、
ＯｒｄＳｅｔ［０］＝０
Ｖ＿ＶＮｏ［ＯｒｄＳｅｔ［０］］＝Ｖ＿ＶＮｏ［０］＝５
Ａｇｇｒ［０］［Ｖ＿ＶＮｏ［ＯｒｄＳｅｔ［０］］］＝Ａｇｇｒ［０］［５］＝６
ＯｒｄＳｅｔ’［Ａｇｇｒ［０］［Ｖ＿ＶＮｏ［ＯｒｄＳｅｔ［０］］］］＝ＯｒｄＳｅｔ’［６］
となり、したがって、
ＯｒｄＳｅｔ’［６］＝０
Ａｇｇｒ［０］［５］＝７
となることが示されている。In the multi-item batch sort process according to the embodiment of the present invention, finally, the elements of the original record number array are transferred to the sorted record number array. FIG. 9 is an explanatory diagram of the process of rearranging the elements of the record number array according to one embodiment of the present invention. In this process, the record number array 503 is divided into four processor ranges 701, 702, 703, and 704 as in FIG. 7, and the four processors execute parallel processing. If the original record number array is OrdSet, the sorted record number array is OrdSet ', the virtual item value number array is V_VNo, and the cumulative frequency distribution array is Aggr [j] (j is a processor number), processor 0, The reordering process by CPU-0 is
for (i = 0; i <4; i ++) {
OrdSet ′ [Aggr [0] [V_VNo [OrdSet [i]]]] = OrdSet [i];
Aggr [0] [V_VNo [OrdSet [i]]] ++;
}
Can be described. Here, Aggr [0] [V_VNo [OrdSet [i]]] ++ indicates that the elements of the cumulative frequency distribution array are used as index pointers. For example, in FIG. 8, for record number 0,
OrdSet [0] = 0
V_VNo [OrdSet [0]] = V_VNo [0] = 5
Aggr [0] [V_VNo [OrdSet [0]]] = Aggr [0] [5] = 6
OrdSet ′ [Aggr [0] [V_VNo [OrdSet [0]]]] = OrdSet ′ [6]
And therefore
OrdSet ′ [6] = 0
Aggr [0] [5] = 7
It has been shown that

さらに、プロセッサ番号ｊをループ変数とすると、本実施形態によるレコード番号配列の要素を並べ替える処理は、
ｆｏｒ（ｊ＝０；ｊ＜４；ｊ＋＋）｛
ｆｏｒ（ｉ＝０；ｉ＜４；ｉ＋＋）｛
ＯｒｄＳｅｔ’［Ａｇｇｒ［ｊ］［Ｖ＿ＶＮｏ［ＯｒｄＳｅｔ［ｊ＊４＋ｉ］］］］＝ＯｒｄＳｅｔ［ｊ＊４＋ｉ］；
Ａｇｇｒ［ｊ］［Ｖ＿ＶＮｏ［ＯｒｄＳｅｔ［ｊ＊４＋ｉ］］］＋＋；
｝
｝
と記述できる。Furthermore, assuming that the processor number j is a loop variable, the process of rearranging the elements of the record number array according to this embodiment is as follows:
for (j = 0; j <4; j ++) {
for (i = 0; i <4; i ++) {
OrdSet ′ [Aggr [j] [V_VNo [OrdSet [j * 4 + i]]]] = OrdSet [j * 4 + i];
Aggr [j] [V_VNo [OrdSet [j * 4 + i]]] ++;
}
}
Can be described.

図１０は、本発明の一実施形態によるレコード番号配列の要素を並べ替える処理の結果を説明する図である。累積度数分布配列１００１、１００２、１００３及び１００４は、ソート後の累積度数分布配列を表し、ソートバージョンのレコード番号配列１００５はソート後のレコード番号配列を表している。このように、本発明の一実施形態によるレコード番号配列の要素を並べ替える処理により、元のレコード番号配列５０３の要素は、第１の次元と第２の次元の二つの次元に関して一括的にソートされ、ソートバージョンのレコード番号配列１００５に格納されている。 FIG. 10 is a diagram for explaining the result of the process of rearranging the elements of the record number array according to an embodiment of the present invention. Cumulative frequency distribution arrays 1001, 1002, 1003, and 1004 represent sorted cumulative frequency distribution arrays, and a sorted version record number array 1005 represents a sorted record number array. As described above, the element of the original record number array 503 is collectively sorted with respect to the two dimensions of the first dimension and the second dimension by the process of rearranging the elements of the record number array according to the embodiment of the present invention. And stored in the sorted version record number array 1005.

［ユニーク行抽出処理］
多項目一括ソート処理によって、多次元集計の対象となるレコードは複数の次元に関して所定の順序でソートされている。多次元集計を行うためには、次元値を特定する必要がある。そのため、本発明の一実施形態では、同一の次元値を有するレコードの中から代表となるレコードを抽出する。本明細書では、このようなレコードを抽出する処理をユニーク行抽出処理とも称する。[Unique row extraction processing]
By the multi-item batch sort process, the records to be subjected to multi-dimensional aggregation are sorted in a predetermined order with respect to a plurality of dimensions. In order to perform multidimensional aggregation, it is necessary to specify a dimension value. Therefore, in one embodiment of the present invention, a representative record is extracted from records having the same dimension value. In this specification, the process of extracting such a record is also referred to as a unique row extraction process.

ユニーク行抽出処理では、複数台のプロセッサのうちの何れかのプロセッサが、インデックスポインタとして利用された累積度数分布配列の要素の値に従ってソートバージョンのレコード番号配列を一意の項目値の組を有するレコードのレコード番号のグループに区分し、区分されたレコード番号の各グループから１個のレコード番号を抽出することにより、一意の項目値の組を特定する。 In the unique row extraction process, a record in which any one of a plurality of processors has a set version of a record number array of a sorted version according to the value of an element of a cumulative frequency distribution array used as an index pointer. Are grouped into groups of record numbers, and one record number is extracted from each group of the segmented record numbers, thereby identifying a unique set of item values.

レコード番号をレコード番号配列に格納する際に、累積度数分布配列の要素をインデックスポインタとして使用すると、レコード番号を格納する毎に累積度数分布配列の要素がインクリメントされる。その結果、累積度数分布配列の要素は、レコード番号がレコード番号配列中に格納されている位置を表現している。よって、ソートバージョンのレコード番号配列の完成後に、累積度数分布配列の要素の値はソートバージョンのレコード番号配列中のレコード番号のグループの境界を指している。 When an element of the cumulative frequency distribution array is used as an index pointer when storing the record number in the record number array, the element of the cumulative frequency distribution array is incremented each time the record number is stored. As a result, the element of the cumulative frequency distribution array represents the position where the record number is stored in the record number array. Thus, after completion of the sorted version record number array, the value of the element of the cumulative frequency distribution array points to the boundary of the group of record numbers in the sorted version record number array.

図１１は、本発明の一実施形態によるユニーク行抽出処理の説明図である。ソート後の累積度数分布配列の要素、本例では、出現回数の数え上げを行う際に、レコード番号配列の最後の範囲を担当したＣＰＵ−４が生成したカウント配列に基づく累積度数分布配列１００４の要素は、レコード番号をグループに区分するため利用できる。より具体的に説明すると、累積度数分布配列がインデックスポインタとして利用されたという事実から、最終的な累積度数分布配列１００４の各要素が指し示しているソートバージョンのレコード番号配列１００５の要素、すなわち、
ＯｒｄＳｅｔ’［Ａｇｇｒ［３］［０］］＝ＯｒｄＳｅｔ’［２］
ＯｒｄＳｅｔ’［Ａｇｇｒ［３］［１］］＝ＯｒｄＳｅｔ’［２］
ＯｒｄＳｅｔ’［Ａｇｇｒ［３］［２］］＝ＯｒｄＳｅｔ’［４］
ＯｒｄＳｅｔ’［Ａｇｇｒ［３］［３］］＝ＯｒｄＳｅｔ’［５］
ＯｒｄＳｅｔ’［Ａｇｇｒ［３］［４］］＝ＯｒｄＳｅｔ’［６］
ＯｒｄＳｅｔ’［Ａｇｇｒ［３］［５］］＝ＯｒｄＳｅｔ’［９］
ＯｒｄＳｅｔ’［Ａｇｇｒ［３］［６］］＝ＯｒｄＳｅｔ’［１０］
ＯｒｄＳｅｔ’［Ａｇｇｒ［３］［７］］＝ＯｒｄＳｅｔ’［１１］
ＯｒｄＳｅｔ’［Ａｇｇｒ［３］［８］］＝ＯｒｄＳｅｔ’［１１］
ＯｒｄＳｅｔ’［Ａｇｇｒ［３］［９］］＝ＯｒｄＳｅｔ’［１２］
ＯｒｄＳｅｔ’［Ａｇｇｒ［３］［１０］］＝ＯｒｄＳｅｔ’［１４］
ＯｒｄＳｅｔ’［Ａｇｇｒ［３］［１１］］＝ＯｒｄＳｅｔ’［１６］
は、一意の項目値の組を有するレコードのグループの境界を表していることがわかる。FIG. 11 is an explanatory diagram of unique row extraction processing according to an embodiment of the present invention. Elements of the cumulative frequency distribution array after sorting, in this example, the elements of the cumulative frequency distribution array 1004 based on the count array generated by the CPU-4 in charge of the last range of the record number array when counting the number of appearances Can be used to divide record numbers into groups. More specifically, from the fact that the cumulative frequency distribution array is used as an index pointer, the elements of the sorted version of the record number array 1005 pointed to by each element of the final cumulative frequency distribution array 1004, that is,
OrdSet ′ [Aggr [3] [0]] = OrdSet ′ [2]
OrdSet ′ [Aggr [3] [1]] = OrdSet ′ [2]
OrdSet ′ [Aggr [3] [2]] = OrdSet ′ [4]
OrdSet ′ [Aggr [3] [3]] = OrdSet ′ [5]
OrdSet ′ [Aggr [3] [4]] = OrdSet ′ [6]
OrdSet ′ [Aggr [3] [5]] = OrdSet ′ [9]
OrdSet ′ [Aggr [3] [6]] = OrdSet ′ [10]
OrdSet ′ [Aggr [3] [7]] = OrdSet ′ [11]
OrdSet ′ [Aggr [3] [8]] = OrdSet ′ [11]
OrdSet ′ [Aggr [3] [9]] = OrdSet ′ [12]
OrdSet ′ [Aggr [3] [10]] = OrdSet ′ [14]
OrdSet ′ [Aggr [3] [11]] = OrdSet ′ [16]
Represents the boundary of a group of records having a unique set of field values.

これにより、図１１の例では、レコード番号は、
グループ０＝｛１，９｝（同図では、１１０１を参照）
グループ１＝｛３，１５｝（同図では、１１０２を参照）
グループ２＝｛１１｝（同図では、１１０３を参照）
グループ３＝｛８｝（同図では、１１０４を参照）
グループ４＝｛０，１０，１４｝（同図では、１１０５を参照）
グループ５＝｛５｝（同図では、１１０６を参照）
グループ６＝｛４｝（同図では、１１０７を参照）
グループ７＝｛２｝（同図では、１１０８を参照）
グループ８＝｛６，１２｝（同図では、１１０９を参照）
グループ９＝｛７，１３｝（同図では、１１１０を参照）
の１０個のグループに分類されることがわかる。Thus, in the example of FIG. 11, the record number is
Group 0 = {1, 9} (refer to 1101 in the figure)
Group 1 = {3, 15} (refer to 1102 in the figure)
Group 2 = {11} (refer to 1103 in the figure)
Group 3 = {8} (Refer to 1104 in the figure)
Group 4 = {0, 10, 14} (refer to 1105 in the figure)
Group 5 = {5} (See 1106 in the figure)
Group 6 = {4} (Refer to 1107 in the figure)
Group 7 = {2} (refer to 1108 in the figure)
Group 8 = {6,12} (refer to 1109 in the figure)
Group 9 = {7, 13} (refer to 1110 in the figure)
It can be seen that it is classified into 10 groups.

このようにグループは１０個しかないので、グループ毎に１個ずつのレコード番号、すなわち、全部で１０個のレコード番号を代表レコード番号として抽出することにより、一意の項目値の組を抽出することができる。 Since there are only 10 groups in this way, a unique set of item values is extracted by extracting one record number per group, that is, a total of 10 record numbers as representative record numbers. Can do.

図１１では、抽出された代表レコード番号を含む代表レコード番号配列ＯｒｄＳｅｔ’’が１１２０として示されている。 In FIG. 11, a representative record number array OrdSet ″ including the extracted representative record number is shown as 1120.

次に、ｉをループ変数、ＶＬ＿１及びＶＮｏ＿１を第１の次元の項目値配列及び項目値番号配列、ＶＬ＿２及びＶＮｏ＿２を第２の次元の項目値配列及び項目値番号配列とし、集計の第１の次元の次元値配列をＤｉｍ−１、第２の次元の次元値配列をＤｉｍ−２とすると、たとえば、
ｆｏｒ（ｉ＝０；ｉ＜１０；ｉ＋＋）｛
Ｄｉｍ−１［ｉ］＝ＶＬ＿１［ＶＮｏ＿１［ＯｒｄＳｅｔ’’［ｉ］］］；
Ｄｉｍ−２［ｉ］＝ＶＬ＿２［ＶＮｏ＿２［ＯｒｄＳｅｔ’’［ｉ］］］；
｝
を複数台のプロセッサで並列に実行することによって、集計の次元１１３０が作成される。Next, i is a loop variable, VL_1 and VNo_1 are the first dimension item value array and item value number array, VL_2 and VNo_2 are the second dimension item value array and item value number array, If the dimension value array of the dimension is Dim-1, and the dimension value array of the second dimension is Dim-2, for example,
for (i = 0; i <10; i ++) {
Dim-1 [i] = VL_1 [VNo_1 [OrdSet ''[i]]];
Dim-2 [i] = VL_2 [VNo_2 [OrdSet ''[i]]];
}
Is executed in parallel by a plurality of processors to create a total dimension 1130.

上記のユニーク行抽出処理において、一意の項目値の組は、項目値に関してソートされた順番に抽出されている。たとえば、一意の項目値の組を、元のレコード番号配列ＯｒｄＳｅｔに出現した順序で抽出するには、代表レコード番号配列の要素を元のレコード番号配列に出現した順序で並べ替えればよい。図１２及び１３は、本発明の一実施形態によるユニーク行抽出処理によって抽出された一意の項目値の組を元のレコード番号配列に出現した順序に並べ替える処理の説明図である。 In the above unique row extraction process, unique item value pairs are extracted in the order in which the item values are sorted. For example, in order to extract unique item value pairs in the order of appearance in the original record number array OrdSet, the elements of the representative record number array may be rearranged in the order of appearance in the original record number array. FIGS. 12 and 13 are explanatory diagrams of processing for rearranging the sets of unique item values extracted by the unique row extraction processing according to the embodiment of the present invention in the order of appearance in the original record number array.

この処理は、代表レコード番号配列に格納されたレコード番号に対応するビットマップフラグを作成するステップと、元のレコード番号の順にビットマップフラグをチェックし、フラグがセットされているレコード番号を元のレコード番号配列に格納されたレコード番号の順序で抽出するステップとを含む。これらのステップは、代表レコード番号配列及び元のレコード番号配列を複数台（本例では、４台）のＣＰＵに割り当てることによって並列処理可能である。 In this process, a bitmap flag corresponding to the record number stored in the representative record number array is created, and the bitmap flag is checked in the order of the original record number, and the record number in which the flag is set is changed to the original number. Extracting in the order of the record numbers stored in the record number array. These steps can be processed in parallel by assigning the representative record number array and the original record number array to a plurality of (four in this example) CPUs.

図１２を参照すると、代表レコード番号配列ＯｒｄＳｅｔ’’１１２０は、ＣＰＵ−０の受け持ち範囲１２００と、ＣＰＵ−１の受け持ち範囲１２０１と、ＣＰＵ−２の受け持ち範囲１２０２と、ＣＰＵ−３の受け持ち範囲１２０３とに分割されている。各プロセッサは、代表レコード番号配列１１２０からそのプロセッサ自体の受け持ち範囲内の要素、すなわち、レコード番号を読み出し、それぞれのビットマップフラグ配列１２１０、１２１１、１２１２及び１２１３の中で、読み出されたレコード番号に対応するビットをセットする。たとえば、ＣＰＵ−０は、ＯｒｄＳｅｔ’’［０］＝１、ＯｒｄＳｅｔ’’［１］＝３及びＯｒｄＳｅｔ’’［２］＝１１の３個のレコード番号を読み出し、ビットマップフラグ配列中の３個のフラグをセットする。すなわち、Ｆｌａｇ＿０［１］＝１、Ｆｌａｇ＿０［３］＝１、及び、Ｆｌａｇ＿０［１１］＝１である。ビットマップフラグ配列Ｆｌａｇ＿０のその他の要素は０のままである。ＣＰＵ−１、ＣＰＵ−２及びＣＰＵ−３も同様に、それぞれ、ビットマップフラグ配列Ｆｌａｇ＿１、Ｆｌａｇ＿２及びＦｌａｇ＿３を設定する。これらの処理は、複数台のプロセッサによって並列に実行される。 Referring to FIG. 12, the representative record number array OrdSet ″ 1120 includes a CPU-0 coverage range 1200, a CPU-1 coverage range 1201, a CPU-2 coverage range 1202, and a CPU-3 coverage range 1203. It is divided into and. Each processor reads an element within the range of the processor itself, that is, a record number, from the representative record number array 1120, and the record number read in each bitmap flag array 1210, 1211, 1212, and 1213. Set the bit corresponding to. For example, the CPU-0 reads out three record numbers of OrdSet ″ [0] = 1, OrdSet ″ [1] = 3 and OrdSet ″ [2] = 11, and outputs three in the bitmap flag array. Set the flag. That is, Flag_0 [1] = 1, Flag_0 [3] = 1, and Flag_0 [11] = 1. Other elements of the bitmap flag array Flag_0 remain 0. Similarly, CPU-1, CPU-2, and CPU-3 set bitmap flag arrays Flag_1, Flag_2, and Flag_3, respectively. These processes are executed in parallel by a plurality of processors.

次に、ビットマップフラグ配列１２１０、１２１１、１２１２及び１２１３のそれぞれが、先頭から順番に、ＣＰＵ−０、ＣＰＵ−１、ＣＰＵ−２及びＣＰＵ−３の４台のプロセッサに割り当てられ、全体的なビットマップフラグ配列Ｆｌａｇが作成される。具体的には、各プロセッサがそれぞれの受け持ち範囲１２２０、１２２１、１２２２、１２２３に関して、論理和演算を並列に実行することにより、全体として、
ｆｏｒ（ｉ＝０；ｉ＜１６；ｉ＋＋）｛
Ｆｌａｇ［ｉ］＝Ｆｌａｇ＿１［ｉ］ＯＲＦｌａｇ＿２［ｉ］ＯＲＦｌａｇ＿３［ｉ］ＯＲＦｌａｇ＿４［ｉ］；
｝
が実行される。これにより、図１２の全体的なビットマップフラグ配列１２３０が得られる。Next, each of the bitmap flag arrays 1210, 1211, 1212 and 1213 is assigned to four processors, CPU-0, CPU-1, CPU-2 and CPU-3, in order from the top, A bitmap flag array Flag is created. Specifically, each processor performs an OR operation in parallel on each of the responsible ranges 1220, 1221, 1222, and 1223, so that as a whole,
for (i = 0; i <16; i ++) {
Flag [i] = Flag_1 [i] OR Flag_2 [i] OR Flag_3 [i] OR Flag_4 [i];
}
Is executed. As a result, the overall bitmap flag array 1230 of FIG. 12 is obtained.

図１３を参照すると、各プロセッサは、全体的な元のレコード番号配列５０３の受け持ち範囲７０１、７０２、７０３、７０４の中の要素の順に、全体的なビットマップフラグ配列１２３０の要素のフラグがセットされているか否かをチェックし、フラグがセットされているレコード番号をチェック順に作業配列１３００に格納し、最後に作業配列をさらなる代表レコード番号配列１３１０に格納する。さらなる代表レコード番号配列１３１０は、代表レコード番号が元のレコード番号配列に格納されたレコード番号の順序で格納されている配列である。 Referring to FIG. 13, each processor sets the flags of the elements of the overall bitmap flag array 1230 in the order of the elements in the ranges 701, 702, 703, and 704 of the overall original record number array 503. The record number for which the flag is set is stored in the work array 1300 in the order of check, and finally the work array is stored in the further representative record number array 1310. The further representative record number array 1310 is an array in which the representative record numbers are stored in the order of the record numbers stored in the original record number array.

次に、ｉをループ変数、ＶＬ＿１及びＶＮｏ＿１を第１の次元の項目値配列及び項目値番号配列、ＶＬ＿２及びＶＮｏ＿２を第２の次元の項目値配列及び項目値番号配列とし、集計の第１の次元の次元値配列をＤｉｍ−１、第２の次元の次元値配列をＤｉｍ−２とすると、たとえば、
ｆｏｒ（ｉ＝０；ｉ＜１０；ｉ＋＋）｛
Ｄｉｍ−１［ｉ］＝ＶＬ＿１［ＶＮｏ＿１［ＯｒｄＳｅｔ’’’［ｉ］］］；
Ｄｉｍ−２［ｉ］＝ＶＬ＿２［ＶＮｏ＿２［ＯｒｄＳｅｔ’’’［ｉ］］］；
｝
を複数台のプロセッサで並列に実行することによって、集計の次元１３２０が作成される。Next, i is a loop variable, VL_1 and VNo_1 are the first dimension item value array and item value number array, VL_2 and VNo_2 are the second dimension item value array and item value number array, If the dimension value array of the dimension is Dim-1, and the dimension value array of the second dimension is Dim-2, for example,
for (i = 0; i <10; i ++) {
Dim-1 [i] = VL_1 [VNo_1 [OrdSet '''[i]]];
Dim-2 [i] = VL_2 [VNo_2 [OrdSet '''[i]]];
}
Is executed in parallel by a plurality of processors, thereby creating a total dimension 1320.

［多項目逐次ソート処理］
多項目並列ソート処理には、上述の多項目一括ソート処理の他に、これから説明する多項目逐次ソート処理がある。たとえば、仮想項目値番号のビット数がコンピュータ上で処理できる整数の範囲を超える場合、上記の多項目一括ソート処理を適用できなくなるので、この多項目逐次ソート処理が適用される。以下では、図５の表形式データを用いて、本発明の一実施形態による多項目逐次ソート処理を説明する。以下の例では、次元数を２として説明しているが、次元数２の多項目逐次ソート処理を多段階化することにより任意の次数の多項目逐次ソート処理を実現可能であることは当業者に明白であろう。[Multi-item sequential sorting]
The multi-item parallel sort process includes a multi-item sequential sort process described below in addition to the above-described multi-item batch sort process. For example, when the number of bits of the virtual item value number exceeds the range of integers that can be processed on the computer, the multi-item batch sort process cannot be applied, so this multi-item sequential sort process is applied. Hereinafter, the multi-item sequential sorting process according to the embodiment of the present invention will be described using the tabular data in FIG. In the following example, the number of dimensions is described as 2. However, it is understood by those skilled in the art that multi-item sequential sort processing of any order can be realized by multi-stage multi-item sequential sort processing of dimension 2 It will be obvious.

多項目逐次ソート処理は、上位次元の項目に関するソートより下位次元の項目に関するソートを先に実行する。上位次元と下位次元の考え方は、多項目一括ソート処理に関して説明した通りである。そして、ユニーク行抽出処理に関して説明したグループ番号の考え方を下位次元のソートと上位次元のソートのそれぞれにおいて導入することにより、比較演算を行うことなくグループ分けを行っている。下位次元のソートによって得られたグループ番号を上位次元のソートの際にレコード番号と共にソートすることにより、上位次元のグループ化が簡単化される。尚、項目値番号が０から連続的に使用されることが保証されるならば、項目値番号をグループ番号として使用することも可能であるが、一般には、データの追加・削除により実際に使用されている項目値番号が飛び飛びの値となることもあるので、好ましくは、項目値番号とは別個にグループ番号が使用される。 In the multi-item sequential sort process, the sort related to the items in the lower dimension is executed first than the sort related to the items in the higher dimension. The concept of the upper dimension and the lower dimension is as described for the multi-item batch sort process. Then, by introducing the concept of the group number described for the unique row extraction process in each of the lower dimension sort and the upper dimension sort, the grouping is performed without performing the comparison operation. By sorting the group numbers obtained by sorting the lower dimensions together with the record numbers when sorting the upper dimensions, the grouping of the upper dimensions is simplified. If it is guaranteed that the item value number is used continuously from 0, it is possible to use the item value number as a group number. However, in general, it is actually used by adding / deleting data. Since the item value number being used may become a skipped value, the group number is preferably used separately from the item value number.

図１４は、本発明の一実施形態による多項目逐次ソート処理の概要説明図である。ソート処理前の表形式データ５０１は、図５に示されているデータと同じである。下位次元である項目「クラス」に関して「クラス」の項目値の昇順（たとえば、アルファベット順）に表形式データをソートすると、下位次元でソートされた表形式データ１４００が得られる。このとき、「クラス」の項目値に基づく第１のグループ番号配列１４０１を表形式データ１４００に付与する。 FIG. 14 is a schematic explanatory diagram of multi-item sequential sorting processing according to an embodiment of the present invention. The tabular data 501 before the sorting process is the same as the data shown in FIG. When the tabular data is sorted in the ascending order (for example, alphabetical order) of the item values of “class” with respect to the item “class” which is the lower dimension, tabular data 1400 sorted in the lower dimension is obtained. At this time, a first group number array 1401 based on the item value of “class” is assigned to the tabular data 1400.

次に、表形式データ１４００と第１のグループ番号配列１４０１を、上位次元である項目「年齢」の項目値の昇順にソートすると、ソートされた表形式データ１４１０と、ソーとされた第１のグループ番号配列１４１１とが得られる。このとき、ソート結果に基づいて新たな第２のグループ番号配列１４１２を表形式データ１４１０に付与する。 Next, when the tabular data 1400 and the first group number array 1401 are sorted in ascending order of the item value of the item “age” which is the higher dimension, the sorted tabular data 1410 and the first saw-shaped data are sorted. A group number array 1411 is obtained. At this time, a new second group number array 1412 is assigned to the tabular data 1410 based on the sorting result.

最後に、第２のグループ番号配列１４１２を参照して、表形式データ１４１０からユニーク行を抽出すると、多項目一括ソート処理で作成された集計の次元と同じ集計の次元１１３０が得られる。 Finally, referring to the second group number array 1412 and extracting a unique row from the tabular data 1410, the same total dimension 1130 as the total dimension created by the multi-item batch sort process is obtained.

以下では、図面を参照して、本発明の一実施形態による多項目逐次ソート処理をより詳細に説明する。図１５は、本発明の一実施形態による多項目逐次ソート処理の下位次元ソート処理における並列カウントアップ処理の説明図である。下位次元ソート処理は、下位次元の項目に関する項目値番号の昇順にレコード番号配列に格納されたレコード番号を並べ替える処理である。この処理は、多項目一括ソート処理における仮想項目値番号の昇順にレコード番号を並べ替える処理と類似している。図１５の例では、下位次元の項目の項目値に対応する項目値番号の出現回数を数え上げる。この数え上げ処理もまた、プロセッサが受け持ち範囲を分担する。本例では、レコード番号配列５０３は、４台のプロセッサのそれぞれの受け持ち範囲１５０１、１５０２、１５０３及び１５０４に分割される。 Hereinafter, the multi-item sequential sorting process according to an embodiment of the present invention will be described in more detail with reference to the drawings. FIG. 15 is an explanatory diagram of the parallel count-up process in the low-order sort process of the multi-item sequential sort process according to an embodiment of the present invention. The lower dimension sorting process is a process of rearranging the record numbers stored in the record number array in ascending order of the item value numbers related to the lower dimension items. This process is similar to the process of rearranging record numbers in ascending order of virtual item value numbers in the multi-item batch sort process. In the example of FIG. 15, the number of occurrences of the item value number corresponding to the item value of the item of the lower dimension is counted. This counting process is also shared by the processor. In this example, the record number array 503 is divided into the respective handling ranges 1501, 1502, 1503, and 1504 of the four processors.

レコード番号をｉ、レコード番号配列をＯｒｄＳｅｔ、項目値番号配列をＶＮｏ＿２、プロセッサ番号をｊ（ｊ＝０，１，２，３）としてプロセッサｊが数え上げた項目値番号の個数を格納するカウント配列をＣｏｕｎｔ［ｊ］とすると、本実施形態による数え上げ処理は、たとえば、Ｃ言語スタイルでは、
ｆｏｒ（ｉ＝０；ｉ＜４；ｉ＋＋）｛
ｆｏｒ（ｊ＝０；ｊ＜４；ｊ＋＋）｛
Ｃｏｕｎｔ［ｊ］［Ｖ＿ＶＮｏ［ＯｒｄＳｅｔ［ｊ＊４＋ｉ］］］＋＋；
｝
｝
と記述される。図１５の例では、初期化されたカウント配列１５１０、１５１１、１５１２及び１５１３は、項目値番号の数え上げ処理によって、カウント配列１５２０、１５２１、１５２２及び１５２３となる。尚、本例において、カウント配列Ｃｏｕｎｔのサイズは、対象としている項目の項目値番号配列をＶＬとし、プロセッサの台数をＮとすると、（ＶＬのサイズ）×Ｎに一致する。A count array for storing the number of item value numbers counted by the processor j, where the record number is i, the record number array is OrdSet, the item value number array is VNo_2, and the processor number is j (j = 0, 1, 2, 3). If Count [j], the counting process according to the present embodiment is, for example, in C language style.
for (i = 0; i <4; i ++) {
for (j = 0; j <4; j ++) {
Count [j] [V_VNo [OrdSet [j * 4 + i]]] ++;
}
}
Is described. In the example of FIG. 15, the initialized count arrays 1510, 1511, 1512, and 1513 become count arrays 1520, 1521, 1522, and 1523 by the item value number counting process. In this example, the size of the count array Count is equal to (VL size) × N, where the item value number array of the target item is VL and the number of processors is N.

続いて、本発明の一実施形態では、コンピュータシステム１０は、項目値番号配列の各要素を累計数化して、項目値番号の出現回数の累積度数分布配列を生成する。図１６は、本発明の一実施形態による多項目逐次ソート処理の下位次元ソート処理における累積度数分布配列を生成する処理の説明図である。プロセッサｊの受け持ち範囲内のカウント配列をＣｏｕｎｔ［ｊ］、累積度数分布配列をＡｇｇｒ［ｊ］とすると、累積度数分布は、
Ａｇｇｒ［０］［０］＝０
Ａｇｇｒ［１］［０］＝Ａｇｇｒ［０］［０］＋Ｃｏｕｎｔ［０］［０］
Ａｇｇｒ［２］［０］＝Ａｇｇｒ［１］［０］＋Ｃｏｕｎｔ［１］［０］
Ａｇｇｒ［３］［０］＝Ａｇｇｒ［２］［０］＋Ｃｏｕｎｔ［２］［０］
Ａｇｇｒ［０］［１］＝Ａｇｇｒ［３］［０］＋Ｃｏｕｎｔ［３］［０］
Ａｇｇｒ［１］［１］＝Ａｇｇｒ［０］［１］＋Ｃｏｕｎｔ［０］［１］
Ａｇｇｒ［２］［１］＝Ａｇｇｒ［１］［１］＋Ｃｏｕｎｔ［１］［１］
Ａｇｇｒ［３］［１］＝Ａｇｇｒ［２］［１］＋Ｃｏｕｎｔ［２］［１］
Ａｇｇｒ［０］［２］＝Ａｇｇｒ［３］［１］＋Ｃｏｕｎｔ［３］［１］
Ａｇｇｒ［１］［２］＝Ａｇｇｒ［０］［２］＋Ｃｏｕｎｔ［０］［２］
Ａｇｇｒ［２］［２］＝Ａｇｇｒ［１］［２］＋Ｃｏｕｎｔ［１］［２］
Ａｇｇｒ［３］［２］＝Ａｇｇｒ［２］［２］＋Ｃｏｕｎｔ［２］［２］
Ａｇｇｒ［０］［３］＝Ａｇｇｒ［３］［２］＋Ｃｏｕｎｔ［３］［２］
Ａｇｇｒ［１］［３］＝Ａｇｇｒ［０］［３］＋Ｃｏｕｎｔ［０］［３］
Ａｇｇｒ［２］［３］＝Ａｇｇｒ［１］［３］＋Ｃｏｕｎｔ［１］［３］
Ａｇｇｒ［３］［３］＝Ａｇｇｒ［２］［３］＋Ｃｏｕｎｔ［２］［３］
として計算される。この計算は、４台のプロセッサのうちのいずれか１台のプロセッサが単独で行ってもよいが、４台のプロセッサに仮想項目値番号の範囲を割り当てて、４台のプロセッサによって並列に行ってもよい。Subsequently, in one embodiment of the present invention, the computer system 10 generates a cumulative frequency distribution array of the number of occurrences of item value numbers by accumulating each element of the item value number array. FIG. 16 is an explanatory diagram of a process for generating a cumulative frequency distribution array in the low-order sort process of the multi-item sequential sort process according to an embodiment of the present invention. When the count array within the range of the processor j is Count [j] and the cumulative frequency distribution array is Aggr [j], the cumulative frequency distribution is
Aggr [0] [0] = 0
Aggr [1] [0] = Aggr [0] [0] + Count [0] [0]
Aggr [2] [0] = Aggr [1] [0] + Count [1] [0]
Aggr [3] [0] = Aggr [2] [0] + Count [2] [0]
Aggr [0] [1] = Aggr [3] [0] + Count [3] [0]
Aggr [1] [1] = Aggr [0] [1] + Count [0] [1]
Aggr [2] [1] = Aggr [1] [1] + Count [1] [1]
Aggr [3] [1] = Aggr [2] [1] + Count [2] [1]
Aggr [0] [2] = Aggr [3] [1] + Count [3] [1]
Aggr [1] [2] = Aggr [0] [2] + Count [0] [2]
Aggr [2] [2] = Aggr [1] [2] + Count [1] [2]
Aggr [3] [2] = Aggr [2] [2] + Count [2] [2]
Aggr [0] [3] = Aggr [3] [2] + Count [3] [2]
Aggr [1] [3] = Aggr [0] [3] + Count [0] [3]
Aggr [2] [3] = Aggr [1] [3] + Count [1] [3]
Aggr [3] [3] = Aggr [2] [3] + Count [2] [3]
Is calculated as This calculation may be performed by any one of the four processors alone, but the range of virtual item value numbers is assigned to the four processors and performed in parallel by the four processors. Also good.

４台のプロセッサによって並列的に累積度数分布を計算する場合には、最初に、各仮想項目値番号ｉの出現回数の合計値Ｓｕｍ［ｉ］を、たとえば、
ｆｏｒ（ｉ＝０；ｉ＜４；ｉ＋＋）｛
ｆｏｒ（ｊ＝０；ｊ＜４；ｊ＋＋）｛
Ｓｕｍ［ｉ］＝Ｃｏｕｎｔ［ｊ］［ｉ］；
｝
｝
によって、並列的に計算する。本実施形態では、たとえば、ＣＰＵ−０がｉ＝０のループを担当し、ＣＰＵ−１がｉ＝１のループを担当し、ＣＰＵ−２がｉ＝２のループを担当し、ＣＰＵ−３がｉ＝３のループを担当する。When the cumulative frequency distribution is calculated in parallel by four processors, first, the total value Sum [i] of the number of occurrences of each virtual item value number i is, for example,
for (i = 0; i <4; i ++) {
for (j = 0; j <4; j ++) {
Sum [i] = Count [j] [i];
}
}
To calculate in parallel. In this embodiment, for example, CPU-0 is in charge of the i = 0 loop, CPU-1 is in charge of the i = 1 loop, CPU-2 is in charge of the i = 2 loop, and CPU-3 is in charge of the loop. Take charge of the loop of i = 3.

続いて、たとえば、４台のプロセッサが並列的に、
Ａｇｇｒ［０］［０］＝０
Ａｇｇｒ［１］［０］＝Ａｇｇｒ［０］［０］＋Ｃｏｕｎｔ［０］［０］
Ａｇｇｒ［２］［０］＝Ａｇｇｒ［１］［０］＋Ｃｏｕｎｔ［１］［０］
Ａｇｇｒ［３］［０］＝Ａｇｇｒ［２］［０］＋Ｃｏｕｎｔ［２］［０］
Ａｇｇｒ［０］［１］＝Ｓｕｍ［０］
Ａｇｇｒ［１］［１］＝Ａｇｇｒ［０］［１］＋Ｃｏｕｎｔ［０］［１］
Ａｇｇｒ［２］［１］＝Ａｇｇｒ［１］［１］＋Ｃｏｕｎｔ［１］［１］
Ａｇｇｒ［３］［１］＝Ａｇｇｒ［２］［１］＋Ｃｏｕｎｔ［２］［１］
Ａｇｇｒ［０］［２］＝Ｓｕｍ［１］
Ａｇｇｒ［１］［２］＝Ａｇｇｒ［０］［２］＋Ｃｏｕｎｔ［０］［２］
Ａｇｇｒ［２］［２］＝Ａｇｇｒ［１］［２］＋Ｃｏｕｎｔ［１］［２］
Ａｇｇｒ［３］［２］＝Ａｇｇｒ［２］［２］＋Ｃｏｕｎｔ［２］［２］
Ａｇｇｒ［０］［３］＝Ｓｕｍ［２］
Ａｇｇｒ［１］［３］＝Ａｇｇｒ［０］［３］＋Ｃｏｕｎｔ［０］［３］
Ａｇｇｒ［２］［３］＝Ａｇｇｒ［１］［３］＋Ｃｏｕｎｔ［１］［３］
Ａｇｇｒ［３］［３］＝Ａｇｇｒ［２］［３］＋Ｃｏｕｎｔ［２］［３］
を実行することによって、累積度数分布配列が得られる。Subsequently, for example, four processors are connected in parallel,
Aggr [0] [0] = 0
Aggr [1] [0] = Aggr [0] [0] + Count [0] [0]
Aggr [2] [0] = Aggr [1] [0] + Count [1] [0]
Aggr [3] [0] = Aggr [2] [0] + Count [2] [0]
Aggr [0] [1] = Sum [0]
Aggr [1] [1] = Aggr [0] [1] + Count [0] [1]
Aggr [2] [1] = Aggr [1] [1] + Count [1] [1]
Aggr [3] [1] = Aggr [2] [1] + Count [2] [1]
Aggr [0] [2] = Sum [1]
Aggr [1] [2] = Aggr [0] [2] + Count [0] [2]
Aggr [2] [2] = Aggr [1] [2] + Count [1] [2]
Aggr [3] [2] = Aggr [2] [2] + Count [2] [2]
Aggr [0] [3] = Sum [2]
Aggr [1] [3] = Aggr [0] [3] + Count [0] [3]
Aggr [2] [3] = Aggr [1] [3] + Count [1] [3]
Aggr [3] [3] = Aggr [2] [3] + Count [2] [3]
To obtain a cumulative frequency distribution array.

図１６には、最終的な累積度数分布配列１６００、１６０１、１６０２及び１６０３が示されている。 FIG. 16 shows final cumulative frequency distribution arrays 1600, 1601, 1602, and 1603.

次に、本発明の一実施形態による多項目逐次ソート処理の下位次元ソートでは、元のレコード番号配列の要素をソート後のレコード番号配列に転送する。図１７は、本発明の一実施形態によるレコード番号配列の要素を並べ替える処理の説明図である。この処理では、レコード番号配列５０３は、図１５と同様に、４台のプロセッサの受け持ち範囲１５０１、１５０２、１５０３及び１５０４に分割され、４台のプロセッサが並列処理を実行する。元のレコード番号配列をＯｒｄＳｅｔ、ソート後のレコード番号配列をＯｒｄＳｅｔ’、項目値番号配列をＶＮｏ＿２、累積度数分布配列をＡｇｇｒ［ｊ］（ｊはプロセッサ番号）とすると、プロセッサ０番、すなわち、ＣＰＵ−０による並べ替え処理は、
ｆｏｒ（ｉ＝０；ｉ＜４；ｉ＋＋）｛
ＯｒｄＳｅｔ’［Ａｇｇｒ［０］［ＶＮｏ＿２［ＯｒｄＳｅｔ［ｉ］］］］＝ＯｒｄＳｅｔ［ｉ］；
Ａｇｇｒ［０］［ＶＮｏ＿２［ＯｒｄＳｅｔ［ｉ］］］＋＋；
｝
と記述できる。ここで、Ａｇｇｒ［０］［ＶＮｏ＿２［ＯｒｄＳｅｔ［ｉ］］］＋＋から、累積度数分布配列の要素がインデックスポインタとして利用されていることがわかる。Next, in the low-order sort of the multi-item sequential sort process according to the embodiment of the present invention, the elements of the original record number array are transferred to the sorted record number array. FIG. 17 is an explanatory diagram of processing for rearranging elements of a record number array according to an embodiment of the present invention. In this processing, the record number array 503 is divided into four processor ranges 1501, 1502, 1503, and 1504 as in FIG. 15, and the four processors execute parallel processing. If the original record number array is OrdSet, the sorted record number array is OrdSet ', the item value number array is VNo_2, and the cumulative frequency distribution array is Aggr [j] (j is a processor number), processor 0, that is, CPU Sorting by -0 is
for (i = 0; i <4; i ++) {
OrdSet ′ [Aggr [0] [VNo — 2 [OrdSet [i]]]] = OrdSet [i];
Aggr [0] [VNo_2 [OrdSet [i]]] ++;
}
Can be described. Here, Aggr [0] [VNo_2 [OrdSet [i]]] ++ indicates that the elements of the cumulative frequency distribution array are used as index pointers.

さらに、プロセッサ番号ｊをループ変数とすると、本実施形態によるレコード番号配列の要素を並べ替える処理は、
ｆｏｒ（ｊ＝０；ｊ＜４；ｊ＋＋）｛
ｆｏｒ（ｉ＝０；ｉ＜４；ｉ＋＋）｛
ＯｒｄＳｅｔ’［Ａｇｇｒ［ｊ］［ＶＮｏ＿２［ＯｒｄＳｅｔ［ｊ＊４＋ｉ］］］］＝ＯｒｄＳｅｔ［ｊ＊４＋ｉ］；
Ａｇｇｒ［ｊ］［ＶＮｏ＿２［ＯｒｄＳｅｔ［ｊ＊４＋ｉ］］］＋＋；
｝
｝
と記述できる。Furthermore, assuming that the processor number j is a loop variable, the process of rearranging the elements of the record number array according to this embodiment is as follows:
for (j = 0; j <4; j ++) {
for (i = 0; i <4; i ++) {
OrdSet ′ [Aggr [j] [VNo — 2 [OrdSet [j * 4 + i]]]] = OrdSet [j * 4 + i];
Aggr [j] [VNo_2 [OrdSet [j * 4 + i]]] ++;
}
}
Can be described.

レコード番号配列の要素を並べ替える処理が完了すると、累積度数分布配列１７００、１７０１、１７０２及び１７０３は、ソート後の累積度数分布配列を表し、ソートバージョンのレコード番号配列１７１０はソート後のレコード番号配列を表している。このように、本発明の一実施形態によるレコード番号配列の要素を並べ替える処理により、元のレコード番号配列５０３の要素は、下位次元に関してソートされ、ソートバージョンのレコード番号配列１７１０に格納されている。表形式データ１４００は下位次元に関してソートされた表形式データを示している。 When the process of rearranging the elements of the record number array is completed, the cumulative frequency distribution arrays 1700, 1701, 1702, and 1703 represent the sorted cumulative frequency distribution arrays, and the sorted version record number array 1710 is the sorted record number array. Represents. Thus, by the process of rearranging the elements of the record number array according to one embodiment of the present invention, the elements of the original record number array 503 are sorted with respect to the lower dimension and stored in the sorted version of the record number array 1710. . Tabular data 1400 indicates tabular data sorted with respect to the lower dimensions.

さらに、ソート完了時の累積度数分布配列１７０３の値を用いることにより、グループ番号配列１７２０に示されるように、レコード番号をグループ分けすることができる。このグループ番号は、後で上位次元に関してレコード番号をソートするときに、レコード番号と共にソートされる。 Furthermore, the record numbers can be grouped as shown in the group number array 1720 by using the value of the cumulative frequency distribution array 1703 at the time of completion of sorting. This group number is sorted along with the record number when the record number is later sorted with respect to the upper dimension.

本発明の一実施形態による多次元逐次ソート処理では、下位次元に関するソートに続いて、上位次元に関するソートが行われる。上位次元に関するソートは下位次元に関するソートに類似しているので簡単に説明する。図１８は、本発明の一実施形態による多項目逐次ソート処理の上位次元ソート処理における並列カウントアップ処理の説明図である。ここでも、上位次元の項目値番号配列ＶＮｏ＿１中の要素の出現回数を４台のプロセッサで並列的にカウントアップする。カウントアップ結果がカウント配列１８２０、１８２１、１８２２、１８２３に示されている。 In the multi-dimensional sequential sorting process according to the embodiment of the present invention, the sorting for the upper dimension is performed following the sorting for the lower dimension. Since the sort related to the upper dimension is similar to the sort related to the lower dimension, it will be briefly described. FIG. 18 is an explanatory diagram of the parallel count-up process in the higher-order sort process of the multi-item sequential sort process according to an embodiment of the present invention. Again, the number of appearances of elements in the higher-order item value number array VNo_1 is counted up in parallel by four processors. The count up results are shown in count arrays 1820, 1821, 1822, 1823.

図１９は、本発明の一実施形態による多項目逐次ソート処理の上位次元ソート処理における累積度数分布配列生成処理の説明図である。上位次元ソート処理の場合も、下位次元ソート処理と同様に、複数台のプロセッサを用いて並列的に累積度数分布配列１９００、１９０１、１９０２、１９０３が生成される。 FIG. 19 is an explanatory diagram of the cumulative frequency distribution array generation process in the higher-order sort process of the multi-item sequential sort process according to an embodiment of the present invention. Also in the case of the higher-order sort processing, cumulative frequency distribution arrays 1900, 1901, 1902, and 1903 are generated in parallel using a plurality of processors, similarly to the lower-order sort processing.

図２０は、本発明の一実施形態による多項目逐次ソート処理の上位次元ソート処理におけるレコード番号配列の要素を並べ替える処理の説明図である。上位次元ソート処理の場合も、下位次元ソート処理と同様に、複数台のプロセッサを用いて、レコード番号配列１７１０がレコード番号配列２０１０へ並列的に並べ替えられる。このとき、同時に、グループ番号配列１７２０もグループ番号配列２０２０へ並列的に並べ替えられる。表形式データ１４１０は下位次元に続いて上位次元に関してソートされた表形式データを示している。 FIG. 20 is an explanatory diagram of the process of rearranging the elements of the record number array in the higher-order sort process of the multi-item sequential sort process according to an embodiment of the present invention. Also in the case of the higher-order sort process, the record number array 1710 is rearranged in parallel to the record number array 2010 using a plurality of processors as in the lower-order sort process. At this time, the group number array 1720 is also rearranged in parallel to the group number array 2020. Tabular data 1410 indicates tabular data sorted with respect to the upper dimension following the lower dimension.

レコード番号配列２０１０は、図１０を参照して説明した多項目一括ソート処理によって得られたレコード番号配列１００５と同一であり、多項目一括ソート処理と多項目逐次ソート処理のどちらでも同じ結果が得られることが確かめられた。 The record number array 2010 is the same as the record number array 1005 obtained by the multi-item batch sort process described with reference to FIG. 10, and the same result is obtained in both the multi-item batch sort process and the multi-item sequential sort process. It was confirmed that

グループ番号配列２０２０は、最終的な累積度数分布配列２００３の要素によって示されているように、Ｇｒｏｕｐ２［０］からＧｒｏｕｐ２［４］の第１区間と、Ｇｒｏｕｐ２［５］からＧｒｏｕｐ２［１０］までの第２区間と、Ｇｒｏｕｐ２［１１］からＧｒｏｕｐ２［１５］までの第３区間の３区間にグループ化される。さらに、第１区間は、要素の値によって、０、２、３の３個に分割され、第２区間は０、１、２、３の４個に分割され、第３区間は１、２、３の３個に分割されている。よって、グループ番号配列２０２０を先頭から順次比較することにより、たとえば、ｆｏｒループを使ってカウントアップすることにより、完全なグループ番号配列２０３０を得ることが可能である。尚、このようなシーケンシャルなアクセスは、キャッシュのプリフェッチが有効であることに注意すべきである。 As shown by the elements of the final cumulative frequency distribution array 2003, the group number array 2020 includes a first section from Group2 [0] to Group2 [4], and from Group2 [5] to Group2 [10]. The second section and the third section from Group 2 [11] to Group 2 [15] are grouped into three sections. Further, the first section is divided into three parts 0, 2, 3 according to the element value, the second section is divided into four parts 0, 1, 2, 3, and the third section is 1, 2, 3 is divided into three. Therefore, the complete group number array 2030 can be obtained by sequentially comparing the group number array 2020 from the top, for example, by counting up using a for loop. It should be noted that the cache prefetch is effective for such a sequential access.

以上の通り、多項目逐次ソート処理によって得られたレコード番号配列２０１０に上記のユニーク行抽出処理を適用することによって、一意の固有値の組、すなわち、集計の次元を作成することができる。 As described above, by applying the above-described unique row extraction process to the record number array 2010 obtained by the multi-item sequential sort process, a unique eigenvalue pair, that is, a total dimension can be created.

［測度並列算出処理］
最後に、ユニーク行抽出処理によって得られた集計の次元に基づいて測度を算出することにより多次元集計処理が実現される。以下、この測度の算出について説明する。集計処理の結果は、グループ番号を行とし、次元及び測度を列とする集計結果テーブルによって表現される。ここでは、集計結果テーブルのサイズ、すなわち、行数と、キャッシュメモリのサイズの関係に応じて、２通りの測度算出方法が提案される。ユニーク行抽出処理によって得られた最終的なレコード番号配列とグループ番号配列について検討すると、グループ番号配列は要素が昇順に並んでいる。一方、レコード番号配列は要素がランダムに配置されている。そのため、測度の算出を実施する際に、集計結果テーブルの行数が小さい場合には、できるだけキャッシュヒット率を高めるために、レコード番号配列中の要素をシーケンシャルに並べ替える（レコード番号昇順タイプ）これによって、レコード番号配列へのアクセスがシーケンシャルアクセスになるため、キャッシュのプリフェッチ機構によりキャッシュヒット率が高くなる。一方、グループ番号配列は、配列中の要素の値の変動が比較的小さいのでキャッシュヒット率が高い。逆に、集計結果テーブルの行数が比較的大きい場合には、処理手順の簡略化と作業用メモリの使用量の削減化のため、レコード番号配列の並べ替えは行わない（グループ番号昇順タイプ）。[Measure parallel calculation processing]
Finally, a multi-dimensional tabulation process is realized by calculating a measure based on the tabulation dimension obtained by the unique row extraction process. Hereinafter, calculation of this measure will be described. The result of the tabulation process is expressed by a tabulation result table with group numbers as rows and dimensions and measures as columns. Here, two types of measure calculation methods are proposed depending on the size of the tabulation result table, that is, the relationship between the number of rows and the size of the cache memory. Considering the final record number array and group number array obtained by the unique row extraction process, elements are arranged in ascending order in the group number array. On the other hand, elements are randomly arranged in the record number array. Therefore, when the calculation of the measure is performed, if the number of rows in the aggregation result table is small, the elements in the record number array are rearranged sequentially (record number ascending order type) in order to increase the cache hit rate as much as possible. As a result, the access to the record number array becomes sequential access, and the cache prefetch mechanism increases the cache hit rate. On the other hand, the group number array has a high cache hit rate because fluctuations in the values of elements in the array are relatively small. Conversely, when the number of rows in the tabulation result table is relatively large, the record number array is not rearranged (group number ascending order type) to simplify the processing procedure and reduce the amount of work memory used. .

［レコード番号昇順タイプの測度並列算出処理］
以下、レコード番号昇順タイプの測度並列算出処理について説明する。この処理は、キャッシュヒット率が高くなるという利点がある。レコード番号配列の要素は、一意であるが、レコード番号昇順の形に変換することにより、シーケンシャルアクセスになるため、プリフェッチ機構によりキャッシュヒット率が高くなる。また、グループ番号配列は変動が小さいので、高いキャッシュヒット率が維持できる。この処理では、各プロセッサがグループ番号配列の全体にアクセスする可能性があるので、プロセッサ毎に集計作業領域を用意し、各プロセッサの集計結果をまとめて最終的な集計結果を作成する。尚、多数の測度が存在する場合には、測度１つ当たりのレコード番号配列の並べ替えのコストが低下すること、項目値番号配列から項目値配列へのアクセスはランダムアクセスであることに注意すべきである。[Measure number parallel calculation process of record number ascending order type]
Hereinafter, the measure parallel calculation processing of the record number ascending order type will be described. This process has the advantage of a high cache hit rate. The elements of the record number array are unique, but the cache hit rate is increased by the prefetch mechanism because sequential access is performed by converting the elements into the ascending order of the record numbers. In addition, since the group number array has a small fluctuation, a high cache hit rate can be maintained. In this process, since each processor may access the entire group number array, a totaling work area is prepared for each processor, and the totaling result of each processor is collected to create a final counting result. Note that when there are a large number of measures, the cost of rearranging the record number array per measure is reduced, and access from the item value number array to the item value array is random access. Should.

レコード番号昇順タイプの測度並列算出処理は、複数台のプロセッサを用いて、レコード番号配列の要素及びグループ番号配列の要素を、レコード番号配列の要素が昇順となるように並べ替えるステップと、複数台のプロセッサを用いて、測度を並列的に集計し、各プロセッサで算出された測度を合成するステップとを備える。 Ascending record number order type measure parallel calculation processing uses a plurality of processors to rearrange the elements of the record number array and the elements of the group number array so that the elements of the record number array are in ascending order, Using the above-mentioned processors, the measures are aggregated in parallel, and the measures calculated by the processors are synthesized.

図２１は、本発明の一実施形態によるレコード番号昇順タイプの測度並列算出処理における並べ替え処理の説明図である。 FIG. 21 is an explanatory diagram of the rearrangement process in the measure number ascending order type measure parallel calculation process according to an embodiment of the present invention.

並べ替え前のレコード番号配列をＯｒｄＳｅｔ’’、グループ番号配列をＧｒｏｕｐ３とし、並べ替え後のレコード番号配列をＯｒｄＳｅｔ’’、グループ番号配列をＧｒｏｕｐ４とし、ループ変数をｉとすると、この並べ替え処理は、
ｆｏｒ（ｉ＝０；ｉ＜１６；ｉ＋＋）｛
ＯｒｄＳｅｔ’’’［ＯｒｄＳｅｔ’’［ｉ］］＝ＯｒｄＳｅｔ’’［ｉ］；
Ｇｒｏｕｐ４［ＯｒｄＳｅｔ’’［ｉ］］＝Ｇｒｏｕｐ４［ｉ］；
｝
と記述することができる。この処理は、レコード番号配列２０１０を図２１に示されるようにプロセッサ毎の受け持ち範囲２１００、２１０１、２１０２、２１０３のように分割することによって、複数台のプロセッサによって並列に実行される。If the record number array before sorting is OrdSet ″, the group number array is Group3, the sorted record number array is OrdSet ″, the group number array is Group4, and the loop variable is i, this sorting process is ,
for (i = 0; i <16; i ++) {
OrdSet ′ ″ [OrdSet ″ [i]] = OrdSet ″ [i];
Group4 [OrdSet '' [i]] = Group4 [i];
}
Can be described. This processing is executed in parallel by a plurality of processors by dividing the record number array 2010 into the handling ranges 2100, 2101, 1022, 2103 for each processor as shown in FIG.

図２２は、本発明の一実施形態によるレコード番号昇順タイプの測度並列算出処理における集計処理の説明図である。レコード番号配列２１１０をプロセッサ毎の受け持ち範囲２２００、２２０１、２２０２、２２０３のように４分割することによって、測度並列算出処理は４台のプロセッサによって並列に実行される。図２２には、測度算出の対象となる項目「ポイント」の項目値番号配列ＶＮｏ＿３と項目値配列ＶＬ＿３がそれぞれ示されている。本例では、測度は「ポイント」の合計値である。よって、既に求められた集計の次元１１３０に関して、「ポイント」の合計値を算出する。 FIG. 22 is an explanatory diagram of the aggregation process in the measure parallel calculation process of the record number ascending order type according to one embodiment of the present invention. By dividing the record number array 2110 into four, such as the handling ranges 2200, 2201, 2202, 2203 for each processor, the measure parallel calculation processing is executed in parallel by the four processors. FIG. 22 shows an item value number array VNo_3 and an item value array VL_3 of the item “point” for which the measure is to be calculated. In this example, the measure is the total value of “points”. Therefore, the total value of “points” is calculated with respect to the already calculated dimension 1130.

４台のプロセッサが用いられるので、共有メモリ上には４個の中間結果格納配列Ｓｕｍ＿０、Ｓｕｍ＿１、Ｓｕｍ＿２、Ｓｕｍ＿３と、最終的な結果格納配列Ｓｕｍが確保され、要素の値が０に初期化される。たとえば、ＣＰＵ−０の処理は、ｉをループ変数として、
ｆｏｒ（ｉ＝０；ｉ＜４；ｉ＋＋）｛
Ｓｕｍ＿０［Ｇｒｏｕｐ４［ＯｒｄＳｅｔ’’’［ｉ］］＝＋ＶＬ＿３［ＶＮｏ＿３［ＯｒｄＳｅｔ’’’［ｉ］］］；
｝
として記述される。ＣＰＵ−１、ＣＰＵ−２、ＣＰＵ−３についても同様である。Since four processors are used, four intermediate result storage arrays Sum_0, Sum_1, Sum_2, Sum_3 and a final result storage array Sum are secured on the shared memory, and the element value is initialized to 0. The For example, the process of CPU-0 uses i as a loop variable.
for (i = 0; i <4; i ++) {
Sum_0 [Group4 [OrdSet ′ ″ [i]] = + VL_3 [VNo — 3 [OrdSet ′ ″ [i]]];
}
Is described as The same applies to CPU-1, CPU-2, and CPU-3.

次に、最終的な結果格納配列Ｓｕｍを４分割して、各プロセッサが受け持ち範囲について、
Ｓｕｍ［ｉ］＝Ｓｕｍ＿０［ｉ］＋Ｓｕｍ＿１［ｉ］＋Ｓｕｍ＿２［ｉ］＋Ｓｕｍ＿３［ｉ］
として記述される演算を実行することにより、同図に示されるような最終的な結果格納配列２２４０が得られる。この最終的な結果格納配列の要素が算出された測度であり、対応する次元が次元１１３０として図２２に示されている。Next, the final result storage array Sum is divided into four, and each processor has a responsibility range.
Sum [i] = Sum_0 [i] + Sum_1 [i] + Sum_2 [i] + Sum_3 [i]
As a result, the final result storage array 2240 as shown in the figure is obtained. The elements of this final result storage array are the calculated measures, and the corresponding dimension is shown in FIG. 22 as dimension 1130.

［グループ番号昇順タイプの測度並列算出処理］
次に、グループ番号昇順タイプの測度並列算出処理について説明する。この処理は、上記のＳｕｍ＿０、Ｓｕｍ＿１、Ｓｕｍ＿２、Ｓｕｍ＿３のような中間的な結果格納配列を確保しなくてもよいので、集計結果テーブルの行数が大きくても使用されるメモリ量が増加しないという利点がある。また、この処理は、レコード番号配列の並べ替えを必要としないので、処理時間が短縮されるという利点がある。一方、レコード番号配列の要素を添字として用いる項目値番号配列へのアクセスはランダムアクセスになることに注意する必要がある。また、項目値番号配列から項目値配列へのアクセスもランダムアクセスである。[Group number ascending type measure parallel calculation processing]
Next, group number ascending type measure parallel calculation processing will be described. This process does not require an intermediate result storage array such as Sum_0, Sum_1, Sum_2, and Sum_3, so that the amount of memory used does not increase even if the number of rows in the aggregation result table is large. There are advantages. Further, since this process does not require rearrangement of the record number array, there is an advantage that the processing time is shortened. On the other hand, it should be noted that access to the item value number array using the elements of the record number array as subscripts is random access. Access from the item value number array to the item value array is also random access.

図２３は、本発明の一実施形態によるグループ番号昇順タイプの測度並列算出処理における集計処理の説明図である。レコード番号配列２０１０を４台のプロセッサ毎の受け持ち範囲２３００、２３０１、２３０２、２３０３に分割する。ここで、注意すべきことは、同じグループに属するレコード番号が別々のプロセッサに割り当てられることがないように、かつ、各プロセッサの受け持ち範囲ができるだけ均等になるように受け持ち範囲２３００、２３０１、２３０２、２３０３を決定しなければならない。たとえば、受け持ち範囲は、たとえば、何れか１台のプロセッサが、１台当たりの受け持ち範囲の限界を設定し、グループ番号配列２０３０を先頭から順に走査し、同一のグループに属するレコード番号が同一のプロセッサに割り当てられるように、受け持ち範囲の限界内で境界を選択することによって決定される。各プロセッサの受け持ち範囲が決定されると、測度並列算出処理は４台のプロセッサによって並列に実行される。図２３には、測度算出の対象となる項目「ポイント」の項目値番号配列ＶＮｏ＿３と項目値配列ＶＬ＿３がそれぞれ示されている。本例では、測度は「ポイント」の合計値である。よって、既に求められた集計の次元１１３０に関して、「ポイント」の合計値を算出する。 FIG. 23 is an explanatory diagram of a counting process in the group number ascending type measure parallel calculation process according to an embodiment of the present invention. The record number array 2010 is divided into the handling ranges 2300, 2301, 2302, 2303 for each of the four processors. Here, it should be noted that the handling ranges 2300, 2301, and 2302, so that the record numbers belonging to the same group are not assigned to different processors and the handling ranges of the respective processors are made as uniform as possible. 2303 must be determined. For example, the coverage range is, for example, a processor in which any one processor sets the limit of the coverage range per unit, scans the group number array 2030 in order from the top, and has the same record number belonging to the same group. Determined by selecting a boundary within the bounds of the coverage. When the responsible range of each processor is determined, the measure parallel calculation processing is executed in parallel by the four processors. FIG. 23 shows an item value number array VNo_3 and an item value array VL_3 of the item “point” for which the measure is to be calculated. In this example, the measure is the total value of “points”. Therefore, the total value of “points” is calculated with respect to the already calculated dimension 1130.

レコード番号配列をＯｒｄＳｅｔ’’、グループ番号配列をＧｒｏｕｐ３、結果格納配列をＳｕｍ（初期値は０）とし、ループ変数をｉとすると、測度算出処理の全体は、
ｆｏｒ（ｉ＝０；ｉ＜１６；ｉ＋＋）｛
Ｓｕｍ［Ｇｒｏｕｐ３［ＯｒｄＳｅｔ’’［ｉ］］］＝＋ＶＬ＿３［ＶＮｏ＿３［ＯｒｄＳｅｔ’’［ｉ］］］；
｝
として記述される。実際には、このループ中の命令は、各プロセッサの処理に展開されて並列に実行される。If the record number array is OrdSet ″, the group number array is Group3, the result storage array is Sum (initial value is 0), and the loop variable is i, the entire measure calculation process is as follows:
for (i = 0; i <16; i ++) {
Sum [Group3 [OrdSet ″ [i]]] = + VL — 3 [VNo — 3 [OrdSet ″ [i]]];
}
Is described as In practice, the instructions in this loop are expanded into the processing of each processor and executed in parallel.

本発明は、以上の実施の形態に限定されることなく、特許請求の範囲に記載された発明の範囲内で、種々の変更が可能であり、それらも本発明の範囲内に包含されるものであることは言うまでもない。 The present invention is not limited to the above embodiments, and various modifications can be made within the scope of the invention described in the claims, and these are also included in the scope of the present invention. Needless to say.

［多段階並列ソート］
ここでは、特許文献３に記載されているような、カウンティングソートに基づく並列ソートと基数ソートの考え方とを組み合わせることにより実現される多段階並列ソートの実施例を説明する。この多段階並列ソートは、上述の多項目一括ソート処理及び多項目逐次ソート処理の基礎となる記述である。[Multi-stage parallel sort]
Here, an embodiment of multi-stage parallel sorting realized by combining parallel sorting based on counting sort and the concept of radix sort as described in Patent Document 3 will be described. This multi-stage parallel sort is a description that is the basis of the above-described multi-item batch sort processing and multi-item sequential sort processing.

項目値配列ＶＬのサイズが大きいとき、すなわち、項目値番号の個数が多数であるときには、項目値番号を基数で表現し、桁ごとに並列ソートを実施することにより、効率的なソートを実現することが可能である。本例の多段階並列ソートは、最下位の桁から始めて順番に現在の桁に関するソート処理を行い、最後に最上位の桁に関するソート処理を行うことによって最終的なソートを完了する。 When the size of the item value array VL is large, that is, when there are a large number of item value numbers, the item value numbers are expressed in radixes, and parallel sorting is performed for each digit to achieve efficient sorting. It is possible. In the multi-stage parallel sort of this example, the sorting process for the current digit is sequentially performed starting from the least significant digit, and the final sorting is completed by finally performing the sorting process for the most significant digit.

多段階並列ソート方法の一例では、図２４のデータ構造を利用する。本実施の形態では、ＣＰＵの台数は４台とし、すべてのＣＰＵが並列に動作する例を考える。システム内のＣＰＵの総数、及び、並列に動作するＣＰＵの台数はこの例に限定されないことに注意すべきである。また、以下では、説明の便宜上、年齢の項目に関して、年齢の昇順にソートする場合を考える。また、年齢の項目値配列の要素は年齢の昇順に並べられている。図４のデータ構造では、年齢に関する項目値番号ＶＮｏは０から４までの値を取り得るので、基数＝４として項目値番号を分解すると、項目値番号は下の桁と上の桁の２桁に分解される。具体的には、項目値番号のモジュロ（４）の値が下の桁の値であり、項目値番号を４で割った商が上の桁の値である。 In an example of the multistage parallel sorting method, the data structure of FIG. 24 is used. In the present embodiment, the number of CPUs is four, and an example in which all CPUs operate in parallel is considered. It should be noted that the total number of CPUs in the system and the number of CPUs operating in parallel are not limited to this example. In the following, for convenience of explanation, consider a case where the items of age are sorted in ascending order of age. The elements of the item value array for age are arranged in ascending order of age. In the data structure of FIG. 4, the item value number VNo relating to age can take a value from 0 to 4. Therefore, when the item value number is decomposed with the radix = 4, the item value number has two digits, the lower digit and the upper digit. Is broken down into Specifically, the modulo (4) value of the item value number is the lower digit value, and the quotient obtained by dividing the item value number by 4 is the upper digit value.

図２５は、本実施例にかかる多段階並列ソート方法のフローチャートである。多段階並列ソート方法は、ステップ２５０１からステップ２５０５の５ステップにより構成される。 FIG. 25 is a flowchart of the multi-stage parallel sorting method according to the present embodiment. The multi-stage parallel sorting method includes five steps from step 2501 to step 2505.

ステップ２５０１：項目値番号の範囲に応じて項目値番号の基数（本例では基数＝４）を選択し、初期のレコード番号配列ＯｒｄＳｅｔを現在のレコード番号配列に設定し、項目値番号の最下位の桁（本例では項目値番号のモジュロ（４）の値）を現在の桁に設定する。 Step 2501: Select the radix of the item value number (in this example, radix = 4) according to the range of the item value number, set the initial record number array OrdSet to the current record number array, and set the lowest item value number (In this example, the modulo (4) value of the item value number) is set to the current digit.

ステップ２５０２：現在のレコード番号配列を分割して４台のプロセッサに割り当てる。 Step 2502: The current record number array is divided and assigned to four processors.

ステップ２５０３：４台のプロセッサのうちの各プロセッサにおいて、割り当てられたレコード番号配列の部分に含まれるレコードに対応した項目値番号の現在の桁の値の出現回数をカウントする。 Step 2503: In each of the four processors, the number of occurrences of the current digit value of the item value number corresponding to the record included in the allocated record number array portion is counted.

ステップ２５０４：項目値番号の現在の桁の値の範囲を分割して４台のプロセッサに割り当てる。 Step 2504: The range of the current digit value of the item value number is divided and assigned to four processors.

ステップ２５０５：４台のプロセッサのうちの各プロセッサにおいて、項目値番号の現在の桁の値の順番に、項目値番号の現在の桁の値が一致する範囲内ではレコード番号配列の部分の順番に従って、割り当てられた項目値番号の現在の桁の値のそれぞれの出現回数を累計数に変換する。 Step 2505: In each of the four processors, the order of the current digit value of the item value number is in accordance with the order of the part of the record number array within the range where the current digit value of the item value number matches. The number of occurrences of the current digit value of the assigned item value number is converted into a cumulative number.

ステップ２５０６：４台のプロセッサのうちの各プロセッサにおいて、割り当てられたレコード番号配列の部分に含まれるレコードに対応した項目値番号の現在の桁の値の出現回数の累計数をポインタとして利用して、割り当てられたレコード番号配列の部分に含まれるレコード番号をさらなるレコード番号配列に格納する。 Step 2506: In each of the four processors, the cumulative number of occurrences of the current digit value of the item value number corresponding to the record included in the assigned record number array portion is used as a pointer. The record number included in the allocated record number array portion is stored in a further record number array.

ステップ２５０７：基数で表現された項目値番号の最上位桁までソート処理が行われたかどうかを判定し、最上位桁までソートされているならば、多段階並列ソート処理を終了する。 Step 2507: It is determined whether or not the sorting process has been performed up to the most significant digit of the item value number expressed in the radix. If the sorting process has been performed up to the most significant digit, the multistage parallel sorting process is terminated.

ステップ２５０８：未処理の桁が残っているならば、その桁を現在の桁に設定し、さらなるレコード番号配列を現在のレコード番号配列として、ステップ２５０２へ戻る。 Step 2508: If an unprocessed digit remains, the digit is set to the current digit, the further record number array is set as the current record number array, and the process returns to Step 2502.

上記の本実施例による多段階並列ソート方法において、ステップ２５０２からステップ２５０６までのソート処理は、上記の本発明の並列ソート方法と同様の処理であり、項目値番号の代わりに項目値番号の現在の桁の値が使用される点だけが異なっている。 In the multi-stage parallel sorting method according to the above-described embodiment, the sorting processing from step 2502 to step 2506 is the same processing as the parallel sorting method of the present invention described above, and the current item value number is used instead of the item value number. The only difference is that the value of the digit is used.

次に、本実施例による多段階並列ソート方法を具体的に説明する。本例では、図２４に示されたデータを、４台のＣＰＵを使用し、年齢の昇順でソートする。初期化ステップ２５０１は、１段階目のソート処理として、年齢の項目値番号のモジュロー４（ＭＯＤ４）の値（下位の桁の値）に関するソート処理を設定し、２段階目のソート処理として、年齢の項目値番号の４で割った商（ＤＩＶ４）の値に関するソート処理を設定する。 Next, the multi-stage parallel sorting method according to the present embodiment will be specifically described. In this example, the data shown in FIG. 24 is sorted in ascending order of age using four CPUs. The initialization step 2501 sets a sort process related to the value of the modulo 4 (MOD 4) of the age item value number (the value of the lower digit) as the first stage sort process, and as the second stage sort process, A sorting process is set for the value of the quotient (DIV 4) divided by 4 of the item value number of age.

初期化ステップ２５０１では、項目値番号の現在の桁の値の出現回数をカウントする配列が準備される。 In an initialization step 2501, an array for counting the number of occurrences of the value of the current digit of the item value number is prepared.

図２６Ａ乃至２８Ｂは、多段階並列ソート方法の第１段階のカウントステップ２５０３の説明図である。図２６Ａのサブステップ１では、たとえば、ＣＰＵ−０は、ＯｒｄＳｅｔ［０］の値０を読み出し、読み出された値０を添字として、ＶＮｏ［０］の値１を読み出し、この値１のモジュロー４（ＭＯＤ４）の値１を添字として、Ｃｏｕｎｔ−０［１］の値０を１にインクリメントする。同様に、ＣＰＵ−１は、ＯｒｄＳｅｔ［５］の値５を読み出し、この値５を添字として、ＶＮｏ［５］の値２を読み出し、この値２のＭＯＤ４の値２を添字として、Ｃｏｕｎｔ−１［２］の値０を１にインクリメントする。以下、図２６Ｂのサブステップ２、図２７Ａのサブステップ３、図２７Ｂのサブステップ４及び図２８Ａのサブステップ５を実行することにより、図２８Ｂに示されるようなカウントアップ結果が得られる。図２８Ｂの配列Ｃｏｕｎｔ−０の要素Ｃｏｕｎｔ−０［ｉ］は、ＣＰＵ−０が担当した配列ＯｒｄＳｅｔのＯｒｄＳｅｔ［０］からＯｒｄＳｅｔ［４］の範囲内の各レコードに対応する年齢の項目値番号の下位の桁の値ｉの出現回数を表わしている。たとえば、Ｃｏｕｎｔ−０［０］は、ＣＰＵ−０の担当範囲内の項目値番号の下位の桁の値０の出現回数が２回であることを表し、Ｃｏｕｎｔ−３［１］はＣＰＵ−３の担当範囲内の項目値番号の下位の桁の値１の出現回数が２回であることを表す。 FIGS. 26A to 28B are explanatory diagrams of the counting step 2503 of the first stage of the multi-stage parallel sorting method. In sub-step 1 of FIG. 26A, for example, CPU-0 reads the value 0 of OrdSet [0], reads the value 1 of VNo [0] using the read value 0 as a subscript, and modulo this value 1 4 (MOD4) value 1 is subscripted and Count-0 [1] value 0 is incremented to 1. Similarly, the CPU-1 reads the value 5 of OrdSet [5], reads the value 2 of VNo [5] using this value 5 as a subscript, and counts the value 2 of MOD4 of this value 2 as a subscript. The value 0 of [2] is incremented to 1. Subsequently, by executing sub-step 2 in FIG. 26, sub-step 3 in FIG. 27A, sub-step 4 in FIG. 27B and sub-step 5 in FIG. 28A, a count-up result as shown in FIG. 28B is obtained. The element Count-0 [i] of the array Count-0 in FIG. 28B is an item value number of the age corresponding to each record in the range of OrdSet [0] to OrdSet [4] of the array OrdSet that the CPU-0 was in charge of. It represents the number of appearances of the value i of the lower digit. For example, Count-0 [0] indicates that the number of occurrences of the value 0 of the lower digit of the item value number within the CPU-0's assigned range is 2, and Count-3 [1] is CPU-3. This indicates that the number of occurrences of the value 1 of the lower digit of the item value number in the assigned range is 2 times.

図２９Ａ、Ｂは多段階並列ソート方法の第１段階の累計数化ステップの説明図である。本例では、昇順ソートに対応して、項目値番号の下位の桁の値の昇順に累計数化を行う。ＣＰＵ−０は、配列Ｃｏｕｎｔの１行目（すなわち、項目値番号の下位の桁の値０）の累計数化を担当し、ＣＰＵ−１乃至ＣＰＵ−３は、それぞれ、配列Ｃｏｕｎｔの２乃至４行目（すなわち、項目値番号の下位の桁の値１乃至３）の累計数化を担当する。図２９Ａに示されるように、累計数化は配列Ｃｏｕｎｔの横方向（すなわち、添字が一致する行）を優先して行われ、次に、先行する行の累計数を後続する行の累計数に加算することにより、全体の累計数が決まる。尚、横方向の累計数化は、既に説明したように各ＣＰＵが並列に実行可能であるが、単一のＣＰＵが担当してもよい。 FIGS. 29A and 29B are explanatory diagrams of the cumulative number step in the first stage of the multi-stage parallel sorting method. In this example, in accordance with the ascending order sort, the cumulative number is performed in ascending order of the value of the lower digit of the item value number. CPU-0 is in charge of accumulating the first row of the array count (that is, the value 0 of the lower digit of the item value number), and CPU-1 to CPU-3 are respectively 2 to 4 of the array count. Responsible for accumulating the number of rows (that is, values 1 to 3 in the lower digits of the item value number). As shown in FIG. 29A, the cumulative numbering is performed by giving priority to the horizontal direction of the array count (that is, the row with the same subscript), and then the cumulative number of the preceding row is changed to the cumulative number of the subsequent row. By adding, the total number of totals is determined. The cumulative number in the horizontal direction can be executed in parallel by the CPUs as described above, but a single CPU may be in charge.

図３０Ａ乃至３２Ｂは多段階並列ソート方法の第１段階においてレコード番号をさらなるレコード番号配列に格納する転送ステップの説明図である。転送ステップでは、各ＣＰＵは、レコード番号配列ＯｒｄＳｅｔから自分が担当する範囲内のレコード番号を読み出し、次に、そのレコード番号を添字として、ポインタ配列ＶＮｏから項目値番号の下位の桁の値を読み出し、さらに、この項目値番号の下位の桁の値を添字として、自プロセッサに関連付けられた累計数化されたＣｏｕｎｔ配列から累計数値を読み出し、この読み出された累計数値をポイントしてさらなるレコード番号配列ＯｒｄＳｅｔ’にレコード番号を格納すると共に、Ｃｏｕｎｔ配列の累計数値を１ずつインクリメントする。図３２Ｂはこのような転送ステップの結果として第１段階で得られたレコード番号配列ＯｒｄＳｅｔ’を表す。 30A to 32B are explanatory diagrams of a transfer step of storing record numbers in a further record number array in the first stage of the multistage parallel sort method. In the transfer step, each CPU reads the record number within the range that it is in charge of from the record number array OrdSet, and then reads the value of the lower digit of the item value number from the pointer array VNo using the record number as a subscript. Further, using the value of the lower digit of this item value number as a subscript, the cumulative numerical value is read from the accumulated count array associated with the processor, and the further cumulative record number is pointed to the read cumulative numerical value. The record number is stored in the array OrdSet ′, and the cumulative value of the Count array is incremented by one. FIG. 32B shows the record number array OrdSet 'obtained in the first stage as a result of such a transfer step.

第２段階では、第１段階で得られたレコード番号配列ＯｒｄＳｅｔ’を初期条件として、年齢の項目値番号の上位の桁の値（ＤＩＶ４の値）に関する昇順ソートを実行する。 In the second stage, with the record number array OrdSet 'obtained in the first stage as an initial condition, ascending order sorting is performed on the value of the upper digit (value of DIV 4) of the item value number of age.

図３３は、本実施例による多段階並列ソート方法の第２段階のステップ２５０２において、現在のレコード番号配列ＯｒｄＳｅｔ’を４台のＣＰＵに割り当て、それぞれのＣｏｕｎｔ配列を準備した状態を示す図である。 FIG. 33 is a diagram showing a state in which the current record number array OrdSet ′ is assigned to four CPUs and the respective Count arrays are prepared in step 2502 of the second stage of the multi-stage parallel sorting method according to the present embodiment. .

図３４Ａ乃至３６Ｂは、多段階並列ソート方法の第２段階のカウントステップの説明図である。図３４Ａのサブステップ１では、たとえば、ＣＰＵ−０は、ＯｒｄＳｅｔ’［０］の値２を読み出し、読み出された値２を添字として、ＶＮｏ［２］の値４を読み出し、この値１の４で割った商（ＤＩＶ４）の値１を添字として、Ｃｏｕｎｔ−０［１］の値０を１にインクリメントする。同様に、ＣＰＵ−１は、ＯｒｄＳｅｔ’［５］の値１２を読み出し、この値１２を添字として、ＶＮｏ［１２］の値４を読み出し、この値４のＤＩＶ４の値１を添字として、Ｃｏｕｎｔ−１［１］の値０を１にインクリメントする。以下、図３４Ｂのサブステップ２、図３５Ａのサブステップ３、図３５Ｂのサブステップ４及び図３６Ａのサブステップ５を実行することにより、図３６Ｂに示されるような第２段階のカウントアップ結果が得られる。図３６Ｂの配列Ｃｏｕｎｔ−０の要素Ｃｏｕｎｔ−０［ｉ］は、ＣＰＵ−０が担当した配列ＯｒｄＳｅｔ’のＯｒｄＳｅｔ’［０］からＯｒｄＳｅｔ［４］の範囲内の各レコードに対応する年齢の項目値番号の上位の桁の値ｉの出現回数を表わしている。たとえば、Ｃｏｕｎｔ−０［０］は、ＣＰＵ−０の担当範囲内の項目値番号の上位の桁の値０の出現回数が４回であることを表し、Ｃｏｕｎｔ−３［１］はＣＰＵ−３の担当範囲内の項目値番号の上位の桁の値１の出現回数が０回であることを表す。 FIG. 34A thru | or 36B is explanatory drawing of the count step of the 2nd step of a multistep parallel sort method. In substep 1 of FIG. 34A, for example, CPU-0 reads the value 2 of OrdSet '[0], reads the value 4 of VNo [2] using the read value 2 as a subscript, Using the value 1 of the quotient (DIV4) divided by 4 as a subscript, the value 0 of Count-0 [1] is incremented to 1. Similarly, CPU-1 reads the value 12 of OrdSet '[5], reads this value 12 as a subscript, reads the value 4 of VNo [12], uses the value 1 of DIV4 of this value 4 as a subscript, and count- The value 0 of 1 [1] is incremented to 1. Subsequently, by executing sub-step 2 in FIG. 34B, sub-step 3 in FIG. 35A, sub-step 4 in FIG. 35B and sub-step 5 in FIG. 36A, the count-up result of the second stage as shown in FIG. 36B is obtained. can get. The element Count-0 [i] of the array Count-0 in FIG. 36B is the item value of the age corresponding to each record in the range of OrdSet ′ [0] to OrdSet [4] of the array OrdSet ′ that the CPU-0 was responsible for. It represents the number of occurrences of the value i of the upper digit of the number. For example, Count-0 [0] indicates that the number of occurrences of the value 0 of the upper digit of the item value number within the CPU-0's assigned range is four, and Count-3 [1] is CPU-3. This means that the number of occurrences of the value 1 of the upper digit of the item value number in the assigned range is 0 times.

図３７は多段階並列ソート方法の第２段階の累計数化ステップの説明図である。本例では、昇順ソートに対応して、項目値番号の上位の桁の値の昇順に累計数化を行う。多段階化によって項目値番号の上位の桁の値の個数は２個に削減されているので、本例では、たとえば、ＣＰＵ−０がすべての値の累計数化を担当する。図３７に示されるように、ＣＰＵ−０は、Ｃｏｕｎｔ−０［０］、Ｃｏｕｎｔ−１［０］、Ｃｏｕｎｔ−２［０］、Ｃｏｕｎｔ−３［０］、Ｃｏｕｎｔ−０［１］、Ｃｏｕｎｔ−１［１］、Ｃｏｕｎｔ−２［１］、及び、Ｃｏｕｎｔ−３［１］の順に累計数化を行う。勿論、本例の場合に、ＣＰＵ−０とＣＰＵ−１の２台のＣＰＵに項目値番号の上位の桁の値０と１を割り当て、２台のＣＰＵが累計数化演算を行ってもよい。 FIG. 37 is an explanatory diagram of the cumulative number step in the second stage of the multi-stage parallel sorting method. In this example, corresponding to the ascending sort, the cumulative number is performed in ascending order of the value of the upper digit of the item value number. Since the number of the upper digits of the item value number is reduced to two by multi-stepping, in this example, for example, CPU-0 is in charge of accumulating all values. As shown in FIG. 37, the CPU-0 counts Count-0 [0], Count-1 [0], Count-2 [0], Count-3 [0], Count-0 [1], Count- Accumulation is performed in the order of 1 [1], Count-2 [1], and Count-3 [1]. Of course, in the case of this example, the CPU 0 and CPU-1 may be assigned the upper digits 0 and 1 of the item value number to the two CPUs, and the two CPUs may perform the cumulative number calculation. .

図３８Ａ乃至４０Ｂは多段階並列ソート方法の第２段階においてレコード番号をさらなるレコード番号配列に格納する転送ステップの説明図である。転送ステップでは、各ＣＰＵは、レコード番号配列ＯｒｄＳｅｔから自分が担当する範囲内のレコード番号を読み出し、次に、そのレコード番号を添字として、ポインタ配列ＶＮｏから項目値番号の上位の桁の値を読み出し、さらに、この項目値番号の上位の桁の値を添字として、自プロセッサに関連付けられた累計数化されたＣｏｕｎｔ配列から累計数値を読み出し、この読み出された累計数値をポイントしてさらなるレコード番号配列ＯｒｄＳｅｔ”にレコード番号を格納すると共に、Ｃｏｕｎｔ配列の累計数値を１ずつインクリメントする。図４０Ｂはこのような転送ステップの結果として第２段階で得られたレコード番号配列ＯｒｄＳｅｔ”を表す。 FIGS. 38A to 40B are explanatory diagrams of a transfer step of storing record numbers in a further record number array in the second stage of the multistage parallel sort method. In the transfer step, each CPU reads the record number within the range that it is in charge of from the record number array OrdSet, and then reads the value of the upper digit of the item value number from the pointer array VNo using the record number as a subscript. Further, using the value of the upper digit of this item value number as a subscript, the cumulative numerical value is read from the accumulated count array associated with the processor, and the further cumulative record number is pointed to the read cumulative numerical value. The record number is stored in the array OrdSet ″, and the cumulative value of the Count array is incremented by 1. FIG. 40B shows the record number array OrdSet ″ obtained in the second stage as a result of such a transfer step.

本実施例の多段階並列ソート方法は項目値番号の下位の桁と上位の桁の２段階により構成されているので、これ以上のソート処理は行われない。したがって、第２段階で得られたレコード番号配列ＯｒｄＳｅｔ”が最初のレコード番号配列ＯｒｄＳｅｔを年齢に関して昇順にソートを行った結果である。 Since the multi-stage parallel sorting method of the present embodiment is composed of two stages of the lower digit and the upper digit of the item value number, no further sort processing is performed. Therefore, the record number array OrdSet "obtained in the second stage is the result of sorting the first record number array OrdSet in ascending order with respect to age.

図４１Ａ及び４２Ｂは、図４に示されたデータ構造に対して本発明の実施の形態にかかる昇順の多段階並列ソート方法を適用した結果を示す図である。本例では、年齢に関する昇順ソートを行ったので、結果のレコード番号配列ＯｒｄＳｅｔ”には、年齢の項目値として１６歳、１８歳、２０歳、２１歳及び２３歳を有するレコードが年齢順に並んでいることがわかる。また、年齢が一致するレコードの順番は、元のレコード番号配列ＯｒｄＳｅｔ中の順番が保存されている。 41A and 42B are diagrams showing the results of applying the ascending multi-stage parallel sorting method according to the embodiment of the present invention to the data structure shown in FIG. In this example, since the ascending order regarding age is performed, records having the age item values of 16, 18, 20, 21, and 23 are arranged in order of age in the resulting record number array OrdSet ”. In addition, the order of records having the same age is stored in the original record number array OrdSet.

また、上記の多段階並列ソート方法は昇順ソートであるが、本実施例による多段階並列ソートは降順ソートでも同様に動作する。さらに、多段階並列ソートの各段階における累計数化演算は、複数台のプロセッサで並列処理してもよく、或いは、少なくとも１台、好ましくは、１台のプロセッサが単独で処理してもよい。 Although the above-described multi-stage parallel sorting method is ascending order sorting, the multi-stage parallel sorting according to the present embodiment operates similarly in descending order sorting. Further, the cumulative number calculation in each stage of the multi-stage parallel sort may be processed in parallel by a plurality of processors, or may be processed by at least one, preferably one processor alone.

［多段階ソート］
上記の多段階並列ソートは、最下位の桁から始めて順番に現在の桁に関するソート処理を行い、最後に最上位の桁に関するソート処理を行うことによって最終的なソートを完了している。これに対して、最上位の桁から始めて順番に現在の桁に関するソート処理を行い、最後に最下位の桁に関するソート処理を行うことによって最終的なソートを完了することも可能である。以下では、このような最上位から最下位の順にソート処理を多段化する方法を簡単に説明する。この多段化手法は、上述の多項目逐次ソート処理に同様に適用される。[Multi-level sorting]
In the above multi-stage parallel sort, the final sort is completed by performing the sort process for the current digit in order starting from the least significant digit and finally the sort process for the most significant digit. On the other hand, it is also possible to complete the final sorting by performing the sorting process for the current digit in order starting from the most significant digit and finally the sorting process for the least significant digit. In the following, a method of performing multi-stage sort processing in order from the highest level to the lowest level will be briefly described. This multi-stage technique is similarly applied to the above-described multi-item sequential sorting process.

本例では、図４３に示されるようなデータ構造を利用する。また、本例では、ＣＰＵの台数は１台とする。また、以下では、年齢の項目に関して、年齢の昇順にソートする場合を考える。レコードの総数はレコード番号０からレコード番号１９までの２０個であり、項目値番号は０から８までの９個である。すなわち、実際の年齢の値は、１５、１６、１８、１９、２０、２１、２３、２５及び２８の９通りである。図４３のデータ構造では、年齢に関する項目値番号ＶＮｏは０から８までの値を取り得るので、基数＝４として項目値番号を分解すると、項目値番号を４で割った商が上の桁の値であり、項目値番号のモジュロ（４）の値が下の桁の値である。項目値番号の上の桁は０、１及び２の３通りの値を取り、下の桁は０、１、２及び３の４通りの値を取り得る。 In this example, a data structure as shown in FIG. 43 is used. In this example, the number of CPUs is one. In the following description, it is assumed that the items of age are sorted in ascending order of age. The total number of records is 20 from record number 0 to record number 19, and the item value number is 9 from 0 to 8. That is, there are nine actual age values of 15, 16, 18, 19, 20, 21, 23, 25, and 28. In the data structure of FIG. 43, the item value number VNo regarding age can take a value from 0 to 8, so when the item value number is decomposed with the radix = 4, the quotient obtained by dividing the item value number by 4 is the upper digit. It is a value, and the value of the modulo (4) of the item value number is the value of the lower digit. The upper digit of the item value number can take three values, 0, 1, and 2, and the lower digit can take four values, 0, 1, 2, and 3.

最初に、第１段階において、上の桁の値０、１及び２の出現回数をカウントするための配列Ｃｏｕｎｔ−１を準備し、要素を０で初期化する。たとえば、Ｃｏｕｎｔ−１［０］は、項目値番号の上位の桁の値が０であるレコードの個数をカウントするための領域である。 First, in the first stage, an array Count-1 for counting the number of occurrences of the upper digits 0, 1, and 2 is prepared, and the elements are initialized with 0. For example, Count-1 [0] is an area for counting the number of records in which the value of the upper digit of the item value number is 0.

次に、レコード番号配列ＯｒｄＳｅｔの先頭の要素（すなわち、レコード）から順番に、その要素に対応する項目値番号を配列ＶＮｏから読み出し、その項目値番号を４で割った商の値をポインタとして用いて、配列Ｃｏｕｎｔ−１の要素の値をインクリメントする。図４４Ａ乃至Ｄは、ＯｒｄＳｅｔ［０］＝０、ＯｒｄＳｅｔ［７］＝７、及び、ＯｒｄＳｅｔ［１９］＝１９の３個のレコード番号について、項目値番号の上位の桁の値を算出し、該当するカウンタをカウントアップし、次に累計数化する例の説明図である。図４４Ｃからわかるように、この第１段階のカウントアップ処理により、項目値番号の上位の桁の値が０であるレコードの個数は１２個、上位の桁の値が１であるレコードの個数は７個、上位の桁の値が２であるレコードの個数は１個である。さらに、図４４Ｄに示されるように、このカウント値を累計数化する。 Next, in order from the first element (that is, record) of the record number array OrdSet, the item value number corresponding to the element is read from the array VNo, and the quotient value obtained by dividing the item value number by 4 is used as a pointer. Then, the value of the element of the array Count-1 is incremented. 44A to 44D calculate the values of the upper digits of the item value numbers for the three record numbers of OrdSet [0] = 0, OrdSet [7] = 7, and OrdSet [19] = 19. It is explanatory drawing of the example which counts up the counter to perform, and makes it a cumulative number next. As can be seen from FIG. 44C, by this first-stage count-up process, the number of records in which the upper digit value of the item value number is 0 is 12, and the number of records in which the upper digit value is 1 is The number of records having 7 and the value of the upper digit is 2 is 1. Further, as shown in FIG. 44D, this count value is accumulated.

次に、項目値番号の上位の桁の値の出現回数が累計数化された配列Ａｇｇｒ−１を用いて、レコード番号配列ＯｒｄＳｅｔをさらなるレコード番号配列ＯｒｄＳｅｔ’に変換する。具体的には、ＯｒｄＳｅｔ［ｉ］＝ｊであるならば、ＶＮｏ［ｊ］を読み出し、このＶＮｏ［ｊ］を４で割った商（ＶＮｏ［ｊ］ＤＩＶ４）をｋとすると、Ａｇｇｒ−１［ｋ］の値を読み出し、ＯｒｄＳｅｔ［Ａｇｇｒ−１［ｋ］］にレコード番号ｊを設定し、Ａｇｇｒ−１［ｋ］をインクリメントする。図４５Ａ、Ｂは、このような多段階ソートにおけるレコード番号転送処理の説明図であり、図４５ＡはＯｒｄＳｅｔ［０］の転送を、図４５ＢはＯｒｄＳｅｔ［１９］の転送を表している。図４６は、第１段階のレコード番号転送の結果のレコード番号配列ＯｒｄＳｅｔ’と、上位の桁の値が分布する範囲とを表している。たとえば、上位の桁の値が０であるレコードはレコード番号配列ＯｒｄＳｅｔ’のＯｒｄＳｅｔ’［０］からＯｒｄＳｅｔ’［１１］の範囲（区間０）に分布し、上位の桁の値が１であるレコードはレコード番号配列ＯｒｄＳｅｔ’のＯｒｄＳｅｔ’［１２］からＯｒｄＳｅｔ’［１８］の範囲（区間１）に分布し、上位の桁の値が２であるレコードはレコード番号配列ＯｒｄＳｅｔ’のＯｒｄＳｅｔ’［１９］（区間２）に存在する。 Next, the record number array OrdSet is converted into a further record number array OrdSet 'using the array Aggr-1 in which the number of appearances of the upper digits of the item value number is accumulated. Specifically, if OrdSet [i] = j, VNo [j] is read, and if the quotient (VNo [j] DIV 4) obtained by dividing VNo [j] by 4 is k, Aggr-1 The value of [k] is read, record number j is set in OrdSet [Aggr-1 [k]], and Aggr-1 [k] is incremented. 45A and 45B are explanatory diagrams of record number transfer processing in such a multi-stage sort. FIG. 45A shows transfer of OrdSet [0], and FIG. 45B shows transfer of OrdSet [19]. FIG. 46 shows a record number array OrdSet 'as a result of the first-stage record number transfer and a range in which the upper digit values are distributed. For example, records whose upper digit value is 0 are distributed in the range (Section 0) from OrdSet ′ [0] to OrdSet ′ [11] of the record number array OrdSet ′, and the value of the upper digit is 1. Are distributed in the range (Section 1) from OrdSet ′ [12] to OrdSet ′ [18] of the record number array OrdSet ′, and the record whose upper digit value is 2 is OrdSet ′ [19] of the record number array OrdSet ′. It exists in (Section 2).

次に、多段階ソートの第２段階では、各区間内で、項目値番号の下位の桁の値によってレコード番号をソートする。たとえば、ＯｒｄＳｅｔ’の区間１は、ＯｒｄＳｅｔ”の対応した区間１へ転送される。第２段階のソートでは、既に上位の桁で区間が定められているので、レコード番号が区間外に転送されることはない。 Next, in the second stage of multi-stage sorting, the record numbers are sorted by the value of the lower digit of the item value number within each section. For example, section 1 of OrdSet ′ is transferred to section 1 corresponding to OrdSet ″. In the second sort, since the section is already determined by the upper digit, the record number is transferred outside the section. There is nothing.

図４７は、多段階ソートの第２段階の初期状態を表す図である。以下の説明では、ＯｒｄＳｅｔ’の区間１について説明する。たとえば、複数台のプロセッサが存在する場合には、区間ごとにプロセッサを割り当てることにより、以下の処理を並列化することも可能である。Ｃｏｕｎｔ−２は区間１内で項目値番号の下位の桁の値（０，１，２，３）の出現回数をカウントするための配列である。 FIG. 47 is a diagram illustrating an initial state of the second stage of the multistage sorting. In the following description, section 1 of OrdSet ′ will be described. For example, when there are a plurality of processors, it is possible to parallelize the following processes by assigning a processor to each section. Count-2 is an array for counting the number of appearances of the value (0, 1, 2, 3) of the lower digit of the item value number in section 1.

図４８Ａ乃至Ｃは、多段階ソートの第２段階のカウントアップ及び累計数化の説明図である。図４８Ａから始めて順番にカウントアップすることにより、図４８Ｂに示されるようなカウントアップ配列が得られる。このカウントアップ配列は、図４８Ｃに示されるように累計数化される。 FIGS. 48A to 48C are explanatory diagrams of the second stage count-up and accumulation in the multi-stage sort. By counting up sequentially starting from FIG. 48A, a count-up sequence as shown in FIG. 48B is obtained. This count-up array is accumulated as shown in FIG. 48C.

最後に、第２の累計数配列Ａｇｇｒ−２をポインタとして利用して、レコード番号配列ＯｒｄＳｅｔ’の区間１をレコード番号配列ＯｒｄＳｅｔ”の区間１へ転送することにより、多段階ソートが完了する。図４９Ａ、Ｂは、多段階ソートの第２段階のレコード番号転送の説明図である。具体的には、ＯｒｄＳｅｔ’［ｉ］＝ｊであるならば、ＶＮｏ［ｊ］を読み出し、このＶＮｏ［ｊ］を４で割った余り（ＶＮｏ［ｊ］ＭＯＤ４）をｋとすると、Ａｇｇｒ−２［ｋ］の値を読み出し、ＯｒｄＳｅｔ”［Ａｇｇｒ−２［ｋ］］にレコード番号ｊを設定し、Ａｇｇｒ−２［ｋ］をインクリメントする。図４９ＡはＯｒｄＳｅｔ’［１４］の転送を、図４９ＢはＯｒｄＳｅｔ’［１８］の転送を表している。図４９ＢのＯｒｄＳｅｔ”の区間１は、区間１の最終的なソート結果を表している。 Finally, using the second cumulative number array Aggr-2 as a pointer, the section 1 of the record number array OrdSet 'is transferred to the section 1 of the record number array OrdSet ", thereby completing the multi-stage sorting. 49A and 49B are explanatory diagrams of the transfer of the record number in the second stage of the multi-stage sort, specifically, if OrdSet ′ [i] = j, VNo [j] is read and this VNo [j ] Is divided by 4 (VNo [j] MOD 4), k is read, the value of Aggr-2 [k] is read, record number j is set in OrdSet ”[Aggr-2 [k]], and Aggr -2 [k] is incremented. 49A shows the transfer of OrdSet '[14], and FIG. 49B shows the transfer of OrdSet' [18]. The section 1 of “OrdSet” in FIG. 49B represents the final sorting result of the section 1.

区間１と同様に、その他の区間０、区間２についても第２段階のカウントアップ、累計数化、及び、レコード番号転送を適用することにより、レコード番号配列ＯｒｄＳｅｔの全体がレコード番号配列ＯｒｄＳｅｔ”へ転送され、ソートが完了する。 Similar to the section 1, by applying the second-stage count-up, accumulation, and record number transfer to the other sections 0 and 2, the entire record number array OrdSet is changed to the record number array OrdSet ”. Transfer and complete sorting.

実施例３では、１００万行（１Ｍ）、５００万行（５Ｍ）、１０００万行（１０Ｍ）、５０００万行（５０Ｍ）及び１億行（１００Ｍ）の３次元表形式テーブルをソートし、一意の項目値の組を抽出し、各項目値の組の出現回数をカウントする集計処理を以下の５通りの方式、すなわち、
（１）多段階並列ソート方式（旧方式）
（２）本発明による多項目逐次ソートとレコード番号昇順形式を組み合わせた方式（ＯｒｄＳｅｔ昇順方式）
（３）本発明による多項目一括ソートとレコード番号昇順形式を組み合わせた方式（ＯｒｄＳｅｔ昇順方式（３次元一括ソート））
（４）本発明による多項目逐次ソートとグループ番号昇順形式を組み合わせた方式（次元昇順方式）
（５）本発明による多項目一括ソートとグループ番号昇順形式を組み合わせた方式（次元昇順方式（３次元一括ソート））
を用いて実施した。そして、各方式の処理時間を計測し、本発明による方式が旧方式に対してどの程度高速化されたかを表す高速化倍率を算出した。本実施例の測定環境は次の通りである。
コンピュータ機種：ＤｅｌｌＰｏｗｅｒＥｄｇｅ^ＴＭ６８００
ＣＰＵ：Ｉｎｔｅｌ（登録商標）デュアルコアプロセッサ×２
クロック数：３．１６ＧＨｚ
メモリサイズ：３２ＧＢ
Ｌ２キャッシュ：１ＭＢ×４
ＯＳ：ＳｕＳＥ２．６．５−７−１９１−ｓｍｐ
図５０Ａ乃至Ｃは実施例３における集計処理の内容を説明する図である。図５０Ａは出力される集計結果テーブルを表し、図５０Bは１００万行からなる集計元テーブルを表し、図５０Ｃは６３２３３４個のレコードからなる集計結果テーブルを表している。In the third embodiment, the three-dimensional tabular table of 1 million rows (1M), 5 million rows (5M), 10 million rows (10M), 50 million rows (50M) and 100 million rows (100M) is sorted and unique. The following five methods, that is, a totaling process for extracting a set of item values and counting the number of occurrences of each set of item values, are as follows:
(1) Multi-stage parallel sort method (old method)
(2) A method combining multi-item sequential sorting and record number ascending order according to the present invention (OrdSet ascending order method)
(3) A method combining multi-item batch sorting and record number ascending order according to the present invention (OrdSet ascending order method (three-dimensional batch sorting))
(4) Method combining multi-item sequential sorting and group number ascending order according to the present invention (dimensional ascending order method)
(5) Method combining multi-item batch sorting and group number ascending order according to the present invention (dimensional ascending order method (three-dimensional batch sorting))
It carried out using. Then, the processing time of each method was measured, and a speed-up magnification representing how fast the method according to the present invention was compared with the old method was calculated. The measurement environment of this example is as follows.
Computer model: Dell PowerEdge ^™ 6800
CPU: Intel (registered trademark) dual-core processor x 2
Number of clocks: 3.16 GHz
Memory size: 32GB
L2 cache: 1MB x 4
OS: SuSE 2.6.5-7-191-smp
50A to 50C are diagrams for explaining the contents of the aggregation process in the third embodiment. FIG. 50A shows an output totaling result table, FIG. 50B shows a totaling source table consisting of 1 million rows, and FIG. 50C shows a totaling result table consisting of 632334 records.

図５１Ａ乃至Ｃは実施例３の測定結果を示す図である。図５１Ａは、上記の５方式による処理の所要時間（ミリ秒）を表し、図５１Ｂは本発明による上記の（２）から（５）の方式が旧方式（１）に対して高速化された倍率を表し、図５１Ｃは図５１Ｂと同じ高速化倍率をグラフ化したものである。 51A to 51C are diagrams showing measurement results of Example 3. FIG. FIG. 51A shows the required time (milliseconds) of processing by the above five methods, and FIG. 51B shows that the above methods (2) to (5) according to the present invention are faster than the old method (1). FIG. 51C is a graph showing the same high speed magnification as FIG. 51B.

（５）次元昇順方式（３次元一括ソート）、（３）ＯｒｄＳｅｔ昇順方式（３次元一括ソート）、（４）次元昇順方式、（２）ＯｒｄＳｅｔ昇順方式の順に高速化され、高速化倍率は、状況によっては、旧方式の１０倍に達している。 (5) Dimension ascending order method (3D batch sort), (3) OrdSet ascending order method (3D batch sort), (4) Dimension ascending order method, (2) OrdSet ascending order method. Depending on the situation, it has reached 10 times the old method.

実施例４では、実施例３と同じ測定環境において、１００万行（１Ｍ）、５００万行（５Ｍ）、１０００万行（１０Ｍ）、５０００万行（５０Ｍ）及び１億行（１００Ｍ）の３次元表形式テーブルをソートし、一意の項目値の組を抽出し、測度の合計を算出する集計処理を実施例２と同様に上記の５通りの方式を用いて実施した。そして、各方式の処理時間を計測し、本発明による方式が旧方式に対してどの程度高速化されたかを表す高速化倍率を算出した。 In Example 4, in the same measurement environment as Example 3, 3 million lines (1M), 5 million lines (5M), 10 million lines (10M), 50 million lines (50M), and 100 million lines (100M) A tabulation process for sorting the dimension table format table, extracting a set of unique item values, and calculating the total of the measures was performed using the above five methods in the same manner as in the second embodiment. Then, the processing time of each method was measured, and a speed-up magnification representing how fast the method according to the present invention was compared with the old method was calculated.

図５２Ａ乃至Ｃは実施例４における集計処理の内容を説明する図である。図５２Ａは出力される集計結果テーブルを表し、図５２Bは１００万行からなる集計元テーブルを表し、図５２Ｃは６３２３３４個のレコードからなる集計結果テーブルを表している。 52A to 52C are diagrams for explaining the contents of the aggregation process in the fourth embodiment. FIG. 52A shows an output totaling result table, FIG. 52B shows a totaling source table consisting of 1 million rows, and FIG. 52C shows a totaling result table consisting of 632334 records.

図５３Ａ乃至Ｃは実施例４の測定結果を示す図である。図５３Ａは、上記の５方式による処理の所要時間（ミリ秒）を表し、図５３Ｂは本発明による上記の（２）から（５）の方式が旧方式（１）に対して高速化された倍率を表し、図５３Ｃは図５３Ｂと同じ高速化倍率をグラフ化したものである。 53A to 53C are diagrams showing measurement results of Example 4. FIG. FIG. 53A shows the required time (milliseconds) of processing by the above five methods, and FIG. 53B shows that the above methods (2) to (5) according to the present invention are faster than the old method (1). FIG. 53C is a graph showing the same high speed magnification as FIG. 53B.

集計結果テーブルのサイズにかかわらず、３次元一括ソート方式が逐次ソート方式より高速に集計処理を実現している。さらに、集計結果テーブルのサイズが比較的小さい場合（５００万行未満の場合）、次元昇順方式がＯｒｄＳｅｔ昇順方式より高速化されているが、集計結果テーブルのサイズが大きくなると、ＯｒｄＳｅｔ昇順方式が次元昇順方式より高速化されることがわかった。 Regardless of the size of the tabulation result table, the three-dimensional batch sort method realizes tabulation processing faster than the sequential sort method. Furthermore, when the size of the aggregation result table is relatively small (less than 5 million rows), the dimension ascending method is faster than the OrdSet ascending method, but when the size of the aggregation result table is increased, the OrdSet ascending method is dimensioned. It was found that it was faster than the ascending method.

実施例３及び実施例４から、本発明の方式は、（２）から（５）の何れの方式であっても、大規模な表形式データの多次元集計処理を従来技術より高速に実現することが実証された。 From Example 3 and Example 4, the method of the present invention realizes multi-dimensional tabulation processing of large-scale tabular data at higher speed than the prior art, regardless of which method is (2) to (5). It was proved.

【００２０】
理は、たとえば、Ｃ言語スタイルでは、
ｆｏｒ（ｉ＝０；ｉ＜４；ｉ＋＋）｛
ｆｏｒ（ｊ＝０；ｊ＜４；ｊ＋＋）｛
Ｃｏｕｎｔ［ｊ］［Ｖ＿ＶＮｏ［ＯｒｄＳｅｔ［ｊ＊４＋ｉ］］］＋＋；
｝
｝
と記述される。図７の例では、初期化されたカウント配列７１０、７１１、７１２及び７１３は、仮想項目値番号の数え上げ処理によって、カウント配列７２０、７２１、７２２及び７２３となる。尚、本例において、第１の項目値配列をＶＬ＿１、第２の項目値配列をＶＬ＿２とすると、カウント配列Ｃｏｕｎｔのサイズは、（ＶＬ＿１のサイズ）×（ＶＬ＿２）のサイズに一致する。
［００５７］
この数え上げ処理の結果は、たとえば、ＣＰＵ−０の受け持ち範囲内では、仮想項目値番号＝０、２、５及び９がそれぞれ１個ずつ存在し、ＣＰＵ−１の受け持ち範囲内では、仮想項目値番号＝６、７、１０及び１１がそれぞれ１個ずつ存在し、ＣＰＵ−２の受け持ち範囲内では、仮想項目値番号０、３、４及び５がそれぞれ１個ずつ存在し、ＣＰＵ−３の受け持ち範囲内では、仮想項目値番号＝２、５、１０及び１１がそれぞれ１個ずつ存在することを表している。
［００５８］
続いて、本発明の一実施形態では、コンピュータシステム１０は、仮想項目値番号配列の各要素を累計数化して、仮想項目値番号の出現回数の累積度数分布配列を生成する。図８は、本発明の一実施形態による累積度数分布配列を生成する処理の説明図である。プロセッサｊの受け持ち範囲内のカウント配列をＣｏｕｎｔ［ｊ］、累積度数分布配列をＡｇｇｒ［ｊ］とすると、累積度数分布は、
Ａｇｇｒ［０］［０］＝０
Ａｇｇｒ［１］［０］＝Ａｇｇｒ［０］［０］＋Ｃｏｕｎｔ［０］［０］
Ａｇｇｒ［２］［０］＝Ａｇｇｒ［１］［０］＋Ｃｏｕｎｔ［１］［０］
Ａｇｇｒ［３］［０］＝Ａｇｇｒ［２］［０］＋Ｃｏｕｎｔ［２］［０］
Ａｇｇｒ［０］［１］＝Ａｇｇｒ［３］［０］＋Ｃｏｕｎｔ［３］［０］
Ａｇｇｒ［１］［１］＝Ａｇｇｒ［０］［１］＋Ｃｏｕｎｔ［０］［１］[0020]
For example, in C language style,
for (i = 0; i <4; i ++) {
for (j = 0; j <4; j ++) {
Count [j] [V_VNo [OrdSet [j * 4 + i]]] ++;
}
}
Is described. In the example of FIG. 7, the initialized count arrays 710, 711, 712, and 713 become count arrays 720, 721, 722, and 723 by the virtual item value number counting process. In this example, when the first item value array is VL_1 and the second item value array is VL_2, the size of the count array Count is equal to the size of (VL_1 size) × (VL_2).
[0057]
As a result of the counting process, for example, one virtual item value number = 0, 2, 5, and 9 exists in the CPU-0 range, and a virtual item value exists in the CPU-1 range. Number = 6, 7, 10 and 11 respectively, and within the CPU-2 range, there is one virtual item value number 0, 3, 4 and 5, respectively, and CPU-3 In the range, one virtual item value number = 2, 5, 10 and 11 is present.
[0058]
Subsequently, in one embodiment of the present invention, the computer system 10 accumulates each element of the virtual item value number array to generate a cumulative frequency distribution array of the number of appearances of the virtual item value number. FIG. 8 is an explanatory diagram of processing for generating a cumulative frequency distribution array according to an embodiment of the present invention. When the count array within the range of the processor j is Count [j] and the cumulative frequency distribution array is Aggr [j], the cumulative frequency distribution is
Aggr [0] [0] = 0
Aggr [1] [0] = Aggr [0] [0] + Count [0] [0]
Aggr [2] [0] = Aggr [1] [0] + Count [1] [0]
Aggr [3] [0] = Aggr [2] [0] + Count [2] [0]
Aggr [0] [1] = Aggr [3] [0] + Count [3] [0]
Aggr [1] [1] = Aggr [0] [1] + Count [0] [1]

【００３０】
び１５２３となる。尚、本例において、カウント配列Ｃｏｕｎｔのサイズは、対象としている項目の項目値番号配列をＶＬとし、プロセッサの台数をＮとすると、（ＶＬのサイズ）×Ｎに一致する。
［００８６］
続いて、本発明の一実施形態では、コンピュータシステム１０は、項目値番号配列の各要素を累計数化して、項目値番号の出現回数の累積度数分布配列を生成する。図１６は、本発明の一実施形態による多項目逐次ソート処理の下位次元ソート処理における累積度数分布配列を生成する処理の説明図である。プロセッサｊの受け持ち範囲内のカウント配列をＣｏｕｎｔ［ｊ］、累積度数分布配列をＡｇｇｒ［ｊ］とすると、累積度数分布は、
Ａｇｇｒ［０］［０］＝０
Ａｇｇｒ［１］［０］＝Ａｇｇｒ［０］［０］＋Ｃｏｕｎｔ［０］［０］
Ａｇｇｒ［２］［０］＝Ａｇｇｒ［１］［０］＋Ｃｏｕｎｔ［１］［０］
Ａｇｇｒ［３］［０］＝Ａｇｇｒ［２］［０］＋Ｃｏｕｎｔ［２］［０］
Ａｇｇｒ［０］［１］＝Ａｇｇｒ［３］［０］＋Ｃｏｕｎｔ［３］［０］
Ａｇｇｒ［１］［１］＝Ａｇｇｒ［０］［１］＋Ｃｏｕｎｔ［０］［１］
Ａｇｇｒ［２］［１］＝Ａｇｇｒ［１］［１］＋Ｃｏｕｎｔ［１］［１］
Ａｇｇｒ［３］［１］＝Ａｇｇｒ［２］［１］＋Ｃｏｕｎｔ［２］［１］
Ａｇｇｒ［０］［２］＝Ａｇｇｒ［３］［１］＋Ｃｏｕｎｔ［３］［１］
Ａｇｇｒ［１］［２］＝Ａｇｇｒ［０］［２］＋Ｃｏｕｎｔ［０］［２］
Ａｇｇｒ［２］［２］＝Ａｇｇｒ［１］［２］＋Ｃｏｕｎｔ［１］［２］
Ａｇｇｒ［３］［２］＝Ａｇｇｒ［２］［２］＋Ｃｏｕｎｔ［２］［２］
Ａｇｇｒ［０］［３］＝Ａｇｇｒ［３］［２］＋Ｃｏｕｎｔ［３］［２］
Ａｇｇｒ［１］［３］＝Ａｇｇｒ［０］［３］＋Ｃｏｕｎｔ［０］［３］
Ａｇｇｒ［２］［３］＝Ａｇｇｒ［１］［３］＋Ｃｏｕｎｔ［１］［３］
Ａｇｇｒ［３］［３］＝Ａｇｇｒ［２］［３］＋Ｃｏｕｎｔ［２］［３］
として計算される。この計算は、４台のプロセッサのうちのいずれか１台のプロセッサが単独で行ってもよいが、４台のプロセッサに項目値番号の範囲を割り当てて、４台のプロセッサによって並列に行ってもよい。[0030]
1523. In this example, the size of the count array Count is equal to (VL size) × N, where the item value number array of the target item is VL and the number of processors is N.
[0086]
Subsequently, in one embodiment of the present invention, the computer system 10 generates a cumulative frequency distribution array of the number of occurrences of item value numbers by accumulating each element of the item value number array. FIG. 16 is an explanatory diagram of a process for generating a cumulative frequency distribution array in the low-order sort process of the multi-item sequential sort process according to an embodiment of the present invention. When the count array within the range of the processor j is Count [j] and the cumulative frequency distribution array is Aggr [j], the cumulative frequency distribution is
Aggr [0] [0] = 0
Aggr [1] [0] = Aggr [0] [0] + Count [0] [0]
Aggr [2] [0] = Aggr [1] [0] + Count [1] [0]
Aggr [3] [0] = Aggr [2] [0] + Count [2] [0]
Aggr [0] [1] = Aggr [3] [0] + Count [3] [0]
Aggr [1] [1] = Aggr [0] [1] + Count [0] [1]
Aggr [2] [1] = Aggr [1] [1] + Count [1] [1]
Aggr [3] [1] = Aggr [2] [1] + Count [2] [1]
Aggr [0] [2] = Aggr [3] [1] + Count [3] [1]
Aggr [1] [2] = Aggr [0] [2] + Count [0] [2]
Aggr [2] [2] = Aggr [1] [2] + Count [1] [2]
Aggr [3] [2] = Aggr [2] [2] + Count [2] [2]
Aggr [0] [3] = Aggr [3] [2] + Count [3] [2]
Aggr [1] [3] = Aggr [0] [3] + Count [0] [3]
Aggr [2] [3] = Aggr [1] [3] + Count [1] [3]
Aggr [3] [3] = Aggr [2] [3] + Count [2] [3]
Is calculated as This calculation may be performed by any one of the four processors alone, or may be performed in parallel by four processors by assigning a range of item value numbers to the four processors. Good.

【００３１】
［００８７］
４台のプロセッサによって並列的に累積度数分布を計算する場合には、最初に、各項目値番号ｉの出現回数の合計値Ｓｕｍ［ｉ］を、たとえば、
ｆｏｒ（ｉ＝０；ｉ＜４：ｉ＋＋）｛
ｆｏｒ（ｊ＝０；ｊ＜４；ｊ＋＋）｛
Ｓｕｍ［ｉ］＝Ｃｏｕｎｔ［ｊ］［ｉ］；
｝
｝
によって、並列的に計算する。本実施形態では、たとえば、ＣＰＵ−０がｉ＝０のループを担当し、ＣＰＵ−１がｉ＝１のループを担当し、ＣＰＵ−２がｉ＝２のループを担当し、ＣＰＵ−３がｉ＝３のループを担当する。
［００８８］
続いて、たとえば、４台のプロセッサが並列的に、
Ａｇｇｒ［０］［０］＝０
Ａｇｇｒ［１］［０］＝Ａｇｇｒ［０］［０］＋Ｃｏｕｎｔ［０］［０］
Ａｇｇｒ［２］［０］＝Ａｇｇｒ［１］［０］＋Ｃｏｕｎｔ［１］［０］
Ａｇｇｒ［３］［０］＝Ａｇｇｒ［２］［０］＋Ｃｏｕｎｔ［２］［０］
Ａｇｇｒ［０］［１］＝Ｓｕｍ［０］
Ａｇｇｒ［１］「１］＝Ａｇｇｒ［０］［１］＋Ｃｏｕｎｔ［０］［１］
Ａｇｇｒ［２］［１］＝Ａｇｇｒ［１］［１］＋Ｃｏｕｎｔ［１］［１］
Ａｇｇｒ［３］［１］＝Ａｇｇｒ［２］［１］＋Ｃｏｕｎｔ［２］［１］
Ａｇｇｒ［０］［２］＝Ｓｕｍ［１］
Ａｇｇｒ［１］［２］＝Ａｇｇｒ［０］［２］＋Ｃｏｕｎｔ［０］［２］
Ａｇｇｒ［２］［２］＝Ａｇｇｒ［１］［２］＋Ｃｏｕｎｔ［１］［２］
Ａｇｇｒ［３］［２］＝Ａｇｇｒ［２］［２］＋Ｃｏｕｎｔ［２］［２］
Ａｇｇｒ［０］［３］＝Ｓｕｍ［２］
Ａｇｇｒ［１］［３］＝Ａｇｇｒ［０］［３］＋Ｃｏｕｎｔ［０］［３］
Ａｇｇｒ［２］［３］＝Ａｇｇｒ［１］［３］＋Ｃｏｕｎｔ［１］［３］
Ａｇｇｒ［３］［３］＝Ａｇｇｒ［２］［３］＋Ｃｏｕｎｔ［２］［３］
を実行することによって、累積度数分布配列が得られる。[0031]
[0087]
When the cumulative frequency distribution is calculated in parallel by four processors, first, the total value Sum [i] of the number of occurrences of each item value number i is, for example,
for (i = 0; i <4: i ++) {
for (j = 0; j <4; j ++) {
Sum [i] = Count [j] [i];
}
}
To calculate in parallel. In this embodiment, for example, CPU-0 is in charge of the i = 0 loop, CPU-1 is in charge of the i = 1 loop, CPU-2 is in charge of the i = 2 loop, and CPU-3 is in charge of the loop. Take charge of the loop of i = 3.
[0088]
Subsequently, for example, four processors are connected in parallel,
Aggr [0] [0] = 0
Aggr [1] [0] = Aggr [0] [0] + Count [0] [0]
Aggr [2] [0] = Aggr [1] [0] + Count [1] [0]
Aggr [3] [0] = Aggr [2] [0] + Count [2] [0]
Aggr [0] [1] = Sum [0]
Aggr [1] “1” = Aggr [0] [1] + Count [0] [1]
Aggr [2] [1] = Aggr [1] [1] + Count [1] [1]
Aggr [3] [1] = Aggr [2] [1] + Count [2] [1]
Aggr [0] [2] = Sum [1]
Aggr [1] [2] = Aggr [0] [2] + Count [0] [2]
Aggr [2] [2] = Aggr [1] [2] + Count [1] [2]
Aggr [3] [2] = Aggr [2] [2] + Count [2] [2]
Aggr [0] [3] = Sum [2]
Aggr [1] [3] = Aggr [0] [3] + Count [0] [3]
Aggr [2] [3] = Aggr [1] [3] + Count [1] [3]
Aggr [3] [3] = Aggr [2] [3] + Count [2] [3]
To obtain a cumulative frequency distribution array.

【００３８】
ここでは、特許文献３に記載されているような、カウンティングソートに基づく並列ソートと基数ソートの考え方とを組み合わせることにより実現される多段階並列ソートの実施例を説明する。この多段階並列ソートは、上述の多項目一括ソート処理及び多項目逐次ソート処理の基礎となる技術である。
［０１１３］
項目値配列ＶＬのサイズが大きいとき、すなわち、項目値番号の個数が多数であるときには、項目値番号を基数で表現し、桁ごとに並列ソートを実施することにより、効率的なソートを実現することが可能である。本例の多段階並列ソートは、最下位の桁から始めて順番に現在の桁に関するソート処理を行い、最後に最上位の桁に関するソート処理を行うことによって最終的なソートを完了する。
［０１１４］
多段階並列ソート方法の一例では、図２４のデータ構造を利用する。本実施の形態では、ＣＰＵの台数は４台とし、すべてのＣＰＵが並列に動作する例を考える。システム内のＣＰＵの総数、及び、並列に動作するＣＰＵの台数はこの例に限定されないことに注意すべきである。また、以下では、説明の便宜上、年齢の項目に関して、年齢の昇順にソートする場合を考える。また、年齢の項目値配列の要素は年齢の昇順に並べられている。図４のデータ構造では、年齢に関する項目値番号ＶＮｏは０から４までの値を取り得るので、基数＝４として項目値番号を分解すると、項目値番号は下の桁と上の桁の２桁に分解される。具体的には、項目値番号のモジュロ（４）の値が下の桁の値であり、項目値番号を４で割った商が上の桁の値である。
［０１１５］
図２５は、本実施例にかかる多段階並列ソート方法のフローチャートである。多段階並列ソート方法は、ステップ２５０１からステップ２５０５の５ステップにより構成される。
［０１１６］
ステップ２５０１：項目値番号の範囲に応じて項目値番号の基数（本例では基数＝４）を選択し、初期のレコード番号配列ＯｒｄＳｅｔを現在のレコード番号配列に設定し、項目値番号の最下位の桁（本例では項目値番号のモジュロ（４）の値）を現在の桁に設定する。
［０１１７］
ステップ２５０２：現在のレコード番号配列を分割して４台のプロセッサに割り当てる。
［０１１８］
ステップ２５０３：４台のプロセッサのうちの各プロセッサにおいて、割り当てられたレコード番号配列の部分に含まれるレコードに対応した項目値番号の現在の桁の値の出現回数をカウントする。[0038]
Here, an embodiment of multi-stage parallel sorting realized by combining parallel sorting based on counting sort and the concept of radix sort as described in Patent Document 3 will be described. This multi-stage parallel sort is a technology that is the basis of the above-described multi-item batch sort process and multi-item sequential sort process.
[0113]
When the size of the item value array VL is large, that is, when there are a large number of item value numbers, the item value numbers are expressed in radixes, and parallel sorting is performed for each digit to achieve efficient sorting. It is possible. In the multi-stage parallel sort of this example, the sorting process for the current digit is sequentially performed starting from the least significant digit, and the final sorting is completed by finally performing the sorting process for the most significant digit.
[0114]
In an example of the multistage parallel sorting method, the data structure of FIG. 24 is used. In the present embodiment, the number of CPUs is four, and an example in which all CPUs operate in parallel is considered. It should be noted that the total number of CPUs in the system and the number of CPUs operating in parallel are not limited to this example. Further, in the following, for convenience of explanation, consider a case where the items of age are sorted in ascending order of age. The elements of the item value array for age are arranged in ascending order of age. In the data structure of FIG. 4, the item value number VNo relating to age can take a value from 0 to 4. Therefore, when the item value number is decomposed with the radix = 4, the item value number has two digits, the lower digit and the upper digit. Is broken down into Specifically, the modulo (4) value of the item value number is the lower digit value, and the quotient obtained by dividing the item value number by 4 is the upper digit value.
[0115]
FIG. 25 is a flowchart of the multi-stage parallel sorting method according to the present embodiment. The multi-stage parallel sorting method includes five steps from step 2501 to step 2505.
[0116]
Step 2501: Select the radix of the item value number (in this example, radix = 4) according to the range of the item value number, set the initial record number array OrdSet to the current record number array, and set the lowest item value number (In this example, the modulo (4) value of the item value number) is set to the current digit.
[0117]
Step 2502: The current record number array is divided and assigned to four processors.
[0118]
Step 2503: In each of the four processors, the number of occurrences of the current digit value of the item value number corresponding to the record included in the allocated record number array portion is counted.

Claims

Record number array in which record numbers of records in tabular data are stored according to a predetermined record order A shared memory that stores a number array and an item value array in which the item values of the tabular data are stored according to the order of the item value numbers corresponding to the item values;
A plurality of processors capable of accessing the shared memory;
In a shared memory multiprocessor system with
A method for extracting a unique set of item values for a given set of items, comprising:
Cumulative frequency distribution that uses the plurality of processors to count in parallel the number of occurrences of a set of item value numbers corresponding to a set of item values relating to the set of predetermined items, and expresses the cumulative frequency distribution of the number of appearances Generating an array in the shared memory;
Using the plurality of processors, reading the item value number pairs in parallel using the element value of the record number array as a subscript, and using the value calculated from the item value number set as a subscript Reading the elements of the cumulative frequency distribution array in parallel, and storing the values of the elements of the record number array in parallel in the sorted record number array using the elements of the cumulative frequency distribution array as an index pointer; ,
Dividing the sorted record number array according to the value of the element of the cumulative frequency distribution array used as the index pointer into a group of record numbers of records having the unique field value pairs;
Identifying the unique set of field values by extracting one record number from each group of the segmented record numbers;
A method comprising:

The step of generating the cumulative frequency distribution array in parallel in the shared memory is performed by using the plurality of processors, and for each record number stored in the record number array, an item value related to the predetermined item set By setting the radix according to the range of the item value numbers included in the set of item value numbers corresponding to the set of item values, the set of item value numbers is changed to each item value number included in the set of item value numbers. Converting to a virtual field value number in a radix format representing each digit,
The method of claim 1.

(I) generating the cumulative frequency distribution array in parallel in the shared memory for each item included in the set of predetermined items, and the record number array for each item included in the set of predetermined items. A step (ii) of storing element values in parallel in the sorted record number array, and the sorted record number array for each item included in the predetermined set of items, Performing the step (iii) of classifying records into groups of record numbers of records having
The step (i) of generating the cumulative frequency distribution array in the shared memory in parallel uses the plurality of processors to count in parallel the number of occurrences of the item value number corresponding to the item value related to the item, Generating a cumulative frequency distribution array representing the cumulative frequency distribution of the occurrence counts in parallel in the shared memory,
The step (ii) of storing the elements of the record number array in parallel in the sorted record number array is performed using the plurality of processors and the values of the element of the item value number array as subscripts. Read the item value number in parallel, read the element of the cumulative frequency distribution array in parallel using the item value number as a subscript, and use the element of the cumulative frequency distribution array as an index pointer in the record number array Storing element values in parallel in the sorted record number array;
The method of claim 1.

The method according to claim 1, further comprising the step of aggregating item values related to different items for each set of the predetermined item values in parallel using the plurality of processors.

Using the plurality of processors, the record number in the sorted record number array and the group number of the group corresponding to the record number are used as an auxiliary record number array and auxiliary number using the record number as a subscript. Storing in parallel group number arrays;
Using the plurality of processors, the elements of the auxiliary record number array are used as subscripts to read out elements in parallel from the item value number array relating to the other item, and the elements of the item value number array are used as subscripts. To read the item value in parallel from the item value array for the other item, to read the element of the auxiliary group number array using the element of the auxiliary record number array as a subscript, and to Storing the aggregation result of the read item values in the aggregation array using the elements of the number array as a subscript of the aggregation array;
The method of claim 1, further comprising:

Using the plurality of processors, the elements of the sorted record number array are used as subscripts to read out elements in parallel from the item value number array relating to the other item, and the elements of the item value number array are used as subscripts. To read the item value in parallel from the item value array for the other item, and read the item value using the group number of the group corresponding to the element of the sorted record number array as the subscript of the aggregate array The method according to claim 1, further comprising the step of storing the totaling result in the totaling array.

Record number array in which record numbers of records in tabular data are stored according to a predetermined record order A shared memory that stores a number array and an item value array in which the item values of the tabular data are stored according to the order of the item value numbers corresponding to the item values;
A plurality of processors capable of accessing the shared memory;
In a shared memory multiprocessor system with
A device that extracts a unique set of item values for a given set of items,
Cumulative frequency distribution that uses the plurality of processors to count in parallel the number of occurrences of the set of item value numbers corresponding to the set of item values related to the set of predetermined items, and express the cumulative frequency distribution of the number of appearances Means for generating an array in the shared memory;
Using the plurality of processors, the item value number pairs are read in parallel using the element value of the record number array as a subscript, and the value calculated from the item value number set is used as a subscript. Means for reading the elements of the cumulative frequency distribution array in parallel, and storing the values of the elements of the record number array in parallel in the sorted record number array using the elements of the cumulative frequency distribution array as an index pointer; ,
Means for dividing the record number array sorted according to the element value of the cumulative frequency distribution array used as the index pointer into a group of record numbers of records having the unique item value pairs;
Means for identifying the set of unique item values by extracting one record number from each group of the record numbers that are classified;
A device comprising:

The means for generating the cumulative frequency distribution array in parallel in the shared memory, using the plurality of processors, for each record number stored in the record number array, an item value relating to the predetermined set of items By setting the radix according to the range of the item value numbers included in the set of item value numbers corresponding to the set of item values, each item value number included in the set of item value numbers Comprises means for converting to a virtual item value number in a radix format representing each digit,
The apparatus of claim 7.

The means for generating the cumulative frequency distribution array in parallel in the shared memory uses the plurality of processors for each item included in the predetermined item set, and the item value corresponding to the item value related to the item Means for counting up the number of appearances of the number in parallel, and generating a cumulative frequency distribution array expressing the cumulative frequency distribution of the number of appearances in the shared memory in parallel;
The means for storing the elements of the record number array in parallel in the sorted record number array uses the plurality of processors for each item included in the set of the predetermined items, and the item value number array The item value number is read in parallel using the value of the element in the subscript, the element of the cumulative frequency distribution array is read in parallel using the item value number as the subscript, and the element of the cumulative frequency distribution array is indexed Means for storing the values of the elements of the record number array in parallel in the sorted record number array by using as pointers;
The apparatus of claim 7.

The apparatus according to claim 7, further comprising means for collecting, in parallel, item values related to different items for each set of the predetermined item values using the plurality of processors.

Using the plurality of processors, the record number in the sorted record number array and the group number of the group corresponding to the record number are used as an auxiliary record number array and auxiliary number using the record number as a subscript. Means for storing in parallel group number arrays;
Using the plurality of processors, the elements of the auxiliary record number array are used as subscripts to read out elements in parallel from the item value number array relating to the other item, and the elements of the item value number array are used as subscripts. To read the item value in parallel from the item value array for the other item, to read the element of the auxiliary group number array using the element of the auxiliary record number array as a subscript, and to Means for storing the aggregation result of the read item value in the aggregation array using an element of the number array as a subscript of the aggregation array;
The apparatus of claim 7, further comprising:

Using the plurality of processors, the elements of the sorted record number array are used as subscripts to read out elements in parallel from the item value number array relating to the other item, and the elements of the item value number array are used as subscripts. To read the item value in parallel from the item value array for the other item, and use the group number of the group corresponding to the element of the sorted record number array as the subscript of the aggregate array to read the item value The apparatus according to claim 7, further comprising means for storing a totaling result of the data in the totaling array.

Record number array in which record numbers of records in tabular data are stored according to a predetermined record order A shared memory that stores a number array and an item value array in which the item values of the tabular data are stored according to the order of the item value numbers corresponding to the item values;
A plurality of processors capable of accessing the shared memory;
A computer-readable program that causes a computer to execute code for extracting a unique set of item values for a predetermined set of items, loaded on a computer comprising:
Cumulative frequency distribution that uses the plurality of processors to count in parallel the number of occurrences of the set of item value numbers corresponding to the set of item values related to the set of predetermined items, and express the cumulative frequency distribution of the number of appearances Code for generating an array in the shared memory;
Using the plurality of processors, the item value number pairs are read in parallel using the element value of the record number array as a subscript, and the value calculated from the item value number set is used as a subscript. A code for reading the elements of the cumulative frequency distribution array in parallel, and storing the values of the elements of the record number array in parallel in the sorted record number array using the elements of the cumulative frequency distribution array as an index pointer; ,
A code for classifying the sorted record number array according to the value of the element of the cumulative frequency distribution array used as the index pointer into a group of record numbers of records having the unique set of field values;
A code that identifies the unique set of field values by extracting one record number from each group of the record numbers that are segmented;
Including programs.

The code for generating the cumulative frequency distribution array in parallel in the shared memory is an item value related to the predetermined set of items for each record number stored in the record number array using the plurality of processors. By setting the radix according to the range of the item value numbers included in the set of item value numbers corresponding to the set of item values, the set of item value numbers is changed to each item value number included in the set of item value numbers. Comprises a code that converts to a virtual field value number in radix format representing each digit,
The program according to claim 13.

Code (i) for generating the cumulative frequency distribution array in parallel in the shared memory for each item included in the predetermined item set, and the record number array for each item included in the predetermined item set A code (ii) for storing element values in parallel in the sorted record number array, and the sorted record number array for each item included in the predetermined set of items A code (iii) for executing a code for dividing the record into groups of record numbers of records having
The code (i) for generating the cumulative frequency distribution array in parallel in the shared memory counts in parallel the number of occurrences of the item value number corresponding to the item value related to the item using the plurality of processors, A code for generating a cumulative frequency distribution array expressing the cumulative frequency distribution of the appearance frequency in the shared memory in parallel;
A code (ii) for storing the elements of the record number array in parallel in the sorted record number array, using the plurality of processors, the value of the element of the item value number array as a subscript Read the item value number in parallel, read the element of the cumulative frequency distribution array in parallel using the item value number as a subscript, and use the element of the cumulative frequency distribution array as an index pointer in the record number array Comprising code for storing element values in parallel in the sorted record number array;
The program according to claim 13.

The program according to claim 13, further comprising a code that aggregates item values related to different items in parallel for each set of the predetermined item values using the plurality of processors.

Using the plurality of processors, the record number in the sorted record number array and the group number of the group corresponding to the record number are used as an auxiliary record number array and auxiliary number using the record number as a subscript. Code to store in parallel group number array,
Using the plurality of processors, the elements of the auxiliary record number array are used as subscripts to read the elements in parallel from the item value number array related to the other item, and the elements of the item value number array are used as subscripts. To read the item value in parallel from the item value array relating to the other item, to read the element of the auxiliary group number array using the element of the auxiliary record number array as a subscript, and to A code for storing the aggregation result of the read item value in the aggregation array using an element of the number array as a subscript of the aggregation array;
The program according to claim 13, further comprising:

Using the plurality of processors, the elements of the sorted record number array are used as subscripts to read out elements in parallel from the item value number array relating to the other item, and the elements of the item value number array are used as subscripts. To read the item value in parallel from the item value array for the other item, and use the group number of the group corresponding to the element of the sorted record number array as the subscript of the aggregate array to read the item value The program according to claim 13, further comprising a code for storing the totaling result in the totaling array.

A record number array in which the record numbers of the records in the tabular data are stored according to a predetermined record order, and an item value in which the item value numbers corresponding to the field values of the predetermined items in the above records of the tabular data are stored according to the record numbers A shared memory for storing an item value array in which an item value of the table data is stored in accordance with the order of the item value number corresponding to the item value, and a plurality of processors accessible to the shared memory And a computer program product for causing the computer to execute the method of claim 1 for extracting a unique set of item values for a predetermined set of items.

A record number array in which the record numbers of the records in the tabular data are stored according to a predetermined record order, and an item value in which the item value numbers corresponding to the field values of the predetermined items in the above records of the tabular data are stored according to the record numbers A shared memory for storing an item value array in which an item value of the table data is stored in accordance with the order of the item value number corresponding to the item value, and a plurality of processors accessible to the shared memory And a computer-readable storage medium storing a computer program for causing the computer to execute the method according to claim 1, which is loaded into a computer comprising: