JPH1153404A

JPH1153404A - Multidimensional totalizing process method

Info

Publication number: JPH1153404A
Application number: JP21109597A
Authority: JP
Inventors: Yoshinori Yamagishi; 義徳山岸
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 1997-08-06
Filing date: 1997-08-06
Publication date: 1999-02-26

Abstract

PROBLEM TO BE SOLVED: To shorten the time needed for a totalizing process by dividing a multidimensional data base into totalization units and not using a totalization unit which is used for totalization in each step in the next step. SOLUTION: A request for multidimensional totalization is inputted through an input device 1 and analyzed in a data controller 2. Then division into partial totalization units is performed on the basis of the analysis result. Further, parallelism is calculated by steps as to the divided partial totalization units and the execution procedure of partial totalizing process performed in the respective steps is determined in consideration of the number of DB dedicated processors 5. Then the partial totalization units are allocated to the DB dedicated processors 5 until all the steps of parallel execution end and the parallel execution is carried out. To perform execution in partial totalization units, a data file is read out of an external storage device 4 and after a totalizing process is performed, the result is written to the external storage device 4. The totalization result which is obtained finally is outputted to an output device 3.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は、データベース処
理方法に関し、特に多次元データベース管理システムの
集計処理方法の改良に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a database processing method and, more particularly, to an improvement in a tallying method of a multidimensional database management system.

【０００２】[0002]

【従来の技術】一般に、多次元データベース管理システ
ムにおいては、高速な集計処理の実現のために索引を用
いる方法が採用されている。索引を用いる場合、予め索
引を生成して処理する方法と実行時に索引を生成して処
理する方法の大きくは２つに分けられる。多次元データ
ベースのように明細レベルのデータである明細データに
対しては、頻繁に更新されるため、検索の際には索引の
再生成が頻発することになる。2. Description of the Related Art In general, in a multidimensional database management system, a method using an index is employed for realizing a high-speed aggregation process. When an index is used, a method of generating and processing an index in advance and a method of generating and processing an index at the time of execution are roughly divided into two methods. Detailed data, which is data at the detailed level, such as a multidimensional database, is frequently updated, so that the index is frequently regenerated during a search.

【０００３】一方、別のアプローチとして索引を使用す
ることなしに高速なソートの実現により、集計処理を行
う方法がある。明細データから多次元の集計結果を得よ
うとする場合、各次元ごとの集計をソート中心のデータ
処理により行うことで最終的な集計結果を得る方法であ
る。明細データに対して全次元を一つのキーとして集計
した結果を一次集計データとした場合、Ｎ次元の集計は
一次集計データに対してＮ回の、各次元に対する集計処
理を行うことにより可能である。On the other hand, as another approach, there is a method of performing a totalizing process by realizing a high-speed sorting without using an index. When a multi-dimensional total result is to be obtained from the detailed data, this is a method of obtaining the final total result by performing totaling for each dimension by data processing centered on sorting. When the result of totaling all the dimensions of the detailed data as one key is used as primary aggregated data, N-dimensional aggregation can be performed by performing the aggregation processing for each dimension N times for the primary aggregated data. .

【０００４】次にこの索引を使用しない多次元集計処理
における従来技術の動作について説明する。図１３は、
４次元集計処理で算出される集計結果を示しており、51
は明細データに対する一次集計結果、52は一次集計デー
タ51を入力とした場合の次元１に対する集計結果、53は
51、52を入力とした場合の次元２に対する集計結果、54
は51、52、53を入力とした場合の次元３に対する集計結
果、55は51、52、53、54を入力とした場合の次元４に対
する集計結果である。従来、４次元の集計結果を算出す
る場合、図１４に示すように、一次集計結果０（黒で示
すデータ）に対して、ステップ１として集計結果０（黒
で示すデータ）を入力として集計結果10（黒で示すデー
タ）を算出して集計結果０、10を出力し、ステップ２と
して集計結果０、10を入力として集計結果20を算出して
集計結果０、10、20を出力し、ステップ３として集計結
果０、10、20を入力として集計結果30を算出して集計結
果０、10、20、30を出力し、ステップ４として集計結果
０、10、20、30を入力として集計結果40を算出して集計
結果０、10、20、30、40を出力していた。[0004] Next, the operation of the prior art in the multidimensional aggregation processing without using the index will be described. FIG.
The tabulation result calculated by the four-dimensional tabulation process is shown.
Is the primary aggregation result for the detail data, 52 is the aggregation result for dimension 1 when the primary aggregation data 51 is input, and 53 is the
Counting result for dimension 2 when 51 and 52 are input, 54
Is a totaling result for dimension 3 when 51, 52, 53 is input, and 55 is a totaling result for dimension 4 when 51, 52, 53, 54 is input. Conventionally, when calculating a four-dimensional tally result, as shown in FIG. 14, the first tally result 0 (data shown in black) is input to the tally result 0 (data shown in black) as a step 1 as shown in FIG. 10 (data shown in black) is calculated and the tally results 0 and 10 are output. In step 2, the tally results 0 and 10 are input to calculate tally results 20 and the tally results 0, 10, and 20 are output. The totaling result 0, 10, 20, 30 is calculated as the input, the totaling result 30 is calculated, the totaling result 0, 10, 20, 30 is output, and the totaling result 0, 10, 20, 30 is input as the step 4, the totaling result 40 is input as the step 4. And output the tally results 0, 10, 20, 30, and 40.

【０００５】[0005]

【発明が解決しようとする課題】従来の多次元集計処理
方法においては、ステップＮの次元Ｎに対する集計処理
を行うためにステップ（Ｎ−１）の全ての出力結果を待
つ必要があり、複数のＣＰＵあるいは複数のデータ処理
専用装置を持つシステムにおいては集計処理を分割した
並列実行を行うことができず、高速な処理には不向きで
あった。In the conventional multidimensional totaling method, it is necessary to wait for all the output results of the step (N-1) in order to perform the totaling process for the dimension N in the step N. In a system having a CPU or a plurality of data processing-dedicated devices, it is not possible to perform the parallel processing by dividing the tallying process, and it is not suitable for high-speed processing.

【０００６】また、ステップＮの処理はステップ（Ｎ−
１）の集計結果を入力した集計処理であり、出力結果と
しては次元Ｎの集計結果に加えて入力データも結果とし
て一緒に出力しているため、次元数が大きい場合にはス
テップが進むにしたがって入力データが雪だるま式に大
きくなり、処理時間が増大するという問題があった。The processing in step N is performed in step (N-
This is a tallying process in which the tallying result of 1) is input. As the output result, in addition to the tallying result of the dimension N, the input data is also output as a result. There has been a problem that the input data increases in a snowball manner and the processing time increases.

【０００７】この発明は、上記のような問題点を解消す
るためになされたもので、各ステップの集計処理におい
て並列実行可能な処理単位に分割し、その部分的集計単
位を高速に処理するための方法を提供することを目的と
する。SUMMARY OF THE INVENTION The present invention has been made to solve the above-mentioned problems, and is intended to divide the totaling process of each step into processing units which can be executed in parallel and to process the partial totaling unit at high speed. The purpose is to provide a method.

【０００８】[0008]

【課題を解決するための手段】この発明に係る多次元集
計処理方法は、明細レベルのデータファイル及び各次元
の階層構造を保持した階層構造情報テーブルを有する多
次元データベースシステムにおいて、集計データからそ
れぞれ直接集計することにより得られる各次元の集計単
位である第１の集計データを集計するステップ、上記第
１の集計データからそれぞれ直接集計することにより得
られる各次元の集計単位である第２の集計データを集計
するステップからなるものである。According to the present invention, there is provided a multi-dimensional totaling method, comprising: a multi-dimensional database system having a detailed level data file and a hierarchical structure information table holding a hierarchical structure of each dimension; A step of totalizing the first aggregation data, which is a totaling unit of each dimension obtained by directly totaling, a second totaling, which is a totaling unit of each dimension, obtained by directly totaling each of the first totaling data; It consists of a step of counting data.

【０００９】また、複数の集計単位からなる上記第１の
集計データ又は上記第２の集計データをそれぞれ同時に
集計処理するものである。Further, the first total data or the second total data composed of a plurality of total units are totaled simultaneously.

【００１０】さらに、同時に集計処理する集計単位を２
回以上に分けて集計処理するものである。[0010] Furthermore, the totaling unit to be processed at the same time is 2
The tallying process is performed in more than one time.

【００１１】また、上記各次元ごとに集計値０の集計結
果レコードを生成し、上記各次元に対応した集計結果レ
コードであることを示す数字からなる昇順又は降順のブ
レークキー情報を上記集計結果レコードに付加するもの
である。In addition, a totaling result record having a totaling value of 0 is generated for each dimension, and break key information consisting of numbers indicating the totaling result record corresponding to each dimension is stored in the totaling result record in ascending or descending order. Is added to.

【００１２】さらにまた、上記複数の集計単位のいずれ
からでも同一の集計単位を集計することが可能なとき上
記複数の集計単位の内のいずれか一つを排他的に独占し
て集計するものである。Further, when it is possible to total the same total unit from any of the plural total units, one of the plural total units is exclusively monopolized and totaled. is there.

【００１３】また、上記複数の集計単位のそれぞれの集
計単位から求めた集計結果を比較検証するものである。[0013] The present invention also compares and verifies the tally results obtained from each of the plurality of tally units.

【００１４】さらに、明細レベルのデータファイル及び
各次元の階層構造を保持した階層構造情報テーブルを有
する多次元データベースを複数有するシステムにおい
て、第１の多次元データベースと第２の多次元データベ
ースの一次集計を同時に処理するものである。Further, in a system having a plurality of multi-dimensional databases having a data file at a detailed level and a hierarchical structure information table holding a hierarchical structure of each dimension, a first totalization of a first multi-dimensional database and a second multi-dimensional database is performed. Are simultaneously processed.

【００１５】また、上記第１の多次元データベースの一
次集計を集計処理し、上記第２の多次元データベースの
一次集計を上記第１の多次元データベースの第１のステ
ップと同時に集計処理し以降それぞれの多次元データベ
ースの連続する各ステップを逐次同時に集計処理するも
のである。[0015] The primary tabulation of the first multidimensional database is tabulated, and the primary tabulation of the second multidimensional database is tabulated simultaneously with the first step of the first multidimensional database. Of the multi-dimensional database are sequentially and simultaneously counted.

【００１６】さらにまた、上記第１の多次元データベー
スの一次集計を集計処理し、上記第２の多次元データベ
ースの一次集計を上記第１の多次元データベースの第２
のステップと同時に集計処理し以降それぞれの多次元デ
ータベースの連続する各ステップを逐次同時に集計処理
するものである。Furthermore, the primary tabulation of the first multidimensional database is tabulated, and the primary tabulation of the second multidimensional database is converted to the second tabulation of the first multidimensional database.
Is performed at the same time as the above step, and thereafter, successive steps of each multidimensional database are sequentially and simultaneously processed.

【００１７】[0017]

BEST MODE FOR CARRYING OUT THE INVENTION

実施の形態１．図１は本発明の実施の形態１における多
次元データベース処理装置の構成図を示し、１は多次元
集計の要求を入力する入力装置、２は集計処理を制御す
るデータ制御装置、３は集計結果を出力する出力装置、
４は多次元データベース及び集計結果を格納する外部記
憶装置、５は各次元の集計単位毎に集計処理をするデー
タベース専用処理装置（以下ＤＢ専用処理装置と称す
る）を示す。Embodiment 1 FIG. FIG. 1 is a configuration diagram of a multidimensional database processing apparatus according to Embodiment 1 of the present invention, in which 1 is an input device for inputting a request for multidimensional aggregation, 2 is a data control device for controlling aggregation processing, and 3 is an aggregation result. An output device that outputs
Reference numeral 4 denotes a multidimensional database and an external storage device for storing the results of counting, and reference numeral 5 denotes a database dedicated processing device (hereinafter, referred to as a DB dedicated processing device) that performs a counting process for each counting unit of each dimension.

【００１８】次に図２の処理手順に基づいて、図１の装
置を対応させながら処理の過程を説明する。まず、入力
装置１より多次元集計の要求を入力し（ステップＳ
１）、データ制御装置２内でその要求の解析を行う（ス
テップＳ２）。次にその解析結果を元に図３に示すよう
な部分的集計単位に分割する（ステップＳ３）。さらに
分割した部分的集計単位に関して図４で示すような各ス
テップごとでの並列度の計算を行い、本装置に接続され
ているＤＢ専用処理装置５の数を考慮して各ステップご
とで実行する部分的集計処理の実行手順を決める（ステ
ップＳ４）。次に、並列実行の全ステップが終了するま
で各部分的集計単位をＤＢ専用処理装置５に割り付けて
並列実行を行う（ステップＳ５）。部分的集計単位の実
行においては外部記憶装置４からデータファイルの読込
みを行い、集計処理を行った後、その結果を外部記憶装
置４に書込む。最後に、得られた集計結果を出力装置３
に出力する（ステップＳ６）。Next, based on the processing procedure of FIG. 2, the process of processing will be described while corresponding to the apparatus of FIG. First, a request for multidimensional aggregation is input from the input device 1 (Step S).
1), the request is analyzed in the data control device 2 (step S2). Next, based on the analysis result, the data is divided into partial tabulation units as shown in FIG. 3 (step S3). Further, the degree of parallelism is calculated for each step as shown in FIG. 4 with respect to the divided partial aggregation units, and is executed for each step in consideration of the number of DB dedicated processing devices 5 connected to the present apparatus. The execution procedure of the partial tabulation process is determined (step S4). Next, until all the steps of the parallel execution are completed, the respective partial aggregation units are allocated to the DB dedicated processing device 5 and the parallel execution is performed (step S5). In the execution of the partial counting unit, the data file is read from the external storage device 4, the counting process is performed, and the result is written in the external storage device 4. Finally, the obtained totaling result is output to the output device 3
(Step S6).

【００１９】図３は部分的集計単位に分割して部分的集
計処理を実行する様子を示しているが、従来技術と異な
るところは、集計単位を各次元に対応して細分化するこ
とにより、並列実行できる集計単位が発生することにあ
る。図３において、集計単位ごとに括弧書きで付記して
あるもの、例えば（←０）は一次集計データである集計
単位０の集計データから直接集計できることを示してお
り、同じく（←０）が付された集計単位は集計単位10と
集計単位30と集計単位40であり、それぞれが並列実行で
きることを意味している。FIG. 3 shows a state in which a partial tabulation process is executed by dividing the data into partial tabulation units. The difference from the prior art is that the tabulation unit is subdivided corresponding to each dimension. The point is that there is an aggregation unit that can be executed in parallel. In FIG. 3, the one added in parentheses for each tabulation unit, for example, (← 0) indicates that the tabulation data can be directly tabulated from the tabulation data of the tabulation unit 0 which is the primary tabulation data. The tallying units are the tallying unit 10, the tallying unit 30, and the tallying unit 40, which means that they can be executed in parallel.

【００２０】図４は、４次元集計処理おける各集計段階
での並列度を示した図である。一次集計の段階では並列
度１であり、第１のステップでは集計単位０を入力とし
て集計結果である集計単位10、20、30、40を算出するた
めの集計処理を並列実行することが可能であり、並列度
は４である。第２、第３、第４のステップの部分的集計
処理における並列度はそれぞれ６、４、１である。一般
的に、Ｎ次元の集計処理におけるステップｉでの並列度
はNＣｉ（Ｎ個の中からｉ個を取出す組合せ）となる。FIG. 4 is a diagram showing the degree of parallelism at each counting stage in the four-dimensional counting process. At the stage of the primary aggregation, the degree of parallelism is 1, and in the first step, the aggregation processing for calculating the aggregation results 10, 20, 30, and 40, which are the aggregation results, can be executed in parallel by inputting the aggregation unit 0. Yes, the degree of parallelism is 4. The degrees of parallelism in the partial aggregation processes in the second, third, and fourth steps are 6, 4, and 1, respectively. In general, the degree of parallelism in step i in the N-dimensional totaling process is NCi (a combination of extracting i pieces from N pieces).

【００２１】図５は、図３で示した４次元の集計結果に
対して、各ステップで算出される集計結果を示した図で
ある。各ステップでは入力データに対してその次元に対
する集計結果のみを出力することにより、データ転送量
を最小限に抑えている。図５において、ステップ１は集
計単位０の集計データから集計単位10、20、30、40の集
計単位からなる第１の集計データが集計される様子を示
しており、ステップ２は集計単位10、20、30、40の集計
単位からなる第１の集計データから集計単位21、31、3
2、41、42、44の集計単位からなる第２の集計データが
集計される様子を示している。同様にしてステップ３、
ステップ４と繰り返すことにより、最終的に４次元の全
ての次元について集計した集計結果が集計単位47に集計
される。このときの、ステップ１による集計結果の具体
例を図６に示す。図６における番号は、図５における集
計単位の番号を示している。FIG. 5 is a diagram showing the tally result calculated in each step with respect to the four-dimensional tally result shown in FIG. In each step, the data transfer amount is minimized by outputting only the aggregation result for the dimension to the input data. In FIG. 5, step 1 shows a state in which the first tabulation data including the tabulation units 10, 20, 30, and 40 is tabulated from the tabulation data of tabulation unit 0, and step 2 is a tabulation unit 10 From the first aggregation data consisting of 20, 30, and 40 aggregation units, the aggregation units 21, 31, 3
The figure shows how the second tabulation data composed of 2, 41, 42, and 44 tabulation units are tabulated. Similarly, step 3,
By repeating step 4, the totaling result finally totaled for all four dimensions is totaled in the totaling unit 47. FIG. 6 shows a specific example of the counting result in step 1 at this time. The numbers in FIG. 6 indicate the numbers of the aggregation units in FIG.

【００２２】上記のように、多次元集計の処理を部分的
集計単位に分割し、複数のＣＰＵあるいは外部のデータ
処理専用装置に割当て、並列に実行させることにより、
高速な多次元集計処理を行うことが可能となる。ここ
で、上記ＤＢ専用処理装置５又はＣＰＵ又は外部のデー
タ処理専用装置の台数に制限がある場合には各集計処理
の処理段階で２回以上に処理を分けて集計することも可
能である。また、入力データを除いた集計結果レコード
のみを次のステップでの入力データとするため処理時間
が短縮できる。As described above, the processing of multidimensional aggregation is divided into partial aggregation units, assigned to a plurality of CPUs or an external data processing device, and executed in parallel.
It is possible to perform high-speed multidimensional aggregation processing. Here, when there is a limit to the number of the DB dedicated processing device 5 or the number of CPUs or external data processing dedicated devices, it is possible to divide the processing into two or more times in the processing stage of each totaling processing. Further, since only the tally result record excluding the input data is used as the input data in the next step, the processing time can be reduced.

【００２３】図４に示すように並列実行可能な部分的集
計単位のセットにおいては、入力するデータファイルが
共通する場合がある。このような場合に部分的集計単位
を実行する各ＤＢ専用処理装置５へのデータ供給方法に
ついて、図７を用いて説明する。部分的集計処理単位の
数をＮとする（ステップＳ１１）。先ず、共通する入力
データファイルをデータ制御装置２内の記憶領域（図示
せず）に読込み（ステップＳ１２）、このデータを各Ｄ
Ｂ専用処理装置５に対して順番に書込むことにより、各
ＤＢ専用処理装置５はそれぞれ部分的集計処理単位に対
して集計し書込む処理を行い（ステップＳ１３）、デー
タの書込み回数がＮに等しくなるまで実行する（ステッ
プＳ１４）。書込んだデータが入力データファイルの最
後のデータでない場合、同様な処理を繰り返す（ステッ
プＳ１５）。このようなデータ供給の方式を採ることに
より、外部記憶装置４とデータ制御装置２との間のデー
タ転送を減らすことができ、データ転送によるオーバヘ
ッドを抑えることができる。As shown in FIG. 4, in a set of partial totalization units that can be executed in parallel, the input data file may be common. A method of supplying data to each DB dedicated processing device 5 that executes a partial totaling unit in such a case will be described with reference to FIG. The number of the partial aggregation processing units is set to N (step S11). First, a common input data file is read into a storage area (not shown) in the data control device 2 (step S12), and this data is stored in each D.
By writing to the B-dedicated processing device 5 in order, each DB-dedicated processing device 5 performs the process of totaling and writing for the partial totaling processing unit (step S13), and the number of times of data writing becomes N The process is executed until the values become equal (step S14). If the written data is not the last data of the input data file, the same processing is repeated (step S15). By adopting such a data supply system, data transfer between the external storage device 4 and the data control device 2 can be reduced, and overhead due to data transfer can be suppressed.

【００２４】図８は部分的集計単位を実行するＤＢ専用
処理装置５の構成を示し、データ処理部６とソート処理
部７とから構成される。データ処理部６はデータ制御装
置５およびソート処理部７とデータのやりとりをＤＭＡ
機能を用いて行うと同時に選択処理や集計演算等のデー
タ処理を行う。図９はデータ処理部６の処理手順につい
て示したものである。ここで明細データファイルとし
て、図１０で示した形式のテーブルを想定し、このテー
ブルのフィールドに対応したカテゴリ構造を図１１とす
る。まず、データ制御装置２よりカテゴリ情報をＤＢ専
用処理装置５内の記憶領域（図示せず）に読込む（ステ
ップＳ２１）。次に明細データファイルのデータを上記
記憶領域にＤＭＡ転送により読込む（ステップＳ２２）
と同時に要求に応じた選択／射影処理を行う（ステップ
Ｓ２３）。射影処理ではソート処理部７にデータを投入
するためのデータ形式に変換する。さらに、このデータ
に対してキーブレーク情報の設定（ステップＳ２４）を
行った後、ソート処理部へＤＭＡ転送によりデータを投
入する（ステップＳ２６）。FIG. 8 shows the configuration of the DB dedicated processing unit 5 that executes a partial tabulation unit, and comprises a data processing unit 6 and a sort processing unit 7. The data processor 6 exchanges data with the data controller 5 and the sort processor 7 by DMA.
At the same time as performing using the function, data processing such as selection processing and total calculation is performed. FIG. 9 shows a processing procedure of the data processing unit 6. Here, a table in the format shown in FIG. 10 is assumed as the detailed data file, and the category structure corresponding to the fields of this table is shown in FIG. First, the data control device 2 reads the category information into a storage area (not shown) in the DB dedicated processing device 5 (step S21). Next, the data of the detailed data file is read into the storage area by DMA transfer (step S22).
At the same time, selection / projection processing according to the request is performed (step S23). In the projection processing, the data is converted into a data format for inputting data to the sort processing unit 7. Further, after setting the key break information for this data (step S24), the data is input to the sort processing unit by DMA transfer (step S26).

【００２５】この処理は明細データファイルの読込みが
終了するまで繰り返す（ステップＳ２５）。明細データ
ファイルのデータの終わりを検知した場合にはカテゴリ
情報に従って集計値０のデータ生成を行い（ステップＳ
２７）、キーブレーク情報の設定（ステップＳ２４）を
行った後、ソート処理部７へその生成データを投入す
る。この処理は集計データの生成が終了する（ステップ
Ｓ２８）まで繰り返される。生成する集計データは、図
３で示したような各次元に対する部分的集計結果に対し
て行う。ソート処理部７によりソートされたデータはデ
ータ処理部６の記憶領域にＤＭＡ転送により読み出され
る（ステップＳ２９）。このデータに対して、集合演算
処理（ステップＳ３０）およびその集合演算結果に対す
る選択処理（ステップＳ３１）を行った後、その集計結
果をデータ制御装置２に出力する（ステップＳ３２）。
ここで出力するデータは、入力データを除いた、部分的
な集計結果のみを出力する。This process is repeated until reading of the detailed data file is completed (step S25). If the end of the data of the detailed data file is detected, data of the total value 0 is generated according to the category information (step S).
27) After setting the key break information (step S24), the generated data is input to the sort processing unit 7. This process is repeated until the generation of the tally data ends (step S28). The total data to be generated is performed on the partial total result for each dimension as shown in FIG. The data sorted by the sort processing unit 7 is read to the storage area of the data processing unit 6 by DMA transfer (step S29). After performing a set operation process (step S30) and a selection process for the set operation result (step S31) on the data, the totalized result is output to the data control device 2 (step S32).
As the data to be output here, only a partial aggregation result excluding the input data is output.

【００２６】図１２は、ソート処理部７に投入するデー
タ形式およびそのデータについて示している。ソート処
理部７に投入されるデータ形式はソートキー列と出力項
目列からなる。ソートキー列は図１１に示すような各次
元ごとのカテゴリ情報からユニークなＩＤに変換した値
を指定する。また、ソートキー列の最後にはブレークキ
ー情報を設定する。ソート処理部７には入力データの他
に部分的集計結果に対応し集計値０の生成データも一緒
に投入されてソートされる。これにより、ソート結果に
対して行うグループ化のためのキーブレーク処理は従来
キー比較を行っていたが、ブレークキー情報の設定によ
りこのキー比較の必要がなくなり、処理の高速化が期待
できる。FIG. 12 shows a data format to be input to the sort processing unit 7 and its data. The data format input to the sort processing unit 7 includes a sort key string and an output item string. The sort key column specifies a value obtained by converting the category information for each dimension as shown in FIG. 11 into a unique ID. At the end of the sort key sequence, break key information is set. In addition to the input data, the sort processing unit 7 also inputs and sorts the generated data of the tally value 0 corresponding to the partial tally result. As a result, the key break processing for grouping performed on the sorting result conventionally performed the key comparison, but the setting of the break key information eliminates the necessity of the key comparison, and the processing speed can be expected to be increased.

【００２７】実施の形態２．実施の形態２では、明細レ
ベルのデータファイル及び各次元の階層構造を保持した
階層構造情報テーブルを有する多次元データベースを複
数有するシステムについて述べる。図１のシステムにお
いて、多次元データベースを複数の外部記憶装置４に格
納しておき、データ制御装置２による制御により、ＤＢ
専用処理装置５を各多次元データベース間で共通に利用
することにより、複数の多次元データベースを同時実行
することが可能である。Embodiment 2 FIG. In the second embodiment, a system having a plurality of multidimensional databases having a data file at a specification level and a hierarchical structure information table holding a hierarchical structure of each dimension will be described. In the system of FIG. 1, the multidimensional database is stored in a plurality of external storage devices 4, and the database is controlled by the data control device 2.
By using the dedicated processing device 5 in common among the multidimensional databases, it is possible to execute a plurality of multidimensional databases simultaneously.

【００２８】ＤＢ専用処理装置５を各多次元データベー
ス間で共通に利用するには、スケジューリングが重要で
あり、図４の明細データ以下並列実行可能な部分的集計
単位のセット全体が複数ある場合に限られたＤＢ専用処
理装置５を有効利用するためには以下のスケジューリン
グが考えられる。Scheduling is important in order to use the DB dedicated processing device 5 in common among the multidimensional databases. In the case where there are a plurality of sets of partial aggregation units that can be executed in parallel from the detailed data in FIG. In order to effectively use the limited DB dedicated processing device 5, the following scheduling is conceivable.

【００２９】先ず、並列度１の一次集計の段階で第１及
び第２の多次元データベースの集計単位０を同時に実行
し、第２の多次元データベースの並列度４の段階である
第１のステップを第１の多次元データベースの並列度６
の段階である第２のステップと同時に集計処理し以降多
次元データベースの連続する各ステップを逐次同時に集
計処理するスケジューリングである。当然のことなが
ら、ここで第３の多次元データベースが存在する場合に
は、第１と第２の多次元データベースの関係と同様にし
て後続処理することになる。このスケジューリングによ
り一次集計の段階での負荷分散が可能になる。また、一
次集計の段階で第１及び第２の多次元データベースの集
計単位０を同時に実行した後のスケジュールは上記の場
合に限られず種々の場合が考えられる。また、ＤＢ専用
処理装置５の台数によっては、各多次元データベースの
各ステップにおいて、２回以上に分けて集計処理するこ
とも可能である。First, in the stage of the primary aggregation of the degree of parallelism 1, the aggregation unit 0 of the first and second multidimensional databases is simultaneously executed, and the first step of the stage of the degree of parallelism 4 of the second multidimensional database is performed. To the degree of parallelism of the first multidimensional database
This is a scheduling in which the tabulation process is performed simultaneously with the second step, which is the stage, and the successive steps of the multidimensional database are sequentially and simultaneously performed. Naturally, if the third multidimensional database exists, the subsequent processing is performed in the same manner as the relationship between the first and second multidimensional databases. This scheduling enables load distribution at the stage of primary aggregation. In addition, the schedule after the totaling unit 0 of the first and second multidimensional databases is simultaneously executed at the stage of the primary aggregation is not limited to the above-described case, and various cases can be considered. Further, depending on the number of DB-dedicated processing devices 5, in each step of each multidimensional database, it is also possible to carry out the tallying process in two or more steps.

【００３０】次は、第１の多次元データベースの一次集
計を集計処理した後、第２の多次元データベースの一次
集計を上記第１の多次元データベースの第１のステップ
と同時に集計処理し以降それぞれの多次元データベース
の連続する各ステップを逐次同時に集計処理するスケジ
ューリングである。このスケジューリングにより明細デ
ータの種類が少ない場合に負荷分散して比較的短時間に
集計処理するのに適している。Next, after the primary aggregation of the first multidimensional database is totalized, the primary aggregation of the second multidimensional database is aggregated simultaneously with the first step of the first multidimensional database. This is a scheduling in which successive steps of the multidimensional database are sequentially and simultaneously counted. This scheduling is suitable for distributing the load when the type of the detailed data is small and performing the totaling processing in a relatively short time.

【００３１】次は、第１の多次元データベースの一次集
計を集計処理した後、第２の多次元データベースの一次
集計を第１の多次元データベースの第２のステップと同
時に集計処理し以降それぞれの多次元データベースの連
続する各ステップを逐次同時に集計処理するスケジュー
リングである。このスケジューリングにより明細データ
の種類が多い場合に少ない台数のＤＢ専用処理装置５に
負荷分散して集計処理するのに適している。Next, after the primary tabulation of the first multidimensional database is processed, the primary tabulation of the second multidimensional database is tabulated simultaneously with the second step of the first multidimensional database. This is a scheduling in which successive steps of the multidimensional database are sequentially and simultaneously counted. With this scheduling, when the type of detailed data is large, it is suitable for distributing the load to a small number of DB-dedicated processing devices 5 and performing the aggregation process.

【００３２】[0032]

【発明の効果】この発明は、以上説明したように構成さ
れているので、以下に示すような効果を奏する。Since the present invention is configured as described above, it has the following effects.

【００３３】多次元データベースを集計単位に分割し各
ステップで集計に使用した集計単位は次のステップでは
使用しないように構成したので、集計処理に要する時間
を短縮することができる。Since the multidimensional database is divided into aggregation units and the aggregation units used for aggregation in each step are not used in the next step, the time required for aggregation processing can be shortened.

【００３４】また、各ステップにおける集計単位を同時
に集計処理するように構成したので、集計処理の処理速
度を早くすることができる。Further, since the totalizing unit in each step is configured to perform the totalizing process at the same time, the processing speed of the totalizing process can be increased.

【００３５】さらに、同時に集計処理する集計単位を２
回以上に分けて集計処理するように構成したので、ＣＰ
Ｕ又はデータ処理専用装置の台数が限られている場合で
も並列実行により処理速度を向上させることができる。Further, the totaling unit to be processed at the same time is 2
Since it is configured to perform the tallying process more than once,
Even when the number of U or dedicated devices for data processing is limited, the processing speed can be improved by parallel execution.

【００３６】また、数字からなる昇順又は降順のブレー
クキー情報を集計結果レコードに付加するように構成し
たので、集計データに集計結果レコードを投入してソー
トすることにより、キー比較をすることなくキーブレー
ク処理を達成することができる。Further, since the ascending or descending break key information consisting of numbers is added to the totaling result record, the totaling result record is added to the totaling data and sorted, so that the key can be compared without performing key comparison. Break processing can be achieved.

【００３７】さらにまた、共通する集計単位の内のいず
れか一つを排他的に独占して集計するように構成したの
で、入力の競合が発生することなく集計処理を実行する
ことが可能となる。Furthermore, since any one of the common tabulation units is exclusively and exclusively tabulated, the tabulation process can be executed without any input conflict. .

【００３８】また、共通する集計単位のそれぞれの集計
単位から求めた集計結果を比較検証するように構成した
ので、集計処理をする過程において集計結果の正誤をチ
ェックすることができる。In addition, since the counting results obtained from the respective counting units of the common counting unit are configured to be compared and verified, it is possible to check whether the counting result is correct or incorrect in the process of performing the counting process.

【００３９】さらに、多次元データベースを複数有する
システムにおいて、第１の多次元データベースと第２の
多次元データベースの一次集計を同時に処理するように
構成したので、複数の多次元データベースの一次集計を
高速に処理することができる。Furthermore, in a system having a plurality of multi-dimensional databases, the first multi-dimensional database and the second multi-dimensional database are configured to process the primary total at the same time. Can be processed.

【００４０】また、多次元データベースを複数有するシ
ステムにおいて、第１の多次元データベースと第２の多
次元データベースの集計処理ステップを１ステップずら
すように構成したので、明細レベルのデータファィルの
種類が少ない場合に負荷分散して比較的短時間に集計処
理することができる。Further, in a system having a plurality of multidimensional databases, the aggregation processing steps of the first multidimensional database and the second multidimensional database are shifted by one step, so that there are few types of detail level data files. In such a case, the load can be distributed and the aggregation process can be performed in a relatively short time.

【００４１】さらに、多次元データベースを複数有する
システムにおいて、第１の多次元データベースと第２の
多次元データベースの集計処理ステップを２ステップず
らすように構成したので、明細レベルのデータファィル
の種類が多い場合に少ない台数のＤＢ専用処理装置５に
負荷分散して集計処理することができる。Furthermore, in a system having a plurality of multi-dimensional databases, the total multi-steps of the first multi-dimensional database and the second multi-dimensional database are shifted by two steps, so that there are many types of detail level data files. In this case, the load can be distributed to a small number of DB-dedicated processing devices 5 to perform the aggregation process.

[Brief description of the drawings]

【図１】この発明の実施の形態１の多次元データベー
ス処理装置を示す構成図である。FIG. 1 is a configuration diagram illustrating a multidimensional database processing device according to a first embodiment of the present invention.

【図２】この発明の実施の形態１の多次元データベー
ス処理装置の処理手順を示すフローチャートである。FIG. 2 is a flowchart illustrating a processing procedure of the multidimensional database processing device according to the first embodiment of the present invention.

【図３】この発明の実施の形態１の多次元集計処理方
法を示す模式図である。FIG. 3 is a schematic diagram illustrating a multidimensional aggregation processing method according to the first embodiment of the present invention;

【図４】この発明の実施の形態１の多次元集計処理方
法を示すタイミングチャートである。FIG. 4 is a timing chart illustrating a multidimensional aggregation processing method according to the first embodiment of the present invention;

【図５】この発明の実施の形態１の多次元集計処理方
法を示す説明図である。FIG. 5 is an explanatory diagram showing a multidimensional aggregation processing method according to the first embodiment of the present invention;

【図６】この発明の実施の形態１の多次元集計処理方
法を示す適用事例である。FIG. 6 is an application example showing the multidimensional aggregation processing method according to the first embodiment of the present invention.

【図７】この発明の実施の形態１の同一ファイルを入
力とする場合の処理手順を示すフローチャートである。FIG. 7 is a flowchart showing a processing procedure when the same file is input according to the first embodiment of the present invention.

【図８】この発明の実施の形態１のＤＢ専用処理装置
を示す構成図である。FIG. 8 is a configuration diagram illustrating a DB-dedicated processing device according to the first embodiment of the present invention;

【図９】この発明の実施の形態１のＤＢ専用処理装置
の処理手順を示すフローチャートである。FIG. 9 is a flowchart illustrating a processing procedure of the DB dedicated processing device according to the first embodiment of the present invention;

【図１０】この発明の実施の形態１の明細データテー
ブルを示す表である。FIG. 10 is a table showing a detailed data table according to the first embodiment of the present invention.

【図１１】この発明の実施の形態１のカテゴリ構造を
示す構成図である。FIG. 11 is a configuration diagram showing a category structure according to the first embodiment of the present invention.

【図１２】ソート処理部へのデータ投入形式を示す説
明図である。FIG. 12 is an explanatory diagram showing a data input format to a sort processing unit.

【図１３】従来例の多次元集計処理方法を示す模式図
である。FIG. 13 is a schematic diagram showing a conventional multidimensional aggregation processing method.

【図１４】従来例の多次元集計処理方法を示す説明図
である。FIG. 14 is an explanatory diagram illustrating a conventional multidimensional aggregation processing method.

[Explanation of symbols]

１入力装置、２データ制御装置、３出力装置、４
外部記憶装置、５データベース（ＤＢ）専用処理装
置。1 input device, 2 data control device, 3 output device, 4
External storage device, 5 database (DB) dedicated processing device.

─────────────────────────────────────────────────────
────────────────────────────────────────────────── ───

【手続補正書】[Procedure amendment]

【提出日】平成９年１２月１０日[Submission date] December 10, 1997

【手続補正１】[Procedure amendment 1]

【補正対象書類名】図面[Document name to be amended] Drawing

【補正対象項目名】図１２[Correction target item name] FIG.

【補正方法】変更[Correction method] Change

【補正内容】[Correction contents]

【図１２】 FIG.

Claims

[Claims]

1. A multi-dimensional database system having a detailed level data file and a hierarchical structure information table holding a hierarchical structure of each dimension, a totaling unit of each dimension obtained by directly totaling the totaling data. 1. A multidimensional tabulation processing method, comprising: a step of tabulating one tabulated data; and a step of tabulating second tabulated data which is a tabulation unit of each dimension obtained by directly tabulating the first tabulated data.

2. The multidimensional aggregation processing method according to claim 1, wherein the first aggregation data or the second aggregation data comprising a plurality of aggregation units is aggregated simultaneously.

3. The multidimensional aggregation processing method according to claim 1, wherein the aggregation processing is performed by dividing the aggregation unit to be aggregated into two or more times at the same time.

4. A totaling result record having a totaling value of 0 is generated for each dimension, and break key information in ascending or descending order consisting of a number indicating a totaling result record corresponding to each dimension is stored in the totaling result record. 4. The multidimensional aggregation processing method according to claim 1, wherein the number is added to the total number.

5. The method according to claim 1, wherein when the same total unit can be totaled from any of the plural total units, one of the plural total units is exclusively monopolized and totaled. The multidimensional aggregation processing method according to any one of claims 1 to 4.

6. The multidimensional tabulation processing method according to claim 5, wherein a comparison result obtained from each of the plurality of tabulation units is compared and verified.

7. A system having a plurality of multidimensional databases each having a detailed level data file and a hierarchical structure information table holding a hierarchical structure of each dimension,
7. The multidimensional totaling method according to claim 1, wherein the first totalizing of the multidimensional database and the second totaling of the second multidimensional database are simultaneously performed.

8. A primary tabulation of the first multidimensional database is tabulated, and a primary tabulation of the second multidimensional database is tabulated simultaneously with the first step of the first multidimensional database. 7. The multidimensional totaling method according to claim 1, wherein the successive steps of the multidimensional database are sequentially and simultaneously processed.

9. A first tabulation of the first multidimensional database is tabulated, and a first tabulation of the second multidimensional database is tabulated simultaneously with the second step of the first multidimensional database. 7. The multidimensional totaling method according to claim 1, wherein the successive steps of the multidimensional database are sequentially and simultaneously processed.