JP3516365B2

JP3516365B2 - Method and apparatus for storing records in memory

Info

Publication number: JP3516365B2
Application number: JP13692495A
Authority: JP
Inventors: 清充日吉
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1995-06-02
Filing date: 1995-06-02
Publication date: 2004-04-05
Anticipated expiration: 2019-04-05
Also published as: JPH08328826A

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は情報処理の分野に係り、
入力された複数のレコードをメモリ内に読み込む方法と
その装置に関する。The present invention relates to the field of information processing,
The present invention relates to a method and apparatus for reading a plurality of input records into a memory.

【０００２】[0002]

【従来の技術】今日の情報処理においては、ソートマー
ジプログラムを始めとして、入力ファイル内の複数のレ
コードをメモリに読み込んで処理を行うプログラムが多
く用いられている。例えばソートマージプログラムによ
るソートマージ処理は、入力ファイル内のレコードを一
定の条件のもとでソートするために行われる。従来のソ
ートマージ処理等において、可変長レコードまたは不定
長レコードをメモリ内に読み込む時は、ユーザが入力フ
ァイル内のレコードの長さの上限（最大レコード長）を
指定し、処理装置は指定された値をもとに複数のレコー
ド格納領域（ＢＩＮ）を割り当てている。2. Description of the Related Art In today's information processing, many programs including a sort-merge program are used to read a plurality of records in an input file into a memory for processing. For example, sort merge processing by a sort merge program is performed in order to sort the records in the input file under a certain condition. When reading variable length records or indefinite length records into memory in conventional sort merge processing, etc., the user specifies the upper limit (maximum record length) of the record length in the input file, and the processing device is specified. A plurality of record storage areas (BIN) are allocated based on the value.

【０００３】図１３は、従来のソートマージプログラム
によるソートマージ処理を示している。図１３におい
て、まずユーザは入力ファイル２の最大レコード長１を
指定し、それをソートマージプログラム３に通知する。
もし、明確な最大レコード長が分からなければ、例えば
少し余裕のある大きめの値を最大レコード長１として指
定しておく。ソートマージプログラム３は、指定された
最大レコード長１の大きさのＢＩＮをメモリ４内にｎ個
割り当て、それらをレコード用領域として確保する。次
に、入力ファイル２の入力レコードを先頭から順に入力
バッファに読み込み、入力バッファからＢＩＮに入力レ
コードを移動して、ソートマージ処理を行う。FIG. 13 shows sort merge processing by a conventional sort merge program. In FIG. 13, the user first specifies the maximum record length 1 of the input file 2 and notifies the sort merge program 3 of it.
If the clear maximum record length is not known, for example, a large value with a little margin is designated as the maximum record length 1. The sort merge program 3 allocates n BINs each having the specified maximum record length 1 in the memory 4 and secures them as a record area. Next, the input records of the input file 2 are sequentially read into the input buffer from the beginning, the input records are moved from the input buffer to BIN, and the sort merge processing is performed.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら上述のよ
うな従来のソートマージ処理には次のような問題があ
る。However, the conventional sort merge processing as described above has the following problems.

【０００５】複数のＢＩＮをメモリ４内で効率よく割り
当てるためには、ユーザはソートマージプログラム３を
実行する前に、入力ファイル２内の最大レコード長を知
っている必要がある。もし、実際の最大レコード長より
著しく大きな値を指定した場合は、メモリ４内の割り当
て可能なＢＩＮの数が減少し、処理効率が低下する。逆
に、ユーザの指定した最大レコード長１より大きな入力
レコードが入力されると、ソートマージプログラム３は
処理を中断しなければならなくなる。In order to efficiently allocate a plurality of BINs in the memory 4, the user needs to know the maximum record length in the input file 2 before executing the sort merge program 3. If a value that is significantly larger than the actual maximum record length is specified, the number of allocable BINs in the memory 4 decreases and the processing efficiency decreases. On the contrary, when an input record larger than the maximum record length 1 specified by the user is input, the sort merge program 3 has to interrupt the processing.

【０００６】本発明は、ユーザが最大レコード長を指定
することなく、可変長レコードまたは不定長レコードを
効率良くメモリに格納するレコード格納方法およびその
装置を提供することを目的とする。It is an object of the present invention to provide a record storage method and apparatus for efficiently storing variable length records or indefinite length records in a memory without the user designating a maximum record length.

【０００７】[0007]

【課題を解決するための手段】図１は、本発明のレコー
ド格納方法の原理図である。図１において処理が開始さ
れると、まず１つの入力レコードを格納するための空き
のレコード格納領域がメモリ内にあるかどうかを調べ
（ステップＳ１）、そのような空きのレコード格納領域
がメモリ内にあれば、そこに上記入力レコードを格納す
る（ステップＳ３）。空きのレコード格納領域がメモリ
内になければ、現在のレコード格納領域をこれまでに入
力された最大のレコードの長さ以上の複数の領域に分割
して、空きのレコード格納領域を生成する（ステップＳ
２）。そして、生成された空きのレコード格納領域に上
記入力レコードを格納する（ステップＳ３）。次に、既
に格納された、上記入力レコードを含む複数の入力レコ
ードのうち、最大のレコードの長さを記憶して（ステッ
プＳ４）、処理を終了する。FIG. 1 is a principle diagram of a record storing method of the present invention. When the process is started in FIG. 1, it is first checked whether or not there is an empty record storage area for storing one input record in the memory (step S1), and such an empty record storage area is found in the memory. If so, the input record is stored there (step S3). If there is no free record storage area in the memory, the current record storage area is divided into a plurality of areas that are equal to or longer than the maximum record length input so far, and a free record storage area is generated (step S
2). Then, the input record is stored in the generated empty record storage area (step S3). Next, of the plurality of input records including the above-mentioned input record, which has already been stored, the maximum record length is stored (step S4), and the process ends.

【０００８】[0008]

【作用】入力ファイルの入力レコードを格納するための
レコード格納領域（ＢＩＮ）は、情報処理装置のメモリ
内に設けられ、これまでに入力された最大のレコードの
長さは最大入力レコード長として記憶されている。ま
ず、現在メモリ内に設定されているレコード格納領域が
空いているかどうかを調べ（ステップＳ１）、空きのレ
コード格納領域があれば、そこに新しい入力レコードを
格納する（ステップＳ３）。The record storage area (BIN) for storing the input record of the input file is provided in the memory of the information processing device, and the maximum record length input so far is stored as the maximum input record length. Has been done. First, it is checked whether or not the record storage area currently set in the memory is free (step S1), and if there is a free record storage area, a new input record is stored (step S3).

【０００９】もし、空きのレコード格納領域がメモリ内
になければ、既に入力レコードが格納されている現在の
レコード格納領域を、記憶されている最大入力レコード
長以上の長さの複数の領域に分割する（ステップＳ
２）。分割により生成される新しいレコード格納領域は
元のレコード格納領域よりも小さくなるが、これまでに
入力されたレコードの長さ以上であることは保証され
る。したがって、既存の入力レコードはすべて格納され
たままで、新たな空きのレコード格納領域が生成され、
新しい入力レコードを格納することができるようになる
（ステップＳ３）。こうして空きのレコード格納領域に
格納された新しい入力レコードを含む複数の入力レコー
ドのうち、最大のレコードの長さを改めて最大入力レコ
ード長として記憶する（ステップＳ４）このようなレコード格納方法によれば、メモリ内のレコ
ード格納領域の数と大きさを入力レコードに応じて変更
することができ、メモリの利用効率が高まる。また、最
大入力レコード長は自動的に決められるので、ユーザは
入力ファイル内のレコード長の上限を指定する必要がな
くなる。If there is no empty record storage area in the memory, the current record storage area in which the input record is already stored is divided into a plurality of areas having a length equal to or longer than the maximum input record length stored. Yes (Step S
2). The new record storage area generated by the division is smaller than the original record storage area, but it is guaranteed that it is longer than the length of the record input so far. Therefore, all existing input records are still stored and new empty record storage area is created.
A new input record can be stored (step S3). Of the plurality of input records including the new input record thus stored in the empty record storage area, the maximum record length is stored again as the maximum input record length (step S4). The number and size of the record storage areas in the memory can be changed according to the input record, and the memory utilization efficiency is improved. Further, since the maximum input record length is automatically determined, the user does not need to specify the upper limit of the record length in the input file.

【００１０】[0010]

【実施例】以下、図面を参照しながら、本発明の実施例
を詳細に説明する。図２は、実施例における情報処理装
置の構成図である。図２の情報処理装置は、入出力端末
１１、ＣＰＵ（中央処理装置）１２、外部記憶装置１
３、メモリ１４、およびこれらを結合するバス１５を備
える。入出力端末１１は、例えばキーボードやマウス等
の入力機器とディスプレイ装置等の出力装置を備えたユ
ーザ端末であり、レコードを処理するプログラムの起動
や処理結果の出力等に用いられる。外部記憶装置１３
は、例えば磁気ディスク装置や光ディスク装置等の記憶
装置であり、入力ファイルや出力ファイル等を格納す
る。メモリ１４は、ＣＰＵ１２により作業領域として使
用され、ここにレコード用領域等が設定される。Embodiments of the present invention will now be described in detail with reference to the drawings. FIG. 2 is a configuration diagram of the information processing apparatus in the embodiment. The information processing apparatus of FIG. 2 includes an input / output terminal 11, a CPU (central processing unit) 12, and an external storage device 1.
3, a memory 14, and a bus 15 coupling these. The input / output terminal 11 is a user terminal provided with an input device such as a keyboard and a mouse and an output device such as a display device, and is used for starting a program for processing a record and outputting a processing result. External storage device 13
Is a storage device such as a magnetic disk device or an optical disk device, and stores input files and output files. The memory 14 is used as a work area by the CPU 12, and a record area and the like are set therein.

【００１１】図３は、実施例におけるメモリ１４の内部
構成を示している。図３（ａ）はソート処理を行うとき
のメモリ１４を示しており、図３（ｂ）はマージ処理を
行うときのメモリ１４を示している。FIG. 3 shows the internal structure of the memory 14 in the embodiment. FIG. 3A shows the memory 14 when performing sort processing, and FIG. 3B shows the memory 14 when performing merge processing.

【００１２】図３（ａ）のメモリ１４内には、入力バッ
ファ１６、レコード用領域１７、作業バッファ１８、お
よびレコード長格納域１９が設けられる。入力バッファ
１６は入力ファイル２１のレコードをメモリ１４内に読
み込むためのバッファであり、作業バッファ１８は処理
の終わったレコードを作業ファイル２２へ出力するため
のバッファである。An input buffer 16, a record area 17, a work buffer 18, and a record length storage area 19 are provided in the memory 14 of FIG. 3A. The input buffer 16 is a buffer for reading the records of the input file 21 into the memory 14, and the work buffer 18 is a buffer for outputting the processed records to the work file 22.

【００１３】ＣＰＵ１２はプログラムを実行することに
より、外部記憶装置１３の入力ファイル２１に含まれる
入力レコードを入力バッファ１６に読み込み、複数のＢ
ＩＮ等から成るレコード用領域１７に移動してソート処
理を行う。そして、処理の中間結果を作業バッファ１８
に移し、一時的に作業ファイル２２として出力する。。The CPU 12 reads the input record contained in the input file 21 of the external storage device 13 into the input buffer 16 by executing the program, and a plurality of B's are read.
Sorting processing is performed by moving to the record area 17 composed of IN or the like. Then, the intermediate result of the processing is stored in the work buffer
And the work file 22 is temporarily output. .

【００１４】このとき、レコード長格納域１９には、レ
コード用領域１７内の最も大きなレコードのレコード長
（最大レコード長）が格納される。ＣＰＵ１２は、レコ
ード長格納域１９に格納された最大レコード長を参照し
て、現在のＢＩＮの長さ（ＢＩＮ長）を適当な整数ｎで
ｎ等分し、最大レコード長以上のｎ個の領域に分割す
る。そして、分割処理により生成されたｎ個の領域の各
々を改めてＢＩＮとして使用する。At this time, the record length storage area 19 stores the record length (maximum record length) of the largest record in the record area 17. The CPU 12 refers to the maximum record length stored in the record length storage area 19, divides the current BIN length (BIN length) into n equal parts by an appropriate integer n, and divides into n areas equal to or larger than the maximum record length. Split into. Then, each of the n areas generated by the division processing is used again as BIN.

【００１５】図３（ｂ）のメモリ１４内には、作業バッ
ファ１８、レコード用領域１７、および出力バッファ２
４が設けられる。作業バッファ１８は作業ファイル２２
をメモリ１４内に読み込むために使用され、出力バッフ
ァ２４はマージされたレコードを出力ファイル２３へ出
力するために使用される。入力ファイル２１に含まれる
入力レコードをすべて読み込むと、ＣＰＵ１２はメモリ
１４内でいくつかの作業ファイル２２のマージ処理等を
行い、出力ファイル２３として外部記憶装置１３に出力
する。In the memory 14 of FIG. 3B, a work buffer 18, a record area 17 and an output buffer 2 are provided.
4 are provided. The work buffer 18 is a work file 22.
Are read into the memory 14 and the output buffer 24 is used to output the merged records to the output file 23. When all the input records included in the input file 21 are read, the CPU 12 performs merge processing of some work files 22 in the memory 14 and outputs the work file 22 as an output file 23 to the external storage device 13.

【００１６】このように、本実施例では、プログラムが
レコード用領域１７内の最大レコード長をもとにして動
的にＢＩＮ長を変更するので、処理の進行状況に応じて
適当なＢＩＮ長が自動的に設定される。したがって、ユ
ーザはあらかじめ最大レコード長を入力する必要がな
い。As described above, in this embodiment, the program dynamically changes the BIN length based on the maximum record length in the record area 17, so that an appropriate BIN length can be set according to the progress of processing. It is set automatically. Therefore, the user does not need to input the maximum record length in advance.

【００１７】次に、図４から図１２までを参照しなが
ら、本発明のレコード格納方法を用いたソートマージ処
理について説明する。図４は、ソートマージ処理のフロ
ーチャートであり、図５から図９までは、処理の途中に
おけるメモリ１４内のレコード用領域１７の例を示して
いる。図４のソートマージ処理は、ＣＰＵ１２がソート
マージプログラムを実行することにより開始される。Next, the sort merge processing using the record storing method of the present invention will be described with reference to FIGS. FIG. 4 is a flowchart of the sort merge processing, and FIGS. 5 to 9 show an example of the record area 17 in the memory 14 during the processing. The sort merge processing of FIG. 4 is started by the CPU 12 executing the sort merge program.

【００１８】図４において処理が開始されると、ＣＰＵ
１２はまずメモリ１４内にレコード用領域１７を確保し
（ステップＳ１１）、ＢＩＮ長の初期設定を行う（ステ
ップＳ１２）。この初期設定により、ＢＩＮ長はレコー
ド用領域１７の領域長に設定される。このときのレコー
ド用領域１７内のＢＩＮの数（ＢＩＮ数）は１であり、
レコード用領域１７は図５のようになる。When the processing is started in FIG. 4, the CPU
First, the area 12 secures the record area 17 in the memory 14 (step S11) and initializes the BIN length (step S12). By this initial setting, the BIN length is set to the area length of the record area 17. At this time, the number of BINs (BIN number) in the record area 17 is 1,
The record area 17 is as shown in FIG.

【００１９】次に、入力ファイル２１からレコードを入
力バッファ１６に読み込み（ステップＳ１３）、入力フ
ァイル２１の最後（ＥＯＦ）かどうか、および処理対象
のレコードが入力バッファ１６内の最終のレコードかど
うかを判定する（ステップＳ１４、Ｓ１５）。ＥＯＦで
あればステップＳ２６以降の処理を行い、入力バッファ
１６の最終であればステップＳ１４以降の処理を繰り返
す。ＥＯＦでなく、入力バッファ１６の最終でもなけれ
ば、入力バッファ１６からレコード用領域１７に移動す
るレコードの長さを調べる（ステップＳ１６）。Next, a record is read from the input file 21 into the input buffer 16 (step S13), and it is determined whether the record at the end (EOF) of the input file 21 and whether the record to be processed is the last record in the input buffer 16. The determination is made (steps S14 and S15). If it is EOF, the processing from step S26 is performed, and if it is the last input buffer 16, the processing from step S14 is repeated. If it is neither the EOF nor the end of the input buffer 16, the length of the record moved from the input buffer 16 to the record area 17 is checked (step S16).

【００２０】移動するレコードの長さが現在のＢＩＮ長
よりも長ければソート処理を行い（ステップＳ２４）、
現在のＢＩＮ長以下であれば、そのレコードを入力バッ
ファ１６からレコード用領域１７内のＢＩＮに移動する
（ステップＳ１８）。そして、レコード用領域１７に格
納されたレコードの内で、最も大きなもののレコード長
をレコード長格納域１９に記憶する（ステップＳ１
９）。もし、直前に移動したレコードの長さが、既にレ
コード長格納域１９に記憶されていた最大入力レコード
長よりも長ければ、そのレコードのレコード長が新しく
最大入力レコード長として記憶される。逆に、移動した
レコードの長さの方が短ければ、レコード長格納域１９
内の最大入力レコード長は更新されない。If the length of the record to be moved is longer than the current BIN length, sort processing is performed (step S24),
If it is not more than the current BIN length, the record is moved from the input buffer 16 to the BIN in the record area 17 (step S18). Then, the record length of the largest record among the records stored in the record area 17 is stored in the record length storage area 19 (step S1).
9). If the length of the record moved immediately before is longer than the maximum input record length already stored in the record length storage area 19, the record length of the record is newly stored as the maximum input record length. On the contrary, if the length of the moved record is shorter, the record length storage area 19
The maximum input record length in is not updated.

【００２１】図５の状態で入力バッファ１６からレコー
ドを移動した場合は、図６に示すように、そのレコード
はレコード用領域１７の先頭に先頭レコードとして格納
され（ステップＳ１８）、そのレコード長が最大入力レ
コード長として、レコード長格納域１９に格納される
（ステップＳ１９）。When a record is moved from the input buffer 16 in the state of FIG. 5, the record is stored as the first record at the beginning of the record area 17 as shown in FIG. 6 (step S18), and the record length is changed. The maximum input record length is stored in the record length storage area 19 (step S19).

【００２２】次に、レコード用領域１７内に空きのＢＩ
Ｎがあるかどうかを調べ（ステップＳ２０）、空きのＢ
ＩＮがあればステップＳ１４以降の処理を繰り返す。も
し、空きのＢＩＮがなければ、レコード長格納域１９内
の最大入力レコード長のｎ倍の長さと現在のＢＩＮ長と
を比較する（ステップＳ２１）。現在のＢＩＮ長が最大
入力レコード長のｎ倍よりも小さければ、ステップＳ２
４以降の処理を行う。また、現在のＢＩＮ長が最大入力
レコード長のｎ倍以上であれば、現在のＢＩＮ長をｎ等
分して（ステップＳ２２）、空きのＢＩＮを確保し（ス
テップＳ２３）、ステップＳ１４以降の処理を繰り返
す。ここで、ｎは適当な分割数であり、あらかじめプロ
グラムに記述されている。Next, an empty BI in the record area 17
It is checked whether or not N is present (step S20), and empty B
If IN is present, the processing from step S14 is repeated. If there is no empty BIN, the length n times the maximum input record length in the record length storage area 19 is compared with the current BIN length (step S21). If the current BIN length is smaller than n times the maximum input record length, step S2
The processing after 4 is performed. If the current BIN length is n times the maximum input record length or more, the current BIN length is divided into n equal parts (step S22), a free BIN is secured (step S23), and the processes after step S14. repeat. Here, n is an appropriate number of divisions and is described in the program in advance.

【００２３】例えば、図６においては空きのＢＩＮがな
いので（ステップＳ２０、ＮＯ）、最大入力レコード長
の２倍（ｎ＝２）とＢＩＮ長とを比較する。現在のＢＩ
Ｎ長はレコード用領域１７の領域長に等しく、最大入力
レコード長の２倍より大きいので（ステップＳ２１、Ｙ
ＥＳ）、ＢＩＮを２等分する（ステップＳ２２）。この
結果、レコード用領域１７は図７のようになり、空きの
ＢＩＮが生成される（ステップＳ２３）。このときのＢ
ＩＮ数は２であり、ＢＩＮ長はレコード用領域１７の領
域長の１／２である。For example, in FIG. 6, since there is no empty BIN (step S20, NO), twice the maximum input record length (n = 2) is compared with the BIN length. Current BI
Since the N length is equal to the area length of the record area 17 and is larger than twice the maximum input record length (step S21, Y
ES) and BIN are equally divided into two (step S22). As a result, the record area 17 becomes as shown in FIG. 7, and an empty BIN is generated (step S23). B at this time
The number of INs is 2, and the BIN length is 1/2 of the area length of the record area 17.

【００２４】そして、先頭レコードに続く次のレコード
は、図８に示すように、生成された空きのＢＩＮに格納
される（ステップＳ１８）。このとき、最大入力レコー
ド長は、先頭レコードと次のレコードのうち長い方のレ
コード長となる（ステップＳ１９）。この後も同様にし
て、２つのＢＩＮはさらに２等分され、図９に示すよう
に、２つの新しい空きのＢＩＮが生成される。このとき
のＢＩＮ数は４になり、ＢＩＮ長はレコード用領域１７
の領域長の１／４になる。Then, the next record following the first record is stored in the generated empty BIN as shown in FIG. 8 (step S18). At this time, the maximum input record length is the longer record length of the first record and the next record (step S19). After this, similarly, the two BINs are further divided into two, and two new empty BINs are generated as shown in FIG. At this time, the number of BIN is 4, and the BIN length is the record area 17
It becomes 1/4 of the area length.

【００２５】このような処理は、ステップＳ１７でＢＩ
Ｎ長がレコードの長さよりも短くなるまで、あるいは、
ステップＳ２１でＢＩＮ長が最大入力レコード長の２倍
よりも小さくなるまで続けられる。ここでは、ＢＩＮ長
の分割数をｎ＝２として処理を行っているが、一般には
任意の分割数を用いることができ、ｎの値を処理の途中
で変更することもできる。Such processing is performed in step S17 in BI.
Until the N length is shorter than the record length, or
This is continued until the BIN length becomes smaller than twice the maximum input record length in step S21. Here, the processing is performed with the number of divisions of the BIN length set to n = 2, but generally any number of divisions can be used, and the value of n can be changed during the processing.

【００２６】次に、ステップＳ２４で行う処理について
説明する。ステップＳ２４では、レコード用領域１７内
の複数のレコードについて一旦ソートを行い、作業バッ
ファ１８に移動させる。ここでは、ソート方法として、
トーナメント法によるソート処理方式（特願平２−２７
９９５３、特開平４−１５３８２６）またはＱＵＩＣＫ
（クイック）ソートを用いる。Next, the processing performed in step S24 will be described. In step S24, the plurality of records in the record area 17 are once sorted and moved to the work buffer 18. Here, as a sorting method,
Sorting method by tournament method (Japanese Patent Application No. 2-27)
9953, JP-A-4-153826) or QUICK
Use (quick) sort.

【００２７】図１０は、トーナメント法によるソート処
理のフローチャートである。トーナメント法とは、２つ
のレコードを１組にした勝ち抜き戦を繰り返し、最も強
いレコードを選出する方法である。図１０において処理
が開始されると、ＣＰＵ１２はまずレコード用領域１７
内の複数のレコードについて初期トーナメントを行い、
優勝レコードを決定する（ステップＳ３１）。そして、
その優勝レコードを作業バッファ１８へ移動する（ステ
ップＳ３２）。FIG. 10 is a flowchart of the sorting process by the tournament method. The tournament method is a method of selecting the strongest record by repeating winning battles in which two records form a set. When the process is started in FIG. 10, the CPU 12 first determines the record area 17
Initial tournaments for multiple records in
The winning record is determined (step S31). And
The winning record is moved to the work buffer 18 (step S32).

【００２８】次に、入力バッファ１６に次の入力レコー
ドがあるかどうかを調べ（ステップＳ３３）、次の入力
レコードがあれば、そのレコード長とＢＩＮ長を比較す
る（ステップＳ３４）。ここで、次の入力レコードのレ
コード長がＢＩＮ長以下であれば、それが作業バッファ
１８へ移された優勝レコードより弱いかどうかを判定す
る（ステップＳ３５）。次の入力レコードが優勝レコー
ドより弱ければ、それを優勝レコードの格納されていた
ＢＩＮに格納し、再びトーナメントを行って次の優勝レ
コードを選出する（ステップＳ３６）。そして、ステッ
プＳ３２以降の処理を繰り返す。Next, it is checked whether or not there is a next input record in the input buffer 16 (step S33), and if there is a next input record, the record length is compared with the BIN length (step S34). If the record length of the next input record is equal to or less than the BIN length, it is determined whether or not it is weaker than the winning record transferred to the work buffer 18 (step S35). If the next input record is weaker than the winning record, it is stored in the BIN where the winning record was stored, the tournament is performed again, and the next winning record is selected (step S36). Then, the processing from step S32 is repeated.

【００２９】図１１は、レコード用領域１７内の４つの
レコードに対するトーナメントソートの例を示してい
る。例えば、図９に示す４つのＢＩＮにそれぞれレコー
ドＡ、Ｂ、Ｃ、Ｄが格納された後に、図１０の処理が開
始されたとする。このとき、レコードＢよりレコードＡ
が強く、レコードＤよりレコードＣが強く、またレコー
ドＣよりレコードＡが強いものとすると、初期トーナメ
ントによりレコードＡが優勝する（ステップＳ３１）。
そこで、レコードＡは作業バッファ１８に移され（ステ
ップＳ３２）、次の入力レコードＥがレコードＡの格納
されていたＢＩＮに移されて、レコードＥ、Ｂ、Ｃ、Ｄ
の間でトーナメントが行われる（ステップＳ３６）。FIG. 11 shows an example of tournament sort for four records in the record area 17. For example, it is assumed that the process of FIG. 10 is started after the records A, B, C, and D are stored in the four BINs shown in FIG. 9, respectively. At this time, from record B to record A
, Record C is stronger than record D, and record A is stronger than record C, record A wins the initial tournament (step S31).
Therefore, the record A is moved to the work buffer 18 (step S32), the next input record E is moved to the BIN in which the record A was stored, and the records E, B, C, D are stored.
A tournament is held between (step S36).

【００３０】このような処理が繰り返され、ステップＳ
３３で次の入力レコードがなくなった時、またはステッ
プＳ３４でレコード長がＢＩＮ長を超えた時、あるいは
ステップＳ３５で次の入力レコードが優勝レコードより
強い時、レコード用領域１７内の残りのレコードをソー
トして作業バッファ１８へ出力し（ステップＳ３７）、
処理を終了する。Such processing is repeated, and step S
When the next input record disappears in 33, or the record length exceeds the BIN length in step S34, or when the next input record is stronger than the winning record in step S35, the remaining records in the record area 17 are deleted. Sort and output to the work buffer 18 (step S37),
The process ends.

【００３１】例えば、図１１において、次の入力レコー
ドＥがなかったときは（ステップＳ３３、ＮＯ）、残り
のレコードＢ、Ｃ、Ｄがソートされて、作業バッファ１
８へ移される（ステップＳ３７）。For example, in FIG. 11, when there is no next input record E (step S33, NO), the remaining records B, C and D are sorted and the work buffer 1
8 (step S37).

【００３２】トーナメント法は、メモリ１４内のレコー
ドから１件の優勝レコードを選出した後、次の入力レコ
ードを優勝レコードがあったＢＩＮに格納することで、
初期入力時に格納したレコード数以上のレコードを１つ
のストリングとして出力できる点で、他のソート技法よ
り優れている。In the tournament method, one winning record is selected from the records in the memory 14, and the next input record is stored in the BIN where the winning record existed.
It is superior to other sorting techniques in that records more than the number of records stored at initial input can be output as one string.

【００３３】図１２は、もう一つのソート方法であるＱ
ＵＩＣＫ法によるソート処理のフローチャートである。
図１２において処理が開始されると、ＣＰＵ１２はまず
ＱＵＩＣＫソートにより、レコード用領域１７内のレコ
ードをすべて並べ替える（ステップＳ４１）。次に、ソ
ートされた全レコードを作業バッファ１８へ移動させて
（ステップＳ４２）、処理を終了する。FIG. 12 shows another sorting method Q.
It is a flowchart of the sorting process by the UICK method.
When the process is started in FIG. 12, the CPU 12 first sorts all the records in the record area 17 by QUICK sort (step S41). Next, all the sorted records are moved to the work buffer 18 (step S42), and the process ends.

【００３４】ＱＵＩＣＫ法では、処理中に入力レコード
を追加することがないため、一つのストリングに格納で
きるレコード数はトーナメントソートより少なくなる
が、ストリングの構造は簡単になる。In the QUICK method, since no input record is added during processing, the number of records that can be stored in one string is smaller than that in tournament sort, but the string structure is simple.

【００３５】こうしてステップＳ２４のソート処理が終
わると、ＣＰＵ１２は作業バッファ１８の内容を作業フ
ァイル２２に出力し（ステップＳ２５）、再び初期設定
を行って（ステップＳ２６）、ステップＳ１４以降の処
理を繰り返す。そして、ステップＳ２７でＥＯＦになる
と、次に作業ファイル２２を使用しているかどうかを調
べる（ステップＳ２７）。When the sort process of step S24 is completed in this way, the CPU 12 outputs the contents of the work buffer 18 to the work file 22 (step S25), initializes again (step S26), and repeats the processes of step S14 and thereafter. . When the EOF is reached in step S27, it is checked whether or not the work file 22 is used next (step S27).

【００３６】もし、一度もステップＳ２４の処理を行っ
ておらず、作業ファイル２２を使用していなければ、そ
のままレコード用領域１７内のレコードをソートして
（ステップＳ２８）、出力ファイル２３に出力し（ステ
ップＳ３０）、処理を終了する。しかし、作業ファイル
２２を使用している場合は、レコード用領域１７内のレ
コードと作業ファイル２２内のレコードのソートマージ
を行う（ステップＳ２９）。そして、マージ結果を出力
ファイル２３に出力し（ステップＳ３０）、処理を終了
する。If the processing of step S24 has never been performed and the work file 22 is not used, the records in the record area 17 are sorted (step S28) and output to the output file 23. (Step S30), the process ends. However, when the work file 22 is used, the records in the record area 17 and the records in the work file 22 are sorted and merged (step S29). Then, the merged result is output to the output file 23 (step S30), and the process ends.

【００３７】例えば、従来のレコード格納方法では、レ
コード用領域１７が１ＭＢ（メガバイト）であった場
合、ユーザが指定した最大レコード長が１ＫＢ（キロバ
イト）なら、レコード用領域１７に格納できるレコード
件数は常に１０００件（１ＭＢ／１ＫＢ）である。For example, in the conventional record storage method, when the record area 17 is 1 MB (megabytes) and the maximum record length specified by the user is 1 KB (kilobytes), the number of records that can be stored in the record area 17 is It is always 1000 (1MB / 1KB).

【００３８】しかし、本実施例では、実際の入力レコー
ドのレコード長の平均値により入力できる件数は異なっ
てくる。つまり、入力レコードの長さにばらつきがある
場合には、比較的小さなレコードは一度に多数入力し、
著しく大きなレコードについてのみ入力数が減少する。
例えば、平均的なレコード長が０．１ＫＢなら１０００
０件、０．５ＫＢなら２０００件のレコードを格納する
ことができ、最も効率の悪い１ＫＢの場合でも１０００
件のレコードが格納されることになる。この結果、メモ
リ１４内に読み込める入力レコード数が増加し、一度に
ソートできるデータ量が各段に増える。したがって、ソ
ートマージ処理の処理時間が短縮される。However, in this embodiment, the number of records that can be input differs depending on the average value of the record lengths of the actual input records. In other words, if there are variations in the length of the input records, enter a relatively small number of records at once,
The number of inputs is reduced only for records that are significantly larger.
For example, if the average record length is 0.1 KB, 1000
2000 records can be stored for 0 or 0.5 KB, and 1000 records can be stored even for the worst 1 KB.
Records will be stored. As a result, the number of input records that can be read in the memory 14 increases and the amount of data that can be sorted at one time increases. Therefore, the processing time of the sort merge processing is shortened.

【００３９】[0039]

【発明の効果】本発明によれば、可変長レコードまたは
不定長レコードを情報処理装置のメモリ内に読み込む処
理において、ユーザが入力レコードの最大値を指定する
必要がなくなる。また、メモリの利用効率が高まり、読
み込んだレコードの処理速度が向上する。According to the present invention, it is not necessary for the user to specify the maximum value of the input record in the process of reading the variable length record or the indefinite length record into the memory of the information processing apparatus. In addition, the memory utilization efficiency is improved, and the processing speed of the read record is improved.

[Brief description of drawings]

【図１】本発明の原理図である。FIG. 1 is a principle diagram of the present invention.

【図２】実施例の情報処理装置の構成図である。FIG. 2 is a configuration diagram of an information processing apparatus according to an embodiment.

【図３】実施例におけるメモリの内部構成を示す図であ
る。FIG. 3 is a diagram showing an internal configuration of a memory in the example.

【図４】実施例におけるソートマージ処理のフローチャ
ートである。FIG. 4 is a flowchart of sort merge processing according to the embodiment.

【図５】ＢＩＮ数１のレコード用領域を示す図である。FIG. 5 is a diagram showing a record area with a BIN number of 1;

【図６】先頭レコード格納後のレコード用領域を示す図
である。FIG. 6 is a diagram showing a record area after the first record is stored.

【図７】ＢＩＮ数２のレコード用領域を示す図である。FIG. 7 is a diagram showing a record area with a BIN number of 2;

【図８】次のレコード格納後のレコード用領域を示す図
である。FIG. 8 is a diagram showing a record area after the next record is stored.

【図９】ＢＩＮ数４のレコード用領域を示す図である。FIG. 9 is a diagram showing a record area with a BIN number of 4;

【図１０】実施例におけるトーナメント処理のフローチ
ャートである。FIG. 10 is a flowchart of tournament processing according to the embodiment.

【図１１】トーナメントソートの例を示す図である。FIG. 11 is a diagram showing an example of tournament sort.

【図１２】実施例におけるクイック処理のフローチャー
トである。FIG. 12 is a flowchart of a quick process according to the embodiment.

【図１３】従来のソートマージ処理を示す図である。FIG. 13 is a diagram showing a conventional sort merge process.

[Explanation of symbols]

１最大レコード長２、２１入力ファイル３ソートマージプログラム４、１４メモリ１１入出力端末１２ＣＰＵ１３外部記憶装置１６入力バッファ１７レコード用領域１８作業バッファ１９レコード長格納域２２作業ファイル２３出力ファイル２４出力バッファ 1 maximum record length 2,21 Input file 3 sort merge program 4,14 memory 11 Input / output terminals 12 CPU 13 External storage device 16 input buffers 17 record area 18 working buffer 19 record length storage area 22 working files 23 Output file 24 output buffer

Claims

(57) [Claims]

1. An information processing apparatus for reading a record into a memory for processing, checks whether or not a free record storage area for storing one input record exists in the memory, and determines the empty record storage area. If it is in the memory, the input record is stored in the empty record storage area, and if the empty record storage area is not in the memory, the current record storage area is stored as the largest record input so far. It is characterized in that the input record is divided into a plurality of areas having a length equal to or larger than the length, the input record is stored in a generated empty record storage area, and the maximum record length among the plurality of stored input records is stored. How to store records.

2. When there is one or more generated empty record storage areas, one or more input records including the input record are stored in one or more empty record storage areas, and the one or more When the input records are stored in all of the empty record storage areas, the record storage area is further divided into a plurality of areas to generate an empty record storage area. The record storage method described in 1.

3. A sorting method comprising: sorting a plurality of input records respectively stored in a plurality of record storage areas by the record storing method according to claim 1 and outputting the sorted records to a file.

4. A tournament in which the plurality of input records are sorted by tournament sort, the determined winning record is output to the file, and the next input record is stored in the record storage area where the winning record was stored. The sorting method according to claim 3, wherein the sorting is repeated.

5. When the length of the input record exceeds the length of the empty record storage area, a plurality of records input so far are sorted by tournament sort, output to a work file, and newly input. 4. The sorting method according to claim 3, wherein when there are no more input records, the contents of the record storage area and the contents of the work file are merged and output to the output file.

6. When the current record storage area cannot be divided into a plurality of areas that are longer than the maximum record length that has been input so far, the plurality of records that have been input so far are sorted by tournament sort. If you output to a work file and there are no new input records,
4. The contents of the record storage area and the contents of the work file are merged and output to an output file.
The sort method described.

7. The sorting method according to claim 3, wherein the plurality of input records are sorted by quick sort and output to the file.

8. When the length of the input record exceeds the length of the empty record storage area, a plurality of records input so far are sorted by a quick sort, output to a work file, and newly input. 4. The sorting method according to claim 3, wherein when there are no more input records, the contents of the record storage area and the contents of the work file are merged and output to the output file.

9. When the current record storage area cannot be divided into a plurality of areas that are longer than the maximum record length that has been input so far, the plurality of records that have been input so far are sorted by a quick sort. 4. The sorting method according to claim 3, wherein when a new input record is output to the work file, the contents of the record storage area and the work file are merged and output to the output file.

10. An information processing apparatus for reading a record into a memory for processing, a means for storing the maximum record length input so far as the maximum input record length, and one input record Means for checking whether or not there is a free record storage area in the memory, and if the free record storage area is not in the memory, the current record storage area is divided into a plurality of records having a length equal to or larger than the maximum input record length. And a means for generating an empty record storage area and a means for storing the input record in the empty record storage area.

11. The apparatus according to claim 1, further comprising means for sorting a plurality of input records respectively stored in the plurality of record storage areas and outputting the sorted records to a file.
0 record storage device.