JPH0728702A

JPH0728702A - Program converting method

Info

Publication number: JPH0728702A
Application number: JP5174375A
Authority: JP
Inventors: Ichiro Kushima; 伊知郎久島; Masahiro Uminaga; 正博海永
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1993-07-14
Filing date: 1993-07-14
Publication date: 1995-01-31

Abstract

PURPOSE:To convert a program in order to decrease the cache misses that are caused by the conflict of cache lines of a cache memory of a set associative system. CONSTITUTION:A part of a program that is repetitively carried out is specified in a step 101, and an array accessed in a loop is checked together with the access position in the array in a step 102. In a step 103, the array access position (subscript expression) changes at the same rate in the loop and the arrays are stored so that the arrays of the same reference and array types are put together. The arrays collected in the same group are changed into a single array in a step 104. In a step 105, the reference to the element of an unstructured array is changed to the reference to the element of a structured array.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、プログラムの変換もし
くはプログラムのコンパイルの技術に関し、特に、プロ
グラムを実行する計算機のキャッシュメモリのキャッシ
ュミスを低減できるようにプログラムを変換する技術に
関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a technique for converting a program or compiling a program, and more particularly to a technique for converting a program so as to reduce cache misses in a cache memory of a computer that executes the program.

【０００２】[0002]

【従来の技術】現在の多くの計算機システムでは、高速
に動作するＣＰＵとアクセス速度の遅い主メモリの速度
のギャップを埋めるために、キャッシュメモリと呼ばれ
るアクセス速度の速い小容量の記憶装置が、主メモリ
（主記憶）とＣＰＵとの間の中間バッファとし備えられ
ることが多い。このようなキャッシュメモリはＣＰＵが
高速にアクセス可能であるので、プログラム中の処理で
参照する多くのデータをキャッシュメモリ上にあるデー
タを参照する処理は、非常に高速に行うことができる。2. Description of the Related Art In many current computer systems, a small-capacity storage device having a high access speed called a cache memory is mainly used in order to fill a speed gap between a CPU operating at high speed and a main memory having a low access speed. It is often provided as an intermediate buffer between the memory (main memory) and the CPU. Since such a cache memory can be accessed by the CPU at high speed, the process of referring to the data in the cache memory that is referred to in the process in the program can be performed very quickly.

【０００３】さて、一般にプログラムがデータを参照す
る場合、次のような性質を持つことが多い。Generally, when a program refers to data, it often has the following properties.

【０００４】１．ある一定のアドレスのデータに対する
アクセスは比較的短い時間内に再発する。1. Access to data at a certain address will occur again within a relatively short time.

【０００５】２．ある一定時間内にアクセスされるデー
タは比較的近いアドレスに分布する。2. Data accessed within a certain period of time is distributed to relatively close addresses.

【０００６】前者は「時間的局所性」、後者は「空間的
局所性」と呼ばれる。キャッシュメモリを含む計算機シ
ステムはこれらの性質を利用するよう、次のように設計
されている。The former is called "temporal locality" and the latter is called "spatial locality". A computer system including a cache memory is designed as follows to utilize these properties.

【０００７】まず時間的局所性により、一度アクセスさ
れたアドレスは近い将来再びアクセスされる可能性が高
い。そこで、通常のキャッシュメモリでは、ＣＰＵが参
照するデータがキャッシュメモリ上にない場合は、それ
を必ずキャッシュメモリにフェッチして、将来の再参照
に備えるようにしている。ここでデータのキャッシュメ
モリへのフェッチはＣＰＵによって自動的に行われの
で、プログラムはそのような命令を明示的に出す必要は
ない。First, due to temporal locality, an address that has been accessed once is likely to be accessed again in the near future. Therefore, in a normal cache memory, if the data referred to by the CPU is not in the cache memory, it is always fetched in the cache memory to prepare for future re-reference. Since the CPU fetches the data to the cache memory automatically, the program does not need to explicitly issue such an instruction.

【０００８】また、空間的局所性により、あるアドレス
がアクセスされたら、近い将来その近くのアドレスもア
クセスされる可能性が高い。そこで、あるデータを主メ
モリからキャッシュメモリにフェッチする場合は、その
データだけでなく、メモリの記憶空間を一定長（十数バ
イトから百バイト程度）の単位に分割したメモリブロッ
クごとキャッシュメモリにフェッチするようにしてい
る。Further, due to the spatial locality, when an address is accessed, there is a high possibility that addresses near the address will be accessed in the near future. Therefore, when fetching certain data from the main memory to the cache memory, not only that data but also the memory blocks obtained by dividing the memory storage space into units of a fixed length (about 10 bytes to 100 bytes) are fetched into the cache memory. I am trying to do it.

【０００９】このようにキャッシュメモリはプログラム
の一般的性質を利用して設計されているので個々のプロ
グラムは特にキャッシュメモリの存在を意識しないで記
述されていても、結果的にキャッシュメモリ装置を有効
利用できることが多い。As described above, since the cache memory is designed by utilizing the general characteristics of the program, even if each program is written without paying attention to the existence of the cache memory, the cache memory device can be effectively used as a result. Often available.

【００１０】なお、一般のキャッシュメモリ技術に対す
る解説としては、情報処理、ボリューム３３、ナンバー
１１（１９９２）、第１３４８頁から１３５７頁に記載
がある。A description of general cache memory technology is given in Information Processing, Volume 33, Number 11 (1992), pages 1348 to 1357.

【００１１】さて、ここで簡単にキャッシュメモリのア
ドレス・マッピングについて説明する。Now, address mapping of the cache memory will be briefly described.

【００１２】キャッシュメモリの各エントリにはメモリ
ブロックが入るが、どのメモリブロックを、どのエント
リに入れるかという点に関しては、いくつかの方式があ
る。A memory block is placed in each entry of the cache memory, but there are some methods regarding which memory block is placed in which entry.

【００１３】このメモリブロックとキャッシュメモリの
エントリの対応付けをアドレス・マッピングと呼ぶ。こ
の方式は大きく分けて３つある。The association between the memory block and the cache memory entry is called address mapping. This method is roughly divided into three types.

【００１４】ダイレクト・マッピングと呼ばれる方式
は、メモリブロックのアドレスによって、入るべきエン
トリを一意に決める方法である。具体的には、アドレス
をキャッシュメモリのサイズで割った余りを、そのメモ
リブロックを入れるべきエントリの番号とする。キャッ
シュメモリのサイズは通常２のべき乗であるので、たと
えばそれが２＾Ｎであった場合には、アドレスの下位Ｎ
ビットにより入るべきエントリが決まる。この方法の場
合、２つのデータのアドレスの下位Ｎビットが偶然一致
していた場合、入るべきエントリが同じになるので、そ
の２つのデータは同時にはキャッシュメモリ上に存在で
きない。これをキャッシュメモリラインの競合と呼ぶ。The method called direct mapping is a method for uniquely determining an entry to be entered by the address of the memory block. Specifically, the remainder obtained by dividing the address by the size of the cache memory is used as the entry number in which the memory block should be inserted. Since the size of the cache memory is usually a power of 2, if it is 2 ^ N, the lower N addresses
The bit determines the entry to enter. In the case of this method, if the lower N bits of the addresses of the two data happen to coincide, the entries to be entered are the same, so the two data cannot exist in the cache memory at the same time. This is called cache memory line competition.

【００１５】セット・アソシアティブと呼ばれる方式
は、ダイレクトマップと同様にメモリアドレスの下位Ｎ
ビットよって入るエントリを決めるが、入るべきエント
リの候補が複数ある。この候補の数をセット数と呼ぶ。
たとえばセット数が２であれば、２つのデータのアドレ
スの下位Ｎビットが一致していても、（２つまでは）同
時にキャッシュメモリ上に存在できる。セット数は通常
２または４であることが多い。この方式でもやはりキャ
ッシュメモリラインの競合がおきる。The method called set associative is similar to the direct map in that the lower N addresses of memory are
The entry to be entered is determined by the bit, but there are multiple entry candidates that should be entered. The number of candidates is called the number of sets.
For example, if the number of sets is two, even if the lower N bits of the addresses of the two data match, they can simultaneously exist (up to two) in the cache memory. Usually, the number of sets is usually 2 or 4. Even in this method, competition of cache memory lines still occurs.

【００１６】完全・アソシアティブと呼ばれる方式は、
任意の空いているエントリにメモリブロックを入れられ
る方式である。この方式ではキャッシュメモリラインの
競合は起きないが、ハードウェア的に高価・低速である
ので実際には用いられることが少ない。The method called perfect associative is
This is a method in which a memory block can be put in any vacant entry. In this method, competition of cache memory lines does not occur, but it is rarely used in practice because it is expensive and slow in terms of hardware.

【００１７】[0017]

【発明が解決しようとする課題】しかし、あるデータを
主メモリからキャッシュメモリにフェッチする場合にメ
モリブロックごとキャッシュメモリにフェッチするのみ
では、キャッシュメモリを有効利用することのできない
プログラムがある。具体的には、プログラムの、繰り返
し実行される部分（ル−プ）でアクセスされる複数のデ
ータが、キャッシュメモリラインを競合する場合が、こ
れに当たる。However, when a certain data is fetched from the main memory to the cache memory, there is a program in which the cache memory cannot be effectively used only by fetching the entire memory block into the cache memory. Specifically, this corresponds to a case where a plurality of data accessed in a repeatedly executed part (loop) of a program compete for a cache memory line.

【００１８】いま、一例として図４に示すプログラムを
考えてみる。なお、ここでは、キャッシュメモリはセッ
ト数２のアソシアティブキャッシュメモリ、キャッシュ
メモリ１エントリは３２バイト（８語）、エントリ数は
１２８の場合を想定する（したがってサイズは２＊３２
＊１２８＝８１９２バイト）。またキャッシュメモリ置
換方式はＬＲＵ（最後にアクセスされた時刻が最も古い
エントリをキャッシュメモリから追い出す方式）とす
る。As an example, consider the program shown in FIG. It is assumed here that the cache memory is an associative cache memory with a set number of 2, the cache memory 1 entry is 32 bytes (8 words), and the number of entries is 128 (therefore, the size is 2 * 32).
* 128 = 8192 bytes). The cache memory replacement method is LRU (a method of expelling from the cache memory the entry with the oldest access time).

【００１９】さて、図４に示したプログラムは、Ｃとし
て知られているプログラミング言語を用いて記述したプ
ログラム例である。この例では、まず、４０１〜４０４
の定義文で、４つの配列ａ，ｂ，ｃ，ｄの要素の型ｉｎ
ｔ（整数型）と要素数２０４８が定義されている。The program shown in FIG. 4 is an example of a program written in a programming language known as C. In this example, first,
Definition statement of four array types a, b, c, d
t (integer type) and the number of elements 2048 are defined.

【００２０】そして、処理ｍａｉｎ４０５中の４０７で
ｉ，ｊの型ｉｎｔ（整数型）が定義された後、４０８で
ｉに関するル−プが、４０９でｊに関するループが宣言
されている。そして、ｉに関するル−プ４０９〜４１１
内のｊに関するループ４１０、４１１内で４つの配列要
素ａ［ｊ］，ｂ［ｊ］，ｃ［ｊ］，ｄ［ｊ］が参照さ
れ、ａ［ｊ］の値がｂ［ｊ］に代入され４１０、ｃ
［ｊ］の値がｄ［ｊ］に代入される４１１。After the type int (integer type) of i, j is defined at 407 in the processing main 405, a loop for i is declared at 408 and a loop for j is declared at 409. And loops 409-411 for i
The four array elements a [j], b [j], c [j], d [j] are referred to in the loops 410 and 411 for j in j, and the value of a [j] is assigned to b [j]. 410, c
The value of [j] is substituted into d [j] 411.

【００２１】ここで、これらの４つの要素はキャッシュ
メモリラインを競合する。なぜなら４つの配列の大きさ
はすべて８１９２（＝２０４８＊４）バイトであり、キ
ャッシュメモリの大きさと同じであるので、ａ［ｊ］，
ｂ［ｊ］，ｄ［ｊ］，ｃ［ｊ］のアドレスを８１９２で
割った余りはかならず等しくなるからである。Here, these four elements compete for a cache memory line. Because the sizes of all four arrays are 8192 (= 2048 * 4) bytes, which is the same as the size of the cache memory, a [j],
This is because the remainders obtained by dividing the addresses of b [j], d [j], and c [j] by 8192 will always be the same.

【００２２】いま、配列要素ａ［Ｊ］を参照した時点で
（Ｊはｊのある特定の値）、キャッシュミスが発生した
とする。すると、ａ［Ｊ］を含むブロック（ａ［Ｊ．．
Ｊ＋７］）がキャッシュメモリにフェッチされる。次
に、配列要素ｂ［Ｊ］を参照した時点で、キャッシュミ
スが発生するとｂ［Ｊ］を含むブロック（ｂ［Ｊ．．Ｊ
＋７］）がキャッシュメモリにフェッチされる。このと
き、これら２つのブロックはキャッシュメモリラインを
競合するが、キャッシュメモリのセット数が２であるの
で、ａ［Ｊ．．Ｊ＋７］とｂ［Ｊ．．Ｊ＋７］は同時に
キャッシュメモリ上に存在できる。It is assumed that a cache miss has occurred at the time when the array element a [J] is referenced (J is a certain value of j). Then, a block (a [J ..
J + 7]) is fetched into the cache memory. Next, when a cache miss occurs at the time when the array element b [J] is referred to, a block (b [J..J
+7]) is fetched into the cache memory. At this time, these two blocks compete for the cache memory line, but since the number of sets of the cache memory is 2, a [J. ． J + 7] and b [J. ． J + 7] can simultaneously exist in the cache memory.

【００２３】しかし、この場合、次に配列要素ｃ［Ｊ］
を参照した時点で、キャッシュミスが発生する。そし
て、ｃ［Ｊ］を含むブロック（ｃ［Ｊ．．Ｊ＋７］）が
キャッシュメモリにフェッチされ、ａ［Ｊ．．Ｊ＋７］
がキャッシュメモリから追い出される。したがい、この
時点で、ｂ［Ｊ．．Ｊ＋７］とｃ［Ｊ．．Ｊ＋７］がキ
ャッシュメモリ上に同時に存在することになる。また、
次に、配列要素ｄ［Ｊ］を参照すると、やはりキャッシ
ュミスが発生し、ｄ［Ｊ］を含むブロックｄ［Ｊ．．Ｊ
＋７］がキャッシュメモリにフェッチされ、ｂ［Ｊ．．
Ｊ＋７］がキャッシュメモリから追い出される。したが
い、この時点では、ｃ［Ｊ．．Ｊ＋７］とｄ［Ｊ．．Ｊ
＋７］がキャッシュメモリに同時に存在している。However, in this case, the array element c [J]
A cache miss occurs when you refer to. Then, the block (c [J..J + 7]) including c [J] is fetched into the cache memory, and a [J. ． J + 7]
Are flushed from cache memory. Therefore, at this point, b [J. ． J + 7] and c [J. ． J + 7] are simultaneously present in the cache memory. Also,
Next, referring to the array element d [J], a cache miss still occurs, and the block d [J. ． J
+7] is fetched into the cache memory, and b [J. ．
J + 7] is expelled from the cache memory. Therefore, at this point, c [J. ． J + 7] and d [J. ． J
+7] exists in the cache memory at the same time.

【００２４】さて、ループイタレーションの次の回では
以下のようになる。まず配列要素ａ［Ｊ＋１］を参照す
る。するとキャッシュミスが発生する。というのは競合
するキャッシュメモリラインにはｃ［Ｊ．．Ｊ＋７］と
ｄ［Ｊ．．Ｊ＋７］が格納されておりａ［Ｊ．．Ｊ＋
７］は追い出されているからである。このようにして配
列要素の参照ｂ［Ｊ＋１］，ｃ［Ｊ＋１］，ｄ［Ｊ＋
１］，．．．，ｄ［Ｊ＋７］毎に、ことごとくキャッシ
ュミスが発生することになる。また、同様に、ａ［Ｊ＋
８］以降のアクセスについても、同じ状況になるので、
結局すべての配列要素の参照についてキャシュミスが発
生する。Now, the next round of loop iteration is as follows. First, the array element a [J + 1] is referenced. Then, a cache miss occurs. This is because c [J. ． J + 7] and d [J. ． J + 7] is stored and a [J. ． J +
7] has been kicked out. In this way, array element references b [J + 1], c [J + 1], d [J +
1] ,. ．． , D [J + 7], a cache miss will occur. Similarly, a [J +
8] Since the same situation applies to access after that,
Eventually a cache miss will occur for all array element references.

【００２５】以上、図４のプログラムのル−プ内ではす
べての配列参照が、キャッシュミスを発生させることを
示した。このようなプログラムに関してキャッシュメモ
リを有効に利用するためには、キャッシュメモリの容量
の増大させるか、または、セット数の増大することが有
効である。As described above, it has been shown that all the array references in the program loop of FIG. 4 cause a cache miss. In order to effectively use the cache memory for such programs, it is effective to increase the capacity of the cache memory or increase the number of sets.

【００２６】すなわち、キャッシュメモリ容量を増大さ
せることにより、間接的にキャッシュメモリラインの競
合の機会を低減することができる。図４の例の場合、キ
ャッシュメモリの容量を２倍とすれば、その派生効果と
してキャッシュメモリラインの競合が４から２へ低減さ
れ、空間的局所性を活用できる。しかしキャッシュメモ
リ容量の増大はコスト増大を招くという大きな欠点があ
る。That is, by increasing the cache memory capacity, it is possible to indirectly reduce the chance of competition for the cache memory lines. In the case of the example of FIG. 4, if the capacity of the cache memory is doubled, the contention of the cache memory lines is reduced from 4 to 2 as a derivative effect, and the spatial locality can be utilized. However, there is a major drawback in that the increase in cache memory capacity causes an increase in cost.

【００２７】一方、セット数の増大は直接的にキャッシ
ュメモリライン競合の可能性を低減するものである。２
セットを４セットにした場合、ａ［Ｊ］，ｂ［Ｊ］，ｃ
［Ｊ］，ｄ［Ｊ］の４つの参照が同時期に発生したとし
てもａ［Ｊ］，ｂ［Ｊ］，ｃ［Ｊ］，ｄ［Ｊ］を含むラ
インは同時にキャッシュメモリに存在できる。だたし、
セット数の増大はキャッシュメモリアクセス時間を長く
してしまう欠点がある。そして、これはプロセッサの動
作周波数を低くしてしまう原因となる。On the other hand, the increase in the number of sets directly reduces the possibility of cache memory line competition. Two
When the number of sets is 4, a [J], b [J], c
Even if four references [J] and d [J] occur at the same time, a line including a [J], b [J], c [J], and d [J] can exist in the cache memory at the same time. However,
Increasing the number of sets has the drawback of increasing the cache memory access time. And this causes the operating frequency of the processor to be lowered.

【００２８】このように、キャッシュメモリ容量の増大
やセット数の増大等のハードウェアの面からの対策には
種々の問題が生じてしまう。As described above, various problems occur in the measures from the viewpoint of hardware such as an increase in the cache memory capacity and an increase in the number of sets.

【００２９】そこで、本発明は、ハードウェアの変更な
しに、プログラムの繰り返し実行される部分（ル−プ）
でアクセスされる複数のデータについても、キャッシュ
メモリのヒット率を向上することのできるプログラムの
変換方法を提供することを目的とする。Therefore, according to the present invention, a portion (loop) of a program that is repeatedly executed without changing hardware.
It is an object of the present invention to provide a program conversion method capable of improving the hit rate of a cache memory even for a plurality of data accessed by.

【００３０】[0030]

【課題を解決するための手段】前記目的は、キャッシュ
メモリ装置自体に変更を加えるのではなく、キャッシュ
メモリラインの競合を回避するようにプログラムを変換
することによって達成される。すなわち、ループ内で参
照される配列要素がキャッシュメモリラインを競合する
可能性があるプログラムに対して、それらの配列のメモ
リ上での配置が、キャッシュメモリラインを競合しない
ような配置に変更されるようにプログラムを変更すれば
よい。The above objects are achieved by transforming a program to avoid cache memory line conflicts, rather than making changes to the cache memory device itself. That is, for a program in which array elements referenced in a loop may conflict with cache memory lines, the placement of those arrays in memory is changed to an arrangement that does not conflict with cache memory lines. Change the program as follows.

【００３１】そこで、このために、本発明は、プログラ
ムを変換する方法であって、プログラムのループする処
理を記述している部分を判別するステップと、判別した
ループする処理を記述している部分内で、参照すること
を記述されている複数の配列のうち、配列の型が等し
く、かつ、前記ル−プ処理の各回の処理で参照される、
配列の要素の配列内の位置が同じ配列どうしを同じ類別
にまとめるステップと、同じ類別にまとめられた複数の
配列を定義する記述を、当該複数の配列により構成され
る１つの配列を定義する記述に変換するステップと、前
記ループ処理部分内の、同じ類別にまとめられた複数の
配列の要素を参照する記述を、当該複数の配列により構
成される１つの配列の対応する要素を参照する記述に変
換するステップとを有することを特徴とするプログラム
の変換方法を提供する。Therefore, for this purpose, the present invention is a method for converting a program, which comprises a step of determining a portion describing a looping process of the program, and a portion describing a determined looping process. Among the plurality of arrays described to be referred to, the types of the arrays are the same, and are referenced in each processing of the loop processing,
A step of grouping arrays that have the same position in the array of elements of the array into the same group, and a description that defines a plurality of arrays grouped into the same group, and a description that defines one array composed of the plurality of arrays. And a description that refers to elements of a plurality of arrays that are grouped in the same category in the loop processing part is changed to a description that refers to corresponding elements of one array configured by the plurality of arrays. And a converting step, which provides a program converting method.

【００３２】[0032]

【作用】本発明に係るプログラムの変換方法によれば、
プログラムのループする処理を記述している部分を判別
し、判別したループする処理を記述している部分内で、
参照することを記述されている複数の配列のうち、配列
の型が等しく、かつ、前記ル−プ処理の各回の処理で参
照される、配列の要素の配列内の位置が同じ配列どうし
を同じ類別にまとめ、プログラムの同じ類別にまとめら
れた複数の配列を定義する記述を、当該複数の配列によ
り構成される１つの配列を定義する記述に変換すると共
に、前記ループ処理部分内の、同じ類別にまとめられた
複数の配列の要素を参照する記述を、当該複数の配列に
より構成される１つの配列の対応する要素を参照する記
述に変換する。そして、このように記述を変換したプロ
グラムに対しては、前記１つの配列を構成する複数の配
列間のキャッシュラインの競合は生じない。また、前記
１つの配列を構成する複数の配列の配列内の位置が同じ
各配要素が、プログラムの実行時にプログラムを実行す
る計算機に主記憶上の連続した位置に、順次配置すれ
ば、より効率的にキャッシュメモリを利用するこができ
る。According to the program conversion method of the present invention,
Determine the part of the program that describes the looping process, and in the part that describes the determined looping process,
Among a plurality of arrays described to be referenced, the arrays have the same type, and the elements of the arrays referred to in each processing of the loop processing have the same position in the array. The description defining a plurality of arrays grouped into the same group and converted into a description defining one array composed of the plurality of arrays is performed, and the same grouping in the loop processing part is performed. The description referring to the elements of the plurality of arrays summarized in (1) is converted into the description referring to the corresponding element of one array configured by the plurality of arrays. Then, with respect to the program whose description is converted in this way, there is no competition of cache lines between a plurality of arrays forming the one array. Further, it is more efficient if the respective array elements having the same position in the array of the plurality of arrays forming the one array are sequentially arranged at the continuous position in the main memory in the computer executing the program at the time of executing the program. The cache memory can be used.

【００３３】[0033]

【実施例】以下、本発明の一実施例について説明する。EXAMPLES An example of the present invention will be described below.

【００３４】図２に、本実施例に係る計算機システムの
構成を示す。FIG. 2 shows the configuration of the computer system according to this embodiment.

【００３５】図示するように、計算機システムはＣＰＵ
２０１、主記憶装置２０２、外部記憶装置２０３、ディ
スプレイ装置２０４、キーボード２０５より構成されて
いる。As shown, the computer system is a CPU
It comprises 201, a main storage device 202, an external storage device 203, a display device 204, and a keyboard 205.

【００３６】外部記憶装置２０３にはソースプログラム
２０６と、オブジェクトプログラム２０７が格納され
る。主記憶装置２０２には、コンパイル処理を記述した
コンパイラプログラム２１１、コンパイル過程で必要と
なる中間語２０８、シンボルテーブル２０９、および、
ループ内配列参照テーブルテーブル２１０が格納され
る。コンパイル処理とは、ユ−ザによってＣ等の高級言
語によって記述されたソ−スプログラム２０６より、計
算機が直接解釈可能なアセンブリ言語やマシン語等によ
って記述したオブジェクトプログラム２０７を生成する
処理であり、ＣＰＵ２０１がコンパイラプログラムを実
行することにより実現される。A source program 206 and an object program 207 are stored in the external storage device 203. In the main storage device 202, a compiler program 211 describing a compiling process, an intermediate language 208 required in the compiling process, a symbol table 209, and
The in-loop array reference table table 210 is stored. The compiling process is a process for generating an object program 207 described by an assembly language or a machine language that can be directly interpreted by a computer from a source program 206 written by a user in a high-level language such as C. It is realized by the CPU 201 executing the compiler program.

【００３７】さて、キーボード２０５は、ユーザからの
コンパイラ起動命令を受け付ける。ディスプレイ装置２
０４は、コンパイル終了メッセージやエラーメッセージ
を表示する。The keyboard 205 receives a compiler activation instruction from the user. Display device 2
04 displays a compilation end message and an error message.

【００３８】以下、本実施例に係るコンパイル処理の詳
細について説明する。The details of the compiling process according to this embodiment will be described below.

【００３９】図３はコンパイラの処理を示すフローチャ
ートである。コンパイルは語彙解析３０１、構文解析３
０２、構造体化３０３、最適化３０４、コード生成３０
５の順に進む。FIG. 3 is a flow chart showing the processing of the compiler. Compile is lexical analysis 301, syntactic analysis 3
02, structuring 303, optimization 304, code generation 30
Proceed in the order of 5.

【００４０】このうち語彙解析、構文解析、最適化、コ
ード生成は従来のコンパイラにおける処理と同じである
ので簡単に説明する。Of these, the vocabulary analysis, the syntax analysis, the optimization, and the code generation are the same as the processing in the conventional compiler, and therefore will be briefly described.

【００４１】語彙解析３０１では、単に文字の列として
格納されているソースプログラムを外部記憶装置２０３
より逐次読み出し、単語（ｌｅｘｉｃｏｎ）の列にす
る。たとえば、前述した図４に示すプログラミング言語
Ｃで記述されたソ−スプログラムを語彙解析すると、図
５に示すような単語の列となる。In the lexical analysis 301, the source program stored simply as a character string is stored in the external storage device 203.
It is read out more sequentially and made into a row of words (lexicon). For example, when the source program described in the programming language C shown in FIG. 4 described above is lexically analyzed, a string of words as shown in FIG. 5 is obtained.

【００４２】ここで、図示するように、各単語は種別５
０１と字句５０２の組で表現される。また、各単語はソ
ースプログラムの出現順に並べられる。種別のｋｅｙｗ
ｏｒｄはプログラムのキーワード、ｉｄは識別子、ｐｕ
ｎｃは区切り記号、ｎｕｍは数字を表す。Here, as shown in the figure, each word is of type 5
It is expressed by a set of 01 and the token 502. The words are arranged in the order of appearance of the source program. Type of keyw
ord is a keyword of the program, id is an identifier, pu
nc represents a delimiter and num represents a number.

【００４３】次に、構文解析３０２は、単語の列を解析
する。構文解析は解析される文が宣言文であるか実行文
であるかによって異なる処理を行う。宣言文に対して
は、宣言される識別子をシンボルテーブル２０９に登録
し、実行文に対しては中間語２０８を主記憶装置２０２
上に作成する。Next, the syntactic analysis 302 analyzes the string of words. The syntax analysis performs different processing depending on whether the analyzed statement is a declarative statement or an executable statement. For the declaration statement, the declared identifier is registered in the symbol table 209, and for the execution statement, the intermediate word 208 is stored in the main storage device 202.
Create on top.

【００４４】ここで、シンボルテーブル２０９の例を図
６に示す。なお、図６に示した例は、先に説明した図４
のソ−スプログラムに対応している。An example of the symbol table 209 is shown in FIG. Note that the example shown in FIG. 6 corresponds to the example shown in FIG.
It corresponds to the source program of.

【００４５】図示するように、シンボルテーブル２０９
に登録されている情報は名称６０１、出現位置６０２、
型６０３、構造体化フラグ６０４等である。名称６０１
は識別子の名称、出現位置６０２は識別子が宣言された
位置（関数内か関数外か）を表す。型６０３は識別子の
型を表し、たとえば「ａｒｒａｙ（ｉｎｔ，２０４
８）」は「要素型がｉｎｔ（整数型）で要素数が２０４
８の配列」という型を表現する。構造体化フラグ６０４
は後述する構造体化処理３０３で設定されるフラグであ
り、最初はすべてｏｆｆとなっている。As shown, the symbol table 209
The information registered in is the name 601, the appearance position 602,
A mold 603, a structured flag 604, and the like. Name 601
Indicates the name of the identifier, and the appearance position 602 indicates the position where the identifier is declared (whether inside the function or outside the function). The type 603 represents the type of the identifier, and for example, “array (int, 204
8) ”is“ the element type is int (integer type) and the number of elements is 204
8 array ". Structured flag 604
Is a flag set in the structuring process 303 described later, and is initially off.

【００４６】次に、中間語２０８の例を図７に示す。な
お、この例も図４のプログラムに対応している。Next, an example of the intermediate word 208 is shown in FIG. This example also corresponds to the program of FIG.

【００４７】図示するように中間語２０８は木構造で表
現されている。木はノード（節）とエッジ（辺）の集合
である。図７でノードは四角で囲った部分、エッジはそ
れらを結ぶ線である。各ノードは１つの親ノードと０個
以上の子ノードを持つ（ただしルートと呼ばれる特別な
ノードだけは親ノードを持たない）。図では各ノードか
ら上に延びたエッジが親ノードを、下に延びたエッジが
子ノードを示す。子は左から第１子、第２子、…と呼
ぶ。たとえば「｛｝」（７０３）の親ノードは「ｆｕｃ
ｎ」（７０１）であり、子ノードは「ｆｏｒ」（７０
４）である。ルートノードは「ｆｕｎｃ」（７０１）で
ある。木はプログラムの論理構造を表現するのに適して
いるでコンパイラで多く用いられている。As shown in the figure, the intermediate word 208 is represented by a tree structure. A tree is a set of nodes and edges. In FIG. 7, nodes are parts surrounded by squares, and edges are lines connecting them. Each node has one parent node and zero or more child nodes (however, only a special node called the root has no parent node). In the figure, the edge extending upward from each node is a parent node, and the edge extending downward is a child node. The children are called the first child, the second child, ... From the left. For example, the parent node of “{}” (703) is “fuc
n ”(701), and the child node is“ for ”(70
4). The root node is "func" (701). Trees are suitable for representing the logical structure of programs and are often used in compilers.

【００４８】このようにして作成されたシンボルテーブ
ル２０９と中間語２０８は、ソ−スプログラムと等価な
プログラムということができる。The symbol table 209 and the intermediate language 208 thus created can be said to be a program equivalent to the source program.

【００４９】次に、最適化３０３では、木構造で表現さ
れた実行文の部分を操作する。そして冗長な部分を見つ
けてその冗長部分を削除するなどの最適化処理を行う。Next, in the optimization 303, the part of the executable statement represented by the tree structure is operated. Then, optimization processing such as finding a redundant portion and deleting the redundant portion is performed.

【００５０】最後のコード生成３０５では、アセンブリ
言語で表現されたオブジェクトプログラムを生成する。
なお、マシン語で表現されたオブジェクトモジュ−ルを
生成するものもある。In the final code generation 305, an object program expressed in assembly language is generated.
There is also one that generates an object module expressed in machine language.

【００５１】すなわち、コード生成３０５では、シンボ
ルテーブル２０９からはアセンブリ言語の領域定義命令
や定数定義命令を生成し、中間語２０８からはアセンブ
リ言語の機械語命令を生成する。That is, in the code generation 305, an assembly language area definition instruction and a constant definition instruction are generated from the symbol table 209, and an assembly language machine language instruction is generated from the intermediate language 208.

【００５２】次に、構造体化３０３の処理手順について
説明する。Next, the processing procedure of the structuring 303 will be described.

【００５３】図１は構造体化３０３の処理手順を、詳し
く表したフローチャートである。FIG. 1 is a flow chart showing in detail the processing procedure of the structuring 303.

【００５４】図示するように、構造体化３０３の処理
は、ループ構造の認識１０１、ループ内配列参照の解析
１０２、配列群の類別１０３、配列要素の構造体化１０
４、配列要素参照の構造体化メンバ参照への変更１０５
の順で進む。As shown in the figure, the processing of structuring 303 is performed by recognizing a loop structure 101, analyzing an array reference in a loop 102, classifying an array group 103, and structuring an array element 10
4. Change of array element reference to structured member reference 105
In order.

【００５５】以下これらの処理を具体的に説明する。These processes will be specifically described below.

【００５６】構造体化３０３では、まずステップ１０１
で中間語２０８よりループ構造の認識１０１を行う。こ
の処理では中間語２０８を走査し、ループを表すノード
を見つけ、そのループで繰り返し実行される文（これ
を、以降単に「ループ実行文」と呼ぶ）を認識する。ル
ープ認識処理は、木のルートから走査を始める。たとえ
ば図７の中間語において、ルートのｆｕｎｃ７０１は関
数定義を表し、その第１子が関数名ｍａｉｎを、第２子
が関数本体を表す。第２子は｛｝７０３である。｛｝の
第１子に移ると、ｆｏｒ７０４（ループノードの１種）
が見つかる（最初のｆｏｒ文）。ｆｏｒノードで実行さ
れる文を表すのは第４子である（第１子は初期値設定
文、第２子は繰り返し判定文、第３子は制御変数更新文
である）ので、第４子に移る。第４子のｆｏｒ７０８は
再びｆｏｒノードである（２番目のｆｏｒ文）。そこで
さらにその第４子へ移ると、それは｛｝７０９であり、
さらに｛｝７０９の子には２つの代入文（＝で示され
る）があることがわかる。したがって２番目のｆｏｒ文
のループ実行文は２つの代入文であることがわかる。以
上でループ構造の認識が終わる。In structuring 303, first, step 101
Then, the loop structure is recognized 101 from the intermediate word 208. In this process, the intermediate word 208 is scanned, a node representing a loop is found, and a statement repeatedly executed in the loop (hereinafter, simply referred to as “loop execution statement”) is recognized. The loop recognition process starts scanning from the root of the tree. For example, in the intermediate language of FIG. 7, the root func 701 represents a function definition, the first child of which represents the function name main and the second child represents the function body. The second child is {} 703. Moving to the first child of {}, for704 (a kind of loop node)
Is found (first for sentence). The fourth child represents the statement executed by the for node (the first child is the initial value setting statement, the second child is the repeat determination statement, and the third child is the control variable update statement). Move on to. The fourth child for708 is again a for node (second for sentence). So when I moved to the fourth child, it was {} 709,
Furthermore, it can be seen that the child of {} 709 has two assignment statements (indicated by =). Therefore, it is understood that the loop execution statement of the second for statement is two assignment statements. This is the end of recognition of the loop structure.

【００５７】次にステップ１０２で、ループ内の配列要
素参照解析を行う。この処理ではステップ１０１で認識
したループ実行文の中間語２０８を走査し、配列要素参
照ノードを見つけ、配列名、添字式、参照状況（使用／
定義の別）を調べ、これを、主記憶装置２０２の配列要
素参照テーブル２１０に登録する。ここで、添字式と
は、配列ａ［Ｆ（ｘ）］の関数Ｆが表す式をいう。Next, at step 102, the array element reference analysis in the loop is performed. In this processing, the intermediate word 208 of the loop execution statement recognized in step 101 is scanned to find the array element reference node, and the array name, subscript expression, reference status (use /
The definition) is checked and registered in the array element reference table 210 of the main memory 202. Here, the subscript expression is an expression represented by the function F of the array a [F (x)].

【００５８】たとえば、図７に示した中間語では配列要
素参照ノード（［］で示される）の第１子が配列名を、
第２子が添字式を表す。また代入ノード（＝で示され
る）の第１子が代入先（定義側）を、第２子が代入元
（使用側）を表すとする。図７の中間語におけるループ
実行文は２つの代入文である。For example, in the intermediate language shown in FIG. 7, the first child of the array element reference node (indicated by []) is the array name,
The second child represents a subscript expression. Further, it is assumed that the first child of the assignment node (denoted by =) represents the assignment destination (definition side) and the second child represents the assignment source (use side). The loop execution statement in the intermediate language of FIG. 7 is two assignment statements.

【００５９】最初の代入文（７１０）は、配列名ａ、添
字式ｊで指定される配列要素（ソースプログラム上の表
現ではａ［ｊ］に対応）を配列名ｂ、添字式ｊで指定さ
れる配列要素（ソースプログラム上の表現ではｂ［ｊ］
に対応）に代入する形である。以下、配列要素について
は説明を明瞭にするために、ソースプログラム上の表現
を便宜的に用いる。次の代入文（７１１）は、配列要素
ｃ［ｊ］を配列要素ｄ［ｊ］に代入する形である。よっ
て、図７に示した中間語より、ステップ１０２によって
図８に示す配列要素参照テーブル２１０が作成される。
なお、配列要素が定義され、かつ使用されている場合は
配列要素参照テーブル２１０の参照状況は「定義」とす
る。In the first assignment statement (710), the array element designated by the array name a and the subscript expression j (corresponding to a [j] in the expression on the source program) is designated by the array name b and the subscript expression j. Array element (b [j] in the source program representation)
Corresponding to) is the form to substitute. Hereinafter, for the sake of clarity, the expression on the source program will be used for the array elements for the sake of clarity. The following assignment statement (711) has a form of assigning the array element c [j] to the array element d [j]. Therefore, in step 102, the array element reference table 210 shown in FIG. 8 is created from the intermediate language shown in FIG.
When the array element is defined and used, the reference status of the array element reference table 210 is “definition”.

【００６０】次に、ステップ１０３で、配列群を類別す
る。Next, in step 103, the array groups are classified.

【００６１】この処理は、各配列要素参照が同じ類に属
するか否かを判定する。ここで、配列要素参照とは、配
列の要素の参照を表す記述をいい、ソ−スプログラム上
ではａ［ｊ］やｃ［ｊ］等の記述が該当し、中間語上で
は、配列要素参照ノード（［］で示される）と、その子
の記述が該当する。This process determines whether each array element reference belongs to the same class. Here, the array element reference refers to a description indicating a reference to an element of an array, which corresponds to a description such as a [j] or c [j] on the source program, and an array element reference on the intermediate language. The description of the node (indicated by []) and its children correspond.

【００６２】この処理のフローチャートを図９に示す。A flowchart of this processing is shown in FIG.

【００６３】この処理では、まずステップ９０１で、与
えられた配列要素参照の添字式がともにループ制御変数
の１次式となっているかを調べる。ここで１次式とは
「定数１＊ループ制御変数＋定数２」の形をしているこ
とである（ただし定数１、定数２は０でもよい）。少な
くともどちらか一方がループ制御変数の１次式になって
いない場合は、同じ類に属さないと判定して（９０６）
処理終了する。In this processing, first, at step 901, it is checked whether both the subscript expressions of the given array element references are linear expressions of the loop control variable. Here, the linear expression has a form of “constant 1 * loop control variable + constant 2” (however, constant 1 and constant 2 may be 0). If at least one of them is not a linear expression of the loop control variable, it is judged that they do not belong to the same class (906).
Processing ends.

【００６４】ともに１次式である場合は次にステップ９
０２で、添字式の係数と定数項がそれぞれ等しいかどう
か（定数１、定数２がそれぞれ等しいかどうか）を調べ
る。等しくない場合は、同じ類に属さないと判定して
（９０６）処理を終了する。If both are linear expressions, then step 9
In 02, it is checked whether the coefficient of the subscript expression and the constant term are equal (whether constant 1 and constant 2 are equal). If they are not equal, it is determined that they do not belong to the same class (906), and the process is terminated.

【００６５】等しい場合は次にステップ９０３で、２つ
の配列の型が等しいかを調べる。ここで型が等しいと
は、配列要素の型と配列の要素数がともに等しいことを
意味する。等しくない場合は、同じ類に属さないと判定
して（９０６）処理を終了する。If they are equal, then in step 903, it is checked whether the two arrays have the same type. Equal type here means that the type of array element and the number of elements of the array are equal. If they are not equal, it is determined that they do not belong to the same class (906), and the process is terminated.

【００６６】型が等しい場合は次にステップ９０４で、
参照状況が等しいかどうかを調べる。ここで参照状況が
等しいとは、２つの参照がいずれも定義であるか、また
はいずれも使用であるということを意味する。等しけれ
ば同じ類に属すると判定して（９０５）処理終了する。
等しくなければ同じ類に属さないと判定して（９０６）
処理を終了する。If the types are the same, then in step 904,
Check if the reference statuses are equal. Here, the reference situations being equal means that the two references are both definitions or both are uses. If they are equal, it is determined that they belong to the same class (905), and the process ends.
If they are not equal, it is determined that they do not belong to the same class (906)
The process ends.

【００６７】以上の処理によって、たとえば図４に示し
たソ−スのプログラムでは参照ａ［ｊ］とｃ［ｊ］が同
じ類に、また参照ｂ［ｊ］とｄ［ｊ］が同じ類に含まれ
ると判定される（ｊはループ制御変数である）。By the above processing, for example, in the source program shown in FIG. 4, the references a [j] and c [j] are in the same class, and the references b [j] and d [j] are in the same class. It is determined to be included (j is a loop control variable).

【００６８】さて、図１に戻り、次にステップ１０４
で、配列要素の構造体化を行う。Now, returning to FIG.
Then, the array elements are structured.

【００６９】この処理では、１つの類に属する配列参照
によって参照される複数の配列をまとめて１つの配列に
する。その配列の要素型は構造体型となるが、それは次
のように求める。１つの類に属する配列をＡ１，…，Ａ
ｎとし、それらの要素型をｔ、配列要素数をＮとする
（同類であれば配列の要素型・要素数は同じである）。
すると合体して１つにした配列の要素型は（Ｃ言語で表
せば）、という型になる。これは、合体して１つにした配列のｉ
番目の要素は、ｎ個の配列Ａ１，…，Ａｎそれぞれのｉ
番目の整数型要素による配列であることを表している。
なお、合体して１つにした配列の要素数はＮのままであ
る。In this processing, a plurality of arrays referred to by the array reference belonging to one class are put together into one array. The element type of the array is a structure type, which is calculated as follows. Sequences belonging to one class are A1, ..., A
Let n be the element type thereof, and t be the number of array elements (if they are of the same type, the element type and number of elements of the array are the same).
Then, the element types of the combined array are (in C language): It becomes the type. This is the i of the combined sequence.
The th element is the i of each of the n arrays A1, ..., An.
Indicates that the array is the th integer type element.
Note that the number of elements in the array that has been merged into one remains N.

【００７０】さて合体して１つにした配列の配列名はユ
ニークな（他の名前と競合しない）名前とするが、ここ
では元の配列名を単につなげたもの（Ａ１…Ａｎ）を便
宜的に用いることにする。The sequence names of the combined sequences are made unique (do not conflict with other names), but here, the original sequence names are simply combined (A1 ... An) for convenience. I will use it for.

【００７１】さて、このように新たに生成した配列は、
シンボルテーブル２０９に登録する。そして、もとの配
列に対しては、構造体化されたことを示す「構造体化フ
ラグ」をオンにする。さらに構造体化した後の配列シン
ボルを登録する。図４のプログラムに対してステップ１
０４の処理を行うと、シンボルテーブルは図１０に示す
ようになる。図１０では配列ａとｃが構造体化されて１
つの配列ａｃ（１００１）となり、配列ｂとｄが構造体
化されて１つの配列ｂｄ（１００２）になっている。こ
れらの配列の型はともに「ａｒｒａｙ（ｓｔｒｕｃｔ
（ｉｎｔ，ｉｎｔ），２０４８）」になる。そして配列
ａ，ｂ，ｃ，ｄに対しては構造体化フラグがオンにな
る。また構造体化した後のシンボルを示すフィールド
（６０５）には、配列ａ，ｃに対しては配列ａｃ、配列
ｂ，ｄに対しては配列ｂｄが登録される。Now, the array newly generated in this way is
Register in the symbol table 209. Then, the "structuring flag" indicating that the original array has been structured is turned on. The array symbol after further structuring is registered. Step 1 for the program in Figure 4
When the processing of 04 is performed, the symbol table becomes as shown in FIG. In FIG. 10, arrays a and c are structured into 1
One array ac (1001) is formed, and the arrays b and d are structured into one array bd (1002). The types of these arrays are both "array (struct
(Int, int), 2048) ”. The structuring flag is turned on for the arrays a, b, c and d. Further, in the field (605) indicating the symbol after being structured, the array ac is registered for the arrays a and c, and the array bd is registered for the arrays b and d.

【００７２】次にステップ１０５で、配列要素参照を構
造体化メンバ参照へ変更する。これは、中間語を再度走
査し、構造体化フラグがオンである配列要素への参照が
あれば、それを構造体化した後の配列要素参照（構造体
メンバ参照）に置き換える処理である。図１１はこの処
理での中間語の変更を示す図である。もとの配列参照を
Ａ、構造体化後の配列をＢとすると、１１０１が変更前
の木を、１１０２が変更後の木を表す。図１１で「．」
というノードは構造体のメンバ参照を表すノードであ
る。Next, at step 105, the array element reference is changed to a structured member reference. This is a process of scanning the intermediate word again, and if there is a reference to an array element whose structuring flag is on, replaces it with the array element reference (structure member reference) after structuring. FIG. 11 is a diagram showing the change of the intermediate language in this processing. Letting A be the original array reference and B be the array after structuring, 1101 represents the tree before the change and 1102 represents the tree after the change. In Fig. 11, "."
Is a node that represents the member reference of the structure.

【００７３】図１１の変更は、ソースプログラム表現で
表現すればＡ［ｅ］がＢ［ｅ］．Ａに変わることを意味
する。図４のプログラムでは図１０のシンボルテーブル
で示されるように配列ａ，ｂ，ｃ，ｄが書き換えの対象
となる。したがって図７の中間語を走査すると配列参照
ａ［ｊ］，ｂ［ｊ］，ｃ［ｊ］，ｄ［ｊ］が変更の対象
として見つかり、以下のように変更される（実際には、
図１１に示すように中間語レベルで変更する）ａ［ｊ］ → ａｃ［ｊ］．ａｂ［ｊ］ → ｂｄ［ｊ］．ｂｃ［ｊ］ → ａｃ［ｊ］．ｃｄ［ｊ］ → ｂｄ［ｊ］．ｄここで、ａｃ［ｊ］．ａは、配列ａｃの要素のうち、配
列ａｃに構造体化されている配列ａのｊ番目の要素を表
している。The modification of FIG. 11 is that A [e] is changed to B [e]. It means to change to A. In the program of FIG. 4, arrays a, b, c, and d are rewritten as shown in the symbol table of FIG. Therefore, when scanning the intermediate word of FIG. 7, array references a [j], b [j], c [j], d [j] are found to be changed and are changed as follows (actually,
Change at intermediate language level as shown in FIG. 11) a [j] → ac [j]. a b [j] → bd [j]. bc [j] → ac [j]. cd [j] → bd [j]. d where ac [j]. a represents the j-th element of the array a that is structured into the array ac among the elements of the array ac.

【００７４】以上で構造体化処理の説明を終わる。This is the end of the explanation of the structuring process.

【００７５】図１２は構造体化処理が終った時点でのプ
ログラムをソースプログラムイメージで示したものであ
る。ただし、以上の処理は、ソ−スプログラムと等価な
中間語２０８やシンボルテ−ブル２０９に対して行うの
で、実際にソースプログラムが変更されるわけではな
い。FIG. 12 is a source program image showing a program at the time when the structuring process is completed. However, since the above processing is performed on the intermediate language 208 and the symbol table 209 equivalent to the source program, the source program is not actually changed.

【００７６】さて、このようにして、構造体化３０３に
よって変更された中間語２０８やシンボルテ−ブル２０
９は、前述したように、最適化３０で、中間後の木構造
で表現された実行文の冗長部分を削除するなどの最適化
処理を行われる。そして、コード生成３０５では、構造
体化３０３によって変更された中間語２０８やシンボル
テ−ブル２０９より、アセンブリ言語もしくはマシン語
で表現されたオブジェクトプログラムを生成する。すな
わち、前述したように、コード生成３０５は、シンボル
テーブル２０９からはアセンブリ言語の領域定義命令や
定数定義命令を生成し、中間語２０８からはアセンブリ
言語の機械語命令を生成する。Now, the intermediate language 208 and the symbol table 20 changed by the structuring 303 in this way are described.
As described above, 9 is the optimization 30 in which optimization processing such as deleting the redundant portion of the executable statement represented by the tree structure after the intermediate processing is performed. Then, in the code generation 305, an object program expressed in assembly language or machine language is generated from the intermediate language 208 and the symbol table 209 changed by the structuring 303. That is, as described above, the code generator 305 generates an assembly language area definition instruction and a constant definition instruction from the symbol table 209, and generates an assembly language machine language instruction from the intermediate language 208.

【００７７】この、シンボルテーブル２０９からのアセ
ンブリ言語の領域定義命令の生成に際して、コード生成
３０５は、シンボルテーブル２０９の構造体化フラグ６
０４がオンに設定されている配列についての領域定義命
令は生成しない。すなわち、たとえば、構造体化された
配列ａｃについて領域は定義するが、この元となった配
列ａ，ｃそれぞれについては領域の定義を行わない。When generating the assembly language area definition instruction from the symbol table 209, the code generator 305 uses the structuring flag 6 of the symbol table 209.
No area definition instruction is generated for an array in which 04 is set to ON. That is, for example, the area is defined for the structured array ac, but the area is not defined for each of the original arrays a and c.

【００７８】また、構造体化された配列は、オブジェク
トプログラムの実行時、オブジェクトプログラムを実行
する計算機の主記憶上の、配列ａｃについての領域定義
命令によって定義された領域にａｃ［０］．ａ，ａｃ
［０］．ｃ，ａｃ［１］．ａ，ａｃ［１］．ｃ，…，ａ
ｃ［２０４７］．ａ，ａｃ［２０４７］．ｃの順に格納
される。When the object program is executed, the structured array is stored in the area defined by the area definition command for the array ac on the main memory of the computer that executes the object program. a, ac
[0]. c, ac [1]. a, ac [1]. c, ..., a
c [2047]. a, ac [2047]. It is stored in the order of c.

【００７９】以下、このようにして生成したオブジェク
トプログラム２０７が実行される際に、キャッシュメモ
リででキャッシュミスがどの程度発生するかを説明す
る。Hereinafter, it will be described how many cache misses occur in the cache memory when the object program 207 thus generated is executed.

【００８０】いま、図１３に示すようなキャッシュメモ
リを想定する。Now, assume a cache memory as shown in FIG.

【００８１】図１３に示したキャッシュメモリは、セッ
ト数２のアソシアティブキャッシュである。キャッシュ
容量は全体で８ｋバイト、すなわち１つのセットの容量
は４ｋバイトである。１セットは１２８個のエントリを
持つ。エントリの番号をインデクスと呼び、同じインデ
クスをもつ２つのエントリを合わせてラインと呼ぶ。１
ラインには１つのラインは制御ビット（１２０１）、タ
グ（１２０２）、データ（１２０３）の３つの部分から
なる。データ部には３２バイト（８語）分のメモリデー
タが格納される。タグはデータ部に格納しているデータ
が、どのメモリアドレスに対応するものかを特定するフ
ィールドである。制御ビットはｅｍｐｔｙビット（１２
０４）、ｒｅｃｅｎｔビット（１２０５）、ｄｉｒｔｙ
ビット（１２０６）等からなる。ｅｍｐｔｙビットは当
ラインが空きであるかを示す（真のとき空き）。ｒｅｃ
ｅｎｔビットは当エントリが、他方のエントリよりも後
にアクセスされたかどうかを示す（真のとき後）。ｄｉ
ｒｔｙビットは当エントリに書き込みがあったかを示す
（真のとき書き込み有り）。The cache memory shown in FIG. 13 is an associative cache having two sets. The total cache capacity is 8 kbytes, that is, the capacity of one set is 4 kbytes. One set has 128 entries. The entry number is called an index, and two entries having the same index are collectively called a line. 1
In each line, one line consists of three parts: control bit (1201), tag (1202), and data (1203). 32 bytes (8 words) of memory data are stored in the data section. The tag is a field that specifies to which memory address the data stored in the data section corresponds. The control bit is the empty bit (12
04), recent bit (1205), dirty
It consists of bits (1206) and the like. The empty bit indicates whether this line is empty (when it is true, it is empty). rec
The ent bit indicates whether this entry is accessed later than the other entry (when true, after). di
The rty bit indicates whether or not there is a write in this entry (when true, there is a write).

【００８２】メモリアクセスが発生すると、メモリアド
レス（１２０７）のうち、下位から６番目から１２番目
の７ビットの値をインデクス（１２０９）としてキャッ
シュライン中のラインを特定する。セット数２であるの
で、１つのラインに２つのエントリがある。次に当該の
メモリアドレスの上位２０ビットの値をタグ（１２０
８）として取り出し、それとエントリ内に格納されたタ
グ（１２０２）の値をそれぞれ比較器（１２１０）で比
較する。一致するエントリがあればキャッシュヒットと
なる。そうでなければキャッシュミスになる。キャッシ
ュミスの場合はデータをメモリからフェッチし、２つの
エントリのうち空（制御ビットのｅｍｔｐｙビットが
真）のエントリに格納する。いずれのエントリも空でな
ければ、最後にアクセスされた時刻がより古い（ｒｅｃ
ｅｎｔビットが真でない）エントリに格納する。ただし
格納する前に、そのエントリに書き込みがあったかどう
か（ｄｉｒｔｙビットが真かどうか）を調べ、書き込み
があれば以前のエントリ内容を主メモリに書き戻す。書
き込みがなければ主メモリに書き戻す必要はない。When a memory access occurs, the line in the cache line is specified by using the 6th to 12th lowest 7-bit value of the memory address (1207) as an index (1209). Since the number of sets is 2, there are two entries in one line. Next, the value of the upper 20 bits of the relevant memory address is set to the tag (120
8), and the value of the tag (1202) stored in the entry is compared by the comparator (1210). If there is a matching entry, it is a cache hit. Otherwise, you will get a cache miss. In the case of a cache miss, the data is fetched from the memory and stored in the empty entry (the emtppy bit of the control bit is true) of the two entries. If neither entry is empty, the last accessed time is older (rec
ent bit is not true) Store in entry. However, before storing, it is checked whether or not the entry has been written (whether the dirty bit is true), and if there is the writing, the previous entry contents are written back to the main memory. If there is no writing, there is no need to write it back to the main memory.

【００８３】このようなキャッシュメモリを備えた計算
機において、先に生成したオブジェクトプログラムを実
行すると、各配列データは、主メモリに、図１４に示す
ように配置されることになる。When the previously created object program is executed in a computer having such a cache memory, each array data is arranged in the main memory as shown in FIG.

【００８４】図１４は、構造体化した配列ａｃ，ｂｄの
主メモリ内での配置を示した図である。図示するよう
に、配列ａｃは００００００００番地から００００２Ｆ
ＦＣ番地に、ａｃ［０］．ａ，ａｃ［０］．ｃ，ａｃ
［１］．ａ，ａｃ［１］．ｃ，…，ａｃ［２０４７］．
ａ，ａｃ［２０４７］．ｃの順に格納される。配列ｂｄ
は００００３０００番地から００００４ＦＦＣ番地に、
ｂｄ［０］．ｂ，ｂｄ［０］．ｄ，ｂｄ［１］．ｂ，ｂ
ｄ［１］．ｄ，…，ｂｄ［２０４７］．ｂ，ｂｄ［２０
４７］．ｄの順に格納される。FIG. 14 is a diagram showing the arrangement of the structured arrays ac and bd in the main memory. As shown in the figure, the array ac is from 00000000 to 00002F
At the FC address, ac [0]. a, ac [0]. c, ac
[1]. a, ac [1]. c, ..., ac [2047].
a, ac [2047]. It is stored in the order of c. Array bd
From address 003000 to address 00004FFC,
bd [0]. b, bd [0]. d, bd [1]. b, b
d [1]. d, ..., bd [2047]. b, bd [20
47]. They are stored in the order of d.

【００８５】配列要素データがキャッシュメモリにフェ
ッチされるときは８要素（８語）分が１度にフェッチさ
れる。たとえば、配列要素ａｃ［０］．ａがアクセスさ
れ、もしそれがキャッシュになかった場合、ａｃ
［０］．ａを含む８語分のメモリブロック（ａｃ
［０］．ａ，…，ａｃ［３］．ｃ）がキャッシュにフェ
ッチされる。また、この場合は、任意のＩに対して、ａ
ｃ［Ｉ］とｂｄ［Ｉ］のアドレスの下位１２ビット部分
の値は等しくなる。たとえばａｃ［１］のアドレスは０
００００００４、ｂｄ［１］のアドレスは００００３０
０４で、下位１２ビットはともに００４である。下位１
２ビットによりキャッシュラインが決まるので、これら
は同じキャッシュラインに格納される。すなわち、ａｃ
［Ｉ］とｂｄ［Ｉ］はキャッシュラインを競合する。When array element data is fetched into the cache memory, 8 elements (8 words) are fetched at once. For example, array element ac [0]. If a is accessed and it is not in the cache, ac
[0]. 8 word memory block including a (ac
[0]. a, ..., ac [3]. c) is fetched into the cache. In this case, for any I, a
The values of the lower 12 bits of the addresses of c [I] and bd [I] are equal. For example, the address of ac [1] is 0
The address of 00000004, bd [1] is 000030
In 04, the lower 12 bits are both 004. Bottom 1
Since the cache line is determined by 2 bits, they are stored in the same cache line. That is, ac
[I] and bd [I] compete for a cache line.

【００８６】さて、図４に示したソ−スプログラムのオ
ブジェクトプログラム実行に沿って、キャッシュメモリ
の振舞いを示すと次のようになる。The behavior of the cache memory along with the object program execution of the source program shown in FIG. 4 is as follows.

【００８７】先ず配列要素ａｃ［０］．ａが参照され
る。最初のアクセスでありａｃ［０］．ａを含むメモリ
ブロック（ａｃ［０］．ａ，…，ａｃ［３］．ｃ）がキ
ャッシュメモリにフェッチされる。次にｂｃ［０］．ｂ
が参照される。これも最初の参照でありｂｄ［０］．ｂ
を含むメモリブロック（ｂｄ［０］．ｂ，…，ｂｄ
［３］．ｄ）がキャッシュメモリにフェッチされる。こ
れら２つのメモリブロックはキャッシュラインを競合す
るが、セット数が２であるので共にキャッシュメモリに
残る。以降、ａｃ［０］．ｃ，ｂｄ［０］．ｄ，ａｃ
［１］．ａ，ｂｄ［１］．ｂ，ａｃ［１］．ｃ，ｂｄ
［１］．ｄ，…，ｂｄ［３］．ｄの順に参照するが、こ
れらはすべて既にキャッシュメモリにフェッチしてある
のでキャッシュヒットとなる。まとめると、１６回の参
照でキャッシュミスが２回である。その次の配列参照は
ａｃ［４］．ａであるが、この部分は最初のａｃ
［０］．ａの参照と同じ状況となる。したがって、１６
回の参照毎に２回キャッシュミスが発生することにな
る。First, the array element ac [0]. Reference is made to a. This is the first access and ac [0]. A memory block (ac [0] .a, ..., ac [3] .c) including a is fetched into the cache memory. Next, bc [0]. b
Is referred to. This is also the first reference, bd [0]. b
, Including memory block (bd [0] .b, ..., bd
[3]. d) is fetched into the cache memory. These two memory blocks compete for the cache line, but both sets remain in the cache memory because the number of sets is two. After that, ac [0]. c, bd [0]. d, ac
[1]. a, bd [1]. b, ac [1]. c, bd
[1]. d, ..., bd [3]. Although they are referred to in the order of d, they are cache hits because they have already been fetched into the cache memory. In summary, 16 references refer to two cache misses. The next sequence reference is ac [4]. a, but this part is the first ac
[0]. It becomes the same situation as the reference of a. Therefore, 16
A cache miss will occur twice for each reference.

【００８８】ここで、比較のために、図１５に、配列を
構造体化しなかった場合の、配列ａ，ｂ，ｃ，ｄの主メ
モリ内での配置を示す。この場合、配列ａは０００００
０００番地から００００１ＦＦＣ番地に、配列ｂは００
００２０００番地から００００２ＦＦＣ番地に、配列ｃ
は００００３０００番地から００００３ＦＦＣ番地に、
配列ｄは００００４０００番地から００００４ＦＦＣ番
地に連続して格納される。また、任意のＩに対して、ａ
［Ｉ］、ｂ［Ｉ］、ｃ［Ｉ］、ｄ［Ｉ］のアドレスの下
位１２ビット部分の値はすべて等しい。すなわち、これ
らはキャッシュラインを競合する。したがい、この場
合、前述したように１６回の参照で１６回、すなわち毎
回キャッシュミスが発生することになる。For comparison, FIG. 15 shows the arrangement of the arrays a, b, c and d in the main memory when the arrays are not structured. In this case, the array a is 00000
Array b is 00 from address 000 to address 00001FFC
Array c from address 002000 to address 00002FFC
From 00003000 to 0000FFFC,
The array d is continuously stored from the address 0004000 to the address 0000440FFC. Also, for any I, a
The values of the lower 12 bits of the addresses of [I], b [I], c [I], and d [I] are all equal. That is, they compete for cache lines. Therefore, in this case, as described above, the cache miss occurs 16 times, that is, every time the reference is made 16 times.

【００８９】したがい、本実施例によれば、ず４に示し
たソ−スプログラムについて、キャッシュミスを１／８
（＝２／１６）に減ずることができたことになる。Therefore, according to the present embodiment, with respect to the source program shown in step 4, the cache miss is 1/8.
This means that we were able to reduce it to (= 2/16).

【００９０】なお、本実施例によれば、参照状況が同じ
配列をまとめて一つの配列に構造体化しているので、参
照状況が「使用」の配列をまとめた配列は、キャッシュ
メモリ上で変更されない。したがい、この配列について
主メモリへの書き戻しが不要になる。すなわち、配列ａ
ｃは、キャッシュメモリ上で変更されないので、配列ａ
ｃの部分を含むキャッシュラインがキャッシュから追い
出されるとき、ｄｉｒｔｙはセットされておらず主メモ
リへの書き戻しが不要となる。このことは、前述したよ
うな、特に主メモリへの書き戻し方式としてストアイン
方式を採用する計算機にとって有利である。なぜなら
ば、主メモリへの書き戻しを行う必要が生じるデータ、
メモリブロックを局在化できるので書き戻し回数を低減
することができるからである。しかし、もし、書き戻し
方式としてストアスル−方式を採用する計算機上でのみ
実行されるプログラムであれば、このような効果は期待
できない。そこで、このような場合等には、参照状況が
異なる配列であっても、まとめて一つの配列に構造体化
するようにしてもよい。すなわち、図９の９０４のステ
ップを省略するようにしてもよい。もちろん、書き戻し
方式としてストアイン方式を採用する計算機で実行され
るプログラムにおいて、参照状況が異なる配列であって
も、まとめて一つの配列に構造体化するようにしても前
述したキャッシュミスを低減する効果を達成することが
できる。この場合、図４に示した例では、配列ａ，ｂ，
ｃ，ｄが一つの配列ａｂｃｄに構造体化されることにな
る。そして、この場合、キャッシュラインの競合は発生
せず、８回の参照毎に１度、キャッシュメモリへのフェ
ッチが生じることになる。これは、ダイレクト・マッピ
ング方式においても同様である、したがい、このように
することにより、ダイレクト・マッピング方式において
もキャッシュミスを低減することができる。According to the present embodiment, since the arrays having the same reference status are grouped into a single array, the array in which the reference status is "used" is changed in the cache memory. Not done. Therefore, there is no need to write back this array to main memory. That is, array a
Since c is not changed in the cache memory, array a
When the cache line including the part of c is evicted from the cache, the dirty is not set and the writing back to the main memory is unnecessary. This is advantageous for a computer that employs the store-in method as a method for writing back to the main memory, as described above. Because the data that needs to be written back to the main memory,
This is because the memory blocks can be localized and the number of write backs can be reduced. However, if the program is executed only on a computer that adopts the store-through method as the write-back method, such an effect cannot be expected. Therefore, in such a case, even arrays having different reference situations may be collectively structured into one array. That is, step 904 in FIG. 9 may be omitted. Of course, in a program executed on a computer that uses the store-in method as the write-back method, even if the arrays have different reference conditions, the cache misses described above are reduced even if they are structured into a single array. The effect of doing can be achieved. In this case, in the example shown in FIG. 4, the arrays a, b,
c and d are structured into one array abcd. In this case, cache line contention does not occur, and a fetch to the cache memory occurs once every eight references. This is also the case with the direct mapping method. Therefore, by doing so, cache misses can be reduced also in the direct mapping method.

【００９１】なお、以上の実施例では、Ｃとして知られ
るプログラミング言語により記述されたソ−スプログラ
ムを対象とする場合を例にとり説明したが、たとえば、
ＦＯＲＴＲＡＮとして知られるプログラミング言語等、
他のプログラミング言語についても同様に実施すること
ができる。In the above embodiments, the source program written in the programming language known as C is described as an example.
A programming language known as FORTRAN,
Other programming languages can be implemented similarly.

【００９２】また、本実施例では、コンパイル時に、中
間言語やシンボルテ−ブルに対して、配列の構造体化や
配列要素参照の変換を行ったが、この結果を反映するよ
うにソ−スプログラムを変換するようにしてもよい。た
とえば、図４に示したソ−スプログラムを図１２に示し
たソ−スプログラムに変換するようにしてもよい。この
場合は、この後、図１２に示したソ−スプログラムを従
来と同様にコンパイルすればよい。In the present embodiment, the structure of the array and the conversion of the array element reference are performed for the intermediate language and the symbol table at the time of compiling, but the source program is made to reflect the result. May be converted. For example, the source program shown in FIG. 4 may be converted into the source program shown in FIG. In this case, after that, the source program shown in FIG. 12 may be compiled in the same manner as the conventional one.

【００９３】以上述べたように本実施例によれば、キャ
ッシュラインの競合によりキャッシュミスが頻発するプ
ログラムを、自動的にキャッシュミスの発生が少ないプ
ログラムに変換することができる。これによりキャッシ
ュ装置の変更なしに、プログラムの実行性能を向上する
ことができる。また、本実施例によれば、このようなプ
ログラムの変換が計算機によって自動的にできるので、
プログラマがプログラムを人手で変更するのに比べて手
間や間違いが少くなるという効果がある。As described above, according to this embodiment, a program in which cache misses frequently occur due to cache line competition can be automatically converted into a program in which cache misses are less likely to occur. As a result, the execution performance of the program can be improved without changing the cache device. Further, according to the present embodiment, since such program conversion can be automatically performed by the computer,
This has the effect of reducing the effort and error compared to the programmer manually changing the program.

【００９４】[0094]

【発明の効果】以上のように本発明によれば、ハードウ
ェアの変更なしに、プログラムの繰り返し実行される部
分（ル−プ）でアクセスされる複数のデータについて
も、キャッシュメモリのヒット率を向上することのでき
るプログラムの変換方法を提供することができる。As described above, according to the present invention, the hit rate of the cache memory can be set even for a plurality of data accessed in the repeatedly executed part (loop) of the program without changing the hardware. It is possible to provide a program conversion method that can be improved.

【図面の簡単な説明】[Brief description of drawings]

【図１】本発明の一実施例に係る構造体化処理の処理手
順を示すフロ−チャ−トである。FIG. 1 is a flow chart showing a processing procedure of a structuring processing according to an embodiment of the present invention.

【図２】本発明の一実施例に係る計算機システムの構成
を示すブロック図である。FIG. 2 is a block diagram showing a configuration of a computer system according to an embodiment of the present invention.

【図３】本発明の一実施例に係るコンパイル処理の処理
手順を示すフロ−チャ−トである。FIG. 3 is a flowchart showing a processing procedure of a compile processing according to an embodiment of the present invention.

【図４】ソ−スプログラムの例を示す説明図である。FIG. 4 is an explanatory diagram showing an example of a source program.

【図５】語彙解析結果の例を示す説明図である。FIG. 5 is an explanatory diagram showing an example of a vocabulary analysis result.

【図６】シンボルテ−ブルの例を示す説明図である。FIG. 6 is an explanatory diagram showing an example of a symbol table.

【図７】中間語の例を示す説明図である。FIG. 7 is an explanatory diagram showing an example of an intermediate language.

【図８】本発明の一実施例に係る語彙要素算所テ−ブル
の構成を示す説明図である。FIG. 8 is an explanatory diagram showing the structure of a vocabulary element place table according to an embodiment of the present invention.

【図９】本発明の一実施例に係る構造体化処理後のシン
ボルテ−ブルを示す説明図である。FIG. 9 is an explanatory diagram showing a symbol table after structuring processing according to an embodiment of the present invention.

【図１０】本発明の一実施例に係る構造体化処理による
中間語の変更のようすを示す説明図である。FIG. 10 is an explanatory diagram showing how an intermediate language is changed by the structuring processing according to the embodiment of the present invention.

【図１１】本発明の一実施例に係る構造体化処理による
変換後のプログラムをソ−スプログラムレベルで示した
説明図である。FIG. 11 is an explanatory diagram showing a program after conversion by a structuring process according to an embodiment of the present invention at a source program level.

【図１２】キャッシュメモリの構成例を示した説明図で
ある。FIG. 12 is an explanatory diagram showing a configuration example of a cache memory.

【図１３】本発明の一実施例に係る配列要素参照の類判
定を行う処理の処理手順を示すフロ−チャ−トである。FIG. 13 is a flowchart showing a processing procedure of processing for making a type determination of array element reference according to an embodiment of the present invention.

【図１４】本発明の一実施例に係る構造体化された配列
の主メモリ上の配置を示した説明図である。FIG. 14 is an explanatory diagram showing an arrangement on a main memory of a structured array according to an embodiment of the present invention.

【図１５】従来の技術に係る配列の主メモリ上の配置を
示した説明図である。FIG. 15 is an explanatory diagram showing an arrangement on a main memory of an array according to a conventional technique.

[Explanation of symbols]

１０１…ループ構造の認識１０２…ループ内配列参照の解析１０３…配列群の類別１０４…配列要素の構造体化１０５…配列要素参照の構造体メンバ参照への変更 101 ... Recognition of loop structure 102 ... Analysis of in-loop array reference 103 ... Classification of array group 104 ... Structuring of array element 105 ... Change of array element reference to structure member reference

Claims

[Claims]

1. A method for converting a program, which comprises a step of determining a part describing a looping process of the program, and a reference in the part describing the determined looping process. A plurality of arranged arrays, the types of the arrays are the same, and the steps of grouping the arrays having the same position in the array elements of the arrays referred to in each processing of the loop processing into the same category, , A description defining multiple arrays grouped into the same category,
The step of converting into a description defining one array composed of the plurality of arrays, and the description referring to the elements of the plurality of arrays grouped in the same category in the loop processing part, by the plurality of arrays. And a step of converting a description that refers to a corresponding element of one configured array, to a program conversion method.

2. The program conversion method according to claim 1, wherein the step of grouping the plurality of arrays includes a plurality of arrays described to be referred to in a part describing the determined looping process. Among them, the types of the arrays are the same, and the positions of the elements of the arrays referred to in each processing of the loop processing are the same in the arrays, and further, the contents of the description that refers to the array However, both of the arrays that are descriptions that change the contents of the array are grouped in the same class,
A method of converting a program, characterized in that the contents of the description referring to the array are the steps of grouping the arrays which are descriptions that do not change the content of the array.

3. The program conversion method according to claim 1, wherein in the step of classifying, the position of an element referred to in each process of the loop process in the description that refers to the array. When the contents of the subscript expressions that specify are equal, the positions of the elements of the array referred to in the processing of each time in the array are the same array, and the program conversion method.

4. A method of compiling a program for generating an object code sequence to be executed on a computer having a main memory and a cache memory from a program, the portion describing a looping process of the program. Among the plurality of arrays that are described to be referred to in the part that describes the determining step and the determined looping processing, the array types are the same, and the processing of each time of the loop processing is performed. The steps of grouping arrays that have the same position in the array of the elements of the array referred to in, and the description that defines multiple arrays grouped in the same group of the program are configured by the multiple arrays. The step of converting into a description defining one array, and the description referring to the elements of a plurality of arrays grouped in the same category in the loop processing part, A step of converting a description that refers to a corresponding element of one array configured by an array of numbers, and a plurality of arrays that form the one array have the same position in the array, according to the converted program. An object code in which an element includes an object code for arranging one array composed of the plurality of arrays so as to be sequentially arranged at consecutive positions on a main memory of a computer that executes the program when the program is executed And a step of generating a sequence.

5. A method of compiling a program for generating an object code to be executed on a computer having a main memory and a cache memory from a program, and determining a part describing a looping process of the program. In the part that describes the looping process and the step that is determined, among the multiple arrays that are described to be referenced, the array types are the same, and in each processing of the loop processing The step of grouping arrays that are referred to and having the same position in the array within the same group, and the description that refers to the elements of a plurality of arrays grouped in the same group within the loop processing part From the step of converting the description that refers to the corresponding element of one array configured by the array of
Specifies that the elements of each array that have the same position in an array of multiple arrays that are grouped into the same category of the program are sequentially arranged at consecutive positions in the main memory of the computer that executes the program when the program is executed. And a step of generating an object code string including object code.

6. A storage device storing a source program,
A computer system comprising a processing device for converting the source program, wherein the processing device describes means for determining a portion describing a looping process of the program, and the determined looping process. Position of the elements of the array in the array that are of the same array type and are referenced in each processing of the loop processing among the plurality of arrays described to be referenced. Means for assigning the same category to the same sequences, and a description defining a plurality of arrays to which the same category of the program is assigned is configured by the plurality of arrays.
A means for changing to a description that defines one array and a description that refers to elements of a plurality of arrays that are grouped in the same category in the portion that describes the loop processing are configured by the plurality of arrays. And a means for changing the description to refer to the corresponding element of one array.

7. The computer system according to claim 6, wherein the processor further includes, from the program whose description has been converted, each element having the same position in the array of a plurality of arrays forming the one array. When the program is executed, an object code sequence including an object code for arranging one array composed of the plurality of arrays so as to be sequentially arranged at consecutive positions in the main memory of the computer that executes the program A computer system having means for generating.