JP2021009443A

JP2021009443A - Information processing apparatus and compiler program

Info

Publication number: JP2021009443A
Application number: JP2019121513A
Authority: JP
Inventors: 恭伸谷村; Yasunobu Tanimura; 山中　栄次; Eiji Yamanaka; 栄次山中; 俊鎌塚; Shun Kamatsuka
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2019-06-28
Filing date: 2019-06-28
Publication date: 2021-01-28
Anticipated expiration: 2039-06-28
Also published as: JP7239827B2

Abstract

To provide an information processing apparatus and a compiler program that enable high-speed division of loops included in a source code.SOLUTION: Dependency information indicating a dependency relationship between instructions included in a loop of an intermediate language generated from a source code is generated, a first matrix is generated by converting the generated dependency information into a matrix, the instructions included in the loop are assigned to a plurality of groups based on a degree of dependency among the instructions calculated from the generated first matrix, and the loop is divided for each assigned group.SELECTED DRAWING: Figure 19

Description

本発明は、情報処理装置及びコンパイラプログラムに関する。 The present invention relates to an information processing device and a compiler program.

例えば、ＨＰＣ（ＨｉｇｈＰｅｒｆｏｒｍａｎｃｅＣｏｎｐｕｔｉｎｇ）等に用いられるコンパイラプログラム（以下、単にコンパイラとも呼ぶ）は、ソースコードのコンパイルを行う際に、処理速度等の処理性能を向上させるための最適化処理を行う。具体的に、コンパイラは、例えば、ソースコードに含まれるループ（以下、分割対象のループとも呼ぶ）を複数のループに分割するループ分割を行う。 For example, a compiler program (hereinafter, also simply referred to as a compiler) used for HPC (High Performance Computing) or the like performs optimization processing for improving processing performance such as processing speed when compiling source code. Specifically, the compiler performs loop splitting, for example, to split a loop included in the source code (hereinafter, also referred to as a loop to be split) into a plurality of loops.

これにより、コンパイラは、例えば、ハード資源不足等に起因する最適化阻害要因の発生を抑制することが可能になる。また、コンパイラは、例えば、キャッシュ効率の低下を抑制することが可能になる（例えば、特許文献１及び２参照）。 This makes it possible for the compiler to suppress the occurrence of optimization-inhibiting factors caused by, for example, lack of hardware resources. Further, the compiler can suppress a decrease in cache efficiency, for example (see, for example, Patent Documents 1 and 2).

特開２００２−１２３５６３号公報JP-A-2002-123563 特開２００９−１０４４２２号公報JP-A-2009-104422

ここで、上記のようなコンパイラは、例えば、分割対象のループに含まれる各命令間の依存関係をそれぞれ解析し、その解析結果に基づいて各命令を複数の分割ループにそれぞれ振り分けることにより、分析対象のループの分割を行う。 Here, a compiler as described above analyzes, for example, the dependency between each instruction included in the loop to be divided, and distributes each instruction to a plurality of divided loops based on the analysis result. Split the target loop.

しかしながら、依存関係を解析する必要がある命令の組合せ数は、分割対象のループに含まれる命令の数に従って多くなる。そのため、分割対象のループに含まれる命令の数が膨大である場合、コンパイラは、ループ分割に多くの時間を要することになり、ソースコードのコンパイルを効率的に行うことができなくなる。 However, the number of instruction combinations for which dependency analysis needs to be analyzed increases according to the number of instructions contained in the loop to be divided. Therefore, if the number of instructions contained in the loop to be split is enormous, the compiler will take a lot of time to split the loop, and the source code cannot be compiled efficiently.

そこで、一つの側面では、本発明は、ソースコードに含まれるループの分割を高速に行うことを可能とする情報処理装置及びコンパイラプログラムを提供することを目的とする。 Therefore, in one aspect, it is an object of the present invention to provide an information processing apparatus and a compiler program capable of dividing a loop included in a source code at high speed.

実施の形態の一態様では、ソースコードから生成した中間言語のループに含まれる各命令間の依存関係を示す依存情報を生成し、生成した前記依存情報を行列に変換することによって第１行列を生成する情報生成部と、生成した前記第１行列から算出した各命令間における依存度合いに基づいて、前記ループに含まれる各命令を複数のグループに振り分けるグループ振分部と、振り分けた前記複数のグループごとに、前記ループの分割を行うループ分割部と、を有する。 In one aspect of the embodiment, the first matrix is formed by generating dependency information indicating the dependency between each instruction included in the loop of the intermediate language generated from the source code and converting the generated dependency information into a matrix. A group distribution unit that distributes each instruction included in the loop to a plurality of groups based on the degree of dependence between the information generation unit to be generated and each instruction calculated from the generated first matrix, and the plurality of distribution units. Each group has a loop dividing portion for dividing the loop.

一つの側面によれば、ソースコードに含まれるループの分割を高速に行うことを可能とする。 According to one aspect, it is possible to divide the loop included in the source code at high speed.

図１は、情報処理システム１０の構成について説明する図である。FIG. 1 is a diagram illustrating a configuration of the information processing system 10. 図２は、情報処理装置１が行うコンパイル処理を説明するフローチャートである。FIG. 2 is a flowchart illustrating a compilation process performed by the information processing apparatus 1. 図３は、情報処理装置１のハードウエア構成を説明する図である。FIG. 3 is a diagram illustrating a hardware configuration of the information processing device 1. 図４は、情報処理装置１の機能のブロック図である。FIG. 4 is a block diagram of the function of the information processing device 1. 図５は、Ｓ２の処理の概略について説明するフローチャートである。FIG. 5 is a flowchart illustrating an outline of the process of S2. 図６は、Ｓ２の処理の詳細を説明するフローチャート図である。FIG. 6 is a flowchart illustrating the details of the process of S2. 図７は、Ｓ２の処理の詳細を説明するフローチャート図である。FIG. 7 is a flowchart illustrating the details of the process of S2. 図８は、Ｓ２の処理の詳細を説明するフローチャート図である。FIG. 8 is a flowchart illustrating the details of the process of S2. 図９は、中間言語２２の内容を説明する具体例である。FIG. 9 is a specific example for explaining the contents of the intermediate language 22. 図１０は、依存情報１３１の具体例について説明する図である。FIG. 10 is a diagram illustrating a specific example of the dependency information 131. 図１１は、依存グラフ１３１ａの具体例について説明する図である。FIG. 11 is a diagram illustrating a specific example of the dependency graph 131a. 図１２は、第１行列１３２の具体例について説明する図である。FIG. 12 is a diagram illustrating a specific example of the first matrix 132. 図１３は、第２行列１３３の具体例について説明する図である。FIG. 13 is a diagram illustrating a specific example of the second matrix 133. 図１４は、依存情報１３１の具体例について説明する図である。FIG. 14 is a diagram illustrating a specific example of the dependency information 131. 図１５は、第１行列１３２の具体例について説明する図である。FIG. 15 is a diagram illustrating a specific example of the first matrix 132. 図１６は、第２行列１３３の具体例について説明する図である。FIG. 16 is a diagram illustrating a specific example of the second matrix 133. 図１７は、依存情報１３１の具体例について説明する図である。FIG. 17 is a diagram illustrating a specific example of the dependency information 131. 図１８は、第１行列１３２の具体例について説明する図である。FIG. 18 is a diagram illustrating a specific example of the first matrix 132. 図１９は、依存グラフ１３１ａの具体例について説明する図である。FIG. 19 is a diagram illustrating a specific example of the dependency graph 131a. 図２０は、分割ループの内容を説明する具体例である。FIG. 20 is a specific example for explaining the contents of the split loop. 図２１は、第２の実施の形態におけるＳ２の処理を説明するフローチャート図である。FIG. 21 is a flowchart illustrating the process of S2 in the second embodiment. 図２２は、第２の実施の形態におけるＳ２の処理を説明するフローチャート図である。FIG. 22 is a flowchart illustrating the process of S2 in the second embodiment. 図２３は、第２の実施の形態におけるＳ２の処理を説明するフローチャート図である。FIG. 23 is a flowchart illustrating the process of S2 in the second embodiment. 図２４は、依存情報１３１の具体例について説明する図である。FIG. 24 is a diagram illustrating a specific example of the dependency information 131. 図２５は、第１行列１３２の具体例について説明する図である。FIG. 25 is a diagram illustrating a specific example of the first matrix 132. 図２６は、第１行列１３２の具体例について説明する図である。FIG. 26 is a diagram illustrating a specific example of the first matrix 132. 図２７は、依存グラフ１３１ａの具体例について説明する図である。FIG. 27 is a diagram illustrating a specific example of the dependency graph 131a. 図２８は、分割ループの内容を説明する具体例である。FIG. 28 is a specific example for explaining the contents of the split loop.

［情報処理システムの構成］
初めに、情報処理システム１０の構成について説明を行う。図１は、情報処理システム１０の構成について説明する図である。 [Information processing system configuration]
First, the configuration of the information processing system 10 will be described. FIG. 1 is a diagram illustrating a configuration of the information processing system 10.

図１に示すように、情報処理システム１０は、例えば、１台以上の物理マシンからなる情報処理装置１と、情報処理装置１の内部または外部に設けられる記憶部１３０と、操作端末５とを含む。操作端末５は、例えば、ソースコードのコンパイルを行う作業者が使用するＰＣ（ＰｅｒｓｏｎａｌＣｏｍｐｕｔｅｒ）であり、ネットワークＮＷを介して情報処理装置１と接続する。 As shown in FIG. 1, the information processing system 10 includes, for example, an information processing device 1 composed of one or more physical machines, a storage unit 130 provided inside or outside the information processing device 1, and an operation terminal 5. Including. The operation terminal 5 is, for example, a PC (Personal Computer) used by a worker who compiles the source code, and is connected to the information processing device 1 via a network NW.

情報処理装置１（情報処理装置１において動作するコンパイラ）は、コンパイルを開始するタイミング（以下、コンパイル開始タイミングとも呼ぶ）になった場合、例えば、記憶部１３０に記憶されたソースコード２１を取得し、取得したソースコード２１のコンパイルを行う処理（以下、コンパイル処理とも呼ぶ）を行うことによって中間言語２２を生成し、さらに、生成した中間言語２２からオブジェクトコード２３を生成する。コンパイル開始タイミングは、例えば、作業者が操作端末５を介してコンパイル処理を開始する旨の指示を行ったタイミングであってよい。 When the information processing device 1 (compiler operating in the information processing device 1) comes to the timing of starting compilation (hereinafter, also referred to as compilation start timing), for example, the source code 21 stored in the storage unit 130 is acquired. , The intermediate language 22 is generated by performing a process of compiling the acquired source code 21 (hereinafter, also referred to as a compilation process), and further, an object code 23 is generated from the generated intermediate language 22. The compilation start timing may be, for example, the timing at which the operator gives an instruction to start the compilation process via the operation terminal 5.

また、情報処理装置１は、オブジェクトコード２３を実行するタイミング（以下、コード実行タイミングとも呼ぶ）になった場合、記憶部１３０に記憶されたオブジェクトコード２３を取得し、コンパイル処理によって生成されたオブジェクトコード２３を実行する処理（以下、コード実行処理とも呼ぶ）を行う。以下、情報処理装置１が行うコンパイル処理及びコード実行処理について説明を行う。 Further, when the timing for executing the object code 23 (hereinafter, also referred to as the code execution timing) is reached, the information processing device 1 acquires the object code 23 stored in the storage unit 130, and the object generated by the compilation process. A process for executing the code 23 (hereinafter, also referred to as a code execution process) is performed. Hereinafter, the compilation process and the code execution process performed by the information processing apparatus 1 will be described.

［情報処理装置によるコンパイル処理］
初めに、情報処理装置１が行うコンパイル処理について説明を行う。図２は、情報処理装置１が行うコンパイル処理を説明するフローチャートである。 [Compilation process by information processing device]
First, the compilation process performed by the information processing apparatus 1 will be described. FIG. 2 is a flowchart illustrating a compilation process performed by the information processing apparatus 1.

情報処理装置１は、図２に示すように、ソースコード２１の字句解析及び構文解析を行うことにより、中間言語２２を生成する（Ｓ１）。そして、情報処理装置１は、例えば、生成した中間言語２２を情報格納領域１３０に記憶する。 As shown in FIG. 2, the information processing apparatus 1 generates the intermediate language 22 by performing lexical analysis and syntactic analysis of the source code 21 (S1). Then, the information processing device 1 stores, for example, the generated intermediate language 22 in the information storage area 130.

その後、情報処理装置１は、Ｓ１の処理において生成された中間言語２２の最適化を行う（Ｓ２）。具体的に、情報処理装置１は、中間言語２２に含まれるループのそれぞれに対して、ループ分割等の処理を行う。 After that, the information processing apparatus 1 optimizes the intermediate language 22 generated in the process of S1 (S2). Specifically, the information processing device 1 performs processing such as loop splitting for each of the loops included in the intermediate language 22.

続いて、情報処理装置１は、例えば、Ｓ１で最適化を行った中間言語２２からオブジェクトコード２３の生成を行う（Ｓ３）。そして、情報処理装置１は、例えば、生成したオブジェクトコード２３を記憶部１３０に記憶する。 Subsequently, the information processing apparatus 1 generates the object code 23 from the intermediate language 22 optimized in S1, for example (S3). Then, the information processing device 1 stores, for example, the generated object code 23 in the storage unit 130.

ここで、図２で説明したＳ２の処理を行う場合、情報処理装置１は、例えば、ループに含まれる各命令間の依存関係をそれぞれ解析し、その解析結果に基づいて各命令を複数の分割ループにそれぞれ振り分けることにより、分析対象のループの分割を行う。 Here, when performing the processing of S2 described with reference to FIG. 2, the information processing apparatus 1 analyzes, for example, the dependency relationship between each instruction included in the loop, and divides each instruction into a plurality of units based on the analysis result. By allocating to each loop, the loop to be analyzed is divided.

しかしながら、依存関係を解析する必要がある命令の組合せ数は、分割対象のループに含まれる命令の数に従って多くなる。そのため、例えば、分割対象のループに含まれる命令の数が膨大である場合、情報処理装置１は、ループ分割に多くの時間を要することになり、ソースコードのコンパイルを効率的に行うことができなくなる。 However, the number of instruction combinations for which dependency analysis needs to be analyzed increases according to the number of instructions contained in the loop to be divided. Therefore, for example, when the number of instructions included in the loop to be divided is enormous, the information processing apparatus 1 requires a lot of time for loop division, and the source code can be compiled efficiently. It disappears.

そこで、本実施の形態における情報処理装置１は、ソースコード２１から生成した中間言語２２のループに含まれる各命令間の依存関係を示す情報（以下、依存情報とも呼ぶ）を生成し、生成した依存情報を変換することによって行列（以下、第１行列とも呼ぶ）を生成する。 Therefore, the information processing device 1 in the present embodiment generates and generates information (hereinafter, also referred to as dependency information) indicating the dependency relationship between each instruction included in the loop of the intermediate language 22 generated from the source code 21. A matrix (hereinafter, also referred to as a first matrix) is generated by transforming the dependency information.

そして、情報処理装置１は、生成した第１行列から算出した各命令間における依存度合いに基づいて、ループに含まれる各命令を複数のグループに振り分け、振り分けたグループごとにループの分割を行う。 Then, the information processing apparatus 1 distributes each instruction included in the loop into a plurality of groups based on the degree of dependence between the instructions calculated from the generated first matrix, and divides the loop into each of the distributed groups.

すなわち、本実施の形態における情報処理装置１は、依存情報から生成された第１行列を用いる演算を行うことで、分割対象のループに含まれる命令の組合せごとの依存関係の解析等を行うことなく、各命令の振り分け先を決定する。 That is, the information processing apparatus 1 in the present embodiment analyzes the dependency relationship for each combination of instructions included in the loop to be divided by performing an operation using the first matrix generated from the dependency information. Instead, determine the distribution destination of each command.

これにより、情報処理装置１は、ループ分割を効率的に行うことが可能になり、ソースコード２１のコンパイルに要する時間を短縮することが可能になる。 As a result, the information processing apparatus 1 can efficiently perform loop splitting, and the time required for compiling the source code 21 can be shortened.

また、情報処理装置１は、分割対象のループに含まれる各命令の振り分け先を計算によって決定することで、異なる分割ループに含まれる命令間の依存関係が最も疎になるループ分割の方法を特定することが可能になる。そのため、情報処理装置１は、特定した方法に従ってループの分割を行うことにより、ソースコード２１から生成されるオブジェクトコード２３の実行時間についても短縮させることが可能になる。 Further, the information processing device 1 specifies a loop splitting method in which the dependency between the instructions included in the different split loops is the sparsest by determining the distribution destination of each instruction included in the loop to be split by calculation. It becomes possible to do. Therefore, the information processing apparatus 1 can also shorten the execution time of the object code 23 generated from the source code 21 by dividing the loop according to the specified method.

［情報処理システムのハードウエア構成］
次に、情報処理システム１０のハードウエア構成について説明する。図３は、情報処理装置１のハードウエア構成を説明する図である。 [Hardware configuration of information processing system]
Next, the hardware configuration of the information processing system 10 will be described. FIG. 3 is a diagram illustrating a hardware configuration of the information processing device 1.

情報処理装置１は、図３に示すように、プロセッサであるＣＰＵ１０１と、メモリ１０２と、外部インターフェース（Ｉ／Ｏユニット）１０３と、記憶媒体１０４とを有する。各部は、バス１０５を介して互いに接続される。 As shown in FIG. 3, the information processing device 1 includes a CPU 101 which is a processor, a memory 102, an external interface (I / O unit) 103, and a storage medium 104. The parts are connected to each other via the bus 105.

記憶媒体１０４は、例えば、コンパイル処理を行うためのプログラム１１０（コンパイラ）を記憶するプログラム格納領域（図示しない）を有する。また、記憶媒体１０４は、例えば、コンパイル処理を行う際に用いられる情報を記憶する記憶部１３０（以下、情報格納領域１３０とも呼ぶ）を有する。なお、記憶媒体１０４は、例えば、ＨＤＤ（ＨａｒｄＤｉｓｋＤｒｉｖｅ）やＳＳＤ（ＳｏｋｉｄＳｔａｔｅＤｒｉｖｅ）であってよい。 The storage medium 104 has, for example, a program storage area (not shown) for storing a program 110 (compiler) for performing a compilation process. Further, the storage medium 104 has, for example, a storage unit 130 (hereinafter, also referred to as an information storage area 130) for storing information used when performing a compilation process. The storage medium 104 may be, for example, an HDD (Hard Disk Drive) or an SSD (Sock State Drive).

ＣＰＵ１０１は、記憶媒体１０４からメモリ１０２にロードされたプログラム１１０を実行してコンパイル処理を行う。 The CPU 101 executes the program 110 loaded from the storage medium 104 into the memory 102 to perform a compilation process.

また、外部インターフェース１０３は、例えば、操作端末５と通信を行う。 Further, the external interface 103 communicates with, for example, the operation terminal 5.

［情報処理システムの機能］
次に、情報処理システム１０の機能について説明を行う。図４は、情報処理装置１の機能のブロック図である。 [Information processing system functions]
Next, the function of the information processing system 10 will be described. FIG. 4 is a block diagram of the function of the information processing device 1.

情報処理装置１は、図４に示すように、例えば、ＣＰＵ１０１やメモリ１０２等のハードウエアとプログラム１１０とが有機的に協働することにより、分割判定部１１１と、情報生成部１１２と、情報管理部１１３と、グループ振分部１１４と、ループ分割部１１５と、近接判定部１１６とを含む各種機能を実現する。 As shown in FIG. 4, the information processing device 1 organically cooperates with hardware such as the CPU 101 and the memory 102 and the program 110 to form a division determination unit 111, an information generation unit 112, and information. Various functions including the management unit 113, the group distribution unit 114, the loop splitting unit 115, and the proximity determination unit 116 are realized.

また、情報処理装置１は、例えば、図４に示すように、依存情報１３１と、第１行列１３２と、第２行列１３３とを情報格納領域１３０に記憶する。 Further, the information processing apparatus 1 stores, for example, the dependency information 131, the first matrix 132, and the second matrix 133 in the information storage area 130, as shown in FIG.

分割判定部１１１は、例えば、ソースコード２１から生成した中間言語２２に含まれる各ループについてループ分割を行うか否かを判定する。具体的に、分割判定部１１１は、例えば、実行時に用いられるレジスタ数やストリーム数がＣＰＵ１０１のレジスタ数やストリーム数を超えるループが中間言語２２に含まれている場合、そのループについてループ分割を行う旨の判定を行う。 The division determination unit 111 determines, for example, whether or not to perform loop division for each loop included in the intermediate language 22 generated from the source code 21. Specifically, for example, when the intermediate language 22 contains a loop in which the number of registers and the number of streams used at the time of execution exceeds the number of registers and the number of streams of the CPU 101, the division determination unit 111 performs loop division for the loop. Make a judgment to that effect.

情報生成部１１２は、分割対象のループに含まれる各命令間の依存関係を示す依存情報１３１を生成する。具体的に、情報生成部１１２は、分割判定部１１１がループ分割を行う旨の判定を行ったループが存在する場合、そのループに対応する依存情報１３１を生成する。また、情報生成部１１２は、生成した依存情報１３１を行列に変換することによって第１行列１３２を生成する。 The information generation unit 112 generates dependency information 131 indicating the dependency relationship between each instruction included in the loop to be divided. Specifically, the information generation unit 112 generates the dependency information 131 corresponding to the loop when the division determination unit 111 determines that the loop split is performed. Further, the information generation unit 112 generates the first matrix 132 by converting the generated dependency information 131 into a matrix.

情報管理部１１３は、例えば、情報生成部１１２が生成した依存情報１３１や第１行列１３２を情報格納領域１３０に記憶する。 The information management unit 113 stores, for example, the dependency information 131 and the first matrix 132 generated by the information generation unit 112 in the information storage area 130.

グループ振分部１１４は、情報生成部１１２が生成した第１行列１３２から算出した各命令間における依存度合いに基づいて、分割対象のループに含まれる各命令を複数のグループに振り分ける。 The group distribution unit 114 distributes each instruction included in the loop to be divided into a plurality of groups based on the degree of dependence between the instructions calculated from the first matrix 132 generated by the information generation unit 112.

ループ分割部１１５は、グループ振分部１１４が振り分けたグループごとに、分割対象のループについてのループ分割を行う。 The loop splitting unit 115 performs loop splitting on the loop to be divided for each group distributed by the group sorting unit 114.

近接判定部１１６は、例えば、メモリ１０２内において近接するアドレスに格納された各データに対してアクセスを行う複数の命令が分割対象のループに含まれているか否かを判定する。 The proximity determination unit 116 determines, for example, whether or not a plurality of instructions for accessing each data stored at adjacent addresses in the memory 102 are included in the loop to be divided.

そして、例えば、メモリ１０２内において近接するアドレスに格納された各データに対してアクセスを行う複数の命令が分割対象のループに含まれていると近接判定部１１６が判定した場合、情報生成部１１２は、分割対象のループに含まれる各命令間の依存関係と、各命令がアクセスするデータのメモリ１０２内における位置関係とに対応する依存情報１３１を生成する。第２行列１３３の説明については後述する。 Then, for example, when the proximity determination unit 116 determines that a plurality of instructions for accessing each data stored at adjacent addresses in the memory 102 are included in the loop to be divided, the information generation unit 112 Generates dependency information 131 corresponding to the dependency relationship between each instruction included in the loop to be divided and the positional relationship in the memory 102 of the data accessed by each instruction. The explanation of the second matrix 133 will be described later.

［第１の実施の形態の概略］
次に、第１の実施の形態の概略について説明する。具体的に、図２で説明したＳ２の処理の概略について説明する。図５は、Ｓ２の処理の概略について説明するフローチャートである。 [Outline of the first embodiment]
Next, the outline of the first embodiment will be described. Specifically, the outline of the process of S2 described with reference to FIG. 2 will be described. FIG. 5 is a flowchart illustrating an outline of the process of S2.

情報処理装置１の情報生成部１１２は、ソースコード２１から生成した中間言語２２のループに含まれる各命令間の依存関係を示す依存情報１３１を生成する（Ｓ１１）。 The information generation unit 112 of the information processing device 1 generates dependency information 131 indicating the dependency relationship between each instruction included in the loop of the intermediate language 22 generated from the source code 21 (S11).

そして、情報生成部１１２は、Ｓ１１の処理で生成した依存情報１３１を行列に変換することによって第１行列１３２を生成する（Ｓ１２）。 Then, the information generation unit 112 generates the first matrix 132 by converting the dependency information 131 generated in the process of S11 into a matrix (S12).

続いて、情報処理装置１のグループ振分部１１４は、Ｓ１２の処理で生成した第１行列１３２から算出した各命令間の依存度合いに基づいて、分割対象のループに含まれる各命令を複数のグループに振り分ける（Ｓ１３）。 Subsequently, the group distribution unit 114 of the information processing apparatus 1 sets a plurality of instructions included in the loop to be divided based on the degree of dependence between the instructions calculated from the first matrix 132 generated in the process of S12. Allocate to groups (S13).

その後、情報処理装置のループ分割部１１５は、Ｓ１３の処理で振り分けたグループごとに、分割対象のループの分割を行う（Ｓ１４）。 After that, the loop splitting unit 115 of the information processing apparatus divides the loop to be split for each group sorted in the process of S13 (S14).

［第１の実施の形態の詳細］
次に、第１の実施の形態の詳細について説明する。図６から図９は、図２で説明したＳ２の処理の詳細を説明するフローチャート図である。また、図９から図２０は、Ｓ２の処理の詳細を説明する図である。 [Details of the first embodiment]
Next, the details of the first embodiment will be described. 6 to 9 are flowcharts illustrating the details of the process of S2 described with reference to FIG. 9 to 20 are views for explaining the details of the process of S2.

情報処理装置１の分割判定部１１１は、図６に示すように、情報格納領域１３０に記憶された中間言語２２に、分割対象のループが含まれているか否かを判定する（Ｓ２１）。以下、中間言語２２の具体例について説明を行う。 As shown in FIG. 6, the division determination unit 111 of the information processing device 1 determines whether or not the intermediate language 22 stored in the information storage area 130 includes a loop to be divided (S21). Hereinafter, specific examples of the intermediate language 22 will be described.

［中間言語の具体例］
図９は、中間言語２２の内容を説明する具体例である。具体的に、図９は、ソースコード２１をアセンブラ命令相当で表現した場合の中間言語２２を説明する具体例である。 [Specific examples of intermediate languages]
FIG. 9 is a specific example for explaining the contents of the intermediate language 22. Specifically, FIG. 9 is a specific example for explaining the intermediate language 22 when the source code 21 is represented by an assembler instruction.

図９に示す中間言語２２は、変数ｒｅｇ＿ｉに初期値として１を設定することを示す命令である「ｍｏｖｒｅｇ＿ｉ，１」と、ループの開始位置を示すラベルである「ＬＡＢＥＬ１：」とを含む。 The intermediate language 22 shown in FIG. 9 includes "mov reg_i, 1" which is an instruction indicating that the variable reg_i is set to 1 as an initial value, and "LABEL1:" which is a label indicating the start position of the loop.

また、図９に示す中間言語２２は、配列Ａのｒｅｇ＿ｉ番目に格納されている値を変数ｒｅｇ１に設定することを示す命令である「ｌｏａｄｒｅｇ１，ｍｅｍ“Ａ（ｒｅｇ＿ｉ）”」と、配列Ｂのｒｅｇ＿ｉ番目に格納されている値を変数ｒｅｇ２に設定することを示す命令である「ｌｏａｄｒｅｇ２，ｍｅｍ“Ｂ（ｒｅｇ＿ｉ）”」とを含む。 Further, the intermediate language 22 shown in FIG. 9 is an instruction indicating that the value stored in the reg_ith position of the array A is set in the variable reg1, which is an instruction "load reg1, mem" A (reg_i) "" and the array B. Includes "load reg2, mem" B (reg_i) "" which is an instruction indicating that the value stored in the reg_i th position of is set in the variable reg2.

また、図９に示す中間言語２２は、変数ｒｅｇ１に設定されている値と変数ｒｅｇ２に設定されている値とを加算することによって算出した値を、変数ｒｅｇ３に設定することを示す命令である「ａｄｄｒｅｇ３，ｒｅｇ１，ｒｅｇ２」と、配列Ｃのｒｅｇ＿ｉ番目に格納されている値を変数ｒｅｇ４に設定することを示す命令である「ｌｏａｄｒｅｇ４，ｍｅｍ“Ｃ（ｒｅｇ＿ｉ）”」とを含む。 Further, the intermediate language 22 shown in FIG. 9 is an instruction indicating that the value calculated by adding the value set in the variable reg1 and the value set in the variable reg2 is set in the variable reg3. It includes "add reg3, reg1, reg2" and "load reg4, mem" C (reg_i) "" which is an instruction indicating that the value stored in the reg_ith position of the array C is set in the variable reg4.

また、図９に示す中間言語２２は、配列Ｄのｒｅｇ＿ｉ番目に格納されている値を変数ｒｅｇ５に設定することを示す命令である「ｌｏａｄｒｅｇ５，ｍｅｍ“Ｄ（ｒｅｇ＿ｉ）”」と、配列Ｃのｒｅｇ＿ｉ＋１０番目に格納されているデータを変数ｒｅｇ６に設定することを示す命令である「ｌｏａｄｒｅｇ６，ｍｅｍ“Ｃ（ｒｅｇ＿ｉ＋１０）”」とを含む。 Further, the intermediate language 22 shown in FIG. 9 is an instruction indicating that the value stored in the reg_i position of the array D is set in the variable reg5, which is an instruction "load reg5, mem" D (reg_i) "" and the array C Includes "load reg6, mem" C (reg_i + 10) "" which is an instruction indicating that the data stored in the reg_i + 10th position of the above is set in the variable reg6.

また、配列Ｅのｒｅｇ＿ｉ番目に格納されている値を変数ｒｅｇ７に設定することを示す命令である「ｌｏａｄｒｅｇ７，ｍｅｍ“Ｅ（ｒｅｇ＿ｉ）”」と、変数ｒｅｇ３に設定されている値と変数ｒｅｇ４に設定されている値とを加算することによって算出した値を、変数ｒｅｇ８に設定することを示す命令である「ａｄｄｒｅｇ８，ｒｅｇ３，ｒｅｇ４」とを含む。 In addition, "load reg7, mem" E (reg_i) "", which is an instruction indicating that the value stored in the reg_ith position of the array E is set in the variable reg7, and the value set in the variable reg3 and the variable reg4. Includes "add reg8, reg3, reg4" which is an instruction indicating that the value calculated by adding the value set in is set in the variable reg8.

また、図９に示す中間言語２２は、変数ｒｅｇ４に設定されている値と変数ｒｅｇ８に設定されている値とを加算することによって算出した値を、変数ｒｅｇ９に設定することを示す命令である「ａｄｄｒｅｇ９，ｒｅｇ４，ｒｅｇ８」と、変数ｒｅｇ３に設定されている値と変数ｒｅｇ５に設定されている値とを乗算することによって算出した値に、変数ｒｅｇ９に設定されている値を加算することによって算出した値を、変数ｒｅｇ１０に設定に設定することを示す命令である「ｍａｄｄｒｅｇ１０，ｒｅｇ３，ｒｅｇ５，ｒｅｇ９」とを含む。 Further, the intermediate language 22 shown in FIG. 9 is an instruction indicating that the value calculated by adding the value set in the variable reg4 and the value set in the variable reg8 is set in the variable reg9. Adding the value set in the variable reg9 to the value calculated by multiplying "add reg9, reg4, reg8" by the value set in the variable reg3 and the value set in the variable reg5. Includes "madd reg10, reg3, reg5, reg9" which is an instruction indicating that the value calculated by the above is set in the variable reg10.

また、図９に示す中間言語２２は、変数ｒｅｇ５に設定されている値と変数ｒｅｇ６に設定されている値とを乗算することによって算出した値に、変数ｒｅｇ１０に設定されている値を加算することによって算出した値を、変数ｒｅｇ１１に設定することを示す命令である「ｍａｄｄｒｅｇ１１，ｒｅｇ５，ｒｅｇ６，ｒｅｇ１０」と、変数ｒｅｇ７に設定されている値と変数ｒｅｇ１１に設定されている値とを加算することによって算出した値を、変数ｒｅｇ１２に設定することを示す命令である「ａｄｄｒｅｇ１２，ｒｅｇ７，ｒｅｇ１１」とを含む。 Further, in the intermediate language 22 shown in FIG. 9, the value set in the variable reg10 is added to the value calculated by multiplying the value set in the variable reg5 and the value set in the variable reg6. Add the value set in the variable reg7 and the value set in the variable reg11 to the command "madd reg11, reg5, reg6, reg10" indicating that the value calculated by this is set in the variable reg11. It includes "add reg12, reg7, reg11" which is an instruction indicating that the value calculated by the above is set in the variable reg12.

また、図９に示す中間言語２２は、変数ｒｅｇ１２に設定されている値を配列Ｆのｒｅｇ＿ｉ番目に格納することを示す命令である「ｓｔｏｒｅｍｅｍ“Ｆ（ｒｅｇ＿ｉ）”，ｒｅｇ１２」と、変数ｒｅｇ＿ｉに設定されている値に１を加算することによって算出した値を、変数ｒｅｇ＿ｉに設定することを示す命令である「ａｄｄｒｅｇ＿ｉ，ｒｅｇ＿ｉ，１」とを含む。 Further, the intermediate language 22 shown in FIG. 9 is an instruction indicating that the value set in the variable reg12 is stored in the reg_ith position of the array F, which is an instruction "store mem" F (reg_i) ", reg12" and the variable reg_i. It includes "add reg_i, reg_i, 1" which is an instruction indicating that the value calculated by adding 1 to the value set in is set in the variable reg_i.

さらに、図９に示す中間言語２２は、変数ｒｅｇ＿ｉに設定されている値と１００とを比較することを示す命令である「ｃｍｐｒｅｇ＿ｉ，１００」と、変数ｒｅｇ＿ｉに設定されている値が１００以下である場合、ループの開始位置を示す「ＬＡＢＥＬ１：」に分岐し、変数ｒｅｇ＿ｉに設定されている値が１００を上回る場合、ループを終了することを示す命令である「ｂｌｅｉｃｃ，ＬＡＢＡＬ１」とを含む。 Further, the intermediate language 22 shown in FIG. 9 has "cmp reg_i, 100" which is an instruction indicating to compare the value set in the variable reg_i with 100, and the value set in the variable reg_i is 100 or less. In the case of, it branches to "LABEL1:" indicating the start position of the loop, and when the value set in the variable reg_i exceeds 100, it is an instruction indicating the end of the loop "bleicc, LABAL1". Including.

なお、以下、図９に示す中間言語２２において各命令の左端に記載されている番号を、各命令の命令番号とも呼ぶ。また、以下、命令番号が「１」から「１３」である命令のそれぞれを命令１から命令１３とも呼ぶ。 Hereinafter, the number described at the left end of each instruction in the intermediate language 22 shown in FIG. 9 is also referred to as an instruction number of each instruction. Further, hereinafter, each of the instructions whose instruction numbers are "1" to "13" will also be referred to as instruction 1 to instruction 13.

図６に戻り、情報格納領域１３０に記憶された中間言語２２に、分割対象のループが含まれている場合（Ｓ２２のＹＥＳ）、情報処理装置１の情報生成部１１２は、Ｓ２１の処理で分割対象のループであると判定したループに含まれる各命令間の依存関係を示す依存情報１３１を生成する（Ｓ２３）。以下、依存情報１３１の具体例について説明を行う。 Returning to FIG. 6, when the intermediate language 22 stored in the information storage area 130 includes a loop to be divided (YES in S22), the information generation unit 112 of the information processing apparatus 1 is divided by the process of S21. Dependency information 131 indicating the dependency relationship between each instruction included in the loop determined to be the target loop is generated (S23). Hereinafter, a specific example of the dependency information 131 will be described.

［依存情報の具体例］
図１０は、依存情報１３１の具体例について説明する図である。 [Specific example of dependency information]
FIG. 10 is a diagram illustrating a specific example of the dependency information 131.

図１０に示す依存情報１３１は、中間言語２２に含まれる各命令の命令番号が記憶される「命令番号」と、各命令と依存関係にある命令の命令番号についてのリストである「データ依存リスト」と、各命令が含まれるグループ（分割ループ）の名称が記憶される「グループ名」とを項目として有する。 The dependency information 131 shown in FIG. 10 is a "data dependency list" which is a list of "instruction numbers" in which instruction numbers of each instruction included in the intermediate language 22 are stored and instruction numbers of instructions having a dependency relationship with each instruction. , And a "group name" in which the name of the group (split loop) including each instruction is stored is included as an item.

具体的に、図９で説明した中間言語２２において、命令１で値が設定された変数ｒｅｇ１は、命令３のみにおいて参照されている。そのため、情報生成部１１２は、例えば、図１０に示すように、「命令番号」が「１」である情報の「データ依存リスト」に「３」を記憶する。 Specifically, in the intermediate language 22 described with reference to FIG. 9, the variable reg1 whose value is set by the instruction 1 is referred to only by the instruction 3. Therefore, for example, as shown in FIG. 10, the information generation unit 112 stores "3" in the "data dependency list" of the information whose "instruction number" is "1".

また、図９で説明した中間言語２２において、命令２で値が設定された変数ｒｅｇ２は、命令３のみにおいて参照されている。そのため、情報生成部１１２は、例えば、図１０に示すように、「命令番号」が「２」である情報の「データ依存リスト」に「３」を記憶する。 Further, in the intermediate language 22 described with reference to FIG. 9, the variable reg2 whose value is set by the instruction 2 is referred to only by the instruction 3. Therefore, for example, as shown in FIG. 10, the information generation unit 112 stores "3" in the "data dependency list" of the information whose "instruction number" is "2".

さらに、図９で説明した中間言語２２において、命令３は、命令１で値が設定された変数ｒｅｇ１と命令２で値が設定された変数ｒｅｇ２とを参照している。また、図９で説明した中間言語２２において、命令３で値が設定された変数ｒｅｇ３は、命令８及び命令１０において参照されている。そのため、情報生成部１１２は、例えば、図１０に示すように、「命令番号」が「３」である情報の「データ依存リスト」に「１」、「２」、「８」及び「１０」を記憶する。 Further, in the intermediate language 22 described with reference to FIG. 9, the instruction 3 refers to the variable reg1 whose value is set by the instruction 1 and the variable reg2 whose value is set by the instruction 2. Further, in the intermediate language 22 described with reference to FIG. 9, the variable reg3 whose value is set by the instruction 3 is referred to by the instruction 8 and the instruction 10. Therefore, for example, as shown in FIG. 10, the information generation unit 112 adds "1", "2", "8", and "10" to the "data dependency list" of the information whose "instruction number" is "3". Remember.

また、情報生成部１１２は、例えば、図１０に示すように、各命令に対応する「グループ名」の初期値として各命令の命令番号を記憶する。図１０に含まれる他の情報についての説明は省略する。 Further, the information generation unit 112 stores, for example, the instruction number of each instruction as the initial value of the "group name" corresponding to each instruction, as shown in FIG. The description of other information included in FIG. 10 will be omitted.

なお、情報生成部１１２は、Ｓ２３の処理において、依存情報１３１に含まれる内容に対応する依存グラフ１３１ａを生成するものであってもよい。以下、依存グラフ１３１ａの具体例について説明を行う。 The information generation unit 112 may generate the dependency graph 131a corresponding to the content included in the dependency information 131 in the process of S23. Hereinafter, a specific example of the dependency graph 131a will be described.

［依存グラフの具体例（１）］
図１１は、依存グラフ１３１ａの具体例について説明する図である。以下、双方向の関係を示すエッジを双方向エッジとも呼び、単方向の関係を示すエッジを単方向エッジとも呼ぶ。 [Specific example of dependency graph (1)]
FIG. 11 is a diagram illustrating a specific example of the dependency graph 131a. Hereinafter, an edge showing a bidirectional relationship is also referred to as a bidirectional edge, and an edge showing a unidirectional relationship is also referred to as a unidirectional edge.

具体的に、図１１に示す依存グラフ１３１ａでは、命令１に対応するノードと命令３に対応するノードとの間、命令２に対応するノードと命令３に対応するノードとの間、命令３に対応するノードと命令８に対応するノードとの間、命令３に対応するノードと命令１０に対応するノードとの間、及び、命令４に対応するノードと命令８に対応するノードとの間のそれぞれに双方向エッジが設定されている。また、図１１に示す依存グラフ１３１ａでは、命令４に対応するノードと命令９に対応するノードとの間、命令５に対応するノードと命令１０に対応するノードとの間、命令５に対応するノードと命令１１に対応するノードとの間、命令６に対応するノードと命令１１に対応するノードとの間、及び、命令７に対応するノードと命令１２に対応するノードとの間のそれぞれに双方向エッジが設定されている。さらに、図１１に示す依存グラフ１３１ａでは、命令８に対応するノードと命令９に対応するノードとの間、命令９に対応するノードと命令１０に対応するノードとの間、命令１０に対応するノードと命令１１に対応するノードとの間、命令１１に対応するノードと命令１２に対応するノードとの間、及び、命令１２に対応するノードと命令１３に対応するノードとの間のそれぞれに双方向エッジが設定されている。すなわち、図１１に示す依存グラフ１３１ａには、１５本の双方向エッジが設定されている。 Specifically, in the dependency graph 131a shown in FIG. 11, between the node corresponding to the instruction 1 and the node corresponding to the instruction 3, the node corresponding to the instruction 2 and the node corresponding to the instruction 3 are connected to the instruction 3. Between the corresponding node and the node corresponding to the instruction 8, between the node corresponding to the instruction 3 and the node corresponding to the instruction 10, and between the node corresponding to the instruction 4 and the node corresponding to the instruction 8. Bidirectional edges are set for each. Further, in the dependency graph 131a shown in FIG. 11, between the node corresponding to the instruction 4 and the node corresponding to the instruction 9, the node corresponding to the instruction 5 and the node corresponding to the instruction 10 correspond to the instruction 5. Between the node and the node corresponding to the instruction 11, between the node corresponding to the instruction 6 and the node corresponding to the instruction 11, and between the node corresponding to the instruction 7 and the node corresponding to the instruction 12. Bidirectional edges are set. Further, in the dependency graph 131a shown in FIG. 11, the node corresponding to the instruction 8 and the node corresponding to the instruction 9 correspond to the instruction 10, and the node corresponding to the instruction 9 and the node corresponding to the instruction 10 correspond to the instruction 10. Between the node and the node corresponding to the instruction 11, between the node corresponding to the instruction 11 and the node corresponding to the instruction 12, and between the node corresponding to the instruction 12 and the node corresponding to the instruction 13. Bidirectional edges are set. That is, 15 bidirectional edges are set in the dependency graph 131a shown in FIG.

図１０で説明した依存情報１３１において、例えば、「命令番号」が「１」である情報の「データ依存リスト」には、「３」が記憶されており、「命令番号」が「３」である情報の「データ依存リスト」には、「１」が記憶されている。そのため、情報生成部１１２は、例えば、図１１に示すように、命令１に対応するノードと命令３に対応するノードとの間に双方向エッジを設定する。 In the dependency information 131 described with reference to FIG. 10, for example, "3" is stored in the "data dependency list" of the information in which the "instruction number" is "1", and the "instruction number" is "3". "1" is stored in the "data dependency list" of certain information. Therefore, for example, as shown in FIG. 11, the information generation unit 112 sets a bidirectional edge between the node corresponding to the instruction 1 and the node corresponding to the instruction 3.

また、図１０で説明した依存情報１３１において、例えば、「命令番号」が「２」である情報の「データ依存リスト」には、「３」が記憶されており、「命令番号」が「３」である情報の「データ依存リスト」には、「２」が記憶されている。そのため、情報生成部１１２は、例えば、図１１に示すように、命令２に対応するノードと命令３に対応するノードとの間に双方向エッジを設定する。 Further, in the dependency information 131 described with reference to FIG. 10, for example, "3" is stored in the "data dependency list" of the information in which the "instruction number" is "2", and the "instruction number" is "3". "2" is stored in the "data dependency list" of the information. Therefore, for example, as shown in FIG. 11, the information generation unit 112 sets a bidirectional edge between the node corresponding to the instruction 2 and the node corresponding to the instruction 3.

さらに、図１０で説明した依存情報１３１において、例えば、「命令番号」が「３」である情報の「データ依存リスト」には、「８」が記憶されており、「命令番号」が「８」である情報の「データ依存リスト」には、「３」が記憶されている。そのため、情報生成部１１２は、例えば、図１１に示すように、命令３に対応するノードと命令８に対応するノードとの間に双方向エッジを設定する。図１１に含まれる他の情報についての説明は省略する。 Further, in the dependency information 131 described with reference to FIG. 10, for example, "8" is stored in the "data dependency list" of the information in which the "instruction number" is "3", and the "instruction number" is "8". "3" is stored in the "data dependency list" of the information. Therefore, for example, as shown in FIG. 11, the information generation unit 112 sets a bidirectional edge between the node corresponding to the instruction 3 and the node corresponding to the instruction 8. The description of other information included in FIG. 11 will be omitted.

図６に戻り、情報生成部１１２は、Ｓ２３の処理で生成した依存情報１３１を行列に変換することによって第１行列１３２を生成する（Ｓ２４）。以下、第１行列１３２の具体例について説明を行う。 Returning to FIG. 6, the information generation unit 112 generates the first matrix 132 by converting the dependency information 131 generated in the process of S23 into a matrix (S24). Hereinafter, a specific example of the first matrix 132 will be described.

［第１行列の具体例］
図１２は、第１行列１３２の具体例について説明する図である。 [Specific example of the first matrix]
FIG. 12 is a diagram illustrating a specific example of the first matrix 132.

図１２に示す第１行列１３２において、「１」から「１３」のそれぞれに対応する行は、命令１から命令１３のそれぞれに対応する行であり、「１」から「１３」のそれぞれに対応する列は、命令１から命令１３のそれぞれに対応する列である。また、図１２に示す第１行列１３２の各要素（各欄）には、行に対応する命令と列に対応する命令との間に依存関係が存在することを示す値である「１」、または、行に対応する命令と列に対応する命令との間に依存関係が存在しないことを示す値である「０」が記憶される。 In the first matrix 132 shown in FIG. 12, the rows corresponding to each of "1" to "13" are the rows corresponding to each of the instructions 1 to 13, and correspond to each of "1" to "13". The columns to be used are the columns corresponding to each of the instructions 1 to 13. Further, each element (each column) of the first matrix 132 shown in FIG. 12 has a value of "1" indicating that there is a dependency between the instruction corresponding to the row and the instruction corresponding to the column. Alternatively, "0", which is a value indicating that there is no dependency between the instruction corresponding to the row and the instruction corresponding to the column, is stored.

具体的に、図１０に示す依存情報１３１において、「命令番号」が「１」である情報の「データ依存リスト」には、「３」が記憶されている。そのため、情報生成部１１２は、図１２に示すように、例えば、命令１に対応する行に含まれる欄のうち、命令３に対応する列に含まれる欄に「１」を記憶する。 Specifically, in the dependency information 131 shown in FIG. 10, "3" is stored in the "data dependency list" of the information in which the "instruction number" is "1". Therefore, as shown in FIG. 12, the information generation unit 112 stores, for example, "1" in the column included in the column corresponding to the instruction 3 among the columns included in the row corresponding to the instruction 1.

また、図１０に示す依存情報１３１において、「命令番号」が「２」である情報の「データ依存リスト」には、「３」が記憶されている。そのため、情報生成部１１２は、図１２に示すように、例えば、命令２に対応する行に含まれる欄のうち、命令３に対応する列に含まれる欄に「１」を記憶する。 Further, in the dependency information 131 shown in FIG. 10, "3" is stored in the "data dependency list" of the information in which the "instruction number" is "2". Therefore, as shown in FIG. 12, the information generation unit 112 stores, for example, "1" in the column included in the column corresponding to the instruction 3 among the columns included in the row corresponding to the instruction 2.

さらに、図１０に示す依存情報１３１において、「命令番号」が「３」である情報の「データ依存リスト」には、「１」、「２」、「８」及び「１０」が記憶されている。そのため、情報生成部１１２は、図１２に示すように、例えば、命令３に対応する行に含まれる欄のうち、命令１に対応する列に含まれる欄と、命令２に対応する列に含まれる欄と、命令８に対応する列に含まれる欄と、命令１０に対応する列に含まれる欄とのそれぞれに「１」を記憶する。図１２に含まれる他の情報についての説明は省略する。 Further, in the dependency information 131 shown in FIG. 10, "1", "2", "8" and "10" are stored in the "data dependency list" of the information whose "instruction number" is "3". There is. Therefore, as shown in FIG. 12, the information generation unit 112 is included in, for example, a column included in the column corresponding to the instruction 1 and a column corresponding to the instruction 2 among the columns included in the row corresponding to the instruction 3. "1" is stored in each of the column, the column included in the column corresponding to the instruction 8, and the column included in the column corresponding to the instruction 10. The description of other information included in FIG. 12 will be omitted.

図６に戻り、情報処理装置１のグループ振分部１１４は、Ｓ２４の処理で生成した第１行列１３２から、各命令間における依存度合いを示す第２行列１３３を生成する（Ｓ２５）。以下、第２行列の具体例について説明を行う。 Returning to FIG. 6, the group distribution unit 114 of the information processing apparatus 1 generates a second matrix 133 indicating the degree of dependence between each instruction from the first matrix 132 generated in the process of S24 (S25). Hereinafter, a specific example of the second matrix will be described.

［第２行列の具体例］
図１３は、第２行列１３３の具体例について説明する図である。 [Specific example of the second matrix]
FIG. 13 is a diagram illustrating a specific example of the second matrix 133.

図１３に示す第２行列１３３において、「１」から「１３」のそれぞれに対応する行は、命令１から命令１３のそれぞれに対応する行であり、「１」から「１３」のそれぞれに対応する列は、命令１から命令１３のそれぞれに対応する列である。また、図１３に示す第２行列１３３の各要素（各欄）には、行に対応する命令と列に対応する命令との間の依存度合いが記憶される。 In the second matrix 133 shown in FIG. 13, the rows corresponding to each of "1" to "13" are the rows corresponding to each of the instructions 1 to 13, and correspond to each of "1" to "13". The columns to be used are the columns corresponding to each of the instructions 1 to 13. Further, in each element (each column) of the second matrix 133 shown in FIG. 13, the degree of dependence between the instruction corresponding to the row and the instruction corresponding to the column is stored.

ここで、各命令間の依存度合いは、例えば、Ｎｅｗｍａｎアルゴリズムによって算出されるクラスタリング指標値であってよい。この場合、グループ振分部１１４は、例えば、以下の式１を用いることによってクラスタリング指標値ΔＱを算出する。 Here, the degree of dependence between the instructions may be, for example, a clustering index value calculated by the Newman algorithm. In this case, the group distribution unit 114 calculates the clustering index value ΔQ by using, for example, the following equation 1.

ΔＱ＝ｅ_ｉｊ＋ｅ_ｊｉ−２ａ_ｉａ_ｊ＝２（ｅ_ｉｊ−ａ_ｉａ_ｊ）（式１） ΔQ = e _ij + e _ji -2a _i a _j = 2 (e _ij- a _i a _j ) (Equation 1)

上記の式１において、変数ａ_ｉは、図１１で説明した依存グラフ１３１ａに含まれる単方向エッジのうち、命令ｉに対応するノードと他の命令に対するノードとの間における単方向エッジの数の割合を示し、変数ａ_ｊは、図１１で説明した依存グラフ１３１ａに含まれる単方向エッジのうち、命令ｊに対応するノードと他の命令に対するノードとの間における単方向エッジの数の割合を示す。また、式１において、変数ｅ_ｉｊは、図１１で説明した依存グラフ１３１ａに含まれる単方向エッジのうち、命令ｉに対応するノードと命令ｊに対応するノードとの間における単方向エッジの数の割合を示す。 In Equation 1 above, the variable _ai is the number of unidirectional edges between the node corresponding to the instruction i and the node for another instruction among the unidirectional edges included in the dependency graph 131a described with reference to FIG. The variable a _j indicates the ratio, and the variable a _j indicates the ratio of the number of unidirectional edges between the node corresponding to the instruction j and the node for other instructions among the unidirectional edges included in the dependency graph 131a described with reference to FIG. Shown. Further, in Equation 1, the variable e _ij is the number of unidirectional edges between the node corresponding to the instruction i and the node corresponding to the instruction j among the unidirectional edges included in the dependency graph 131a described with reference to FIG. Indicates the ratio of.

具体的に、図１１で説明した依存グラフ１３１ａが示す状態は、双方向エッジが１５本含まれている状態であり、単方向エッジが３０本含まれている場合と同じ状態である。また、図１１で説明した依存グラフ１３１ａにおいて、命令１２に対応するノードと他のノードとの間における単方向エッジの数、命令１３に対応するノードと他のノードとの間における単方向エッジの数及び命令１２に対応するノードと命令１３に対応するノードとの間における単方向エッジの数は、それぞれ３本、１本及び１本である。そのため、グループ振分部１１４は、例えば、命令ｉが命令１２であって命令ｊが命令１３である場合、以下の式（２）のように、クラスタリング指標値ΔＱとして「０．０６０」を算出する。 Specifically, the state shown by the dependency graph 131a described with reference to FIG. 11 is a state in which 15 bidirectional edges are included, which is the same as a state in which 30 unidirectional edges are included. Further, in the dependency graph 131a described with reference to FIG. 11, the number of unidirectional edges between the node corresponding to the instruction 12 and the other node, and the unidirectional edge between the node corresponding to the instruction 13 and the other node. The number and the number of unidirectional edges between the node corresponding to the instruction 12 and the node corresponding to the instruction 13 are 3, 1, and 1, respectively. Therefore, for example, when the instruction i is the instruction 12 and the instruction j is the instruction 13, the group distribution unit 114 calculates “0.060” as the clustering index value ΔQ as in the following equation (2). To do.

２＊（（１／３０）−（３／３０）＊（１／３０））＝０．０６・・・（式２） 2 * ((1/30)-(3/30) * (1/30)) = 0.06 ... (Equation 2)

そのため、グループ振分部１１４は、図１３に示すように、例えば、命令１２に対応する行に含まれる欄のうち、命令１３に対応する列に含まれる欄に「０．０６０」を記憶する。 Therefore, as shown in FIG. 13, the group distribution unit 114 stores, for example, "0.060" in the column included in the column corresponding to the instruction 13 among the columns included in the row corresponding to the instruction 12. ..

なお、命令ｉが命令１３であって命令ｊが１２である場合のクラスタリング指標値ΔＱは、命令ｉが命令１２であって命令ｊが１３である場合のクラスタリング指標値ΔＱと同じ値になる。そのため、グループ振分部１１４は、例えば、命令ｉが命令１２であって命令ｊが１３である場合のクラスタリング指標値ΔＱについての算出を行った場合、命令ｉが命令１３であって命令ｊが１２である場合のクラスタリング指標値ΔＱについての算出を行わないものであってもよい。そして、グループ振分部１１４は、この場合、図１３に示すように、命令１３に対応する行に含まれる欄のうち、命令１２に対応する列に含まれる欄に、クラスタリング指標値ΔＱの算出を行っていないことを示す値である「０．０００」を記憶するものであってよい。図１３に含まれる他の情報についての説明は省略する。 The clustering index value ΔQ when the instruction i is the instruction 13 and the instruction j is 12, is the same as the clustering index value ΔQ when the instruction i is the instruction 12 and the instruction j is 13. Therefore, for example, when the group distribution unit 114 calculates the clustering index value ΔQ when the instruction i is the instruction 12 and the instruction j is 13, the instruction i is the instruction 13 and the instruction j is The clustering index value ΔQ in the case of 12 may not be calculated. Then, in this case, as shown in FIG. 13, the group distribution unit 114 calculates the clustering index value ΔQ in the column included in the column corresponding to the instruction 12 among the columns included in the row corresponding to the instruction 13. It may store "0.000" which is a value indicating that the above is not performed. The description of other information included in FIG. 13 will be omitted.

図７に戻り、グループ振分部１１４は、Ｓ２５の処理で生成した第２行列１３３の要素の値のうち、最大の値に対応する各命令の振り分け先を同じグループに決定する（Ｓ３１）。 Returning to FIG. 7, the group distribution unit 114 determines the distribution destination of each instruction corresponding to the maximum value among the values of the elements of the second matrix 133 generated in the process of S25 to the same group (S31).

具体的に、図１３で説明した第２行列１３３において、命令１２に対応する行に含まれる欄のうち、命令１３に対応する列に含まれる欄と、命令７に対応する行に含まれる欄のうち、命令１２に対応する列に含まれる欄とのそれぞれには、最大の値である「０．０６０」が設定されている。そのため、グループ振分部１１４は、図１３で説明した第２行列１３３の各要素に対応する命令の組合せのうち、命令１２及び命令１３の組合せまたは命令７及び命令１２の組合せを特定する。そして、グループ振分部１１４は、例えば、命令１２及び命令１３の組合せを特定した場合、命令１２と命令１３とを同じグループに振り分ける旨の決定を行う。 Specifically, in the second matrix 133 described with reference to FIG. 13, among the columns included in the row corresponding to the instruction 12, the column included in the column corresponding to the instruction 13 and the column included in the row corresponding to the instruction 7. Among them, the maximum value "0.060" is set in each of the columns included in the column corresponding to the instruction 12. Therefore, the group distribution unit 114 specifies the combination of the instruction 12 and the instruction 13 or the combination of the instruction 7 and the instruction 12 among the combinations of the instructions corresponding to each element of the second matrix 133 described with reference to FIG. Then, for example, when the combination of the instruction 12 and the instruction 13 is specified, the group distribution unit 114 determines that the instruction 12 and the instruction 13 are distributed to the same group.

そして、グループ振分部１１４は、Ｓ２５の処理で生成した第２行列１３３またはＳ３３の処理（後述する処理）で再生成した第２行列１３３における行のうち、Ｓ３１の処理で振り分け先を決定した各命令に対応する複数の行を、その複数の行における同一列ごとの要素の和を要素とする単一の行に変換し、Ｓ２５の処理で生成した第２行列１３３またはＳ３３の処理で再生成した第２行列１３３における列のうち、Ｓ３１の処理で振り分け先を決定した各命令に対応する複数の列を、その複数の列における同一行ごとの要素の和を要素とする単一の列に変換することにより、第１行列１３２を再生成する（Ｓ３２）。 Then, the group distribution unit 114 determines the distribution destination in the process of S31 among the rows in the second matrix 133 generated in the process of S25 or the row in the second matrix 133 regenerated in the process of S33 (process described later). A plurality of rows corresponding to each instruction are converted into a single row having the sum of the elements of the same column in the plurality of rows as an element, and reproduced by the processing of the second matrix 133 or S33 generated in the processing of S25. Among the columns in the second matrix 133 formed, a single column having a plurality of columns corresponding to each instruction whose distribution destination is determined in the process of S31 and the sum of the elements of the same row in the plurality of columns as elements. The first matrix 132 is regenerated by converting to (S32).

具体的に、グループ振分部１１４は、例えば、Ｓ３１の処理における決定結果に基づいて依存情報１３１を更新する。そして、グループ振分部１１４は、例えば、更新した依存情報１３１を参照することによって第１行列１３２の再生成を行う。以下、Ｓ３２の処理の具体例について説明を行う。 Specifically, the group distribution unit 114 updates the dependency information 131 based on the determination result in the process of S31, for example. Then, the group distribution unit 114 regenerates the first matrix 132 by referring to the updated dependency information 131, for example. Hereinafter, a specific example of the processing of S32 will be described.

［Ｓ３２の処理の具体例］
図１４及び図１５は、Ｓ３２の処理の具体例を説明する図である。 [Specific example of processing of S32]
14 and 15 are diagrams for explaining a specific example of the process of S32.

例えば、Ｓ３１の処理で同じグループに振り分けることを決定した各命令が命令１２及び命令１３である場合、グループ振分部１１４は、図１４に示すように、「命令番号」が「１３」である情報の「グループ名」に記憶された値を、「命令番号」が「１２」である情報の「グループ名」に記憶された値である「１２」に更新するように、依存情報１３１を更新する。 For example, when the instructions 12 and the instruction 13 are determined to be distributed to the same group in the process of S31, the group distribution unit 114 has the "instruction number" of "13" as shown in FIG. Dependency information 131 is updated so that the value stored in the "group name" of the information is updated to "12" which is the value stored in the "group name" of the information whose "instruction number" is "12". To do.

その後、グループ振分部１１４は、例えば、図１５に示すように、命令１２に対応する行に含まれる欄に、命令１２に対応する行に含まれる欄に設定されている値と、命令１３に対応する行に含まれる欄に設定されている値とを同一列ごとに加算することによって算出した値をそれぞれ記憶する。また、グループ振分部１１４は、例えば、図１５に示すように、命令１２に対応する列に含まれる欄に、命令１２に対応する列に含まれる欄に設定されている値と、命令１３に対応する列に含まれる欄に設定されている値とを同一行ごとに加算することによって算出した値をそれぞれ記憶する。 After that, as shown in FIG. 15, for example, the group distribution unit 114 sets a value set in a column included in the line corresponding to the instruction 12 in a column included in the line corresponding to the instruction 12 and an instruction 13 The value calculated by adding the value set in the column included in the row corresponding to is added for each same column is stored. Further, as shown in FIG. 15, for example, the group distribution unit 114 has a value set in a column included in the column corresponding to the instruction 12 and a value set in the column included in the column corresponding to the instruction 12, and the instruction 13 The value calculated by adding the value set in the column included in the column corresponding to is added for each row is stored.

具体的に、図１２で説明した第１行列１３２において、例えば、命令１２に対応する行に含まれる欄のうち、命令７に対応する列に含まれる欄には、「１」が記憶されており、命令１３に対応する行に含まれる欄のうち、命令７に対応する列に含まれる欄には、「０」が記憶されている。そのため、グループ振分部１１４は、図１５に示すように、例えば、命令１２に対応する行に含まれる欄のうち、命令７に対応する列に含まれる欄に、「１」及び「０」の和である「１」を記憶する。 Specifically, in the first matrix 132 described with reference to FIG. 12, for example, among the columns included in the row corresponding to the instruction 12, "1" is stored in the column included in the column corresponding to the instruction 7. Therefore, among the columns included in the row corresponding to the instruction 13, "0" is stored in the column included in the column corresponding to the instruction 7. Therefore, as shown in FIG. 15, the group distribution unit 114 has, for example, "1" and "0" in the columns included in the column corresponding to the instruction 7 among the columns included in the row corresponding to the instruction 12. Memorize "1" which is the sum of.

また、図１２で説明した第１行列１３２において、例えば、命令１１に対応する行に含まれる欄のうち、命令１２に対応する列に含まれる欄には、「１」が記憶されており、命令１１に対応する行に含まれる欄のうち、命令１３に対応する列に含まれる欄には、「０」が記憶されている。そのため、グループ振分部１１４は、図１５に示すように、例えば、命令１１に対応する行に含まれる欄のうち、命令１２に対応する列に含まれる欄に、「１」及び「０」の和である「１」を記憶する。 Further, in the first matrix 132 described with reference to FIG. 12, for example, among the columns included in the row corresponding to the instruction 11, "1" is stored in the column included in the column corresponding to the instruction 12. Among the columns included in the row corresponding to the instruction 11, "0" is stored in the column included in the column corresponding to the instruction 13. Therefore, as shown in FIG. 15, the group distribution unit 114 has, for example, "1" and "0" in the columns included in the column corresponding to the instruction 12 among the columns included in the row corresponding to the instruction 11. Memorize "1" which is the sum of.

さらに、図１２で説明した第１行列１３２において、例えば、命令１２に対応する行に含まれる欄のうち、命令１２に対応する列に含まれる欄には、「０」が記憶されており、命令１２に対応する行に含まれる欄のうち、命令１３に対応する列に含まれる欄には、「１」が記憶されており、命令１３に対応する行に含まれる欄のうち、命令１２に対応する列に含まれる欄には、「１」が記憶されており、命令１３に対応する行に含まれる欄のうち、命令１３に対応する列に含まれる欄には、「０」が記憶されている。そのため、グループ振分部１１４は、図１５に示すように、例えば、命令１２に対応する行に含まれる欄のうち、命令１２に対応する列に含まれる欄に、「０」、「１」、「１」及び「０」の和である「２」を記憶する。図１５に含まれる他の情報についての説明は省略する。 Further, in the first matrix 132 described with reference to FIG. 12, for example, among the columns included in the row corresponding to the instruction 12, "0" is stored in the column included in the column corresponding to the instruction 12. Of the columns included in the row corresponding to the instruction 12, "1" is stored in the column included in the column corresponding to the instruction 13, and among the columns included in the row corresponding to the instruction 13, the instruction 12 "1" is stored in the column corresponding to the column corresponding to, and "0" is stored in the column included in the column corresponding to the instruction 13 among the columns included in the row corresponding to the instruction 13. It is remembered. Therefore, as shown in FIG. 15, the group distribution unit 114 has, for example, "0" and "1" in the columns included in the column corresponding to the instruction 12 among the columns included in the row corresponding to the instruction 12. , "2" which is the sum of "1" and "0" is stored. The description of other information included in FIG. 15 will be omitted.

図７に戻り、グループ振分部１１４は、Ｓ３２の処理で再生成した第１行列１３２から第２行列１３３を再生成する（Ｓ３３）。 Returning to FIG. 7, the group distribution unit 114 regenerates the second matrix 133 from the first matrix 132 regenerated in the process of S32 (S33).

具体的に、グループ振分部１１４は、例えば、図１５で説明した第１行列１３２に対してＳ２５の処理と同じ処理を行うことにより、図１６に示す第２行列１３３を生成（再生成）する。 Specifically, for example, the group distribution unit 114 generates (regenerates) the second matrix 133 shown in FIG. 16 by performing the same processing as the processing of S25 on the first matrix 132 described with reference to FIG. To do.

そして、グループ振分部１１４は、Ｓ２１の処理で分割対象であると判定したループに含まれる各命令の振り分け先のグループの数が所定数以下に到達したか否かを判定する（Ｓ３４）。 Then, the group distribution unit 114 determines whether or not the number of distribution destination groups of each instruction included in the loop determined to be the division target in the process of S21 has reached a predetermined number or less (S34).

具体的に、図１４で説明した依存情報１３１の「グループ名」には、「１」から「１２」までの値（１２種類の値）が記憶されている。そのため、グループ振分部１１４は、この場合、Ｓ２１の処理で分割対象であると判定したループに含まれる各命令の振り分け先のグループの数として「１２」を特定する。そして、例えば、Ｓ３４の処理における所定数が「２」である場合、グループ振分部１１４は、Ｓ２１の処理で分割対象であると判定したループに含まれる各命令の振り分け先のグループの数が所定数以下に到達していないと判定する。 Specifically, values (12 types of values) from "1" to "12" are stored in the "group name" of the dependency information 131 described with reference to FIG. Therefore, in this case, the group distribution unit 114 specifies "12" as the number of distribution destination groups of each instruction included in the loop determined to be the division target in the process of S21. Then, for example, when the predetermined number in the process of S34 is "2", the group distribution unit 114 has the number of groups to which each instruction is distributed included in the loop determined to be the division target in the process of S21. It is determined that the number has not reached the predetermined number or less.

一方、図１７に示す依存情報１３１の「グループ名」には、「１」及び「５」のみ（２種類の値）が記憶されている。そのため、グループ振分部１１４は、この場合、Ｓ２１の処理で分割対象であると判定したループに含まれる各命令の振り分け先のグループの数として「２」を特定する。そして、例えば、Ｓ３４の処理における所定数が「２」である場合、グループ振分部１１４は、Ｓ２１の処理で分割対象であると判定したループに含まれる各命令の振り分け先のグループの数が所定数以下に到達したと判定する。 On the other hand, only "1" and "5" (two kinds of values) are stored in the "group name" of the dependency information 131 shown in FIG. Therefore, in this case, the group distribution unit 114 specifies "2" as the number of distribution destination groups of each instruction included in the loop determined to be the division target in the process of S21. Then, for example, when the predetermined number in the process of S34 is "2", the group distribution unit 114 has the number of groups to which each instruction is distributed included in the loop determined to be the division target in the process of S21. It is determined that the number has reached a predetermined number or less.

なお、グループ振分部１１４は、図１７に示す依存情報１３１を生成したことに応じて、例えば、図１８に示す第１行列１３２の生成を行う。また、グループ振分部１１４は、図１７に示す依存情報１３１を生成したことに応じて、例えば、図１９に示す依存グラフ１３１ａを生成する。以下、図１９に示す依存グラフ１３１ａの具体例について説明を行う。 The group distribution unit 114 generates, for example, the first matrix 132 shown in FIG. 18 in response to the generation of the dependency information 131 shown in FIG. Further, the group distribution unit 114 generates, for example, the dependency graph 131a shown in FIG. 19 in response to the generation of the dependency information 131 shown in FIG. Hereinafter, a specific example of the dependency graph 131a shown in FIG. 19 will be described.

［依存グラフの具体例（２）］
図１９に示す依存グラフ１３１ａにおいて、命令１、命令２、命令３、命令４、命令８及び命令９のそれぞれに対応するノード群と、命令５、命令６、命令７、命令１０、命令１１、命令１２及び命令１３のそれぞれに対応するノード群とは、それぞれ異なるグループに含まれている。 [Specific example of dependency graph (2)]
In the dependency graph 131a shown in FIG. 19, the node group corresponding to each of instruction 1, instruction 2, instruction 3, instruction 4, instruction 8 and instruction 9, and instruction 5, instruction 6, instruction 7, instruction 10, instruction 11, The node group corresponding to each of the instruction 12 and the instruction 13 is included in a different group.

そして、図１９に示す依存グラフ１３１ａにおいて、命令１に対応するノードを含むグループと、命令５に対応するノードを含むグループとの間には、命令３に対応するノードと命令１０に対応するノードとの間のエッジと、命令９に対応するノードと命令１０に対応するノードとの間のエッジとが設定されている。 Then, in the dependency graph 131a shown in FIG. 19, between the group including the node corresponding to the instruction 1 and the group including the node corresponding to the instruction 5, the node corresponding to the instruction 3 and the node corresponding to the instruction 10 are connected. An edge between the two, and an edge between the node corresponding to the instruction 9 and the node corresponding to the instruction 10 are set.

すなわち、図１９に示す依存グラフ１３１ａは、命令１、命令２、命令３、命令４、命令８及び命令９が一方の分割ループに含まれ、かつ、命令５、命令６、命令７、命令１０、命令１１、命令１２及び命令１３が他方の分割ループに含まれるようにループ分割を行った場合、異なる分割ループのそれぞれに含まれる命令間におけるエッジの数を２本に抑えることが可能になることを示している。 That is, in the dependency graph 131a shown in FIG. 19, instruction 1, instruction 2, instruction 3, instruction 4, instruction 8, and instruction 9 are included in one of the split loops, and instruction 5, instruction 6, instruction 7, and instruction 10 are included. When loop splitting is performed so that the instructions 11, instruction 12, and instruction 13 are included in the other division loop, the number of edges between the instructions included in each of the different division loops can be suppressed to two. It is shown that.

図９に戻り、Ｓ２１の処理で分割対象であると判定したループに含まれる各命令の振り分け先のグループの数が所定数以下に到達したと判定した場合（Ｓ４１のＹＥＳ）、情報処理装置１のループ分割部１１５は、Ｓ３３の処理で生成された第２行列１３３の内容に従って、Ｓ２１の処理で分割対象であると判定したループのループ分割を行う（Ｓ４２）。 Returning to FIG. 9, when it is determined that the number of distribution destination groups of each instruction included in the loop determined to be the division target in the process of S21 has reached a predetermined number or less (YES in S41), the information processing device 1 The loop splitting unit 115 of the above performs loop splitting of the loop determined to be the split target in the processing of S21 according to the contents of the second matrix 133 generated in the processing of S33 (S42).

その後、情報処理装置１は、Ｓ２の処理を終了する。なお、情報処理装置１は、情報格納領域１３０に記憶された中間言語２２に、分割対象のループが含まれていないと判定した場合も同様に（Ｓ２２のＮＯ）、Ｓ２の処理を終了する。 After that, the information processing device 1 ends the processing of S2. When it is determined that the intermediate language 22 stored in the information storage area 130 does not include the loop to be divided, the information processing device 1 similarly ends the process of S2 (NO in S22).

一方、Ｓ２１の処理で分割対象であると判定したループに含まれる各命令の振り分け先のグループの数が所定数以下に到達していないと判定した場合（Ｓ４１のＮＯ）、グループ振分部１１４は、Ｓ３２以降の処理を再度行う。以下、ループ分割を行った後の分割ループの具体例について説明を行う。 On the other hand, when it is determined that the number of distribution destination groups of each instruction included in the loop determined to be the division target in the processing of S21 has not reached a predetermined number or less (NO in S41), the group distribution unit 114 Performs the processing after S32 again. Hereinafter, a specific example of the split loop after performing the loop split will be described.

［分割ループの具体例］
図２０は、分割ループの内容を説明する具体例である。図２０（Ａ）は、分割ループのうちの一方を説明する具体例であり、図２０（Ｂ）は、分割ループのうちの他方を説明する具体例である。 [Specific example of split loop]
FIG. 20 is a specific example for explaining the contents of the split loop. FIG. 20A is a specific example for explaining one of the divided loops, and FIG. 20B is a specific example for explaining the other of the divided loops.

図２０（Ａ）に示す分割ループには、図９で説明した中間言語のうち、命令１、命令２、命令３、命令４、命令８及び命令９が含まれている。 The split loop shown in FIG. 20A includes instruction 1, instruction 2, instruction 3, instruction 4, instruction 8 and instruction 9 among the intermediate languages described in FIG.

そして、図２０（Ａ）に示す分割ループは、命令１等の後に、変数ｒｅｇ３に設定された値を、配列ｔｍｐ＿ａｒｒａｙ１のｒｅｇ＿ｉ番目に格納することを示す命令である「ｓｔｏｒｅｔｍｐ＿ａｒｒａｙ１（ｒｅｇ＿ｉ），ｒｅｇ３」と、変数ｒｅｇ９に設定された値を、配列ｔｍｐ＿ａｒｒａｙ２のｒｅｇ＿ｉ番目に格納することを示す命令である「ｓｔｏｒｅｔｍｐ＿ａｒｒａｙ２（ｒｅｇ＿ｉ），ｒｅｇ９」とを含む。 Then, the split loop shown in FIG. 20 (A) is an instruction indicating that the value set in the variable reg3 is stored in the reg_ith position of the array tp_array1 after the instruction 1 or the like, "store tp_array1 (reg_i), reg3". , And "store tp_array2 (reg_i), reg9", which is an instruction indicating that the value set in the variable reg9 is stored in the reg_ith position of the array tp_array2.

一方、図２０（Ｂ）に示す分割ループには、図９で説明した中間言語のうち、命令５、命令６、命令７、命令１０、命令１１、命令１２及び命令１３が含まれている。 On the other hand, the split loop shown in FIG. 20B includes instruction 5, instruction 6, instruction 7, instruction 10, instruction 11, instruction 12, and instruction 13 among the intermediate languages described in FIG.

そして、図２０（Ｂ）に示す分割ループは、命令５等の前に、配列ｔｍｐ＿ａｒｒａｙ１のｒｅｇ＿ｉ番目に格納されている値を変数ｒｅｇ３に設定することを示す命令である「ｌｏａｄｒｅｇ３，ｔｍｐ＿ａｒｒａｙ１（ｒｅｇ＿ｉ）」と、配列ｔｍｐ＿ａｒｒａｙ２のｒｅｇ＿ｉ番目に格納されている値を変数ｒｅｇ９に設定することを示す命令である「ｌｏａｄｒｅｇ９，ｔｍｐ＿ａｒｒａｙ２（ｒｅｇ＿ｉ）」とを含む。 Then, the split loop shown in FIG. 20 (B) is an instruction indicating that the value stored in the reg_i th of the array tm_array1 is set in the variable reg3 before the instruction 5 and the like, "load reg3, tp_array1 (reg_i). ) ”, And“ load reg9, tp_array2 (reg_i) ”which is an instruction indicating that the value stored in the rig_i th of the array tp_array2 is set in the variable reg9.

すなわち、図１９で説明したように、変数ｒｅｇ３に設定された値（命令３に対応する値）、及び、変数ｒｅｇ９に設定された値（命令９に対応する値）は、図２０（Ｂ）に示す分割ループに含まれる命令１０において参照される。そのため、図２０（Ａ）に示す分割ループには、変数ｒｅｇ３に設定された値及び変数ｒｅｇ９に設定された値を一時配列に格納する命令が含まれている。また、図２０（Ｂ）に示す分割ループには、一時配列に格納されている値を取り出す命令が含まれている。 That is, as described with reference to FIG. 19, the value set in the variable reg3 (value corresponding to the instruction 3) and the value set in the variable reg9 (value corresponding to the instruction 9) are shown in FIG. 20B. It is referred to in instruction 10 included in the split loop shown in. Therefore, the split loop shown in FIG. 20A includes an instruction to store the value set in the variable reg3 and the value set in the variable reg9 in a temporary array. Further, the division loop shown in FIG. 20B includes an instruction for fetching a value stored in a temporary array.

このように、本実施の形態における情報処理装置１は、ソースコード２１から生成した中間言語２２のループに含まれる各命令間の依存関係を示す依存情報１３１を生成し、生成した依存情報１３１を行列に変換することによって第１行列１３２を生成する。 As described above, the information processing apparatus 1 in the present embodiment generates the dependency information 131 indicating the dependency relationship between each instruction included in the loop of the intermediate language 22 generated from the source code 21, and generates the dependency information 131. The first matrix 132 is generated by converting to a matrix.

そして、情報処理装置１は、生成した第１行列１３２から算出した各命令間における依存度合いに基づいて、ループに含まれる各命令を複数のグループに振り分け、振り分けたグループごとにループの分割を行う。 Then, the information processing device 1 distributes each instruction included in the loop into a plurality of groups based on the degree of dependence between the instructions calculated from the generated first matrix 132, and divides the loop into each of the distributed groups. ..

すなわち、本実施の形態における情報処理装置１は、依存情報１３１から生成された第１行列１３２を用いた演算を行うことで、分割対象のループに含まれる命令の組合せごとの依存関係の解析等を行うことなく、各命令の振り分け先を決定する。 That is, the information processing device 1 in the present embodiment performs an operation using the first matrix 132 generated from the dependency information 131, thereby analyzing the dependency relationship for each combination of instructions included in the loop to be divided. To determine the distribution destination of each command without performing.

なお、情報処理装置１は、Ｓ２１及びＳ２２の処理において、情報格納領域１３０に記憶された中間言語２２に分割対象のループが複数含まれていると判定した場合、Ｓ２３以降の処理を分割対象のループごとに行うものであってよい。 When the information processing device 1 determines in the processing of S21 and S22 that the intermediate language 22 stored in the information storage area 130 includes a plurality of loops to be divided, the processing after S23 is to be divided. It may be performed for each loop.

また、情報処理装置１は、Ｓ２５に処理においてクラスタリング指標値を算出する場合、Ｎｅｗｍａｎアルゴリズム以外のアルゴリズム（例えば、ラベル伝搬法やＫ−ｍｅａｎｓ法等のアルゴリズム）を用いるものであってもよい。 Further, the information processing apparatus 1 may use an algorithm other than the Newsman algorithm (for example, an algorithm such as a label propagation method or a K-means method) when calculating the clustering index value in the processing in S25.

［第２の実施の形態］
次に、第２の実施の形態の詳細について説明する。図２１から図２３は、第２の実施の形態におけるＳ２の処理を説明するフローチャート図である。また、図２４から図２８は、第２の実施の形態におけるＳ２の処理の詳細を説明する図である。 [Second Embodiment]
Next, the details of the second embodiment will be described. 21 to 23 are flowcharts illustrating the process of S2 in the second embodiment. In addition, FIGS. 24 to 28 are diagrams for explaining the details of the processing of S2 in the second embodiment.

第２の実施の形態におけるコンパイル処理は、第１の実施の形態におけるコンパイル処理と異なり、各命令がアクセスするデータのメモリ１０２内における位置関係についても参照してループの分割を行う。 The compilation process in the second embodiment is different from the compilation process in the first embodiment, and the loop is divided by referring to the positional relationship of the data accessed by each instruction in the memory 102.

分割判定部１１１は、図２１に示すように、情報格納領域１３０に記憶された中間言語２２に、分割対象のループが含まれるか否かを判定する（Ｓ５１）。 As shown in FIG. 21, the division determination unit 111 determines whether or not the intermediate language 22 stored in the information storage area 130 includes a loop to be divided (S51).

そして、情報格納領域１３０に記憶された中間言語２２に、分割対象のループが含まれていると判定した場合（Ｓ５２のＹＥＳ）、情報処理装置１の近接判定部１１６は、例えば、メモリ１０２内において近接するアドレスに格納された各データに対してアクセスを行う複数の命令が分割対象のループに含まれているか否かを判定する（Ｓ５３）。 Then, when it is determined that the intermediate language 22 stored in the information storage area 130 includes the loop to be divided (YES in S52), the proximity determination unit 116 of the information processing device 1 is, for example, in the memory 102. In (S53), it is determined whether or not a plurality of instructions for accessing each data stored in adjacent addresses are included in the loop to be divided (S53).

具体的に、近接判定部１１６は、例えば、同一の配列に格納された各データに対してアクセスを行う複数の命令が同一の分割対象のループに含まれているか否かを判定する。 Specifically, the proximity determination unit 116 determines, for example, whether or not a plurality of instructions for accessing each data stored in the same array are included in the same loop to be divided.

すなわち、オブジェクトコード２３の実行時において、例えば、第１命令の実行に伴って第１配列の第１データに対するアクセスが発生する場合、ＣＰＵ１０１は、メモリ１０２に格納されている第１配列の各データを含む所定サイズのデータをキャッシュメモリ（図示しない）に一時的に格納し、キャッシュメモリに格納した第１データに対してアクセスを行う。 That is, when the object code 23 is executed, for example, when the first data of the first array is accessed with the execution of the first instruction, the CPU 101 performs each data of the first array stored in the memory 102. Data of a predetermined size including the above is temporarily stored in a cache memory (not shown), and the first data stored in the cache memory is accessed.

そして、例えば、第１命令と異なる第２命令の実行に伴って第１配列の第２データに対するアクセスが発生する場合、ＣＰＵ１０１は、第１配列の各データがキャッシュメモリにまだ格納されていれば、キャッシュメモリに格納されている第２データに対してアクセスを行う。一方、他のデータに対するアクセスの発生等に伴って第１配列の各データがキャッシュメモリから既に追い出されている場合、ＣＰＵ１０１は、メモリ１０２に格納されている第１配列の各データを含む所定サイズのデータをキャッシュメモリに再度に格納し、キャッシュメモリに格納した第２データに対してアクセスを行う。 Then, for example, when access to the second data in the first array occurs due to the execution of the second instruction different from the first instruction, the CPU 101 determines that each data in the first array is still stored in the cache memory. , Access the second data stored in the cache memory. On the other hand, when each data in the first array has already been expelled from the cache memory due to the occurrence of access to other data, the CPU 101 has a predetermined size including each data in the first array stored in the memory 102. Data is stored in the cache memory again, and the second data stored in the cache memory is accessed.

そのため、情報処理装置１は、例えば、上記のような第１命令及び第２命令が分割対象のループに含まれている場合、第１命令と第２命令が同じ分割ループに含まれるようにループ分割を行い、第１命令の実行タイミングと第２命令の実行タイミングとを近接させる。 Therefore, for example, when the first instruction and the second instruction as described above are included in the loop to be divided, the information processing apparatus 1 loops so that the first instruction and the second instruction are included in the same division loop. The division is performed so that the execution timing of the first instruction and the execution timing of the second instruction are brought close to each other.

これにより、情報処理装置１は、分割対象のループの実行中に第１データがキャッシュメモリから追い出される確率を抑えることが可能になる。そのため、情報処理装置１は、第１データをキャッシュメモリに再格納する処理の発生頻度を抑えることが可能になり、オブジェクトコード２３の実行時間を短縮させることが可能になる。 As a result, the information processing apparatus 1 can suppress the probability that the first data is expelled from the cache memory during the execution of the loop to be divided. Therefore, the information processing device 1 can reduce the frequency of occurrence of the process of re-storing the first data in the cache memory, and can shorten the execution time of the object code 23.

したがって、近接判定部１１６は、Ｓ５３の処理において、例えば、分割対象のループに含まれる命令から、同一の配列に含まれる各データに対してアクセスを行う複数の命令の特定を行う。 Therefore, in the process of S53, the proximity determination unit 116 identifies a plurality of instructions for accessing each data included in the same array from the instructions included in the loop to be divided, for example.

なお、近接判定部１１６は、例えば、ある配列に含まれるデータのうち、所定の回転数に対応する範囲内のデータに対してアクセスを行う複数の命令の特定を行うものであってもよい。 Note that the proximity determination unit 116 may specify, for example, a plurality of instructions for accessing data within a range corresponding to a predetermined rotation speed among the data included in a certain array.

図２１に戻り、情報生成部１１２は、Ｓ５１の処理で分割対象であると判定したループに含まれる各命令間の依存関係と、各命令がアクセスするデータのメモリ１０２内における位置関係とを示す依存情報１３１を生成する（Ｓ５４）。 Returning to FIG. 21, the information generation unit 112 shows the dependency relationship between each instruction included in the loop determined to be the division target in the process of S51, and the positional relationship of the data accessed by each instruction in the memory 102. Dependency information 131 is generated (S54).

［依存情報の具体例］
図２４は、依存情報１３１の具体例について説明する図である。 [Specific example of dependency information]
FIG. 24 is a diagram illustrating a specific example of the dependency information 131.

図２４に示す依存情報１３１は、図１０で説明した依存情報１３１が有する項目に加え、メモリ１０２内における近接したアドレスに格納された各データに対してアクセスを行う複数の命令の命令番号についてのリストである「キャッシュ共有依存リスト」を項目として有する。 The dependency information 131 shown in FIG. 24 refers to the instruction numbers of a plurality of instructions for accessing each data stored at adjacent addresses in the memory 102, in addition to the items included in the dependency information 131 described with reference to FIG. It has a "cache sharing dependency list" which is a list as an item.

具体的に、図９で説明した中間言語２２において、配列Ｃに格納されているデータは、命令４及び命令６のそれぞれにおいて参照されている。そのため、情報生成部１１２は、例えば、図２４に示すように、「命令番号」が「４」である情報の「キャッシュ共有依存リスト」に「６」を記憶し、「命令番号」が「６」である情報の「キャッシュ共有依存リスト」に「４」を記憶する。また、情報生成部１１２は、この場合、例えば、図２４に示すように、「命令番号」が「４」及び「６」以外である情報の「キャッシュ共有依存リスト」に、情報が存在しないことを示す「−」を記憶する。 Specifically, in the intermediate language 22 described with reference to FIG. 9, the data stored in the array C is referred to in each of the instruction 4 and the instruction 6. Therefore, for example, as shown in FIG. 24, the information generation unit 112 stores "6" in the "cache sharing dependency list" of the information whose "instruction number" is "4", and the "instruction number" is "6". "4" is stored in the "cache sharing dependency list" of the information. Further, in this case, the information generation unit 112 does not have any information in the "cache sharing dependency list" of the information whose "instruction number" is other than "4" and "6", for example, as shown in FIG. Memorize the "-" indicating.

図２１に戻り、情報生成部１１２は、Ｓ５４の処理で生成した依存情報１３１を行列に変換することによって第１行列１３２を生成する（Ｓ５５）。以下、第１行列１３２の具体例について説明を行う。 Returning to FIG. 21, the information generation unit 112 generates the first matrix 132 by converting the dependency information 131 generated in the process of S54 into a matrix (S55). Hereinafter, a specific example of the first matrix 132 will be described.

［第１行列の具体例］
図２５は、第１行列１３２の具体例について説明する図である。 [Specific example of the first matrix]
FIG. 25 is a diagram illustrating a specific example of the first matrix 132.

図２５に示す第１行列１３２の各要素（各欄）には、行に対応する命令と列に対応する命令との間に依存関係が存在することを示す値である「１」、行に対応する命令及び列に対応する命令のそれぞれがアクセスするデータのメモリ１０２内における位置が近接していることを示す値である「５」、または、行に対応する命令と列に対応する命令との間に依存関係がせず、かつ、行に対応する命令及び列に対応する命令のそれぞれがアクセスするデータのメモリ１０２内における位置が近接していないことを示す値である「０」が記憶される。 Each element (each column) of the first matrix 132 shown in FIG. 25 has a value of "1" indicating that there is a dependency between the instruction corresponding to the row and the instruction corresponding to the column, in the row. "5", which is a value indicating that the positions of the data to be accessed by the corresponding instruction and the instruction corresponding to the column are close to each other in the memory 102, or the instruction corresponding to the row and the instruction corresponding to the column. There is no dependency between the two, and "0", which is a value indicating that the positions in the memory 102 of the data accessed by each of the instruction corresponding to the row and the instruction corresponding to the column are not close to each other, is stored. Will be done.

具体的に、図２４で説明した依存情報１３１において、「命令番号」が「４」である情報の「キャッシュ共有依存リスト」には、「６」が記憶されており、「命令番号」が「６」である情報の「キャッシュ共有依存リスト」には、「４」が記憶されている。そのため、情報生成部１１２は、図２５に示すように、例えば、命令４に対応する行に含まれる欄のうち、命令６に対応する列に含まれる欄に「５」を記憶する。また、情報生成部１１２は、図２５に示すように、例えば、命令６に対応する行に含まれる欄のうち、命令４に対応する列に含まれる欄に「５」を記憶する。 Specifically, in the dependency information 131 described with reference to FIG. 24, "6" is stored in the "cache sharing dependency list" of the information in which the "instruction number" is "4", and the "instruction number" is "4". "4" is stored in the "cache sharing dependency list" of the information which is "6". Therefore, as shown in FIG. 25, the information generation unit 112 stores, for example, "5" in the column included in the column corresponding to the instruction 6 among the columns included in the row corresponding to the instruction 4. Further, as shown in FIG. 25, the information generation unit 112 stores, for example, "5" in the column included in the column corresponding to the instruction 4 among the columns included in the row corresponding to the instruction 6.

すなわち、依存関係にある複数の命令が異なる分割ループに含まれることによるオブジェクトコード２３の実行時間に対する影響よりも、キャッシュミスの発生回数が増加することによるオブジェクトコード２３の実行時間に対する影響の方が大きいと判断できる。そのため、情報生成部１１２は、図２５に示すように、例えば、各命令がアクセスするデータのメモリ１０２内における位置が近接していることを示す値が、各命令間に依存関係が存在することを示す値よりも大きくなるように、第１行列１３２の生成を行う。 That is, the effect on the execution time of the object code 23 due to the increase in the number of cache misses is greater than the effect on the execution time of the object code 23 due to the inclusion of a plurality of dependent instructions in different split loops. It can be judged that it is large. Therefore, as shown in FIG. 25, the information generation unit 112 has, for example, that the values indicating that the positions of the data accessed by each instruction in the memory 102 are close to each other have a dependency relationship between the instructions. The first matrix 132 is generated so as to be larger than the value indicating.

図２１に戻り、グループ振分部１１４は、Ｓ５５の処理で生成した第１行列１３２から、各命令間における依存度合いを示す第２行列１３３を生成する（Ｓ５６）。 Returning to FIG. 21, the group distribution unit 114 generates a second matrix 133 indicating the degree of dependence between each instruction from the first matrix 132 generated in the process of S55 (S56).

そして、グループ振分部１１４は、図２２に示すように、Ｓ５６の処理で生成した第２行列１３３の要素の値のうち、最大の値に対応する複数の命令の振り分け先を同じグループに決定する（Ｓ６１）。 Then, as shown in FIG. 22, the group distribution unit 114 determines the distribution destination of a plurality of instructions corresponding to the maximum values among the values of the elements of the second matrix 133 generated in the process of S56 to the same group. (S61).

続いて、グループ振分部１１４は、Ｓ５６の処理で生成した第２行列１３３またはＳ６３の処理で再生成した第２行列１３３における行のうち、Ｓ６１の処理で振り分け先を決定した各命令に対応する複数の行を、その複数の行における同一列ごとの要素の和を要素とする単一の行に変換し、Ｓ５６の処理で生成した第２行列１３３またはＳ６３の処理で再生成した第２行列１３３における列のうち、Ｓ６１の処理で振り分け先を決定した各命令に対応する複数の列を、その複数の列における同一行ごとの要素の和を要素とする単一の列に変換することにより、第１行列１３２を再生成する（Ｓ６２）。 Subsequently, the group distribution unit 114 corresponds to each instruction whose distribution destination is determined by the processing of S61 among the rows in the second matrix 133 generated by the processing of S56 or the second matrix 133 regenerated by the processing of S63. The second row is converted into a single row whose elements are the sum of the elements of the same column in the plurality of rows, and regenerated by the processing of the second matrix 133 or S63 generated by the processing of S56. Of the columns in the matrix 133, a plurality of columns corresponding to each instruction whose distribution destination is determined in the process of S61 are converted into a single column having the sum of the elements of the same row in the plurality of columns as elements. Regenerates the first matrix 132 (S62).

さらに、グループ振分部１１４は、Ｓ６２の処理で再生成した第１行列１３２から第２行列１３３を再生成する（Ｓ６３）。 Further, the group distribution unit 114 regenerates the second matrix 133 from the first matrix 132 regenerated in the process of S62 (S63).

その後、グループ振分部１１４は、Ｓ５１の処理で分割対象であると判定したループに含まれる各命令の振り分け先のグループの数が所定数以下に到達したか否かを判定する（Ｓ６４）。 After that, the group distribution unit 114 determines whether or not the number of distribution destination groups of each instruction included in the loop determined to be the division target in the process of S51 has reached a predetermined number or less (S64).

具体的に、例えば、Ｓ６４の処理における所定数が「２」である場合において、図２６に示す第１行列１３２が生成されている場合、グループ振分部１１４は、Ｓ５１の処理で分割対象であると判定したループに含まれる各命令の振り分け先のグループの数が所定数以下に到達したと判定する。 Specifically, for example, when the predetermined number in the processing of S64 is "2" and the first matrix 132 shown in FIG. 26 is generated, the group distribution unit 114 is a division target in the processing of S51. It is determined that the number of distribution destination groups of each instruction included in the loop determined to be present has reached a predetermined number or less.

なお、図２７に示すように、図２６に示す第１行列１３２に対応する依存グラフ１３１ａでは、同じ配列に格納された各データを参照する各命令（命令４及び命令６）が同じグループに振り分けられている。 As shown in FIG. 27, in the dependency graph 131a corresponding to the first matrix 132 shown in FIG. 26, each instruction (instruction 4 and instruction 6) that refers to each data stored in the same array is distributed to the same group. Has been done.

続いて、Ｓ５１の処理で分割対象であると判定したループに含まれる各命令の振り分け先のグループの数が所定数以下に到達したと判定した場合（Ｓ７１のＹＥＳ）、ループ分割部１１５は、Ｓ６３の処理で生成された第２行列１３３の内容に従って、Ｓ５１の処理で分割対象であると判定したループのループ分割を行う（Ｓ７２）。 Subsequently, when it is determined that the number of the distribution destination groups of each instruction included in the loop determined to be the division target in the processing of S51 has reached a predetermined number or less (YES in S71), the loop division unit 115 determines. According to the contents of the second matrix 133 generated in the process of S63, the loop split of the loop determined to be the split target in the process of S51 is performed (S72).

そして、情報処理装置１は、Ｓ２の処理を終了する。なお、情報処理装置１は、情報格納領域１３０に記憶された中間言語２２に、分割対象のループが含まれていないと判定した場合も同様に（Ｓ５２のＮＯ）、Ｓ２の処理を終了する。 Then, the information processing device 1 ends the processing of S2. When it is determined that the intermediate language 22 stored in the information storage area 130 does not include the loop to be divided, the information processing device 1 also ends the processing of S2 (NO in S52).

一方、Ｓ５１の処理で分割対象であると判定したループに含まれる各命令の振り分け先のグループの数が所定数以下に到達していないと判定した場合（Ｓ７１のＮＯ）、グループ振分部１１４は、Ｓ６２以降の処理を再度行う。以下、ループ分割を行った後の分割ループの具体例について説明を行う。 On the other hand, when it is determined that the number of the distribution destination groups of each instruction included in the loop determined to be the division target in the processing of S51 has not reached the predetermined number or less (NO in S71), the group distribution unit 114 Performs the processing after S62 again. Hereinafter, a specific example of the split loop after performing the loop split will be described.

［分割ループの具体例］
図２８は、分割ループの内容を説明する具体例である。図２８（Ａ）は、分割ループのうちの一方を説明する具体例であり、図２８（Ｂ）は、分割ループのうちの他方を説明する具体例である。 [Specific example of split loop]
FIG. 28 is a specific example for explaining the contents of the split loop. FIG. 28 (A) is a specific example for explaining one of the divided loops, and FIG. 28 (B) is a specific example for explaining the other of the divided loops.

図２８（Ａ）に示す分割ループには、図９で説明した中間言語のうち、命令１、命令２、命令３、命令４、命令６、命令８及び命令９が含まれている。 The split loop shown in FIG. 28A includes instruction 1, instruction 2, instruction 3, instruction 4, instruction 6, instruction 8 and instruction 9 among the intermediate languages described in FIG.

また、図２８（Ａ）に示す分割ループは、命令１等の後に、変数ｒｅｇ３に設定された値を、配列ｔｍｐ＿ａｒｒａｙ１のｒｅｇ＿ｉ番目に格納することを示す命令である「ｓｔｏｒｅｔｍｐ＿ａｒｒａｙ１（ｒｅｇ＿ｉ），ｒｅｇ３」と、変数ｒｅｇ６に設定された値を、配列ｔｍｐ＿ａｒｒａｙ２のｒｅｇ＿ｉ番目に格納することを示す命令である「ｓｔｏｒｅｔｍｐ＿ａｒｒａｙ２（ｒｅｇ＿ｉ），ｒｅｇ６」と、変数ｒｅｇ９に設定された値を、配列ｔｍｐ＿ａｒｒａｙ３のｒｅｇ＿ｉ番目に格納することを示す命令である「ｓｔｏｒｅｔｍｐ＿ａｒｒａｙ３（ｒｅｇ＿ｉ），ｒｅｇ９」とを含む。 Further, the split loop shown in FIG. 28 (A) is an instruction indicating that the value set in the variable reg3 is stored in the reg_ith position of the array tp_array1 after the instruction 1 or the like, "store tp_array1 (reg_i), reg3". , Which is an instruction indicating that the value set in the variable reg6 is stored in the reg_ith position of the array tp_array2, and the value set in the variable reg9 is set in the reg_i of the array tp_array3. It includes "store tp_array3 (reg_i), reg9" which is an instruction indicating to store in the third position.

一方、図２８（Ｂ）に示す分割ループには、図９で説明した中間言語のうち、命令５、命令７、命令１０、命令１１、命令１２及び命令１３が含まれている。 On the other hand, the split loop shown in FIG. 28B includes instruction 5, instruction 7, instruction 10, instruction 11, instruction 12, and instruction 13 among the intermediate languages described in FIG.

また、図２８（Ｂ）に示す分割ループは、命令５等の前に、配列ｔｍｐ＿ａｒｒａｙ１のｒｅｇ＿ｉ番目に格納されている値を変数ｒｅｇ３に設定することを示す命令である「ｌｏａｄｒｅｇ３，ｔｍｐ＿ａｒｒａｙ１（ｒｅｇ＿ｉ）」と、配列ｔｍｐ＿ａｒｒａｙ２のｒｅｇ＿ｉ番目に格納されている値を変数ｒｅｇ６に設定することを示す命令である「ｌｏａｄｒｅｇ６，ｔｍｐ＿ａｒｒａｙ２（ｒｅｇ＿ｉ）」と、配列ｔｍｐ＿ａｒｒａｙ３のｒｅｇ＿ｉ番目に格納されている値を変数ｒｅｇ９に設定することを示す命令である「ｌｏａｄｒｅｇ９，ｔｍｐ＿ａｒｒａｙ３（ｒｅｇ＿ｉ）」とを含む。 Further, the split loop shown in FIG. 28 (B) is an instruction indicating that the value stored in the reg_i th of the array tp_array1 is set in the variable reg3 before the instruction 5 and the like, "load reg3, tp_array1 (reg_i). ) ”,“ Load reg6, tp_array2 (reg_i) ”, which is an instruction to set the value stored in the reg_ith position of the array tp_array2 in the variable reg6, and the value stored in the rig_ith position of the array tp_ary3 It includes "load reg9, tp_array3 (reg_i)" which is an instruction indicating that the variable reg9 is set.

すなわち、図２８に示す分割ループは、図２０で説明した分割ループよりも一時配列の数が増加している。しかしながら、図２８に示す分割ループでは、同じ配列に格納された各データを参照する各命令（命令４及び命令６）が同じ分割ループに含まれている。 That is, the number of temporary sequences in the split loop shown in FIG. 28 is larger than that in the split loop described in FIG. However, in the division loop shown in FIG. 28, each instruction (instruction 4 and instruction 6) that refers to each data stored in the same array is included in the same division loop.

これにより、情報処理装置１は、オブジェクトコード２３の実行時間をより短縮させることが可能になる。 As a result, the information processing device 1 can further shorten the execution time of the object code 23.

以上の実施の形態をまとめると、以下の付記のとおりである。 The above embodiments can be summarized as follows.

（付記１）
ソースコードから生成した中間言語のループに含まれる各命令間の依存関係を示す依存情報を生成し、生成した前記依存情報を行列に変換することによって第１行列を生成する情報生成部と、
生成した前記第１行列から算出した各命令間における依存度合いに基づいて、前記ループに含まれる各命令を複数のグループに振り分けるグループ振分部と、
振り分けた前記複数のグループごとに、前記ループの分割を行うループ分割部と、を有する、
ことを特徴とする情報処理装置。 (Appendix 1)
An information generation unit that generates a first matrix by generating dependency information indicating the dependency between each instruction included in the loop of the intermediate language generated from the source code and converting the generated dependency information into a matrix.
A group distribution unit that distributes each instruction included in the loop into a plurality of groups based on the degree of dependence between each instruction calculated from the generated first matrix.
Each of the plurality of sorted groups has a loop splitting unit for splitting the loop.
An information processing device characterized by this.

（付記２）
付記１において、
前記情報生成部は、
前記ループに含まれる命令の組合せごとに、各組合せに含まれる命令間に依存関係があることを前記依存情報が示しているか否かを判定し、
前記命令の組合せのうち、各組合せに含まれる命令間に依存関係があることを前記依存情報が示している組合せに対応する要素を第１の値とし、前記命令の組合せのうち、各組合せに含まれる命令間に依存関係があることを前記依存情報が示していない組合せに対応する要素を第２の値とすることにより、前記第１行列の生成を行う、
ことを特徴とする情報処理装置。 (Appendix 2)
In Appendix 1,
The information generation unit
For each combination of instructions included in the loop, it is determined whether or not the dependency information indicates that there is a dependency between the instructions included in each combination.
Among the combinations of the instructions, the element corresponding to the combination in which the dependency information indicates that there is a dependency between the instructions included in each combination is set as the first value, and each combination of the combinations of the instructions has The first matrix is generated by setting the element corresponding to the combination in which the dependency information does not indicate that there is a dependency between the included instructions as the second value.
An information processing device characterized by this.

（付記３）
付記１において、
前記グループ振分部は、異なるグループにそれぞれ含まれる命令間において存在する依存関係が少なくなるように、前記ループに含まれる各命令を複数のグループに振り分ける、
ことを特徴とする情報処理装置。 (Appendix 3)
In Appendix 1,
The group distribution unit distributes each instruction included in the loop to a plurality of groups so that the dependency relationship existing between the instructions included in the different groups is reduced.
An information processing device characterized by this.

（付記４）
付記１において、
前記グループ振分部は、
前記第１行列から各命令間における依存度合いを示す第２行列を生成し、
生成した前記第２行列に基づいて、前記ループに含まれる各命令を前記複数のグループに振り分ける、
ことを特徴とする情報処理装置。 (Appendix 4)
In Appendix 1,
The group distribution section
From the first matrix, a second matrix showing the degree of dependence between each instruction is generated.
Based on the generated second matrix, each instruction included in the loop is distributed to the plurality of groups.
An information processing device characterized by this.

（付記５）
付記４において、
前記グループ振分部は、
前記ループに含まれる命令の組合せごとに、各組合せに含まれる命令間における依存度合いを算出し、
算出した前記依存度合いのそれぞれを要素とすることにより、前記第２行列を生成する、
ことを特徴とする情報処理装置。 (Appendix 5)
In Appendix 4,
The group distribution section
For each combination of instructions included in the loop, the degree of dependence between the instructions included in each combination is calculated.
The second matrix is generated by using each of the calculated degrees of dependence as an element.
An information processing device characterized by this.

（付記６）
付記５において、
前記グループ振分部は、Ｎｅｗｍａｎアルゴリズムを用いることにより、前記依存度合いを算出する、
ことを特徴とする情報処理装置。 (Appendix 6)
In Appendix 5,
The group distribution unit calculates the degree of dependence by using the Newman algorithm.
An information processing device characterized by this.

（付記７）
付記５において、
前記グループ振分部は、
前記第２行列の要素の値のうち、最大の値に対応する複数の命令の振り分け先を同じグループに決定し、
前記第２行列における前記複数の命令に対応する複数の行を、前記複数の行における同一列ごとの要素の和を要素とする単一の行に変換し、かつ、前記第２行列における前記複数の命令に対応する複数の列を、前記複数の列における同一行ごとの要素の和を要素とする単一の列に変換することによって、前記第１行列を再生成し、
再生成した前記第１行列から前記第２行列を再生成し、
前記ループに含まれる各命令の振り分け先として決定したグループの数が所定数以下になるまで、前記決定する処理と前記第１行列を再生成する処理と前記第２行列を再生成する処理とを繰り返す、
ことを特徴とする情報処理装置。 (Appendix 7)
In Appendix 5,
The group distribution section
Among the values of the elements of the second matrix, the distribution destinations of the plurality of instructions corresponding to the maximum values are determined in the same group.
The plurality of rows corresponding to the plurality of instructions in the second matrix are converted into a single row having the sum of the elements of the same column in the plurality of rows as elements, and the plurality of rows in the second matrix. The first matrix is regenerated by converting the plurality of columns corresponding to the instruction of to a single column having the sum of the elements of the same row in the plurality of columns as elements.
The second matrix is regenerated from the regenerated first matrix, and the second matrix is regenerated.
Until the number of groups determined as the distribution destination of each instruction included in the loop becomes a predetermined number or less, the process of determining, the process of regenerating the first matrix, and the process of regenerating the second matrix are performed. repeat,
An information processing device characterized by this.

（付記８）
付記２において、さらに、
近接する記憶領域に格納された各データに対してアクセスを行う複数の命令が前記ループに含まれているか否かを判定する近接判定部を有し、
前記情報生成部は、前記複数の命令が前記ループに含まれているか否かの判定結果に基づいて、前記第１行列の生成を行う、
ことを特徴とする情報処理装置。 (Appendix 8)
In Appendix 2, further
It has a proximity determination unit that determines whether or not a plurality of instructions for accessing each data stored in adjacent storage areas are included in the loop.
The information generation unit generates the first matrix based on the determination result of whether or not the plurality of instructions are included in the loop.
An information processing device characterized by this.

（付記９）
付記８において、
前記情報生成部は、前記複数の命令が前記ループに含まれていると判定した場合、前記複数の命令間に対応する要素を前記第１の値よりも大きい第３の値とすることにより、前記第１行列の生成を行う、
ことを特徴とする情報処理装置。 (Appendix 9)
In Appendix 8,
When the information generation unit determines that the plurality of instructions are included in the loop, the information generation unit sets the element corresponding to the plurality of instructions to a third value larger than the first value. The first matrix is generated.
An information processing device characterized by this.

（付記１０）
付記８において、
前記近接する記憶領域に格納された各データは、同一の配列である、
ことを特徴とする情報処理装置。 (Appendix 10)
In Appendix 8,
Each data stored in the adjacent storage area is the same array.
An information processing device characterized by this.

（付記１１）
ソースコードから生成した中間言語のループに含まれる各命令間の依存関係を示す依存情報を生成し、
生成した前記依存情報を行列に変換することによって第１行列を生成し、
生成した前記第１行列から算出した各命令間における依存度合いに基づいて、前記ループに含まれる各命令を複数のグループに振り分け、
振り分けた前記複数のグループごとに、前記ループの分割を行う、
処理をコンピュータに実行させることを特徴とするコンパイラプログラム。 (Appendix 11)
Generates dependency information that shows the dependency between each instruction included in the intermediate language loop generated from the source code.
The first matrix is generated by converting the generated dependency information into a matrix.
Based on the degree of dependence between each instruction calculated from the generated first matrix, each instruction included in the loop is divided into a plurality of groups.
The loop is divided for each of the plurality of sorted groups.
A compiler program characterized by having a computer perform processing.

（付記１２）
付記１１において、
前記第１行列を生成する処理では、
前記ループに含まれる命令の組合せごとに、各組合せに含まれる命令間に依存関係があることを前記依存情報が示しているか否かを判定し、
前記命令の組合せのうち、各組合せに含まれる命令間に依存関係があることを前記依存情報が示している組合せに対応する要素を第１の値とし、前記命令の組合せのうち、各組合せに含まれる命令間に依存関係があることを前記依存情報が示していない組合せに対応する要素を第２の値とすることにより、前記第１行列の生成を行う、
ことを特徴とするコンパイラプログラム。 (Appendix 12)
In Appendix 11,
In the process of generating the first matrix,
For each combination of instructions included in the loop, it is determined whether or not the dependency information indicates that there is a dependency between the instructions included in each combination.
Among the combinations of the instructions, the element corresponding to the combination in which the dependency information indicates that there is a dependency between the instructions included in each combination is set as the first value, and each combination of the combinations of the instructions has The first matrix is generated by setting the element corresponding to the combination in which the dependency information does not indicate that there is a dependency between the included instructions as the second value.
A compiler program characterized by that.

（付記１３）
付記１１において、
前記複数のグループに振り分ける処理では、異なるグループにそれぞれ含まれる命令間において存在する依存関係が少なくなるように、前記ループに含まれる各命令を複数のグループに振り分ける、
ことを特徴とするコンパイラプログラム。 (Appendix 13)
In Appendix 11,
In the process of distributing to a plurality of groups, each instruction included in the loop is distributed to a plurality of groups so that the dependency existing between the instructions included in the different groups is reduced.
A compiler program characterized by that.

（付記１４）
付記１１において、
前記複数のグループに振り分ける処理では、
前記第１行列から各命令間における依存度合いを示す第２行列を生成し、
生成した前記第２行列に基づいて、前記ループに含まれる各命令を前記複数のグループに振り分ける、
ことを特徴とするコンパイラプログラム。 (Appendix 14)
In Appendix 11,
In the process of distributing to a plurality of groups,
From the first matrix, a second matrix showing the degree of dependence between each instruction is generated.
Based on the generated second matrix, each instruction included in the loop is distributed to the plurality of groups.
A compiler program characterized by that.

１：情報処理装置５：操作端末
２１：ソースコード２２：中間言語
２３：オブジェクトコード１３０：記憶部
ＮＷ：ネットワーク 1: Information processing device 5: Operation terminal 21: Source code 22: Intermediate language 23: Object code 130: Storage unit NW: Network

Claims

An information generation unit that generates a first matrix by generating dependency information indicating the dependency between each instruction included in the loop of the intermediate language generated from the source code and converting the generated dependency information into a matrix.
A group distribution unit that distributes each instruction included in the loop into a plurality of groups based on the degree of dependence between each instruction calculated from the generated first matrix.
Each of the plurality of sorted groups has a loop splitting unit for splitting the loop.
An information processing device characterized by this.

In claim 1,
The information generation unit
For each combination of instructions included in the loop, it is determined whether or not the dependency information indicates that there is a dependency between the instructions included in each combination.
Among the combinations of the instructions, the element corresponding to the combination in which the dependency information indicates that there is a dependency between the instructions included in each combination is set as the first value, and each combination of the combinations of the instructions has The first matrix is generated by setting the element corresponding to the combination in which the dependency information does not indicate that there is a dependency between the included instructions as the second value.
An information processing device characterized by this.

In claim 1,
The group distribution unit distributes each instruction included in the loop to a plurality of groups so that the dependency relationship existing between the instructions included in the different groups is reduced.
An information processing device characterized by this.

In claim 1,
The group distribution section
From the first matrix, a second matrix showing the degree of dependence between each instruction is generated.
Based on the generated second matrix, each instruction included in the loop is distributed to the plurality of groups.
An information processing device characterized by this.

In claim 4,
The group distribution section
For each combination of instructions included in the loop, the degree of dependence between the instructions included in each combination is calculated.
The second matrix is generated by using each of the calculated degrees of dependence as an element.
An information processing device characterized by this.

In claim 5,
The group distribution unit calculates the degree of dependence by using the Newman algorithm.
An information processing device characterized by this.

In claim 5,
The group distribution section
Among the values of the elements of the second matrix, the distribution destinations of the plurality of instructions corresponding to the maximum values are determined in the same group.
The plurality of rows corresponding to the plurality of instructions in the second matrix are converted into a single row having the sum of the elements of the same column in the plurality of rows as elements, and the plurality of rows in the second matrix. The first matrix is regenerated by converting the plurality of columns corresponding to the instruction of to a single column having the sum of the elements of the same row in the plurality of columns as elements.
The second matrix is regenerated from the regenerated first matrix, and the second matrix is regenerated.
Until the number of groups determined as the distribution destination of each instruction included in the loop becomes a predetermined number or less, the process of determining, the process of regenerating the first matrix, and the process of regenerating the second matrix are performed. repeat,
An information processing device characterized by this.

In claim 2, further
It has a proximity determination unit that determines whether or not a plurality of instructions for accessing each data stored in adjacent storage areas are included in the loop.
The information generation unit generates the first matrix based on the determination result of whether or not the plurality of instructions are included in the loop.
An information processing device characterized by this.

In claim 8.
When the information generation unit determines that the plurality of instructions are included in the loop, the information generation unit sets the element corresponding to the plurality of instructions to a third value larger than the first value. The first matrix is generated.
An information processing device characterized by this.

In claim 8.
Each data stored in the adjacent storage area is the same array.
An information processing device characterized by this.

Generates dependency information that shows the dependency between each instruction included in the intermediate language loop generated from the source code.
The first matrix is generated by converting the generated dependency information into a matrix.
Based on the degree of dependence between each instruction calculated from the generated first matrix, each instruction included in the loop is divided into a plurality of groups.
The loop is divided for each of the plurality of sorted groups.
A compiler program characterized by having a computer perform processing.