JP2015210740A

JP2015210740A - Compilation method, compilation device, and compilation program

Info

Publication number: JP2015210740A
Application number: JP2014093212A
Authority: JP
Inventors: 石川　貴洋; Takahiro Ishikawa; 貴洋石川
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2014-04-28
Filing date: 2014-04-28
Publication date: 2015-11-24
Anticipated expiration: 2034-04-28
Also published as: JP6264179B2

Abstract

PROBLEM TO BE SOLVED: To shorten a compilation time of a source program.SOLUTION: A compilation device 100 extracts duplicate method candidates from a method group in an intermediate language I code obtained through syntax analysis and meaning analysis on a source program sc. The compilation device 100 generates information 110 on correspondence relation between methods of the extracted duplicate method candidates based upon processing logic specified with respective intermediate language I codes of the extracted duplicate method candidates. The compilation device 100 determines whether the methods of the duplicate method candidates have redundancy in terms of execution based upon the generated correspondence relation information 110, and performs merging processing on duplicate method candidates having the redundancy between methods in terms of execution.

Description

本発明は、コンパイル方法、コンパイル装置およびコンパイルプログラムに関する。 The present invention relates to a compiling method, a compiling device, and a compiling program.

従来、計算科学の分野、とりわけＨＰＣ（ＨｉｇｈＰｅｒｆｏｒｍａｎｃｅＣｏｍｐｕｔｉｎｇ）の世界でのソフトウェア開発は、主にＦｏｒｔｒａｎによりコーディングされてきた。しかし、近年は、計算科学の分野でも、Ｃ言語は勿論のこと、生産性・再利用性の高さの観点から、Ｃ＋＋やＪａｖａ（登録商標）といったオブジェクト指向言語によりコーディングされたソフトウェアが増えている。 Conventionally, software development in the field of computational science, especially in the world of High Performance Computing (HPC), has been mainly coded by Fortran. However, in recent years, in the field of computational science, not only C language but also software coded in object-oriented languages such as C ++ and Java (registered trademark) has been increasing from the viewpoint of high productivity and reusability. Yes.

先行技術としては、例えば、メソッドへのリファレンスをオブジェクトベースのプログラミングシステムにおいてカプセル化し、そのリファレンスが安全であることを保証するための技術がある。また、インタプリタによってオブジェクト指向プログラムを実行するスクリプト言語処理装置において、複数のオブジェクトに対して同一のメソッドを記述する場合に、グループ化した記述を可能とする技術がある。 Prior art includes, for example, a technique for encapsulating a reference to a method in an object-based programming system and ensuring that the reference is safe. In addition, there is a technique that enables grouped description when the same method is described for a plurality of objects in a script language processing apparatus that executes an object-oriented program by an interpreter.

特表２００２−５１７８１５号公報JP-T-2002-517815 特開平１０−１７１６５６号公報JP-A-10-171656

しかしながら、従来技術によれば、オブジェクト指向言語でコーディングされたソースプログラムには、オブジェクト指向言語の言語的特徴から大量のメソッドが含まれることが多く、ソースプログラムのコンパイル時間の増加を招くという問題がある。 However, according to the prior art, a source program coded in an object-oriented language often includes a large number of methods due to the linguistic features of the object-oriented language, which increases the compilation time of the source program. is there.

一つの側面では、本発明は、ソースプログラムのコンパイル時間の短縮化を図るコンパイル方法、コンパイル装置およびコンパイルプログラムを提供することを目的とする。 In one aspect, an object of the present invention is to provide a compiling method, a compiling device, and a compiling program for shortening the compiling time of a source program.

本発明の一側面によれば、ソースプログラムの構文解析および意味解析により得られる中間言語コード中の複数のメソッドから、少なくとも継承関係または名称のいずれかに基づき複数のマージ候補メソッドを抽出し、前記複数のマージ候補メソッドそれぞれの中間言語コードにより特定される処理ロジックに基づいて、前記複数のマージ候補メソッドからマージ可能な複数のマージ可能メソッドを抽出し、前記複数のマージ可能メソッドのマージ処理を行うコンパイル方法、コンパイル装置およびコンパイルプログラムが提案される。 According to one aspect of the present invention, a plurality of merge candidate methods are extracted from a plurality of methods in an intermediate language code obtained by syntax analysis and semantic analysis of a source program, based on at least an inheritance relationship or a name, Based on the processing logic specified by the intermediate language code of each of the plurality of merge candidate methods, a plurality of mergeable methods that can be merged are extracted from the plurality of merge candidate methods, and the plurality of mergeable methods are merged. A compiling method, a compiling device, and a compiling program are proposed.

本発明の一態様によれば、ソースプログラムのコンパイル時間の短縮化を図ることができるという効果を奏する。 According to one aspect of the present invention, there is an effect that it is possible to shorten the compile time of the source program.

図１は、実施の形態にかかるコンパイル方法の一実施例を示す説明図である。FIG. 1 is an explanatory diagram of an example of the compiling method according to the embodiment. 図２は、コンパイル装置１００のハードウェア構成例を示すブロック図である。FIG. 2 is a block diagram illustrating a hardware configuration example of the compiling device 100. 図３は、ソースプログラムｓｃの具体例を示す説明図である。FIG. 3 is an explanatory diagram showing a specific example of the source program sc. 図４は、中間言語Ｉコードの具体例を示す説明図である。FIG. 4 is an explanatory diagram showing a specific example of the intermediate language I code. 図５は、コンパイル装置１００の機能的構成例を示すブロック図である。FIG. 5 is a block diagram illustrating a functional configuration example of the compiling device 100. 図６は、重複メソッド候補のメソッド間のデータの比較例を示す説明図（その１）である。FIG. 6 is an explanatory diagram (part 1) of a comparative example of data between methods of duplicate method candidates. 図７は、重複メソッド候補のメソッド間のデータの比較例を示す説明図（その２）である。FIG. 7 is an explanatory diagram (part 2) illustrating a comparative example of data between methods of duplicate method candidates. 図８は、重複メソッド候補のメソッド間のデータの比較例を示す説明図（その３）である。FIG. 8 is an explanatory diagram (part 3) of a comparative example of data between methods of duplicate method candidates. 図９は、重複メソッド候補のメソッド間のデータの比較例を示す説明図（その４）である。FIG. 9 is an explanatory diagram (part 4) illustrating a comparative example of data between methods of duplicate method candidates. 図１０は、ＳｅｔＣｏｌｏｒメソッドのマージ処理を示す説明図である。FIG. 10 is an explanatory diagram showing merge processing of the SetColor method. 図１１は、ＳｅｔＣｏｌｏｒメソッドをマージしたメソッド本体の中間言語Ｉコードの一例を示す説明図である。FIG. 11 is an explanatory diagram showing an example of the intermediate language I code of the method body obtained by merging the SetColor method. 図１２は、重複メソッド候補をマージしないソースプログラム例を示す説明図である。FIG. 12 is an explanatory diagram of an example of a source program that does not merge duplicate method candidates. 図１３は、コンパイル装置１００のコンパイル処理手順の一例を示すフローチャートである。FIG. 13 is a flowchart illustrating an example of a compile processing procedure of the compile device 100. 図１４は、重複メソッド候補抽出処理の具体的処理手順の一例を示すフローチャートである。FIG. 14 is a flowchart illustrating an example of a specific processing procedure of the duplicate method candidate extraction processing. 図１５は、対応関係情報生成処理の具体的処理手順の一例を示すフローチャートである。FIG. 15 is a flowchart illustrating an example of a specific processing procedure of the correspondence relationship information generation processing. 図１６は、マージ処理の具体的処理手順の一例を示すフローチャートである。FIG. 16 is a flowchart illustrating an example of a specific processing procedure of the merge processing.

以下に図面を参照して、本発明にかかるコンパイル方法、コンパイル装置およびコンパイルプログラムの実施の形態を詳細に説明する。 Exemplary embodiments of a compiling method, a compiling device, and a compiling program according to the present invention will be described below in detail with reference to the drawings.

（コンパイル方法の一実施例）
図１は、実施の形態にかかるコンパイル方法の一実施例を示す説明図である。図１において、コンパイル装置１００は、ソースプログラムｓｃをコンパイルするコンピュータである。ソースプログラムｓｃは、オブジェクト指向言語でコーディングされたソースコードを含む。オブジェクト指向言語としては、例えば、Ｊａｖａ、Ｃ＋＋などがある。 (One example of compiling method)
FIG. 1 is an explanatory diagram of an example of the compiling method according to the embodiment. In FIG. 1, a compiling device 100 is a computer that compiles a source program sc. The source program sc includes source code coded in an object-oriented language. Examples of object-oriented languages include Java and C ++.

コンパイラのコンパイル処理は、大きく分けると、構文・意味解析区、最適化区およびコード生成区に区分けされる。構文・意味解析区では、ソースプログラムｓｃを入力として、ソースプログラムｓｃの構文解析および意味解析が行われ、構文・意味解析区内のインターフェースである中間言語Ｉコードが生成される。 The compiler compilation process is roughly divided into a syntax / semantic analysis section, an optimization section, and a code generation section. In the syntax / semantic analysis section, the source program sc is input, and the source program sc is subjected to syntax analysis and semantic analysis, and an intermediate language I code that is an interface in the syntax / semantic analysis section is generated.

中間言語Ｉコードには、ソースプログラムｓｃ内のメソッド（関数）が表現される。メソッドとは、ある処理を実現するための命令のかたまりである。また、構文・意味解析区では、中間言語Ｉコードを入力として、コンパイラ内のインターフェースである中間言語ＩＩコードが生成される。 Methods (functions) in the source program sc are expressed in the intermediate language I code. A method is a group of instructions for realizing a certain process. Further, in the syntax / semantic analysis section, an intermediate language II code which is an interface in the compiler is generated with the intermediate language I code as an input.

最適化区では、中間言語ＩＩコードを入力として、中間言語ＩＩコードの最適化処理が行われる。最適化処理では、例えば、中間言語ＩＩコードから不要と見なされるメソッドが削除される。不要と見なされるメソッドとしては、例えば、翻訳単位に限定した静的メソッドで使用されていないメソッドが挙げられる。 In the optimization section, the intermediate language II code is input and the intermediate language II code is optimized. In the optimization process, for example, a method deemed unnecessary from the intermediate language II code is deleted. Examples of methods that are considered unnecessary include methods that are not used in static methods limited to translation units.

コード生成区では、中間言語ＩＩコードを入力として、オブジェクトファイルｆが生成される。オブジェクトファイルｆは、機械語により命令とデータが表現されたオブジェクトコードを含むファイルである。オブジェクトファイルｆは、例えば、リンカと呼ばれるプログラムに渡され、最終的に実行プログラムが生成される。 In the code generation area, the object file f is generated with the intermediate language II code as an input. The object file f is a file including an object code in which instructions and data are expressed in machine language. The object file f is transferred to, for example, a program called a linker, and finally an execution program is generated.

ここで、オブジェクト指向言語は、ＦｏｒｔｒａｎやＣ言語などに比べて、高い生産性や再利用性を得ることができるプログラミング言語である。一方で、オブジェクト指向言語では、オブジェクト指向言語の言語的特徴であるカプセル化、オーバーライド、オーバーロードの概念から、データ型ごとにデータの操作や処理ロジックをまとめたメソッドが多く作成される。 Here, the object-oriented language is a programming language that can obtain high productivity and reusability as compared to the Fortran or C language. On the other hand, in an object-oriented language, many methods that summarize data operations and processing logic for each data type are created from the concept of encapsulation, override, and overload, which are linguistic features of the object-oriented language.

さらに、オブジェクト指向言語のテンプレート関数では、テンプレート引数の型や値によって、処理ロジックが同じメソッドがコンパイラによって複数作成される。カプセル化とは、データとデータを操作する手続き（メソッド）を一体化して「オブジェクト」として定義し、オブジェクト内の細かい仕様や構造を外部から隠蔽することである。クラス内のメソッドは、カプセル化の概念で記述されたものであるといえる。クラスとは、オブジェクトの雛型を定義したものであり、オブジェクトがどのようなフィールド（変数）／メソッドを持っているかを記述したものである。 Further, in the template function of the object-oriented language, a plurality of methods having the same processing logic are created by the compiler depending on the type and value of the template argument. Encapsulation is to unify data and procedures (methods) for manipulating data, defining them as “objects”, and hiding the detailed specifications and structures in the objects from the outside. It can be said that the methods in a class are described in the concept of encapsulation. A class defines a template of an object and describes what fields (variables) / methods the object has.

また、オーバーライドとは、基底クラス（スーパークラス）から継承された派生クラス（サブクラス）において、基底クラスで定義されたメソッドを再定義することである。また、オーバーロードとは、引数の型、数、並び順などが異なる同一名のメソッドや演算子を多重定義することである。 Overriding means redefining a method defined in the base class in a derived class (subclass) inherited from the base class (super class). Overloading means overloading methods and operators with the same name that have different types, numbers, and order of arguments.

ソースプログラムｓｃに含まれるメソッドが多くなると、ソースプログラムｓｃのコンパイル時間の増加を招くとともに、生成されるオブジェクトファイルｆのデータサイズの増加を招いてしまう。すなわち、オブジェクト指向言語では、生産性・再利用性の高さと引き替えに、ソースプログラムｓｃのコンパイル時間の増加や、オブジェクトファイルｆのデータサイズの増加が問題となる。 When the number of methods included in the source program sc increases, the compilation time of the source program sc increases and the data size of the generated object file f increases. That is, in the object-oriented language, an increase in the compilation time of the source program sc and an increase in the data size of the object file f are problematic in exchange for high productivity and reusability.

ソースプログラムｓｃのコンパイル時間を短縮し、オブジェクトファイルｆのデータサイズを小さくするためには、メソッドの絶対数を減らすことが効果的である。従来のコンパイラでは、最適化区の最適化処理において、不要と見なされるメソッドを削除することが行われている。 In order to shorten the compile time of the source program sc and reduce the data size of the object file f, it is effective to reduce the absolute number of methods. In the conventional compiler, methods that are considered unnecessary are deleted in the optimization processing of the optimization section.

ところが、従来のコンパイラでは、ソースプログラムｓｃ内のメソッドが不要と見なされない限り、全てのメソッドがコンパイル対象としてオブジェクトファイルｆに翻訳されてしまう。上述したように、オブジェクト指向言語では大量のメソッドが作成されるため、不要と見なされるメソッドの削除だけではメソッドの絶対数は大して変わらず、コンパイル時間の削減はあまり望めない。 However, in the conventional compiler, unless the methods in the source program sc are not considered unnecessary, all the methods are translated into the object file f as a compilation target. As described above, since a large number of methods are created in an object-oriented language, the absolute number of methods does not change significantly only by deleting unnecessary methods, and the reduction of compile time cannot be expected much.

ここで、データや処理ロジックについて同一性を有する、すなわち、実行面で冗長性を有する複数のメソッドを一つのメソッドにマージすることで、メソッドの絶対数を減らすことができる。例えば、オーバーライドやオーバーロードされるメソッド同士は、実行面で冗長性を有する可能性が高く、マージ対象となるメソッドとなり得る。 Here, it is possible to reduce the absolute number of methods by merging a plurality of methods having the same data and processing logic, that is, redundancy in terms of execution, into one method. For example, methods that are overridden or overloaded are highly likely to have redundancy in terms of execution, and can be methods to be merged.

このため、最適化区の最適化処理において、不要と見なされるメソッドを削除するとともに、オブジェクト指向言語の言語的特徴から実行面で冗長性を有する複数のメソッドを判断して一つのメソッドにマージすることが考えられる。ところが、最適化区では、基本的にＦｏｒｔｒａｎ、Ｃ言語、Ｃ＋＋などの言語に依存しない言語無依存な処理が行われる。このため、最適化区では、オブジェクト指向言語固有の情報が乏しく、オブジェクト指向言語特有の最適化処理を行うことは難しい。 For this reason, in the optimization process of the optimization section, methods deemed unnecessary are deleted, and a plurality of methods having redundancy in terms of execution are judged from the linguistic features of the object-oriented language and merged into one method. It is possible. However, in the optimized section, language-independent processing that is basically independent of languages such as Fortran, C language, C ++, and the like is performed. For this reason, the information specific to the object-oriented language is scarce in the optimization section, and it is difficult to perform optimization processing specific to the object-oriented language.

すなわち、最適化区では、オブジェクト指向言語固有の情報が乏しいために、大量のメソッドの中から、どのメソッドがオブジェクト指向言語の言語的特徴を持った実行面で冗長性のあるメソッドであるかを判断することが難しい。従って、最適化区の最適化処理において、複数のメソッドをマージしようとしても、現実的な有限時間内に効果的なメソッド数の削減を行うことはできない。 In other words, because the information specific to the object-oriented language is scarce in the optimization section, it is determined which of the large number of methods is a redundant method in terms of execution with the linguistic features of the object-oriented language. It is difficult to judge. Therefore, even when trying to merge a plurality of methods in the optimization process of the optimization section, it is not possible to effectively reduce the number of methods within a realistic finite time.

また、オブジェクト指向言語では、実行時のコードの冗長性は必ずしもソースプログラムｓｃに表れず、例えば、コンパイラによって実行面で冗長となるコードが生成される場合がある。さらに、カプセル化やオーバーライド、オーバーロードの判定は、ソースプログラムｓｃの字面だけでは判定することができず、構文・意味解析を行わなければ正確に判定することはできない。 In the object-oriented language, code redundancy at the time of execution does not always appear in the source program sc. For example, a compiler may generate code that is redundant in terms of execution. Furthermore, the determination of encapsulation, overriding, and overload cannot be determined only by the face of the source program sc, and cannot be determined accurately unless syntax / semantic analysis is performed.

また、ソースプログラムｓｃのコンパイル時間の多くは最適化区での最適化処理にかかっており、最適化区の入力時点でメソッド数が少なくなっていなければ、コンパイル時間の短縮は見込めない。これらのことから、コンパイル時間の短縮とオブジェクトファイルｆのデータサイズの削減のためには、最適化区での最適化処理とともに、オブジェクト指向言語の言語的特徴を判断できる構文・意味解析区でのソースプログラムレベル（よりソースプログラムｓｃに近い）での最適化が必要になっている。 Further, much of the compile time of the source program sc depends on the optimization process in the optimization section. If the number of methods is not reduced at the time of input of the optimization section, the compile time cannot be shortened. From these facts, in order to shorten the compilation time and the data size of the object file f, in the syntax / semantic analysis section that can determine the linguistic features of the object-oriented language as well as optimization processing in the optimization section There is a need for optimization at the source program level (closer to the source program sc).

そこで、本実施の形態では、オブジェクト指向言語の言語的特徴を判断できる構文・意味解析区において、オブジェクト指向言語の言語的特徴から、実行面で冗長性を有する複数のメソッドを判断して一つのメソッドにマージすることで、メソッドの絶対数を減らすコンパイル方法について説明する。以下、コンパイル装置１００のコンパイル処理例について説明する。 Therefore, in this embodiment, in the syntax / semantic analysis section that can determine the linguistic characteristics of the object-oriented language, a plurality of methods having redundancy in terms of execution are determined from the linguistic characteristics of the object-oriented language, and one method is determined. Explain how to compile to reduce the absolute number of methods by merging with methods. Hereinafter, a compile processing example of the compile device 100 will be described.

（１）コンパイル装置１００は、構文・意味解析区において、ソースプログラムｓｃの構文解析および意味解析を行う。ここで、構文解析とは、ソースプログラムｓｃに記述された文（ステートメント）や構造が言語仕様に沿っているか否かをチェックすることである。意味解析は、ソースプログラムｓｃに記述された変数の型や文が意味的に正しいか否かをチェックすることである。 (1) The compiling device 100 performs syntax analysis and semantic analysis of the source program sc in the syntax / semantic analysis section. Here, the syntax analysis is to check whether a sentence or a structure described in the source program sc conforms to the language specification. Semantic analysis is to check whether the variable type and sentence described in the source program sc are semantically correct.

ソースプログラムｓｃの構文解析および意味解析を行うことにより、構文・意味解析区内のインターフェースである中間言語Ｉコードを得ることができる。中間言語Ｉコードには、ソースプログラムｓｃ内のメソッドが表現されている。また、中間言語Ｉコードには、オブジェクト指向言語の特徴を持ったメソッドについて、オブジェクト指向言語の特徴を表す情報（図１中、（オ））が存在する。 By performing syntax analysis and semantic analysis of the source program sc, an intermediate language I code that is an interface in the syntax and semantic analysis section can be obtained. Methods in the source program sc are expressed in the intermediate language I code. Further, in the intermediate language I code, there is information ((e) in FIG. 1) representing the characteristics of the object-oriented language for the methods having the characteristics of the object-oriented language.

（２）コンパイル装置１００は、構文・意味解析区において、ソースプログラムｓｃの構文解析および意味解析により得られる中間言語Ｉコード中の複数のメソッドから、少なくとも継承関係または名称のいずれかに基づき複数のマージ候補メソッドを抽出する。ここで、複数のマージ候補メソッドとは、実行面での冗長性を有する可能性がある複数のメソッドである。 (2) In the syntax / semantic analysis section, the compiling device 100 has a plurality of methods based on at least an inheritance relationship or a name from a plurality of methods in the intermediate language I code obtained by syntax analysis and semantic analysis of the source program sc. Extract merge candidate methods. Here, the plurality of merge candidate methods are a plurality of methods that may have redundancy in terms of execution.

以下の説明では、複数のマージ候補メソッドをまとめて「重複メソッド候補」と表記する場合がある。重複メソッド候補としては、例えば、オーバーライドされたメソッドやオーバーロードされたメソッドがある。 In the following description, a plurality of merge candidate methods may be collectively referred to as “duplicate method candidates”. Duplicate method candidates include, for example, overridden methods and overloaded methods.

具体的には、例えば、コンパイル装置１００は、ソースプログラムｓｃの中間言語Ｉコードを参照して、親子関係を有するクラス間で同一名の複数のメソッドを重複メソッド候補として抽出する。これにより、オーバーライドされたメソッドを重複メソッド候補として抽出することができる。 Specifically, for example, the compiling device 100 refers to the intermediate language I code of the source program sc, and extracts a plurality of methods having the same name between classes having a parent-child relationship as duplicate method candidates. Thereby, the overridden method can be extracted as a duplicate method candidate.

（３）コンパイル装置１００は、構文・意味解析区において、抽出した重複メソッド候補それぞれの中間言語Ｉコードにより特定される処理ロジックに基づいて、重複メソッド候補のメソッド間の対応関係情報１１０を生成する。ここで、対応関係情報１１０は、重複メソッド候補のメソッド間の実行面での冗長性の有無を判断するための情報である。 (3) In the syntax / semantic analysis section, the compiling device 100 generates correspondence information 110 between methods of duplicate method candidates based on the processing logic specified by the intermediate language I code of each extracted duplicate method candidate. . Here, the correspondence relationship information 110 is information for determining whether there is redundancy in terms of execution between methods of duplicate method candidates.

具体的には、例えば、コンパイル装置１００は、ソースプログラムｓｃの中間言語Ｉコードから、重複メソッド候補それぞれに対応する中間言語Ｉコードを抽出する。つぎに、コンパイル装置１００は、抽出した中間言語Ｉコード中の宣言・文・式それぞれのデータごとに、メソッド間のデータを比較する。 Specifically, for example, the compiling device 100 extracts an intermediate language I code corresponding to each duplicate method candidate from the intermediate language I code of the source program sc. Next, the compiling device 100 compares data between methods for each data of declaration, sentence, and expression in the extracted intermediate language I code.

そして、コンパイル装置１００は、データごとの比較結果に基づいて、重複メソッド候補それぞれに対応する中間言語Ｉコードのうち、メソッド間で意味的に同一である部分と、メソッド間で意味的に異なる部分（各メソッド固有の部分）を識別する情報を対応関係情報１１０として生成する。 Then, the compiling device 100, based on the comparison result for each data, in the intermediate language I code corresponding to each duplicate method candidate, a portion that is semantically identical between methods and a portion that is semantically different between methods Information for identifying (a part unique to each method) is generated as correspondence information 110.

（４）コンパイル装置１００は、構文・意味解析区において、生成した対応関係情報１１０に基づいて、重複メソッド候補のメソッド間で実行面での冗長性を有するか否かを判定する。具体的には、例えば、コンパイル装置１００は、宣言・文・式それぞれのデータのうち少なくともいずれかのデータが意味的に同一である場合に、重複メソッド候補のメソッド間で実行面での冗長性を有すると判定してもよい。メソッド間で実行面での冗長性を有する重複メソッド候補は、マージ可能な複数のマージ可能メソッドである。 (4) In the syntax / semantic analysis section, the compiling device 100 determines whether or not there is redundancy in terms of execution between methods of duplicate method candidates based on the generated correspondence relationship information 110. Specifically, for example, the compiling apparatus 100 performs redundancy in terms of execution between duplicate method candidate methods when at least one of the data of declarations, statements, and expressions is semantically identical. You may determine with having. Duplicate method candidates having execution redundancy between methods are a plurality of mergeable methods that can be merged.

（５）コンパイル装置１００は、構文・意味解析区において、重複メソッド候補のメソッド間で実行面での冗長性を有すると判定した場合に、対応関係情報１１０に基づいて、重複メソッド候補のマージ処理を行う。すなわち、コンパイル装置１００は、重複メソッド候補からマージ可能な複数のマージ可能メソッドを抽出して、複数のマージ可能メソッドのマージ処理を行う。ここで、マージ処理とは、重複メソッド候補である複数のメソッドを、重複メソッド候補のいずれの呼び出し元からも呼び出せる一つのメソッドとしてまとめることである。 (5) When the compiling device 100 determines that there is redundancy in terms of execution between methods of duplicate method candidates in the syntax / semantic analysis section, the merge method of duplicate method candidates based on the correspondence information 110 I do. That is, the compiling device 100 extracts a plurality of mergeable methods that can be merged from the duplicate method candidates, and performs a merge process for the plurality of mergeable methods. Here, the merging process is to combine a plurality of methods that are duplicate method candidates as one method that can be called from any caller of the duplicate method candidates.

例えば、メソッド間で冗長性を有するメソッドとして、メソッドＡ、メソッドＢおよびメソッドＣの３つのメソッドがあったとする。これら３つのメソッドを、例えば、メソッドＡＢＣとして１つにマージすることにより、メソッドの数を１／３にすることができる。なお、共通化できないメソッド固有の部分については、メソッド固有の情報を、マージしたメソッドに引数などで渡すことにより、メソッド固有の処理を切り分けられるようにする。 For example, it is assumed that there are three methods, method A, method B, and method C, as methods having redundancy among methods. By merging these three methods into one as, for example, method ABC, the number of methods can be reduced to 1/3. For method-specific parts that cannot be shared, method-specific processing can be separated by passing method-specific information to the merged method as an argument.

具体的には、例えば、コンパイル装置１００は、重複メソッド候補のメソッド間で共通する処理の中間言語Ｉコードを生成するとともに、重複メソッド候補の各メソッドで固有の処理の中間言語Ｉコードを生成する。そして、コンパイル装置１００は、生成した中間言語Ｉコードをマージすることにより、重複メソッド候補をマージしたメソッド本体の中間言語Ｉコードを生成する。 Specifically, for example, the compiling device 100 generates an intermediate language I code for processing that is common among methods of duplicate method candidates, and also generates an intermediate language I code for processing unique to each method of the duplicate method candidates. . Then, the compiling device 100 generates the intermediate language I code of the method body in which the duplicate method candidates are merged by merging the generated intermediate language I code.

（６）コンパイル装置１００は、構文・意味解析区において、マージしたメソッド本体の中間言語Ｉコードに基づいて、コンパイラ内のインターフェースである中間言語ＩＩコードを生成する。具体的には、例えば、コンパイル装置１００は、中間言語Ｉコードで表現したメソッド（マージしたメソッド本体、マージしていないメソッド）を中間言語ＩＩコードの形式に変換する。生成された中間言語ＩＩコードは最適化区に入力される。 (6) The compiling device 100 generates intermediate language II code that is an interface in the compiler based on the merged intermediate language I code of the method body in the syntax / semantic analysis section. Specifically, for example, the compiling device 100 converts a method expressed by an intermediate language I code (a merged method body and an unmerged method) into an intermediate language II code format. The generated intermediate language II code is input to the optimization section.

図１中、最適化区に入力される中間言語ＩＩコード内の点線枠は、マージされたために削除されたメソッドを示している。図１の例では、ソースプログラムｓｃの中間言語Ｉコードの段階では１２個存在していたメソッドが、中間言語ＩＩコードの段階では５個となっており、最適化区への入力時点でメソッドの絶対数が大幅に削減されている。 In FIG. 1, the dotted frame in the intermediate language II code input to the optimization section indicates a method that has been deleted because it has been merged. In the example of FIG. 1, there are 12 methods in the intermediate language I code stage of the source program sc, and there are 5 methods in the intermediate language II code stage. The absolute number has been greatly reduced.

（７）コンパイル装置１００は、最適化区において、ソースプログラムｓｃの中間言語ＩＩコードの最適化処理を行う。最適化処理では、例えば、中間言語ＩＩコードから不要と見なされるメソッドが削除される。図１の例では、中間言語ＩＩコードから、不要と見なされた１個のメソッドが削除されている。最適化処理が行われた中間言語ＩＩコードはコード生成区に入力される。 (7) The compiling device 100 performs an optimization process of the intermediate language II code of the source program sc in the optimization section. In the optimization process, for example, a method deemed unnecessary from the intermediate language II code is deleted. In the example of FIG. 1, one method deemed unnecessary is deleted from the intermediate language II code. The intermediate language II code subjected to the optimization process is input to the code generation area.

（８）コンパイル装置１００は、コード生成区において、最適化処理が行われた中間言語ＩＩコードに基づいて、オブジェクトファイルｆを生成する。生成されたオブジェクトファイルｆは、例えば、リンカに渡されて、最終的に実行プログラムが生成される。 (8) The compiling device 100 generates the object file f in the code generation section based on the intermediate language II code on which the optimization process has been performed. The generated object file f is transferred to, for example, a linker, and an execution program is finally generated.

このように、実施の形態にかかるコンパイル装置１００によれば、構文・意味解析区において、オブジェクト指向言語特有の性質にマッチした効果的な最適化を行ってメソッドの絶対数を削減することができる。これにより、最適化区の入力時点でメソッドの絶対数を削減可能となるため、最適化処理にかかる処理時間が少なくなり、結果的に、ソースプログラムｓｃのコンパイル時間を短縮することができる。また、メソッドの絶対数を削減することにより、オブジェクトファイルｆのデータサイズを縮小することができる。 As described above, according to the compiling device 100 according to the embodiment, in the syntax / semantic analysis section, it is possible to reduce the absolute number of methods by performing effective optimization that matches the characteristics specific to the object-oriented language. . Thereby, since the absolute number of methods can be reduced at the time of input of the optimization section, the processing time required for the optimization process is reduced, and as a result, the compilation time of the source program sc can be shortened. Further, the data size of the object file f can be reduced by reducing the absolute number of methods.

（コンパイル装置１００のハードウェア構成例）
図２は、コンパイル装置１００のハードウェア構成例を示すブロック図である。図２において、コンパイル装置１００は、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）２０１と、メモリ２０２と、ディスクドライブ２０３と、ディスク２０４と、Ｉ／Ｆ（Ｉｎｔｅｒｆａｃｅ）２０５と、ディスプレイ２０６と、キーボード２０７と、マウス２０８と、を有する。また、各構成部はバス２００によってそれぞれ接続される。 (Example of hardware configuration of compiling device 100)
FIG. 2 is a block diagram illustrating a hardware configuration example of the compiling device 100. 2, the compiling device 100 includes a CPU (Central Processing Unit) 201, a memory 202, a disk drive 203, a disk 204, an I / F (Interface) 205, a display 206, a keyboard 207, and a mouse 208. And having. Each component is connected by a bus 200.

ここで、ＣＰＵ２０１は、コンパイル装置１００の全体の制御を司る。メモリ２０２は、例えば、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）およびフラッシュＲＯＭなどを有する。具体的には、例えば、フラッシュＲＯＭやＲＯＭが各種プログラム（例えば、コンパイルプログラム）を記憶し、ＲＡＭがＣＰＵ２０１のワークエリアとして使用される。メモリ２０２に記憶されるプログラムは、ＣＰＵ２０１にロードされることで、コーディングされている処理をＣＰＵ２０１に実行させる。 Here, the CPU 201 governs overall control of the compiling device 100. The memory 202 includes, for example, a ROM (Read Only Memory), a RAM (Random Access Memory), and a flash ROM. Specifically, for example, a flash ROM or ROM stores various programs (for example, a compiled program), and a RAM is used as a work area of the CPU 201. The program stored in the memory 202 is loaded on the CPU 201 to cause the CPU 201 to execute the coded process.

ディスクドライブ２０３は、ＣＰＵ２０１の制御に従ってディスク２０４に対するデータのリード／ライトを制御する。ディスク２０４は、ディスクドライブ２０３の制御で書き込まれたデータを記憶する。ディスク２０４としては、例えば、磁気ディスク、光ディスクなどが挙げられる。 The disk drive 203 controls reading / writing of data with respect to the disk 204 according to the control of the CPU 201. The disk 204 stores data written under the control of the disk drive 203. Examples of the disk 204 include a magnetic disk and an optical disk.

Ｉ／Ｆ２０５は、通信回線を通じてネットワーク２１０に接続され、ネットワーク２１０を介して他のコンピュータに接続される。そして、Ｉ／Ｆ２０５は、ネットワーク２１０と内部のインターフェースを司り、他のコンピュータからのデータの入出力を制御する。Ｉ／Ｆ２０５には、例えば、モデムやＬＡＮアダプタなどを採用することができる。 The I / F 205 is connected to the network 210 via a communication line, and is connected to another computer via the network 210. The I / F 205 controls an internal interface with the network 210 and controls input / output of data from other computers. For example, a modem or a LAN adapter may be employed as the I / F 205.

ディスプレイ２０６は、カーソル、アイコンあるいはツールボックスをはじめ、文書、画像、機能情報などのデータを表示する。ディスプレイ２０６は、例えば、ＣＲＴ（ＣａｔｈｏｄｅＲａｙＴｕｂｅ）、ＴＦＴ（ＴｈｉｎＦｉｌｍＴｒａｎｓｉｓｔｏｒ）液晶ディスプレイ、プラズマディスプレイなどを採用することができる。 A display 206 displays data such as a document, an image, and function information as well as a cursor, an icon, or a tool box. As the display 206, for example, a CRT (Cathode Ray Tube), a TFT (Thin Film Transistor) liquid crystal display, a plasma display, or the like can be adopted.

キーボード２０７は、文字、数字、各種指示などの入力のためのキーを備え、データの入力を行う。キーボード２０７は、タッチパネル式の入力パッドやテンキーなどであってもよい。マウス２０８は、カーソルの移動や範囲選択、あるいはウィンドウの移動やサイズの変更などを行う。 The keyboard 207 includes keys for inputting characters, numbers, various instructions, and the like, and inputs data. The keyboard 207 may be a touch panel type input pad or a numeric keypad. The mouse 208 performs cursor movement, range selection, window movement, size change, and the like.

（ソースプログラムｓｃの具体例）
図３は、ソースプログラムｓｃの具体例を示す説明図である。図３において、ソースプログラム３００は、オブジェクト指向言語でコーディングされたオブジェクト指向特有のプログラム例である。 (Specific example of source program sc)
FIG. 3 is an explanatory diagram showing a specific example of the source program sc. In FIG. 3, a source program 300 is an example of an object-oriented program coded in an object-oriented language.

ソースプログラム３００では、３行目の基底クラスＣＯＬＯＲにおいて、色（カラー）の基本データと処理ロジックを定義している。１０行目には、基底クラスＣＯＬＯＲのＳｅｔＣｏｌｏｒメソッドが定義されている。また、２２行目の派生クラスＲＥＤは、基底クラスＣＯＬＯＲを継承したクラスである。２４行目には、ＲＥＤクラスのメソッドとして、基底クラスＣＯＬＯＲのＳｅｔＣｏｌｏｒメソッドをオーバーライドして定義している。 In the source program 300, color basic data and processing logic are defined in the base class COLOR on the third line. In line 10, the SetColor method of the base class COLOR is defined. The derived class RED on the 22nd line is a class that inherits the base class COLOR. On the 24th line, as a method of the RED class, it is defined by overriding the SetColor method of the base class COLOR.

（中間言語Ｉコードの具体例）
図４は、中間言語Ｉコードの具体例を示す説明図である。図４において、中間言語Ｉコード４１０，４２０は、ソースプログラム３００（図３）の構文解析および意味解析により得られる中間言語Ｉコードを模式的に示したものである。 (Specific example of intermediate language I code)
FIG. 4 is an explanatory diagram showing a specific example of the intermediate language I code. In FIG. 4, intermediate language I codes 410 and 420 schematically show intermediate language I codes obtained by syntax analysis and semantic analysis of the source program 300 (FIG. 3).

具体的には、中間言語Ｉコード４１０は、ソースプログラム３００内の１０行目に定義されたＳｅｔＣｏｌｏｒメソッドを表現する中間言語Ｉコードである。また、中間言語Ｉコード４２０は、ソースプログラム３００内の２４行目に定義されたＳｅｔＣｏｌｏｒメソッドを表現する中間言語Ｉコードである。 Specifically, the intermediate language I code 410 is an intermediate language I code that expresses the SetColor method defined on the 10th line in the source program 300. The intermediate language I code 420 is an intermediate language I code that represents the SetColor method defined on the 24th line in the source program 300.

ここで、中間言語Ｉコード４１０は、データ４１１〜４１４を含む。データ４１１は、ソースプログラム３００に記述された基底クラスＣＯＬＯＲのＳｅｔＣｏｌｏｒメソッドの宣言情報の構文、意味を表現したものである。データ４１２〜４１４は、基底クラスＣＯＬＯＲのＳｅｔＣｏｌｏｒメソッドの文情報の構文、意味を表現したものであり、データ４１５〜４１７をそれぞれ含む。データ４１５〜４１７は、基底クラスＣＯＬＯＲのＳｅｔＣｏｌｏｒメソッドの式情報の構文、意味をそれぞれ表現したものである。 Here, the intermediate language I code 410 includes data 411 to 414. The data 411 represents the syntax and meaning of the declaration information of the SetColor method of the base class COLOR described in the source program 300. Data 412 to 414 represent the syntax and meaning of sentence information of the SetColor method of the base class COLOR, and include data 415 to 417, respectively. Data 415 to 417 represent the syntax and meaning of the expression information of the SetColor method of the base class COLOR, respectively.

また、中間言語Ｉコード４２０は、データ４２１〜４２４を含む。データ４２１は、ソースプログラム３００に記述された派生クラスＲＥＤのＳｅｔＣｏｌｏｒメソッドの宣言情報の構文、意味を表現したものである。データ４２２〜４２４は、派生クラスＲＥＤのＳｅｔＣｏｌｏｒメソッドの文情報の構文、意味を表現したものであり、データ４２５〜４２７をそれぞれ含む。データ４２５〜４２７は、派生クラスＲＥＤのＳｅｔＣｏｌｏｒメソッドの式情報の構文、意味をそれぞれ表現したものである。 Further, the intermediate language I code 420 includes data 421 to 424. Data 421 represents the syntax and meaning of the declaration information of the SetColor method of the derived class RED described in the source program 300. Data 422 to 424 represent the syntax and meaning of sentence information of the SetColor method of the derived class RED, and include data 425 to 427, respectively. Data 425 to 427 represent the syntax and meaning of the expression information of the SetColor method of the derived class RED.

このＳｅｔＣｏｌｏｒメソッドは、オブジェクト指向言語のオーバーライド（再定義）が使われており、基底クラスＣＯＬＯＲのＳｅｔＣｏｌｏｒメソッドを、派生クラスＲＥＤのＳｅｔＣｏｌｏｒメソッドがオーバーライドしている。このため、２つのＳｅｔＣｏｌｏｒメソッドは、メソッド間で実行面での冗長性を有する可能性があるメソッドといえる。 The SetColor method uses an override (redefinition) of an object-oriented language, and the SetColor method of the derived class RED overrides the SetColor method of the base class COLOR. Therefore, the two SetColor methods can be said to be methods that may have redundancy in terms of execution between methods.

（コンパイル装置１００の機能的構成例）
図５は、コンパイル装置１００の機能的構成例を示すブロック図である。図５において、コンパイル装置１００は、取得部５０１と、解析部５０２と、抽出部５０３と、生成部５０４と、判定部５０５と、マージ処理部５０６と、最適化部５０７と、コード生成部５０８と、出力部５０９と、を含む構成である。取得部５０１〜出力部５０９は制御部となる機能であり、具体的には、例えば、図２に示したメモリ２０２、ディスク２０４などの記憶装置に記憶されたプログラムをＣＰＵ２０１に実行させることにより、または、Ｉ／Ｆ２０５により、その機能を実現する。各機能部の処理結果は、例えば、メモリ２０２、ディスク２０４などの記憶装置に記憶される。 (Functional configuration example of the compiling device 100)
FIG. 5 is a block diagram illustrating a functional configuration example of the compiling device 100. 5, the compiling apparatus 100 includes an acquisition unit 501, an analysis unit 502, an extraction unit 503, a generation unit 504, a determination unit 505, a merge processing unit 506, an optimization unit 507, and a code generation unit 508. And an output unit 509. The acquisition unit 501 to the output unit 509 are functions serving as a control unit. Specifically, for example, by causing the CPU 201 to execute a program stored in a storage device such as the memory 202 and the disk 204 illustrated in FIG. Alternatively, the function is realized by the I / F 205. The processing result of each functional unit is stored in a storage device such as the memory 202 and the disk 204, for example.

取得部５０１は、ソースプログラムｓｃを取得する。ソースプログラムｓｃは、ＪａｖａやＣ＋＋などのオブジェクト指向言語でコーディングされたソースコードを含むものであり、例えば、ＦｏｒｔｒａｎやＣ言語などでコーディングされたソースコードが含まれていてもよい。 The acquisition unit 501 acquires the source program sc. The source program sc includes source code coded in an object-oriented language such as Java or C ++. For example, the source program sc may include source code coded in Fortran or C language.

具体的には、例えば、取得部５０１は、キーボード２０７やマウス２０８を用いたユーザの操作入力により、ソースプログラムｓｃ（例えば、ソースプログラム３００）を取得する。また、例えば、取得部５０１は、Ｉ／Ｆ２０５により、外部のコンピュータからソースプログラムｓｃを取得することにしてもよい。 Specifically, for example, the acquisition unit 501 acquires the source program sc (for example, the source program 300) by a user operation input using the keyboard 207 or the mouse 208. For example, the acquisition unit 501 may acquire the source program sc from an external computer by the I / F 205.

解析部５０２は、ソースプログラムｓｃの構文解析および意味解析を行う。具体的には、例えば、まず、解析部５０２は、ソースプログラムｓｃの字句解析を行う。字句解析とは、ソースプログラムｓｃを構成する文字の並びを、例えば、キーワード、変数、演算子などのトークンの並びに変換することである。 The analysis unit 502 performs syntax analysis and semantic analysis of the source program sc. Specifically, for example, the analysis unit 502 first performs lexical analysis of the source program sc. Lexical analysis is the conversion of a sequence of characters that make up the source program sc, for example, a sequence of tokens such as keywords, variables, and operators.

そして、解析部５０２は、字句解析により得られるトークンの並びについて、トークン間の関係を解析することにより、ソースプログラムｓｃの構文解析および意味解析を行う。これにより、例えば、図４に示したような、構文・意味解析区内のインターフェースである中間言語Ｉコードを得ることができる。 Then, the analysis unit 502 performs syntax analysis and semantic analysis of the source program sc by analyzing the relationship between tokens with respect to the token sequence obtained by the lexical analysis. Thereby, for example, an intermediate language I code which is an interface in the syntax / semantic analysis section as shown in FIG. 4 can be obtained.

なお、中間言語Ｉコードでは、例えば、基底クラスと派生クラスは木構造のように表現されており、クラス間の親子関係を特定できる構造となっている。また、各クラスと各クラスに属するメソッドも木構造のように表現されており、どのクラスにどのメソッドが属しているかを特定できる構造となっている。 In the intermediate language I code, for example, the base class and the derived class are expressed like a tree structure, and the parent-child relationship between classes can be specified. In addition, each class and the methods belonging to each class are also expressed in a tree structure, and it is a structure that can specify which method belongs to which class.

抽出部５０３は、ソースプログラムｓｃの構文解析および意味解析により得られる中間言語Ｉコード中の複数のメソッドから重複メソッド候補を抽出する。ここで、重複メソッド候補は、上述したように、メソッド間で実行面での冗長性を有する候補となる複数のマージ候補メソッドであり、例えば、オーバーライドやオーバーロードされたメソッドである。 The extraction unit 503 extracts duplicate method candidates from a plurality of methods in the intermediate language I code obtained by syntax analysis and semantic analysis of the source program sc. Here, as described above, the duplicate method candidates are a plurality of merge candidate methods that are candidates for execution redundancy between methods, and are, for example, methods that are overridden or overloaded.

具体的には、例えば、抽出部５０３は、生成された中間言語Ｉコードを参照して、親子関係を有するクラス間で同一名のメソッドの組合せを重複メソッド候補として抽出する。これにより、オーバーライドされたメソッドを重複メソッド候補として抽出することができる。 Specifically, for example, the extraction unit 503 refers to the generated intermediate language I code and extracts a combination of methods having the same name between classes having a parent-child relationship as duplicate method candidates. Thereby, the overridden method can be extracted as a duplicate method candidate.

図３に示したソースプログラム３００を例に挙げると、基底クラスＣＯＬＯＲと派生クラスＲＥＤとが親子関係を有するクラスである。このため、抽出部５０３は、ソースプログラム３００の中間言語Ｉコード（図４に示した中間言語Ｉコード４１０，４２０を含む中間言語Ｉコード）を参照して、基底クラスＣＯＬＯＲと派生クラスＲＥＤとの間でメソッド名が同一のＳｅｔＣｏｌｏｒメソッドを重複メソッド候補として抽出する。 Taking the source program 300 shown in FIG. 3 as an example, the base class COLOR and the derived class RED are classes having a parent-child relationship. For this reason, the extraction unit 503 refers to the intermediate language I code of the source program 300 (intermediate language I code including the intermediate language I codes 410 and 420 shown in FIG. 4), and determines the base class COLOR and the derived class RED. The SetColor method having the same method name is extracted as a duplicate method candidate.

また、抽出部５０３は、例えば、生成された中間言語Ｉコードを参照して、同一クラス内のメソッド、あるいは、いずれのクラスにも属していないメソッドのうち、同一名のメソッドの組合せを重複メソッド候補として抽出することにしてもよい。これにより、オーバーロードされたメソッドを重複メソッド候補として抽出することができる。 In addition, the extraction unit 503 refers to the generated intermediate language I code, for example, duplicates a combination of methods having the same name among methods in the same class or methods not belonging to any class. You may decide to extract as a candidate. As a result, the overloaded method can be extracted as a duplicate method candidate.

また、抽出部５０３は、例えば、生成された中間言語Ｉコードを参照して、同一クラス内の演算子、あるいは、いずれのクラスにも属していない演算子のうち、同一名の演算子の組合せを重複メソッド候補として抽出することにしてもよい。これにより、オーバーロードされた演算子（多重定義演算子）を重複メソッド候補として抽出することができる。 Further, the extraction unit 503 refers to the generated intermediate language I code, for example, and combines the operators of the same name among the operators in the same class or the operators not belonging to any class May be extracted as a duplicate method candidate. Thereby, an overloaded operator (overloaded operator) can be extracted as a duplicate method candidate.

なお、オーバーロードされた演算子（多重定義演算子）は、名称（メソッド名）が同一で、かつ、引数の型、数、並び順の少なくともいずれかが異なる演算子である。 The overloaded operator (overloaded operator) is an operator having the same name (method name) and different at least one of argument type, number, and arrangement order.

生成部５０４は、抽出された重複メソッド候補それぞれの中間言語Ｉコードにより特定される処理ロジックに基づいて、重複メソッド候補のメソッド間の対応関係情報１１０を生成する。具体的には、例えば、生成部５０４は、ソースプログラムｓｃの中間言語Ｉコードから、重複メソッド候補それぞれに対応する中間言語Ｉコードを抽出する。 The generation unit 504 generates correspondence information 110 between methods of duplicate method candidates based on the processing logic specified by the intermediate language I code of each of the extracted duplicate method candidates. Specifically, for example, the generation unit 504 extracts an intermediate language I code corresponding to each duplicate method candidate from the intermediate language I code of the source program sc.

つぎに、生成部５０４は、抽出した中間言語Ｉコード中の宣言・文・式それぞれのデータごとに、メソッド間のデータを比較する。中間言語Ｉコードは、ソースプログラムｓｃに記述された順に従って宣言・文・式の構造が形成されている。このため、生成部５０４は、中間言語Ｉコードの構造に従って、メソッド間のデータを順次比較する。 Next, the generation unit 504 compares data between methods for each data of declaration, sentence, and expression in the extracted intermediate language I code. The intermediate language I code has a declaration / sentence / expression structure formed in the order described in the source program sc. Therefore, the generation unit 504 sequentially compares data between methods according to the structure of the intermediate language I code.

なお、中間言語Ｉ中の式のデータは、文のデータに含まれる。このため、式のデータを含む文のデータ同士を比較する際は、文のデータに含まれる式のデータ同士を比較することになり、式のデータが異なると、文のデータも異なることになる。 Note that the expression data in the intermediate language I is included in the sentence data. For this reason, when comparing the sentence data including the expression data, the expression data included in the sentence data is compared. If the expression data is different, the sentence data is also different. .

そして、生成部５０４は、データごとの比較結果に基づいて、抽出した中間言語Ｉコードのうち、メソッド間で意味的に同一である部分と、メソッド間で意味的に異なる部分（各メソッド固有の部分）を識別する情報を対応関係情報１１０として生成する。なお、メソッド間のデータの比較例および対応関係情報１１０の生成例については、図６〜図９を用いて後述する。 Then, based on the comparison result for each data, the generation unit 504 generates a portion that is semantically identical between methods and a portion that is semantically different between methods (specific to each method). Information for identifying (part) is generated as correspondence information 110. Note that a comparative example of data between methods and a generation example of the correspondence information 110 will be described later with reference to FIGS.

判定部５０５は、生成された対応関係情報１１０に基づいて、重複メソッド候補のメソッド間で実行面での冗長性を有するか否かを判定する。具体的には、例えば、判定部５０５は、中間言語Ｉコード中の宣言・文・式それぞれのデータのうち少なくともいずれかのデータが意味的に同一である場合に、重複メソッド候補のメソッド間で実行面での冗長性を有すると判定してもよい。 Based on the generated correspondence information 110, the determination unit 505 determines whether or not there is redundancy in terms of execution between methods of the duplicate method candidate. Specifically, for example, the determination unit 505 determines whether or not at least one of the declaration, sentence, and expression data in the intermediate language I code is semantically identical between duplicate method candidate methods. It may be determined that there is redundancy in terms of execution.

また、例えば、判定部５０５は、中間言語Ｉコード中の宣言・文・式それぞれのデータのうちの所定数Ｘ以上のデータが意味的に同一である場合に、重複メソッド候補のメソッド間で実行面での冗長性を有すると判定してもよい。ここで、所定数Ｘは、例えば、予め任意に設定されてメモリ２０２やディスク２０４などの記憶装置に記憶されていてもよい。 Further, for example, the determination unit 505 executes between duplicate method candidate methods when data of a predetermined number X or more among the data of declarations, sentences, and expressions in the intermediate language I code is semantically identical. It may be determined that there is redundancy in terms. Here, the predetermined number X may be arbitrarily set in advance and stored in a storage device such as the memory 202 or the disk 204, for example.

また、所定数Ｘとして、重複メソッド候補のいずれかのメソッドに対応する中間言語Ｉコード中の宣言・文・式の全データ数に比率ｒを乗算した値を設定することにしてもよい。比率ｒは、１以下の値である。比率ｒを大きくすればするほど、メソッド間の実行面での冗長性を判定する際の判定基準を厳しくすることができる。 Further, as the predetermined number X, a value obtained by multiplying the total number of data of declarations / sentences / expressions in the intermediate language I code corresponding to any one of the duplicate method candidates by the ratio r may be set. The ratio r is a value of 1 or less. The larger the ratio r, the stricter the criteria for determining redundancy in terms of execution between methods.

マージ処理部５０６は、対応関係情報１１０に基づいて、メソッド間で実行面での冗長性を有すると判定された重複メソッド候補のマージ処理を行う。具体的には、例えば、マージ処理部５０６は、対応関係情報１１０に基づいて、メソッド間で実行面での冗長性を有する重複メソッド候補の各メソッドの中間言語Ｉコードを走査しながら、重複メソッド候補をマージしたメソッド本体の中間言語Ｉコードを生成する。 The merge processing unit 506 performs merge processing of duplicate method candidates determined to have redundancy in terms of execution among methods based on the correspondence relationship information 110. Specifically, for example, the merge processing unit 506 scans the duplicate method while scanning the intermediate language I code of each method of duplicate method candidates having redundancy in terms of execution among the methods based on the correspondence relationship information 110. An intermediate language I code of the method body in which candidates are merged is generated.

より具体的には、例えば、マージ処理部５０６は、対応関係情報１１０に基づいて、重複メソッド候補のメソッド間で共通する処理の中間言語Ｉコードを生成する。メソッド間で共通する処理は、例えば、メソッド間で意味的に同一である部分のデータから特定される。 More specifically, for example, the merge processing unit 506 generates an intermediate language I code for processing that is common among duplicate method candidate methods based on the correspondence relationship information 110. A process common among methods is specified from, for example, data of a portion that is semantically identical between methods.

また、マージ処理部５０６は、対応関係情報１１０に基づいて、重複メソッド候補の各メソッドで固有の処理の中間言語Ｉコードを生成する。各メソッドで固有の処理は、例えば、メソッド間で意味的に異なる部分のデータから特定される。そして、マージ処理部５０６は、生成した中間言語Ｉコードをマージすることにより、重複メソッド候補をマージしたメソッド本体の中間言語Ｉコードを生成する。 Further, the merge processing unit 506 generates an intermediate language I code for processing unique to each method of the duplicate method candidate based on the correspondence relationship information 110. The processing unique to each method is specified, for example, from data of portions that are semantically different between methods. Then, the merge processing unit 506 generates an intermediate language I code of a method body obtained by merging duplicate method candidates by merging the generated intermediate language I code.

なお、重複メソッド候補をマージしたメソッド本体の中間言語Ｉコードは、呼び出し元が呼び出すメソッドを特定するために、マージ元の各メソッドが属するクラスを特定できるようになっている。重複メソッド候補のマージ処理例については、図１０および図１１を用いて後述する。 The intermediate language I code of the method body obtained by merging duplicate method candidates can specify the class to which each method of the merge source belongs in order to specify the method to be called by the call source. An example of the merge processing of duplicate method candidates will be described later with reference to FIGS.

また、マージ処理部５０６は、重複メソッド候補をマージしたメソッド本体の中間言語Ｉコードに基づいて、コンパイラ（コンパイルプログラム）内のインターフェースであるソースプログラムｓｃの中間言語ＩＩコードを生成する。具体的には、例えば、マージ処理部５０６は、中間言語Ｉコードで表現したメソッド（マージしたメソッド本体、マージしていないメソッド）を中間言語ＩＩコードの形式に変換する。 Further, the merge processing unit 506 generates an intermediate language II code of the source program sc that is an interface in the compiler (compile program), based on the intermediate language I code of the method body obtained by merging the duplicate method candidates. Specifically, for example, the merge processing unit 506 converts the method expressed in the intermediate language I code (the merged method body and the unmerged method) into the intermediate language II code format.

これにより、ソースプログラムｓｃの中間言語Ｉコード中の複数のメソッドのうちのマージされたメソッドの数分だけ絶対数が削減された中間言語ＩＩコードを生成することができる。なお、中間言語ＩＩコードでは、マージされた各メソッドの呼び出し元からマージしたメソッド本体を参照できる構造となっている。 Thus, it is possible to generate an intermediate language II code whose absolute number is reduced by the number of merged methods among a plurality of methods in the intermediate language I code of the source program sc. The intermediate language II code has a structure in which the merged method body can be referred from the caller of each merged method.

最適化部５０７は、生成されたソースプログラムｓｃの中間言語ＩＩコードの最適化処理を行う。具体的には、例えば、最適化部５０７は、ソースプログラムの中間言語ＩＩコードから、どこからも呼び出しがないメソッドを不要なメソッドと見なして削除する。これにより、中間言語ＩＩコード中の複数のメソッドのうち使用されないメソッドを削除してメソッドの絶対数を削減することができる。 The optimization unit 507 performs optimization processing on the intermediate language II code of the generated source program sc. Specifically, for example, the optimization unit 507 deletes a method that is not called from anywhere as an unnecessary method from the intermediate language II code of the source program. As a result, the method that is not used among a plurality of methods in the intermediate language II code can be deleted to reduce the absolute number of methods.

コード生成部５０８は、最適化処理されたソースプログラムｓｃの中間言語ＩＩコードに基づいて、ソースプログラムｓｃのオブジェクトファイルｆを生成する。具体的には、例えば、コード生成部５０８は、最適化処理された中間言語ＩＩコードを機械語の命令の列に変換することにより、ソースプログラムｓｃのオブジェクトファイルｆを生成する。 The code generation unit 508 generates an object file f of the source program sc based on the intermediate language II code of the optimized source program sc. Specifically, for example, the code generation unit 508 generates the object file f of the source program sc by converting the optimized intermediate language II code into a sequence of machine language instructions.

出力部５０９は、生成されたソースプログラムｓｃのオブジェクトファイルｆを出力する。具体的には、例えば、出力部５０９は、ソースプログラムｓｃのオブジェクトファイルｆを、リンカと呼ばれるプログラムに出力する。ここで、リンカとは、オブジェクトファイルｆを解析して、必要なライブラリなどを付け加えて実行ファイルを生成するプログラムである。これにより、ソースプログラムｓｃの実行ファイルを得ることができる。 The output unit 509 outputs the generated object file f of the source program sc. Specifically, for example, the output unit 509 outputs the object file f of the source program sc to a program called a linker. Here, the linker is a program that analyzes the object file f, adds a necessary library, etc., and generates an execution file. Thereby, an execution file of the source program sc can be obtained.

なお、上述した説明では、重複メソッド候補としてメソッド名が同一のメソッドの組合せを抽出する場合を例に挙げて説明したが、メソッド名が異なっていてもメソッド間で実行面での冗長性を有する場合がある。しかし、メソッド名が異なる全てのメソッドの組合せについて実行面の冗長性を判断するのは処理時間の増大化を招いてしまう。 In the above description, a case where a combination of methods having the same method name is extracted as a duplicate method candidate has been described as an example. However, even if the method names are different, the methods have redundancy in terms of execution. There is a case. However, judging the redundancy of the execution surface for all the combinations of methods having different method names leads to an increase in processing time.

例えば、クラス間で親子関係があるメソッド同士は、それ以外のメソッド同士に比べて、メソッド間で処理ロジックが共通する部分が多い傾向にある。このため、メソッド名が異なるメソッド間の実行面での冗長性も判断する場合には、抽出部５０３は、親子関係を有するクラス間のメソッドの組合せを重複メソッド候補として抽出することにしてもよい。 For example, methods that have a parent-child relationship between classes tend to have more parts with common processing logic between methods than other methods. For this reason, when determining redundancy in terms of execution between methods having different method names, the extraction unit 503 may extract a combination of methods between classes having a parent-child relationship as a duplicate method candidate. .

（メソッド間のデータの比較例）
つぎに、図４に示した中間言語Ｉコード４１０，４２０を例に挙げて、重複メソッド候補のメソッド間のデータの比較例について説明する。 (Example of data comparison between methods)
Next, taking the intermediate language I codes 410 and 420 shown in FIG. 4 as an example, a comparative example of data between methods of duplicate method candidates will be described.

図６〜図９は、重複メソッド候補のメソッド間のデータの比較例を示す説明図である。図６において、まず、生成部５０４は、中間言語Ｉコード４１０から、Ｐａｒａｍｅｔｅｒ（宣言）のデータ４１１を選択する。つぎに、生成部５０４は、中間言語Ｉコード４２０から、データ４１１と同一種のＰａｒａｍｅｔｅｒのデータ４２１を選択する。 6 to 9 are explanatory diagrams illustrating comparative examples of data between methods of duplicate method candidates. In FIG. 6, the generation unit 504 first selects parameter data 411 from the intermediate language I code 410. Next, the generation unit 504 selects Parameter data 421 of the same type as the data 411 from the intermediate language I code 420.

そして、生成部５０４は、選択したＰａｒａｍｅｔｅｒのデータ４１１，４２１同士を比較する。ここでは、データ４１１，４２１の間で、引数（仮引数）の型、数、並び順のいずれも異なる。このため、生成部５０４は、データ４１１，４２１が意味的に異なると判断して、メソッド間で意味的に異なる部分を識別する情報を対応関係情報１１０として生成する。図６の例では、メソッド間で意味的に異なる部分を識別する情報を、チェック記号を付与することで表現している。すなわち、中間言語Ｉコード中に付与されるチェック記号が対応関係情報１１０に相当する。 Then, the generation unit 504 compares the selected parameter data 411 and 421 with each other. Here, the types, number, and arrangement order of arguments (provisional arguments) are different between the data 411 and 421. For this reason, the generation unit 504 determines that the data 411 and 421 are semantically different, and generates information that identifies portions that are semantically different between methods as the correspondence information 110. In the example of FIG. 6, information for identifying a portion that is semantically different between methods is expressed by adding a check symbol. That is, the check symbol given in the intermediate language I code corresponds to the correspondence information 110.

図７において、まず、生成部５０４は、中間言語Ｉコード４１０から、Ｓｔａｔｅｍｅｎｔ（文）のデータ４１２を選択する。つぎに、生成部５０４は、中間言語Ｉコード４２０から、データ４１２と同一種のＳｔａｔｅｍｅｎｔのデータ４２２を選択する。そして、生成部５０４は、選択したＳｔａｔｅｍｅｎｔのデータ４１２，４２２にそれぞれ含まれるＯｐｅｒａｔｏｒ（式）のデータ４１５，４２５同士を比較する。 In FIG. 7, the generation unit 504 first selects Statement data 412 from the intermediate language I code 410. Next, the generation unit 504 selects, from the intermediate language I code 420, Statement data 422 of the same type as the data 412. Then, the generation unit 504 compares the data 415 and 425 of the operator (expression) included in the data 412 and 422 of the selected statement.

ここでは、データ４１５，４２５がそれぞれ表す式が同一である。このため、生成部５０４は、データ４１２，４２２が意味的に同一であると判断して、メソッド間で意味的に同一である部分を識別する情報を対応関係情報１１０として生成する。図７の例では、メソッド間で意味的に同一である部分を識別する情報を、チェック記号を付与しないことで表現している。 Here, the expressions represented by the data 415 and 425 are the same. For this reason, the generation unit 504 determines that the data 412 and 422 are semantically identical, and generates information that identifies a portion that is semantically identical between methods as the correspondence information 110. In the example of FIG. 7, information for identifying a portion that is semantically identical between methods is expressed by not adding a check symbol.

図８において、まず、生成部５０４は、中間言語Ｉコード４１０から、Ｓｔａｔｅｍｅｎｔのデータ４１３を選択する。つぎに、生成部５０４は、中間言語Ｉコード４２０から、データ４１３と同一種のＳｔａｔｅｍｅｎｔのデータ４２３を選択する。そして、生成部５０４は、選択したＳｔａｔｅｍｅｎｔのデータ４１３，４２３にそれぞれ含まれるＯｐｅｒａｔｏｒのデータ４１６，４２６同士を比較する。 In FIG. 8, the generation unit 504 first selects Statement data 413 from the intermediate language I code 410. Next, the generation unit 504 selects the same type of Statement data 423 as the data 413 from the intermediate language I code 420. Then, the generation unit 504 compares the operator data 416 and 426 included in the selected statement data 413 and 423, respectively.

ここでは、データ４１６，４２６がそれぞれ表す式の一部が異なっている。このため、生成部５０４は、データ４１３，４２３が意味的に異なると判断して、メソッド間で意味的に異なる部分を識別する情報を対応関係情報１１０として生成する。図８の例では、メソッド間で意味的に異なる部分を識別する情報を、チェック記号を付与することで表現している。 Here, some of the expressions represented by the data 416 and 426 are different. For this reason, the generation unit 504 determines that the data 413 and 423 are semantically different, and generates information for identifying a portion that is semantically different between methods as the correspondence information 110. In the example of FIG. 8, information for identifying portions that are semantically different between methods is expressed by adding a check symbol.

図９において、まず、生成部５０４は、中間言語Ｉコード４１０から、Ｓｔａｔｅｍｅｎｔのデータ４１４を選択する。つぎに、生成部５０４は、中間言語Ｉコード４２０から、データ４１４と同一種のＳｔａｔｅｍｅｎｔのデータ４２４を選択する。そして、生成部５０４は、選択したＳｔａｔｅｍｅｎｔのデータ４１４，４２４にそれぞれ含まれるＯｐｅｒａｔｏｒのデータ４１７，４２７同士を比較する。 In FIG. 9, the generation unit 504 first selects Statement data 414 from the intermediate language I code 410. Next, the generation unit 504 selects, from the intermediate language I code 420, Statement data 424 of the same type as the data 414. Then, the generation unit 504 compares the operator data 417 and 427 included in the selected statement data 414 and 424, respectively.

ここでは、データ４１７，４２７がそれぞれ表す式が同一である。このため、生成部５０４は、データ４１４，４２４が意味的に同一であると判断して、メソッド間で意味的に同一である部分を識別する情報を対応関係情報１１０として生成する。図９の例では、メソッド間で意味的に同一である部分を識別する情報を、チェック記号を付与しないことで表現している。 Here, the expressions represented by the data 417 and 427 are the same. For this reason, the generation unit 504 determines that the data 414 and 424 are semantically identical, and generates information that identifies a portion that is semantically identical between methods as the correspondence information 110. In the example of FIG. 9, information for identifying a portion that is semantically identical between methods is expressed by not adding a check symbol.

（重複メソッド候補のマージ処理例）
つぎに、ソースプログラム３００内の基底クラスＣＯＬＯＲと派生クラスＲＥＤとの間でメソッド名が同一のＳｅｔＣｏｌｏｒメソッドを重複メソッド候補として、重複メソッド候補のマージ処理例について説明する。まず、ソースプログラム３００を用いて、ＳｅｔＣｏｌｏｒメソッドのマージ処理のイメージについて説明する。 (Duplicate method candidate merge processing example)
Next, an example of merge processing of duplicate method candidates will be described with the SetColor method having the same method name between the base class COLOR and the derived class RED in the source program 300 as a duplicate method candidate. First, an image of the merge process of the SetColor method will be described using the source program 300. FIG.

図１０は、ＳｅｔＣｏｌｏｒメソッドのマージ処理を示す説明図である。図１０において、ソースプログラム３００内の１０行目から１４行目のコード１００１は、基底クラスＣＯＬＯＲのＳｅｔＣｏｌｏｒメソッドに関する記述である。また、ソースプログラム３００内の２４行目から２９行目のコード１００２は、派生クラスＲＥＤのＳｅｔＣｏｌｏｒメソッドに関する記述である。 FIG. 10 is an explanatory diagram showing merge processing of the SetColor method. In FIG. 10, a code 1001 on the 10th to 14th lines in the source program 300 is a description related to the SetColor method of the base class COLOR. Further, the code 1002 on the 24th to 29th lines in the source program 300 is a description related to the SetColor method of the derived class RED.

マージ処理部５０６は、コード１００１に対応する中間言語Ｉコードとコード１００２に対応する中間言語Ｉコードとをマージして、イメージ１０１０で示すような中間言語Ｉコードを生成することになる。ここで、コード１００１とコード１００２とを比較すると、１０行目と２４行目の引数、および１２行目と２６行目の式が、それぞれのＳｅｔＣｏｌｏｒメソッドで固有のものとなっている。 The merge processing unit 506 merges the intermediate language I code corresponding to the code 1001 and the intermediate language I code corresponding to the code 1002 to generate an intermediate language I code as shown by the image 1010. Here, when comparing the code 1001 and the code 1002, the arguments on the 10th and 24th lines and the expressions on the 12th and 26th lines are unique to each SetColor method.

このため、マージ処理部５０６は、イメージ１０１０内のコード１０１１〜１０１５のように、基底クラスＣＯＬＯＲおよび派生クラスＲＥＤのＳｅｔＣｏｌｏｒメソッド間で共通する処理の中間言語Ｉコードと、各ＳｅｔＣｏｌｏｒメソッドで固有の処理の中間言語Ｉコードを生成する。 For this reason, the merge processing unit 506, like codes 1011 to 1015 in the image 1010, processes intermediate language I code common to the SetColor methods of the base class COLOR and the derived class RED, and processes unique to each SetColor method Intermediate language I code is generated.

コード１０１１は、基底クラスＣＯＬＯＲおよび派生クラスＲＥＤのいずれのＳｅｔＣｏｌｏｒメソッドに応じた処理を行うのかを選択するためのものであり、呼び出し元から第１引数として与えられる情報が定義される。コード１０１２は、ＳｅｔＣｏｌｏｒメソッドの引数を表す。 The code 1011 is for selecting which of the base class COLOR and the derived class RED performs processing according to the SetColor method, and information given as a first argument from the caller is defined. Code 1012 represents an argument of the SetColor method.

コード１０１３，１０１５は、基底クラスＣＯＬＯＲおよび派生クラスＲＥＤのＳｅｔＣｏｌｏｒメソッド間で共通する処理の中間言語Ｉコードに対応する。一方、コード１０１４は、基底クラスＣＯＬＯＲおよび派生クラスＲＥＤの各ＳｅｔＣｏｌｏｒメソッドで固有の処理の中間言語Ｉコードに対応する。コード１０１４には、基底クラスＣＯＬＯＲおよび派生クラスＲＥＤのいずれのＳｅｔＣｏｌｏｒメソッドであるかを判定する判定文（ｉｆ）が含まれている。 Codes 1013 and 1015 correspond to an intermediate language I code of processing common between the SetColor method of the base class COLOR and the derived class RED. On the other hand, the code 1014 corresponds to an intermediate language I code of processing unique to each SetColor method of the base class COLOR and the derived class RED. The code 1014 includes a determination statement (if) for determining which SetColor method of the base class COLOR and the derived class RED.

また、ソースプログラム３００内の３６行目は、基底クラスＣＯＬＯＲのＳｅｔＣｏｌｏｒメソッドを呼び出すための記述である。また、ソースプログラム３００内の３７行目は、派生クラスＲＥＤのＳｅｔＣｏｌｏｒメソッドを呼び出すための記述である。このため、マージ処理部５０６は、基底クラスＣＯＬＯＲまたは派生クラスＲＥＤのＳｅｔＣｏｌｏｒメソッドを呼び出すために、イメージ１０２０のような中間言語Ｉコードを合わせて生成する。 The 36th line in the source program 300 is a description for calling the SetColor method of the base class COLOR. The 37th line in the source program 300 is a description for calling the SetColor method of the derived class RED. Therefore, the merge processing unit 506 generates an intermediate language I code such as the image 1020 in order to call the SetColor method of the base class COLOR or the derived class RED.

ここで、イメージ１０１０に対応する中間言語Ｉコードについて説明する。 Here, the intermediate language I code corresponding to the image 1010 will be described.

図１１は、ＳｅｔＣｏｌｏｒメソッドをマージしたメソッド本体の中間言語Ｉコードの一例を示す説明図である。図１１において、中間言語Ｉコード１１００は、基底クラスＣＯＬＯＲのＳｅｔＣｏｌｏｒメソッドと派生クラスＲＥＤのＳｅｔＣｏｌｏｒメソッドとをマージしたメソッド本体の中間言語Ｉコードを模式的に示したものであり、図１０に示したイメージ１０１０に対応する。 FIG. 11 is an explanatory diagram showing an example of the intermediate language I code of the method body obtained by merging the SetColor method. In FIG. 11, the intermediate language I code 1100 schematically shows the intermediate language I code of the method body obtained by merging the SetColor method of the base class COLOR and the SetColor method of the derived class RED, as shown in FIG. Corresponds to image 1010.

中間言語Ｉコード１１００において、データ１１０１は、基底クラスＣＯＬＯＲおよび派生クラスＲＥＤのいずれのＳｅｔＣｏｌｏｒメソッドに応じた処理を行うのかを選択するための中間言語Ｉコードである。データ１１０１は、図１０に示したイメージ１０１０内のコード１０１１に対応する。 In the intermediate language I code 1100, the data 1101 is an intermediate language I code for selecting which of the base class COLOR and the derived class RED should be processed according to the SetColor method. Data 1101 corresponds to the code 1011 in the image 1010 shown in FIG.

データ１１０２は、ＳｅｔＣｏｌｏｒメソッドの引数を表す中間言語Ｉコードである。データ１１０２は、イメージ１０１０内のコード１０１２に対応する。データ１１０３は、基底クラスＣＯＬＯＲおよび派生クラスＲＥＤのＳｅｔＣｏｌｏｒメソッド間で共通する処理の中間言語Ｉコードである。データ１１０３は、イメージ１０１０内のコード１０１３に対応する。 Data 1102 is an intermediate language I code representing an argument of the SetColor method. Data 1102 corresponds to code 1012 in image 1010. Data 1103 is an intermediate language I code of processing common between the SetColor method of the base class COLOR and the derived class RED. Data 1103 corresponds to the code 1013 in the image 1010.

データ１１０４は、各ＳｅｔＣｏｌｏｒメソッドで固有の処理の中間言語Ｉコードである。データ１１０４は、イメージ１０１０内のコード１０１４に対応する。データ１１０５は、ＳｅｔＣｏｌｏｒメソッド間で共通する処理の中間言語Ｉコードである。データ１１０５は、イメージ１０１０内のコード１０１５に対応する。 Data 1104 is an intermediate language I code of processing unique to each SetColor method. Data 1104 corresponds to code 1014 in image 1010. Data 1105 is an intermediate language I code for processing common to the SetColor methods. Data 1105 corresponds to code 1015 in image 1010.

（重複メソッド候補をマージしないソースプログラム例）
つぎに、重複メソッド候補をマージしない場合のソースプログラムｓｃについて説明する。ここでは、オーバーロードされたメソッドを重複メソッド候補として抽出する場合を例に挙げて説明する。 (Example source program that does not merge duplicate method candidates)
Next, the source program sc in the case where duplicate method candidates are not merged will be described. Here, a case where an overloaded method is extracted as a duplicate method candidate will be described as an example.

図１２は、重複メソッド候補をマージしないソースプログラム例を示す説明図である。図１２において、ソースプログラム１２００は、オブジェクト指向言語でコーディングされたオブジェクト指向特有のプログラム例である。 FIG. 12 is an explanatory diagram of an example of a source program that does not merge duplicate method candidates. In FIG. 12, a source program 1200 is an example of an object-oriented program coded in an object-oriented language.

ソースプログラム１２００の４行目には、ｐｒｉｎｔメソッドが定義されている。また、ソースプログラム１２００の９行目には、４行目のｐｒｉｎｔメソッドと同一名のｐｒｉｎｔメソッドが定義されている。すなわち、ｐｒｉｎｔメソッドがオーバーロードされている。 In the fourth line of the source program 1200, a print method is defined. Also, in the 9th line of the source program 1200, a print method having the same name as the print method in the 4th line is defined. That is, the print method is overloaded.

ソースプログラム１２００内の４行目から６行目のコード１２０１は、４行目のｐｒｉｎｔメソッドに関する記述である。また、ソースプログラム１２００内の９行目から２２行目のコード１２０２は、９行目のｐｒｉｎｔメソッドに関する記述である。ここで、コード１２０１とコード１２０２とを比較すると、メソッド名以外で、メソッド間で意味的に同一である部分は存在しない。 The code 1201 from the 4th line to the 6th line in the source program 1200 describes the print method on the 4th line. Also, the code 1202 from the 9th line to the 22nd line in the source program 1200 is a description relating to the print method in the 9th line. Here, when comparing the code 1201 and the code 1202, there is no portion that is semantically identical between methods other than the method name.

このように、メソッド間で意味的に同一である部分が少なく、メソッド間で実行面での冗長性が低い場合には、ｐｒｉｎｔメソッドをマージしてメソッドの絶対数を削減しても、ソースプログラムｓｃのコンパイル時間の短縮は見込めない。このため、コンパイル装置１００は、ソースプログラム１２００内のｐｒｉｎｔメソッドのようなメソッドについてはマージ対象としない。 In this way, when there are few parts that are semantically identical between methods and the execution redundancy between methods is low, even if the print method is merged to reduce the absolute number of methods, the source program The compile time of sc cannot be shortened. For this reason, the compiling apparatus 100 does not consider a method such as the print method in the source program 1200 as a merge target.

（コンパイル装置１００のコンパイル処理手順）
つぎに、コンパイル装置１００のコンパイル処理手順について説明する。 (Compile processing procedure of the compiling device 100)
Next, a compiling process procedure of the compiling apparatus 100 will be described.

図１３は、コンパイル装置１００のコンパイル処理手順の一例を示すフローチャートである。図１３のフローチャートにおいて、まず、コンパイル装置１００は、ソースプログラムｓｃを読み込む（ステップＳ１３０１）。 FIG. 13 is a flowchart illustrating an example of a compile processing procedure of the compile device 100. In the flowchart of FIG. 13, the compiling device 100 first reads the source program sc (step S1301).

つぎに、コンパイル装置１００は、ソースプログラムｓｃの字句解析を行う（ステップＳ１３０２）。そして、コンパイル装置１００は、字句解析により得られるトークンの並びについて、トークン間の関係を解析することにより、ソースプログラムｓｃの構文解析および意味解析を行う（ステップＳ１３０３）。 Next, the compiling device 100 performs lexical analysis of the source program sc (step S1302). Then, the compiling device 100 performs syntax analysis and semantic analysis of the source program sc by analyzing the relationship between tokens in the token sequence obtained by the lexical analysis (step S1303).

つぎに、コンパイル装置１００は、ソースプログラムｓｃの構文解析および意味解析により得られる中間言語Ｉコード中の複数のメソッドから重複メソッド候補を抽出する重複メソッド候補抽出処理を行う（ステップＳ１３０４）。重複メソッド候補抽出処理の具体的な処理内容については、図１４を用いて後述する。 Next, the compiling device 100 performs a duplicate method candidate extraction process for extracting duplicate method candidates from a plurality of methods in the intermediate language I code obtained by the syntax analysis and semantic analysis of the source program sc (step S1304). The specific processing content of the duplicate method candidate extraction processing will be described later with reference to FIG.

つぎに、コンパイル装置１００は、抽出された重複メソッド候補のうち未選択の重複メソッド候補を選択する（ステップＳ１３０５）。そして、コンパイル装置１００は、重複メソッド候補のメソッド間の対応関係情報１１０を生成する対応関係情報生成処理を行う（ステップＳ１３０６）。対応関係情報生成処理の具体的な処理手順については、図１５を用いて後述する。 Next, the compiling device 100 selects an unselected duplicate method candidate from the extracted duplicate method candidates (step S1305). Then, the compiling device 100 performs correspondence information generation processing for generating correspondence information 110 between methods of duplicate method candidates (step S1306). A specific processing procedure of the correspondence relationship information generation processing will be described later with reference to FIG.

つぎに、コンパイル装置１００は、生成した重複メソッド候補の対応関係情報に基づいて、重複メソッド候補のメソッド間で実行面での冗長性を有するか否かを判定する（ステップＳ１３０７）。ここで、実行面での冗長性を有さないと判定した場合（ステップＳ１３０７：Ｎｏ）、コンパイル装置１００は、ステップＳ１３０９に移行する。 Next, the compiling device 100 determines whether or not there is redundancy in terms of execution between the methods of the duplicate method candidates based on the generated correspondence information of the duplicate method candidates (step S1307). If it is determined that there is no redundancy in terms of execution (step S1307: No), the compiling device 100 proceeds to step S1309.

一方、実行面での冗長性を有すると判定した場合（ステップＳ１３０７：Ｙｅｓ）、コンパイル装置１００は、重複メソッド候補をマージしたメソッド本体の中間言語Ｉコードを生成するマージ処理を行う（ステップＳ１３０８）。マージ処理の具体的な処理手順については、図１６を用いて後述する。 On the other hand, if it is determined that there is redundancy in terms of execution (step S1307: Yes), the compiling device 100 performs a merge process for generating an intermediate language I code of a method body in which duplicate method candidates are merged (step S1308). . A specific processing procedure of the merge processing will be described later with reference to FIG.

つぎに、コンパイル装置１００は、抽出された重複メソッド候補のうち未選択の重複メソッド候補があるか否かを判断する（ステップＳ１３０９）。ここで、未選択の重複メソッド候補がある場合（ステップＳ１３０９：Ｙｅｓ）、コンパイル装置１００は、ステップＳ１３０５に戻る。 Next, the compiling device 100 determines whether or not there is an unselected duplicate method candidate among the extracted duplicate method candidates (step S1309). If there is an unselected duplicate method candidate (step S1309: YES), the compiling device 100 returns to step S1305.

一方、未選択の重複メソッド候補がない場合は（ステップＳ１３０９：Ｎｏ）、コンパイル装置１００は、重複メソッド候補をマージしたメソッド本体の中間言語Ｉコードと、マージしていないメソッドの中間言語Ｉコードとに基づいて、ソースプログラムｓｃの中間言語ＩＩコードを生成する（ステップＳ１３１０）。 On the other hand, if there is no unselected duplicate method candidate (step S1309: No), the compiling device 100 determines the intermediate language I code of the method body in which the duplicate method candidates are merged and the intermediate language I code of the method that has not been merged. Based on the above, an intermediate language II code of the source program sc is generated (step S1310).

そして、コンパイル装置１００は、生成されたソースプログラムｓｃの中間言語ＩＩコードの最適化処理を行う（ステップＳ１３１１）。つぎに、コンパイル装置１００は、最適化処理されたソースプログラムｓｃの中間言語ＩＩコードに基づいて、ソースプログラムｓｃのオブジェクトファイルｆを生成する（ステップＳ１３１２）。 Then, the compiling device 100 performs an optimization process of the intermediate language II code of the generated source program sc (step S1311). Next, the compiling device 100 generates an object file f of the source program sc based on the intermediate language II code of the optimized source program sc (step S1312).

そして、コンパイル装置１００は、生成したソースプログラムｓｃのオブジェクトファイルｆを出力して（ステップＳ１３１３）、本フローチャートによる一連の処理を終了する。 Then, the compiling device 100 outputs the generated object file f of the source program sc (step S1313), and ends the series of processing according to this flowchart.

これにより、コンパイラの構文・意味解析区において、メソッド間で実行面での冗長性を有する複数のメソッドを一つのメソッドにマージすることができ、最適化区の入力時点でメソッドの絶対数を削減することができる。 As a result, in the syntax / semantic analysis section of the compiler, multiple methods with redundancy in terms of execution among methods can be merged into one method, and the absolute number of methods is reduced when the optimization section is input. can do.

＜重複メソッド候補抽出処理手順＞
つぎに、図１３に示したステップＳ１３０４の重複メソッド候補抽出処理の具体的な処理手順について説明する。 <Duplicate method candidate extraction procedure>
Next, a specific processing procedure of the duplicate method candidate extraction process in step S1304 shown in FIG. 13 will be described.

図１４は、重複メソッド候補抽出処理の具体的処理手順の一例を示すフローチャートである。図１４のフローチャートにおいて、まず、コンパイル装置１００は、ソースプログラムｓｃの中間言語Ｉコードを参照して、親子関係を有するクラス間で同一名のメソッドが存在するか否かを判断する（ステップＳ１４０１）。 FIG. 14 is a flowchart illustrating an example of a specific processing procedure of the duplicate method candidate extraction processing. In the flowchart of FIG. 14, first, the compiling device 100 refers to the intermediate language I code of the source program sc and determines whether or not a method having the same name exists between classes having a parent-child relationship (step S1401). .

ここで、親子関係を有するクラス間で同一名のメソッドが存在する場合（ステップＳ１４０１：Ｙｅｓ）、コンパイル装置１００は、親子関係を有するクラス間で同一名のメソッドの組合せを重複メソッド候補として抽出する（ステップＳ１４０２）。そして、コンパイル装置１００は、ステップＳ１４０１に戻る。 If a method having the same name exists between classes having a parent-child relationship (step S1401: Yes), the compiling device 100 extracts a combination of methods having the same name between classes having a parent-child relationship as duplicate method candidates. (Step S1402). Then, the compiling device 100 returns to step S1401.

一方、親子関係を有するクラス間で同一名のメソッドが存在しない場合（ステップＳ１４０１：Ｎｏ）、コンパイル装置１００は、ソースプログラムｓｃの中間言語Ｉコードを参照して、同一名のメソッドまたは多重定義演算子が存在するか否かを判断する（ステップＳ１４０３）。 On the other hand, when a method with the same name does not exist between classes having a parent-child relationship (step S1401: No), the compiling device 100 refers to the intermediate language I code of the source program sc, and the method or overloaded operation with the same name. It is determined whether or not a child exists (step S1403).

ここで、同一名のメソッドまたは多重定義演算子が存在する場合（ステップＳ１４０３：Ｙｅｓ）、コンパイル装置１００は、同一名のメソッドまたは多重定義演算子の組合せを重複メソッド候補として抽出する（ステップＳ１４０４）。そして、コンパイル装置１００は、ステップＳ１４０３に戻る。 If a method or overloaded operator with the same name exists (step S1403: Yes), the compiling apparatus 100 extracts a method or overloaded operator combination with the same name as a duplicate method candidate (step S1404). . Then, the compiling device 100 returns to step S1403.

一方、同一名のメソッドまたは多重定義演算子が存在しない場合（ステップＳ１４０３：Ｎｏ）、コンパイル装置１００は、重複メソッド候補抽出処理を呼び出したステップに戻る。これにより、オーバーロードされたメソッドやオーバーロードされたメソッドを、メソッド間で実行面での冗長性を有する候補となる重複メソッド候補として抽出することができる。 On the other hand, when there is no method or overloaded operator with the same name (step S1403: No), the compiling device 100 returns to the step that called the duplicate method candidate extraction process. Thereby, an overloaded method or an overloaded method can be extracted as a duplicate method candidate that is a candidate having redundancy in terms of execution among methods.

＜対応関係情報生成処理手順＞
つぎに、図１３に示したステップＳ１３０６の対応関係情報生成処理の具体的な処理手順について説明する。 <Correspondence information generation processing procedure>
Next, a specific processing procedure of the correspondence relationship information generation processing in step S1306 shown in FIG. 13 will be described.

図１５は、対応関係情報生成処理の具体的処理手順の一例を示すフローチャートである。図１５のフローチャートにおいて、まず、コンパイル装置１００は、ソースプログラムｓｃの中間言語Ｉコードから、重複メソッド候補それぞれに対応する中間言語Ｉコードを抽出する（ステップＳ１５０１）。 FIG. 15 is a flowchart illustrating an example of a specific processing procedure of the correspondence relationship information generation processing. In the flowchart of FIG. 15, first, the compiling device 100 extracts intermediate language I code corresponding to each duplicate method candidate from the intermediate language I code of the source program sc (step S1501).

つぎに、コンパイル装置１００は、抽出した中間言語Ｉコードを参照して、重複メソッド候補のメソッド間で宣言・文・式のデータを順次比較する（ステップＳ１５０２）。そして、コンパイル装置１００は、比較した比較結果に基づいて、抽出した中間言語Ｉコードのうち、メソッド間で意味的に異なる部分があるか否かを判断する（ステップＳ１５０３）。 Next, the compiling device 100 refers to the extracted intermediate language I code, and sequentially compares the data of the declaration / sentence / expression between the methods of the duplicate method candidate (step S1502). Then, the compiling device 100 determines whether or not there is a part that is semantically different between methods in the extracted intermediate language I code based on the comparison result (step S1503).

ここで、メソッド間で意味的に異なる部分がある場合（ステップＳ１５０３：Ｙｅｓ）、コンパイル装置１００は、抽出した中間言語Ｉコードのうちのメソッド間で意味的に異なる部分にマーキングする（ステップＳ１５０４）。そして、コンパイル装置１００は、対応関係情報生成処理を呼び出したステップに戻る。なお、マーキングは、例えば、図６に示したようなチェック記号に相当する。 Here, when there is a portion that is semantically different between methods (step S1503: Yes), the compiling device 100 marks a portion that is semantically different between methods in the extracted intermediate language I code (step S1504). . Then, the compiling device 100 returns to the step that called the correspondence information generation process. The marking corresponds to, for example, a check symbol as shown in FIG.

一方、メソッド間で意味的に異なる部分がない場合（ステップＳ１５０３：Ｎｏ）、コンパイル装置１００は、対応関係情報生成処理を呼び出したステップに戻る。これにより、重複メソッド候補それぞれに対応する中間言語Ｉコードのうち、メソッド間で意味的に同一である部分と、メソッド間で意味的に異なる部分とを識別する対応関係情報１１０を生成することができる。 On the other hand, if there is no semantically different portion between the methods (step S1503: No), the compiling device 100 returns to the step that called the correspondence relationship information generation process. Thus, the correspondence relationship information 110 for identifying a portion that is semantically identical between methods and a portion that is semantically different between methods in the intermediate language I code corresponding to each of the duplicate method candidates can be generated. it can.

＜マージ処理手順＞
つぎに、図１３に示したステップＳ１３０８のマージ処理の具体的な処理手順について説明する。 <Merge processing procedure>
Next, a specific processing procedure of the merge processing in step S1308 shown in FIG. 13 will be described.

図１６は、マージ処理の具体的処理手順の一例を示すフローチャートである。図１６のフローチャートにおいて、まず、コンパイル装置１００は、重複メソッド候補の中間言語Ｉコードから、宣言・文・式のデータを選択する（ステップＳ１６０１）。この際、コンパイル装置１００は、選択したデータと図１５に示したステップＳ１５０２で比較したデータが存在する場合は、そのデータも合わせて選択する。 FIG. 16 is a flowchart illustrating an example of a specific processing procedure of the merge processing. In the flowchart of FIG. 16, the compiling device 100 first selects declaration / sentence / expression data from the intermediate language I code of the duplicate method candidate (step S1601). At this time, if the selected data and the data compared in step S1502 shown in FIG. 15 exist, the compiling device 100 also selects the data.

つぎに、コンパイル装置１００は、選択したデータにマーキングがあるか否かを判断する（ステップＳ１６０２）。ここで、マーキングがある場合（ステップＳ１６０２：Ｙｅｓ）、コンパイル装置１００は、選択したデータに基づいて、重複メソッド候補のメソッド固有の処理の中間言語Ｉコードを生成する（ステップＳ１６０３）。 Next, the compiling device 100 determines whether or not the selected data has a marking (step S1602). Here, when there is marking (step S1602: Yes), the compiling device 100 generates an intermediate language I code for the process-specific processing of the duplicate method candidate based on the selected data (step S1603).

そして、コンパイル装置１００は、重複メソッド候補の中間言語Ｉコードから選択されていない未選択のデータがあるか否かを判断する（ステップＳ１６０４）。ここで、未選択のデータがある場合（ステップＳ１６０４：Ｙｅｓ）、コンパイル装置１００は、ステップＳ１６０１に戻る。 The compiling device 100 determines whether there is unselected data that is not selected from the intermediate language I code of the duplicate method candidate (step S1604). If there is unselected data (step S1604: YES), the compiling device 100 returns to step S1601.

また、ステップＳ１６０２において、マーキングがない場合（ステップＳ１６０２：Ｎｏ）、コンパイル装置１００は、選択したデータに基づいて、重複メソッド候補のメソッド共通の処理の中間言語Ｉコードを生成して（ステップＳ１６０５）、ステップＳ１６０４に移行する。 If there is no marking in step S1602 (step S1602: No), the compiling device 100 generates an intermediate language I code for processing common to duplicate method candidates based on the selected data (step S1605). The process proceeds to step S1604.

また、ステップＳ１６０４において、未選択のデータがない場合（ステップＳ１６０４：Ｎｏ）、コンパイル装置１００は、生成した各処理の中間言語Ｉコードをマージすることにより、重複メソッド候補をマージしたメソッド本体の中間言語Ｉコードを生成して（ステップＳ１６０６）、マージ処理を呼び出したステップに戻る。 In step S1604, if there is no unselected data (step S1604: No), the compiling device 100 merges the generated intermediate language I codes of the respective processes, so that the intermediate method body into which the duplicate method candidates are merged is merged. A language I code is generated (step S1606), and the process returns to the step that called the merge process.

これにより、メソッド間で実行面での冗長性を有する重複メソッド候補をマージしたメソッド本体の中間言語Ｉコードを生成することができる。なお、重複メソッド候補をマージしたメソッド本体の中間言語Ｉコードには、マージしたいずれのメソッドに応じた処理を行うのかを選択するためのデータ（例えば、図１１に示したデータ１１０１）が含まれる。また、ステップＳ１６０６では、マージしたいずれかのメソッドを呼び出すための中間言語Ｉコードも合わせて生成される。 As a result, it is possible to generate an intermediate language I code of a method body obtained by merging duplicate method candidates having redundancy in terms of execution among methods. Note that the intermediate language I code of the method body into which the duplicate method candidates are merged includes data for selecting which of the merged methods is to be processed (for example, data 1101 shown in FIG. 11). . In step S1606, an intermediate language I code for calling one of the merged methods is also generated.

以上説明したように、実施の形態にかかるコンパイル装置１００によれば、ソースプログラムｓｃの構文解析および意味解析により得られる中間言語Ｉコード中の複数のメソッドから、メソッド間で実行面での冗長性を有する可能性がある重複メソッド候補を抽出することができる。これにより、オブジェクト指向言語の言語的特徴を判断できる構文・意味解析区において、データや処理ロジックについて同一性を有する可能性がある複数のメソッドを重複メソッド候補として抽出することができる。 As described above, according to the compiling apparatus 100 according to the embodiment, the redundancy in terms of execution among methods from a plurality of methods in the intermediate language I code obtained by the syntax analysis and semantic analysis of the source program sc. Can be extracted. As a result, in the syntax / semantic analysis section where the linguistic characteristics of the object-oriented language can be determined, a plurality of methods that may have the same data and processing logic can be extracted as duplicate method candidates.

例えば、コンパイル装置１００は、ソースプログラムｓｃの中間言語Ｉコード中のメソッド群から、親子関係を有するクラス間で同一名のメソッドの組合せを重複メソッド候補として抽出してもよい。これにより、メソッド間で実行面での冗長性を有する可能性があるオーバーライドされたメソッドを重複メソッド候補として抽出することができる。 For example, the compiling apparatus 100 may extract a combination of methods having the same name between classes having a parent-child relationship from the method group in the intermediate language I code of the source program sc as a duplicate method candidate. As a result, overridden methods that may have redundancy in terms of execution among methods can be extracted as duplicate method candidates.

例えば、コンパイル装置１００は、ソースプログラムｓｃの中間言語Ｉコード中のメソッド群から、同一クラス内のメソッド、あるいは、いずれのクラスにも属していないメソッドのうち、同一名のメソッドの組合せを重複メソッド候補として抽出してもよい。これにより、メソッド間で実行面での冗長性を有する可能性があるオーバーロードされたメソッドを重複メソッド候補として抽出することができる。 For example, the compiling device 100 duplicates a combination of methods having the same name among methods in the same class or methods not belonging to any class from the method group in the intermediate language I code of the source program sc. You may extract as a candidate. As a result, overloaded methods that may have execution redundancy between methods can be extracted as duplicate method candidates.

例えば、コンパイル装置１００は、ソースプログラムｓｃの中間言語Ｉコード中のメソッド群から、同一クラス内の演算子、あるいは、いずれのクラスにも属していない演算子のうち、同一名の演算子の組合せを重複メソッド候補として抽出してもよい。これにより、メソッド（演算子）間で実行面での冗長性を有する可能性があるオーバーロードされた多重定義演算子を重複メソッド候補として抽出することができる。 For example, the compiling device 100 selects a combination of operators having the same name among operators in the same class or operators not belonging to any class from the method group in the intermediate language I code of the source program sc. May be extracted as a duplicate method candidate. As a result, overloaded overloaded operators that may have execution redundancy between methods (operators) can be extracted as duplicate method candidates.

また、コンパイル装置１００によれば、抽出した重複メソッド候補それぞれの中間言語Ｉコードにより特定される処理ロジックに基づいて、重複メソッド候補のメソッド間の対応関係情報１１０を生成することができる。これにより、重複メソッド候補のメソッド間の実行面での冗長性の有無を判断するための対応関係情報１１０を生成することができる。 Further, the compiling device 100 can generate correspondence information 110 between methods of duplicate method candidates based on the processing logic specified by the intermediate language I code of each extracted duplicate method candidate. Thereby, it is possible to generate the correspondence information 110 for determining whether or not there is redundancy in terms of execution between methods of duplicate method candidates.

例えば、コンパイル装置１００は、重複メソッド候補それぞれの中間言語Ｉコード中の宣言、文、式それぞれのデータごとに、メソッド間のデータを比較する。そして、コンパイル装置１００は、比較した比較結果に基づいて、重複メソッド候補それぞれの中間言語Ｉコードのうち、メソッド間で意味的に同一である部分とメソッド間で意味的に異なる部分とを識別する対応関係情報１１０を生成してもよい。これにより、重複メソッド候補のメソッド間で意味的に同一である部分とメソッド間で意味的に異なる部分とを識別することが可能となる。 For example, the compiling device 100 compares data between methods for each data of declaration, sentence, and expression in the intermediate language I code of each duplicate method candidate. Then, the compiling device 100 identifies a portion that is semantically identical between the methods and a portion that is semantically different between the methods in the intermediate language I code of each of the duplicate method candidates based on the comparison result. The correspondence information 110 may be generated. This makes it possible to identify a portion that is semantically identical between duplicate method candidate methods and a portion that is semantically different between methods.

また、コンパイル装置１００によれば、生成した対応関係情報１１０に基づいて、重複メソッド候補のメソッド間で実行面での冗長性を有するか否かを判定し、メソッド間で実行面での冗長性を有する重複メソッド候補のマージ処理を行うことができる。これにより、メソッド間で実行面での冗長性を有する複数のメソッドを一つのメソッドにマージすることができ、構文・意味解析区の段階でメソッドの絶対数を削減することができる。 Also, the compiling device 100 determines whether or not there is redundancy in terms of execution between methods of duplicate method candidates based on the generated correspondence relationship information 110, and redundancy in terms of execution between methods. It is possible to perform merge processing of duplicate method candidates having As a result, a plurality of methods having redundancy in terms of execution among methods can be merged into one method, and the absolute number of methods can be reduced at the stage of syntax / semantic analysis.

例えば、コンパイル装置１００は、重複メソッド候補それぞれの中間言語Ｉコード中の宣言・文・式それぞれのデータのうち少なくともいずれかのデータが、メソッド間で意味的に同一である場合にメソッド間で実行面での冗長性を有すると判定してもよい。これにより、メソッド間で意味的に同一である部分が存在する重複メソッド候補をマージ対象とすることができ、効果的なマージ処理を行うことができる。 For example, the compiling device 100 executes between methods when at least one of declaration, sentence, and expression data in the intermediate language I code of each duplicate method candidate is semantically identical between methods. It may be determined that there is redundancy in terms. As a result, duplicate method candidates that have portions that are semantically identical between methods can be merged, and effective merge processing can be performed.

例えば、コンパイル装置１００は、重複メソッド候補それぞれの中間言語Ｉコード中の宣言・文・式それぞれのデータのうちの所定数Ｘ以上のデータが意味的に同一である場合に、メソッド間で実行面での冗長性を有すると判定してもよい。これにより、メソッド間で意味的に同一である部分が多く存在する重複メソッド候補をマージ対象とすることができ、より効果的なマージ処理を行うことができる。 For example, the compiling device 100 executes the execution surface between methods when data of a predetermined number X or more among the data of declarations, sentences, and expressions in the intermediate language I code of each of the duplicate method candidates is semantically identical. It may be determined that there is redundancy in. As a result, duplicate method candidates that have many portions that are semantically identical between methods can be merged, and more effective merge processing can be performed.

これらのことから、コンパイル装置１００によれば、最適化区の入力時点でメソッドの絶対数を削減することが可能となり、ソースプログラムｓｃのコンパイル時間の短縮化を図るとともに、オブジェクトファイルｆのデータサイズを縮小することができる。また、コンパイル装置１００によれば、オブジェクト指向言語の言語的特徴からメソッド間で実行面での冗長性を有する可能性がある重複メソッド候補を絞り込むことができるため、マージ対象となる複数のメソッドの判別を効率的かつ効果的に行うことができる。 Thus, according to the compiling apparatus 100, the absolute number of methods can be reduced at the time of input of the optimization section, the compilation time of the source program sc can be shortened, and the data size of the object file f can be reduced. Can be reduced. Further, the compiling device 100 can narrow down duplicate method candidates that may have redundancy in terms of execution among methods from the linguistic characteristics of the object-oriented language, and therefore, a plurality of methods to be merged can be selected. The discrimination can be performed efficiently and effectively.

これにより、ソフトウェア開発の開発スピードを向上させることができる。また、オブジェクトファイルｆのデータサイズの縮小により、実行時のプログラムのロード時間の短縮化を図るとともに、コンピュータシステムの省資源化（例えば、ＣＰＵ負荷、メモリ容量、ディスク容量、ネットワーク負荷など）を図ることができる。 Thereby, the development speed of software development can be improved. Further, by reducing the data size of the object file f, the load time of the program at the time of execution is shortened, and resource savings of the computer system (for example, CPU load, memory capacity, disk capacity, network load, etc.) are achieved. be able to.

なお、本実施の形態で説明したコンパイル方法は、予め用意されたプログラムをパーソナル・コンピュータやワークステーション等のコンピュータで実行することにより実現することができる。本コンパイルプログラムは、ハードディスク、フレキシブルディスク、ＣＤ−ＲＯＭ、ＭＯ、ＤＶＤ等のコンピュータで読み取り可能な記録媒体に記録され、コンピュータによって記録媒体から読み出されることによって実行される。また、本コンパイルプログラムは、インターネット等のネットワークを介して配布してもよい。 The compiling method described in the present embodiment can be realized by executing a program prepared in advance on a computer such as a personal computer or a workstation. The compile program is recorded on a computer-readable recording medium such as a hard disk, a flexible disk, a CD-ROM, an MO, and a DVD, and is executed by being read from the recording medium by the computer. Further, the compiled program may be distributed via a network such as the Internet.

上述した実施の形態に関し、さらに以下の付記を開示する。 The following additional notes are disclosed with respect to the embodiment described above.

（付記１）コンピュータが、
ソースプログラムの構文解析および意味解析により得られる中間言語コード中の複数のメソッドから、少なくとも継承関係または名称のいずれかに基づき複数のマージ候補メソッドを抽出し、
前記複数のマージ候補メソッドそれぞれの中間言語コードにより特定される処理ロジックに基づいて、前記複数のマージ候補メソッドからマージ可能な複数のマージ可能メソッドを抽出し、
前記複数のマージ可能メソッドのマージ処理を行う、
処理を実行することを特徴とするコンパイル方法。 (Supplementary note 1)
Extracting multiple merge candidate methods based on at least the inheritance relationship or name from multiple methods in the intermediate language code obtained by parsing and semantic analysis of the source program,
Based on the processing logic specified by the intermediate language code of each of the plurality of merge candidate methods, extract a plurality of mergeable methods that can be merged from the plurality of merge candidate methods,
Performing a merge process of the plurality of mergeable methods,
A compiling method characterized by executing processing.

（付記２）前記コンピュータが、
前記複数のマージ候補メソッドそれぞれの中間言語コードにより特定される処理ロジックに基づいて、前記複数のマージ候補メソッドのメソッド間の対応関係情報を生成する処理を実行し、
前記複数のマージ可能メソッドを抽出する処理は、
前記対応関係情報に基づいて、前記複数のマージ候補メソッドからマージ可能な複数のマージ可能メソッドを抽出することを特徴とする付記１に記載のコンパイル方法。 (Appendix 2) The computer
Based on the processing logic specified by the intermediate language code of each of the plurality of merge candidate methods, execute processing for generating correspondence information between methods of the plurality of merge candidate methods,
The process of extracting the plurality of mergeable methods includes:
The compiling method according to appendix 1, wherein a plurality of mergeable methods are extracted from the plurality of merge candidate methods based on the correspondence information.

（付記３）前記複数のマージ候補メソッドを抽出する処理は、
前記中間言語コード中の複数のメソッドから、親子関係を有するクラス間で同一名のメソッドの組合せを前記複数のマージ候補メソッドとして抽出することを特徴とする付記１または２に記載のコンパイル方法。 (Supplementary Note 3) The process of extracting the plurality of merge candidate methods is as follows:
3. The compiling method according to appendix 1 or 2, wherein a combination of methods having the same name between classes having a parent-child relationship is extracted as the plurality of merge candidate methods from a plurality of methods in the intermediate language code.

（付記４）前記複数のマージ候補メソッドを抽出する処理は、
前記中間言語コード中の複数のメソッドから、同一クラス内のメソッド、あるいは、いずれのクラスにも属していないメソッドのうち、同一名のメソッドの組合せを前記複数のマージ候補メソッドとして抽出することを特徴とする付記１〜３のいずれか一つに記載のコンパイル方法。 (Supplementary Note 4) The process of extracting the plurality of merge candidate methods is as follows:
A combination of methods having the same name among methods in the same class or methods not belonging to any class is extracted as a plurality of merge candidate methods from a plurality of methods in the intermediate language code. The compiling method according to any one of appendices 1 to 3.

（付記５）前記複数のマージ候補メソッドを抽出する処理は、
前記中間言語コード中の複数のメソッドから、同一クラス内の演算子、あるいは、いずれのクラスにも属していない演算子のうち、同一名の演算子の組合せを前記複数のマージ候補メソッドとして抽出することを特徴とする付記１〜４のいずれか一つに記載のコンパイル方法。 (Supplementary Note 5) The process of extracting the plurality of merge candidate methods is as follows:
From a plurality of methods in the intermediate language code, a combination of operators having the same name among operators in the same class or operators not belonging to any class is extracted as the plurality of merge candidate methods. The compiling method according to any one of appendices 1 to 4, characterized in that:

（付記６）前記コンピュータが、
前記複数のマージ候補メソッドそれぞれの中間言語コード中の宣言、文、式それぞれのデータごとに、前記複数のマージ候補メソッドのメソッド間のデータを比較する処理を実行し、
前記生成する処理は、
前記データごとの比較結果に基づいて、前記複数のマージ候補メソッドそれぞれの中間言語コードのうち、前記メソッド間で意味的に同一である部分と前記メソッド間で意味的に異なる部分とを識別する対応関係情報を生成することを特徴とする付記２に記載のコンパイル方法。 (Appendix 6) The computer
For each data of declaration, sentence, and expression in the intermediate language code of each of the plurality of merge candidate methods, execute processing for comparing data between the methods of the plurality of merge candidate methods,
The process to generate is
Corresponding to identifying a portion that is semantically identical between the methods and a portion that is semantically different between the methods, among the intermediate language codes of each of the plurality of merge candidate methods based on the comparison result for each data The compiling method according to appendix 2, wherein the relation information is generated.

（付記７）前記複数のマージ可能メソッドを抽出する処理は、
前記対応関係情報を参照して、前記複数のマージ候補メソッドそれぞれの中間言語コード中の宣言、文、式それぞれのデータのうちの少なくともいずれかのデータが意味的に同一である前記複数のマージ候補メソッドを、前記複数のマージ可能メソッドとして抽出することを特徴とする付記６に記載のコンパイル方法。 (Supplementary Note 7) The process of extracting the plurality of mergeable methods is as follows:
With reference to the correspondence information, the plurality of merge candidates in which at least one of data of declaration, sentence, and expression in the intermediate language code of each of the plurality of merge candidate methods is semantically identical The compiling method according to appendix 6, wherein a method is extracted as the plurality of mergeable methods.

（付記８）前記複数のマージ可能メソッドを抽出する処理は、
前記対応関係情報を参照して、前記複数のマージ候補メソッドそれぞれの中間言語コード中の宣言、文、式それぞれのデータのうちの所定数以上のデータが意味的に同一である前記複数のマージ候補メソッドを、前記複数のマージ可能メソッドとして抽出することを特徴とする付記６に記載のコンパイル方法。 (Supplementary Note 8) The process of extracting the plurality of mergeable methods is as follows:
With reference to the correspondence information, the plurality of merge candidates in which a predetermined number or more of the data of declarations, sentences, and expressions in the intermediate language code of each of the plurality of merge candidate methods is semantically identical The compiling method according to appendix 6, wherein a method is extracted as the plurality of mergeable methods.

（付記９）前記マージ処理を行う処理は、
前記対応関係情報に基づいて、前記複数のマージ可能メソッドのメソッド間で共通する処理の中間言語コードと、前記複数のマージ可能メソッドの各メソッドで固有の処理の中間言語コードとをマージして、前記複数のマージ可能メソッドをマージしたメソッド本体の中間言語コードを生成することにより、前記複数のマージ可能メソッドのマージ処理を行うことを特徴とする付記２に記載のコンパイル方法。 (Supplementary Note 9) The process of performing the merge process is as follows:
Based on the correspondence information, the intermediate language code of processing common among the methods of the plurality of mergeable methods and the intermediate language code of processing unique to each method of the plurality of mergeable methods are merged, The compiling method according to appendix 2, wherein merge processing of the plurality of mergeable methods is performed by generating an intermediate language code of a method body obtained by merging the plurality of mergeable methods.

（付記１０）前記ソースプログラムは、オブジェクト指向言語によってコーディングされたプログラムであることを特徴とする付記１〜９のいずれか一つに記載のコンパイル方法。 (Supplementary note 10) The compiling method according to any one of supplementary notes 1 to 9, wherein the source program is a program coded in an object-oriented language.

（付記１１）ソースプログラムの構文解析および意味解析により得られる中間言語コード中の複数のメソッドから、少なくとも継承関係または名称のいずれかに基づき複数のマージ候補メソッドを抽出し、前記複数のマージ候補メソッドそれぞれの中間言語コードにより特定される処理ロジックに基づいて、前記複数のマージ候補メソッドからマージ可能な複数のマージ可能メソッドを抽出し、前記複数のマージ可能メソッドのマージ処理を行う制御部、
を有することを特徴とするコンパイル装置。 (Supplementary Note 11) A plurality of merge candidate methods are extracted from a plurality of methods in an intermediate language code obtained by syntax analysis and semantic analysis of a source program based on at least an inheritance relationship or a name, and the plurality of merge candidate methods A control unit that extracts a plurality of mergeable methods that can be merged from the plurality of merge candidate methods based on the processing logic specified by each intermediate language code, and performs a merge process of the plurality of mergeable methods;
A compiling device characterized by comprising:

（付記１２）コンピュータに、
ソースプログラムの構文解析および意味解析により得られる中間言語コード中の複数のメソッドから、少なくとも継承関係または名称のいずれかに基づき複数のマージ候補メソッドを抽出し、
前記複数のマージ候補メソッドそれぞれの中間言語コードにより特定される処理ロジックに基づいて、前記複数のマージ候補メソッドからマージ可能な複数のマージ可能メソッドを抽出し、
前記複数のマージ可能メソッドのマージ処理を行う、
処理を実行させることを特徴とするコンパイルプログラム。 (Supplementary note 12)
Extracting multiple merge candidate methods based on at least the inheritance relationship or name from multiple methods in the intermediate language code obtained by parsing and semantic analysis of the source program,
Based on the processing logic specified by the intermediate language code of each of the plurality of merge candidate methods, extract a plurality of mergeable methods that can be merged from the plurality of merge candidate methods,
Performing a merge process of the plurality of mergeable methods,
A compiled program characterized by causing processing to be executed.

（付記１３）コンピュータに、
ソースプログラムの構文解析および意味解析により得られる中間言語コード中の複数のメソッドから、少なくとも継承関係または名称のいずれかに基づき複数のマージ候補メソッドを抽出し、
前記複数のマージ候補メソッドそれぞれの中間言語コードにより特定される処理ロジックに基づいて、前記複数のマージ候補メソッドからマージ可能な複数のマージ可能メソッドを抽出し、
前記複数のマージ可能メソッドのマージ処理を行う、
処理を実行させるコンパイルプログラムを記録したことを特徴とする前記コンピュータに読み取り可能な記録媒体。 (Supplementary note 13)
Extracting multiple merge candidate methods based on at least the inheritance relationship or name from multiple methods in the intermediate language code obtained by parsing and semantic analysis of the source program,
Based on the processing logic specified by the intermediate language code of each of the plurality of merge candidate methods, extract a plurality of mergeable methods that can be merged from the plurality of merge candidate methods,
Performing a merge process of the plurality of mergeable methods,
A computer-readable recording medium in which a compilation program for executing processing is recorded.

１００コンパイル装置
５０１取得部
５０２解析部
５０３抽出部
５０４生成部
５０５判定部
５０６マージ処理部
５０７最適化部
５０８コード生成部
５０９出力部 DESCRIPTION OF SYMBOLS 100 Compile apparatus 501 Acquisition part 502 Analysis part 503 Extraction part 504 Generation part 505 Determination part 506 Merge processing part 507 Optimization part 508 Code generation part 509 Output part

Claims

Computer
Extracting multiple merge candidate methods based on at least the inheritance relationship or name from multiple methods in the intermediate language code obtained by parsing and semantic analysis of the source program,
Based on the processing logic specified by the intermediate language code of each of the plurality of merge candidate methods, extract a plurality of mergeable methods that can be merged from the plurality of merge candidate methods,
Performing a merge process of the plurality of mergeable methods,
A compiling method characterized by executing processing.

The computer is
Based on the processing logic specified by the intermediate language code of each of the plurality of merge candidate methods, execute processing for generating correspondence information between methods of the plurality of merge candidate methods,
The process of extracting the plurality of mergeable methods includes:
The compiling method according to claim 1, wherein a plurality of mergeable methods are extracted from the plurality of merge candidate methods based on the correspondence information.

The process of extracting the plurality of merge candidate methods is as follows:
3. The compiling method according to claim 1, wherein a combination of methods having the same name between classes having a parent-child relationship is extracted as the plurality of merge candidate methods from a plurality of methods in the intermediate language code.

The process of extracting the plurality of merge candidate methods is as follows:
A combination of methods having the same name among methods in the same class or methods not belonging to any class is extracted as a plurality of merge candidate methods from a plurality of methods in the intermediate language code. The compiling method according to any one of claims 1 to 3.

The process of extracting the plurality of merge candidate methods is as follows:
From a plurality of methods in the intermediate language code, a combination of operators having the same name among operators in the same class or operators not belonging to any class is extracted as the plurality of merge candidate methods. The compiling method according to any one of claims 1 to 4.

The computer is
For each data of declaration, sentence, and expression in the intermediate language code of each of the plurality of merge candidate methods, execute processing for comparing data between the methods of the plurality of merge candidate methods,
The process to generate is
Corresponding to identifying a portion that is semantically identical between the methods and a portion that is semantically different between the methods, among the intermediate language codes of each of the plurality of merge candidate methods based on the comparison result for each data The compiling method according to claim 2, wherein the relation information is generated.

The process of extracting the plurality of mergeable methods includes:
With reference to the correspondence information, the plurality of merge candidates in which at least one of data of declaration, sentence, and expression in the intermediate language code of each of the plurality of merge candidate methods is semantically identical The compiling method according to claim 6, wherein a method is extracted as the plurality of mergeable methods.

The compiling method according to claim 1, wherein the source program is a program coded in an object-oriented language.

A plurality of merge candidate methods are extracted from a plurality of methods in the intermediate language code obtained by the syntax analysis and semantic analysis of the source program based on at least the inheritance relationship or the name, and the intermediate languages of the plurality of merge candidate methods are extracted. Based on the processing logic specified by the code, a control unit that extracts a plurality of mergeable methods that can be merged from the plurality of merge candidate methods, and performs a merge process of the plurality of mergeable methods;
A compiling device characterized by comprising:

On the computer,
Extracting multiple merge candidate methods based on at least the inheritance relationship or name from multiple methods in the intermediate language code obtained by parsing and semantic analysis of the source program,
Based on the processing logic specified by the intermediate language code of each of the plurality of merge candidate methods, extract a plurality of mergeable methods that can be merged from the plurality of merge candidate methods,
Performing a merge process of the plurality of mergeable methods,
A compiled program characterized by causing processing to be executed.