JP3726992B2

JP3726992B2 - Batch function call method

Info

Publication number: JP3726992B2
Application number: JP03441798A
Authority: JP
Inventors: 教安森; 純男菊池
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1998-02-17
Filing date: 1998-02-17
Publication date: 2005-12-14
Anticipated expiration: 2018-02-17
Also published as: JPH11232118A

Description

【０００１】
【発明の属する技術分野】
本発明はコンパイラなどのプログラム変換技術に係わり、特に、関数呼出機構を持つ言語に対して、関数および関数呼出の関係を解析して関数情報および呼出情報として生成するプログラム変換の際の一括関数呼出化方法に関する。
【０００２】
【従来の技術】
高級言語によるプログラミングでは、関数（手続きとも呼ばれるが、以降では関数で統一する）による構造化を行うのが一般的である。関数は、一連の処理をまとめてパラメータ化して記述したもの（以下、これを関数定義と呼ぶ）で、それを呼出す（以下単に呼出と呼ぶ）ことにより関数定義内の処理を実現できる。関数定義側のパラメータには、仮引数、返却変数（以下、仮返数と呼ぶ）があり、関数定義内の記述では、仮引数・仮返数（以下、仮パラメータと呼ぶ）を使って処理を記述する。呼出側のパラメータには、実引数、返却変数（以下、実返数と呼ぶ）があり、呼出側の変数／式を使ってパラメータの指定を行う（この呼出インタフェースとなる変数／式を実パラメータと呼ぶ）。尚、以下では、仮パラメータと実パラメータを総称して、インタフェース変数／式と呼ぶ。実パラメータには様々な値を指定することが可能で、これにより同様の処理（＝関数定義）を繰り返し実行することができる。
【０００３】
このように、関数はプログラミング上便利な機構である一方で、呼出時にオーバヘッドがかかるという問題点を抱えている。即ち、引数の受け渡しや、呼出時のスタックポインタの更新／退避回復、命令ジャンプに伴うキャッシュミスの可能性の増大といったことにより、処理性能の低下を招く。特に、処理内容が簡単な関数を反復呼出する場合には、呼出オーバヘッドは性能上無視できないものとなる。
【０００４】
このような状況に対処するため、高級言語では、インライン展開の機能を備えている場合が多い。インライン展開は、関数呼出を関数の定義本体の内容で置き換え、呼出を抑止するものである。具体的には、各呼出点で、関数定義の内容を展開し、仮パラメータを実パラメータで置き換えることにより実現する。インライン展開に関しては、佐々政孝著岩波講座ソフトウェア科学５「プログラミング言語処理系」（岩波書店、１９８９）のｐｐ．４５０―４５１に詳しい。
【０００５】
【発明が解決しようとする課題】
インライン展開は、高級言語における呼出オーバヘッド削減のほとんど唯一の手段である。しかし、インライン展開の適用によりコードサイズが大きくなり過ぎると、キャッシュミスの機会が増え、かえって性能が低下する場合もある。そのため、大きな関数の呼出や、呼出個所が多い場合には、インライン展開を制限するのが普通である。即ち、コード量の極端な増加を伴う場合には、インライン展開による呼出オーバヘッドの削減は行うことができない。すなわち、従来技術においては、インライン展開が適用できない場合に、呼出オーバヘッドの削減を行う手段がないという課題があった。
本発明の目的は、インライン展開によらない呼出オーバヘッドの削減手段を設けることにより、インライン展開が適用できない場合にも、呼出オーバヘッドの削減を行い、オブジェクトプログラムの処理性能の向上を図ることが可能な一括関数呼出化方法を提供することにある。
【０００６】
【課題を解決するための手段】
インライン展開は、各呼出点で関数定義の処理内容を展開し、（その呼出点での）呼出を除去する。従って、インライン展開の適用の有無は、各呼出点の呼出回数を０とするか否かを判断することになる。複数回の同一関数の呼出の場合、インライン展開が適用できない場合でも、関数定義側に複数呼出分の処理を展開すれば、コードサイズを極端に増大させることなく、呼出回数を削減することが可能である。即ち、複数回の同一関数の処理を一度に実行する一括関数を生成し、複数回の同一関数の呼出を、１回の一括関数の呼出に置き換える一括化呼出生成手段を設けることで、インライン展開によらない呼出オーバヘッドの削減手段を設けることができる。
これにより、インライン展開が適用できない場合にも、呼出オーバヘッドの削減を行うことが可能となり、オブジェクトプログラムの処理性能の向上に寄与する。
【０００７】
【発明の実施の形態】
本発明は、同一関数の複数呼び出しを、一括処理する一括関数の呼び出しに変換する方式に関するものである。
以下では、本発明の１実施例を図面を用いて説明する。
尚、本発明は、ソースプログラムから変換後のソースプログラムを生成する場合にも、コンパイラが中間語から変換後の中間語に変換する場合にも、同様に適用可能であるが、実施例としてコンパイラに適用した場合（但し、中間語は同等なソースプログラムの形式で示す）を説明することにする。
【０００８】
図１は、本発明のコンパイル方法を実施するコンパイラが動作する計算機システムの構成図である。計算機システムは、ＣＰＵ１０１、ディスプレイ装置１０２、キーボード１０３、主記憶装置１０４、外部記憶装置１０５より構成されている。
キーボード１０３より、ユーザからのコンパイラ起動命令を受け付ける。コンパイラ終了メッセージや、エラーメッセージは、ディスプレイ装置１０２に表示される。外部記憶装置１０５には、ソースプログラム１０６とオブジェクトプログラム１０７が格納される。
主記憶装置１０４には、コンパイラ１０８があり、コンパイル過程で必要となる中間語３、関数情報テーブル４、呼出情報テーブル５、一括呼出情報テーブル６、一括関数情報テーブル７が格納される。
コンパイル処理は、ＣＰＵ１０１によって、制御される。
【０００９】
図２に本実施例のコンパイラの処理手順を示す。
構文解析２０１では、ソースプログラム１０６を入力として字句構文解析を行ない、中間語３を出力する。
ループ展開処理２０２は中間後３を入力とし、反復処理を展開して反復回数を低減するループ展開を行い、中間語３を出力する。
一括関数化処理２は、中間語３を入力とし、複数の関数呼び出しを一括命令関数呼び出しに変換する一括関数化処理を行ない、中間語３を出力する。
コード生成処理２０３は、中間語を最終的なオブジェクトプログラム１０７の形式に変換する。
尚、ループ展開２０２は、本発明の適用機会を増やすため望ましい処理であるが、必須の処理ではない。
【００１０】
図３は、本発明における関数呼出一括化方法の１実施例を示すフローチャート図であり、図２の関数呼出一括化処理２の手順を表している。
先ず、関数情報テーブル生成処理２１により、ソースプログラム中の関数定義の各種情報を抽出し、関数情報テーブル４を作成する。同様に、呼出情報テーブル生成処理２２により、関数呼出に関する情報を抽出し、呼出情報テーブル５を生成する。
【００１１】
次に、一括呼出情報テーブル生成処理２３により、呼出情報テーブル５から複数の同一関数呼出を一括処理する一括関数に関する情報を抽出し、一括関数情報テーブル６を生成する。この一括呼出情報テーブル生成処理２３は、同一関数呼出に関する情報を抽出する同型呼出抽出処理２３０と、同型呼出集合から一括呼出化可能な呼出を抽出する一括呼出抽出処理２３１、抽出した一括呼出を一括呼出情報テーブルに登録する一括呼出情報テーブル登録処理２３２からなる。各処理に関しては図１７から図２３で詳しく述べる。
【００１２】
次に、一括関数生成処理２４により、一括関数情報テーブル７に基づき、中間後３に一括関数を生成する。
最後に、一括呼出化変換処理２５により、一括呼出情報テーブル６に基づき、中間語３中の関数呼出を一括関数呼出に変換する。
【００１３】
図４は、図３で示した関数呼出一括化関連処理と、入出力となる中間語やテーブル類の関係を示した処理構成図である。但し、中間語３は全般的に参照するため入出力関係は、主要なもののみを示した。
関数情報テーブル生成処理２１は、一括関数呼出化前中間語３１を入力とし、関数定義の情報を格納した関数情報テーブル４を生成する。関数情報テーブル４は、関数の一覧を格納する関数テーブル４１と、関数のインタフェースとなる仮引数および仮返数の情報を格納する仮パラメータテーブル４２から成る。
呼出情報テーブル生成処理２２は、一括呼出化前中間語３１を入力とし、関数呼出の情報を格納した呼出情報テーブル５を生成する。呼出情報テーブル５は、関数呼出の一覧を格納する呼出テーブル５１と、関数呼出のインタフェースとなる実引数および実返数の情報を格納する実パラメータテーブル５２から成る。
【００１４】
一括呼出化情報テーブル生成処理２３は、呼出情報テーブル５を入力とし、一括呼出化可能な関数呼出の情報を格納した一括呼出情報テーブル６と、一括呼出される関数定義の情報を格納した一括関数情報テーブル７を生成する。一括呼出情報テーブル６は、一括呼出化可能な呼出の一覧を格納する一括呼出テーブル６１と、一括呼出のインタフェースとなる実引数および実返数の情報を格納する実パラメータテーブル６２から成る。同様に、一括関数情報テーブル７は、一括呼出化される関数定義の一覧を格納する一括関数テーブル７１と、一括関数のインタフェースとなる仮引数および仮返数の情報を格納する仮パラメータテーブル７２から成る。
【００１５】
一括関数生成処理２４は、一括呼出化前中間語３１と一括関数情報テーブル７を入力とし、一括呼出化後中間語３２に一括関数を生成する。
一括呼出化変換処理２５は、一括呼出化前中間語３１と一括呼出情報テーブル６を入力とし、一括呼出化前中間語３１の関数呼出を一括関数呼出に変換し、一括呼出化後中間語３２に生成する。
【００１６】
次に、具体的な適用例の説明のため、処理対象たるソースプログラムの例を挙げておく。
図５は、本発明の一実施例の説明のためのソースプログラム例（Ｃ言語で書かれたプログラム例）である。
本実施例において、中間語３はソースプログラムと同等の情報を有しており特別に中間語の形式である必要はない。そこで、中間語３の例示に当たっては、ソースプログラムの形式で記述するものとする。従って、図５はソースプログラム例であるとともに、（一括呼出化前の）中間語３１の例ともなっている。
また、図２の説明でも述べたとおり、本実施例では（本質的には不要な）ループ展開を行っている。図５では、ループ展開は適用されていないが、ループ展開後に同一関数の呼出が生成される場合にも、一般性を失うことなく適用できる。
図５のプログラム例では、関数ｆ、ｇ、ｍａｉｎの３関数があり、関数ｍａｉｎが関数ｇを１回、関数ｇが関数ｆを２回呼び出している。１７行と２０行の関数ｆの呼出が、一括呼出可能な関数である。
【００１７】
図６は、図５のソースプログラム例に対する一括関数呼出化後のソースプログラム例であり、一括関数呼出化後中間語３２に対応している。図６では、図５にはなかった一括関数Ｆが新たに生成されている（３０行から３６行）。また、図５では関数ｇは関数ｆを２回呼び出していたが、図６では関数ｆの２回の呼出と同等の処理を行う一括関数Ｆの呼出１回（１６行）となっている。即ち、関数ｆの２回の呼出を一括関数Ｆの一回の呼出に変換しており、一括呼出化変換を行った結果の例である。
【００１８】
以降では、一括関数呼出化処理２の細部の説明を行なう。
説明にあたっては、図５の中間語を入力例とし、各種テーブルの出力例を逐次示していく。そこで、先ず、各種テーブルの構成例を説明してから、処理手順の説明を行なう。
以下、関数情報テーブル４、呼出情報テーブル５、一括呼出情報テーブル６、一括関数情報テーブル７の順にその構成例を述べる。
【００１９】
図７は、図５の例に対して求めた関数テーブル４１の構成例である。
関数テーブルは、プログラム中に出現する関数定義毎に、エントリが作成される。各関数には、一意な関数番号が付与されており、その関数番号により各エントリがアクセス可能となっている。各エントリは、関数番号欄４１１、関数名称欄４１２、定義行番号欄４１３、仮返数欄４１４、仮引数集合欄４１５、呼出集合欄４１６で構成される。
【００２０】
関数番号欄４１１には、関数テーブルのエントリ番号、関数名称欄４１２には、対応する関数の名称が格納される。
定義行番号欄４１３には、当該関数の中間語３での関数定義情報が格納される。この定義行番号欄により、対応する関数定義の全ての情報を得ることができる。例えば、関数の各種インタフェース情報の取得や、関数定義本体の読み出し／複写等も可能である。本実施例では、図７で示したように、ソースプログラムの行番号の形式で示してある。
仮返数欄４１４および仮引数集合欄４１５は、当該関数の仮返数および仮引数の情報が、仮パラメータテーブル４２のエントリ番号の形式で格納される。
呼出集合欄４１６は、当該関数中に存在する関数呼出の情報が、呼出テーブル５１のエントリ番号の形式で格納される。
【００２１】
図８は、図５の例に対して求めた仮パラメータテーブル４２の構成例である。
仮パラメータテーブルは、関数テーブル４１の関数インタフェース情報を格納するための補助テーブルである。仮返数や仮引数毎に、エントリが作成され、そのエントリ番号が関数テーブル４１の仮返数欄や仮引数集合欄に格納される。
各エントリは、仮パラメータ番号欄４２１、仮パラメータ位置番号欄４２２、仮パラメータ型欄４２３、仮パラメータ変数欄４２４で構成される。
【００２２】
仮パラメータ番号欄４２１には、仮パラメータテーブルのエントリ番号が格納される。
仮パラメータ位置番号欄４２２には、当該仮パラメータの出現位置が番号で示してある。仮返数の位置番号は０、ｉ番めの仮引数は位置番号ｉとなる。
仮パラメータ型欄４２３には、仮パラメータの型情報が格納される。
仮パラメータ変数欄４２４には、対応する仮パラメータの名称が格納される。
【００２３】
図９は、図５の例に対して求めた呼出テーブル５１の構成例である。
呼出テーブルは、プログラム中に出現する関数呼出毎に、エントリが作成される。各エントリは、呼出番号欄５１１、呼出行番号欄５１２、呼出関数番号欄５１３、実返数欄５１４、実引数集合欄５１５で構成される。
【００２４】
呼出番号欄５１１には、呼出テーブルのエントリ番号、呼出行番号欄５１２には、対応する関数呼出の行番号が格納される。
呼出関数番号欄５１３には、当該呼出が呼び出す関数が、関数テーブル４１の関数番号が格納される。
実返数欄５１４および実引数集合欄５１５は、当該呼出の実返数および実引数の情報が、実パラメータテーブル５２のエントリ番号の形式で格納される。
【００２５】
図１０は、図５の例に対して求めた実パラメータテーブル５２の構成例である。
実パラメータテーブルは、呼出テーブル５１の呼出インタフェース情報を格納するための補助テーブルである。実返数や実引数毎に、エントリが作成され、そのエントリ番号が呼出テーブル５１の実返数欄や実引数集合欄に格納される。各エントリは、実パラメータ番号欄５２１、実パラメータ位置番号欄５２２、実パラメータ式欄５２２で構成される。
【００２６】
実パラメータ番号欄５２１には、実パラメータテーブルのエントリ番号が格納される。
実パラメータ位置番号欄５２２には、当該実パラメータの出現位置が番号で示してある。実返数の位置番号は、（仮パラメータテーブル４２の仮パラメータ位置番号欄４２２と同様に）０、ｉ番めの実引数は位置番号ｉとなっている。
実パラメータ式欄５２３には、対応する実パラメータの名称が格納される。
【００２７】
図１１は、図５の例に対して求めた一括呼出テーブル６１の構成例である。
一括呼出テーブルは、一括関数の（一括）呼出毎に、エントリが作成される。各エントリは、一括呼出番号欄６１１、生成元呼出集合欄６１２、一括関数番号欄６１３、実返数名称欄６１４、実返数型欄６１５、実返数集合欄６１６、実引数集合欄６１７で構成される。
【００２８】
一括呼出番号欄６１１には、一括呼出テーブルのエントリ番号が格納される。
生成元呼出集合欄６１２には、一括呼出化する呼出の集合が、呼出テーブル５１のエントリ番号の形式で格納される。
一括関数番号欄６１３には、当該一括呼出が呼び出す一括関数が、一括関数テーブル７１のエントリ番号（一括関数関数番号）で格納される。
実返数名称欄６１４および実返数型欄６１５には、返数の名称および返数の型が格納される。
実返数集合欄６１６および実引数集合欄６１７は、当該一括呼出の実返数および実引数の情報が、一括実パラメータテーブル６２のエントリ番号の形式で格納される。
【００２９】
図１２は、図５の例に対して求めた一括実パラメータテーブル６２の構成例である。
一括実パラメータテーブルは、一括呼出テーブル６１の呼出インタフェース情報を格納するための補助テーブルである。実返数や実引数毎に、エントリが作成され、そのエントリ番号が一括呼出テーブル６１の実返数欄や実引数集合欄に格納される。各エントリは、一括実パラメータ番号欄６２１、一括実パラメータ位置番号欄６２２、生成元実パラメータ番号欄６２３、変換前名称欄６２４、変換後名称欄６２５で構成される。
【００３０】
一括実パラメータ番号欄６２１には、一括呼出実パラメータテーブルのエントリ番号が格納される。
一括実パラメータ位置番号欄６２２には、当該実パラメータの出現位置が番号で示してある。
生成元呼出集合欄６２３には、一括化する呼出が、呼出テーブル５１のエントリ番号の集合形式で格納される。
変換前名称欄７２４には、生成元呼出における実パラメータ名称、変換後名称欄７２５には、一括呼出化後の（一括）実パラメータ名称がそれぞれ格納される。尚、変換前名称欄７２４と変換後名称欄７２５は、実返数に対してのみ有効な欄である（実引数に関しては、常に空白となる）。
【００３１】
図１３は、図５の例に対して求めた一括関数テーブル７１の構成例である。
一括関数テーブルは、一括化関数生成を行う関数毎に、エントリが作成される。各関数には、一意な一括関数番号が付与されており、その一括関数番号により各エントリがアクセス可能となっている。各エントリは、一括関数番号欄７１１、一括関数名称欄７１２、一括化係数欄７１３、生成元関数番号欄７１４、仮返数型欄７１５、仮返数名称欄７１６、仮返数集合欄７１７、仮引数数集合欄７１８で構成される。
【００３２】
一括関数番号欄７１１には、一括関数テーブルのエントリ番号、
一括関数名称欄７１２には、生成する一括関数の名称が格納される。
一括化係数欄７１３には、何回の呼出を一括化するかの個数が格納される。
生成元関数番号欄７１４には、一括化する関数が関数テーブル４１の関数番号の形式で格納される。
仮返数型欄７１５および仮返数名称欄７１６には、返数の型および返数の名称が格納される。
仮返数集合欄７１７および仮引数集合欄７１８は、当該一括関数の仮返数および仮引数の情報が、一括仮パラメータテーブル７２のエントリ番号の形式で格納される。
【００３３】
図１４は、図５の例に対して求めた一括仮パラメータテーブル７２の構成例である。
一括仮パラメータテーブルは、一括関数テーブル７１の関数インタフェース情報を格納するための補助テーブルである。仮返数や仮引数毎に、エントリが作成され、そのエントリ番号が一括関数テーブル７１の仮返数集合欄や仮引数集合欄に格納される。各エントリは、一括仮パラメータ番号欄７２１、生成元仮パラメータ番号欄７２２、一括仮パラメータ位置番号欄７２３、反復番号欄７２４、変換前名称欄７２５、変換後名称欄７２６で構成される。
【００３４】
一括仮パラメータ番号欄７２１には、一括仮パラメータテーブルのエントリ番号が格納される。
生成元仮パラメータ番号欄７２２には、生成元関数の対応する仮パラメータが、仮パラメータテーブル４２の仮パラメータ番号の形式で格納される。
一括仮パラメータ位置番号欄７２３には、当該仮パラメータの出現位置が番号で示してある。一括仮パラメータ位置番号は、仮返数と仮引数は独立に数えられ、ｉ番めの仮返数／仮引数は位置番号ｉとなる。
【００３５】
反復番号欄７２４には、一括化の際に何番めの関数の仮パラメータであったかが格納される。この反復番号は０から数え始め、一括化係数がｎのときｎ−１までの番号が付与されることになる。
変換前名称欄７２５には、生成元関数における仮パラメータ名称、変換後名称欄７２６には、一括関数化後の（一括）仮パラメータ名称がそれぞれ格納される。
以上で、本実施例で用いる各種テーブル構成の説明を終る。
【００３６】
以下、これらのテーブルを生成／参照する各種処理の説明を順に行なう。
図１５は、関数情報テーブル生成処理２１のフローチャート図である。関数情報テーブル生成処理は、中間語３を逐次走査し、関数定義を検出したらその関数定義情報を収集・登録した関数情報テーブル４を生成する。以下、その手順を説明する。
【００３７】
先ず、関数番号ｆｉ、仮パラメータ番号ｆｐｉを０で初期化する（ステップ１５０１）。
次に、未処理関数があるか否かを判定（ステップ１５０２）し、未処理関数がなくなるまで、ステップ１５０３以下の処理を行なう。
未処理関数ｆがある場合には、先ず、関数エントリＦを生成し、関数番号ｆｉをインクリメントする（ステップ１５０３）。以下、ステップ１５０４以降でこの関数エントリＦに関数ｆの関数情報を格納する。
【００３８】
先ず、ｆの定義行番号をｆｌ、関数名をｆｎとする（ステップ１５０４）。次に、仮パラメータ集合ＦＰＳを求めるため、ＦＰＳを空集合で初期化する（ステップ１５０５）。そして、未処理仮パラメータがあるか否かを判定（ステップ１５０６）し、未処理仮パラメータがなくなるまで、ステップ１５０７以下の処理を行なう。
未処理仮パラメータｆｐがある場合には、仮パラメータエントリＦＰを生成し、仮パラメータ番号ｆｐｉをインクリメントする（ステップ１５０７）。
【００３９】
次に、ｆｐの仮パラメータ情報を仮パラメータテーブル４２に格納（ステップ１５０８）し、そのエントリ番号ｆｐｉを仮パラメータ集合ＦＰＳに加える（ステップ１５０９）。以下、未処理仮パラメータがなくなるまで、ステップ１５０６からステップ１５０９を反復実行する。尚、ステップ１５０８は仮パラメータ情報である、仮パラメータ番号、仮パラメータ位置、仮パラメータ種別、仮パラメータ型、仮パラメータ名称をエントリＦＰに格納するが、詳細は略す（図８の仮パラメータテーブル４２の説明を参照）。
【００４０】
ステップ１５０６の判定がｎｏとなった時点で、仮パラメータ集合ＦＰＳの収集が終わるので、Ｆに全ての関数情報を格納（ステップ１５１０）して、次の未処理関数判定（ステップ１５０２）を行う。全ての関数を処理し終わった時点で処理を終了する。
【００４１】
図７および図８は、図５で示されたソースプログラムに対し、上記の処理を行なった結果を示す関数情報テーブルの例となっている。
例えば、エントリ４１７は、図５の００３行から００８行までで定義されている関数ｆの情報が格納されている。また、仮返数欄により、名称ｒで仮パラメータ位置０（＝仮返数）、仮パラメータ型がｉｎｔである仮返数があることが分かる。
【００４２】
図１６は、呼出情報テーブル生成処理２２のフローチャート図である。
呼出情報テーブル生成処理は、中間語３を逐次走査し、関数呼出を検出したらその呼出情報を収集・登録した呼出情報テーブル５を生成する。以下、その手順を説明する。
【００４３】
先ず、呼出番号ｃｉ、実パラメータ番号ａｐｉを０で初期化する（ステップ１６０１）。
次に、未処理関数があるか否かを判定（ステップ１６０２）し、未処理関数がなくなるまで、ステップ１６０３以下の処理を行なう。
未処理関数ｆがある場合には、先ず、その関数エントリをＦとし（ステップ１６０３）、呼出集合ＣＳを空集合で初期化する（ステップ１６０４）。
次に、ｆに未処理の関数呼出があるか否かを判定（ステップ１６０５）し、存在する場合には、ステップ１６０６以降で呼出情報の登録を行う。
呼出情報の登録に当たっては、先ず、呼出エントリＣを生成し、呼出番号ｃｉをインクリメントする（ステップ１６０６）。
次に、呼出行番号ｃｌ、呼出関数番号ｆｉを求める（ステップ１６０７）。尚、呼出関数番号ｆｉの取得は、関数情報テーブル４１を検索することで容易に実現できる（関数情報テーブルに存在しない場合には空欄としておき、一括化対象から外す）。
【００４４】
次に、実パラメータ集合ＡＰＳを求めるため、ＡＰＳを空集合で初期化する（ステップ１６０８）。そして、未処理実パラメータがあるか否かを判定（ステップ１６０９）し、未処理実パラメータがなくなるまで、ステップ１６１０以下の処理を行なう。
即ち、未処理実パラメータａｐがある場合には、実パラメータエントリＡＰを生成し、実パラメータ番号ａｐｉをインクリメント（ステップ１６１０）後、ａｐの実パラメータ情報を実パラメータテーブル５２に格納（ステップ１６１１）し、そのエントリ番号ａｐｉを実パラメータ集合ＡＰＳに加える（ステップ１６１２）。尚、ステップ１６１１は、実パラメータ情報である、実パラメータ番号、実パラメータ位置番号、実パラメータ式をエントリＡＰに格納するが、詳細は略す（図１４の実パラメータテーブル５２の説明を参照）。
【００４５】
ステップ１６０９の判定がｎｏとなった時点で、実パラメータ集合ＡＰＳの収集が終わるので、Ｃに全ての呼出情報を格納（ステップ１６１３）する（実パラメータ集合ＡＰＳは、実返数と実引数を合わせたもので、呼出情報テーブルの実返数欄と実引数欄に格納する）。その後、ＣＳにｃｉを加え（ステップ１６１４）、次の未処理呼出判定（ステップ１６０５）を行う。ステップ１６０５の判定がｎｏとなった段階で、関数ｆにおける全ての呼出を処理し終わるので、Ｆの呼出集合欄にＣＳを格納（ステップ１６１５）し、次の未処理関数の判定（ステップ１６０２）に進む。全ての関数を処理し終わった時点で処理を終了する。
【００４６】
図９および図１０は、図５のソースプログラムの例に対し、上記の処理を行なった結果を示す呼出情報テーブル５の例となっている。例えば、図９のエントリ５１６は、１７行目の関数ｆの呼出情報を格納したものである。この呼出では、実引数として（ｐ＊ｐ，ｑ＊ｑ）を渡し、実返数ｒ１に返却値が格納されることが分かる。
【００４７】
図１７は、一括呼出化情報テーブル生成処理２３のフローチャート図である。
一括呼出化情報テーブル生成処理は、呼出情報テーブル５１を逐次走査し、一括呼出化可能な呼出を検出したら、その一括呼出情報を収集・登録した一括呼出情報テーブル６および一括関数情報テーブル７を生成する。以下、その手順を説明する。
【００４８】
先ず、一括呼出番号ａｃｉ、一括実パラメータ番号ａａｐｉ、一括関数番号ａｆｉ、一括実パラメータ番号ａｆｐｉを０で初期化する（ステップ１７０１）。次に、未処理関数エントリがあるか否かを判定（ステップ１７０２）し、未処理関数エントリがなくなるまで、ステップ１７０３以下の処理を行なう。
未処理関数エントリＦがある場合には、先ず、その呼出集合をＣＳとする（ステップ１７０３）。
【００４９】
次に、ＣＳから同一関数の呼出の集合である同型呼出集合ＳＣＳの抽出処理（ステップ１７０４からステップ１７１１）に移る。この同型呼出集合ＳＣＳの抽出処理は、図３のステップ２３０に相当する。
先ず、ＣＳをワーク集合ＷＣＳに格納し、ＳＣＳを空集合で初期化する（ステップ１７０４）。
次に、ＣＳに未処理の呼出エントリＣがあるか否かを判定（ステップ１７０５）し、存在する場合には、Ｃの呼出番号ｃｉと呼出関数番号ｆｉを取得（ステップ１７０６）し、ＷＣＳからｃｉを除く（ステップ１７０７）。
次に、ＷＣＳに未処理エントリＷＣがあるかどうかを判定（ステップ１７０８）し、存在する場合には、ＷＣの呼出番号ｗｃｉと呼出関数番号ｗｆｉを取得（ステップ１７０９）する。
そして、ｆｉとｗｆｉの一致判定（ステップ１７１０）により、同一関数の呼出であるかを調べる。同一関数である場合には、ＳＣＳにｗｃｉを加えて（ステップ１７１１）からステップ１７０８に進む。
【００５０】
ステップ１７０８で全てのワーク呼出集合の要素を判定した時点で、同型呼出集合ＳＣＳの抽出が完了する。
ここで、ＳＣＳが複数要素を持つか否かを判定（ステップ１７１２）し、複数存在する場合にのみ、一括呼出化情報抽出処理（ステップ２３１。詳細は図１８を参照）を実行し、ステップ１７０５以降で次の同型呼出集合ＳＣＳの処理に進む。
ステップ１７０５で全エントリを処理したら、ステップ１７０２に進み、次の関数エントリを処理する。
全ての関数エントリを処理しおわった時点で、一括呼出化情報テーブル生成処理を終了する。
【００５１】
図１８は、一括呼出化情報抽出処理２３１のフローチャート図である。一括呼出化情報抽出処理は、与えられた同型呼出集合ＳＣＳから、一括呼出化可能な呼出を集めた一括呼出集合ＡＳＣＳを抽出し、一括呼出情報テーブル６および一括関数情報テーブル７を生成する。以下、その手順を説明する。
先ず、ＳＣＳに未処理の呼出エントリＳＣがあるか否かを判定（ステップ１８０１）し、存在する場合には、Ｃの呼出番号ｓｃｉを取得（ステップ１８０２）する。
次に、ＳＣＳをワーク集合ＷＳＣＳに格納し、ＡＳＣＳを空集合で初期化（ステップ１８０３）後、ＷＳＣＳからｓｃｉを除く（ステップ１８０４）。
次に、ＷＳＣＳに未処理エントリＷＳＣがあるかどうかを判定（ステップ１８０５）し、存在する場合には、ＷＳＣの呼出番号ｗｓｃｉを取得（ステップ１８０６）する。
【００５２】
そして、ｓｃｉとｗｓｃｉが一括呼出化可能であるかを判定（ステップ１８０７）し、可能である場合には、ＡＳＣＳにｗｓｃｉを加える（ステップ１８０８）。尚、この一括呼出化の可能性判定は、ｗｓｃｉの呼出が、ｓｃｉの呼出行番号の位置に移動可能か等により判定するが、本発明には直接関係しないので詳細は略す。
その後、ステップ１８０５に進み、次のＷＳＣの要素を調べる。
ステップ１８０５で全てのワーク呼出集合ＷＳＣＳの要素を判定した時点で、一括化呼出集合ＡＳＣＳの抽出が完了する。
ここで、ＡＳＣＳが複数要素を持つか否かを判定（ステップ１８０９）し、複数存在する場合にのみ、一括呼出化情報登録処理（ステップ２３２。詳細は図１９を参照）を実行する。
次に、ＳＣＳからＡＳＣＳを除き（ステップ１８１０）、ステップ１８０１以降で次の同型呼出集合ＳＣＳの要素の処理に進む。
ステップ１８０１で全エントリを処理しおわった時点で、一括呼出化情報抽出処理を終了する。
【００５３】
図１９は、一括呼出化情報登録処理２３２のフローチャート図である。一括呼出化情報登録処理は、与えられた一括呼出集合ＡＳＣＳから、一括呼出情報テーブル６および一括関数情報テーブル７を生成する。以下、その手順を説明する。
先ず、一括呼出エントリＡＣを作成して、ａｃｉをインクリメント（ステップ１９０１）後、ＡＳＣＳをＡＣの生成元呼出集合欄に格納する（ステップ１９０２）。
次に、ＡＳＣＳの（任意の要素の）呼出関数番号をｃｉ、要素数をａｆｋとする（ステップ１９０３）。
そして、ＡＳＣＳから変数となる型ａｒｔを生成（ステップ１９０４）し、ＡＣの返数型欄に格納（ステップ１９０５）する。尚、ａｒｔは、本実施例では配列型としたが、生成元呼出集合の全ての返数が表現できる型であれば何でもよい。その生成方法は、本発明にとって本質的ではないので詳細は略す。
【００５４】
次に、一意な名称ａｃｎを生成し、ＡＣの実返数名称欄に格納（ステップ１９０６）後、一括呼出実引数登録処理（ステップ２３３。詳細は図２０）、一括呼出実返数登録処理（ステップ２３４。詳細は図２１）を行って、一括呼出情報テーブルへの登録を完了する。
【００５５】
次に、一括関数情報テーブルへの登録を行う。
先ず、一括関数エントリＡＦを作成して、ａｆｉをインクリメントする（ステップ１９１１）。
次に、ａｒｔをＡＦの返数型欄に格納（ステップ１９１２）する。
次に、一意な名称ａｆｎを生成し、ＡＦの仮返数名称欄に格納（ステップ１９１３）後、一括化係数欄にａｆｋを格納する（ステップ１９１４）。
最後に、一括関数仮引数登録処理（ステップ２３５。詳細は図２２）、一括関数仮返数登録処理（ステップ２３６。詳細は図２３）を行って、一括関数情報テーブルへの登録を完了する。
以上で、一括呼出情報テーブルと一括関数情報テーブルへの登録が完了するので、一括化呼出情報登録処理を終了する。
【００５６】
図２０は、一括呼出実返数登録処理２３３のフローチャート図である。一括呼出実返数登録処理は、一括呼出テーブルの各エントリにある生成元呼出集合から一括実パラメータテーブル６２を生成し、一括呼出テーブル６１の一括呼出実返数集合欄６１６に登録する。以下、その手順を説明する。
先ず、一括呼出実返数集合ＡＡＲＳを空集合で初期化する（ステップ２００１）。また、一括呼出実返数位置番号ａａｐｒも０で初期化する（ステップ２００２）。
【００５７】
そして、ＡＣの生成呼出元集合をＡＣＳ（ステップ２００３）として、ＡＣＳに未処理の呼出Ｃが存在かを判定（ステップ２００４）し、存在するときは、ステップ２００５からステップ２０１３を反復実行する。
この反復処理では、Ｃの実返数集合をＡＲＳ（ステップ２００５）とし、ＡＲＳに未処理の実返数ＡＲが存在かを判定（ステップ２００６）し、存在するときは、ステップ２００７からステップ２０１２を反復実行する。
この反復処理では、先ず、ａａｐｒをインクリメントする（ステップ２００７）。
次に、ＡＲの実パラメータ番号と名称を、それぞれａｒｉ、ａｒｎとする（ステップ２００８）。
次に、一括呼出実返数名称ａａｒｎを生成（ステップ２００９）する。
次に、一括呼出実パラメータエントリＡＡＰを生成し、ａａｐｉをインクリメント（ステップ２０１０）する。
【００５８】
次に、ＡＡＰに一括呼出実返数情報を登録（ステップ２０１１）後、ＡＡＲＳにａａｒｉを加える（ステップ２０１２）。
ステップ２００６の判定でｎｏとなったとき、全ての実返数の処理が終了するので、ＡＡＲＳをＡＣの一括呼出実返数集合欄に格納（ステップ２０１３）後、次のＡＣＳの処理を行う。
ＡＣＳの全ての呼出エントリを処理し終えた時点で、一括呼出実返数登録処理２３３が終了する。
【００５９】
図２１は、一括呼出実引数登録処理２３４のフローチャート図である。一括呼出実引数登録処理は、一括呼出テーブルの各エントリにある生成元呼出集合から一括実パラメータテーブル６２を生成し、一括呼出テーブル６１の一括呼出実引数集合欄６１７に登録する。以下、その手順を説明する。
【００６０】
先ず、一括呼出実引数集合ＡＡＡＳを空集合で初期化する（ステップ２１０１）。また、一括呼出実引数位置番号ａａｐａも０で初期化する（ステップ２１０２）。
そして、ＡＣの生成呼出元集合をＡＣＳ（ステップ２１０３）として、ＡＣＳに未処理の呼出Ｃが存在かを判定（ステップ２１０４）し、存在するときは、ステップ２1０５からステップ２１１３を反復実行する。
この反復処理では、Ｃの実引数集合をＡＰＳ（ステップ２１０５）とし、ＡＰＳに未処理の実引数ＡＰが存在かを判定（ステップ２１０６）し、存在するするときは、ステップ２１０７からステップ２１１２を反復実行する。
この反復処理では、先ず、ａａｐaをインクリメントする（ステップ２１０７）。
【００６１】
次に、ＡＰの実パラメータ番号と名称を、それぞれａｐｉ、ａｐｎとする（ステップ２１０８）。
次に、一括呼出実引数名称ａａｐｎを生成（ステップ２１０９）する。
次に、一括呼出実パラメータエントリＡＡＰを生成し、ａａｐｉをインクリメント（ステップ２１１０）する。
次に、ＡＡＰに一括呼出実引数情報を登録（ステップ２１１１）後、ＡＡＡＳにａａｐｉを加える（ステップ２１１２）。
ステップ２１０６の判定でｎｏとなったとき、全ての実引数の処理が終了するので、ＡＡＡＳをＡＣの一括呼出実返数集合欄に格納（ステップ２０１３）後、次のＡＣＳの処理を行う。
ＡＣＳの全ての呼出エントリを処理し終えた時点で、一括呼出実引数登録処理２３４が終了する。
【００６２】
図１１および図１２は、図９および図１０で示された呼出情報テーブル５を入力とし、図２０および図２１の処理を行なった結果を示す一括呼出情報テーブル６の例となっている。
例えば、図１１のエントリ６１８は、図９に示された呼出テーブル５１のエントリ５１６とエントリ５１７の呼出が一括呼出化可能であることを示している。この一括呼出は、実引数として（ｐ＊ｐ，ｑ＊ｑ，ｐ＊ｑ，ｑ＊ｐ）を渡し、型がｉｎｔの配列である実返数ａｒｆに返却値を格納すればよいことが分かる。
【００６３】
図２２は、一括関数仮返数登録処理２３５のフローチャート図である。一括関数仮返数登録処理は、一括関数テーブルの各エントリにある生成元関数番号と一括化係数から、一括仮パラメータテーブル７２を生成し、一括関数テーブル７１の一括関数仮返数集合欄７１７に登録する。以下、その手順を説明する。
先ず、一括関数仮返数集合ＡＦＲＳを空集合で初期化する（ステップ２２０１）。また、一括関数仮返数位置番号ａｆｐｒも０で初期化する（ステップ２２０２）。
次に、ＡＦの一括化係数をａｆｋ、ＡＦの生成元関数エントリをＦとする（ステップ２２０３。両者は、既に一括呼出化常用登録処理で算出済み）。
さらに、反復回数ｉを０で初期化（ステップ２２０４）後、ｉがａｆｋと等しいかどうかを判定（ステップ２２０５）することで、ｉがａｆｋと等しくなるまで、ステップ２２０６からステップ２２１２を反復実行する。
この反復処理では、先ず、ｉをインクリメントする（ステップ２２０６）。
次に、ａｆｐｒをインクリメントする（ステップ２２０７）。
【００６４】
次に、ＡＦの仮パラメータ番号と名称を、それぞれｆｐｉ、ｆｐｎとする（ステップ２２０８）。
次に、一括関数仮返数名称ａｆｐｎを生成（ステップ２２０９）する。
次に、一括関数仮パラメータエントリＡＦＰを生成し、ａｆｐｉをインクリメント（ステップ２２１０）する。
次に、ＡＦＰに一括関数仮返数情報を登録（ステップ２２１１）後、ＡＦＲＳにａｆｐｉを加える（ステップ２２１２）。
ステップ２２０５の判定でｎｏとなったとき、ＡＦＲＳをＡＦの一括関数仮返数集合欄に格納（ステップ２２１３）し、一括関数仮返数登録処理２３５が終了する。
【００６５】
図２３は、一括関数仮引数登録処理２３６のフローチャート図である。一括関数仮引数登録処理は、一括関数テーブルの各エントリにある生成元関数番号と一括化係数から、一括仮パラメータテーブル７２を生成し、一括関数テーブル７１の一括関数仮引数集合欄７１８に登録する。以下、その手順を説明する。
先ず、一括関数仮引数集合ＡＦＡＳを空集合で初期化する（ステップ２３０１）。また、一括関数仮引数位置番号ａｆｐａも０で初期化する（ステップ２３０２）。
次に、ＡＦの一括化係数をａｆｋ、ＡＦの生成元関数エントリをＦとする（ステップ２３０３。両者は、既に一括呼出化常用登録処理で算出済み）。
【００６６】
さらに、反復回数ｉを０で初期化（ステップ２３０４）後、ｉがａｆｋと等しいかどうかを判定（ステップ２３０５）することで、ｉがａｆｋと等しくなるまで、ステップ２３０６からステップ２３１５を反復実行する。
この反復処理では、先ず、ｉをインクリメントする（ステップ２２０６）。次に、Ｆの仮引数集合をＦＡＳ（ステップ２３０７）とし、ＦＡＳに未処理の仮引数ＦＡが存在かを判定（ステップ２３０８）し、存在するするときは、ステップ２３０９からステップ２３１４を反復実行する。
この反復処理では、先ず、ａｆｐａをインクリメントする（ステップ２３０９）。
次に、ＡＦの仮パラメータ番号と名称を、それぞれｆｐｉ、ｆｐｎとする（ステップ２３１０）。
【００６７】
次に、一括関数仮引数名称ａｆｐｎを生成（ステップ２３１１）する。
次に、一括関数仮パラメータエントリＡＦＰを生成し、ａｆｐｉをインクリメント（ステップ２３１２）する。
次に、ＡＦＰに一括関数仮引数情報を登録（ステップ２３１３）後、ＡＦＡＳにａｆｐｉを加える（ステップ２３１４）。
ステップ２３０８の判定でｎｏとなったとき、全ての仮引数の処理が終了するので、ＡＦＡＳをＡＦの一括関数仮引数集合欄に格納（ステップ２３１５）後、
次の反復処理を行う。
ｉとａｆｋが一致するまで、一括化係数分反復実行した時点で、一括関数仮引数登録処理２３６が終了する。
【００６８】
図１３および図１４は、図７から図１０に示された関数情報テーブル４および呼出情報テーブル５を入力とし、図１７から図２３の処理を行なった結果を示す一括関数情報テーブル７の例となっている。
例えば、図１３のエントリ７１９は、図７に示された関数テーブル４１のエントリ４１７を、一括化係数２で一括関数化したものであることを示している。この一括関数は、仮引数として（ｘ０，ｙ０，ｘ１，ｙ１）を渡され、型がｉｎｔの配列である仮返数ｆｒｆを返却する。
以上で、一括呼出化情報テーブル生成処理２３の説明を終える。
一括呼出化情報テーブル生成処理２３の後、一括化関数生成処理２４、一括呼出化変換処理２５を行う。以下順に説明する。
【００６９】
図２４は、一括関数生成処理２４のフローチャート図である。
一括関数生成処理は、一括関数情報テーブル７の各エントリから、中間語３中に一括関数を生成する。以下、その手順を説明する。
未処理の一括関数エントリＡＦがあるかどうかを調べ（ステップ２４０１）、存在する全ての一括関数エントリに対し、ステップ２４０２以下の処理を行う。
先ず、ＡＦの生成元関数番号をｆｉ、ＡＦの一括化係数をａｆｋとする（ステップ２４０２）。
次に、ＡＦの仮返数集合ＡＦＲＳから、一括関数仮返数定義を生成する（ステップ２４０３）。
次に、ＡＦの仮引数集合ＡＦＡＳから、一括関数仮引数定義を生成する（ステップ２４０４）。
以上で、一括関数の呼出インタフェースの生成が完了するので、次に、ステップ２４０６からステップ２４１３で一括関数本体を生成する。
【００７０】
先ず、定義部反復回数ｍを０で初期化（ステップ２４０５）後、ｆｉの定義部ＦＢをコピーする（ステップ２４０６）。
次に、このコピーされた領域ＦＢ中の仮パラメータの変換を行う。
先ず、ＡＦの仮返数集合ＡＦＲＳと仮引数集合ＡＦＡＳの和集合を一括関数仮パラメータ集合ＡＦＰＳとする（ステップ２４０７）。
ＡＦＰＳに未処理の仮パラメータエントリＡＦＰがあるか否かを判定（ステップ２４０８）し、存在する場合はステップ２４０９以降で仮パラメータの変換を行う。
【００７１】
先ず、ＡＦＰの変換前名称をｂｃｎ、変換後名称をａｃｎとする（ステップ２４０９）。
次に、ＦＢに出現する名称ｂｃｎをａｃｎに変換する（ステップ２４１０）後、さらに次の仮パラメータの変換のため、ステップ２４０８に進む。
ＡＦＰＳの全要素の処理が終わったら、ｍをインクリメント（ステップ２４１１）し、一括化係数ａｆｋと一致するか否かを判定する（ステップ２４１２）。
一致しない間は、ステップ２４０６に進み反復実行し、ｍがａｆｋと一致したら、ＡＦの仮返数名称を用いて一括関数仮返数返却を生成（ステップ２４１３）し、ステップ２４０１に進んで次の一括関数エントリの処理を行う。
全ての一括関数エントリの処理が終わった時点で、一括関数生成処理が終了する。
【００７２】
図６は、図１３および図１４に示された一括関数情報テーブル７を入力とし、図２４の処理を行なった結果を示す（ソースプログラム形式による）中間語の例となっている。図６の３０行から３６行に、図１３のエントリ７１９で示された一括関数が生成されている。３２行には、一括関数仮返数定義が、３５行には一括関数仮返数返却分が、それぞれ生成されている。
【００７３】
図２５は、一括呼出化変換処理２５のフローチャート図である。
一括呼出化変換処理は、一括呼出情報テーブル６を使って、中間語３中の一括化可能同型呼出を、一括呼出に変換する。以下、その手順を説明する。
未処理の一括呼出エントリＡＣがあるかどうかを調べ（ステップ２５０１）、存在する全ての一括呼出エントリに対し、ステップ２５０２以下の処理を行う。
先ず、ＡＣの実返数名称から、一括呼出実返数定義を生成する（ステップ２５０２）。
次に、ＡＣの生成元呼出集合をＣＳ、ＡＣの実返数集合をＡＡＰＳとする（ステップ２５０３）。
次に、ＣＳの任意の要素Ｃを取り出し、呼出関数番号ｆｉを取得する（ステップ２５０４）。
【００７４】
次に、Ｃの呼出行番号から一括呼出生成位置ａｃｐを決定する（ステップ２５０５）。この一括呼出生成位置は、一括呼出化の可能性の判定により保証された場所ならば、何処でもよいが、本実施例では、便宜上、最初の呼出位置の直前とする。
次に、ＡＣから一括呼出を作成し、ａｃｐに生成する（ステップ２５０６）。尚、一括呼出の作成は、一括呼出実返数名称、一括関数番号、一括呼出実引数集合により容易に作成できるので、詳細は略す。
次に、ＣＳに未処理の呼出エントリＣがあるかどうかを判定（ステップ２５０７）し、存在する場合は、Ｃの呼出行番号を見て呼出を削除する（ステップ２５０８）。
【００７５】
ＣＳの全ての呼出を削除したら、ＡＡＲＳに未処理の一括呼出実引数エントリＡＡＲがあるかどうかを判定（ステップ２５０９）し、存在する場合には、ステップ２５１０からステップ２５１１で、実返数の変換を行う。
ＡＡＲの変換前名称をｂｃｎ、変換後名称をａｃｎ（ステップ２５１０）とし、ｆｉの関数本体領域に出現するｂｃｎをａｃｎに変換する（ステップ２５１１）。
ステップ２５０９の判定のｎｏとなったとき、全ての一括呼出実返数の変換が終わり、ＡＣに関する変換処理が終了する。
その後、ステップ２５０１に進み、次の一括呼出エントリの一括呼出化変換処理を行う。
全ての一括呼出エントリの処理が終わった時点で、一括呼出化変換処理が終了する。
【００７６】
図６は、図１３および図１４に示された一括呼出情報テーブル６を入力とし、図２５の処理を行なった結果を示す（ソースプログラム形式による）中間語の例となっている。
図５の１７行と２１行の２つの関数呼出が、図６では１６行の一括呼出へと変換されている。また、１４行には、一括呼出実返数定義生成されている。
本実施例によれば、関数呼出の回数を低減するにより、呼出オーバヘッドの削減を行うことが可能となり、オブジェクトプログラムの処理性能の向上に寄与する。
【００７７】
図２６に本発明の他の本実施例のコンパイラの処理手順を示す。図２６は、ループ展開時に一括関数呼出化する第２の実施例と、ループ並列化時に一括関数呼出化する第３の実施例に共通のコンパイラの処理手順である。
構文解析２０１とコード生成処理２０３は図のそれと同様なので説明は省略する。
ループ展開ないしループ並列化と一括関数呼出化処理２０５は中間後３を入力とし、反復処理を直列／並列展開するループ展開／ループ並列化を行い、一括関数呼出化変換を行った中間語３を出力する。ループ展開の場合が第２実施例、ループ並列化の場合が第３実施例となる。尚、処理の大部分とテーブル構成等は、第１の実施例と同様であるため、以下では、概略のみを説明する。
【００７８】
図２７は、本発明の第２の実施例であるループ展開時に一括関数呼出化する場合の図２６のステップ２０５の処理手順を示すフローチャート図である。以下順に説明する。
先ず、関数情報テーブル生成処理２７０１により、ソースプログラム中の関数定義の各種情報を抽出し、関数情報テーブル４を作成する。
次に、未処理のループＬが存在するか否かを判定（ステップ２７０２）し、存在する場合にはステップ２７０３からステップ２７１０までを反復実行する。
この反復処理では、先ずＬがループ展開可能であるかどうかを判定（ステップ２７０４）し、存在する場合にはステップ２７０５からステップ２７１０までを反復実行する。
【００７９】
この反復処理では、先ずＬをｎ回ループ展開を行う（ステップ２７０４）。
次に、呼出情報テーブル生成処理（ステップ２７０５）を行い、関数呼出に関する情報を抽出し、呼出情報テーブル５を生成する。
次に、Ｌからループ展開によって生成されたｎ個の同一関数の呼出を同型呼出集合ＳＣＳとする（ステップ２７０６）。
次に、同型呼出集合ＳＣＳから一括呼出化可能な呼出を抽出する一括呼出抽出処理（ステップ２７０７）を行い、抽出した一括呼出を一括呼出情報テーブルに登録する（ステップ２７０８）。
ステップ２７０２でｎｏとなった時、全てのループに対して処理を終え、一括関数生成処理（ステップ２７０９）、一括呼出化変換処理（ステップ２７１０）を行い、処理を終了する。
【００８０】
尚、以下に図２７のステップと図３のステップの対応関係を示す。以下の各ステップの処理内容は、括弧内で示した図３のステップと同様の処理である。
ステップ２７０１（図３のステップ２１）
ステップ２７０５（図３のステップ２２）
ステップ２７０９（図３のステップ２３１）
ステップ２７１０（図３のステップ２３２）
ステップ２７１１（図３のステップ２４）
ステップ２７１２（図３のステップ２５）
【００８１】
図２８は、本発明の第２および第３実施例の説明のためのソースプログラム例である。本実施例においても、中間語３ソースプログラムの形式で記述するものとする。従って、図２８はソースプログラム例であるとともに、（一括呼出化前の）中間語３１の例ともなっている。図２８のプログラム例では、関数ｇの１６行から２２行に、関数ｆを呼び出すループが存在する。
【００８２】
図２９は、図２７のステップ２７０４でループ展開した段階の中間語３を、ソースプログラム形式で表現したものである。図２８の１６行から２２行に関数呼出を含むループが、図２９の１６行から２２行へとループ展開されている。
【００８３】
図３０は、図２７ソースプログラムを入力とし、図２６の処理を行った結果の中間語３を、ソースプログラム形式で表現したものである。図２９の１６行から２２行のループ展開中の関数ｆの呼出が一括関数呼出化されている。
本実施例によれば、ループ展開時に一括関数呼出化を行うため、ループ展開数に応じた効率的な一括関数呼出化を行うことができる。
【００８４】
図３１は、本発明の第３の実施例であるループ並列化時に一括関数呼出化する場合の図２６のステップ２０５の処理手順を示すフローチャート図である。本処理手順は、図２７とほとんど同じなので説明は省略する（ステップ３１０４とステップ３１０６が若干異なる）。
【００８５】
図３２は、図３０のステップ２７０４でループ並列化した段階の中間語３を、ソースプログラム形式で表現したものである。図３２の１８行および１９行にある「ｘ｜｜ｙ；」の記法は、ｘとｙが並列に実行されることを示している。図２８の１６行から２２行に関数呼出を含むループが、図３２の１６行から２１行へとループ並列化されていることが分かる。
【００８６】
図３３は、図２７のソースプログラムを入力とし、図３０の処理を行った結果の中間語３を、ソースプログラム形式で表現したものである。図３１の１６行から２２行のループ並列化中の関数ｆの呼出が一括関数呼出化されていることがわかる。
本実施例によれば、ループ並列化時に一括関数呼出化を行うため、ループの並列度に応じた効率的な一括関数呼出化を行うことができる。
【００８７】
【発明の効果】
本発明によれば、コンパイラないしソース変換プログラムは、同一関数の呼出を一括関数の呼出に変換することができる。即ち、インライン展開によらない呼出オーバヘッドの削減手段を提供している。、
これにより、インライン展開が適用できない場合にも、呼出オーバヘッドの削減を行うことが可能となり、オブジェクトプログラムの処理性能の向上に寄与する。
【図面の簡単な説明】
【図１】本発明を実施するコンパイラが稼働する計算機システムの構成図である。
【図２】本発明を実施するコンパイラの処理手順を示すフローチャート図である。
【図３】コンパイラの一括関数呼出化処理２のフローチャート図である。
【図４】本発明におけるコンパイラの一括関数呼出化処理２の処理構成図である。
【図５】ソースプログラムの例（一括関数呼出化前の中間語３１のソースプログラム形式による構成例）である。
【図６】一括関数呼出化後の中間語３２のソースプログラム形式による構成例である。
【図７】関数テーブル４１の構成例である。
【図８】仮パラメータテーブル４２の構成例である。
【図９】呼出テーブル５１の構成例である。
【図１０】実パラメータテーブル５２の構成例である。
【図１１】一括呼出テーブル６１の構成例である。
【図１２】一括実パラメータテーブル６２の構成例である。
【図１３】一括関数テーブル７１の構成例である。
【図１４】一括仮パラメータテーブル７２の構成例である。
【図１５】関数情報テーブル生成処理２１のフローチャート図である。
【図１６】呼出情報テーブル生成処理２２のフローチャート図である。
【図１７】一括呼出化情報テーブル生成処理２３のフローチャート図である。
【図１８】一括呼出化情報抽出処理２３１のフローチャート図である。
【図１９】一括呼出化情報登録処理２３２のフローチャート図である。
【図２０】一括呼出実返数登録処理２３３のフローチャート図である。
【図２１】一括呼出実引数登録処理２３４のフローチャート図である。
【図２２】一括関数仮返数登録処理２３５のフローチャート図である。
【図２３】一括関数仮引数登録処理２３４のフローチャート図である。
【図２４】一括関数生成処理２４のフローチャート図である。
【図２５】一括呼出化変換処理２５のフローチャート図である。
【図２６】第２および第３実施例のコンパイラの処理手順である。
【図２７】ループ展開用一括関数呼出化処理のフローチャート図である。
【図２８】第２および第３実施例の説明のためのソースプログラムの例である。
【図２９】ループ展開後の中間語のソースプログラム形式による構成例である。
【図３０】ループ展開用一括関数呼出化変換後の中間語のソースプログラム形式による構成例である。
【図３１】ループ並列化用一括関数呼出化処理のフローチャート図である。
【図３２】ループ展開後の中間語３のソースプログラム形式による構成例である。
【図３３】ループ並列化用一括関数呼出化変換後の中間語のソースプログラム形式による構成例である。[0001]
BACKGROUND OF THE INVENTION
The present invention relates to a program conversion technique such as a compiler, and in particular, for a language having a function call mechanism, collective function calls at the time of program conversion for analyzing the relationship between functions and function calls and generating them as function information and call information. It relates to the conversion method.
[0002]
[Prior art]
In programming in a high-level language, structuring is generally performed by a function (also called a procedure, but hereinafter unified by function). A function is a series of processes described in a parameterized form (hereinafter referred to as a function definition), and by calling it (hereinafter simply referred to as a call), the process within the function definition can be realized. The parameters on the function definition side include dummy arguments and return variables (hereinafter referred to as temporary return numbers). In the description in the function definition, processing is performed using dummy arguments and temporary return numbers (hereinafter referred to as temporary parameters). Is described. Caller parameters include actual arguments and return variables (hereinafter referred to as actual return numbers), and the parameters are specified using the caller variables / expressions (the variables / expressions used as this call interface are the actual parameters). Called). Hereinafter, the temporary parameter and the actual parameter are collectively referred to as an interface variable / expression. Various values can be specified for the actual parameter, and the same process (= function definition) can be repeatedly executed.
[0003]
Thus, while functions are a convenient mechanism for programming, they have a problem that overhead is required at the time of calling. That is, processing performance is reduced by passing arguments, updating / saving / recovering the stack pointer at the time of calling, and increasing the possibility of a cache miss accompanying an instruction jump. In particular, when a function with simple processing contents is repeatedly called, the call overhead cannot be ignored in terms of performance.
[0004]
In order to cope with such a situation, high-level languages often have an inline expansion function. Inline expansion replaces a function call with the contents of the function definition body and suppresses the call. Specifically, it is realized by expanding the contents of the function definition at each call point and replacing the temporary parameter with the actual parameter. For inline development, see Masataka Sasa's Iwanami lecture, Software Science 5 "Programming Language Processor" (Iwanami Shoten, 1989) Details on 450-451.
[0005]
[Problems to be solved by the invention]
Inline expansion is almost the only means of reducing call overhead in high-level languages. However, if the code size becomes too large due to the application of inline expansion, the chance of a cache miss increases and the performance may decrease. For this reason, inline expansion is usually restricted when a large function is called or there are many call locations. That is, when the code amount is extremely increased, the call overhead cannot be reduced by inline expansion. That is, the conventional technique has a problem that there is no means for reducing the call overhead when inline expansion cannot be applied.
An object of the present invention is to provide a means for reducing call overhead that does not depend on inline expansion, so that even when inline expansion cannot be applied, call overhead can be reduced and the processing performance of the object program can be improved. It is to provide a batch function call method.
[0006]
[Means for Solving the Problems]
Inline expansion expands the processing contents of the function definition at each call point and removes the call (at that call point). Therefore, whether or not inline expansion is applied determines whether or not the number of calls at each call point is zero. In the case of multiple calls to the same function, even if inline expansion cannot be applied, if the processing for multiple calls is expanded on the function definition side, the number of calls can be reduced without significantly increasing the code size. It is. In other words, by creating a batch function that executes multiple times of processing of the same function at once and providing a batch call generation means for replacing multiple calls of the same function with a single batch function call, inline expansion It is possible to provide a means for reducing call overhead that does not depend on.
As a result, even when inline expansion cannot be applied, it is possible to reduce the call overhead and contribute to the improvement of the processing performance of the object program.
[0007]
DETAILED DESCRIPTION OF THE INVENTION
The present invention relates to a method for converting a plurality of calls of the same function into a call of a batch function for batch processing.
Hereinafter, an embodiment of the present invention will be described with reference to the drawings.
Note that the present invention can be similarly applied when generating a converted source program from a source program and when the compiler converts an intermediate language into an intermediate language after conversion. Will be explained (however, the intermediate language is shown in the form of an equivalent source program).
[0008]
FIG. 1 is a configuration diagram of a computer system in which a compiler that implements the compiling method of the present invention operates. The computer system includes a CPU 101, a display device 102, a keyboard 103, a main storage device 104, and an external storage device 105.
A compiler activation command from the user is received from the keyboard 103. The compiler end message and the error message are displayed on the display device 102. A source program 106 and an object program 107 are stored in the external storage device 105.
The main storage device 104 includes a compiler 108, which stores an intermediate language 3, a function information table 4, a call information table 5, a collective call information table 6, and a collective function information table 7 that are necessary in the compiling process.
The compilation process is controlled by the CPU 101.
[0009]
FIG. 2 shows the processing procedure of the compiler of this embodiment.
In the syntax analysis 201, the lexical syntax analysis is performed with the source program 106 as an input, and the intermediate language 3 is output.
The loop expansion process 202 receives 3 after the middle, performs loop expansion that expands the iterative process to reduce the number of iterations, and outputs the intermediate word 3.
The batch functioning process 2 receives the intermediate language 3 as input, performs batch functioning processing that converts a plurality of function calls into batch command function calls, and outputs the intermediate language 3.
The code generation process 203 converts the intermediate language into the final object program 107 format.
The loop expansion 202 is a desirable process for increasing the application opportunities of the present invention, but is not an essential process.
[0010]
FIG. 3 is a flowchart showing an embodiment of the function call batching method according to the present invention, and shows the procedure of the function call batching process 2 of FIG.
First, the function information table generation process 21 extracts various information of the function definition in the source program and creates the function information table 4. Similarly, the call information table generation process 22 extracts information related to function calls and generates the call information table 5.
[0011]
Next, the batch call information table generation processing 23 extracts information on a batch function for batch processing a plurality of identical function calls from the call information table 5 to generate a batch function information table 6. The batch call information table generation processing 23 includes a homomorphic call extraction processing 230 that extracts information on the same function call, a batch call extraction processing 231 that extracts calls that can be batch-called from the same-type call set, and batches the extracted batch calls. It consists of a batch call information table registration process 232 to be registered in the call information table. Each process will be described in detail with reference to FIGS.
[0012]
Next, a collective function generation process 24 generates a collective function 3 after the middle based on the collective function information table 7.
Finally, the function call in the intermediate language 3 is converted into a batch function call based on the batch call information table 6 by the batch call conversion process 25.
[0013]
FIG. 4 is a processing configuration diagram showing the relationship between the function call batching related processing shown in FIG. 3 and intermediate words and tables as input / output. However, since the intermediate language 3 is generally referred to, only the main input / output relationships are shown.
The function information table generation process 21 receives the intermediate language 31 before calling a batch function as an input, and generates a function information table 4 storing function definition information. The function information table 4 includes a function table 41 that stores a list of functions, and a temporary parameter table 42 that stores information on a dummy argument and a temporary return number serving as a function interface.
The call information table generation process 22 receives the pre-call intermediate language 31 and generates a call information table 5 storing function call information. The call information table 5 includes a call table 51 that stores a list of function calls, and an actual parameter table 52 that stores information on actual arguments and actual return numbers serving as function call interfaces.
[0014]
The batch call information table generation processing 23 receives the call information table 5 and receives a batch call information table 6 that stores information on function calls that can be batch-called, and a batch function that stores information on function definitions to be collectively called. An information table 7 is generated. The batch call information table 6 includes a batch call table 61 that stores a list of calls that can be batch-called, and an actual parameter table 62 that stores information on actual arguments and the number of actual returns serving as a batch call interface. Similarly, the collective function information table 7 includes a collective function table 71 that stores a list of function definitions to be collectively called, and a formal parameter table 72 that stores information on dummy arguments and provisional numbers that serve as an interface for the collective function. Become.
[0015]
The batch function generation process 24 receives the intermediate language 31 before batch calling and the batch function information table 7 as input, and generates a batch function in the intermediate language 32 after batch calling.
The batch call conversion processing 25 receives the intermediate language 31 before batch calling and the batch call information table 6 as input, converts the function call of the intermediate language 31 before batch calling into a batch function call, and performs the intermediate language 32 after batch calling. To generate.
[0016]
Next, in order to explain a specific application example, an example of a source program to be processed is given.
FIG. 5 is an example of a source program (an example of a program written in C language) for explaining an embodiment of the present invention.
In this embodiment, the intermediate language 3 has information equivalent to that of the source program and does not need to be in the form of an intermediate language. Therefore, in the example of the intermediate language 3, it is described in the form of a source program. Therefore, FIG. 5 is an example of the source program and also an example of the intermediate language 31 (before the batch call).
Further, as described in the description of FIG. 2, in this embodiment, loop expansion (essentially unnecessary) is performed. In FIG. 5, loop expansion is not applied, but it can also be applied without loss of generality when a call to the same function is generated after loop expansion.
In the example of the program shown in FIG. 5, there are three functions f, g, and main. The function main calls the function g once, and the function g calls the function f twice. Calling the function f on lines 17 and 20 is a function that can be called collectively.
[0017]
FIG. 6 shows an example of a source program after the batch function call with respect to the source program example of FIG. 5, and corresponds to the intermediate language 32 after the batch function call. In FIG. 6, a collective function F that was not shown in FIG. 5 is newly generated (from line 30 to line 36). In FIG. 5, the function g called the function f twice, but in FIG. Performs processing equivalent to two calls of function f The batch function F is called once (16 lines). That is, this is an example of the result of performing batch call conversion by converting two calls of the function f into one call of the batch function F.
[0018]
Hereinafter, details of the batch function call processing 2 will be described.
In the description, the intermediate language in FIG. 5 is used as an input example, and output examples of various tables are sequentially shown. Therefore, first, a configuration example of various tables will be described, and then a processing procedure will be described.
Hereinafter, configuration examples of the function information table 4, the call information table 5, the collective call information table 6, and the collective function information table 7 will be described in this order.
[0019]
FIG. 7 is a configuration example of the function table 41 obtained for the example of FIG.
In the function table, an entry is created for each function definition that appears in the program. Each function is given a unique function number, and each entry can be accessed by the function number. Each entry includes a function number column 411, a function name column 412, a definition line number column 413, a provisional return number column 414, a formal argument set column 415, and a call set column 416.
[0020]
The function number column 411 stores the entry number of the function table, and the function name column 412 stores the name of the corresponding function.
The definition line number column 413 stores function definition information in the intermediate language 3 of the function. With this definition line number column, all information of the corresponding function definition can be obtained. For example, it is possible to acquire various interface information of a function and read / copy the function definition body. In this embodiment, as shown in FIG. 7, the line numbers of the source program are shown.
The temporary return number column 414 and the temporary argument set column 415 store the temporary return number and the dummy argument information of the function in the form of entry numbers in the temporary parameter table 42.
In the call set column 416, information on function calls existing in the function is stored in the entry number format of the call table 51.
[0021]
FIG. 8 is a configuration example of the temporary parameter table 42 obtained for the example of FIG.
The temporary parameter table is an auxiliary table for storing the function interface information of the function table 41. An entry is created for each temporary return number or dummy argument, and the entry number is stored in the temporary return number column or dummy argument set column of the function table 41.
Each entry includes a temporary parameter number column 421, a temporary parameter position number column 422, a temporary parameter type column 423, and a temporary parameter variable column 424.
[0022]
The temporary parameter number field 421 stores the entry number of the temporary parameter table.
In the temporary parameter position number field 422, the appearance position of the temporary parameter is indicated by a number. The position number of the temporary return number is 0, and the i-th dummy argument is the position number i.
The temporary parameter type field 423 stores temporary parameter type information.
The temporary parameter variable column 424 stores the name of the corresponding temporary parameter.
[0023]
FIG. 9 is a configuration example of the call table 51 obtained for the example of FIG.
An entry is created in the call table for each function call that appears in the program. Each entry includes a call number column 511, a call line number column 512, a call function number column 513, an actual return number column 514, and an actual argument set column 515.
[0024]
The call number column 511 stores the entry number of the call table, and the call line number column 512 stores the line number of the corresponding function call.
The call function number column 513 stores the function number of the function table 41 for the function called by the call.
The actual return number column 514 and the actual argument set column 515 store the actual return number and actual argument information of the call in the form of entry numbers in the actual parameter table 52.
[0025]
FIG. 10 is a configuration example of the actual parameter table 52 obtained for the example of FIG.
The actual parameter table is an auxiliary table for storing call interface information of the call table 51. An entry is created for each actual return number or actual argument, and the entry number is stored in the actual return number column or actual argument set column of the call table 51. Each entry includes an actual parameter number field 521, an actual parameter position number field 522, and an actual parameter expression field 522.
[0026]
The actual parameter number column 521 stores the entry number of the actual parameter table.
In the actual parameter position number column 522, the appearance position of the actual parameter is indicated by a number. The position number of the actual return number is 0 (similar to the temporary parameter position number field 422 of the temporary parameter table 42), and the i-th actual argument is the position number i.
The actual parameter expression field 523 stores the name of the corresponding actual parameter.
[0027]
FIG. 11 is a configuration example of the collective call table 61 obtained for the example of FIG.
In the batch call table, an entry is created for each (batch) call of a batch function. Each entry includes a batch call number column 611, a generation source call set column 612, a batch function number column 613, an actual return number column 614, an actual return type column 615, an actual return number set column 616, and an actual argument set column 617. Composed.
[0028]
The batch call number column 611 stores the entry number of the batch call table.
In the generation call set column 612, a set of calls to be collectively called is stored in the entry number format of the call table 51.
In the batch function number column 613, a batch function called by the batch call is stored as an entry number (batch function function number) of the batch function table 71.
The actual return number column 614 and the actual return type column 615 store the return name and return type.
The actual return number column 616 and the actual argument set column 617 store the actual return number and actual argument information of the batch call in the form of the entry number of the batch actual parameter table 62.
[0029]
FIG. 12 is a configuration example of the collective actual parameter table 62 obtained for the example of FIG.
The collective actual parameter table is an auxiliary table for storing the call interface information of the collective call table 61. An entry is created for each actual return number or actual argument, and the entry number is stored in the actual return number column or actual argument set column of the batch call table 61. Each entry includes a collective actual parameter number column 621, a collective actual parameter position number column 622, a generation source actual parameter number column 623, a pre-conversion name column 624, and a post-conversion name column 625.
[0030]
The batch actual parameter number field 621 stores the entry number of the batch call actual parameter table.
In the collective actual parameter position number column 622, the appearance position of the actual parameter is indicated by a number.
In the generation call set column 623, calls to be batched are stored in the form of a set of entry numbers in the call table 51.
The pre-conversion name column 724 stores the actual parameter name in the generation source call, and the post-conversion name column 725 stores the (collective) actual parameter name after the batch call. The pre-conversion name field 724 and the post-conversion name field 725 are fields that are valid only for the actual return number (the actual argument is always blank).
[0031]
FIG. 13 is a configuration example of the collective function table 71 obtained for the example of FIG.
In the batch function table, an entry is created for each function that generates a batch function. Each function is given a unique batch function number, and each entry can be accessed by the batch function number. Each entry includes a batch function number column 711, a batch function name column 712, a batch coefficient factor column 713, a generation function number column 714, a provisional return type column 715, a provisional return number name column 716, a provisional return number set column 717, It consists of a dummy argument number set column 718.
[0032]
In the batch function number column 711, the entry number of the batch function table,
The collective function name column 712 stores the name of the collective function to be generated.
The batching coefficient column 713 stores the number of calls to be batched.
In the generation function number column 714, functions to be grouped are stored in the function number format of the function table 41.
The provisional return type column 715 and provisional return number name column 716 store the return type and return number name.
The temporary return number column 717 and the temporary argument set column 718 store the temporary return number and dummy argument information of the batch function in the form of the entry number of the batch temporary parameter table 72.
[0033]
FIG. 14 is a configuration example of the collective temporary parameter table 72 obtained for the example of FIG.
The batch temporary parameter table is an auxiliary table for storing the function interface information of the batch function table 71. An entry is created for each temporary return number or dummy argument, and the entry number is stored in the temporary return number set column or dummy argument set column of the batch function table 71. Each entry includes a batch temporary parameter number column 721, a generation source temporary parameter number column 722, a batch temporary parameter position number column 723, a repetition number column 724, a pre-conversion name column 725, and a post-conversion name column 726.
[0034]
The batch temporary parameter number field 721 stores the entry number of the batch temporary parameter table.
In the generation source temporary parameter number column 722, the corresponding temporary parameter of the generation source function is stored in the format of the temporary parameter number in the temporary parameter table 42.
In the batch temporary parameter position number column 723, the appearance position of the temporary parameter is indicated by a number. In the batch temporary parameter position number, the temporary return number and the dummy argument are counted independently, and the i-th temporary return number / the dummy argument is the position number i.
[0035]
The iteration number column 724 stores the number of the temporary function parameter at the time of batching. This repetition number starts counting from 0, and when the batching coefficient is n, numbers up to n-1 are given.
The pre-conversion name field 725 stores a temporary parameter name in the generation source function, and the post-conversion name field 726 stores a (collective) temporary parameter name after batch function conversion.
This is the end of the description of various table configurations used in this embodiment.
[0036]
Hereinafter, various processes for generating / referencing these tables will be described in order.
FIG. 15 is a flowchart of the function information table generation process 21. In the function information table generation process, the intermediate language 3 is sequentially scanned, and when a function definition is detected, a function information table 4 in which the function definition information is collected and registered is generated. The procedure will be described below.
[0037]
First, the function number fi and the temporary parameter number fpi are initialized with 0 (step 1501).
Next, it is determined whether or not there is an unprocessed function (step 1502), and the processing after step 1503 is performed until there is no unprocessed function.
If there is an unprocessed function f, first, a function entry F is generated, and the function number fi is incremented (step 1503). Thereafter, the function information of the function f is stored in the function entry F in step 1504 and subsequent steps.
[0038]
First, the definition line number of f is set to fl, and the function name is set to fn (step 1504). Next, in order to obtain a temporary parameter set FPS, the FPS is initialized with an empty set (step 1505). Then, it is determined whether or not there is an unprocessed temporary parameter (step 1506), and processing from step 1507 is performed until there is no unprocessed temporary parameter.
If there is an unprocessed temporary parameter fp, a temporary parameter entry FP is generated and the temporary parameter number fpi is incremented (step 1507).
[0039]
Next, the temporary parameter information of fp is stored in the temporary parameter table 42 (step 1508), and the entry number fpi is added to the temporary parameter set FPS (step 1509). Thereafter, step 1506 to step 1509 are repeatedly executed until there is no unprocessed temporary parameter. Step 1508 stores temporary parameter information, temporary parameter number, temporary parameter position, temporary parameter type, temporary parameter type, and temporary parameter name in the entry FP, but details are omitted (in the temporary parameter table 42 of FIG. 8). See description).
[0040]
When the determination in step 1506 becomes no, collection of the temporary parameter set FPS ends, so all function information is stored in F (step 1510), and the next unprocessed function determination (step 1502) is performed. The process ends when all functions have been processed.
[0041]
7 and 8 are examples of function information tables indicating the results of performing the above processing on the source program shown in FIG.
For example, the entry 417 stores information on the function f defined in lines 003 to 008 in FIG. Further, it can be seen from the provisional return number column that there is a provisional return number having the name r, the provisional parameter position 0 (= provisional return number), and the provisional parameter type int.
[0042]
FIG. 16 is a flowchart of the call information table generation process 22.
In the call information table generation process, the intermediate language 3 is sequentially scanned, and when a function call is detected, a call information table 5 that collects and registers the call information is generated. The procedure will be described below.
[0043]
First, the call number ci and the actual parameter number api are initialized with 0 (step 1601).
Next, it is determined whether or not there is an unprocessed function (step 1602), and the processing from step 1603 is performed until there is no unprocessed function.
If there is an unprocessed function f, first, the function entry is set to F (step 1603), and the call set CS is initialized with an empty set (step 1604).
Next, it is determined whether or not there is an unprocessed function call in f (step 1605), and if it exists, call information is registered in step 1606 and subsequent steps.
In registering the call information, first, a call entry C is generated, and the call number ci is incremented (step 1606).
Next, the call line number cl and the call function number fi are obtained (step 1607). The call function number fi can be easily obtained by searching the function information table 41 (if it does not exist in the function information table, it is left blank and excluded from the target of batching).
[0044]
Next, in order to obtain the actual parameter set APS, the APS is initialized with an empty set (step 1608). Then, it is determined whether or not there is an unprocessed actual parameter (step 1609), and processing from step 1610 is performed until there is no unprocessed actual parameter.
That is, if there is an unprocessed actual parameter ap, an actual parameter entry AP is generated, the actual parameter number api is incremented (step 1610), and the actual parameter information of ap is stored in the actual parameter table 52 (step 1611). The entry number api is added to the actual parameter set APS (step 1612). In step 1611, the actual parameter number, the actual parameter position number, and the actual parameter expression, which are actual parameter information, are stored in the entry AP, but the details are omitted (see the description of the actual parameter table 52 in FIG. 14).
[0045]
Since the collection of the actual parameter set APS ends when the determination in step 1609 becomes no, all call information is stored in C (step 1613). (The actual parameter set APS combines the actual return number and the actual argument. Stored in the actual return number field and the actual argument field of the call information table). Then, ci is added to CS (step 1614), and the next unprocessed call determination (step 1605) is performed. Since all calls in the function f have been processed when the determination in step 1605 becomes no, CS is stored in the call set column of F (step 1615), and the next unprocessed function is determined (step 1602). Proceed to The process ends when all functions have been processed.
[0046]
9 and 10 are examples of the call information table 5 indicating the result of performing the above processing on the example of the source program of FIG. For example, the entry 516 in FIG. 9 stores the call information of the function f on the 17th line. In this call, (p * p, q * q) is passed as an actual argument, and the return value is stored in the actual return number r1.
[0047]
FIG. 17 is a flowchart of the batch call information table generation process 23.
The batch call information table generation process sequentially scans the call information table 51 and, when a call that can be batch-called is detected, generates a batch call information table 6 and a batch function information table 7 that collect and register the batch call information. To do. The procedure will be described below.
[0048]
First, the batch call number aci, the batch actual parameter number aapi, the batch function number afi, and the batch actual parameter number afpi are initialized to 0 (step 1701). Next, it is determined whether or not there is an unprocessed function entry (step 1702), and the processes in and after step 1703 are performed until there is no unprocessed function entry.
If there is an unprocessed function entry F, first, the call set is set as CS (step 1703).
[0049]
Next, the process proceeds to extraction processing (from step 1704 to step 1711) of the same type call set SCS which is a set of calls of the same function from CS. This extraction process of the isomorphic call set SCS corresponds to step 230 in FIG.
First, CS is stored in the work set WCS, and the SCS is initialized with an empty set (step 1704).
Next, it is determined whether or not there is an unprocessed call entry C in the CS (step 1705). If there is a call entry C, the call number ci and call function number fi of C are obtained (step 1706). ci is excluded (step 1707).
Next, it is determined whether or not there is an unprocessed entry WC in the WCS (step 1708), and if it exists, the WC call number wci and call function number wfi are obtained (step 1709).
Then, it is checked whether the same function is called by determining whether fi and wfi match (step 1710). If the functions are the same, wci is added to the SCS (step 1711) and the process proceeds to step 1708.
[0050]
When the elements of all work call sets are determined in step 1708, the extraction of the isomorphic call set SCS is completed.
Here, it is determined whether or not the SCS has a plurality of elements (step 1712). Only when there are a plurality of SCSs, collective call information extraction processing (step 231; see FIG. 18 for details) is executed, and step 1705 is performed. Thereafter, the process proceeds to the next isomorphic call set SCS.
When all entries have been processed in step 1705, the process proceeds to step 1702 to process the next function entry.
When all the function entries have been processed, the batch call information table generation process ends.
[0051]
FIG. 18 is a flowchart of the batch call information extraction process 231. In the batch call information extraction process, a batch call set ASCS obtained by collecting calls that can be batch-called is extracted from the given same-type call set SCS, and a batch call information table 6 and a batch function information table 7 are generated. The procedure will be described below.
First, it is determined whether or not there is an unprocessed call entry SC in the SCS (step 1801), and if it exists, the C call number sci is obtained (step 1802).
Next, the SCS is stored in the work set WSCS, the ASCS is initialized with an empty set (step 1803), and then sci is removed from the WSCS (step 1804).
Next, it is determined whether or not there is an unprocessed entry WSC in the WSCS (step 1805), and if it exists, the call number wsci of the WSC is acquired (step 1806).
[0052]
Then, it is determined whether sci and wsci can be collectively called (step 1807). If yes, wsci is added to ASCS (step 1808). The possibility of batch call is determined based on whether the call of wsci can be moved to the position of the call line number of sci or the like, but the details are omitted because it is not directly related to the present invention.
Thereafter, the process proceeds to step 1805 to examine the next WSC element.
When the elements of all work call sets WSCS are determined in step 1805, the extraction of the batch call set ASCS is completed.
Here, it is determined whether or not the ASCS has a plurality of elements (step 1809), and the batch call information registration process (step 232, see FIG. 19 for details) is executed only when there are a plurality of elements.
Next, ASCS is removed from the SCS (step 1810), and the process proceeds to the processing of the elements of the next isomorphic call set SCS after step 1801.
When all entries have been processed in step 1801, the batch call information extraction processing is terminated.
[0053]
FIG. 19 is a flowchart of the batch call information registration process 232. In the batch call information registration process, the batch call information table 6 and the batch function information table 7 are generated from the given batch call set ASCS. The procedure will be described below.
First, a batch call entry AC is created, aci is incremented (step 1901), and then ASCS is stored in the AC generation source call set column (step 1902).
Next, the call function number (of any element) of ASCS is ci and the number of elements is afk (step 1903).
Then, a type art as a variable is generated from the ASCS (step 1904), and stored in the return type column of the AC (step 1905). Although art is an array type in this embodiment, any type may be used as long as it can represent all the return numbers of the generation call set. Since the generation method is not essential to the present invention, details are omitted.
[0054]
Next, a unique name acn is generated and stored in the AC actual return name column (step 1906), and then a batch call actual argument registration process (step 233, details are shown in FIG. 20), a batch call actual return number registration process ( Step 234. Details are performed as shown in FIG.
[0055]
Next, registration to the batch function information table is performed.
First, a batch function entry AF is created, and afi is incremented (step 1911).
Next, art is stored in the AF return type column (step 1912).
Next, a unique name afn is generated and stored in the AF temporary return name column (step 1913), and then afk is stored in the batch coefficient column (step 1914).
Finally, a batch function dummy argument registration process (step 235, details are shown in FIG. 22) and a batch function temporary return number registration process (step 236, details are shown in FIG. 23) are performed to complete registration in the batch function information table.
Since the registration into the batch call information table and the batch function information table is completed, the batch call information registration process is terminated.
[0056]
FIG. 20 is a flowchart of the collective call actual return number registration process 233. In the batch call actual return number registration process, the batch actual parameter table 62 is generated from the source call set in each entry of the batch call table and registered in the batch call actual return number set column 616 of the batch call table 61. The procedure will be described below.
First, the batch call actual return number set AARS is initialized with an empty set (step 2001). The batch call actual return number position number aapr is also initialized to 0 (step 2002).
[0057]
Then, the generation call source set of AC is set as ACS (step 2003), and it is determined whether there is an unprocessed call C in the ACS (step 2004). If there is, the steps 2005 to 2013 are repeatedly executed.
In this iterative process, an actual return number set of C is set to ARS (step 2005), it is determined whether there is an unprocessed actual return number AR in the ARS (step 2006), and if there is, step 2007 to step 2012 are performed. Run repeatedly.
In this iterative process, first, aapr is incremented (step 2007).
Next, the actual parameter number and name of AR are set to ari and arn, respectively (step 2008).
Next, a collective call actual return number name aarn is generated (step 2009).
Next, a batch call actual parameter entry AAP is generated, and aapi is incremented (step 2010).
[0058]
Next, the batch call actual return number information is registered in the AAP (step 2011), and aari is added to the AARS (step 2012).
When the result of determination in step 2006 is no, the processing of all the actual return numbers is completed. Therefore, after the AARS is stored in the AC collective call actual return number set column (step 2013), the next ACS process is performed.
When all the call entries of ACS have been processed, the batch call actual return number registration process 233 ends.
[0059]
FIG. 21 is a flowchart of the batch call actual argument registration process 234. In the batch call actual argument registration process, the batch actual parameter table 62 is generated from the source call set in each entry of the batch call table and registered in the batch call actual argument set column 617 of the batch call table 61. The procedure will be described below.
[0060]
First, the batch call actual argument set AAAS is initialized with an empty set (step 2101). The batch call actual argument position number aapa is also initialized with 0 (step 2102).
Then, the generation call source set of AC is set as ACS (step 2103), and it is determined whether there is an unprocessed call C in the ACS (step 2104). If there is, the steps 2105 to 2113 are repeatedly executed.
In this iterative process, the actual argument set of C is set to APS (step 2105), it is determined whether there is an unprocessed actual argument AP in the APS (step 2106), and if there is, the process from step 2107 to step 2112 is repeated. Execute.
In this iterative process, first, aapa is incremented (step 2107).
[0061]
Next, the actual parameter number and name of the AP are set to api and apn, respectively (step 2108).
Next, a batch call actual argument name aapn is generated (step 2109).
Next, a batch call actual parameter entry AAP is generated, and aapi is incremented (step 2110).
Next, the batch call actual argument information is registered in the AAP (step 2111), and then aapi is added to the AAAS (step 2112).
When the determination in step 2106 is no, processing of all actual arguments is completed, so AAAS is stored in the AC batch call actual return number column (step 2013), and then the next ACS processing is performed.
When all the ACS call entries have been processed, the batch call actual argument registration process 234 ends.
[0062]
11 and 12 are examples of the collective call information table 6 that shows the results of performing the processes of FIGS. 20 and 21 with the call information table 5 shown in FIGS. 9 and 10 as an input.
For example, entry 618 in FIG. 11 indicates that calls of entry 516 and entry 517 of call table 51 shown in FIG. 9 can be collectively called. In this batch call, it is understood that (p * p, q * q, p * q, q * p) is passed as an actual argument, and the return value is stored in the actual return number arf that is an array of type int. .
[0063]
FIG. 22 is a flowchart of the batch function temporary return number registration process 235. In the batch function temporary return number registration process, the batch temporary parameter table 72 is generated from the source function number and the batching coefficient in each entry of the batch function table, and the batch function temporary return number set column 717 of the batch function table 71 is entered. register. The procedure will be described below.
First, the batch function temporary return number set AFRS is initialized with an empty set (step 2201). The batch function temporary return position number afpr is also initialized to 0 (step 2202).
Next, the AF batching coefficient is afk, and the AF source function entry is F (step 2203. Both have already been calculated in the batch call regular registration process).
Further, after initializing the number of iterations i to 0 (step 2204), it is determined whether i is equal to afk (step 2205), so that steps i2206 to 2212 are repeatedly executed until i becomes equal to afk. .
In this iterative process, first, i is incremented (step 2206).
Next, afpr is incremented (step 2207).
[0064]
Next, the temporary parameter number and name of AF are set to fpi and fpn, respectively (step 2208).
Next, a batch function temporary return name apfn is generated (step 2209).
Next, a batch function formal parameter entry AFP is generated, and afpi is incremented (step 2210).
Next, the batch function temporary return number information is registered in the AFP (step 2211), and afpi is added to the AFRS (step 2212).
When the determination in step 2205 is no, AFRS is stored in the AF batch function provisional number set column (step 2213), and the batch function provisional number registration process 235 ends.
[0065]
FIG. 23 is a flowchart of the batch function dummy argument registration process 236. In the batch function dummy argument registration process, a batch temporary parameter table 72 is generated from the source function number and batching coefficient in each entry of the batch function table and registered in the batch function dummy argument set column 718 of the batch function table 71. . The procedure will be described below.
First, the batch function dummy argument set AFAS is initialized with an empty set (step 2301). The batch function dummy argument position number afpa is also initialized to 0 (step 2302).
Next, the AF batching coefficient is afk, and the AF source function entry is F (step 2303. Both have already been calculated in the batch call regular registration process).
[0066]
Further, after initializing the number of iterations i to 0 (step 2304), it is determined whether i is equal to afk (step 2305), so that steps 2306 to 2315 are repeatedly executed until i becomes equal to afk. .
In this iterative process, first, i is incremented (step 2206). Next, the F parameter set of F is set to FAS (step 2307), and it is determined whether there is an unprocessed dummy parameter FA in the FAS (step 2308). If there is, the steps 2309 to 2314 are repeatedly executed. .
In this iterative process, first, afpa is incremented (step 2309).
Next, the temporary parameter number and name of AF are set to fpi and fpn, respectively (step 2310).
[0067]
Next, a collective function formal argument name apfn is generated (step 2311).
Next, a batch function formal parameter entry AFP is generated, and afpi is incremented (step 2312).
Next, the batch function dummy argument information is registered in the AFP (step 2313), and afpi is added to the AFAS (step 2314).
When the determination in step 2308 results in no, the processing of all dummy arguments is completed, so AFAS is stored in the AF batch function dummy argument set column (step 2315).
Perform the next iteration.
The batch function formal argument registration process 236 ends when the batch coefficient is repeatedly executed until i and afk match.
[0068]
FIGS. 13 and 14 are examples of the collective function information table 7 showing the results of performing the processes of FIGS. 17 to 23 with the function information table 4 and the call information table 5 shown in FIGS. 7 to 10 as inputs. It has become.
For example, the entry 719 in FIG. 13 indicates that the entry 417 in the function table 41 shown in FIG. This batch function is passed (x0, y0, x1, y1) as a dummy argument, and returns a temporary return number frf that is an array of type int.
This is the end of the description of the batch call information table generation process 23.
After the batch call information table generation process 23, a batch function generation process 24 and a batch call conversion process 25 are performed. This will be described in order below.
[0069]
FIG. 24 is a flowchart of the batch function generation process 24.
The batch function generation process generates a batch function in the intermediate language 3 from each entry of the batch function information table 7. The procedure will be described below.
It is checked whether or not there is an unprocessed batch function entry AF (step 2401), and the processing from step 2402 is performed on all existing batch function entries.
First, it is assumed that the AF generation function number is fi, and the AF batch coefficient is afk (step 2402).
Next, a batch function provisional number definition is generated from the AF provisional number set AFRS (step 2403).
Next, a batch function dummy argument definition is generated from the AF dummy argument set AFAS (step 2404).
Thus, generation of the batch function call interface is completed. Next, in step 2406 to step 2413, a batch function body is generated.
[0070]
First, after the definition section iteration count m is initialized to 0 (step 2405), the definition section FB of fi is copied (step 2406).
Next, the temporary parameters in the copied area FB are converted.
First, the union set of the AF temporary return number set AFRS and the dummy argument set AFAS is set as a batch function temporary parameter set AFPS (step 2407).
It is determined whether or not there is an unprocessed temporary parameter entry AFP in the AFPS (step 2408). If it exists, the temporary parameter is converted in step 2409 and subsequent steps.
[0071]
First, the pre-conversion name of AFP is bcn, and the post-conversion name is acn (step 2409).
Next, after the name bcn appearing in the FB is converted to acn (step 2410), the process proceeds to step 2408 to convert the next formal parameter.
When all the AFPS elements have been processed, m is incremented (step 2411), and it is determined whether or not it matches the batch coefficient afk (step 2412).
If not, the process proceeds to step 2406 and is repeatedly executed. If m matches afk, a batch function temporary return is generated using the AF temporary return name (step 2413). Perform batch function entry processing.
When all the batch function entry processes have been completed, the batch function generation process ends.
[0072]
FIG. 6 is an example of an intermediate language (in the source program format) showing the result of performing the processing of FIG. 24 using the collective function information table 7 shown in FIGS. 13 and 14 as an input. The batch function indicated by the entry 719 in FIG. 13 is generated from the 30th line to the 36th line in FIG. Line 32 generates a batch function provisional return number definition, and line 35 generates a batch function provisional return number.
[0073]
FIG. 25 is a flowchart of the batch call conversion process 25.
The batch call conversion processing uses the batch call information table 6 to convert batchable isomorphic calls in the intermediate language 3 into batch calls. The procedure will be described below.
It is checked whether or not there is an unprocessed batch call entry AC (step 2501), and the processing from step 2502 is performed on all existing batch call entries.
First, a batch call actual return number definition is generated from the AC actual return number name (step 2502).
Next, it is assumed that the generation call set of AC is CS and the actual return number set of AC is AAPS (step 2503).
Next, an arbitrary element C of CS is taken out and the call function number fi is acquired (step 2504).
[0074]
Next, the batch call generation position acp is determined from the call line number of C (step 2505). The collective call generation position may be anywhere as long as it is guaranteed by the determination of the possibility of collective call. In this embodiment, for convenience, it is set immediately before the first call position.
Next, a batch call is created from the AC and generated in an acp (step 2506). The batch call can be easily created by the batch call actual return number name, the batch function number, and the batch call actual argument set, and the details are omitted.
Next, it is determined whether or not there is an unprocessed call entry C in the CS (step 2507), and if it exists, the call is deleted by looking at the call line number of C (step 2508).
[0075]
When all calls of CS are deleted, it is determined whether or not there is an unprocessed batch call actual argument entry AAR in AARS (step 2509). If there is, conversion of the actual return number is performed in steps 2510 to 2511. I do.
The name of AAR before conversion is bcn, the name after conversion is acn (step 2510), and bcn that appears in the function body area of fi is converted to acn (step 2511).
When the determination in step 2509 is no, the conversion of all the batch call actual return numbers is completed, and the conversion process for AC is completed.
Thereafter, the process proceeds to step 2501 to perform batch call conversion processing of the next batch call entry.
When all the batch call entries have been processed, the batch call conversion process ends.
[0076]
FIG. 6 is an example of an intermediate language (in the source program format) showing the result of performing the processing of FIG. 25 using the collective call information table 6 shown in FIGS. 13 and 14 as an input.
The two function calls of lines 17 and 21 in FIG. 5 are converted into a batch call of 16 lines in FIG. In line 14, a batch call actual return number definition is generated.
According to this embodiment, it is possible to reduce the call overhead by reducing the number of function calls, which contributes to the improvement of the processing performance of the object program.
[0077]
FIG. 26 shows the processing procedure of the compiler according to another embodiment of the present invention. FIG. 26 shows a compiler processing procedure common to the second embodiment in which a batch function is called during loop expansion and the third embodiment in which a batch function is called during loop parallelization.
Since the syntax analysis 201 and the code generation process 203 are the same as those in the figure, description thereof is omitted.
Loop expansion or loop parallelization and batch function call processing 205 receives 3 after the middle, and performs loop expansion / loop parallelization that repeats serial processing in parallel / loop parallel processing, and intermediate language 3 subjected to batch function call conversion is obtained. Output. The case of loop expansion is the second embodiment, and the case of loop parallelization is the third embodiment. Since most of the processing and the table configuration are the same as in the first embodiment, only the outline will be described below.
[0078]
FIG. 27 is a flowchart showing the processing procedure of step 205 in FIG. 26 when the batch function is called at the time of loop expansion according to the second embodiment of the present invention. This will be described in order below.
First, the function information table generation process 2701 extracts various information of the function definition in the source program, and creates the function information table 4.
Next, it is determined whether or not an unprocessed loop L exists (step 2702). If there is an unprocessed loop L, steps 2703 to 2710 are repeatedly executed.
In this iterative process, it is first determined whether or not L can be loop-expanded (step 2704), and if it exists, step 2705 to step 2710 are repeatedly executed.
[0079]
In this iterative process, L is first loop expanded n times (step 2704).
Next, call information table generation processing (step 2705) is performed, information related to function calls is extracted, and the call information table 5 is generated.
Next, n calls of the same function generated by loop expansion from L are defined as an isomorphic call set SCS (step 2706).
Next, a batch call extraction process (step 2707) for extracting calls that can be batch-called from the same type call set SCS is performed (step 2707), and the extracted batch calls are registered in the batch call information table (step 2708).
When the answer is no in step 2702, the processing is completed for all the loops, batch function generation processing (step 2709) and batch call conversion processing (step 2710) are performed, and the processing ends.
[0080]
The correspondence between the steps of FIG. 27 and the steps of FIG. 3 is shown below. The processing content of each step below is the same processing as the step of FIG. 3 shown in parentheses.
Step 2701 (Step 21 in FIG. 3)
Step 2705 (Step 22 in FIG. 3)
Step 2709 (Step 231 in FIG. 3)
Step 2710 (Step 232 in FIG. 3)
Step 2711 (Step 24 in FIG. 3)
Step 2712 (Step 25 in FIG. 3)
[0081]
FIG. 28 shows an example of a source program for explaining the second and third embodiments of the present invention. Also in this embodiment, it is described in the form of an intermediate language 3 source program. Therefore, FIG. 28 is an example of a source program and also an example of the intermediate language 31 (before the collective call). In the program example of FIG. 28, there is a loop that calls the function f on the 16th to 22nd lines of the function g.
[0082]
FIG. 29 is a representation of the intermediate language 3 at the stage of loop expansion in step 2704 of FIG. 27 in a source program format. A loop including function calls on lines 16 to 22 in FIG. 28 is expanded from lines 16 to 22 in FIG.
[0083]
FIG. 30 represents the intermediate language 3 as a result of performing the processing of FIG. 26 with the source program of FIG. 27 as an input, in the source program format. In FIG. 29, the call of the function f during the loop expansion from the 16th line to the 22nd line is made into a batch function call.
According to this embodiment, since the batch function call is performed at the time of loop expansion, efficient batch function call according to the number of loop expansions can be performed.
[0084]
FIG. 31 is a flowchart showing the processing procedure of step 205 in FIG. 26 in the case of performing a batch function call during loop parallelization according to the third embodiment of the present invention. Since this processing procedure is almost the same as that in FIG. 27, a description thereof will be omitted (steps 3104 and 3106 are slightly different).
[0085]
FIG. 32 shows the intermediate language 3 at the stage where the loop is parallelized in step 2704 of FIG. 30 in a source program format. The notation “x || y;” on lines 18 and 19 in FIG. 32 indicates that x and y are executed in parallel. It can be seen that the loop including the function call from the 16th line to the 22nd line in FIG. 28 is loop-parallelized from the 16th line to the 21st line in FIG.
[0086]
FIG. 33 represents the intermediate language 3 as a result of performing the processing of FIG. 30 with the source program of FIG. 27 as an input, in a source program format. It can be seen that the call to the function f during the loop parallelization from the 16th line to the 22nd line in FIG. 31 is a batch function call.
According to the present embodiment, since the batch function call is performed at the time of loop parallelization, efficient batch function call according to the parallel degree of the loop can be performed.
[0087]
【The invention's effect】
According to the present invention, the compiler or the source conversion program can convert a call of the same function into a call of a batch function. In other words, a means for reducing call overhead that does not depend on inline expansion is provided. ,
As a result, even when inline expansion cannot be applied, it is possible to reduce the call overhead and contribute to the improvement of the processing performance of the object program.
[Brief description of the drawings]
FIG. 1 is a configuration diagram of a computer system in which a compiler that implements the present invention operates.
FIG. 2 is a flowchart showing a processing procedure of a compiler implementing the present invention.
FIG. 3 is a flowchart of batch function call processing 2 of a compiler.
FIG. 4 is a processing configuration diagram of compiler batch function call processing 2 according to the present invention;
FIG. 5 is an example of a source program (a configuration example of the intermediate language 31 before calling a batch function in a source program format).
FIG. 6 is a configuration example of the intermediate language 32 after the batch function call in the source program format.
7 is a configuration example of a function table 41. FIG.
8 is a configuration example of a temporary parameter table 42. FIG.
9 is a configuration example of a call table 51. FIG.
10 is a configuration example of an actual parameter table 52. FIG.
11 is a configuration example of a collective call table 61. FIG.
12 is a configuration example of a collective actual parameter table 62. FIG.
13 is a configuration example of a batch function table 71. FIG.
14 is a configuration example of a collective temporary parameter table 72. FIG.
15 is a flowchart of function information table generation processing 21. FIG.
16 is a flowchart of a call information table generation process 22. FIG.
FIG. 17 is a flowchart of batch call information table generation processing 23;
FIG. 18 is a flowchart of batch call information extraction processing 231;
FIG. 19 is a flowchart of batch call information registration processing 232;
FIG. 20 is a flowchart of batch call actual return number registration processing 233;
FIG. 21 is a flowchart of batch call actual argument registration processing 234;
FIG. 22 is a flowchart of a batch function temporary return number registration process 235;
FIG. 23 is a flowchart of a batch function formal argument registration process 234;
24 is a flowchart of a batch function generation process 24. FIG.
FIG. 25 is a flowchart of batch call conversion processing 25;
FIG. 26 shows the processing procedure of the compilers of the second and third embodiments.
FIG. 27 is a flowchart of a loop expansion batch function calling process;
FIG. 28 is an example of a source program for explaining the second and third embodiments.
FIG. 29 is a configuration example of an intermediate language source program format after loop expansion;
FIG. 30 is a configuration example of the intermediate language source program format after the loop expansion batch function call conversion.
FIG. 31 is a flowchart of batch function call processing for loop parallelization;
FIG. 32 is a configuration example of the intermediate language 3 after the loop expansion in the source program format;
FIG. 33 is a configuration example of the intermediate language in a source program format after the conversion into a batch function for loop parallelization conversion.

Claims

A function call method for analyzing a relationship between a function and a function call to generate function information and function call information, and performing a program conversion process using the function information and the function call information,
A plurality of instruction sequences constituting the a call, collective call-information generating step of generating information for an instruction sequence which constitutes a call equivalent single function of the same function,
From the top Symbol collective call-information, and collectively function generation step of generating an instruction sequence that constitutes the bulk function of performing an instruction sequence composing the multiple execution of the same function at a time,
A collective call conversion step for converting from the collective call information into an instruction sequence constituting a plurality of calls of the same function into an instruction sequence constituting a single collective function call; call Shutsuka way.

In the batch function calling method according to claim 1,
The collective call information generation step recognizes an isomorphic function call that is a call of an instruction sequence that constitutes the same function using the function call information, and generates an isomorphic call information step; and
A batch function call method comprising: a batch call selection step for selecting a call that can be converted into an instruction sequence constituting a batch function call from the same type call generated by the same type call recognition step.

In the batch function calling method according to claim 2,
A batch call determination step for determining whether or not the plurality of isomorphic calls can be converted into an instruction sequence constituting a batch function call;
And a collective call information registration step for generating collective call information of a group of functions determined to be convertible into an instruction sequence constituting a collective function call by the collective call determination step. Batch function call method.

In the batch function calling method according to claim 2,
The isomorphic call recognition step is a loop expansion isomorphic call recognition step that recognizes a call of the same function generated at the time of loop expansion and generates isomorphic call information at the time of loop expansion.
The batch call selection step generates batch call information in which a call that can be converted into a call of an instruction sequence constituting a batch function is selected from the loop expansion type isomorphic call information generated by the loop expansion type call recognition step. A collective function calling method characterized by

The collective function calling method according to claim 4 ,
A collective function calling method, wherein the collective coefficient determined in the collective call information generating step is a loop expansion number.

In the batch function calling method according to claim 2,
The isomorphic call recognition step is a loop parallelization isomorphic call recognition step for recognizing a call of the same function generated at the time of loop parallelization and generating isomorphic call information at the time of loop parallelization.
The batch call selection step generates batch call information by selecting a call that can be converted into a call to an instruction sequence constituting a batch function from the loop parallel type isomorphic call information generated by the loop parallelization isomorphic call recognition step. A collective function calling method characterized by:

In the batch function calling method according to claim 6 ,
A collective function calling method, wherein the collective coefficient determined in the collective call information generating step is the number of loop parallelizations.