JP3619861B2

JP3619861B2 - Pipeline information output method, output device therefor, and computer-readable recording medium

Info

Publication number: JP3619861B2
Application number: JP12397099A
Authority: JP
Inventors: 延佳山地; 正樹青木; 清文鈴木; 浩二高原; 豊山中; 治道小泉; 政人森島; 賢一山本; 恭伸谷村; 明日下部; 直史杉本
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 1999-04-30
Filing date: 1999-04-30
Publication date: 2005-02-16
Anticipated expiration: 2019-04-30
Also published as: JP2000315161A

Description

【０００１】
【発明の属する技術分野】
本発明は、パイプライン情報の出力方法などに関し、特にソ−スプログラムをコンバイラやアセンブラで翻訳したときのパイプライン部分についての命令群を出力することにより、利用者が、パイプライン化の状況やパイプライン処理で用いられる各命令の種別などをあらかじめ把握した上でソ−スプログラムのチュ−ニングを行なえるようにしたものである。なお、本明細書では必要に応じて「ソ−スプログラム」を単に「プログラム」と記す。
【０００２】
一般に、プログラムの実行性能を高めるためには、利用者がソ−スプログラムの翻訳内容を確認しながら当該プログラムのチュ−ニング作業をより効率的に行なっていくことが望ましく、本発明はソ−スプログラムのパイプライン処理の対象部分についてもこの要請に応えられるようにしたものである。
【０００３】
【従来の技術】
従来、利用者は、ソ−スプログラム翻訳時の最適化やベクトル化などの状況を示す出力情報を確認した上でハ−ドウェアの特性を考慮しながら、ベクトル演算などのようなパイプライン処理部分を含むソ−スプログラムのチュ−ニングをおこなっていた。
【０００４】
すなわち、チュ−ニングの実行主体の利用者側には、パイプライン処理で用いられる命令の種別・順序や命令数、またパイプライン内での演算数などを示すパイプライン情報が提供されていなかった。
【０００５】
【発明が解決しようとする課題】
このように、従来のプログラムチュ−ニングは、パイプライン情報をその判断材料としていないので、作業効率が悪いという問題点があった。
【０００６】
そこで、本発明では、ソ−スプログラム翻訳時のパイプライン情報を求めて出力し、利用者がこの出力内容を参照できるようにして、プログラムチュ−ニングの効率化を図り、また、オブジェクトプログラムの実行性能を高めることを目的とする。
【０００７】
【課題を解決するための手段】
本発明はこの課題を次のようにして解決する。
パイプライン処理の対象部分を含むソースプログラムの翻訳に際し、コンピュータを用いて、前記対象部分の翻訳により生成される命令群についてパイプライン処理で用いられる命令の種別、順序、命令数を示すパイプライン情報を求め、命令の種別ごとに時系列に前記パイプライン情報が示す処理手順がわかる画面を表示する。
【０００８】
本発明においては、ソ−スプログラム翻訳時のパイプライン処理の命令群に関するパイプライン情報を出力するので、利用者はこの出力内容を参考にしてプログラムチュ−ニングを効率的に実行できる。
【０００９】
本発明は、このような解決手段からなるパイプライン情報の出力方法や出力装置、および、パイプライン情報の出力処理をコンピュ−タに実現させるためのプログラムを格納したコンピュ−タ読み取り可能なプログラム記憶媒体、を対象としている。
【００１０】
【発明の実施の形態】
図１乃至図８を参照して本発明の実施の形態を説明する。
【００１１】
図１はソ−スプログラムと、それに対応のパイプライン情報の画面表示例とを示す説明図であり、１はソ−スプログラム，２はソ−スプログラム１をコンパイルしたときに画面表示されるパイプライン情報を示している。
【００１２】
図２は図１のソ−スプログラムに対するパイプライン処理の概念を示す説明図であり、１１はメモリ，１２はロ−ド／ストアパイプライン（Ｌ／ＳＴパイプライン），１３ベクトルレジスタ，１４は乗算・加算パイプライン（Ｍ＆Ａパイプライン），１５は除算パイプライン，１６は加算パイプラインをそれぞれ示している。
【００１３】
ソ−スプログラム１は、配列ａ，ｂ，ｃ，ｄ，ｅ，ｆについての、
ａ（ｉ）＝（ｂ（ｉ）＋ｃ（ｉ）×ｄ（ｉ））÷ｅ（ｉ）＋ｆ（ｉ）
の計１０２４回のル−プ演算からなっている。
【００１４】
パイプライン情報２は、
・ベクトルロ−ド命令Ｌ，ベクトルストア命令Ｓ（Ｌ／ＳＴパイプライン）
・ベクトル乗算命令Ｍ，ベクトル加算命令Ａ（Ｍ＆Ａパイプライン）
・ベクトル除算命令（ＤＩＶパイプライン）
・ベクトルマスク命令（ＭＡＳＫパイプライン）
からなっている。
【００１５】
パイプライン情報２が示す処理手順は、
（１）配列ｂをメモリ１１からロ−ド／ストアパイプライン１２経由でベクトルレジスタ１３にロ−ドし、
（２）配列ｃをメモリ１１からロ−ド／ストアパイプライン１２経由でベクトルレジスタ１３にロ−ドし、
（３）配列ｄをメモリ１１からロ−ド／ストアパイプライン１２経由でベクトルレジスタ１３にロ−ドし、
（４）乗算・加算パイプライン１４で、ベクトルレジスタ１３の配列ｃと配列ｄとを乗算し、
（５）乗算・加算パイプライン１４で、ベクトルレジスタ１３の配列ｂと（４）の乗算結果とを加算し、
（６）配列ｅをメモリ１１からロ−ド／ストアパイプライン１２経由でベクトルレジスタ１３にロ−ドし、
（７）配列ｆをメモリ１１からロ−ド／ストアパイプライン１２経由でベクトルレジスタ１３にロ−ドし、
（８）除算パイプライン１５で、（５）の加算結果を配列ｅで除算し、
（９）加算パイプライン１６で、配列ｆと（８）の結果とを加算し、
（１０）ベクトルレジスタ１３の配列ａに（９）の加算結果をストアする、
となっている。
【００１６】
このとき、
・ロ−ド／ストアパイプライン１２への上記（３）のベクトルロ−ド命令の割り付け
・乗算・加算パイプライン１４への上記（４）のベクトル乗算命令の割り付け
が同じタイミングで行われる。
【００１７】
また、
・ロ−ド／ストアパイプライン１２への上記（７）のベクトルロ−ド命令の割り付け
・除算パイプライン１５への上記（８）のベクトル除算命令の割り付け
・加算パイプライン１６への上記（９）のベクトル加算命令の割り付け
が同じタイミングで行われる。
【００１８】
もっとも同じタイミングでベクトル命令が割りつけられた各パイプラインに対応のハ−ドウェア（演算回路）は、そこでの処理に必要なデ−タを受け取った上で命令内容を実行するように制御される。
【００１９】
例えば加算パイプライン１６に対応の演算回路は、除算パイプライン１５の出力と、上記（７）のベクトルロ−ド命令でベクトルレジスタ１３にロ−ドされた配列ｆとを受け取った上で、両者の加算処理を実行する。
【００２０】
図１のパイプライン情報２から利用者が確認できるのは、
・乗算命令と加算命令が個々に存在し、乗算と加算に関する複合命令Ｕが生成されていないこと
・全体の命令数は１０個（ロ−ド命令が５，ストア命令が１，乗算命令が１，加算命令が２，除算命令が１）なのでパイプライン内の演算が少ないこと
などである。
【００２１】
以上の確認内容に基づいて以下のようなチュ−ニング項目を抽出できる。
▲１▼複合命令Ｕを使用するオプションをコマンドプロセッサに指定する。
▲２▼複合命令Ｕを使用する最適化制御行をソ−スプログラムに付加する。
▲３▼ル−プ演算の回数が半分となるソ−スプログラム内容に修正して、パイプライン内の演算数を倍にする。
【００２２】
上記▲１▼および▲２▼のチュ−ニング項目はコンパイラの最適化機能を利用するものである。また、上記▲１▼のチュ−ニング項目はソ−スプログラムの修正を対象とせず、上記▲２▼および▲３▼のチュ−ニング項目はそれぞれソ−スプログラムの修正を対象としている。
【００２３】
図３は、複合命令Ｕを使用する最適化制御行を図１のＤＯル−プに指定したソ−スプログラム３を示している。上記▲２▼のチュ−ニング項目の場合に相当し、最適化制御行は「！ｏｃｌｆｍａｄｄ」である。
【００２４】
図４は、上記▲１▼や▲２▼のチュ−ニング項目に対処した場合のパイプライン情報４を示している。
【００２５】
図４のパイプライン情報４は、
・ソ−スプログラム１の翻訳時に、ベクトル乗算命令Ｍとベクトル加算命令Ａとの複合命令Ｕを用い、
・その結果、図１のパイプライン情報に比べてベクトル命令数が一つ少ないこと、すなわちベクトル演算処理が全体で１命令分だけ早く終了すること、
を示している。
【００２６】
このとき、
・ロ−ド／ストアパイプライン１２へのベクトルロ−ド命令（配列ｄ）の割り付け
・乗算・加算パイプライン１４への複合命令Ｕの割り付け
が同じタイミングで行われる。
【００２７】
また、
・ロ−ド／ストアパイプライン１２へのベクトルロ−ド命令（配列ｆ）の割り付け
・除算パイプライン１５へのベクトル除算命令の割り付け
・加算パイプライン１６へのベクトル加算命令の割り付け
が同じタイミングで行われる。
【００２８】
パイプライン情報４が示す処理手順は上記（１）〜（１０）と同様であり、また、複数のベクトル命令が同一のタイミングで割り付けられたパイプライン間の関連動作も図１の場合と同様に制御される。
【００２９】
図５は、図１のＤＯル−プの繰り返し回数を半分にしたソ−スプログラム５とそれに対応のパイプライン情報６の画面表示例とを示している。上記▲３▼のチュ−ニング項目の場合に相当する。
【００３０】
図５のパイプライン情報６が示す処理手順は、
・先ずソ−スプログラム５の前半の式に関する上記（１）〜（１０）の処理
・次にソ−スプログラム５の後半の式に関する上記（１）〜（１０）の処理
からなっている。
【００３１】
図５のパイプライン情報６は、図１のそれに比べて、
・ＤＯル−プの繰り返し回数が半分の「５１２」であり、
・ベクトル命令数が２倍の「２０」であること、
を示している。
【００３２】
この上記▲３▼のチュ−ニング項目の場合は、ＤＯル−プの繰り返し回数を減らして１命令当たりの処理時間を短くし、また命令数が増えてパイプライン内の命令スケジュ−リングが促進されることにより、プログラムの実行性能の向上を図っている。
【００３３】
図６は、コンパイラのシステム構成を示す説明図であり、
２１はソ−スプログラム
２２はコンパイラ
２３はソ−スプログラムの入力部
２４は入力されたソ−スプログラムの字句解析，構文解析や意味解析などを実行する解析部
２５はコンピュ−タの内部表現で表した中間言語を生成する中間言語生成部
２６は例えば演算の冗長性をなくすなどして処理効率の向上化を図るための変更を加えるえる最適化部
２７はパイプライン情報の作成機能を持つパイプラインスケジュ−リング部
２８はコ−ド生成部
２９はパイプライン情報の出力機能を有するリスタ部
３０はバッファなどの記憶手段
３１はコンパイラ２２の出力であるオブジェクトプログラム
３２はパイプライン情報の表示手段
をそれぞれ示している。なお、コンパイラ２２に相当の処理部と表示手段３２との距離は任意である。
【００３４】
以上の構成要素の中、
・パイプラインスケジューリング部２７が「前記対象部分の翻訳により生成される命令群についてパイプライン処理で用いられる命令の種別、順序、命令数を示すパイプライン情報を求める手段」に相当し、
・表示手段３２やリスタ部２９が「命令の種別ごとに時系列に前記パイプライン情報が示す処理手順がわかる画面を表示する出力手段」に相当する。
【００３５】
図７は、パイプライン情報の出力手順を示す説明図であり、その内容は次のようになっている。
（Ｓ１）入力部２３は、利用者がパイプライン情報の出力オプションをコマンドプロセッサに指定しているかどうかを判断し、「ＹＥＳ」の場合は次のステップに進み、「ＮＯ」の場合は出力処理を終了する。
（Ｓ２）パイプラインスケジュ−リング部２７は、入力されたソ−スプログラム２１がベクトル演算を対象としているかどうかを判断し、「ＹＥＳ」の場合は次のステップに進み、「ＮＯ」の場合は出力処理を終了する。
（Ｓ３）パイプラインスケジュ−リング部２７は、ソ−スプログラム２１のベクトル演算部分についてのプロセッサ情報（例えば解析後の命令の並び）を記憶手段３０に保存して、次のステップに進む。
（Ｓ４）リスタ部２９は、記憶手段３０にプロセッサ情報が保存されているかどうかを判断し、「ＹＥＳ」の場合は次のステップに進み、「ＮＯ」の場合は出力処理を終了する。
（Ｓ５）リスタ部２９は、記憶手段３０にプロセッサ情報を出力して表示手段３２に画面表示し、出力処理を終了する。
【００３６】
なお、ステップ（Ｓ４）で、記憶手段３０にプロセッサ情報が保存されているかどうかを確認するのは、リスタ部２９の動作内容に基づいている。
【００３７】
例えば、リスタ部２９はコンパイル処理の中断後に再走行する機能を持っており、この再走行の際にパイプライン情報が記憶手段３０に保存されていないときの、当該記憶手段に対するリスタ部２９の処理を効率的に行なうためである。
【００３８】
図８は、コンピュ−タ読み取り可能な記録媒体からプログラムを読み取って実行するコンピュ−タシステムの概要を示す説明図であり、４はコンピュ−タシステム、４１はＣＰＵやディスクドライブ装置などを内蔵した本体部、４２は本体部４１からの指示により画像を表示するディスプレイ、４３は表示画面、４４はコンピュ−タシステム４に種々の情報を入力するためのキ−ボ−ド、４５は表示画面４３の任意の位置を指定するマウス、４６は外部のデ−タベ−ス（ＤＡＳＤなどの回線先メモリ）、４７は外部のデ−タベ−ス４６にアクセスするモデム、４８はＣＤ−ＲＯＭやフロッピ−ディスクなどの可搬型記録媒体をそれぞれ示している。
【００３９】
プログラムを格納する記録媒体としては、
・プログラム提供者側のデ−タベ−ス４６（回線先メモリ）
・可搬型記録媒体４８
・本体部４１側のＲＡＭやハ−ドディスク
などのいずれでもよく、当該プログラムは本体部４１にロ−デイングされてその主メモリ上で実行される。
【００４０】
【発明の効果】
本発明は、このように、ソ−スプログラム翻訳時のパイプライン処理の命令群に関するパイプライン情報を出力しているので、利用者はこの出力内容を参考にしながらプログラムチュ−ニングを効率的に実行でき、また、オブジェクトプログラムの実行性能を高めることができる。
【図面の簡単な説明】
【図１】本発明の、ソ−スプログラムと、それに対応のパイプライン情報の画面表示例とを示す説明図である。
【図２】本発明の、図１のソ−スプログラムに対するパイプライン処理の概念を示す説明図である。
【図３】本発明の、複合命令Ｕを使用する最適化制御行を図１のＤＯル−プに指定したソ−スプログラムを示す説明図である。
【図４】本発明の、▲１▼，▲２▼のチュ−ニング項目に対処した場合のパイプライン情報を示す説明図である。
【図５】本発明の、図１のＤＯル−プの繰り返し回数を半分にしたソ−スプログラムとそれに対応のパイプライン情報の画面表示例とを示す説明図である。
【図６】本発明の、コンパイラのシステム構成を示す説明図である。
【図７】本発明の、パイプライン情報の出力手順を示す説明図である。
【図８】本発明の、コンピュ−タ読み取り可能な記録媒体からプログラムを読み取って実行するコンピュ−タシステムの概要を示す説明図である。
【符号の説明】
１：ソ−スプログラム
２：ソ−スプログラム１のパイプライン情報
３：▲２▼のチュ−ニング項目に対処したソ−スプログラム
４：▲１▼，▲２▼のチュ−ニング項目に対処した場合のパイプライン情報
５：▲３▼のチュ−ニング項目に対処したソ−スプログラム
６：ソ−スプログラム５のパイプライン情報
１１：メモリ
１２：ロ−ド／ストアパイプライン（Ｌ／ＳＴパイプライン）
１３：ベクトルレジスタ
１４：乗算・加算パイプライン（Ｍ＆Ａパイプライン）
１５：除算パイプライン
１６：加算パイプライン
２１：ソ−スプログラム
２２：コンパイラ
２３：ソ−スプログラムの入力部
２４：解析部
２５：中間言語生成部
２６：最適化部
２７：パイプラインスケジュ−リング部
２８：コ−ド生成部
２９：パイプライン情報の出力機能を有するリスタ部
３０：バッファなどの記憶手段
３１：オブジェクトプログラム
３２：表示手段
４：コンピュ−タシステム
４１：ＣＰＵやディスクドライブ装置などを内蔵した本体部
４２：ディスプレイ
４３：表示画面
４４：キ−ボ−ド
４５：マウス
４６：外部のデ−タベ−ス（ＤＡＳＤなどの回線先メモリ）
４７：モデム
４８：ＣＤ−ＲＯＭやフロッピ−ディスクなどの可搬型記録媒体[0001]
BACKGROUND OF THE INVENTION
The present invention relates to a pipeline information output method and the like, and in particular, by outputting a group of instructions for a pipeline portion when a source program is translated by a combiner or an assembler, a user can The source program can be tuned after knowing in advance the type of each instruction used in the pipeline processing. In this specification, “source program” is simply referred to as “program” as necessary.
[0002]
In general, in order to improve the execution performance of a program, it is desirable for the user to perform the tuning operation of the program more efficiently while confirming the translated content of the source program. The target part of the pipeline processing of the program is designed to meet this demand.
[0003]
[Prior art]
Conventionally, the user has checked the output information indicating the status of optimization and vectorization at the time of translation of the source program, and then considered the hardware characteristics, and the pipeline processing part such as vector calculation Tuning of the source program including
[0004]
That is, pipeline information indicating the type / order of instructions used in pipeline processing, the number of instructions, the number of operations in the pipeline, etc. has not been provided to the user who performs the tuning execution. .
[0005]
[Problems to be solved by the invention]
As described above, the conventional program tuning has a problem in that work efficiency is poor because the pipeline information is not used as the determination material.
[0006]
Therefore, in the present invention, pipeline information at the time of source program translation is obtained and output so that the user can refer to the output contents to improve the efficiency of program tuning, and the object program The purpose is to improve execution performance.
[0007]
[Means for Solving the Problems]
The present invention solves this problem as follows.
Pipeline information indicating the type, order, and number of instructions used in pipeline processing for a group of instructions generated by translation of the target part using a computer when translating the source program including the target part for pipeline processing And displays a screen showing the processing procedure indicated by the pipeline information in time series for each type of instruction.
[0008]
In the present invention, pipeline information relating to a group of pipeline processing instructions at the time of source program translation is output, so that the user can efficiently execute program tuning with reference to the output contents.
[0009]
The present invention provides a pipeline information output method and output apparatus comprising such a solution, and a computer-readable program storage storing a program for causing a computer to execute pipeline information output processing. Targeted at medium.
[0010]
DETAILED DESCRIPTION OF THE INVENTION
An embodiment of the present invention will be described with reference to FIGS.
[0011]
FIG. 1 is an explanatory view showing a source program and a screen display example of pipeline information corresponding to the source program. 1 is a source program, and 2 is a screen displayed when the source program 1 is compiled. Pipeline information is shown.
[0012]
FIG. 2 is an explanatory diagram showing the concept of pipeline processing for the source program of FIG. 1, wherein 11 is a memory, 12 is a load / store pipeline (L / ST pipeline), 13 vector registers, and 14 is A multiplication / addition pipeline (M & A pipeline), 15 is a division pipeline, and 16 is an addition pipeline.
[0013]
Source program 1 is for arrays a, b, c, d, e, and f.
a (i) = (b (i) + c (i) × d (i)) ÷ e (i) + f (i)
It consists of a total of 1024 loop calculations.
[0014]
Pipeline information 2 is
-Vector load instruction L, vector store instruction S (L / ST pipeline)
-Vector multiplication instruction M, vector addition instruction A (M & A pipeline)
-Vector division instruction (DIV pipeline)
-Vector mask instruction (MASK pipeline)
It is made up of.
[0015]
The processing procedure indicated by the pipeline information 2 is as follows:
(1) Load the array b from the memory 11 into the vector register 13 via the load / store pipeline 12;
(2) Load the array c from the memory 11 to the vector register 13 via the load / store pipeline 12;
(3) The array d is loaded from the memory 11 to the vector register 13 via the load / store pipeline 12, and
(4) Multiply the array c and the array d of the vector register 13 by the multiplication / addition pipeline 14;
(5) In the multiplication / addition pipeline 14, add the array b of the vector register 13 and the multiplication result of (4),
(6) The array e is loaded from the memory 11 to the vector register 13 via the load / store pipeline 12, and
(7) The array f is loaded from the memory 11 to the vector register 13 via the load / store pipeline 12, and
(8) In the division pipeline 15, divide the addition result of (5) by the array e,
(9) The addition pipeline 16 adds the array f and the result of (8),
(10) Store the addition result of (9) in the array a of the vector register 13.
It has become.
[0016]
At this time,
Allocation / multiplication / addition of the vector load instruction (4) to the load / store pipeline 12 is performed at the same timing.
[0017]
Also,
-Allocation of vector load instruction (7) to load / store pipeline 12-Allocation of vector division instruction (8) to division pipeline 15-(9) to addition pipeline 16 All vector addition instructions are assigned at the same timing.
[0018]
Hardware (arithmetic circuit) corresponding to each pipeline to which a vector instruction is assigned at the same timing is controlled so as to execute the instruction contents after receiving data necessary for processing there. .
[0019]
For example, the arithmetic circuit corresponding to the addition pipeline 16 receives the output of the division pipeline 15 and the array f loaded into the vector register 13 by the vector load instruction (7) above, Addition processing is executed.
[0020]
The user can confirm from the pipeline information 2 in FIG.
・ Multiplication instructions and addition instructions exist individually, and compound instruction U related to multiplication and addition is not generated. ・ Total number of instructions is 10 (load instruction is 5, store instruction is 1, multiplication instruction is 1. Because the add instruction is 2, the divide instruction is 1), there are few operations in the pipeline.
[0021]
Based on the above confirmation contents, the following tuning items can be extracted.
(1) Designate an option to use the compound instruction U to the command processor.
(2) An optimization control line using the compound instruction U is added to the source program.
(3) Modify the source program so that the number of loop operations is halved, and double the number of operations in the pipeline.
[0022]
The tuning items (1) and (2) above use the optimization function of the compiler. Further, the tuning item (1) does not target the correction of the source program, and the tuning items (2) and (3) described above target the correction of the source program.
[0023]
FIG. 3 shows a source program 3 in which the optimization control line using the compound instruction U is designated in the DO loop of FIG. This corresponds to the tuning item (2) above, and the optimization control line is “! Ocl fmadd”.
[0024]
FIG. 4 shows pipeline information 4 when the tuning items (1) and (2) are dealt with.
[0025]
The pipeline information 4 in FIG.
-When the source program 1 is translated, a compound instruction U of a vector multiplication instruction M and a vector addition instruction A is used,
-As a result, the number of vector instructions is one less than that of the pipeline information of FIG. 1, that is, the vector operation processing is completed earlier by one instruction as a whole,
Is shown.
[0026]
At this time,
Allocation / multiplication / addition of the compound load U to the load / store pipeline 12 is performed at the same timing.
[0027]
Also,
-Allocation of vector load instruction (array f) to load / store pipeline 12-Allocation of vector division instruction to division pipeline 15-Allocation of vector addition instruction to addition pipeline 16 at the same timing Is called.
[0028]
The processing procedure indicated by the pipeline information 4 is the same as the above (1) to (10), and related operations between pipelines in which a plurality of vector instructions are allocated at the same timing are the same as in FIG. Be controlled.
[0029]
FIG. 5 shows a source program 5 in which the number of repetitions of the DO loop of FIG. 1 is halved and a screen display example of pipeline information 6 corresponding thereto. This corresponds to the tuning item (3) above.
[0030]
The processing procedure indicated by the pipeline information 6 in FIG.
First, the processing of the above (1) to (10) relating to the first half of the source program 5, and then the processing of (1) to (10) relating to the second half of the source program 5.
[0031]
The pipeline information 6 in FIG. 5 is compared with that in FIG.
・ The number of repetitions of the DO loop is half “512”,
-The number of vector instructions is twice "20",
Is shown.
[0032]
In the case of the above item (3), the number of DO loop repetitions is reduced to shorten the processing time per instruction, and the number of instructions is increased to facilitate instruction scheduling in the pipeline. As a result, the execution performance of the program is improved.
[0033]
FIG. 6 is an explanatory diagram showing the system configuration of the compiler.
21 is a source program 22, a compiler 23 is a source program 24, an input unit 24 is a lexical analysis, syntax analysis, and semantic analysis of the input source program. An analysis unit 25 is an internal representation of the computer. The intermediate language generation unit 26 that generates the intermediate language represented by (2) has a function of creating pipeline information, for example, by adding a change for improving processing efficiency by eliminating the redundancy of operations. The pipeline scheduling unit 28 is a code generation unit 29 is a pipeline information output function, the lister unit 30 is a buffer, etc. The storage unit 31 is the output of the compiler 22 The object program 32 is the pipeline information display unit Respectively. The distance between the processing unit corresponding to the compiler 22 and the display unit 32 is arbitrary.
[0034]
Among the above components,
The pipeline scheduling unit 27 corresponds to “means for obtaining pipeline information indicating the type, order, and number of instructions used in pipeline processing for an instruction group generated by translation of the target part ”;
The display unit 32 and the lister unit 29 correspond to “an output unit that displays a screen showing the processing procedure indicated by the pipeline information in time series for each type of instruction” .
[0035]
FIG. 7 is an explanatory diagram showing a procedure for outputting pipeline information, and the contents thereof are as follows.
(S1) The input unit 23 determines whether or not the user designates the output option of the pipeline information to the command processor. If “YES”, the process proceeds to the next step, and if “NO”, the output process is performed. Exit.
(S2) The pipeline scheduling unit 27 determines whether or not the input source program 21 is intended for vector operation. If “YES”, the process proceeds to the next step, and if “NO”, End the output process.
(S3) The pipeline scheduling unit 27 stores the processor information (for example, the sequence of the analyzed instructions) for the vector operation part of the source program 21 in the storage unit 30, and proceeds to the next step.
(S4) The lister unit 29 determines whether the processor information is stored in the storage unit 30. If “YES”, the process proceeds to the next step, and if “NO”, the output process ends.
(S5) The lister unit 29 outputs the processor information to the storage means 30, displays it on the display means 32, and ends the output process.
[0036]
Whether or not the processor information is stored in the storage unit 30 in step (S4) is based on the operation content of the lister unit 29.
[0037]
For example, the lister unit 29 has a function of rerunning after the compilation process is interrupted, and processing of the lister unit 29 for the storage unit when the pipeline information is not stored in the storage unit 30 at the time of the rerunning. This is to efficiently perform the above.
[0038]
FIG. 8 is an explanatory diagram showing an outline of a computer system that reads and executes a program from a computer-readable recording medium. , 42 is a display for displaying an image in accordance with an instruction from the main body 41, 43 is a display screen, 44 is a keyboard for inputting various information to the computer system 4, and 45 is an arbitrary screen on the display screen 43. Mouse for specifying the position, 46 is an external database (line-destination memory such as DASD), 47 is a modem for accessing the external database 46, 48 is a CD-ROM or floppy disk, etc. Each of the portable recording media is shown.
[0039]
As a recording medium for storing the program,
・ Program provider's database 46 (line destination memory)
・ Portable recording medium 48
Any of a RAM, a hard disk, and the like on the main unit 41 side may be used, and the program is loaded into the main unit 41 and executed on the main memory.
[0040]
【The invention's effect】
In this way, the present invention outputs the pipeline information related to the instruction group of the pipeline processing at the time of the source program translation, so that the user can efficiently perform the program tuning while referring to the output contents. It can be executed and the execution performance of the object program can be improved.
[Brief description of the drawings]
FIG. 1 is an explanatory diagram showing a source program and a screen display example of corresponding pipeline information according to the present invention.
FIG. 2 is an explanatory diagram showing the concept of pipeline processing for the source program of FIG. 1 according to the present invention.
FIG. 3 is an explanatory diagram showing a source program according to the present invention in which an optimization control line using a compound instruction U is designated in the DO loop of FIG. 1;
FIG. 4 is an explanatory diagram showing pipeline information when dealing with the tuning items (1) and (2) according to the present invention.
FIG. 5 is an explanatory diagram showing a source program according to the present invention in which the number of repetitions of the DO loop of FIG. 1 is halved and a screen display example of corresponding pipeline information.
FIG. 6 is an explanatory diagram showing a system configuration of a compiler according to the present invention.
FIG. 7 is an explanatory diagram illustrating a procedure for outputting pipeline information according to the present invention.
FIG. 8 is an explanatory diagram showing an outline of a computer system for reading and executing a program from a computer-readable recording medium according to the present invention.
[Explanation of symbols]
1: Source program 2: Source program 1 pipeline information 3: Source program 4 that addresses the tuning item (2): Addresses the tuning items (1) and (2) Pipeline information 5: Source program 6 corresponding to the tuning item (3): Pipeline information 11 of the source program 5: Memory 12: Load / store pipeline (L / ST pipeline)
13: Vector register 14: Multiplication / addition pipeline (M & A pipeline)
15: Division pipeline 16: Addition pipeline 21: Source program 22: Compiler 23: Source program input unit 24: Analysis unit 25: Intermediate language generation unit 26: Optimization unit 27: Pipeline scheduling Unit 28: Code generation unit 29: Lister unit 30 having pipeline information output function 30: Storage unit 31 such as a buffer 31: Object program 32: Display unit 4: Computer system 41: Built-in CPU, disk drive device, etc. Main unit 42: display 43: display screen 44: keyboard 45: mouse 46: external database (line-destination memory such as DASD)
47: Modem 48: Portable recording medium such as a CD-ROM or floppy disk

Claims

When translating the source program including the target part of the pipeline processing,
Using a computer
A step of the type of instructions used for instructions that are generated by the target portion of the translation in the pipeline processing, order, Ru seek pipeline information indicating the number of instructions,
Displaying a screen showing a processing procedure indicated by the pipeline information in time series for each type of instruction;
A method for outputting pipeline information, characterized in that:

In a translation apparatus for a source program including a target part of pipeline processing,
Means for obtaining pipeline information indicating the type, order, and number of instructions used in pipeline processing for an instruction group generated by translation of the target portion;
Output means for displaying a screen for understanding the processing procedure indicated by the pipeline information in time series for each type of instruction;
An apparatus for outputting pipeline information.

A recording medium storing a translation program for a source program including a target part of pipeline processing,
The translation program, the type of instructions used in the pipeline processing with the instructions generated by the target portion of the translation, the order, asking the pipeline information indicating the number of instructions, wherein the time series for each type of instruction This is to make a computer realize the function of displaying a screen that shows the processing procedure indicated by the pipeline information .
A computer-readable recording medium.