JP7271957B2

JP7271957B2 - Dynamic link device, dynamic load device, computer system, dynamic link method, dynamic load method, dynamic link program, and dynamic load program

Info

Publication number: JP7271957B2
Application number: JP2019003516A
Authority: JP
Inventors: 照之今井
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2019-01-11
Filing date: 2019-01-11
Publication date: 2023-05-12
Anticipated expiration: 2039-01-11
Also published as: JP2020113050A

Description

本発明は、動的リンク装置、動的ロード装置、計算機システム、動的リンク方法、動的ロード方法、動的リンクプログラム、および動的ロードプログラムに関する。 The present invention relates to a dynamic linking device, a dynamic loading device, a computer system, a dynamic linking method, a dynamic loading method, a dynamic linking program, and a dynamic loading program.

今日の計算機システムでは、半導体技術や電力の制約によって、汎用計算機の周波数を高めることによる性能向上が飽和してきている。そのため、処理に適したアクセラレータなど、特性の異なる計算機によって構成されたヘテロな計算機システムが拡大している。一例として、通常の計算や入出力を行う汎用プロセッサを備えるホストと、数値計算に適したプロセッサを備えるアクセラレータとで構成される計算機システムがある。アクセラレータには、ＩｎｔｅｌＸｅｏｎＰｈｉ（登録商標）やＮＥＣＳＸ－ＡｕｒｏｒａＴＳＵＢＡＳＡ（登録商標）のように、オペレーティングシステムの機能、および共有ライブラリをサポートするものがある。 In today's computer systems, due to limitations in semiconductor technology and power, the performance improvement achieved by increasing the frequency of general-purpose computers is becoming saturated. As a result, heterogeneous computer systems composed of computers with different characteristics, such as accelerators suitable for processing, are expanding. As an example, there is a computer system composed of a host having a general-purpose processor for normal calculation and input/output, and an accelerator having a processor suitable for numerical calculation. Some accelerators, such as Intel Xeon Phi (registered trademark) and NEC SX-Aurora TSUBASA (registered trademark), support operating system functions and shared libraries.

共有ライブラリは、プログラムの再利用性を高めることでソフトウェアの生産性を向上させ、かつ、複数のプログラムで使用する関数によって重複するコードによるストレージやメモリの消費を削減するために、計算機システムにおいて広く普及している。標準的なオペレーティングシステムは、共有ライブラリをサポートしている。例えば、Ｌｉｎｕｘ（登録商標）およびＵｎｉｘ（登録商標）系オペレーティングシステムでは、ＥＬＦ(Executable and Linking Format)およびＥＬＦｓｈａｒｅｄｏｂｊｅｃｔという動的リンク・動的ロード可能な共有ライブラリがサポートされている。 Shared libraries are widely used in computer systems to improve software productivity by increasing program reusability, and to reduce storage and memory consumption due to code duplicated by functions used by multiple programs. Widespread. Standard operating systems support shared libraries. For example, Linux (registered trademark) and Unix (registered trademark) operating systems support dynamically linkable and dynamically loadable shared libraries called ELF (Executable and Linking Format) and ELF shared objects.

ヘテロな計算機システムは、プログラムの実行を開始する計算機（ホスト）上で動作する実行ファイルから、特定の処理に適した別の計算機（アクセラレータ）を起動する。この特定の処理は、最適化されたコードの再利用性のために共有ライブラリとして提供されることがある。 A heterogeneous computer system activates another computer (accelerator) suitable for specific processing from an executable file running on a computer (host) that starts execution of a program. This particular process is sometimes provided as a shared library for optimized code reusability.

ヘテロな計算機システムに用いられる複数の計算機で動作する分散プログラムにおいて、自身の計算機以外の計算機（以下、サーバとも呼ばれる）で動作する機能（たとえば、関数・メソッド）を使用する場合、ＲＰＣ(Remote Procedure Call)が使用される。ＲＰＣでは、通常の関数呼び出しとは異なる煩雑な記述・処理の追加が必要である。たとえば、ＲＰＣを使用するときは、クライアントがリモートで呼び出すべき関数に渡す引数を、マーシャリングまたはシリアライズと呼ばれる、転送するためのデータの塊に変換する処理を実行し、変換したデータを、関数を実行するサーバに転送する。サーバは、マーシャリングまたはシリアライズされたデータを元の形に復元（アンマーシャリング、デシリアライズ）し、実際に関数を呼び出す。サーバは、関数を呼び出し実行した結果をマーシャリングしてクライアントに返信する。クライアントはサーバから返された結果をアンマーシャリングする。 In a distributed program that operates on multiple computers used in a heterogeneous computer system, when using a function (for example, function / method) that operates on a computer other than its own computer (hereinafter also referred to as a server), RPC (Remote Procedure Call) is used. RPC requires addition of complicated descriptions and processes that are different from normal function calls. For example, when using RPC, a process called marshalling or serialization is performed to convert the arguments passed to the function to be called remotely by the client into chunks of data for transfer, and the converted data is used to execute the function. transfer to a server that The server restores (unmarshals, deserializes) the marshalled or serialized data to its original form, and actually calls the function. The server calls the function, marshals the result, and sends it back to the client. The client unmarshals the results returned by the server.

一般に、ＲＰＣによって他の計算機の関数を呼び出すプログラムでは、クライアントスタブとサーバスタブとが用いられる。クライアントスタブは、クライアント側でマーシャリングを行い、サーバに要求を発行し、サーバから戻り値を受信し、その値を返す。サーバスタブは、サーバ側で要求を受け付け、アンマーシャリングを行い、サーバ上の関数に引数として渡し、関数を実行し、戻り値をクライアントに送信する。開発者は、クライアントスタブを呼び出すプログラムを記述し、クライアントスタブをリンクする。 Generally, a client stub and a server stub are used in a program that calls functions of other computers by RPC. The client stub does the marshalling on the client side, issues requests to the server, receives return values from the server, and returns the values. The server stub receives the request on the server side, unmarshals it, passes it as an argument to a function on the server, executes the function, and sends the return value to the client. A developer writes a program that calls the client stub and links the client stub.

非特許文献１には、ＲＰＣの一例としてのＯＮＣ(Open Network Computing) ＲＰＣについての説明がある。 Non-Patent Document 1 describes ONC (Open Network Computing) RPC as an example of RPC.

また、特許文献１には、ホストとホストに接続されるアクセラレータとを含むグラフィックシステムについての記載がある。このグラフィックシステムは、ホストのライブラリにストラクチャ管理部を設ける。クライアントは、ストラクチャ管理部により、直接、アクセラレータ内の共有メモリ上にストラクチャ編集リクエストを格納することができる。 Further, Patent Document 1 describes a graphic system including a host and an accelerator connected to the host. This graphics system provides a structure manager in the host's library. The structure manager allows clients to store structure edit requests directly on the shared memory within the accelerator.

特許文献２に記載の技術は、ゲストプラットフォームのライブラリをロードする必要があるとき、ホスト用の代替のライブラリがあるかを判別し、有れば当該代替のライブラリをロードして使用する。さらに、特許文献２に記載の技術は、ホスト用の代替のライブラリが無ければ、エミュレーション・バイナリ変換によりゲスト用のライブラリを使用する。 The technology described in Patent Document 2 determines whether there is an alternative library for the host when it is necessary to load the guest platform library, and if so, loads and uses the alternative library. Furthermore, the technology described in Patent Document 2 uses a guest library by emulation/binary conversion if there is no alternative library for the host.

特開平０９－３１９８５６号公報JP-A-09-319856 特表２０１３－５１１７８２号公報Japanese Patent Publication No. 2013-511782

R. Srinivasan, "RPC: Remote Procedure Call Protocol Specification Version 2", [online], August 1995, Request for Comments: 1831 (RFC1831), [平成30年12月25日検索], インターネット < http://www.ring.gr.jp/pub/doc/RFC/rfc1831.txt>R. Srinivasan, "RPC: Remote Procedure Call Protocol Specification Version 2", [online], August 1995, Request for Comments: 1831 (RFC1831), [searched December 25, 2018], Internet < http://www .ring.gr.jp/pub/doc/RFC/rfc1831.txt>

ＲＰＣに使用するクライアントスタブとサーバスタブ（以下、これらを纏めてスタブと呼ぶ）とを自動生成することで、スタブの開発にかかるコストが軽減される。たとえば、ＩＤＬ(Interface Definition Language)で関数のインタフェースが記述され、ＩＤＬコンパイラによって、スタブが生成される。詳細には、ＯＮＣＲＰＣに使用するＩＤＬコンパイラとして、ＲＰＣＧＥＮが知られている。ＲＰＣＧＥＮは、インタフェース定義ファイルを使って、サーバおよびクライアントのＣ言語スタブを生成するコンパイラである。 By automatically generating a client stub and a server stub (hereinafter collectively referred to as stubs) used for RPC, the cost of developing stubs can be reduced. For example, a function interface is described in IDL (Interface Definition Language), and a stub is generated by an IDL compiler. Specifically, RPCGEN is known as an IDL compiler used for ONC RPC. RPCGEN is a compiler that uses an interface definition file to generate server and client C language stubs.

しかしながら、ＩＤＬによるスタブの自動生成では、スタブ開発のコストは軽減されるものの、依然としてプログラムによってスタブをリンクさせなければならない。 However, although automatic generation of stubs by IDL reduces the cost of stub development, stubs must still be linked programmatically.

本発明は、上記課題を解決するためになされたものであり、アクセラレータのプログラムが、スタブを記述することなく、ホストの共有ライブラリをリンクすることができる技術を提供することを目的とする。 The present invention has been made to solve the above problems, and an object of the present invention is to provide a technology that enables an accelerator program to link a shared library of a host without writing a stub.

本発明の動的リンク装置は、共有ライブラリを自装置のメモリにロードし、呼び出し元のプログラムからリンクするロード手段と、共有ライブラリのターゲットを識別し、呼び出し元のプログラムから共有ライブラリの関数呼び出しの機能を自装置または他の装置に切り替えるプラットフォーム切替手段と、を備える。 The dynamic linking device of the present invention loads a shared library into its own memory, identifies loading means for linking from a caller program, identifies a target of the shared library, and allows the caller program to call functions of the shared library. and platform switching means for switching the function to the own device or another device.

本発明の動的ロード装置は、共有ライブラリを自装置のメモリにロードし、呼び出し元のプログラムからリンクするロード手段と、共有ライブラリのターゲットを識別し、呼び出し元のプログラムから共有ライブラリの関数呼び出しの機能を自装置または他の装置に切り替えるプラットフォーム切替手段と、を備える。 The dynamic loading device of the present invention loads a shared library into its own memory, identifies loading means for linking from a caller program, and a target of the shared library, and allows the caller program to call functions of the shared library. and platform switching means for switching the function to the own device or another device.

本発明の計算機システムは、第１の情報処理装置と、第１の情報処理装置に接続された第２の情報処理装置と、を含む計算機システムであって、第１の情報処理装置は、共有ライブラリを第１の情報処理装置のメモリにロードし、呼び出し元のプログラムからリンクするロード手段と、共有ライブラリのターゲットを識別し、呼び出し元のプログラムから共有ライブラリの関数呼び出しの機能を第１の情報処理装置または第２の情報処理装置に切り替えるプラットフォーム切替手段と、を備える。 A computer system of the present invention is a computer system including a first information processing device and a second information processing device connected to the first information processing device, wherein the first information processing device is a shared A loading means for loading the library into the memory of the first information processing device and linked from the caller program, identifying the target of the shared library, and specifying the functions of function calls of the shared library from the caller program as the first information. platform switching means for switching to the processing device or the second information processing device.

本発明の動的リンク方法は、共有ライブラリを自装置のメモリにロードし、呼び出し元のプログラムからリンクし、共有ライブラリのターゲットを識別し、呼び出し元のプログラムから共有ライブラリの関数呼び出しの機能を自装置または他の装置に切り替える。 The dynamic linking method of the present invention loads a shared library into the memory of its own device, links from a calling program, identifies a target of the shared library, and automatically functions to call functions of the shared library from the calling program. Switch to a device or other device.

本発明の動的ロード方法は、共有ライブラリを自装置のメモリにロードし、呼び出し元のプログラムからリンクし、共有ライブラリのターゲットを識別し、呼び出し元のプログラムから共有ライブラリの関数呼び出しの機能を自装置または他の装置に切り替える。 The dynamic loading method of the present invention loads the shared library into the memory of its own device, links from the calling program, identifies the target of the shared library, and automatically allows the function of the shared library to be called from the calling program. Switch to a device or other device.

本発明の動的リンクプログラムは、共有ライブラリを自装置のメモリにロードし、呼び出し元のプログラムからリンクする機能と、共有ライブラリのターゲットを識別し、呼び出し元のプログラムから共有ライブラリの関数呼び出しの機能を自装置または他の装置に切り替える機能と、をコンピュータに実行させる。 The dynamic link program of the present invention has a function of loading a shared library into the memory of its own device and linking it from a caller program, and a function of identifying a target of the shared library and calling a function of the shared library from the caller program. to the own device or another device.

本発明の動的ロードプログラムは、共有ライブラリを自装置のメモリにロードし、呼び出し元のプログラムからリンクする機能と、共有ライブラリのターゲットを識別し、呼び出し元のプログラムから共有ライブラリの関数呼び出しの機能を自装置または他の装置に切り替える機能と、をコンピュータに実行させる。 The dynamic loading program of the present invention has a function of loading a shared library into the memory of its own device, linking from the calling program, identifying the target of the shared library, and calling the function of the shared library from the calling program. to the own device or another device.

本発明によれば、アクセラレータのプログラムは、スタブを記述することなく、ホストの共有ライブラリをリンクすることができる。 According to the present invention, accelerator programs can link host shared libraries without writing stubs.

本発明の第１の実施形態に係る動的リンク装置の構成例を示すブロック図である。1 is a block diagram showing a configuration example of a dynamic link device according to a first embodiment of the present invention; FIG. 図１に示す動的リンク装置が動作する環境の一例としての計算機システムの構成例を示すブロック図である。2 is a block diagram showing a configuration example of a computer system as an example of an environment in which the dynamic link device shown in FIG. 1 operates; FIG. 図２に示す計算機システム上で動作する動的リンク装置の動作例（動的リンク方法、動的ロード方法、動的リンクプログラム、動的ロードプログラム）を説明するフローチャートである。3 is a flow chart for explaining an operation example (a dynamic linking method, a dynamic loading method, a dynamic linking program, a dynamic loading program) of a dynamic linking device operating on the computer system shown in FIG. 2; 第１の実施形態において、プログラム“a.out”を起動した際の初期状態における、アクセラレータのメモリ空間の配置例を示す図である。FIG. 10 is a diagram showing an example of arrangement of memory spaces of accelerators in the initial state when the program “a.out” is started in the first embodiment; 第１の実施形態において、動的リンクされた共有ライブラリのロード後における、アクセラレータのメモリ空間の配置例を示す図である。FIG. 4 is a diagram showing an example of allocation of memory spaces of accelerators after loading a dynamically linked shared library in the first embodiment; 第１の実施形態において、動的リンクされた共有ライブラリの遅延リンク完了時における、アクセラレータおよびホストの各メモリ空間の配置例を示す図である。FIG. 4 is a diagram showing an example of arrangement of memory spaces of the accelerator and the host upon completion of delayed linking of a dynamically linked shared library in the first embodiment; 本発明の第２の実施形態に係る動的リンク装置の構成例を示すブロック図である。FIG. 8 is a block diagram showing a configuration example of a dynamic link device according to a second embodiment of the present invention; 本発明の第３の実施形態に係る動的ロード装置の構成例を示すブロック図である。FIG. 11 is a block diagram showing a configuration example of a dynamic load device according to a third embodiment of the present invention; 本発明の第４の実施形態に係る計算機システムの構成例を示すブロック図である。FIG. 11 is a block diagram showing a configuration example of a computer system according to a fourth embodiment of the present invention; FIG.

［第１の実施形態］
（構成の説明）
図１は、本発明の第１の実施形態に係る動的リンク装置１０の構成例を示すブロック図である。図１に示す動的リンク装置１０は、ロード部１１（ロード手段の一例）と、プラットフォーム切替部１２（プラットフォーム切替手段の一例）と、を備える。 [First embodiment]
(Description of configuration)
FIG. 1 is a block diagram showing a configuration example of a dynamic link device 10 according to the first embodiment of the present invention. The dynamic link device 10 shown in FIG. 1 includes a load section 11 (an example of loading means) and a platform switching section 12 (an example of platform switching means).

ロード部１１は、共有ライブラリをアクセラレータのメモリにロードし、呼び出し元のプログラムからリンクする動的リンカまたは動的ローダである。ロード部１１は、アクセラレータをターゲットとする共有ライブラリとしての動的オブジェクトと、ホストをターゲットとする共有ライブラリとしての動的オブジェクトとの両方をロードする。 The loading unit 11 is a dynamic linker or dynamic loader that loads the shared library into the memory of the accelerator and links it from the calling program. The loading unit 11 loads both a dynamic object as a shared library targeting the accelerator and a dynamic object as a shared library targeting the host.

プラットフォーム切替部１２は、ロード部１１がロードした共有ライブラリのターゲットを識別する。また、プラットフォーム切替部１２は、呼び出し元のプログラムにより共有ライブラリの関数を呼び出すための関数呼び出し機能を切り替える。 The platform switching unit 12 identifies the target of the shared library loaded by the loading unit 11 . The platform switching unit 12 also switches the function calling function for calling the function of the shared library by the calling program.

本実施形態における上記各部は、ハードウェア、またはハードウェアおよびソフトウェアの組み合わせた環境下で実現可能である。たとえば、上記各部は、図２に示す環境下で実現可能である。 Each unit described above in the present embodiment can be realized in an environment of hardware or a combination of hardware and software. For example, the above units can be implemented under the environment shown in FIG.

図２は、図１に示す動的リンク装置１０が動作する環境の一例としての計算機システム２０の構成例を示すブロック図である。図２に示す計算機システム２０は、ホスト２１とアクセラレータ２２とを備える。ホスト２１とアクセラレータ２２とは、Ｉ／Ｏ(Input/Output)インタフェース２３を介して接続される。ホスト２１とアクセラレータ２２とは、それぞれ、１つまたは複数のプロセッサとメモリとを備える。図２に示す例では、ホスト２１は、ｎ個のプロセッサ２１１－１～２１１－ｎと、メモリ２１２と、を備える。アクセラレータ２２は、プロセッサ２２１とメモリ２２２とを備える。ホスト２１は、ディスク装置２１３を備える。ディスク装置２１３により、ホスト２１が使用するファイルシステムが、構築される。ディスク装置２１３には、ホスト２１のオペレーティングシステムや、ホスト２１およびアクセラレータ２２が実行するプログラムや、共有ライブラリの情報が格納される。共有ライブラリは、読み出し元のプログラムに動的リンクしている。 FIG. 2 is a block diagram showing a configuration example of a computer system 20 as an example of an environment in which the dynamic link device 10 shown in FIG. 1 operates. A computer system 20 shown in FIG. 2 includes a host 21 and an accelerator 22 . The host 21 and the accelerator 22 are connected via an I/O (Input/Output) interface 23 . The host 21 and accelerator 22 each comprise one or more processors and memory. In the example shown in FIG. 2, the host 21 includes n processors 211-1 to 211-n and a memory 212. The accelerator 22 has a processor 221 and a memory 222 . The host 21 has a disk device 213 . A file system used by the host 21 is constructed by the disk device 213 . The disk device 213 stores the operating system of the host 21, programs executed by the host 21 and the accelerator 22, and shared library information. A shared library is dynamically linked to the program from which it is read.

ホスト２１およびアクセラレータ２２は、たとえば、ネットワーク上で情報（データやサービス等）を提供しているハードウェアが搭載された機器のことである。ここで、ハードウェアとは、たとえば、ＩＣやＦＰＧＡ等の電子回路、またはコンピュータ（たとえば、ＣＰＵやＧＰＵがメモリに記憶されたプログラムを実行することにより機能を実現する装置）である。あるいは、ハードウェアとは、たとえば、電子回路とコンピュータとが組み合わせされた装置のことを指す。上記において、ＩＣはIntegrated Circuitの略であり、ＦＰＧＡはField Programmable Gate Arrayの略である。また、ＣＰＵはCentral Processing Unitの略であり、ＧＰＵはGraphics Processing Unitの略である。 The host 21 and the accelerator 22 are, for example, devices equipped with hardware that provides information (data, services, etc.) on the network. Here, the hardware is, for example, an electronic circuit such as an IC or FPGA, or a computer (for example, a device in which a CPU or GPU executes a program stored in a memory to realize functions). Alternatively, hardware refers to a device that combines electronic circuitry and a computer, for example. In the above, IC is an abbreviation for Integrated Circuit, and FPGA is an abbreviation for Field Programmable Gate Array. Also, CPU is an abbreviation for Central Processing Unit, and GPU is an abbreviation for Graphics Processing Unit.

以下の説明において、動的リンク装置１０は、アクセラレータ２２において実現される場合について説明する。ただし、これは、第１の実施形態を限定するものではない。動的リンク装置１０は、ホスト２１において動作してもよい。すなわち、動的リンク装置１０は、アクセラレータ２２またはホスト２１（以下、第１の情報処理装置または自装置と呼ぶ場合もある）のいずれかにおいて動作すればよい。そして、動的リンク装置１０は、第１の情報処理装置における共有ライブラリの関数呼び出しを、第１の情報処理装置および第１の情報処理装置に接続される装置（以下、第２の情報処理装置または他の装置と呼ぶ場合もある）とのどちらかに切り替える。例えば、動的リンク装置１０がアクセラレータ２２において動作する場合、アクセラレータ２２が第１の情報処理装置であり、ホスト２１が第２の情報処理装置である。動的リンク装置１０がホスト２１において動作する場合、ホスト２１が第１の情報処理装置であり、アクセラレータ２２が第２の情報処理装置である。 In the following description, the case where the dynamic link device 10 is implemented in the accelerator 22 will be described. However, this does not limit the first embodiment. The dynamic link device 10 may operate on the host 21 . In other words, the dynamic link device 10 may operate in either the accelerator 22 or the host 21 (hereinafter also referred to as the first information processing device or self device). Then, the dynamic linking device 10 calls the function of the shared library in the first information processing device to the first information processing device and the device connected to the first information processing device (hereinafter referred to as the second information processing device). or other device). For example, when the dynamic link device 10 operates on the accelerator 22, the accelerator 22 is the first information processing device and the host 21 is the second information processing device. When the dynamic link device 10 operates on the host 21, the host 21 is the first information processing device and the accelerator 22 is the second information processing device.

（動作の説明）
図３は、図２に示す計算機システム２０上で動作する動的リンク装置１０の動作例を説明するフローチャートである。すなわち、本フローチャートは、動的リンク方法、動的ロード方法、動的リンクプログラム、動的ロードプログラムを説明するものに他ならない。 (Description of operation)
FIG. 3 is a flow chart explaining an operation example of the dynamic link device 10 operating on the computer system 20 shown in FIG. That is, this flowchart is nothing but to explain the dynamic linking method, the dynamic loading method, the dynamic linking program, and the dynamic loading program.

アクセラレータ２２上でプログラム“a.out”が開始されると、ロード部１１は、プログラム“a.out”が動的リンクしているライブラリをロードする（ステップＳ１０）。 When the program "a.out" is started on the accelerator 22, the loading unit 11 loads the library dynamically linked to the program "a.out" (step S10).

動的リンクしているライブラリがロードされると、プラットフォーム切替部１２はライブラリのターゲットがホスト２１であるかアクセラレータ２２であるかを判別する（ステップＳ１１）。 When the dynamically linked library is loaded, the platform switching unit 12 determines whether the target of the library is the host 21 or the accelerator 22 (step S11).

ライブラリのターゲットがホスト用であれば（ステップＳ１１のＹｅｓ）、プラットフォーム切替部１２は、ホスト２１上でも同じライブラリをロードするよう要求する（ステップＳ１４）。ライブラリをロードした後、プログラム“a.out”がホスト２１のライブラリに含まれる関数を呼び出すと、プラットフォーム切替部１２は、ホスト用スタブへジャンプするよう設定する。詳細には、プラットフォーム切替部１２は、当該関数の呼び出しのためのコードを格納するＰＬＴ(Procedure Linkage Table)からホスト用スタブへジャンプするよう設定する（ステップＳ１５）。 If the target of the library is for the host (Yes in step S11), the platform switching unit 12 requests that the same library be loaded on the host 21 as well (step S14). After loading the library, when the program "a.out" calls a function included in the library of the host 21, the platform switching unit 12 sets jumping to the host stub. Specifically, the platform switching unit 12 sets a PLT (Procedure Linkage Table) that stores the code for calling the function to jump to the host stub (step S15).

一方、ライブラリのターゲットがアクセラレータ用であれば（ステップＳ１１のＮｏ）、プラットフォーム切替部１２は、ステップＳ１０でロードしたライブラリの初期化処理を実行する（ステップＳ１２）。さらに、プラットフォーム切替部１２は、ＧＯＴ(Global Offset Table)を設定することで、後述される図５のようにメモリの再配置を完了させる（ステップＳ１３）。ステップＳ１２およびステップＳ１３の処理は、一般的な動的リンカでも実行される処理である。 On the other hand, if the target of the library is for the accelerator (No in step S11), the platform switching unit 12 executes initialization processing of the library loaded in step S10 (step S12). Further, the platform switching unit 12 sets a GOT (Global Offset Table) to complete memory rearrangement as shown in FIG. 5 (step S13). The processing of steps S12 and S13 is processing that is also executed by a general dynamic linker.

図４および図５を用いて、図３で示されるステップＳ１０の詳細な処理について説明する。 Detailed processing of step S10 shown in FIG. 3 will be described with reference to FIGS. 4 and 5. FIG.

図４は、本実施形態において、プログラム“a.out”を起動した際の初期状態における、アクセラレータ２２のメモリ２２２のメモリ空間３の配置例を示す図である。本実施例における“a.out”は、たとえば、ＥＬＦ動的リンク実行ファイルである。 FIG. 4 is a diagram showing an arrangement example of the memory space 3 of the memory 222 of the accelerator 22 in the initial state when the program "a.out" is started in this embodiment. "a.out" in this embodiment is, for example, an ELF dynamically linked executable file.

図４に示すように、アクセラレータ２２のメモリ空間３には、プログラム“a.out”のヘッダおよびテキストを保持する領域であるテキスト領域３１１と、“a.out”のデータを格納する領域であるデータ領域３１２とが配置される。さらに、アクセラレータ２２のメモリ空間３には、プログラム“a.out”をロードした動的リンカ“ld.so”のヘッダおよびテキストを保持するテキスト領域３２１と、動的リンカ“ld.so”のデータを格納するデータ領域３２２とが配置される。 As shown in FIG. 4, the memory space 3 of the accelerator 22 has a text area 311 that holds the header and text of the program "a.out" and an area that stores the data of "a.out". A data area 312 is arranged. Further, in the memory space 3 of the accelerator 22, a text area 321 for holding the header and text of the dynamic linker "ld.so" loaded with the program "a.out" and the data of the dynamic linker "ld.so" A data area 322 for storing is arranged.

プログラム“a.out”は、共有ライブラリ“libhost.so”をリンクし、“libhost.so”に含まれる関数ｆｕｎｃを呼び出している。 Program "a.out" links shared library "libhost.so" and calls function func included in "libhost.so".

テキスト領域３１１は、動的リンクした関数を呼び出すためのＰＬＴを含む。関数ｆｕｎｃを呼び出すＰＬＴは、ｆｕｎｃ＠ｐｌｔ３１１１を含む。 Text area 311 contains PLTs for calling dynamically linked functions. A PLT that calls the function func includes func@plt3111.

また、データ領域３１２は、動的リンクしたライブラリにおける関数のアドレスを格納する“.got.plt”において関数ｆｕｎｃに対応するエントリであるＧＯＴ３１２１を有する。ＧＯＴ３１２１は、ｆｕｎｃ＠ｐｌｔで指示される領域に格納される間接ジャンプ命令のオペランドとして使用される。ＧＯＴ３１２１の初期値は、ｆｕｎｃ＠ｐｌｔ３１１１中の間接ジャンプ命令の次の命令のアドレスである。これは、遅延リンクに使用するためである。 The data area 312 also has a GOT 3121 which is an entry corresponding to the function func in ".got.plt" storing the address of the function in the dynamically linked library. GOT 3121 is used as an operand of an indirect jump instruction stored in the area indicated by func@plt. The initial value of GOT3121 is the address of the instruction next to the indirect jump instruction in func@plt3111. This is for use with delayed links.

動的リンカのテキスト領域３２１は、共有ライブラリ用スタブ３２１１と、ホスト用スタブ３２１２とを含む。 The dynamic linker text area 321 includes a shared library stub 3211 and a host stub 3212 .

共有ライブラリ用スタブ３２１１は、動的リンカ“ld.so”のシンボルテーブルから、所定のＰＬＴエントリの関数の配置されたアドレスを解決してＰＬＴの再配置を行う。すなわち、共有ライブラリ用スタブ３２１１は、“.got.plt”の対応するエントリを更新して、当該関数のアドレスにジャンプする。共有ライブラリ用スタブ３２１１による再配置の後は、共有ライブラリ中の関数は、ＰＬＴからジャンプ命令により、共有ライブラリ用のスタブを経由せずに呼び出される。 The shared library stub 3211 resolves the address where the function of the predetermined PLT entry is located from the symbol table of the dynamic linker "ld.so" and relocates the PLT. That is, the shared library stub 3211 updates the corresponding entry in ".got.plt" and jumps to the address of the function. After the relocation by the shared library stub 3211, the function in the shared library is called by a jump instruction from the PLT without going through the shared library stub.

ホスト用スタブ３２１２は、関数呼び出しの引数として渡された値を、ホスト２１側で呼び出す関数とともにアクセラレータ２２からホスト２１に通知して、ホスト２１上での関数の呼び出しを要求する。一例としては、ホスト用スタブ３２１２は、ＡＢＩ(Application Binary Interface)で規定されたレジスタの値および呼び出す関数のアドレスをマーシャリングし、Ｉ／Ｏインタフェース２３を介してホスト２１に転送する。 The host stub 3212 notifies the host 21 from the accelerator 22 of the value passed as the argument of the function call together with the function to be called on the host 21 side, and requests the function call on the host 21 . As an example, the host stub 3212 marshals the values of registers defined by ABI (Application Binary Interface) and the addresses of functions to be called, and transfers them to the host 21 via the I/O interface 23 .

ロード部１１は、プログラム“a.out”が要求している、すなわち動的リンクしているライブラリをファイルシステムからメモリ空間３にロードする。そして、ロード部１１は、“a.out”の初期化ルーチンを実行する。本実施例において、“libhost.so”のテキスト領域およびデータ領域は、ディスク装置２１３上のファイルシステムからアクセラレータ２２のメモリ空間３にマップされる。 The loading unit 11 loads the dynamically linked library requested by the program “a.out” from the file system into the memory space 3 . Then, the load section 11 executes the initialization routine of "a.out". In this embodiment, the text area and data area of “libhost.so” are mapped from the file system on disk device 213 to memory space 3 of accelerator 22 .

図５は、本実施例において動的リンカ“ld.so”による初期化処理によって、動的リンクされた共有ライブラリのロード後における、アクセラレータ２２のメモリ空間３の配置の一例を示す図である。図４と同じ領域については、図４と同じ番号を付与し、その詳細な説明を省略する。図５に示すアクセラレータ２２のメモリ空間３には、図４と同様に、“a.out”のテキスト領域３１１およびデータ領域３１２、並びに、動的リンカ“ld.so”のテキスト領域３２１およびデータ領域３２２が配置されている。さらに、図５に示すメモリ空間３には、“libhost.so”のヘッダおよびテキストを格納するテキスト領域４３１と、“libtest.so”のデータを格納するデータ領域４３２とが配置されている。テキスト領域４３１には、関数ｆｕｎｃ４３１１が含まれる。ただし、関数ｆｕｎｃ４３１１は、ホスト２１で実行するバイナリであり、アクセラレータ２２のプロセッサ２２１では実行できない。 FIG. 5 is a diagram showing an example of the allocation of the memory space 3 of the accelerator 22 after the dynamically linked shared library is loaded by the initialization processing by the dynamic linker "ld.so" in this embodiment. The same regions as in FIG. 4 are assigned the same numbers as in FIG. 4, and detailed description thereof will be omitted. In the memory space 3 of the accelerator 22 shown in FIG. 5, similarly to FIG. 322 are arranged. Further, in the memory space 3 shown in FIG. 5, a text area 431 storing the header and text of "libhost.so" and a data area 432 storing data of "libtest.so" are arranged. Text area 431 includes function func 4311 . However, the function func 4311 is a binary executed by the host 21 and cannot be executed by the processor 221 of the accelerator 22 .

図６を参照して、本実施例におけるプラットフォーム切替部１２によるＰＬＴの設定処理、すなわち遅延リンク（ステップＳ１５）の詳細を説明する。 With reference to FIG. 6, the details of the PLT setting process, that is, the delay link (step S15) by the platform switching unit 12 in this embodiment will be described.

図６は、本実施形態において関数呼び出し時にプラットフォーム切替部１２が行う遅延リンク完了時における、アクセラレータ２２のメモリ２２２のメモリ空間３、およびホスト２１のメモリ２１２のメモリ空間５の配置の一例を示す図である。図４および図５と同じ領域については、図４および図５と同じ番号を付し、その説明を省略する。ホスト２１のメモリ空間５には、“libhost.so”のテキスト領域５３１とデータ領域５３２とが配置されている。テキスト領域５３１およびデータ領域５３２は、ステップＳ１４においてロード部１１がロードした領域である。テキスト領域５３１は、関数ｆｕｎｃ５３１１を含んでいる。アクセラレータ２２のメモリ空間３に配置されたｆｕｎｃ用スタブ５４１は、ホスト２１の関数ｆｕｎｃ５３１１を呼び出すためのスタブである。 FIG. 6 is a diagram showing an example of the arrangement of the memory space 3 of the memory 222 of the accelerator 22 and the memory space 5 of the memory 212 of the host 21 when the delay linking performed by the platform switching unit 12 at the time of function call is completed in this embodiment. is. The same regions as in FIGS. 4 and 5 are given the same numbers as in FIGS. 4 and 5, and the description thereof is omitted. A text area 531 and a data area 532 of “libhost.so” are arranged in the memory space 5 of the host 21 . A text area 531 and a data area 532 are areas loaded by the loading section 11 in step S14. Text area 531 contains function func 5311 . The func stub 541 arranged in the memory space 3 of the accelerator 22 is a stub for calling the function func 5311 of the host 21 .

プログラム“a.out”が関数ｆｕｎｃを呼び出すと、アクセラレータ２２は、まず、“a.out”のテキスト領域３１１にあるｆｕｎｃ＠ｐｌｔ３１１１のコードを実行する。動的リンクをサポートするプログラムにおいて、ＰＬＴには、“.got.plt”のエントリに保持されたアドレスへの間接ジャンプ命令と、ＰＬＴの番号を設定して共有ライブラリ用スタブ３２１１にジャンプする命令とが含まれる。データ領域３１２にある“.got.plt”のエントリＧＯＴ３１２１が未設定であるため、アクセラレータ２２は、間接ジャンプにより次の命令を実行する。アクセラレータ２２の処理は、共有ライブラリ用スタブ３２１１へジャンプする。 When the program "a.out" calls the function func, the accelerator 22 first executes the code of func@plt 3111 in the text area 311 of "a.out". In a program that supports dynamic linking, the PLT contains an indirect jump instruction to the address held in the entry of ".got.plt" and an instruction to set the PLT number and jump to the shared library stub 3211. is included. Since the entry GOT 3121 of ".got.plt" in the data area 312 is not set, the accelerator 22 executes the next instruction by indirect jump. The processing of the accelerator 22 jumps to the shared library stub 3211 .

本実施形態における共有ライブラリ用スタブ３２１１の処理を説明する。共有ライブラリ用スタブ３２１１は、シンボルテーブルから対応するアドレスを検索する。プラットフォーム切替部１２は、検索した結果得られたアドレスを含むテキスト領域が、ホスト用のライブラリのものか、アクセラレータ用のライブラリのものかを識別する。 The processing of the shared library stub 3211 in this embodiment will be described. Shared library stub 3211 retrieves the corresponding address from the symbol table. The platform switching unit 12 identifies whether the text area containing the address obtained as a result of the search is in the library for the host or in the library for the accelerator.

アクセラレータ用であれば、プラットフォーム切替部１２は、一般的な動的リンカと同様に、“.got.plt”のエントリであるＧＯＴ３１２１を、得られたアドレスで更新する。 For the accelerator, the platform switching unit 12 updates the GOT 3121, which is the entry of ".got.plt", with the obtained address, like a general dynamic linker.

ホスト用であれば、プラットフォーム切替部１２は、アクセラレータ２２のメモリ空間３に領域を割り当て、ｆｕｎｃ用スタブ５４１を作成する。ｆｕｎｃ用スタブ５４１は、ホスト２１の関数ｆｕｎｃ５３１１のアドレスをセットしてホスト用スタブ３２１２にジャンプするアクセラレータ用の命令列である。ホスト２１の関数ｆｕｎｃ５３１１のアドレスは、ＡＢＩ上における関数呼び出しによって、破壊してよいレジスタの一つにセットされる。あるいは、ホスト２１の関数ｆｕｎｃ５３１１のアドレスは、スタックのトップにセットされてもよい。あるいは、本用途のために、アドレス１つ分のＴＬＳ(Thread Local Storage)の領域が予約されてもよい。プラットフォーム切替部１２がｆｕｎｃ用スタブ５４１を作成した後、“.got.plt”のエントリであるＧＯＴ３１２１をｆｕｎｃ用スタブ５４１のアドレスにセットし、アクセラレータ２２は、ホスト用スタブ３２１２にジャンプする。ホスト用スタブ３２１２は、ホスト２１の関数ｆｕｎｃ５３１１を呼び出す。 For the host, the platform switching unit 12 allocates an area to the memory space 3 of the accelerator 22 and creates a stub 541 for func. The func stub 541 is an accelerator instruction string that sets the address of the function func 5311 of the host 21 and jumps to the host stub 3212 . The address of the host 21 function func 5311 is set in one of the registers that may be corrupted by a function call on the ABI. Alternatively, the address of function func 5311 of host 21 may be set to the top of the stack. Alternatively, a TLS (Thread Local Storage) area for one address may be reserved for this use. After the platform switching unit 12 creates the func stub 541 , the GOT 3121 that is the entry “.got.plt” is set to the address of the func stub 541 , and the accelerator 22 jumps to the host stub 3212 . The host stub 3212 calls the function func 5311 of the host 21 .

また、２回目以降の関数ｆｕｎｃの呼び出しでは、“.got.plt”が更新されているので、アクセラレータ２２は、ＰＬＴからｆｕｎｃ用スタブ５４１で関数ｆｕｎｃ５３１１のアドレスを指定して、ホスト用スタブ３２１２にジャンプする。これによって、ホスト用スタブ３２１２からホスト２１の関数ｆｕｎｃ５３１１が呼び出される。 Also, in the second and subsequent calls to function func, ".got.plt" is updated. to jump As a result, the function func 5311 of the host 21 is called from the host stub 3212 .

（効果の説明）
以上説明した第１の実施形態によれば、プラットフォーム切替部１２を備えることで、アクセラレータ２２のプログラムは、スタブを記述することなく、ホスト２１の共有ライブラリをリンクできる。 (Explanation of effect)
According to the first embodiment described above, by providing the platform switching unit 12, the program of the accelerator 22 can link the shared library of the host 21 without describing a stub.

（変形例）
以上説明した第１の実施形態では、動的リンカが、スタンダードなＬｉｎｕｘ（登録商標）の動的リンカである場合を例に挙げた。しかしながら、これはあくまで一例であり、第１の実施形態は、上記のような動的リンカに限定されない。 (Modification)
In the first embodiment described above, the dynamic linker is a standard Linux (registered trademark) dynamic linker. However, this is only an example, and the first embodiment is not limited to the dynamic linker as described above.

また、動的ローダは、たとえば、プログラムのロード時に依存関係に基づいてロードする代わりに、ｄｌｏｐｅｎ関数のようなＡＰＩ(Application Programming Interface)の呼び出しを用いて動作してもよい。 Also, the dynamic loader may operate using an API (Application Programming Interface) call such as the dlopen function instead of loading based on dependencies when the program is loaded, for example.

また、遅延リンクは、必須ではない。たとえば、最初の呼び出しより前に、ロード直後にｆｕｎｃ用スタブ５４１が生成され、“.got.plt”のエントリがｆｕｎｃ用スタブ５４１のアドレスにセットされてもよい。 Also, a delayed link is not required. For example, the func stub 541 may be generated immediately after loading, prior to the first call, and the entry “.got.plt” may be set to the address of the func stub 541 .

また、ＩＳＡ(Instruction Set Architecture)やＡＢＩによっては、各関数用のスタブ（ｆｕｎｃ用スタブ５４１相当）を用意せず、直接ホスト用スタブにセットできる場合がある。例えば、スタブにジャンプした時点で関数呼び出しのアドレス（ｆｕｎｃ＠ｐｌｔ）が残っていれば、そのアドレスから対応するＰＬＴがわかる、つまり、呼び出すべき関数ｆｕｎｃを特定することが可能である。 Also, depending on the ISA (Instruction Set Architecture) or ABI, there are cases where a stub for each function (corresponding to the func stub 541) can be set directly in the host stub without preparing it. For example, if the function call address (func@plt) remains at the time of jumping to the stub, the corresponding PLT can be known from that address, that is, the function func to be called can be specified.

また、以上説明した実施形態の場合、汎用レジスタにおける数値引数になる可能性のあるものは、すべて転送することを前提としているが、本実施形態は、これに限定されることはない。 In addition, in the embodiment described above, it is assumed that all possible numerical arguments in general-purpose registers are transferred, but the present embodiment is not limited to this.

たとえば、動的ローダは、ｘ８６－６４のように引数の型によって異なるレジスタに引数が保存されるＡＢＩでは、すべての引数をマーシャリングして送る。なお、ｘ８６－６４は、インテル（登録商標）社の命令セットアーキテクチャを６４ビットに拡張した命令セットの一つである。 For example, the dynamic loader marshals and sends all arguments in ABIs where arguments are stored in different registers depending on the argument type, such as x86-64. The x86-64 is one of the instruction sets obtained by extending the instruction set architecture of Intel (registered trademark) to 64 bits.

また、動的ローダは、Ｄｗａｒｆのデバッグ情報または名前修飾（ｎａｍｅｍａｎｇｌｉｎｇ）された関数名（たとえばＣ＋＋の場合）などを用いる場合、次のように動作してもよい。すなわち、動的ローダは、バイナリに含まれている情報から、引数の型およびサイズを取得し、構造体ポインタの引数がある場合にデータをコピーインおよびマーシャリングして転送することも可能である。なお、Ｄｗａｒｆは、広く使われているデバッグ用データフォーマットの規格である。 The dynamic loader may also behave as follows when using Dwarf debug information or name mangling function names (eg in C++). That is, the dynamic loader can also obtain the type and size of the arguments from the information contained in the binary, and copy-in and marshal the data for transfer if there are structure pointer arguments. Dwarf is a widely used debug data format standard.

［第２の実施形態］
図７は、本発明の第２の実施形態に係る動的リンク装置３００の構成例を示すブロック図である。動的リンク装置３００は、ロード部３０２（ロード手段の一例）と、プラットフォーム切替部３０４（プラットフォーム切替手段の一例）と、を備える。 [Second embodiment]
FIG. 7 is a block diagram showing a configuration example of a dynamic link device 300 according to the second embodiment of the invention. The dynamic link device 300 includes a load section 302 (an example of loading means) and a platform switching section 304 (an example of platform switching means).

ロード部３０２は、共有ライブラリをアクセラレータ２２のメモリ２２２にロードし、呼び出し元のプログラムからリンクする。プラットフォーム切替部３０４は、共有ライブラリのターゲットを識別し、呼び出し元のプログラムから共有ライブラリの関数呼び出しの機能を自装置または他の装置に切り替える。すなわち、ロード部３０２とプラットフォーム切替部３０４は、ロード部１１とプラットフォーム切替部１２と、それぞれ同様に動作する。 The loading unit 302 loads the shared library into the memory 222 of the accelerator 22 and links it from the calling program. The platform switching unit 304 identifies the target of the shared library and switches the function of calling the function of the shared library from the calling program to the own device or another device. That is, the load section 302 and the platform switching section 304 operate similarly to the loading section 11 and the platform switching section 12, respectively.

本実施形態によれば、アクセラレータ２２のプログラムは、スタブを記述することなく、ホスト２１の共有ライブラリをリンクすることができる。その理由は、動的リンク装置３００のロード部３０２とプラットフォーム切替部３０４が、第１の実施形態における対応する構成とそれぞれ同様に動作するためである。 According to this embodiment, the program of the accelerator 22 can link the shared library of the host 21 without writing a stub. The reason is that the loading unit 302 and the platform switching unit 304 of the dynamic link device 300 operate similarly to their corresponding configurations in the first embodiment.

［第３の実施形態］
図８は、本発明の第３の実施形態に係る動的ロード装置４００の構成例を示すブロック図である。動的ロード装置４００は、ロード部４０２（ロード手段の一例）と、プラットフォーム切替部４０４（プラットフォーム切替手段の一例）と、を備える。 [Third embodiment]
FIG. 8 is a block diagram showing a configuration example of a dynamic load device 400 according to the third embodiment of the invention. The dynamic loading device 400 includes a loading section 402 (an example of loading means) and a platform switching section 404 (an example of platform switching means).

ロード部４０２は、共有ライブラリをアクセラレータ２２のメモリ２２２にロードし、呼び出し元のプログラムからリンクする。プラットフォーム切替部４０４は、共有ライブラリのターゲットを識別し、呼び出し元のプログラムから共有ライブラリの関数呼び出しの機能を自装置または他の装置に切り替える。すなわち、ロード部４０２とプラットフォーム切替部４０４は、ロード部１１とプラットフォーム切替部１２と、それぞれ同様に動作する。 The loading unit 402 loads the shared library into the memory 222 of the accelerator 22 and links it from the calling program. The platform switching unit 404 identifies the target of the shared library, and switches the function of calling the function of the shared library from the calling program to its own device or another device. In other words, the loading unit 402 and the platform switching unit 404 operate similarly to the loading unit 11 and the platform switching unit 12, respectively.

本実施形態によれば、アクセラレータ２２のプログラムは、スタブを記述することなく、ホスト２１の共有ライブラリをリンクすることができる。その理由は、動的ロード装置４００のロード部４０２とプラットフォーム切替部４０４が、第１の実施形態における対応する構成とそれぞれ同様に動作するためである。 According to this embodiment, the program of the accelerator 22 can link the shared library of the host 21 without writing a stub. The reason is that the loading unit 402 and the platform switching unit 404 of the dynamic loading device 400 operate similarly to their corresponding configurations in the first embodiment.

［第４の実施形態］
図９は、本発明の第４の実施形態に係る計算機システム５００の構成例を示すブロック図である。 [Fourth embodiment]
FIG. 9 is a block diagram showing a configuration example of a computer system 500 according to the fourth embodiment of this invention.

計算機システム５００は、ホスト６００と、ホスト６００に接続されたアクセラレータ７００と、を含む。 A computer system 500 includes a host 600 and an accelerator 700 connected to the host 600 .

動的リンク装置１０などがアクセラレータ７００に含まれる場合、アクセラレータ７００は、共有ライブラリをアクセラレータ７００のメモリにロードし、呼び出し元のプログラムからリンクする。さらに、アクセラレータ７００は、共有ライブラリのターゲットを識別し、呼び出し元のプログラムから共有ライブラリの関数呼び出しの機能をアクセラレータ７００またはホスト６００に切り替える。つまり、アクセラレータ７００は、関数呼び出しの機能を第１の情報処理装置または第２の情報処理装置に切り替える。このように、アクセラレータ７００は、アクセラレータ２２と同様に動作する。 When the dynamic link device 10 or the like is included in the accelerator 700, the accelerator 700 loads the shared library into the memory of the accelerator 700 and links from the calling program. Further, the accelerator 700 identifies the target of the shared library and switches the functionality of the function call of the shared library from the calling program to the accelerator 700 or the host 600 . That is, the accelerator 700 switches the function call function to the first information processing device or the second information processing device. In this manner, accelerator 700 operates similarly to accelerator 22 .

本実施形態によれば、アクセラレータ７００のプログラムは、スタブを記述することなく、ホスト６００の共有ライブラリをリンクすることができる。その理由は、アクセラレータ７００が、第１の実施形態のアクセラレータ２２と同様に動作するためである。 According to this embodiment, the program of the accelerator 700 can link the shared library of the host 600 without writing a stub. The reason is that the accelerator 700 operates similarly to the accelerator 22 of the first embodiment.

なお、動的リンク装置１０などがホスト６００に含まれる場合、ホスト６００は、共有ライブラリをホスト６００のメモリにロードし、呼び出し元のプログラムからリンクする。さらに、ホスト６００は、共有ライブラリのターゲットを識別し、呼び出し元のプログラムから共有ライブラリの関数呼び出しの機能を切り替える。この場合、ホスト６００が、動的リンク装置１０などと同様の機能を実現する。 If the host 600 includes the dynamic link device 10 or the like, the host 600 loads the shared library into the memory of the host 600 and links it from the calling program. In addition, the host 600 identifies the target of the shared library and switches the functionality of the function call of the shared library from the calling program. In this case, the host 600 implements the same function as the dynamic link device 10 and the like.

以上説明した各実施形態は、共有ライブラリをサポートするヘテロな計算機システム、そのオペレーティングシステム、動的リンカ、または動的ローダなどに広く適用可能である。 Each of the embodiments described above can be widely applied to heterogeneous computer systems that support shared libraries, their operating systems, dynamic linkers, dynamic loaders, and the like.

以上、各実施形態を用いて本発明を説明したが、本発明の技術的範囲は、上記各実施形態の記載に限定されない。上記各実施形態に多様な変更又は改良を加えることが可能であることは当業者にとって自明である。従って、そのような変更又は改良を加えた形態もまた本発明の技術的範囲に含まれることは説明するまでもない。また、以上説明した各実施形態において使用される、数値や各構成の名称等は例示的なものであり適宜変更可能である。また、図面中の矢印の方向は、一例を示すものであり、ブロック間の信号の向きを限定するものではない。 Although the present invention has been described using each embodiment, the technical scope of the present invention is not limited to the description of each embodiment. It is obvious to those skilled in the art that various modifications or improvements can be made to each of the above embodiments. Therefore, it is needless to say that the forms with such changes or improvements are also included in the technical scope of the present invention. Numerical values, names of components, and the like used in each of the embodiments described above are examples and can be changed as appropriate. Also, the directions of the arrows in the drawings are only examples, and do not limit the directions of signals between blocks.

３メモリ空間
５メモリ空間
１０動的リンク装置
１１ロード部
１２プラットフォーム切替部
２０計算機システム
２１ホスト
２２アクセラレータ
２３Ｉ／Ｏインタフェース
２１１－１～２１１－ｎプロセッサ
２１２メモリ
２１３ディスク装置
２２１プロセッサ
２２２メモリ
３００動的リンク装置
３０２ロード部
３０４プラットフォーム切替部
４００動的ロード装置
４０２ロード部
４０４プラットフォーム切替部
５００計算機システム
６００ホスト
７００アクセラレータ 3 memory space 5 memory space 10 dynamic link device 11 load unit 12 platform switching unit 20 computer system 21 host 22 accelerator 23 I/O interface 211-1 to 211-n processor 212 memory 213 disk device 221 processor 222 memory 300 dynamic Link Device 302 Loading Unit 304 Platform Switching Unit 400 Dynamic Loading Device 402 Loading Unit 404 Platform Switching Unit 500 Computer System 600 Host 700 Accelerator

Claims

a loading means for loading the shared library into the memory of its own device and linking it from the calling program;
identifying the target of the shared library, and switching the function of calling the function of the shared library from the calling program to its own device or another device;
When the target of the shared library is for the own device, the loaded shared library is initialized and a GOT (Global Offset Table) is set to rearrange the function calling function of the shared library. complete, and
configure the GOT to jump to the other device's stub
A dynamic linker comprising platform switching means and .

a loading means for loading the shared library into the memory of its own device and linking it from the calling program;
identifying the target of the shared library, and switching the function of calling the function of the shared library from the calling program to its own device or another device;
When the target of the shared library is for the own device, the loaded shared library is initialized and a GOT (Global Offset Table) is set to rearrange the function calling function of the shared library. complete, and
Configure the GOT to jump to the other device's stub
A dynamic loading device comprising: a platform switching means;

A computer system including a first information processing device and a second information processing device connected to the first information processing device,
The first information processing device is
loading the shared library into the memory of the first information processing device and linking from the calling program;
identifying a target of the shared library, and switching a function of calling the function of the shared library from the calling program to the first information processing device or the second information processing device;
When the target of the shared library is for the first information processing device, the function calling function of the shared library is performed by executing initialization processing of the loaded shared library and setting a GOT (Global Offset Table). to complete the relocation of the
setting the GOT to jump to the stub of the second information processing device
computer system.

Load the shared library into your device's memory,
link from the calling program,
identifying a target of said shared library;
switching the calling function of the shared library from the calling program to its own device or another device;
When the target of the shared library is for the own device, the loaded shared library is initialized and a GOT (Global Offset Table) is set to rearrange the function calling function of the shared library. complete, and
configure the GOT to jump to the other device's stub
Dynamic link method.

Load the shared library into your device's memory,
link from the calling program,
identifying a target for said shared library;
switching the calling function of the shared library from the calling program to its own device or another device;
When the target of the shared library is for the own device, the loaded shared library is initialized and a GOT (Global Offset Table) is set to rearrange the function calling function of the shared library. complete, and
Configure the GOT to jump to the other device's stub
Dynamic loading method.

A function to load a shared library into the memory of its own device and link it from the calling program;
identifying the target of the shared library, and switching the function of calling the function of the shared library from the calling program to its own device or another device;
When the target of the shared library is for the own device, the loaded shared library is initialized and a GOT (Global Offset Table) is set to rearrange the function calling function of the shared library. complete, and
a dynamic link program that causes a computer to perform the functions of configuring said GOT to jump to said other device's stub .

A function to load a shared library into the memory of its own device and link it from the calling program;
identifying the target of the shared library, and switching the function of calling the function of the shared library from the calling program to its own device or another device;
When the target of the shared library is for the own device, the loaded shared library is initialized and a GOT (Global Offset Table) is set to rearrange the function calling function of the shared library. complete, and
a dynamic load program that causes a computer to perform the functions of configuring said GOT to jump to said other device's stub .