JP2004078636A

JP2004078636A - Memory management system, memory arranging method and its program and recording medium

Info

Publication number: JP2004078636A
Application number: JP2002238938A
Authority: JP
Inventors: Takashi Ikebe; 池邉　隆; Shinji Tanaka; 田中　慎司
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2002-08-20
Filing date: 2002-08-20
Publication date: 2004-03-11

Abstract

<P>PROBLEM TO BE SOLVED: To improve the hit rates of data cache and TLB cache by collecting objects frequently referred to from one thread as a data taken into a data cache and TLB at the time of process execution. <P>SOLUTION: By providing a memory control part 10 dynamically controlling an arrangement destination of a memory of an object generated by a plurality of threads in the processes execution in a computer (calculator) and arranging frequently-used objects nearby, and assigning a unique area for every thread in a heap area of a logical memory space in the process, the object generated by one thread is continuously arranged on a memory or a memory device. <P>COPYRIGHT: (C)2004,JPO

Description

【０００１】
【発明の属する技術分野】
本発明は、計算機におけるプロセス実行時に生成されるオブジェクトの動的メモリ配置技術に係わり、例えば、キャッシュ性能の向上等を図るのに好適なメモリ管理技術に関するものである。
【０００２】
【従来の技術】
通常の計算機（コンピュータ）においては、プロセスの実行単位はスレッドであり、プロセス実行時に個々のスレッドより生成される個々のオブジェクトは、当該プロセス用のヒープ領域に、生成順に格納され、その後、スレッド単位でプロセスが実行される際、中央演算装置（ＣＰＵ）が前出のオブジェクトをヒープ領域から読み出して実行する。
【０００３】
この際、使用頻度の高いオブジェクトは、ヒープ領域よりもアクセス速度の速いデータキャッシュに保持し、実行速度を高速化することが従来行われている。
【０００４】
しかしながら、一般的に、データキャッシュのサイズは有限であり、データキャッシュに取り込まれるデータサイズは固定長であり、基本的に、データキャッシュに取り込まれるデータは、プロセスが使用するメモリの部分的な複製であり、ヒープ領域の部分的な複製でもある。従って、データキャッシュにおいても、オブジェクトはその生成順に保持された状態である。
【０００５】
このように、データキャッシュにおいては、１つのプロセスの各スレッドが生成した全てのオブジェクトを保持するものではない。そのため、中央演算装置がデータキャッシュから読み出したデータ内のオブジェクト全てが、１つのスレッドにおいて使用頻度の高いオブジェクトとは限らず、キャッシュミスが発生する。
【０００６】
また、一般的に、１つのプロセスで使用される論理メモリ空間は、そのプロセス内のみで有効であり、かつ、一般的に、プロセス毎に使用する論理メモリ空間を切り替えるため、論理メモリ空間と物理メモリ空間のマッピングが必要になる。
【０００７】
プロセスから見える論理メモリ空間のアドレス（仮想アドレス）から実際のメモリ・アドレス（物理アドレス）への対応表の中で頻繁に使用されるものを一時的に保存するバッファをＴＬＢ（Ｔｒａｎｓｌａｔｉｏｎ　Ｌｏｏｋ−ａｓｉｄｅ　Ｂｕｆｆｅｒ）といい、このＴＬＢを用いることにより、これら仮想アドレスと物理アドレスの変換を高速に実行できる。
【０００８】
ところが、前出の論理メモリ空間は、上述したヒープ領域を含んでおりオブジェクトが生成順にヒープ領域上に配置されていけば、ＴＬＢに保存された１つの変換に含まれるオブジェクトの全てが１つのスレッドで頻繁に参照されるとは限らない。
【０００９】
【発明が解決しようとする課題】
解決しようとする問題点は、従来の技術では、データキャッシュおよびＴＬＢに含まれるオブジェクトの全てが１つのスレッドで頻繁に使用されるとは限らない点である。
【００１０】
本発明の目的は、これら従来技術の課題を解決し、プロセス実行時にデータキャッシュおよびＴＬＢに取り込まれるデータ、すなわちヒープ領域に、１つのスレッドから頻繁に参照されるオブジェクトを集約させることで、データキャッシュおよびＴＬＢキャッシュのヒット率を向上させることである。
【００１１】
【課題を解決するための手段】
上記目的を達成するため、本発明では、計算機におけるプロセス実行時に複数スレッドが生成するオブジェクトのメモリ配置先を動的に制御し、使用頻度の高いオブジェクト同士を近傍に配置することを特徴とする。すなわち、プロセスにおける論理メモリ空間のヒープ領域に、スレッド毎の固有領域を割り当てることで、１つのスレッドが生成するオブジェクトを、メモリ、もしくは記憶装置上で連続して配置させる。このことにより、ヒープ領域内に生成されるオブジェクトがスレッド単位でまとまり、データキャッシュに取り込まれる単位データ内の複数のオブジェクトに、同一のスレッドのオブジェクトが入る可能性が高くなり、結果として、データキャッシュのヒット率が向上する。
【００１２】
【発明の実施の形態】
以下、本発明の実施の形態を、図面により詳細に説明する。
【００１３】
図１は、本発明に係わるメモリ管理システムの構成例を示すブロック図であり、図２は、図１におけるメモリ管理システムを実装したコンピュータのハードウェア構成例を示すブロック図である。
【００１４】
図２において、２１はＣＲＴ（Ｃａｔｈｏｄｅ　Ｒａｙ　Ｔｕｂｅ）やＬＣＤ（Ｌｉｑｕｉｄ　Ｃｒｙｓｔａｌ　Ｄｉｓｐｌａｙ）等からなる表示装置、２２はキーボードやマウス等からなる入力装置、２３はＨＤＤ（Ｈａｒｄ　Ｄｉｓｋ　Ｄｒｉｖｅ）等からなる外部記憶装置、２４はＣＰＵ（Ｃｅｎｔｒａｌ　Ｐｒｏｃｅｓｓｉｎｇ　Ｕｎｉｔ）２４ａや主メモリ２４ｂおよび入出力インタフェース２４ｃ等を具備してコンピュータ処理を行なう情報処理装置、２５は本発明に係わるプログラムやデータを記録したＣＤ−ＲＯＭ（Ｃｏｍｐａｃｔ　Ｄｉｓｃ−Ｒｅａｄ　Ｏｎｌｙ　Ｍｅｍｏｒｙ）もしくはＤＶＤ（Ｄｉｇｉｔａｌ　Ｖｉｄｅｏ　Ｄｉｓｃ／Ｄｉｇｉｔａｌ　Ｖｅｒｓａｔｉｌｅ　Ｄｉｓｃ）等からなる光ディスク、２６は光ディスク２５に記録されたプログラムおよびデータを読み出すための駆動装置、２７はＬＡＮ（Ｌｏｃａｌ　Ａｒｅａ　Ｎｅｔｗｏｒｋ）カードやモデム等からなる通信装置である。
【００１５】
光ディスク２５に格納されたプログラムおよびデータを情報処理装置２４により駆動装置２６を介して外部記憶装置２３内にインストールした後、外部記憶装置２３から主メモリ２４ｂに読み込みＣＰＵ２４ａで処理することにより、情報処理装置２４内に図１に示すメモリ管理システムを含む各処理部が構成される。
【００１６】
図１において、１はメモリ管理システム、２はプロセス実行部であり、メモリ管理システム１は、領域割当部１１、マップ生成部１２、マップ情報記憶部１３オブジェクト配置部１４により構成されるメモリ管理部１０を有する。
【００１７】
このような構成により、本例のメモリ管理システム１では、コンピュータにおけるプロセス実行時に、複数スレッドが生成するオブジェクトをメモリに配置する際、スレッド単位でメモリ領域を割り当て、スレッドが生成したオブジェクトを、当該スレッドに割り当てたメモリ領域に配置する。
【００１８】
すなわち、領域割当部１１により、プロセス用のヒープ領域をスレッド単位で割り当て、マップ生成部１２において、スレッドに割り当てた領域と当該スレッドとの対応付け情報を生成し、マップ情報記憶部１３で、その情報をテーブル（スレッドヒープ領域マップ１５）に記憶し、オブジェクト配置部１４において、スレッドが生成するオブジェクトを、マップ情報記憶部１３でスレッドヒープ領域マップ１５に記憶している対応付け情報を参照して、当該スレッドに割り当てた領域に配置する。
【００１９】
スレッドヒープ領域マップ１５においては、各スレッドの特定に用いるスレッド識別情報が登録されるスレッドＩＤ１５ａと、各スレッドに割り当てたメモリ領域の特定に用いる領域情報１５ｂとの項目が設けられており、このスレッドヒープ領域マップ１５の内容に基づき、例えば、スレッドＡａが生成するオブジェクトは、アドレス情報Ａａで特定されるメモリ領域に配置される。
【００２０】
このように、本例では、メモリ管理部１０を新たに設けることにより、コンピュータのプロセス実行過程において、プロセスの実行単位であるスレッドによるオブジェクトの生成時に、スレッド毎にオブジェクトの生成場所を動的に制御することで、使用頻度の高いオブジェクト同士を近傍に配置する。
【００２１】
ここでの動的な制御は、メモリ管理部１０において、当該プロセスにおける論理メモリ空間のヒープ領域に、スレッド毎の固有領域を割り当てることで行われる。また、オブジェクトの近傍配置とは、メモリ、もしくはデータキャッシュやＴＬＢ（Ｔｒａｎｓｌａｔｉｏｎ　Ｌｏｏｋ−ａｓｉｄｅ　Ｂｕｆｆｅｒ）等の記憶装置上でオブジェクトが連続して配置されていることを意味する。
【００２２】
このオブジェクトの近傍配置により、ヒープ領域内に生成されるオブジェクトがスレッド単位でまとまり、例えばデータキャッシュに取り込まれる単位データ内の複数のオブジェクトに、同一のスレッドのオブジェクトが入る可能性が高くなる。
【００２３】
プロセス実行時には複数のスレッドが実行されるが、一般的に、データキャッシュは１つのスレッドに対してのみ有効であり、スレッド切り替えが生じればデータキャッシュの有効性はなく、有効なデータキャッシュが再構築される。
【００２４】
上述したように、本例では、データキャッシュの中に同一スレッドのオブジェクトが入る可能性が高く、結果として、従来の技術よりもデータキャッシュのヒット率が向上する。
【００２５】
同様に、ＴＬＢキャッシュにおいても、１スレッドに使用されるオブジェクトが同一ページ上に存在する可能性が高く、それらのページをＴＬＢキャッシュに取り込むことで、同一スレッド実行時におけるＴＬＢキャッシュヒット率が向上する。
【００２６】
さらに、スレッド切り替え時には、常に一定数のページファイルの切り替えが要求されるが、このページファイル切り替え数は、オブジェクトが生成順にヒープ領域に格納されている場合よりも、スレッド数が十分多い場合には、少なくなるので、大規模アプリケーションに対して、本例が極めて有効であることがわかる。
【００２７】
以下、図３から図５を用いて、本例のメモリ管理システムの処理動作を説明する。
【００２８】
図３は、図１におけるメモリ管理システムによるメモリ配置処理例を示すフローチャートであり、図４は、図１におけるメモリ管理システムの第１の処理動作例を示す説明図、図５は、図１におけるメモリ管理システムの第２の処理動作例を示す説明図である。
【００２９】
図３に示すように、プロセスＡにおけるスレッドＡａの生成要求があれば（ステップ３０１）、図１におけるメモリ管理システム１は、当該スレッドＡａに、プロセスＡ用のヒープ領域における領域Ａａを割り当て（ステップ３０２）、その割当結果を、例えば、図１におけるスレッドヒープ領域マップ１５に示すように、各スレッドの特定に用いるスレッド識別情報（スレッドＩＤ：Ａａ）と、各スレッドに割り当てたメモリ領域の特定に用いる領域情報（アドレスＡａ）とを対応付け、記憶装置に記憶する（ステップ３０３）。
【００３０】
その後、スレッドＡａによるオブジェクトの生成要求があれば（ステップ３０４）、図１におけるスレッドヒープ領域マップ１５を参照して、当該スレッドＡａに割り当てた領域Ａａを特定し、この領域Ａａに、当該オブジェクトを配置する（ステップ３０５）。
【００３１】
次の図４および図５の処理においては、「ＪＡＶＡ（登録商標）　ＶＭ」における例を説明する。
【００３２】
「ＪＡＶＡ（登録商標）　ＶＭ」ではヒープ領域管理技術として、「Ｇａｒｂａｇｅ　Ｃｏｌｌｅｃｔｉｏｎ」（ＧＣ）を使用しており、本例では、ＧＣの技術として「Ｇｅｎｅｒａｔｉｏｎ　ＧＣ」を使用した場合について述べる。
【００３３】
「Ｇｅｎｅｒａｔｉｏｎ　ＧＣ」では図４に示すように、ヒープ領域は大きく「Ｎｅｗ　Ｇｅｎｅｒａｔｉｏｎ」、「Ｏｌｄ　Ｇｅｎｅｒａｔｉｏｎ」、「Ｐｅｒｍａｎｅｎｔ　Ｇｅｎｅｒａｔｉｏｎ」という３つの領域に分けられ、その中でも更に「Ｎｅｗ　Ｇｅｎｅｒａｔｉｏｎ」は「Ｅｄｅｎ」と「Ｓｕｒｖｉｖｏｒ」という２つの領域に分けられる。
【００３４】
上記構造において新規にオブジェクトが生成される場合には常に、「Ｅｄｅｎ」領域からオブジェクトが生成される。
【００３５】
本例ではメモリ管理部１０を新たに設け、プロセス実行時にスレッド生成要求が生じた場合、そのスレッド生成要求をメモリ管理部１０が取得し、メモリ管理部１０はプロセスのヒープ領域をスレッドヒープ領域マップ１５で管理する。
【００３６】
メモリ管理部１０は、スレッド毎に一定ヒープ領域を「ＧｅｎｅｒａｔｉｏｎＧＣ」に使用される複数ジェネレーションにわたって確保を行い、その領域をスレッドに対して通知を行う。
【００３７】
尚、この際に生成されるスレッド毎のヒープ領域は、事前に静的に決定されているものであり、ユーザからの要求によって任意の領域を割り当てることが可能である。
【００３８】
例えば、プロセスを実行中にスレッドＡａの生成要求が生じれば（図４中（１））、メモリ管理部１０は、その通知を取得し、メモリ管理部１０が持つスレッドヒープ領域マップ１５から新たなスレッド用エリアを判断し（図４中（２））、実ヒープ領域の確保を行う（図４中（３））。その情報を基にスレッドＡａが作成される（図４中（４），（５），（６））。
【００３９】
図５においては、スレッド実行時のオブジェクト生成例を示しており、オブジェクト生成要求が生じた場合、生成要求をメモリ管理部１０に通知し、メモリ管理部１０は、スレッドヒープ領域マップ１５に従って、要求されたオブジェクトを「Ｎｅｗ　Ｇｅｎｅｒａｔｉｏｎ」の「Ｅｄｅｎ」領域に生成する。
【００４０】
例えば、プロセス実行中にスレッドＡａに処理が移った場合（図５中（１））、スレッドＡを実行するに当たってオブジェクトの生成要求はメモリ管理部１０に通知され（図５中（２））、メモリ管理部１０がそのスレッドＡａに割り当てられたヒープ領域（「Ｅｄｅｎ」領域）に、要求されたオブジェクトを生成する（図５中（３））。
【００４１】
メモリ管理部１０は、生成されたヒープ領域のアドレスを返り値としてスレッドＡａに通知する。
【００４２】
以上、図１〜図５を用いて説明したように、本例では、コンピュータ（計算機）におけるプロセス実行時に複数スレッドが生成するオブジェクトのメモリ配置先を動的に制御し、使用頻度の高いオブジェクト同士を近傍に配置する。すなわち、プロセスにおける論理メモリ空間のヒープ領域に、スレッド毎の固有領域を割り当てることで、１つのスレッドが生成するオブジェクトを、メモリ、もしくは記憶装置上で連続して配置させる。
【００４３】
このことにより、ヒープ領域内に生成されるオブジェクトがスレッド単位でまとまり、使用頻度の高いオブジェクト同士が近傍に配置されることにより、例えばデータキャッシュに取り込まれる単位データ内の複数のオブジェクトに、同一のスレッドのオブジェクトが入る可能性が高くなり、結果として、データキャッシュのヒット率が向上する。
【００４４】
さらに、１つのプロセス内でスレッド数が十分多い場合には、使用頻度の高いオブジェクト同士が近傍に配置されることにより、同一ページ内に配置される可能性が高くなり、その結果としてＴＬＢのヒット率を向上させることができる。
【００４５】
尚、本発明は、図１〜図５を用いて説明した例に限定されるものではなく、その要旨を逸脱しない範囲において種々変更可能である。例えば、本例では、「ＪＡＶＡ（登録商標）　ＶＭ」を例として説明したが、「ＪＡＶＡ（登録商標）」以外の他のプログラム言語で動作するコンピュータにおいても適用可能である。
【００４６】
また、各スレッドに割り当てるヒープ領域（スレッド割り当てヒープ）のサイズに関しては、全てのスレッドに対して同じ大きさにすることでも、それぞれ異なるように設定しても良い。
【００４７】
また、本例では、コンピュータ（計算機）の構成として図２の構成例を示したが、キーボードや光ディスクの駆動装置の無いコンピュータ構成としても良い。また、本例では、光ディスクを記録媒体として用いているが、ＦＤ（Ｆｌｅｘｉｂｌｅ　Ｄｉｓｋ）等を記録媒体として用いることでも良い。また、プログラムのインストールに関しても、通信装置を介してネットワーク経由でプログラムをダウンロードしてインストールすることでも良い。
【００４８】
【発明の効果】
本発明によれば、使用頻度の高いオブジェクト同士が近傍に配置され、このことにより、例えばデータキャッシュのデータ取り込み時に１つのスレッドからの使用頻度の高いオブジェクトが取り込まれ、その結果としてデータキャッシュのヒット率を向上させることができ、かつ、１つのプロセス内でスレッド数が十分多い場合には、使用頻度の高いオブジェクト同士が近傍に配置されることにより、同一ページ内に配置される可能性が高くなり、その結果としてＴＬＢのヒット率を向上させることが可能である。
【図面の簡単な説明】
【図１】本発明に係わるメモリ管理システムの構成例を示すブロック図である。
【図２】図１におけるメモリ管理システムを実装したコンピュータのハードウェア構成例を示すブロック図である。
【図３】図１におけるメモリ管理システムによるメモリ配置処理例を示すフローチャートである。
【図４】図１におけるメモリ管理システムの第１の処理動作例を示す説明図である。
【図５】図１におけるメモリ管理システムの第２の処理動作例を示す説明図である。
【符号の説明】
１：メモリ管理システム、２：プロセス実行部、１０：メモリ管理部、１１：領域割当部、１２：マップ生成部、１３：マップ情報記憶部、１４オブジェクト配置部、１５：スレッドヒープ領域マップ、１５ａ：スレッドＩＤ、１５ｂ：領域情報、２１：表示装置、２２：入力装置、２３：外部記憶装置、２４：情報処理装置、２４ａ：ＣＰＵ、２４ｂ：主メモリ、２４ｃ：入出力インタフェース、２５：光ディスク、２６：駆動装置、２７：通信装置。[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention relates to a technique for dynamically allocating objects generated when a process is executed in a computer, and more particularly to a memory management technique suitable for improving cache performance and the like.
[0002]
[Prior art]
In a normal computer (computer), the execution unit of a process is a thread, and individual objects generated by the individual threads when the process is executed are stored in the heap area for the process in the order of generation, and thereafter, the thread unit When the process is executed by the CPU, the central processing unit (CPU) reads the above-mentioned object from the heap area and executes it.
[0003]
At this time, conventionally, frequently used objects are stored in a data cache having an access speed higher than that of a heap area, thereby increasing the execution speed.
[0004]
However, in general, the size of the data cache is finite, the size of the data taken into the data cache is fixed, and basically, the data taken into the data cache is a partial copy of the memory used by the process. And is also a partial copy of the heap area. Therefore, even in the data cache, the objects are held in the order of generation.
[0005]
As described above, the data cache does not hold all the objects generated by each thread of one process. Therefore, all objects in the data read from the data cache by the central processing unit are not necessarily objects frequently used in one thread, and a cache miss occurs.
[0006]
In general, the logical memory space used by one process is effective only within that process, and generally, the logical memory space used for each process is switched. Memory space mapping is required.
[0007]
A buffer for temporarily storing frequently used ones in a correspondence table from an address (virtual address) of a logical memory space visible to a process to an actual memory address (physical address) is provided in a TLB (Translation Look-Aside Buffer). ), The use of this TLB enables high-speed conversion between these virtual addresses and physical addresses.
[0008]
However, the above-described logical memory space includes the above-described heap area, and if objects are arranged on the heap area in the order of generation, all of the objects included in one conversion stored in the TLB are one thread. Is not always referred to.
[0009]
[Problems to be solved by the invention]
The problem to be solved is that in the related art, not all objects included in the data cache and TLB are frequently used by one thread.
[0010]
SUMMARY OF THE INVENTION An object of the present invention is to solve the problems of the prior art, and to collect data frequently fetched from one thread into a data fetched into a data cache and a TLB at the time of executing a process, that is, a heap area. And to improve the hit rate of the TLB cache.
[0011]
[Means for Solving the Problems]
In order to achieve the above object, the present invention is characterized in that a memory allocation destination of an object generated by a plurality of threads when a process is executed in a computer is dynamically controlled, and frequently used objects are arranged near each other. That is, by allocating a unique area for each thread to a heap area of a logical memory space in a process, objects generated by one thread are continuously arranged on a memory or a storage device. As a result, the objects generated in the heap area are grouped in units of threads, and the possibility that an object of the same thread is included in a plurality of objects in the unit data taken into the data cache is increased. Hit rate is improved.
[0012]
BEST MODE FOR CARRYING OUT THE INVENTION
Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.
[0013]
FIG. 1 is a block diagram showing a configuration example of a memory management system according to the present invention, and FIG. 2 is a block diagram showing a hardware configuration example of a computer on which the memory management system in FIG. 1 is mounted.
[0014]
2, reference numeral 21 denotes a display device such as a CRT (Cathode Ray Tube) or an LCD (Liquid Crystal Display); 22, an input device including a keyboard and a mouse; and 23, an external storage device including a HDD (Hard Disk Drive). , 24 are an information processing apparatus having a CPU (Central Processing Unit) 24a, a main memory 24b, an input / output interface 24c and the like for performing computer processing, and 25 is a CD-ROM (Compact Disc) recording programs and data according to the present invention. -An optical disk such as a read only memory (DVD) or a DVD (Digital Video Disc / Digital Versatile Disc); Programs and data drive for reading, 27 is a communication device comprising a LAN (Local Area Network) card or a modem.
[0015]
After the programs and data stored in the optical disk 25 are installed in the external storage device 23 by the information processing device 24 via the driving device 26, the programs and data are read from the external storage device 23 into the main memory 24b, and processed by the CPU 24a. Each processing unit including the memory management system shown in FIG.
[0016]
In FIG. 1, 1 is a memory management system, 2 is a process execution unit, and the memory management system 1 is a memory management unit including an area allocation unit 11, a map generation unit 12, a map information storage unit 13, and an object arrangement unit 14. With 10.
[0017]
With such a configuration, in the memory management system 1 of the present example, when an object generated by a plurality of threads is arranged in the memory at the time of executing a process in the computer, a memory area is allocated in units of threads, and the object generated by the thread Allocate in the memory area allocated to the thread.
[0018]
That is, the process allocating unit 11 allocates a process heap region in units of threads, the map generation unit 12 generates association information between the region allocated to the thread and the thread, and the map information storage unit 13 stores the association information. The information is stored in a table (thread heap area map 15), and the object placement unit 14 refers to the object generated by the thread with reference to the association information stored in the thread heap area map 15 in the map information storage unit 13. , In the area allocated to the thread.
[0019]
In the thread heap area map 15, items of a thread ID 15a in which thread identification information used to specify each thread is registered and area information 15b used to specify a memory area allocated to each thread are provided. Based on the contents of the heap area map 15, for example, an object generated by the thread Aa is arranged in a memory area specified by the address information Aa.
[0020]
As described above, in the present example, by newly providing the memory management unit 10, in the process of executing a process of a computer, when an object is generated by a thread which is a unit of execution of a process, an object generation location is dynamically changed for each thread. By controlling, objects that are frequently used are arranged close to each other.
[0021]
The dynamic control here is performed by the memory management unit 10 allocating a unique area for each thread to a heap area of the logical memory space in the process. In addition, the vicinity arrangement of objects means that the objects are continuously arranged on a memory or a storage device such as a data cache or a TLB (Translation Look-aside Buffer).
[0022]
Due to the close arrangement of the objects, the objects generated in the heap area are grouped in units of threads, and for example, there is a high possibility that an object of the same thread enters a plurality of objects in the unit data taken into the data cache.
[0023]
When a process is executed, a plurality of threads are executed. Generally, however, the data cache is valid for only one thread, and if a thread switch occurs, the data cache is not valid, and the valid data cache is restored. Be built.
[0024]
As described above, in this example, there is a high possibility that an object of the same thread enters the data cache, and as a result, the hit ratio of the data cache is improved as compared with the related art.
[0025]
Similarly, in the TLB cache, there is a high possibility that an object used for one thread exists on the same page, and by taking those pages into the TLB cache, the TLB cache hit rate during execution of the same thread is improved. .
[0026]
Furthermore, when switching threads, switching of a certain number of page files is always required.However, this number of page files is switched when the number of threads is sufficiently larger than when the objects are stored in the heap area in the order of generation. It can be seen that this example is extremely effective for large-scale applications.
[0027]
Hereinafter, the processing operation of the memory management system of this example will be described with reference to FIGS.
[0028]
FIG. 3 is a flowchart showing an example of a memory arrangement process by the memory management system in FIG. 1, FIG. 4 is an explanatory diagram showing a first processing operation example of the memory management system in FIG. 1, and FIG. FIG. 13 is an explanatory diagram illustrating a second processing operation example of the memory management system.
[0029]
As shown in FIG. 3, when there is a request to create a thread Aa in the process A (step 301), the memory management system 1 in FIG. 1 allocates an area Aa in the heap area for the process A to the thread Aa (step 301). 302), for example, as shown in the thread heap area map 15 in FIG. 1, the thread identification information (thread ID: Aa) used for specifying each thread and the memory area allocated to each thread are specified. The area information (address Aa) to be used is associated with and stored in the storage device (step 303).
[0030]
Thereafter, when there is a request to generate an object by the thread Aa (step 304), the area Aa assigned to the thread Aa is specified with reference to the thread heap area map 15 in FIG. 1, and the object is stored in this area Aa. It is arranged (step 305).
[0031]
In the following processing in FIGS. 4 and 5, an example in “JAVA (registered trademark) VM” will be described.
[0032]
“JAVA (registered trademark) VM” uses “Garbage Collection” (GC) as a heap area management technique. In this example, a case where “Generation GC” is used as a GC technique will be described.
[0033]
As shown in FIG. 4, in the “Generation GC”, the heap area is largely divided into three areas “New Generation”, “Old Generation”, and “Permanent Generation”, and among them, “New Generation” is further referred to as “Eden”. It is divided into two areas called “Survivor”.
[0034]
Whenever a new object is created in the above structure, an object is created from the “Eden” area.
[0035]
In this example, a memory management unit 10 is newly provided, and when a thread generation request is generated during the execution of a process, the memory management unit 10 acquires the thread generation request, and the memory management unit 10 maps a heap area of the process to a thread heap area map. It manages at 15.
[0036]
The memory management unit 10 secures a constant heap area for each thread over a plurality of generations used for “Generation GC”, and notifies the thread of the area.
[0037]
Note that the heap area for each thread generated at this time is statically determined in advance, and an arbitrary area can be allocated according to a request from the user.
[0038]
For example, if a request to create a thread Aa occurs during the execution of the process ((1) in FIG. 4), the memory management unit 10 obtains the notification and newly obtains the notification from the thread heap area map 15 of the memory management unit 10. An appropriate heap area is determined ((2) in FIG. 4), and a real heap area is secured ((3) in FIG. 4). A thread Aa is created based on the information ((4), (5), (6) in FIG. 4).
[0039]
FIG. 5 shows an example of object generation at the time of thread execution. When an object generation request occurs, the generation request is notified to the memory management unit 10, and the memory management unit 10 requests the object according to the thread heap area map 15. The generated object is generated in the “Eden” area of the “New Generation”.
[0040]
For example, when the process is transferred to the thread Aa during the execution of the process ((1) in FIG. 5), a request to create an object is notified to the memory management unit 10 when executing the thread A ((2) in FIG. 5). The memory management unit 10 generates the requested object in the heap area (“Eden” area) allocated to the thread Aa ((3) in FIG. 5).
[0041]
The memory management unit 10 notifies the thread Aa of the address of the generated heap area as a return value.
[0042]
As described above with reference to FIGS. 1 to 5, in the present example, the memory allocation destination of the objects generated by a plurality of threads when a process is executed in a computer (computer) is dynamically controlled, and objects frequently used are Is placed in the vicinity. That is, by allocating a unique area for each thread to a heap area of a logical memory space in a process, objects generated by one thread are continuously arranged on a memory or a storage device.
[0043]
As a result, objects generated in the heap area are grouped in units of threads, and frequently used objects are arranged close to each other, so that, for example, the same object is assigned to a plurality of objects in unit data taken into the data cache. The likelihood of thread objects entering is increased, resulting in an improved data cache hit rate.
[0044]
Furthermore, if the number of threads in one process is sufficiently large, the objects that are frequently used are arranged close to each other, so that the possibility that the objects are arranged in the same page increases. Rate can be improved.
[0045]
The present invention is not limited to the examples described with reference to FIGS. 1 to 5 and can be variously modified without departing from the gist thereof. For example, in this example, “JAVA (registered trademark) VM” has been described as an example, but the present invention is also applicable to a computer that operates in a program language other than “JAVA (registered trademark)”.
[0046]
Further, the size of the heap area (thread allocation heap) allocated to each thread may be the same for all threads or may be set differently.
[0047]
Further, in this example, the configuration example of FIG. 2 is shown as a configuration of a computer (computer), but a computer configuration without a keyboard or a drive device of an optical disk may be used. In this example, the optical disk is used as the recording medium, but an FD (Flexible Disk) or the like may be used as the recording medium. As for the installation of the program, the program may be downloaded and installed via a network via a communication device.
[0048]
【The invention's effect】
According to the present invention, frequently used objects are arranged close to each other, whereby, for example, a frequently used object from one thread is fetched when data is fetched from the data cache, and as a result, a hit in the data cache occurs. When the ratio can be improved and the number of threads in one process is sufficiently large, the objects that are frequently used are arranged close to each other, so that there is a high possibility that the objects are arranged in the same page. As a result, it is possible to improve the TLB hit rate.
[Brief description of the drawings]
FIG. 1 is a block diagram illustrating a configuration example of a memory management system according to the present invention.
FIG. 2 is a block diagram illustrating a hardware configuration example of a computer on which the memory management system in FIG. 1 is mounted.
FIG. 3 is a flowchart illustrating an example of a memory allocation process by the memory management system in FIG. 1;
FIG. 4 is an explanatory diagram showing a first processing operation example of the memory management system in FIG. 1;
FIG. 5 is an explanatory diagram showing a second processing operation example of the memory management system in FIG. 1;
[Explanation of symbols]
1: memory management system, 2: process execution unit, 10: memory management unit, 11: area allocation unit, 12: map generation unit, 13: map information storage unit, 14 object arrangement unit, 15: thread heap area map, 15a : Thread ID, 15b: area information, 21: display device, 22: input device, 23: external storage device, 24: information processing device, 24a: CPU, 24b: main memory, 24c: input / output interface, 25: optical disk, 26: drive device, 27: communication device.

Claims

A memory management system for controlling memory allocation of objects generated by a plurality of threads when a process of a computer is executed,
A memory management system, comprising: a memory management unit that allocates a memory area in units of threads, and arranges an object generated by the thread in the memory area allocated to the thread.

A memory management system for controlling memory allocation of objects generated by a plurality of threads when a process of a computer is executed,
Allocating means for allocating a heap area for the process in units of threads,
Storage means for storing information associated with an area allocated to a thread and the thread;
Means for arranging an object generated by a thread in an area allocated to the thread with reference to the association information stored in the storage means.

A memory allocation method of an object generated by a plurality of threads when a process is executed in a computer,
A procedure for allocating a memory area for each thread;
Allocating an object generated by a thread to a memory area allocated to the thread.

A memory allocation method of an object generated by a plurality of threads when a process is executed in a computer,
A procedure for allocating a heap area for the above process in units of threads,
Storing in a storage device association information between an area allocated to a thread and the thread;
Allocating an object generated by a thread to an area allocated to the thread with reference to the association information stored in the storage device.

A program for causing a computer to execute each procedure in the memory arrangement method according to claim 3.

A computer-readable recording medium having recorded thereon a program for causing a computer to execute each procedure in the memory arrangement method according to claim 3.