JP2009199367A

JP2009199367A - Computer system, i/o scheduler and i/o scheduling method

Info

Publication number: JP2009199367A
Application number: JP2008040633A
Authority: JP
Inventors: Teruyuki Baba; 輝幸馬場
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2008-02-21
Filing date: 2008-02-21
Publication date: 2009-09-03

Abstract

PROBLEM TO BE SOLVED: To increase a throughput under a condition that all processes execute disk access at least once within a prescribed time. SOLUTION: An I/O scheduler performs the scheduling of access requests from a plurality of processes. The scheduling includes a step (A) for reading process characteristic information indicating an access target file and file position information, a step (B) for storing an access request in a wait queue, a step (C) for determining the order of wait queues so that the access target file is accessed in the arrangement order on a disk while referring to the process characteristic information and the file position information, a step (D) for determining the number of access requests to be taken out from respective wait queues so that all processes execute disk access at least once within the prescribed time, and a step (E) for acquiring the access requests from the wait queues and sending the acquired access requests to the disk in accordance with the determined order and number of access requests. COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明は、計算機システムに関する。特に、本発明は、計算機システムにおいて、ディスクへのアクセス要求をスケジューリングするＩ／Ｏスケジューリング技術に関する。 The present invention relates to a computer system. In particular, the present invention relates to an I / O scheduling technique for scheduling a disk access request in a computer system.

計算機システムにおいて、ＣＰＵが実行する多くのプロセスは、ディスクへのアクセス処理（データ読み込み、データ書き込み）を必要とする。従って、ディスクアクセス処理を高速化することは、計算機システムの高速化のために重要な課題の一つである。ディスクアクセス処理を高速化するためには、多くのプロセスからのディスクアクセス要求をそのままディスクに渡すのではなく、ディスクへ送られるディスクアクセス要求の順番を適切に調整することが重要である。そのような調整（調停）を行うことは、「スケジューリング」と呼ばれる。複数のプロセスからのディスクアクセス要求のスケジューリングを行うのが、ＯＳ内の「Ｉ／Ｏスケジューラ」である。 In a computer system, many processes executed by a CPU require a disk access process (data reading, data writing). Therefore, speeding up the disk access processing is one of the important issues for speeding up the computer system. In order to increase the speed of the disk access processing, it is important to appropriately adjust the order of the disk access requests sent to the disk, instead of directly passing the disk access requests from many processes to the disk. Performing such adjustment (arbitration) is called “scheduling”. The “I / O scheduler” in the OS performs scheduling of disk access requests from a plurality of processes.

特許文献１には、複数のコンピュータで共有される外部記憶装置が開示されている。その外部記憶装置は、データを格納する複数の記憶ディスク部と、外部記憶制御部とを備える。記憶ディスク部は、データに対する読み出し／書き込みコマンドをキューイング可能である。外部記憶制御部は、コンピュータからのコマンドを受入キューに蓄え記憶ディスク部に順次発行することによって、コンピュータと記憶ディスク部との間のデータ入出力処理を制御する。より詳細には、外部記憶制御部は、予測処理時間生成手段と、送出キューと、処理時間予測手段と、コマンドキューイング手段と、コマンドバッチ生成手段とを含む。予測処理時間生成手段は、コマンドの処理に要すると予測される時間であるコマンド処理時間予測値を生成する。送出キューは、コマンドの待ち行列である。送出キューには、コマンド処理時間予測値の和が所定の処理タイムスライスに応じた値となる個数だけ、受入キュー内のコマンドが所定のタイミングで格納される。処理時間予測手段は、予測処理時間生成手段に基づいて、各記憶ディスク部別の処理時間を予測する。コマンドキューイング手段は、予測された処理時間が最大となる記憶ディスク部に対するコマンドを送出キューから取り出して、対応する記憶ディスク部にキューイングする。コマンドバッチ生成手段は、各送出キューが空になると、受入キュー内のコマンドを送出キューに格納する。 Patent Document 1 discloses an external storage device shared by a plurality of computers. The external storage device includes a plurality of storage disk units for storing data and an external storage control unit. The storage disk unit can queue read / write commands for data. The external storage control unit controls data input / output processing between the computer and the storage disk unit by storing commands from the computer in an acceptance queue and sequentially issuing the commands to the storage disk unit. More specifically, the external storage control unit includes a predicted processing time generation unit, a sending queue, a processing time prediction unit, a command queuing unit, and a command batch generation unit. The predicted processing time generation unit generates a command processing time predicted value that is a predicted time required for command processing. The send queue is a queue of commands. In the sending queue, the commands in the receiving queue are stored at a predetermined timing as many times as the sum of the command processing time predicted values becomes a value corresponding to a predetermined processing time slice. The processing time prediction means predicts the processing time for each storage disk unit based on the predicted processing time generation means. The command queuing means takes out a command for the storage disk unit having the maximum estimated processing time from the transmission queue and queues the command to the corresponding storage disk unit. The command batch generation means stores the commands in the receiving queue in the sending queue when each sending queue becomes empty.

特開平９−２５８９０７号公報JP-A-9-258907

近年、ＣＰＵ性能は急速に向上しているが、ディスク性能はＣＰＵほど急速には向上していない。そのため、ディスクアクセス要求の発行からレスポンスを受け取るまでの待ち時間が非常に長くなるプロセスやＣＰＵが発生し得る。待ち時間が長くなったり、頻繁に待ち状態が発生したりすると、システム全体の効率が低下してしまう。従って、Ｉ／Ｏスケジューラは、待ち時間が短くなるように、複数のプロセスからのディスクアクセス要求をスケジューリングすることが好適である。 In recent years, CPU performance has improved rapidly, but disk performance has not improved as rapidly as CPU. For this reason, there may be a process or CPU in which the waiting time from when a disk access request is issued until a response is received becomes very long. If the waiting time becomes long or the waiting state frequently occurs, the efficiency of the entire system is lowered. Therefore, it is preferable that the I / O scheduler schedules disk access requests from a plurality of processes so that the waiting time is shortened.

また、ディスクアクセス処理を高速化するために、ディスクとＣＰＵとの間の単位時間あたりのデータ転送速度であるディスク帯域幅（スループット）を向上させることも重要である。よって、Ｉ／Ｏスケジューラは、スループットが向上するように、複数のプロセスからのディスクアクセス要求をスケジューリングすることが好適である。 It is also important to improve the disk bandwidth (throughput), which is the data transfer rate per unit time between the disk and the CPU, in order to speed up the disk access process. Therefore, it is preferable that the I / O scheduler schedules disk access requests from a plurality of processes so as to improve the throughput.

しかしながら、一般的に、「待ち時間の短縮」と「スループットの向上」とは互いに相反する関係にある。そのことを、図１で示される例を参照して説明する。 However, in general, “reduction of waiting time” and “improvement of throughput” are in a mutually contradictory relationship. This will be described with reference to the example shown in FIG.

図１において、プロセスＰ１〜Ｐ４が、ディスクに記録されているファイルＦ１〜Ｆ４のそれぞれへアクセスを行うとする。Ｉ／Ｏスケジューラは、プロセスＰ１〜Ｐ４のそれぞれから発行されるディスクアクセス要求をどのような順番でディスクへ送るかを決定する。より詳細には、Ｉ／Ｏスケジューラは、ディスクアクセス要求を一時的に保持しておく待ちキューＱ１〜Ｑ４を有している。待ちキューはプロセス毎に割り当てられており、図１の例では、待ちキューＱ１〜Ｑ４がそれぞれプロセスＰ１〜Ｐ４に割り当てられている。つまり、プロセスＰ１〜Ｐ４からのディスクアクセス要求は、待ちキューＱ１〜Ｑ４のそれぞれに一旦格納される。リクエスト選択部は、“所定の規則”に従って、待ちキューＱ１〜Ｑ４からディスクアクセス要求を取り出し、取り出したものを順次ディスクへ送る。 In FIG. 1, it is assumed that processes P1 to P4 access files F1 to F4 recorded on the disc. The I / O scheduler determines in what order the disk access requests issued from each of the processes P1 to P4 are sent to the disk. More specifically, the I / O scheduler has wait queues Q1 to Q4 that temporarily hold disk access requests. The wait queue is assigned for each process. In the example of FIG. 1, the wait queues Q1 to Q4 are assigned to the processes P1 to P4, respectively. That is, disk access requests from the processes P1 to P4 are temporarily stored in the waiting queues Q1 to Q4. The request selection unit takes out the disk access requests from the waiting queues Q1 to Q4 according to the “predetermined rule”, and sequentially sends the taken out requests to the disk.

ここで、その所定の規則としては、（１）それぞれの待ちキューからディスクアクセス要求を１個ずつ順番に取り出す、あるいは、（２）１つの待ちキューに格納されている全てのディスクアクセス要求を取り出した後、次の待ちキューへ移る、が考えられる。これら２つの規則はそれぞれ長所と短所を有している。 Here, as the predetermined rule, (1) one disk access request is sequentially taken out from each waiting queue, or (2) all disk access requests stored in one waiting queue are taken out. After that, it is possible to move to the next waiting queue. Each of these two rules has advantages and disadvantages.

規則（１）の場合、リクエスト選択部は、ある待ちキューから１個のディスクアクセス要求を取り出した後、すぐに次の待ちキューへ移る。その結果、あるプロセスがディスク上のあるファイルにアクセスした後、すぐに別のプロセスがディスク上の別のファイルにアクセスすることになる。従って、それぞれのプロセスが短い待ち時間で次々とディスクアクセスすることが可能となる。待ち時間が極端に長くなるプロセスが無いので、タイムアウトは発生せず、また、各プロセスの処理効率が向上する。その一方で、ディスク上のアクセス対象のファイルが次々と切り換わるため、ディスクヘッドは頻繁に移動する必要があり、ファイル間のシークに時間がとられてしまう。ファイル間のシーク時間が増大すると、ある一定の期間のうちに各ファイルから読み出せる（あるいは書き込める）データ量が必然的に少なくなってしまう。このことは、単位時間あたりのデータ伝送速度であるスループットが低下することを意味する。このように、規則（１）の場合、待ち時間は短縮されるが、スループットが低下する。 In the case of rule (1), the request selection unit takes out one disk access request from a certain waiting queue and immediately moves to the next waiting queue. As a result, after one process accesses a file on the disk, another process immediately accesses another file on the disk. Therefore, each process can access the disk one after another with a short waiting time. Since no process has an extremely long waiting time, no timeout occurs and the processing efficiency of each process is improved. On the other hand, since the files to be accessed on the disk are switched one after another, it is necessary to move the disk head frequently, and it takes time to seek between the files. When the seek time between files increases, the amount of data that can be read (or written) from each file within a certain period of time inevitably decreases. This means that the throughput, which is the data transmission rate per unit time, decreases. Thus, in the case of rule (1), the waiting time is shortened, but the throughput is reduced.

規則（２）の場合、リクエスト選択部は、１つの待ちキューに格納されている全てのディスクアクセス要求を取り出した後、次の待ちキューへ移る。そのため、アクセス対象のファイルが頻繁に切り換わることがない。また、一般的に、同一プロセスがアクセスするデータはディスク上の近い位置に存在している。従って、ディスクヘッドのシーク時間が短くなり、その分、単位時間あたりのデータ転送速度であるスループットが増加する。その一方で、あるプロセスからの全てのディスクアクセス要求の処理が終了するまで、次のプロセスは待機する必要がある。つまり、後のプロセスほど待ち時間は長くなり、タイムアウトが発生する可能性が高くなる。この傾向は、同時に動作するプロセスの数が多くなるほど、また、１つのプロセスが発行するディスクアクセス要求の数が多くなるほど、強くなる。このように、規則（２）の場合、スループットは向上するが、待ち時間が長くなる。 In the case of rule (2), the request selection unit extracts all disk access requests stored in one wait queue, and then moves to the next wait queue. Therefore, the file to be accessed does not switch frequently. In general, data accessed by the same process exists at a close position on the disk. Accordingly, the seek time of the disk head is shortened, and the throughput, which is the data transfer rate per unit time, is increased accordingly. On the other hand, the next process needs to wait until processing of all disk access requests from a process is completed. In other words, the later process has a longer waiting time and the possibility of a timeout occurring. This tendency becomes stronger as the number of processes operating simultaneously increases and as the number of disk access requests issued by one process increases. Thus, in the case of rule (2), the throughput is improved, but the waiting time is increased.

本願発明者は次の点に着目した。実際のシステム運用においては、各プロセスの待ち時間は最小である必要はなく、多くの場合、各プロセスは予め定められた時間（例えば、アプリケーションのタイムアウトが発生しない時間）内に処理を開始すればよい。つまり、所定の時間内に、各プロセスは少なくとも１回ディスクアクセスを行えばよい。このような条件の下で、スループットを可能な限り大きくすることが望まれる。 The inventor of the present application paid attention to the following points. In actual system operation, the waiting time of each process does not need to be minimum, and in many cases, if each process starts processing within a predetermined time (for example, a time when an application timeout does not occur) Good. In other words, each process may perform disk access at least once within a predetermined time. Under such conditions, it is desired to increase the throughput as much as possible.

本発明の１つの目的は、所定の時間内に全てのプロセスがディスクアクセスを少なくとも１回実行するという条件の下で、スループットをできるだけ大きくすることができるスケジューリング技術を提供することにある。 One object of the present invention is to provide a scheduling technique capable of increasing the throughput as much as possible under the condition that all processes execute disk access at least once within a predetermined time.

本発明の第１の観点において、Ｉ／Ｏスケジューラが提供される。Ｉ／Ｏスケジューラは、複数のプロセスがディスクにアクセスする際に、それぞれのプロセスから発行されるディスクアクセス要求のスケジューリング処理をコンピュータに実行させる。そのスケジューリング処理は、（Ａ）複数のプロセスのそれぞれがアクセスするアクセス対象ファイルを示すプロセス特性情報、及びディスクに記録されているファイルの位置を示すファイル位置情報を、記憶装置から読み出すステップと、（Ｂ）複数のプロセスからのディスクアクセス要求を、複数の待ちキューのそれぞれに格納するステップと、（Ｃ）プロセス特性情報及びファイル位置情報を参照し、アクセス対象ファイルがディスク上の並び順で順番にアクセスされるように、ディスクアクセス要求が取り出される複数の待ちキューの順番を決定するステップと、（Ｄ）所定の時間内に複数のプロセスの全てが少なくとも１回ディスクアクセスを行うように、当該所定の時間内に複数の待ちキューのそれぞれから取り出されるディスクアクセス要求の数を決定するステップと、（Ｅ）上記決定された順番及び数に従って、複数の待ちキューからディスクアクセス要求を取得し、取得したディスクアクセス要求をディスクに送るステップと、を含む。 In a first aspect of the present invention, an I / O scheduler is provided. When a plurality of processes access a disk, the I / O scheduler causes a computer to execute a scheduling process of a disk access request issued from each process. The scheduling process includes (A) reading process characteristic information indicating an access target file accessed by each of a plurality of processes and file position information indicating a position of a file recorded on the disk from a storage device; B) A step of storing disk access requests from a plurality of processes in each of a plurality of wait queues, and (C) referring to process characteristic information and file position information, and the access target files are sequentially arranged in the order of arrangement on the disk. Determining the order of a plurality of wait queues from which disk access requests are retrieved so as to be accessed, and (D) the predetermined process so that all of the plurality of processes perform disk access at least once within a predetermined time period. The disk is removed from each of the multiple waiting queues within Determining a number of access requests, according to the order and the number determined above (E), comprising the steps of: sending from a plurality of waiting queue to get the disk access request, the disk access request to the disk obtained, the.

本発明の第２の観点において、複数のプロセスがディスクにアクセスする際に、それぞれのプロセスから発行されるディスクアクセス要求のスケジューリング処理を行うＩ／Ｏスケジューリング方法が提供される。そのＩ／Ｏスケジューリング方法は、（Ａ）複数のプロセスのそれぞれがアクセスするアクセス対象ファイルを示すプロセス特性情報、及びディスクに記録されているファイルの位置を示すファイル位置情報を、記憶装置から読み出すステップと、（Ｂ）複数のプロセスからのディスクアクセス要求を、複数の待ちキューのそれぞれに格納するステップと、（Ｃ）プロセス特性情報及びファイル位置情報を参照し、アクセス対象ファイルがディスク上の並び順で順番にアクセスされるように、ディスクアクセス要求が取り出される複数の待ちキューの順番を決定するステップと、（Ｄ）所定の時間内に複数のプロセスの全てが少なくとも１回ディスクアクセスを行うように、当該所定の時間内に複数の待ちキューのそれぞれから取り出されるディスクアクセス要求の数を決定するステップと、（Ｅ）上記決定された順番及び数に従って、複数の待ちキューからディスクアクセス要求を取得し、取得したディスクアクセス要求をディスクに送るステップと、を含む。 In a second aspect of the present invention, there is provided an I / O scheduling method for performing a scheduling process of a disk access request issued from each process when a plurality of processes access the disk. In the I / O scheduling method, (A) a step of reading process characteristic information indicating an access target file accessed by each of a plurality of processes and file position information indicating a position of a file recorded on a disk from a storage device (B) a step of storing disk access requests from a plurality of processes in each of a plurality of waiting queues; and (C) referring to process characteristic information and file position information, and the access target files are arranged in order on the disk. (D) determining the order of a plurality of waiting queues from which disk access requests are taken out, so that all of the plurality of processes perform disk access at least once within a predetermined time period. And is taken out from each of the plurality of waiting queues within the predetermined time. And determining the number of I disk access request, in accordance with (E) the determined order and number, and sending a plurality of waiting queue to get the disk access request, the disk access request to the disk obtained, the.

本発明の第３の観点において、計算機システムが提供される。計算機システムは、ディスクと、そのディスクにアクセスする複数のプロセスを実行する処理装置と、Ｉ／Ｏスケジューラと、記憶装置とを備える。記憶装置には、複数のプロセスのそれぞれがアクセスするアクセス対象ファイルを示すプロセス特性情報、及びディスクに記録されているファイルの位置を示すファイル位置情報が格納される。Ｉ／Ｏスケジューラは、処理装置によって実行され、複数のプロセスのそれぞれから発行されるディスクアクセス要求のスケジューリング処理を行う。より詳細には、Ｉ／Ｏスケジューラは、複数のプロセスからのディスクアクセス要求を複数の待ちキューのそれぞれに格納する。Ｉ／Ｏスケジューラは、プロセス特性情報及びファイル位置情報を参照し、アクセス対象ファイルがディスク上の並び順で順番にアクセスされるように、ディスクアクセス要求が取り出される複数の待ちキューの順番を決定する。Ｉ／Ｏスケジューラは、所定の時間内に複数のプロセスの全てが少なくとも１回ディスクアクセスを行うように、所定の時間内に複数の待ちキューのそれぞれから取り出されるディスクアクセス要求の数を決定する。そして、Ｉ／Ｏスケジューラは、決定された順番及び数に従って、複数の待ちキューからディスクアクセス要求を取得し、取得したディスクアクセス要求をディスクに送る。 In a third aspect of the present invention, a computer system is provided. The computer system includes a disk, a processing device that executes a plurality of processes that access the disk, an I / O scheduler, and a storage device. The storage device stores process characteristic information indicating an access target file accessed by each of a plurality of processes, and file position information indicating a position of a file recorded on the disk. The I / O scheduler is executed by the processing device and performs a scheduling process of a disk access request issued from each of a plurality of processes. More specifically, the I / O scheduler stores disk access requests from a plurality of processes in each of a plurality of wait queues. The I / O scheduler refers to the process characteristic information and the file position information, and determines the order of a plurality of wait queues from which disk access requests are retrieved so that the access target file is accessed in order in the arrangement order on the disk. . The I / O scheduler determines the number of disk access requests fetched from each of the plurality of wait queues within a predetermined time period so that all of the plurality of processes perform disk access at least once within the predetermined time period. Then, the I / O scheduler acquires disk access requests from a plurality of waiting queues according to the determined order and number, and sends the acquired disk access requests to the disk.

本発明のスケジューリング技術によれば、所定の時間内に全てのプロセスがディスクアクセスを少なくとも１回実行するという条件の下で、スループットをできるだけ大きくすることが可能となる。 According to the scheduling technique of the present invention, it is possible to maximize the throughput under the condition that all processes execute disk access at least once within a predetermined time.

添付図面を参照して、本発明の実施の形態に係る計算機システム及びＩ／Ｏスケジューラを説明する。 A computer system and an I / O scheduler according to an embodiment of the present invention will be described with reference to the accompanying drawings.

１．計算機システム
図２は、本発明の実施の形態に係る計算機システム（コンピュータシステム）１の構成の一例を示している。計算機システム１は、処理装置２、記憶装置３、ディスク４、入力装置５及び出力装置６を備えている。処理装置２は、１つあるいは複数のＣＰＵを含んでいる。記憶装置３は例えばＲＡＭ（Random Access Memory）である。ディスク４は例えば複数のファイルが記録されたＨＤＤ（Hard Disk Drive）である。ディスク４は、ネットワークを介して複数の計算機システム１で共有されていてもよい。入力装置５は例えばキーボードやマウスである。出力装置６は例えばディスプレイである。計算機システム１は、仮想化技術によって構築されていてもよい。 1. Computer System FIG. 2 shows an example of the configuration of a computer system (computer system) 1 according to an embodiment of the present invention. The computer system 1 includes a processing device 2, a storage device 3, a disk 4, an input device 5, and an output device 6. The processing device 2 includes one or a plurality of CPUs. The storage device 3 is, for example, a RAM (Random Access Memory). The disk 4 is, for example, an HDD (Hard Disk Drive) in which a plurality of files are recorded. The disk 4 may be shared by a plurality of computer systems 1 via a network. The input device 5 is, for example, a keyboard or a mouse. The output device 6 is a display, for example. The computer system 1 may be constructed by a virtualization technique.

処理装置２は、ＯＳの他、様々なアプリケーションを実行する。このとき、処理装置２は、ＯＳやアプリケーション等により発生するプロセスを実行する。多くのプロセスは、ディスク４へのアクセス処理（データ読み込み、データ書き込み）を必要とし、ディスクアクセス要求をＯＳ内のＩ／Ｏスケジューラ１０に発行する。Ｉ／Ｏスケジューラ１０は、処理装置２によって実行されるプログラムの一種であり、複数のプロセスのそれぞれから発行されるディスクアクセス要求のスケジューリング処理を行う。尚、ＯＳやＩ／Ｏスケジューラ１０は、ディスク４やコンピュータ読み取り可能な記録媒体に記録されている。そして、ＯＳやＩ／Ｏスケジューラ１０は、記憶装置３に読み出され、処理装置２によって実行される。 The processing device 2 executes various applications in addition to the OS. At this time, the processing device 2 executes a process generated by the OS, application, or the like. Many processes require access processing (data read, data write) to the disk 4 and issue a disk access request to the I / O scheduler 10 in the OS. The I / O scheduler 10 is a kind of program executed by the processing device 2, and performs scheduling processing of disk access requests issued from each of a plurality of processes. The OS and the I / O scheduler 10 are recorded on the disk 4 or a computer-readable recording medium. Then, the OS and the I / O scheduler 10 are read into the storage device 3 and executed by the processing device 2.

後に詳しく説明されるように、本実施の形態に係るＩ／Ｏスケジューラ１０は、所定の時間内に全てのプロセスがディスクアクセスを少なくとも１回実行するという条件の下で、スループットができるだけ大きくなるようにスケジューリング処理を行う。そのために、Ｉ／Ｏスケジューラ１０は、プロセス特性情報ＰＲＯＣ、ファイル位置情報ＬＯＣ、シーク時間情報ＳＥＫ、アクセス時間情報ＡＣＳ等を利用する。それらプロセス特性情報ＰＲＯＣ、ファイル位置情報ＬＯＣ、シーク時間情報ＳＥＫ及びアクセス時間情報ＡＣＳは、記憶装置３に格納される。まず、各情報の詳細を説明する。 As will be described in detail later, the I / O scheduler 10 according to the present embodiment increases the throughput as much as possible under the condition that all processes execute disk access at least once within a predetermined time. Perform scheduling processing. For this purpose, the I / O scheduler 10 uses process characteristic information PROC, file position information LOC, seek time information SEK, access time information ACS, and the like. The process characteristic information PROC, file position information LOC, seek time information SEK, and access time information ACS are stored in the storage device 3. First, details of each information will be described.

（プロセス特性情報ＰＲＯＣ）
図３は、プロセス特性情報ＰＲＯＣの一例を示している。プロセス特性情報ＰＲＯＣは、それぞれのプロセスがアクセスするディスク４上のファイル（アクセス対象ファイル）を示している。例えば図３の例では、プロセス特性情報ＰＲＯＣは、複数のプロセスＰ１〜Ｐ４がそれぞれ複数のファイルＦ１〜Ｆ４へアクセスすることを示している。プロセス特性情報ＰＲＯＣは、プロセス（Ｐ１〜Ｐ４）とアクセス対象ファイル（Ｆ１〜Ｆ４）との対応関係を示しているとも言える。このプロセス特性情報ＰＲＯＣを参照することによって、Ｉ／Ｏスケジューラ１０は、各プロセスがディスク４上のどのアクセス対象ファイルにアクセスするか把握することができる。 (Process characteristic information PROC)
FIG. 3 shows an example of the process characteristic information PROC. The process characteristic information PROC indicates a file (access target file) on the disk 4 accessed by each process. For example, in the example of FIG. 3, the process characteristic information PROC indicates that the plurality of processes P1 to P4 access the plurality of files F1 to F4, respectively. It can be said that the process characteristic information PROC indicates the correspondence between the processes (P1 to P4) and the access target files (F1 to F4). By referring to the process characteristic information PROC, the I / O scheduler 10 can grasp which access target file on the disk 4 is accessed by each process.

（ファイル位置情報ＬＯＣ）
図４Ａ及び図４Ｂは、ファイル位置情報ＬＯＣの例を示している。ファイル位置情報ＬＯＣは、ディスク４に記録されているファイルの位置を示す情報である。図４Ａの例では、ファイル名、開始位置及び終了位置の対応関係がファイル毎に示されている。図４Ｂの例では、ファイル名、開始位置及びファイルサイズの対応関係がファイル毎に示されている。図４Ｂの場合、各ファイルの終了位置は、開始位置とファイルサイズから求めることができる。このファイル位置情報ＬＯＣを参照することによって、Ｉ／Ｏスケジューラ１０は、ディスク４上のファイルの配置関係及びファイル間の距離を計算することができる。尚、図４Ａ及び図４Ｂで示された例では、ファイル位置を表す単位として、ディスク４の先頭からのバイト数が用いられているが、それに限られない。ファイル位置は、ディスク４の先頭からのシリンダ数などで表されてもよい。 (File location information LOC)
4A and 4B show examples of the file position information LOC. The file position information LOC is information indicating the position of the file recorded on the disk 4. In the example of FIG. 4A, the correspondence between the file name, the start position, and the end position is shown for each file. In the example of FIG. 4B, the correspondence between the file name, the start position, and the file size is shown for each file. In the case of FIG. 4B, the end position of each file can be obtained from the start position and the file size. By referring to the file position information LOC, the I / O scheduler 10 can calculate the arrangement relationship of the files on the disk 4 and the distance between the files. In the example shown in FIGS. 4A and 4B, the number of bytes from the top of the disk 4 is used as a unit representing the file position, but is not limited thereto. The file position may be represented by the number of cylinders from the top of the disk 4.

（シーク時間情報ＳＥＫ）
図５Ａ及び図５Ｂは、シーク時間情報ＳＥＫの例を示している。シーク時間情報ＳＥＫは、ディスク４上でヘッドが移動する距離である「シーク距離」とその移動時間である「シーク時間」との対応関係を示す情報である。図５Ａは、その対応関係ｙ＝ｆ（ｘ）を示すグラフである。図５Ａにおいて、横軸はシーク距離ｘを表し、縦軸はシーク時間ｙを表している。あるいは、図５Ｂに示されるように、シーク時間情報ＳＥＫは、シーク距離とシーク時間との対応関係をテーブル形式で与えてもよい。図５Ｂによれば、例えば、シーク距離が０．５ＧＢｙｔｅのときにシーク時間は８．０ミリ秒であり、シーク距離が６４ＧＢｙｔｅのときにシーク時間は１７．５ミリ秒であることがわかる。テーブルのエントリに直接示されていないシーク距離に対応するシーク時間は、エントリ間を補間すること等によって算出することができる。尚、シーク距離は、バイト数だけでなくシリンダ数でも表され得る。 (Seek time information SEK)
5A and 5B show examples of seek time information SEK. The seek time information SEK is information indicating the correspondence between the “seek distance” that is the distance the head moves on the disk 4 and the “seek time” that is the movement time. FIG. 5A is a graph showing the correspondence y = f (x). In FIG. 5A, the horizontal axis represents the seek distance x, and the vertical axis represents the seek time y. Alternatively, as shown in FIG. 5B, the seek time information SEK may give the correspondence between the seek distance and the seek time in a table format. According to FIG. 5B, for example, when the seek distance is 0.5 GB, the seek time is 8.0 milliseconds, and when the seek distance is 64 GB, the seek time is 17.5 milliseconds. The seek time corresponding to the seek distance not directly shown in the table entry can be calculated by interpolating between the entries. The seek distance can be expressed not only by the number of bytes but also by the number of cylinders.

（アクセス時間情報ＡＣＳ）
図６Ａ及び図６Ｂは、アクセス時間情報ＡＣＳの例を示している。アクセス時間情報ＡＣＳは、アクセスされるデータサイズとそのアクセスに要する時間の関係を示す情報である。ディスクアクセスにはデータ読み出しとデータ書き込みがあり、それぞれに対してアクセス時間情報ＡＣＳが定義され得る。例えばデータ読み出しの場合、アクセス時間情報ＡＣＳは、「読み出しデータサイズ」とその読み出し動作に要する「読み出し時間」との対応関係を示す。図６Ａは、その対応関係ｙ＝ｇ（ｘ）を示すグラフである。図６Ａにおいて、横軸は読み出しデータサイズｘを表し、縦軸は読み出し時間ｙを表している。あるいは、図６Ｂに示されるように、アクセス時間情報ＡＣＳは、読み出しデータサイズと読み出し時間との対応関係をテーブル形式で与えてもよい。図６Ｂによれば、例えば、読み出しデータサイズが４ｋＢのときに読み出し時間は０．１０ミリ秒であり、読み出しデータサイズが１２８ｋＢのときに読み出し時間は３．２ミリ秒であることがわかる。テーブルのエントリに直接示されていない読み出しデータサイズに対応する読み出し時間は、エントリ間を補間すること等によって算出することができる。 (Access time information ACS)
6A and 6B show examples of access time information ACS. The access time information ACS is information indicating the relationship between the data size to be accessed and the time required for the access. Disk access includes data read and data write, and access time information ACS can be defined for each. For example, in the case of data reading, the access time information ACS indicates the correspondence between “read data size” and “read time” required for the read operation. FIG. 6A is a graph showing the correspondence y = g (x). In FIG. 6A, the horizontal axis represents the read data size x, and the vertical axis represents the read time y. Alternatively, as shown in FIG. 6B, the access time information ACS may give the correspondence between the read data size and the read time in a table format. According to FIG. 6B, for example, it can be seen that when the read data size is 4 kB, the read time is 0.10 milliseconds, and when the read data size is 128 kB, the read time is 3.2 milliseconds. The read time corresponding to the read data size not directly shown in the table entry can be calculated by interpolating between the entries.

２．Ｉ／Ｏスケジューラ
本実施の形態に係るＩ／Ｏスケジューラ１０は、上述の情報を適宜利用することによって、スケジューリング処理を行う。このスケジューリング処理において、次の点が留意される。第一に、本実施の形態では、プロセスのディスクアクセスに対して所定のレスポンスタイム（以下、「要求ピリオドＴｐｒｄ」と参照される）が要求される。全てのプロセスは、その要求ピリオドＴｐｒｄ内に少なくとも１回ディスクアクセスを行うことが要求される。要求ピリオドＴｐｒｄは、システム効率が劣化しない範囲でユーザにより適宜設定され得る。例えば、要求ピリオドＴｐｒｄは、計算機システム１が実行するアプリケーションのタイムアウト時間以下に設定される。第二に、それぞれのプロセスのディスクアクセスに関するスループットの合計ができるだけ大きくなるように、スケジューリングが行われる。すなわち、本実施の形態に係るＩ／Ｏスケジューラ１０は、要求ピリオドＴｐｒｄ内に全てのプロセスがディスクアクセスを少なくとも１回実行するという条件の下で、スループットができるだけ大きくなるようにスケジューリング処理を行う。 2. I / O scheduler The I / O scheduler 10 according to the present embodiment performs scheduling processing by appropriately using the above information. The following points are noted in this scheduling process. First, in this embodiment, a predetermined response time (hereinafter referred to as “request period Tprd”) is required for the disk access of the process. All processes are required to make a disk access at least once within the requested period Tprd. The request period Tprd can be set as appropriate by the user as long as the system efficiency does not deteriorate. For example, the request period Tprd is set to be equal to or less than the timeout time of the application executed by the computer system 1. Second, scheduling is performed so that the total throughput of each process regarding disk access is as large as possible. That is, the I / O scheduler 10 according to the present embodiment performs the scheduling process so that the throughput becomes as large as possible under the condition that all processes execute disk access at least once within the request period Tprd.

２−１．概要
図７に示されるように、複数のプロセスＰ１〜Ｐ４がＩ／Ｏスケジューラ１０を介してディスクアクセスを行う場合を考える。複数のプロセスＰ１〜Ｐ４は、ディスク４に記録された複数のファイルＦ１〜Ｆ４のそれぞれにアクセスする。このとき、それぞれのプロセスＰ１〜Ｐ４はディスクアクセス要求をＩ／Ｏスケジューラ１０に発行する。Ｉ／Ｏスケジューラ１０は、プロセスＰ１〜Ｐ４のそれぞれから発行されるディスクアクセス要求のスケジューリング処理を行う。 2-1. Overview As shown in FIG. 7, consider a case where a plurality of processes P 1 to P 4 perform disk access via the I / O scheduler 10. The plurality of processes P1 to P4 access each of the plurality of files F1 to F4 recorded on the disk 4. At this time, each of the processes P 1 to P 4 issues a disk access request to the I / O scheduler 10. The I / O scheduler 10 performs scheduling processing of disk access requests issued from the processes P1 to P4.

Ｉ／Ｏスケジューラ１０は、ディスクアクセス要求を一時的に保持しておく複数の待ちキューＱ１〜Ｑ４を有している。待ちキューはプロセス毎に割り当てられており、図７の例では、待ちキューＱ１〜Ｑ４がそれぞれプロセスＰ１〜Ｐ４に割り当てられている。つまり、プロセスＰ１〜Ｐ４からのディスクアクセス要求は、待ちキューＱ１〜Ｑ４のそれぞれに一旦格納される。 The I / O scheduler 10 has a plurality of wait queues Q1 to Q4 that temporarily hold disk access requests. The wait queue is assigned for each process. In the example of FIG. 7, the wait queues Q1 to Q4 are assigned to the processes P1 to P4, respectively. That is, disk access requests from the processes P1 to P4 are temporarily stored in the waiting queues Q1 to Q4.

更に、Ｉ／Ｏスケジューラ１０は、プロセスモニタ部２０、スケジューリング決定部３０、及びリクエスト選択部４０を有している。プロセスモニタ部２０は、動作中のプロセスをモニタし、プロセスの情報を取得する機能を有する。スケジューリング決定部３０は、ディスクアクセス要求が取り出される待ちキューの順番や、各待ちキューから取り出すべきディスクアクセス要求の数を決定する機能を有する。リクエスト選択部４０は、決定された順番及び数に従って、待ちキューからディスクアクセス要求を取得し、取得したものを順次ディスク４へ送る機能を有する。 Further, the I / O scheduler 10 includes a process monitor unit 20, a scheduling determination unit 30, and a request selection unit 40. The process monitor unit 20 has a function of monitoring an operating process and acquiring process information. The scheduling determination unit 30 has a function of determining the order of waiting queues from which disk access requests are taken out and the number of disk access requests to be taken out from each waiting queue. The request selection unit 40 has a function of acquiring disk access requests from the waiting queue according to the determined order and number, and sequentially transmitting the acquired ones to the disk 4.

図８は、本実施の形態に係るスケジューリング処理を示すフローチャートである。図７及び図８を参照して、本実施の形態に係るスケジューリング処理を説明する。 FIG. 8 is a flowchart showing the scheduling process according to the present embodiment. The scheduling process according to the present embodiment will be described with reference to FIGS.

（ステップＳ１０）
スケジューリング決定部３０は、プロセス特性情報ＰＲＯＣ、ファイル位置情報ＬＯＣ、シーク時間情報ＳＥＫ及びアクセス時間情報ＡＣＳを、記憶装置３から読み出す。また、ＯＳは、予め設定されている要求ピリオドＴｐｒｄの情報をスケジューリング決定部３０に与える。要求ピリオドＴｐｒｄは、全てのプロセスＰ１〜Ｐ４で共通であるとする。 (Step S10)
The scheduling determination unit 30 reads process characteristic information PROC, file position information LOC, seek time information SEK, and access time information ACS from the storage device 3. In addition, the OS provides the scheduling determination unit 30 with information on the preset request period Tprd. The request period Tprd is common to all the processes P1 to P4.

（ステップＳ２０）
Ｉ／Ｏスケジューラ１０は、プロセスＰ１〜Ｐ４のそれぞれからのディスクアクセス要求を、待ちキューＱ１〜Ｑ４のそれぞれに一旦格納する。また、プロセスモニタ部２０は、動作中のプロセスＰ１〜Ｐ４をモニタし、それらプロセスＰ１〜Ｐ４の情報を取得する。取得する情報としては、プロセス番号やプロセス名が挙げられる。例えば、ＯＳがＬｉｎｕｘの場合、ｐｓコマンドを利用することによって、プロセス番号やプロセス名を取得することができる。 (Step S20)
The I / O scheduler 10 temporarily stores disk access requests from the processes P1 to P4 in the waiting queues Q1 to Q4, respectively. In addition, the process monitor unit 20 monitors the processes P1 to P4 that are operating, and acquires information about the processes P1 to P4. Information to be acquired includes a process number and a process name. For example, when the OS is Linux, the process number and process name can be acquired by using the ps command.

（ステップＳ３０）
スケジューリング決定部３０は、要求ピリオドＴｐｒｄ内のスケジューリング規則を示す「スケジューリングパラメータ」を決定する。スケジューリングパラメータには、ディスクアクセス要求が取り出される待ちキューＱ１〜Ｑ４の順番や、要求ピリオドＴｐｒｄ内に待ちキューＱ１〜Ｑ４のそれぞれから取り出すべきディスクアクセス要求の数Ａｉが含まれる。ここで、添え字ｉは、待ちキューＱ１〜Ｑ４、すなわち、プロセスＰ１〜Ｐ４のそれぞれを示す。本実施の形態によれば、スケジューリング決定部３０は、要求ピリオドＴｐｒｄ内に全プロセスＰ１〜Ｐ４が少なくとも１回ディスクアクセスを行い、且つ、プロセスＰ１〜Ｐ４のそれぞれに関するスループットの合計ができるだけ大きくなるように、スケジューリングパラメータを決定する。その詳細は後述される。 (Step S30)
The scheduling determination unit 30 determines a “scheduling parameter” indicating a scheduling rule in the request period Tprd. The scheduling parameters include the order of the waiting queues Q1 to Q4 from which disk access requests are taken out, and the number Ai of disk access requests to be taken out from each of the waiting queues Q1 to Q4 in the request period Tprd. Here, the suffix i indicates each of the waiting queues Q1 to Q4, that is, the processes P1 to P4. According to the present embodiment, the scheduling determination unit 30 causes all the processes P1 to P4 to access the disk at least once within the request period Tprd, and the total throughput for each of the processes P1 to P4 is as large as possible. Next, scheduling parameters are determined. Details thereof will be described later.

（ステップＳ４０）
リクエスト選択部４０は、スケジューリング決定部３０によって決定されたスケジューリングパラメータに従って、それぞれの待ちキューＱ１〜Ｑ４からのディスクアクセス要求の選択及び出力を行う。つまり、リクエスト選択部４０は、スケジューリングパラメータで示される順番に従って待ちキュー（ｉ＝Ｑ１〜Ｑ４）を１つずつ選択し、且つ、選択中の１つの待ちキューｉからはスケジューリングパラメータで示される数Ａｉだけディスクアクセス要求を取り出す。そして、リクエスト選択部４０は、取り出したディスクアクセス要求を順次ディスク４に送る。 (Step S40)
The request selection unit 40 selects and outputs disk access requests from the respective waiting queues Q1 to Q4 in accordance with the scheduling parameters determined by the scheduling determination unit 30. That is, the request selection unit 40 selects the waiting queues (i = Q1 to Q4) one by one according to the order indicated by the scheduling parameter, and the number Ai indicated by the scheduling parameter from one waiting queue i being selected. Only take out disk access requests. Then, the request selection unit 40 sequentially sends the extracted disk access requests to the disk 4.

ディスクアクセス要求に応答して、ディスク４ではアクセス対象ファイルに対するデータ読み出しあるいはデータ書き込みが行われる。そして、その結果が、当該ディスクアクセス要求を発行したプロセスに返される。本実施の形態では、要求ピリオドＴｐｒｄの間に、全てのプロセスＰ１〜Ｐ４が少なくとも１回ディスクアクセスを行う。 In response to the disk access request, the disk 4 performs data reading or data writing with respect to the access target file. The result is returned to the process that issued the disk access request. In the present embodiment, all the processes P1 to P4 perform disk access at least once during the request period Tprd.

要求ピリオドＴｐｒｄ毎に、ステップＳ４０は繰り返される。いずれかの待ちキューに蓄積されたディスクアクセス要求が無くなると、処理はステップＳ２０に戻る。また、上述のプロセスモニタ部２０は、動作中のプロセスのモニタを継続している。動作中のプロセスが増減すると、処理はステップＳ２０に戻る。 Step S40 is repeated for each request period Tprd. When there is no disk access request stored in any of the waiting queues, the process returns to step S20. Further, the process monitor unit 20 described above continues to monitor the process in operation. When the number of operating processes increases or decreases, the process returns to step S20.

２−２．スケジューリングパラメータの決定（ステップＳ３０）
次に、上述のステップＳ３０を詳しく説明する。図９は、本実施の形態におけるスケジューリングパラメータの決定方法を示すフローチャートである。 2-2. Determination of scheduling parameters (step S30)
Next, step S30 described above will be described in detail. FIG. 9 is a flowchart showing a method for determining a scheduling parameter in the present embodiment.

（ステップＳ３１０）
まず、スケジューリング決定部３０は、要求ピリオドＴｐｒｄにおいてディスクアクセス要求が取り出される待ちキューの順番を決定する。要求ピリオドＴｐｒｄにおけるスループットを増加させるためには、その要求ピリオドＴｐｒｄ中でアクセス対象ファイル間のシークにとられる時間をできるだけ削減することが望ましい。そのためには、要求ピリオドＴｐｒｄにおけるアクセス対象ファイル間のシーク距離の合計をできるだけ小さくすることが望ましい。よって、スケジューリング決定部３０は、アクセス対象ファイル間のシーク距離の合計がなるべく小さくなるように順番を決定する。 (Step S310)
First, the scheduling determination unit 30 determines the order of the waiting queue from which the disk access request is taken out in the request period Tprd. In order to increase the throughput in the request period Tprd, it is desirable to reduce as much as possible the time taken for seeking between the files to be accessed in the request period Tprd. For that purpose, it is desirable to make the total of the seek distances between the files to be accessed in the request period Tprd as small as possible. Therefore, the scheduling determination unit 30 determines the order so that the total seek distance between the access target files is as small as possible.

スケジューリング決定部３０は、プロセスモニタ部２０から動作中のプロセスＰ１〜Ｐ４の情報（例：プロセス名）を受け取る。そして、スケジューリング決定部３０は、プロセス特性情報ＰＲＯＣを参照することにより、動作中のプロセスＰ１〜Ｐ４のそれぞれがアクセスするアクセス対象ファイルＦ１〜Ｆ４を把握する。更に、スケジューリング決定部３０は、ファイル位置情報ＬＯＣを参照することにより、アクセス対象ファイルＦ１〜Ｆ４のディスク４上での配置関係を把握する。そして、スケジューリング決定部３０は、アクセス対象ファイルＦ１〜Ｆ４がディスク４上の“並び順（配置順）”で順番にアクセスされるように、ディスクアクセス要求が取り出される待ちキューＱ１〜Ｑ４の順番を決定する。 The scheduling determination unit 30 receives information (for example, process name) of the processes P1 to P4 that are operating from the process monitor unit 20. Then, the scheduling determination unit 30 refers to the process characteristic information PROC to grasp the access target files F1 to F4 accessed by each of the operating processes P1 to P4. Further, the scheduling determination unit 30 refers to the file position information LOC to grasp the arrangement relationship of the access target files F1 to F4 on the disk 4. Then, the scheduling determining unit 30 sets the order of the waiting queues Q1 to Q4 from which the disk access request is taken out so that the access target files F1 to F4 are sequentially accessed in the “arrangement order (arrangement order)” on the disk 4. decide.

例えば、図４Ａ及び図４Ｂで示されたファイル位置情報ＬＯＣの例の場合、アクセス対象ファイルＦ１〜Ｆ４は、ディスク４の先頭から「Ｆ１、Ｆ２、Ｆ３、Ｆ４」の順番で配置されている。従って、図１０Ａに示されるように、アクセス対象ファイルＦ１〜Ｆ４が「Ｆ１、Ｆ２、Ｆ３、Ｆ４、Ｆ１」の順番でアクセスされる場合に、シーク距離の合計が最も小さくなる。しかしながら、図１０Ｂに示されるように、アクセス対象ファイルＦ１〜Ｆ４が「Ｆ１、Ｆ４、Ｆ２、Ｆ３、Ｆ１」といった順番でアクセスされると、シーク距離の合計は大きくなってしまう。上述の通り、スループットを大きくするためには、アクセス対象ファイルＦ１〜Ｆ４間のシーク距離の合計はなるべく小さいことが望ましい。従って、アクセス対象ファイルＦ１〜Ｆ４のアクセス順としては、図１０Ａで示されたような順番が好適である。あるいは、図１０Ａと逆の順番「Ｆ４、Ｆ３、Ｆ２、Ｆ１、Ｆ４」であっても同じである。いずれにせよ、アクセス対象ファイルＦ１〜Ｆ４のアクセス順としては、ディスク４上の“並び順（配置順）”に沿っていることが望ましい。 For example, in the example of the file position information LOC shown in FIGS. 4A and 4B, the access target files F1 to F4 are arranged in the order of “F1, F2, F3, F4” from the top of the disk 4. Therefore, as shown in FIG. 10A, when the access target files F1 to F4 are accessed in the order of “F1, F2, F3, F4, F1”, the total seek distance becomes the smallest. However, as shown in FIG. 10B, when the access target files F1 to F4 are accessed in the order of “F1, F4, F2, F3, F1”, the total seek distance becomes large. As described above, in order to increase the throughput, the total seek distance between the access target files F1 to F4 is preferably as small as possible. Therefore, as the access order of the access target files F1 to F4, the order shown in FIG. 10A is preferable. Or it is the same even if it is order "F4, F3, F2, F1, F4" reverse to FIG. 10A. In any case, it is desirable that the access order of the access target files F1 to F4 is in accordance with the “arrangement order (arrangement order)” on the disk 4.

スケジューリング決定部３０は、要求ピリオドＴｐｒｄにおいて図１０Ａで示されたようなアクセス順が再現されるように、ディスクアクセス要求が取り出される待ちキューＱ１〜Ｑ４の順番を決定する。すなわち、本例では、取り出し順は、「Ｑ１、Ｑ２、Ｑ３、Ｑ４」あるいは「Ｑ２、Ｑ３、Ｑ４、Ｑ１」あるいは「Ｑ３、Ｑ４、Ｑ１、Ｑ２」あるいは「Ｑ４、Ｑ３、Ｑ２、Ｑ１」と決定される。これにより、要求ピリオドＴｐｒｄ中でアクセス対象ファイルＦ１〜Ｆ４間のシークに割かれる時間が極力抑えられ、データ読み出しあるいはデータ書き込みに割り当て可能な時間が増えることになる。このことは、要求ピリオドＴｐｒｄにおけるスループットの合計が増加することを意味する。 The scheduling determining unit 30 determines the order of the waiting queues Q1 to Q4 from which the disk access requests are taken out so that the access order as shown in FIG. 10A is reproduced in the request period Tprd. That is, in this example, the extraction order is “Q1, Q2, Q3, Q4” or “Q2, Q3, Q4, Q1” or “Q3, Q4, Q1, Q2” or “Q4, Q3, Q2, Q1”. It is determined. As a result, the time allocated for seeking between the access target files F1 to F4 in the requested period Tprd is suppressed as much as possible, and the time allocatable for data reading or data writing is increased. This means that the total throughput in the requested period Tprd increases.

尚、例えば図１０Ａに示されるように、アクセス対象ファイルＦ１〜Ｆ４に対するアクセスは、要求ピリオドＴｐｒｄでちょうど一巡する。図１０Ａの例では、１回の要求ピリオドＴｐｒｄにおいて、ディスクヘッドはファイルＦ１の位置からファイルＦ４の位置まで順次移動した後、最初のファイルＦ１の位置まで戻る。つまり、アクセス対象ファイルＦ１〜Ｆ４に対するアクセスは循環的に行われ、要求ピリオドＴｐｒｄ毎に同じ動作が繰り返される。 For example, as shown in FIG. 10A, the access to the access target files F1 to F4 is completed once in the request period Tprd. In the example of FIG. 10A, in one request period Tprd, the disk head sequentially moves from the position of the file F1 to the position of the file F4, and then returns to the position of the first file F1. That is, access to the access target files F1 to F4 is performed cyclically, and the same operation is repeated for each request period Tprd.

（ステップＳ３２０）
次に、スケジューリング決定部３０は、ステップＳ３１０で決定された順番でディスクアクセス要求が取り出された場合の、アクセス対象ファイルＦ１〜Ｆ４間のシーク距離（ファイル間シーク距離）を算出する。つまり、スケジューリング決定部３０は、要求ピリオドＴｐｒｄにおいて上記決定順でアクセス対象ファイルＦ１〜Ｆ４にアクセスが発生した場合の、ファイル間シーク距離を算出する。プロセスＰ１〜Ｐ４のそれぞれのアクセス対象ファイルＦ１〜Ｆ４はプロセス特性情報ＰＲＯＣに示されており、それらアクセス対象ファイルＦ１〜Ｆ４のディスク４上の位置はファイル位置情報ＬＯＣに示されている。従って、その情報を参照することによりファイル間シーク距離を計算することができる。 (Step S320)
Next, the scheduling determining unit 30 calculates the seek distance (access seek distance between files) between the access target files F1 to F4 when the disk access requests are extracted in the order determined in step S310. That is, the scheduling determination unit 30 calculates the inter-file seek distance when the access target files F1 to F4 are accessed in the determination order in the request period Tprd. The access target files F1 to F4 of the processes P1 to P4 are indicated in the process characteristic information PROC, and the positions of the access target files F1 to F4 on the disk 4 are indicated in the file position information LOC. Therefore, the seek distance between files can be calculated by referring to the information.

各ファイルはディスク４上でファイルサイズ分の幅を有しているため、ファイル間シーク距離の算出にあたって、各ファイルの代表位置を決定する必要がある。その代表位置としては、各ファイルの開始位置、終了位置、あるいは開始位置と終了位置との中間位置などが挙げられる。例えば中間位置が代表位置として用いられる場合、あるファイルの中間位置と次にアクセスされるファイルの中間位置との間の距離が、ファイル間シーク距離として算出される。 Since each file has a width corresponding to the file size on the disk 4, it is necessary to determine a representative position of each file when calculating the seek distance between files. Examples of the representative position include a start position, an end position, or an intermediate position between the start position and the end position of each file. For example, when the intermediate position is used as the representative position, the distance between the intermediate position of a certain file and the intermediate position of the next accessed file is calculated as the seek distance between files.

図１０Ａで示された例の場合を考える。この場合、要求ピリオドＴｐｒｄにおけるアクセス順は「Ｆ１、Ｆ２、Ｆ３、Ｆ４、Ｆ１」である。従って、ファイルＦ１とＦ２の間のシーク距離Ｌ１２、ファイルＦ２とＦ３の間のシーク距離Ｌ２３、ファイルＦ３とＦ４の間のシーク距離Ｌ３４、及びファイルＦ４とＦ１の間のシーク距離Ｌ４１が算出される。 Consider the case of the example shown in FIG. 10A. In this case, the access order in the request period Tprd is “F1, F2, F3, F4, F1”. Accordingly, the seek distance L12 between the files F1 and F2, the seek distance L23 between the files F2 and F3, the seek distance L34 between the files F3 and F4, and the seek distance L41 between the files F4 and F1 are calculated. .

（ステップＳ３３０）
次に、スケジューリング決定部３０は、シーク時間情報ＳＥＫ（図５Ａ、図５Ｂ参照）を参照することによって、ステップＳ３２０で算出されたファイル間シーク距離に対応するファイル間シーク時間を算出する。図５Ａで示されたような関数ｙ＝ｆ（ｘ）が用いられる場合、変数ｘにファイル間シーク距離を代入することによって、対応するファイル間シーク時間ｙを得ることができる。図５Ｂで示されたようなテーブルが用いられる場合、エントリ間を補間することによって、ファイル間シーク距離に対応するファイル間シーク時間を得ることができる。 (Step S330)
Next, the scheduling determination unit 30 refers to the seek time information SEK (see FIGS. 5A and 5B) to calculate an inter-file seek time corresponding to the inter-file seek distance calculated in step S320. When the function y = f (x) as shown in FIG. 5A is used, the corresponding inter-file seek time y can be obtained by substituting the inter-file seek distance into the variable x. When a table as shown in FIG. 5B is used, an inter-file seek time corresponding to an inter-file seek distance can be obtained by interpolating between entries.

図１０Ａで示された例の場合、上述のファイル間シーク距離Ｌ１２、Ｌ２３、Ｌ３４及びＬ４１のそれぞれに対応するファイル間シーク時間Ｔ１２、Ｔ２３、Ｔ３４及びＴ４１が算出される。算出されたファイル間シーク時間は、要求ピリオドＴｐｒｄ中でアクセス対象ファイルＦ１〜Ｆ４間のシークに割かれる時間に相当する。ステップＳ３１０で説明された順番決めの工夫の結果、ファイル間シーク時間の総和（Ｔ１２＋Ｔ２３＋Ｔ３４＋Ｔ４１）は最小限に抑えられている。その分、各ファイルでのデータ読み出しあるいはデータ書き込みに割り当て可能な時間が増えることになる。このことは、要求ピリオドＴｐｒｄにおけるスループットの合計が増加することを意味する。 In the example shown in FIG. 10A, the inter-file seek times T12, T23, T34, and T41 corresponding to the inter-file seek distances L12, L23, L34, and L41, respectively, are calculated. The calculated seek time between files corresponds to the time allocated for seeking between the access target files F1 to F4 in the request period Tprd. As a result of the devising of ordering described in step S310, the total inter-file seek time (T12 + T23 + T34 + T41) is minimized. Accordingly, the time allocatable for data reading or data writing in each file increases. This means that the total throughput in the requested period Tprd increases.

（ステップＳ３４０）
次に、スケジューリング決定部３０は、ステップＳ３３０で算出されたファイル間シーク時間のそれぞれを要求ピリオドＴｐｒｄから引くことによって、「残り時間Ｔｒｓｔ」を算出する。図１０Ａで示された例の場合、残り時間Ｔｒｓｔは、Ｔｐｒｄ−（Ｔ１２＋Ｔ２３＋Ｔ３４＋Ｔ４１）と算出される。この残り時間Ｔｒｓｔは、アクセス対象ファイルでのデータ読み出しあるいはデータ書き込みに利用可能な時間に相当する。上述の通り、ファイル間シーク時間の総和は最小限に抑えられているため、残り時間Ｔｒｓｔとして最大限の時間が確保される。 (Step S340)
Next, the scheduling determination unit 30 calculates the “remaining time Trst” by subtracting each of the inter-file seek times calculated in step S330 from the request period Tprd. In the case of the example shown in FIG. 10A, the remaining time Trst is calculated as Tprd− (T12 + T23 + T34 + T41). This remaining time Trst corresponds to the time available for data reading or data writing in the access target file. As described above, since the total sum of seek times between files is minimized, the maximum time is secured as the remaining time Trst.

ステップＳ３４０の処理は、データ読み出しあるいはデータ書き込みに割り当て可能な残り時間Ｔｒｓｔを見積もることだけでなく、要求ピリオドＴｐｒｄ中で全てのファイル間シーク動作に必要な時間をあらかじめ確保しておくことにも相当する。全てのファイル間シーク動作に必要な時間が確保されることは、全てのプロセスＰ１〜Ｐ４が要求ピリオドＴｐｒｄ内に少なくとも１回ディスクアクセスを行うことが可能となることを意味する。言い換えれば、ステップＳ３４０によって、全てのプロセスＰ１〜Ｐ４が要求ピリオドＴｐｒｄ内に少なくとも１回ディスクアクセスを行うことが保証される。 The processing in step S340 is equivalent to not only estimating the remaining time Trst that can be allocated to data reading or data writing, but also securing in advance the time required for all file-to-file seek operations in the requested period Tprd. To do. Securing the time required for all inter-file seek operations means that all the processes P1 to P4 can perform disk access at least once within the requested period Tprd. In other words, step S340 ensures that all processes P1-P4 make a disk access at least once within the requested period Tprd.

（ステップＳ３５０）
以上に説明されたように、ステップＳ３１０での順番決めにより、要求ピリオドＴｐｒｄにおけるスループットの合計の向上が期待される。また、ステップＳ３４０での残り時間Ｔｒｓｔの算出により、全てのプロセスＰ１〜Ｐ４が要求ピリオドＴｐｒｄ内に少なくとも１回ディスクアクセスを行うことが保証される。後は、スケジューリング決定部３０が、残り時間ＴｒｓｔをプロセスＰ１〜Ｐ４間で適宜分配すればよい。それぞれのプロセスＰ１〜Ｐ４に分配される時間に基づいて、残り時間Ｔｒｓｔ（すなわち要求ピリオドＴｐｒｄ）内にそれぞれの待ちキューＱ１〜Ｑ４から取り出すことができるディスクアクセス要求の数Ａｉを決定することができる。その方法としては様々考えられる。 (Step S350)
As described above, the total throughput in the request period Tprd is expected to be improved by the order determination in step S310. Further, the calculation of the remaining time Trst in step S340 ensures that all the processes P1 to P4 perform disk access at least once within the request period Tprd. Thereafter, the scheduling determination unit 30 may appropriately distribute the remaining time Trst among the processes P1 to P4. Based on the time distributed to the respective processes P1 to P4, the number Ai of disk access requests that can be taken out from the respective waiting queues Q1 to Q4 within the remaining time Trst (that is, the request period Tprd) can be determined. . Various methods are conceivable.

＜第１の例＞
例えば、ディスクアクセス要求の取得数Ａｉは複数のプロセスＰ１〜Ｐ４間で均等になるように決定される。そのために、スケジューリング決定部３０は例えば次のような処理を行う。ディスクアクセスがデータ読み出しの場合を例示するが、データ書き込みの場合も同様である。 <First example>
For example, the number Ai of disk access request acquisitions is determined to be equal among the plurality of processes P1 to P4. For this purpose, the scheduling determination unit 30 performs the following processing, for example. Although the case where the disk access is data reading is illustrated, the same applies to the case of data writing.

あるプロセスｉが１回のディスクアクセス要求によってアクセス対象ファイルから読み出すデータサイズはＤａｃｓ＿ｉであるとする。この１回の読み出しデータサイズＤａｃｓ＿ｉは、ユーザによって与えられてもよいし、システムで予め決まっていてもよい。また、読み出しデータサイズＤａｃｓ＿ｉは、プロセス間で同じであってもよいし異なっていてもよい。 Assume that a data size read from a file to be accessed by a process i by a single disk access request is Dacs_i. This one-time read data size Dacs_i may be given by the user or may be predetermined by the system. Further, the read data size Dacs_i may be the same or different between processes.

スケジューリング決定部３０は、アクセス時間情報ＡＣＳ（図６Ａ、図６Ｂ参照）を参照することによって、１回の読み出しデータサイズＤａｃｓ＿ｉに対応する読み出し時間Ｔａｃｓ＿ｉを算出する。図６Ａで示されたような関数ｙ＝ｇ（ｘ）が用いられる場合、変数ｘに読み出しデータサイズＤａｃｓ＿ｉを代入することによって、対応する読み出し時間Ｔａｃｓ＿ｉを得ることができる。図６Ｂで示されたようなテーブルが用いられる場合、エントリ間を補間することによって、読み出しデータサイズＤａｃｓ＿ｉに対応する読み出し時間Ｔａｃｓ＿ｉを得ることができる。また、スケジューリング決定部３０は、シーク時間情報ＳＥＫ（図５Ａ、図５Ｂ参照）を参照することによって、アクセス対象ファイルの中で読み出しデータ間を移動するのに要するファイル内シーク時間Ｔｓｅｋ＿ｉを算出する。このファイル内シーク時間Ｔｓｅｋ＿ｉは、１回の読み出しデータサイズＤａｃｓ＿ｉに対応する１回のシーク時間である。 The scheduling determination unit 30 calculates the read time Tacs_i corresponding to one read data size Dacs_i by referring to the access time information ACS (see FIGS. 6A and 6B). When the function y = g (x) as shown in FIG. 6A is used, the corresponding read time Tacs_i can be obtained by substituting the read data size Dacs_i for the variable x. When the table as shown in FIG. 6B is used, the read time Tacs_i corresponding to the read data size Dacs_i can be obtained by interpolating between the entries. Further, the scheduling determining unit 30 refers to the seek time information SEK (see FIGS. 5A and 5B) to calculate the in-file seek time Tsek_i required for moving between read data in the access target file. This in-file seek time Tsek_i is one seek time corresponding to one read data size Dacs_i.

上記読み出し時間Ｔａｃｓ＿ｉ及びファイル内シーク時間Ｔｓｅｋ＿ｉの和（Ｔａｃｓ＿ｉ＋Ｔｓｅｋ＿ｉ）は、あるプロセスｉからの１回のディスクアクセス要求の処理に必要な時間に相当する。従って、各プロセスｉに関して、残り時間Ｔｒｓｔ内に待ちキューから取り出すことができるディスクアクセス要求の数Ａｉは、次の式（１）で与えられ得る。 The sum of the read time Tacs_i and the file seek time Tsek_i (Tacs_i + Tsek_i) corresponds to the time required to process one disk access request from a certain process i. Therefore, for each process i, the number Ai of disk access requests that can be taken out from the waiting queue within the remaining time Trst can be given by the following equation (1).

式（１）において、Ｎは動作中のプロセスＰ１〜Ｐ４の全体を意味する。よって、式（１）の分母は、全てのプロセスＰ１〜Ｐ４からのディスクアクセス要求を１回ずつ処理するのに必要な時間の総和を表す。残り時間Ｔｒｓｔをその総和で割ることによって、各プロセスｉに関するディスクアクセス要求の取得数Ａｉを算出することができる。この場合、取得数ＡｉはプロセスＰ１〜Ｐ４間で等しくなる。尚、実際には取得数Ａｉは整数である必要があるため、式（１）により求まる値の小数点以下を切り捨てることにより取得数Ａｉが決定される。 In the formula (1), N means the entire processes P1 to P4 in operation. Therefore, the denominator of Expression (1) represents the total time required to process the disk access requests from all the processes P1 to P4 once. By dividing the remaining time Trst by the sum, it is possible to calculate the number of disk access request acquisitions Ai for each process i. In this case, the acquisition number Ai is equal between the processes P1 to P4. Actually, since the acquisition number Ai needs to be an integer, the acquisition number Ai is determined by rounding down the decimal part of the value obtained by the equation (1).

＜第２の例＞
プロセスＰ１〜Ｐ４のそれぞれに要求されるスループットに対して“重み”が与えられている場合を考える。この場合、得られるスループットにその重み付けが反映されるように、ディスクアクセス要求の取得数Ａｉを決定することが可能である。プロセスＰ１〜Ｐ４のそれぞれに要求されるスループットの重み付けを指定するパラメータは、以下「重み付けパラメータ」と参照される。重み付けパラメータは、例えばプロセス特性情報ＰＲＯＣに含まれる。 <Second example>
Consider a case where a “weight” is given to the throughput required for each of the processes P1 to P4. In this case, the number Ai of disk access request acquisitions can be determined so that the weighting is reflected in the obtained throughput. The parameter that designates the weighting of the throughput required for each of the processes P1 to P4 is hereinafter referred to as “weighting parameter”. The weighting parameter is included in the process characteristic information PROC, for example.

図１１は、本例におけるプロセス特性情報ＰＲＯＣを示している。図１１において、プロセス特性情報ＰＲＯＣは、プロセスＰ１〜Ｐ４のそれぞれに関するアクセス対象ファイルＦ１〜Ｆ４に加えて、重み付けパラメータを示している。本例では、重み付けパラメータは、要求スループットの「重み係数」である。重み係数は、それぞれのプロセスＰ１〜Ｐ４に要求されるスループット間の比率である。例えば、プロセスＰ１、Ｐ２に指定された重み係数がそれぞれ１と２である場合、プロセスＰ２にはプロセスＰ１の２倍のスループットが要求されている。 FIG. 11 shows the process characteristic information PROC in this example. In FIG. 11, the process characteristic information PROC indicates a weighting parameter in addition to the access target files F1 to F4 related to each of the processes P1 to P4. In this example, the weighting parameter is a “weighting factor” of the required throughput. The weighting factor is a ratio between the throughputs required for the respective processes P1 to P4. For example, when the weighting factors specified for the processes P1 and P2 are 1 and 2, respectively, the process P2 is required to have twice the throughput of the process P1.

スケジューリング決定部３０は、この重み係数に応じて残り時間Ｔｒｓｔを分配し、得られるスループットにその重み係数が反映されるようにディスクアクセス要求の取得数Ａｉを決定する。具体的には、既出の第１の例と同様に、スケジューリング決定部３０は、１回あたりの読み出し時間Ｔａｃｓ＿ｉ及びファイル内シーク時間Ｔｓｅｋ＿ｉを算出する。また、スケジューリング決定部３０は、プロセス特性情報ＰＲＯＣを参照し、それぞれのプロセスＰ１〜Ｐ４に指定された重み係数Ｗｉを取得する。この場合、ディスクアクセス要求の取得数Ａｉは、次の式（２）で与えられる。 The scheduling determination unit 30 distributes the remaining time Trst according to the weighting factor, and determines the number Ai of disk access request acquisitions so that the weighting factor is reflected in the obtained throughput. Specifically, as in the first example described above, the scheduling determination unit 30 calculates the read time Tacs_i and the in-file seek time Tsek_i per time. Further, the scheduling determination unit 30 refers to the process characteristic information PROC and acquires the weighting factor Wi specified for each of the processes P1 to P4. In this case, the acquisition number Ai of the disk access request is given by the following equation (2).

実際には取得数Ａｉは整数である必要があるため、式（２）により求まる値の小数点以下を切り捨てることにより取得数Ａｉが決定される。尚、式（２）により求まる値が１未満となるプロセスがある場合、そのプロセスに関する取得数を１とし、代わりに他のプロセスに関する２以上の取得数から１だけ減算する。減算対象として取得数が最大であるプロセスを選択すると、当該プロセスに割り当てられるスループットの減少率が小さいため、影響を小さく抑えることができる。このような処理によっても全てのプロセスに関する取得数が１以上にならない場合、Ｉ／Ｏスケジューラ１０はエラーを管理者に通知することもできる。 Actually, since the acquisition number Ai needs to be an integer, the acquisition number Ai is determined by rounding down the decimal point of the value obtained by the equation (2). When there is a process whose value obtained by the expression (2) is less than 1, the number of acquisitions related to that process is set to 1, and instead, 1 is subtracted from two or more acquisition numbers related to other processes. When the process with the maximum number of acquisitions is selected as the subtraction target, the reduction rate of the throughput allocated to the process is small, so that the influence can be suppressed small. If the number of acquisitions related to all processes does not become 1 or more by such processing, the I / O scheduler 10 can also notify the administrator of an error.

＜第３の例＞
要求スループットの重み付けは、上記重み係数に限られない。例えば、プロセスＰ１〜Ｐ４の中に、要求スループットが優先的に与えられる「優先プロセス」と、要求スループットがベストエフォートで与えられる「ベストエフォートプロセス」が混在する場合を考える。この場合、優先プロセスかベストエフォートプロセスかの区分けが、要求スループットの重み付けに相当することになる。 <Third example>
The weighting of the required throughput is not limited to the weighting factor. For example, let us consider a case where a “priority process” in which the requested throughput is given preferentially and a “best effort process” in which the requested throughput is given with the best effort are mixed in the processes P1 to P4. In this case, the classification between the priority process and the best effort process corresponds to the weighting of the requested throughput.

図１２は、本例におけるプロセス特性情報ＰＲＯＣを示している。図１２において、プロセス特性情報ＰＲＯＣは、プロセスＰ１〜Ｐ４のそれぞれに関するアクセス対象ファイルＦ１〜Ｆ４に加えて、重み付けパラメータを示している。本例では、重み付けパラメータは、各プロセスが優先プロセスかベストエフォートプロセスかを示す「クラス」と、優先プロセスに関する「要求スループット」と、ベストエフォートプロセスに関する要求スループットの「重み係数」とを含んでいる。優先プロセスに関する要求スループットは、優先的に確保されるべきスループットである。重み係数は、ベストエフォートプロセスのそれぞれに要求されるスループット間の比率である。 FIG. 12 shows the process characteristic information PROC in this example. In FIG. 12, the process characteristic information PROC indicates a weighting parameter in addition to the access target files F1 to F4 related to the processes P1 to P4, respectively. In this example, the weighting parameters include a “class” indicating whether each process is a priority process or a best effort process, a “request throughput” regarding the priority process, and a “weighting factor” of the request throughput regarding the best effort process. . The requested throughput related to the priority process is a throughput that should be secured with priority. The weighting factor is the ratio between the throughput required for each of the best effort processes.

スケジューリング決定部３０は、この重み付けパラメータに応じて残り時間Ｔｒｓｔを分配し、得られるスループットにその重み付けパラメータが反映されるようにディスクアクセス要求の取得数Ａｉを決定する。図１３を参照して、本例におけるステップＳ３５０の処理を詳しく説明する。 The scheduling determination unit 30 distributes the remaining time Trst according to the weighting parameter, and determines the number Ai of disk access request acquisitions so that the weighting parameter is reflected in the obtained throughput. With reference to FIG. 13, the process of step S350 in this example will be described in detail.

（ステップＳ３５１）
スケジューリング決定部３０は、プロセス特性情報ＰＲＯＣを参照し、まず優先プロセスｋに関してディスクアクセス要求の取得数Ａｋを決定する。ここで、添え字ｋは、プロセスＰ１〜Ｐ４に含まれる優先プロセスを示す。図１２の例の場合、プロセスＰ１、Ｐ２が優先プロセスｋである。プロセス特性情報ＰＲＯＣにおいて、優先プロセスｋに指定された要求スループットはＢｋであるとする。また、優先プロセスｋが１回のディスクアクセス要求によってアクセス対象ファイルから読み出すデータサイズはＤａｃｓであるとする。この１回の読み出しデータサイズＤａｃｓは、ユーザによって与えられてもよいし、システムで予め決まっていてもよい。この場合、スケジューリング決定部３０は、次の式（３）に従って、優先プロセスｋに関する取得数Ａｋを算出する。 (Step S351)
The scheduling determining unit 30 refers to the process characteristic information PROC, and first determines the disk access request acquisition number Ak for the priority process k. Here, the subscript k indicates a priority process included in the processes P1 to P4. In the example of FIG. 12, processes P1 and P2 are priority processes k. In the process characteristic information PROC, it is assumed that the requested throughput specified for the priority process k is Bk. Further, it is assumed that the data size read from the access target file by the priority process k by one disk access request is Dacs. This one-time read data size Dacs may be given by the user or may be predetermined by the system. In this case, the scheduling determination unit 30 calculates the acquisition number Ak for the priority process k according to the following equation (3).

式（３）において、分子は、要求ピリオドＴｐｒｄ内に優先プロセスｋが要求するデータ量を表す。この要求データ量を１回の読み出しデータサイズＤａｃｓで割ることにより、要求スループットＢｋを実現するために必要な取得数Ａｋを算出することができる。つまり、スケジューリング決定部３０は、優先プロセスｋに関して、要求スループットＢｋが得られるようにそれぞれの待ちキューからの取得数Ａｋを決定する。尚、実際には取得数Ａｋは整数である必要があるため、式（３）により求まる値の小数点以下を切り捨てることにより取得数Ａｋが決定される。 In equation (3), the numerator represents the amount of data required by the priority process k within the required period Tprd. By dividing this required data amount by the read data size Dacs for one time, the number of acquisitions Ak necessary for realizing the required throughput Bk can be calculated. That is, the scheduling determination unit 30 determines the acquisition number Ak from each waiting queue so that the requested throughput Bk can be obtained for the priority process k. Actually, since the acquisition number Ak needs to be an integer, the acquisition number Ak is determined by rounding down the decimal point of the value obtained by the equation (3).

（ステップＳ３５２）
次に、スケジューリング決定部３０は、上記ステップＳ３５１で決定された取得数Ａｋに基づいて、その取得数Ａｋのディスクアクセス要求の処理に必要な時間を算出する。そして、スケジューリング決定部３０は、算出された時間を残り時間Ｔｒｓｔから引くことによって、新たな残り時間Ｔｒｓｔ’を算出する。具体的には、新たな残り時間Ｔｒｓｔ’は、次の式（４）で与えられる。 (Step S352)
Next, the scheduling determination unit 30 calculates the time required for processing the disk access request for the acquired number Ak based on the acquired number Ak determined in step S351. Then, the scheduling determination unit 30 calculates a new remaining time Trst ′ by subtracting the calculated time from the remaining time Trst. Specifically, the new remaining time Trst ′ is given by the following equation (4).

式（４）において、Ｔａｃｓ＿ｋ及びＴｓｅｋ＿ｋはそれぞれ、優先プロセスｋに関する１回あたりの読み出し時間及びファイル内シーク時間である。１回あたりの読み出し時間Ｔａｃｓ＿ｋ及びファイル内シーク時間Ｔｓｅｋ＿ｋは、既出の第１の例と同様に、アクセス時間情報ＡＣＳとシーク時間情報ＳＥＫを参照することによって算出可能である。 In Equation (4), Tacs_k and Tsek_k are a read time and a seek time within a file for the priority process k, respectively. The read time Tacs_k and in-file seek time Tsek_k per one time can be calculated by referring to the access time information ACS and seek time information SEK, as in the first example.

式（４）で示された処理によって、取得数Ａｋのディスクアクセス要求の処理に必要な時間があらかじめ確保される。すなわち、要求ピリオドＴｐｒｄにおいて優先プロセスｋの要求スループットＢｋを実現するために必要な時間があらかじめ確保される。スケジューリング決定部３０は、優先プロセスｋ用の時間をあらかじめ確保することにより、残り時間Ｔｒｓｔを更新していると言える。そして、更新後の残り時間Ｔｒｓｔ’が、ベストエフォートプロセスに割り当て可能な時間となる。 By the process shown in the equation (4), the time necessary for processing the disk access request for the number of acquired Ak is secured in advance. That is, the time necessary for realizing the required throughput Bk of the priority process k in the required period Tprd is secured in advance. It can be said that the scheduling determination unit 30 updates the remaining time Trst by securing the time for the priority process k in advance. The remaining time Trst 'after the update is a time that can be allocated to the best effort process.

（ステップＳ３５３）
後は、スケジューリング決定部３０が、プロセス特性情報ＰＲＯＣを参照して、更新後の残り時間Ｔｒｓｔ’をベストエフォートプロセスｊ間で分配すればよい。ここで、添え字ｊは、プロセスＰ１〜Ｐ４に含まれるベストエフォートプロセスを示す。図１２の例の場合、プロセスＰ３、Ｐ４がベストエフォートプロセスｊである。ベストエフォートプロセスＰ３、Ｐ４に分配される時間に基づいて、残り時間Ｔｒｓｔ’内にそれぞれの待ちキューＱ３、Ｑ４から取り出すことができるディスクアクセス要求の数Ａｊを決定することができる。 (Step S353)
Thereafter, the scheduling determination unit 30 may distribute the remaining time Trst ′ after the update among the best effort processes j with reference to the process characteristic information PROC. Here, the subscript j indicates the best effort process included in the processes P1 to P4. In the example of FIG. 12, the processes P3 and P4 are the best effort process j. Based on the time distributed to the best effort processes P3 and P4, the number Aj of disk access requests that can be taken out from the respective waiting queues Q3 and Q4 within the remaining time Trst ′ can be determined.

例えば図１２に示されるように、ベストエフォートプロセスに対して要求スループットの重み係数が指定されている場合を考える。この場合、スケジューリング決定部３０は、この重み係数に応じて残り時間Ｔｒｓｔ’を分配し、得られるスループットにその重み係数が反映されるように取得数Ａｊを決定する。具体的には、既出の第１の例と同様に、スケジューリング決定部３０は、１回あたりの読み出し時間Ｔａｃｓ＿ｊ及びファイル内シーク時間Ｔｓｅｋ＿ｊを算出する。また、スケジューリング決定部３０は、プロセス特性情報ＰＲＯＣを参照し、ベストエフォートプロセスｊに指定された重み係数Ｗｊを取得する。この場合、ベストエフォートプロセスｊに関する取得数Ａｊは、次の式（５）で与えられる。 For example, as shown in FIG. 12, consider a case where a weighting factor for requested throughput is specified for the best effort process. In this case, the scheduling determination unit 30 distributes the remaining time Trst ′ according to the weighting factor, and determines the acquisition number Aj so that the weighting factor is reflected in the obtained throughput. Specifically, as in the first example described above, the scheduling determination unit 30 calculates a read time Tacs_j and an in-file seek time Tsek_j per time. In addition, the scheduling determination unit 30 refers to the process characteristic information PROC and acquires the weighting factor Wj specified for the best effort process j. In this case, the acquisition number Aj regarding the best effort process j is given by the following equation (5).

式（５）は、既出の式（２）における残り時間Ｔｒｓｔを更新後の残り時間Ｔｒｓｔ’で置換し、対象をベストエフォートプロセスｊだけに限定したものに相当する。尚、実際には取得数Ａｊは整数である必要があるため、式（５）により求まる値の小数点以下を切り捨てることにより取得数Ａｊが決定される。 Equation (5) corresponds to a case where the remaining time Trst in the foregoing equation (2) is replaced with the updated remaining time Trst 'and the target is limited to only the best effort process j. Actually, since the acquisition number Aj needs to be an integer, the acquisition number Aj is determined by rounding down the decimal point of the value obtained by the equation (5).

このように、スケジューリング決定部３０は、残り時間Ｔｒｓｔを優先プロセスとベストエフォートプロセスのそれぞれに別々に分配し、取得数Ａｋ、Ａｊを別々に算出する。取得数Ａｋ、Ａｊに従えば、優先プロセスｋに要求されるスループットが実現され、ベストエフォートプロセスｊに指定された要求スループットの比率が実現される。すなわち、それぞれのプロセスに関して得られるスループットに重み付けが反映されるように、取得数Ａｋ、Ａｊが決定されている。 As described above, the scheduling determination unit 30 separately distributes the remaining time Trst to each of the priority process and the best effort process, and calculates the acquisition numbers Ak and Aj separately. According to the acquisition numbers Ak and Aj, the throughput required for the priority process k is realized, and the ratio of the requested throughput specified for the best effort process j is realized. That is, the acquisition numbers Ak and Aj are determined so that weighting is reflected in the throughput obtained for each process.

以上に説明されたように、ステップＳ３５０では、スケジューリング決定部３０は、残り時間ＴｒｓｔをプロセスＰ１〜Ｐ４間で適宜分配することによって、それぞれのプロセスＰ１〜Ｐ４に関するディスクアクセス要求の取得数Ａｉを決定することができる。 As described above, in step S350, the scheduling determination unit 30 determines the number of acquired disk access requests Ai for each process P1 to P4 by appropriately distributing the remaining time Trst among the processes P1 to P4. can do.

ステップＳ３１０で決定された「順番」とステップＳ３５０で決定された「取得数Ａｉ」が、要求ピリオドＴｐｒｄ内のスケジューリング規則を示す「スケジューリングパラメータ」となる。スケジューリング決定部３０は、要求ピリオドＴｐｒｄ内に全プロセスＰ１〜Ｐ４が少なくとも１回ディスクアクセスを行い、且つ、プロセスＰ１〜Ｐ４のそれぞれに関するスループットの合計ができるだけ大きくなるように、スケジューリングパラメータを決定することができている。重み付けパラメータが与えられている場合には、スケジューリング決定部３０は、それぞれのプロセスＰ１〜Ｐ４に関するスループットにその重み付けが反映されるようにスケジューリングパラメータを決定することができる。 The “order” determined in step S310 and the “number of acquisitions Ai” determined in step S350 become “scheduling parameters” indicating the scheduling rule in the request period Tprd. The scheduling determination unit 30 determines scheduling parameters so that all processes P1 to P4 perform disk access at least once within the requested period Tprd, and the total throughput for each of the processes P1 to P4 is as large as possible. Is done. When the weighting parameter is given, the scheduling determination unit 30 can determine the scheduling parameter so that the weight is reflected in the throughput regarding each of the processes P1 to P4.

決定されたスケジューリングパラメータに従えば、全てのプロセスは所定の要求ピリオドＴｐｒｄ内に少なくとも１回ディスクアクセスを実行することができ、且つ、与えられた条件下でスループットは最大限大きくなる。つまり、アプリケーションのタイムアウト等の発生を防止しつつ、スループットを向上させることが可能となる。 According to the determined scheduling parameters, all processes can perform disk access at least once within a given request period Tprd, and throughput is maximized under given conditions. That is, it is possible to improve throughput while preventing occurrence of an application timeout or the like.

以上、本発明の実施の形態が添付の図面を参照することにより説明された。但し、本発明は、上述の実施の形態に限定されず、要旨を逸脱しない範囲で当業者により適宜変更され得る。 The embodiments of the present invention have been described above with reference to the accompanying drawings. However, the present invention is not limited to the above-described embodiments, and can be appropriately changed by those skilled in the art without departing from the scope of the invention.

図１は、本発明の課題を説明するためのＩ／Ｏスケジューラの概念図である。FIG. 1 is a conceptual diagram of an I / O scheduler for explaining the problem of the present invention. 図２は、本発明の実施の形態に係る計算機システムの構成例を示すブロック図である。FIG. 2 is a block diagram showing a configuration example of the computer system according to the embodiment of the present invention. 図３は、本実施の形態におけるプロセス特性情報の一例を示すテーブルである。FIG. 3 is a table showing an example of process characteristic information in the present embodiment. 図４Ａは、本実施の形態におけるファイル位置情報の一例を示すテーブルである。FIG. 4A is a table showing an example of file position information in the present embodiment. 図４Ｂは、本実施の形態におけるファイル位置情報の他の例を示すテーブルである。FIG. 4B is a table showing another example of file position information in the present embodiment. 図５Ａは、本実施の形態におけるシーク時間情報の一例を示すグラフである。FIG. 5A is a graph showing an example of seek time information in the present embodiment. 図５Ｂは、本実施の形態におけるシーク時間情報の他の例を示すテーブルである。FIG. 5B is a table showing another example of seek time information in the present embodiment. 図６Ａは、本実施の形態におけるアクセス時間情報の一例を示すグラフである。FIG. 6A is a graph showing an example of access time information in the present embodiment. 図６Ｂは、本実施の形態におけるアクセス時間情報の他の例を示すテーブルである。FIG. 6B is a table showing another example of access time information in the present embodiment. 図７は、本実施の形態に係るＩ／Ｏスケジューラの機能を概念的に示す機能ブロック図である。FIG. 7 is a functional block diagram conceptually showing the functions of the I / O scheduler according to the present embodiment. 図８は、本実施の形態に係るスケジューリング処理を示すフローチャートである。FIG. 8 is a flowchart showing the scheduling process according to the present embodiment. 図９は、本実施の形態におけるスケジューリングパラメータの決定方法を示すフローチャートである。FIG. 9 is a flowchart showing a method for determining a scheduling parameter in the present embodiment. 図１０Ａは、アクセス対象ファイルに対するアクセス順序の一例を示す概念図である。FIG. 10A is a conceptual diagram illustrating an example of an access order for an access target file. 図１０Ｂは、アクセス対象ファイルに対するアクセス順序の他の例を示す概念図である。FIG. 10B is a conceptual diagram illustrating another example of an access order for an access target file. 図１１は、本実施の形態におけるプロセス特性情報の他の例を示すテーブルである。FIG. 11 is a table showing another example of process characteristic information in the present embodiment. 図１２は、本実施の形態におけるプロセス特性情報の更に他の例を示すテーブルである。FIG. 12 is a table showing still another example of the process characteristic information in the present embodiment. 図１３は、本実施の形態におけるディスクアクセス要求の取得数の決定方法の一例を示すフローチャートである。FIG. 13 is a flowchart illustrating an example of a method for determining the number of acquired disk access requests according to the present embodiment.

Explanation of symbols

１計算機システム
２処理装置
３記憶装置
４ディスク
５入力装置
６出力装置
１０Ｉ／Ｏスケジューラ
２０プロセスモニタ部
３０スケジューリング決定部
４０リクエスト選択部
ＰＲＯＣプロセス特性情報
ＬＯＣファイル位置情報
ＳＥＫシーク時間情報
ＡＣＳアクセス時間情報
Ａｉスケジューリングパラメータ
Ｔｐｒｄ要求ピリオド
Ｔｒｓｔ残り時間 DESCRIPTION OF SYMBOLS 1 Computer system 2 Processing apparatus 3 Storage apparatus 4 Disk 5 Input apparatus 6 Output apparatus 10 I / O scheduler 20 Process monitor part 30 Scheduling determination part 40 Request selection part PROC Process characteristic information LOC File position information SEK Seek time information ACS Access time information Ai Scheduling parameter Tprd Request period Trst Remaining time

Claims

An I / O scheduler that causes a computer to execute scheduling processing of a disk access request issued from each of the plurality of processes when a plurality of processes access the disk,
The scheduling process includes
(A) reading process characteristic information indicating an access target file accessed by each of the plurality of processes and file position information indicating a position of a file recorded on the disk from a storage device;
(B) storing disk access requests from the plurality of processes in each of a plurality of waiting queues;
(C) Referring to the process characteristic information and the file position information, the order of the plurality of wait queues from which disk access requests are taken out so that the access target file is sequentially accessed in the arrangement order on the disk. A step to determine;
(D) determining the number of disk access requests to be retrieved from each of the plurality of wait queues within the predetermined time period so that all of the plurality of processes perform disk access at least once within a predetermined time period; When,
(E) obtaining a disk access request from the plurality of waiting queues according to the determined order and number, and sending the obtained disk access request to the disk.

The I / O scheduler according to claim 1,
The step (A) further includes a step of reading seek time information indicating a correspondence relationship between a seek distance on the disk and a seek time from the storage device,
The step (D)
(D1) referring to the file position information, and calculating a seek distance between the access target files when the access target file is accessed in the determined order;
(D2) referring to the seek time information, calculating a seek time corresponding to the calculated seek distance;
(D3) calculating a remaining time by subtracting the calculated seek time from the predetermined time;
(D4) determining the number of disk access requests to be retrieved from each of the plurality of wait queues within the remaining time by distributing the remaining time among the plurality of processes.

The I / O scheduler according to claim 2, wherein
In the step (D4), the number of disk access requests is determined to be equal among the plurality of processes. I / O scheduler.

The I / O scheduler according to claim 2, wherein
The process characteristic information further indicates a weighting parameter that specifies a weight of throughput required for each of the plurality of processes,
In the step (D4), the number of disk access requests is determined so that the weighting is reflected in the throughput related to each of the plurality of processes.

The I / O scheduler according to claim 4, wherein
The weighting parameter includes a ratio between throughput required for each of the plurality of processes;
In the step (D4), the number of disk access requests is determined so that the ratio is reflected in the throughput related to each of the plurality of processes.

The I / O scheduler according to claim 4, wherein
The plurality of processes include a priority process in which a required throughput is preferentially given, and a best effort process in which a required throughput is given at best effort,
The weighting parameter includes a class indicating whether each of the plurality of processes is the priority process or the best effort process;
The step (D4) includes
(D41) determining the number of disk access requests to be retrieved from the corresponding one of the plurality of waiting queues so that the required throughput is obtained with respect to the priority process;
(D42) updating the remaining time by subtracting the time required to process the determined number of disk access requests from the remaining time;
(D43) determining the number of disk access requests to be retrieved from the corresponding one of the plurality of waiting queues for the best effort process by distributing the updated remaining time among the best effort processes; I / O scheduler including

The I / O scheduler according to claim 6, wherein
The weighting parameter further includes a ratio between throughput required for each of the best effort processes;
In the step (D43), the number of disk access requests is determined such that the ratio is reflected in the throughput for each of the best effort processes.

The I / O scheduler according to claim 1,
The process characteristic information further indicates a weighting parameter that specifies a weight of throughput required for each of the plurality of processes,
In the step (D), the predetermined process is performed so that all of the plurality of processes perform disk access at least once within the predetermined time period, and the weight is reflected in the throughput relating to each of the plurality of processes. The number of disk access requests to be fetched from each of the plurality of waiting queues within a predetermined time is determined. I / O scheduler.

An I / O scheduling method for performing a scheduling process of a disk access request issued from each of the plurality of processes when a plurality of processes access the disk,
(A) reading process characteristic information indicating an access target file accessed by each of the plurality of processes and file position information indicating a position of a file recorded on the disk from a storage device;
(B) storing disk access requests from the plurality of processes in each of a plurality of waiting queues;
(C) Referring to the process characteristic information and the file position information, the order of the plurality of wait queues from which disk access requests are taken out so that the access target file is sequentially accessed in the arrangement order on the disk. A step to determine;
(D) determining the number of disk access requests to be retrieved from each of the plurality of wait queues within the predetermined time period so that all of the plurality of processes perform disk access at least once within a predetermined time period; When,
(E) acquiring disk access requests from the plurality of waiting queues according to the determined order and number, and sending the acquired disk access requests to the disk. An I / O scheduling method.

A disc,
A processing device for executing a plurality of processes for accessing the disk;
An I / O scheduler that is executed by the processing device and performs scheduling processing of a disk access request issued from each of the plurality of processes;
A storage device for storing process characteristic information indicating an access target file to be accessed by each of the plurality of processes, and file position information indicating a position of a file recorded on the disk, and
The I / O scheduler is
Store disk access requests from the plurality of processes in a plurality of wait queues,
Referring to the process characteristic information and the file position information, determine the order of the plurality of wait queues from which disk access requests are taken out so that the access target file is accessed in order in the arrangement order on the disk;
Determining the number of disk access requests to be retrieved from each of the plurality of wait queues within the predetermined time period so that all of the plurality of processes perform disk access at least once within a predetermined time period;
A computer system that acquires a disk access request from the plurality of waiting queues according to the determined order and number, and sends the acquired disk access request to the disk.