JP2008146420A

JP2008146420A - Computer program for optimizing processing sequence of i/o request

Info

Publication number: JP2008146420A
Application number: JP2006333932A
Authority: JP
Inventors: Hiromi Uwada; 弘美宇和田
Original assignee: Nomura Research Institute Ltd
Current assignee: Nomura Research Institute Ltd
Priority date: 2006-12-12
Filing date: 2006-12-12
Publication date: 2008-06-26
Anticipated expiration: 2026-12-12
Also published as: JP4951326B2

Abstract

<P>PROBLEM TO BE SOLVED: To improve the throughput of an I/O control system for optimizing an I/O request processing sequence in the I/O control system. <P>SOLUTION: This computer program includes checking how many and how sequenced I/O requests are lined up in the queue of an I/O control system, and which storage region is pertinent to the I/O destination of each of the I/O requests, and calculating the number of I/O as the number of the I/O requests as the I/O destinations for each of two or more storage regions as the I/O destinations of a plurality of I/O requests lined up in the queue among a plurality of storage regions based on the check result, and rearranging the sequence of the plurality of I/O requests in the queue so that the I/O requests whose I/O destinations belong to the storage regions with the larger number of I/O can be sequenced more earlier. <P>COPYRIGHT: (C)2008,JPO&INPIT

Description

本発明は、Ｉ／Ｏ要求（入出力要求）の処理順序の最適化に関する。 The present invention relates to optimization of the processing order of I / O requests (input / output requests).

例えば、特許文献１には、アプリケーションからのアクセス要求を一元監視し、アプリケーションを追加した場合の予想負荷データを出力することが開示されている。また、特許文献２には、アクセスが局所的に集中する際に、高速書込みできるバッファに一時蓄積して、シーク／サーチ時間を考慮したスケジューリングを行うことが開示されている。また、特許文献３には、［登録時刻＊スケジュールパラメータ］なる評価式を使用してＩ／Ｏ要求の発行順序を決めることが開示されている。さらに、特許文献４には、トランザクション実効効率を最適化すべくＤＢＭＳのアクセスプランを生成することが開示されている。 For example, Patent Document 1 discloses that access requests from applications are centrally monitored and expected load data when an application is added is output. Further, Patent Document 2 discloses that when accesses are concentrated locally, scheduling is performed in consideration of seek / search time by temporarily accumulating in a buffer capable of high-speed writing. Patent Document 3 discloses that the order of issuing I / O requests is determined using an evaluation formula [registration time * schedule parameter]. Further, Patent Document 4 discloses that a DBMS access plan is generated to optimize transaction effective efficiency.

特開２００５−１４９２８３号公報JP 2005-149283 A 特開平１０−１１１７６２号公報Japanese Patent Laid-Open No. 10-111762 特開平１０−２８９１８９号公報Japanese Patent Laid-Open No. 10-289189 特開２００２−３２２４９号公報JP 2002-32249 A

ところで、Ｉ／Ｏ要求発行元からＩ／Ｏ要求を受信して一旦キューに蓄積し該キューから出力されたＩ／Ｏ要求について所定の処理が完了した場合に所定の応答をＩ／Ｏ要求発行元に返すＩ／Ｏ制御システムとして、例えば、データベースを管理するソフトウェアであるＤＢＭＳ（DataBase Management System）が知られている。ＤＢＭＳは、アプリケーションからのＩ／Ｏ要求に基づいて、データベースに関する操作（以下、「データベース操作」）を行う。データベース操作とは、例えば、データの書込み（更新）、データの読み出し（参照）、データの削除等である。ＤＢＭＳは、そのＩ／Ｏ要求について所定の処理が完了した場合に、所定の応答を、そのＩ／Ｏ要求の発行元であるアプリケーションに返す。所定の処理としては、例えば、後述する排他制御権の取得、データベース操作の一つとして記憶装置に対してＩ／Ｏコマンドを発行する処理がある。 By the way, when an I / O request is received from an I / O request issuer, is temporarily stored in a queue, and predetermined processing is completed for an I / O request output from the queue, a predetermined response is issued. As an I / O control system to be returned to the original, for example, a DBMS (DataBase Management System) which is software for managing a database is known. The DBMS performs an operation related to a database (hereinafter, “database operation”) based on an I / O request from an application. Database operations include, for example, writing (updating) data, reading (referring) data, and deleting data. When a predetermined process is completed for the I / O request, the DBMS returns a predetermined response to the application that issued the I / O request. As the predetermined processing, for example, there is processing for obtaining an exclusive control right described later and issuing an I / O command to the storage device as one of database operations.

ＤＢＭＳは、一般に、オペレーティングシステム（ＯＳ）上で稼動し、ＯＳから認識される複数のプロセスからなるプロセス群として構成される。以下、ＤＢＭＳにおける個々のプロセスを「ＤＢＭＳプロセス」と言う。複数のＤＢＭＳプロセスは、それぞれ、複数のアプリケーションプロセス（アプリケーションプログラムにおけるプロセス）と通信し、そのアプリケーションプロセスからのＩ／Ｏ要求を処理することになる。このため、複数のＤＢＭＳプロセスが、同一の記憶領域に対して同時にアクセスすることが起こりうる。 The DBMS is generally configured as a process group including a plurality of processes that are operated on an operating system (OS) and recognized by the OS. Hereinafter, each process in the DBMS is referred to as a “DBMS process”. Each of the plurality of DBMS processes communicates with a plurality of application processes (processes in the application program), and processes I / O requests from the application processes. For this reason, a plurality of DBMS processes may simultaneously access the same storage area.

これを防ぐために、ミューテックスなどを応用した排他制御が用いられている。この場合、ある記憶領域に対する排他制御権（言い換えればＩ／Ｏ実行権）を取得できたＤＢＭＳプロセスのみが、該記憶領域に対してＩ／Ｏを実行することができる一方で、同じ記憶領域にＩ／Ｏを実行したい他のＤＢＭＳプロセスは、記憶領域に対応した排他制御権を取得できるまで、その記憶領域に対するＩ／Ｏの実行が待たされることになる。 In order to prevent this, exclusive control using a mutex is used. In this case, only a DBMS process that has acquired an exclusive control right (in other words, an I / O execution right) for a certain storage area can execute I / O for the storage area, while in the same storage area. Another DBMS process that wishes to execute I / O waits for execution of I / O to the storage area until it can acquire the exclusive control right corresponding to the storage area.

一つのＩ／Ｏ要求で複数の記憶領域に対してそれぞれＩ／Ｏが発生する場合があるが、その場合は、それら複数の記憶領域の全てについて排他制御権を取得できたときに、それら複数の記憶領域の各々にＩ／Ｏを行うことが可能になる。このようなＩ／Ｏ要求についての処理の完了、言い換えれば、Ｉ／Ｏ要求を発行してから所定の応答をＤＢＭＳから受信するまでのアプリケーションにとっての待ち時間は、それら複数の記憶領域のうちの最も遅くに終わる処理に依存する。 There may be cases where I / O is generated for each of a plurality of storage areas with one I / O request. In this case, when the exclusive control right can be acquired for all of the plurality of storage areas, I / O can be performed on each of the storage areas. Completion of processing for such an I / O request, in other words, the waiting time for an application from issuing an I / O request to receiving a predetermined response from the DBMS is the number of storage areas. Depends on the process that ends most recently.

通常、複数のアプリケーションプロセスからそれぞれ発行されたＩ／Ｏ要求は、ＤＢＭＳで管理されているキュー（典型的にはＦＩＦＯ（First In First Out）形式のバッファ）に一旦蓄積され、そのキューから出力された順序で処理されるが、このような環境下で、Ｉ／Ｏ要求を受け付けた順番通りに処理していくと、ＤＢＭＳのスループットの劣化につながるおそれがある。 Normally, I / O requests issued from multiple application processes are temporarily stored in a queue managed by the DBMS (typically FIFO (First In First Out) buffer) and output from the queue. However, if the processing is performed in the order in which the I / O requests are received in such an environment, the throughput of the DBMS may be deteriorated.

以上のような問題点は、ＤＢＭＳに限らず、他種の前述したＩ／Ｏ制御システムにも同様に存在し得る。 The problems as described above can be present not only in the DBMS but also in other types of I / O control systems described above.

従って、本発明の目的は、Ｉ／Ｏ制御システムでのＩ／Ｏ要求処理順序を最適化することでそのＩ／Ｏ制御システムのスループットを向上させることである。 Therefore, an object of the present invention is to improve the throughput of the I / O control system by optimizing the I / O request processing order in the I / O control system.

Ｉ／Ｏ制御システムのキューにどんな順序でどれだけの数のＩ／Ｏ要求が並んでいて各Ｉ／Ｏ要求のＩ／Ｏ先はどの記憶領域であるかのチェックを行う。そして、そのチェックの結果を基に、複数の記憶領域のうちの、キュー内に並んでいる複数のＩ／Ｏ要求でＩ／Ｏ先とされる二以上の記憶領域の各々について、Ｉ／Ｏ先とされるＩ／Ｏ要求の数であるＩ／Ｏ数を算出し、Ｉ／Ｏ数が多い記憶領域をＩ／Ｏ先とするＩ／Ｏ要求ほど早い順番とするようキュー内での複数のＩ／Ｏ要求の順序を並び替える。 It is checked which number of I / O requests are arranged in what order in the queue of the I / O control system and which storage area is the I / O destination of each I / O request. Then, based on the result of the check, I / O is performed for each of two or more storage areas that are set as I / O destinations in a plurality of I / O requests arranged in the queue among the plurality of storage areas. The number of I / Os, which is the number of I / O requests that are the destination, is calculated. The order of the I / O requests is rearranged.

キューで蓄積されたまま残っている複数のＩ／Ｏ要求でＩ／Ｏ先とされる二以上の記憶領域のそれぞれについてＩ／Ｏ数を算出し、該算出されたＩ／Ｏ数に着目した並び替え、具体的には、Ｉ／Ｏ数が多い記憶領域をＩ／Ｏ先とするＩ／Ｏ要求ほど優先的に先に処理するような並び替えを行うことにより、Ｉ／Ｏ制御システムのスループットの向上が期待できる。 The number of I / Os is calculated for each of two or more storage areas that are the I / O destinations of a plurality of I / O requests that remain accumulated in the queue, and attention is paid to the calculated number of I / Os. Rearrangement, specifically, rearrangement is performed so that I / O requests that have a storage area with a large number of I / Os as the I / O destination are processed first. Increased throughput can be expected.

この技術は、Ｉ／Ｏ制御システムに適用することができる。ここで言う「Ｉ／Ｏ制御システム」は、ＤＢＭＳに代表されるコンピュータプログラムであっても良いし、該コンピュータプログラムを読み込んで実行することによりＩ／Ｏ制御機能（例えばＤＢＭＳとしての機能）を発揮するコンピュータマシン（例えばサーバマシン）であっても良い。 This technique can be applied to an I / O control system. The “I / O control system” mentioned here may be a computer program typified by a DBMS, and exhibits an I / O control function (for example, a function as a DBMS) by reading and executing the computer program. It may be a computer machine (for example, a server machine).

また、“Ｉ／Ｏ”とは、入出力の略である。故に、Ｉ／Ｏ要求は、入力要求であっても良いし、書込み要求であっても良い。また、Ｉ／Ｏ要求は、直接的に入力或いは出力を要求するものに限らず、入力或いは出力が行われることになる要求であってもよい。例えば、Ｉ／Ｏ制御システムがＤＢＭＳであって、該ＤＢＭＳが、prepare（ＳＱＬ文）を受信しそれに応答して所定の準備ができた後にcommitを受信するタイプのＤＢＭＳである場合には、prepare及びcommitのうちの少なくとも一方が上述したＩ／Ｏ要求となる。 “I / O” is an abbreviation for input / output. Therefore, the I / O request may be an input request or a write request. Further, the I / O request is not limited to a request for direct input or output, but may be a request for input or output. For example, when the I / O control system is a DBMS and the DBMS is a DBMS of a type that receives a prepare (SQL statement) and receives a commit after receiving a preparation (SQL statement) in response thereto, prepare. And at least one of the commits is the I / O request described above.

また、上記チェックは、定期的に或いは不定期的に行うことができる。Ｉ／Ｏ要求の順序の並び替えは、典型的には、チェックが行われる都度に行われることになるが、それに限らず、複数回のチェックが行われてから一回の並び替えが行われる方式であっても良い。 The check can be performed regularly or irregularly. The rearrangement of the order of I / O requests is typically performed each time a check is performed, but is not limited to this, and the rearrangement is performed once after a plurality of checks are performed. It may be a method.

＜第一の実施形態＞ <First embodiment>

以下、本発明の第一の実施形態に係るＩ／Ｏ要求の処理順序の最適化方式について、図面を参照して説明する。なお、以下の説明では、Ｉ／Ｏ制御システムは、ＤＢＭＳ（特に、リレーショナルデータベース管理システム（ＲＤＢＭＳ））であるとする。また、以下、説明を分かり易くするため、一つのＩ／Ｏ要求は、一つのトランザクションであるとし、一つのアプリケーションプロセスから送信され、オペレーティングシステム（ＯＳ）からＣＰＵ時間が割り当てられた一つのＤＢＭＳプロセスによってキューから読み出されて処理されるものとする。 Hereinafter, a method for optimizing the processing order of I / O requests according to the first embodiment of the present invention will be described with reference to the drawings. In the following description, it is assumed that the I / O control system is a DBMS (particularly, a relational database management system (RDBMS)). In addition, hereinafter, for the sake of easy understanding, it is assumed that one I / O request is one transaction, one DBMS process that is transmitted from one application process and assigned CPU time from the operating system (OS). Is read from the queue and processed.

図１は、本実施形態に係るシステム全体の構成を示す図である。 FIG. 1 is a diagram showing a configuration of the entire system according to the present embodiment.

サーバ１０４と、クライアント１０１（ここでは、クライアント（１）１０１ａ、クライアント（２）１０１ｂ、クライアント（３）１０１ｃ）とが、それぞれネットワーク１０３に接続されている。 A server 104 and a client 101 (here, client (1) 101a, client (2) 101b, client (3) 101c) are connected to the network 103, respectively.

サーバ１０４は、ＤＢＭＳ１０７を備えたデータベースサーバとして機能する。各クライアント１０１のアプリケーション１０２（ここでは、アプリケーション（１）１０２ａ、アプリケーション（２）１０２ｂ、アプリケーション（３）１０２ｃ）における一のプロセス（アプリケーションプロセス）は、ＤＢＭＳ１０７に対して、データベース操作の内容を例えばＳＱＬで記述した要求（以下、「Ｉ／Ｏ要求」）を送信する。Ｉ／Ｏ要求を受信したＤＢＭＳ１０７は、その受信したＩ／Ｏ要求を待ち受けキュー１０５（ここではＦＩＦＯ（First-In First-Out）型のバッファ）に一時蓄積する。ＤＢＭＳ１０７のプロセス群１０７１のうちの、ＯＳからＣＰＵ時間が割り当てられた一のプロセス（ＤＢＭＳプロセス）が、待ち受けキュー１０５の先頭にあるＩ／Ｏ要求を読み出して解釈して、そのＩ／Ｏ要求に対応したデータベース操作を行う。そのデータベース操作により、ストレージ１０８に対してＩ／Ｏコマンドが発行される。なお、ここで、ストレージ１０８は、例えば、一以上のディスク型記憶装置（以下、単に「ディスク」と言う）１０８１を備えた装置とすることができる。ストレージ１０８では、複数のディスク１０８１でＲＡＩＤ（Redundant Array of Independent (or Inexpensive) Disks）が構成されていても良いし、一以上のディスク１０８１の記憶空間を利用して論理ユニット（論理的な記憶装置）が用意されていても良い。ディスク１０８１や論理ユニットに代えて他種の二次記憶装置が採用されても良い（一次記憶装置はサーバ１０４内の図示しないメモリ）。つまり、二次記憶装置は、物理的な記憶装置であっても良いし論理的な記憶装置であっても良い。 The server 104 functions as a database server provided with the DBMS 107. One process (application process) in the application 102 (here, application (1) 102a, application (2) 102b, application (3) 102c) of each client 101 sends the contents of the database operation to the DBMS 107, for example, SQL. The request described below (hereinafter referred to as “I / O request”) is transmitted. Receiving the I / O request, the DBMS 107 temporarily stores the received I / O request in the waiting queue 105 (here, a FIFO (First-In First-Out) type buffer). Of the process group 1071 of the DBMS 107, one process (DBMS process) to which the CPU time is allocated from the OS reads and interprets the I / O request at the head of the waiting queue 105, and converts it to the I / O request. Perform the corresponding database operation. By this database operation, an I / O command is issued to the storage 108. Here, the storage 108 can be, for example, an apparatus including one or more disk storage devices (hereinafter simply referred to as “disks”) 1081. In the storage 108, a plurality of disks 1081 may form a RAID (Redundant Array of Independent (or Inexpensive) Disks), or a logical unit (logical storage device) using the storage space of one or more disks 1081. ) May be prepared. Instead of the disk 1081 or the logical unit, another type of secondary storage device may be employed (the primary storage device is a memory (not shown) in the server 104). That is, the secondary storage device may be a physical storage device or a logical storage device.

ＤＢＭＳ１０７では、複数の論理的な記憶領域、並びに、それら複数の論理的な記憶領域の各々とストレージ１０８における物理的な記憶領域との対応付けを表す情報が管理されている。ここでは、論理的な記憶領域を、ブロック１０７４とし、複数のブロック１０７４と複数の物理的な記憶領域との対応付けを表した情報を、ブロック索引１０７３とする。Ｉ／Ｏ要求には、例えば、Ｉ／Ｏ先を表す情報（以下、Ｉ／Ｏ先情報）として、物理的な記憶領域を表す情報（以下、物理領域情報、例えばアドレス）が記録されていて、ＤＢＭＳ１０７では、その物理領域情報を用いてブロック索引１０７３を参照することで、その物理領域情報が表す物理的な記憶領域に対応したブロック１０７４（つまり、Ｉ／Ｏ先となるブロック１０７４）を特定することができる。一つのブロック１０７４に対応付けられる一つの物理的な記憶領域は、ブロック索引１０７３においてどのように定義されても良い。具体的には、ストレージ１０８は、いくつかの物理的な記憶領域に分離され、その分離された記憶領域ごとにデータベース操作が行われる。この分離された記憶領域は、排他制御の単位となるものであり、以下、この領域を排他制御単位と呼ぶ。排他制御単位は、様々な態様をとることができ、例えば、ディスク１０８１、ファイル１０８２、レコード１０８３、及び論理ユニットのうちの少なくとも一つとしても良い（その際、例えば、複数のディスク１０８１をまたぐ形で定義されても良いし、ディスク１０８１の一部として定義されても良い）。このような排他制御単位に一つのブロック１０７４が対応付けられているため、ブロック１０７４も、排他制御単位となる。 The DBMS 107 manages a plurality of logical storage areas and information indicating the correspondence between each of the plurality of logical storage areas and a physical storage area in the storage 108. Here, the logical storage area is the block 1074, and the information representing the correspondence between the plurality of blocks 1074 and the plurality of physical storage areas is the block index 1073. In the I / O request, for example, information indicating a physical storage area (hereinafter, physical area information, for example, an address) is recorded as information indicating the I / O destination (hereinafter, I / O destination information). The DBMS 107 identifies the block 1074 corresponding to the physical storage area represented by the physical area information (that is, the block 1074 that is the I / O destination) by referring to the block index 1073 using the physical area information. can do. One physical storage area associated with one block 1074 may be defined in any way in the block index 1073. Specifically, the storage 108 is separated into several physical storage areas, and a database operation is performed for each of the separated storage areas. This separated storage area is a unit of exclusive control, and this area is hereinafter referred to as an exclusive control unit. The exclusive control unit may take various forms, and may be, for example, at least one of the disk 1081, the file 1082, the record 1083, and the logical unit (in this case, for example, a form that spans a plurality of disks 1081). Or may be defined as part of the disk 1081). Since one block 1074 is associated with such an exclusive control unit, the block 1074 is also an exclusive control unit.

この実施形態では、例えば以下の流れで、一つのＩ／Ｏ要求（以下、この説明において、「対象Ｉ／Ｏ要求」と言う）が完結する。
（ステップ１）アプリケーションプロセスがprepare文（ＳＱＬ）を発行する。
（ステップ２）ＤＢＭＳ１０７がアプリケーションプロセスからの該ＳＱＬを解釈して、対象Ｉ／Ｏ要求に変換した後、待ち受けキュー１０５に一旦蓄積する。
（ステップ３）アプリケーションプロセスがcommit文（ＳＱＬ）を発行する。
（ステップ４）待ち受けキュー１０５に蓄積された対象Ｉ／Ｏ要求のcommitフラグを有効にする。
（ステップ５）待ち受けキュー１０５の前方にあるＩ／Ｏ要求のなかで、commitフラグが有効になっているものがＤＢＭＳプロセスによって取り出される都度、それより後方に蓄積されている対象Ｉ／Ｏ要求の順番が一つ繰り上がる。このような処理が繰り返されることで、commitフラグが有効になっている対象Ｉ／Ｏ要求が取り出される可能性が高まり、やがて、ＤＢＭＳプロセスによって、対象Ｉ／Ｏ要求が待ち受けキュー１０５から取り出される。ＤＢＭＳプロセスが、対象Ｉ／Ｏ要求に含まれている情報とブロック索引１０７３とを用いて、Ｉ／Ｏ先のブロック１０７４（以下、対象ブロック１０７４）を特定し、対象ブロック１０７４に対応するミューテックス１０７２の取得を試みる。取得できなければ、待ちとなる。また、対象ブロック１０７４が複数個ある場合には、それら複数の対象ブロック１０７４にそれぞれ対応した複数のミューテックス１０７２を全て取得できるまで、待ちとなる。
（ステップ６）ミューテックス１０７２を取得できたならば、ＤＢＭＳプロセスが、ストレージ１０８内の、対象ブロック１０７４に対応した物理的な記憶領域を有する二次記憶装置に対し、該物理的な記憶領域を指定したＩ／Ｏコマンドを送信する。該Ｉ／Ｏコマンドに対して所定のコマンド応答（例えばＩ／Ｏ成功）を受信した場合、ＤＢＭＳプロセスは、ミューテックス１０７２を解放し、対象Ｉ／Ｏ要求に対する所定の要求応答（例えば成功）を、アプリケーションプロセスに通知する。
（ステップ７）アプリケーションプロセスが、成功を受信する。 In this embodiment, for example, one I / O request (hereinafter referred to as “target I / O request” in this description) is completed in the following flow.
(Step 1) The application process issues a prepare statement (SQL).
(Step 2) The DBMS 107 interprets the SQL from the application process, converts it into a target I / O request, and temporarily stores it in the standby queue 105.
(Step 3) The application process issues a commit statement (SQL).
(Step 4) The commit flag of the target I / O request accumulated in the waiting queue 105 is validated.
(Step 5) Every time an I / O request in the front of the waiting queue 105 with the commit flag being valid is taken out by the DBMS process, the target I / O request accumulated behind it is retrieved. The order goes up by one. By repeating such processing, the possibility that the target I / O request with the commit flag being valid is extracted is increased. Eventually, the target I / O request is extracted from the standby queue 105 by the DBMS process. The DBMS process uses the information included in the target I / O request and the block index 1073 to identify an I / O destination block 1074 (hereinafter referred to as the target block 1074), and a mutex 1072 corresponding to the target block 1074. Try to get. If it cannot be obtained, it will be waiting. If there are a plurality of target blocks 1074, the process waits until all of the plurality of mutexes 1072 respectively corresponding to the plurality of target blocks 1074 can be acquired.
(Step 6) If the mutex 1072 can be acquired, the DBMS process designates the physical storage area for the secondary storage device having the physical storage area corresponding to the target block 1074 in the storage 108. Sent the I / O command. When a predetermined command response (for example, I / O success) is received in response to the I / O command, the DBMS process releases the mutex 1072, and receives a predetermined request response (for example, success) for the target I / O request. Notify the application process.
(Step 7) The application process receives success.

以上の流れにおいて、（ステップ１）〜（ステップ７）までが、アプリケーションプログラム１０２にとっての待ち時間である。なお、前述したコマンド応答とは、ＤＢＭＳが、Ｉ／Ｏコマンドの送信先の二次記憶装置から受信する、該Ｉ／Ｏコマンドに対する応答である。一方、前述した要求応答とは、ＤＢＭＳが、Ｉ／Ｏ要求の送信元のアプリケーションに送信する、該Ｉ／Ｏ要求に対する応答である。 In the above flow, steps (step 1) to (step 7) are waiting times for the application program 102. The command response described above is a response to the I / O command that the DBMS receives from the secondary storage device that is the transmission destination of the I / O command. On the other hand, the above-described request response is a response to the I / O request that the DBMS transmits to the application that is the transmission source of the I / O request.

本実施形態では、待ち受けキュー１０５に蓄積されている複数のＩ／Ｏ要求についての全体的な待ち時間を短縮することを図るべく、最適化部１０６が備えられる。最適化部１０６は、待ち受けキュー１０５（典型的には、ＦＩＦＯ（First-In First-Out）型のバッファ）で並んでいる複数のＩ／Ｏ要求と、それら複数のＩ／Ｏ要求から特定される複数のブロックとを基にしたマトリクスを作成し、該マトリクス上での並び替えを実行することで、上記の全体的な待ち時間を短縮することができる。最適化部１０６は、ＤＢＭＳ１０７同様、サーバ１０４のＣＰＵ（図示せず）で実行されるコンピュータプログラムである。最適化部１０６は、ＤＢＭＳ１０７に組み込まれていても良いし、ＤＢＭＳ１０７と通信可能にＤＢＭＳ１０７の外に存在していても良い。 In the present embodiment, an optimization unit 106 is provided in order to reduce the overall waiting time for a plurality of I / O requests accumulated in the standby queue 105. The optimization unit 106 is specified from a plurality of I / O requests arranged in the standby queue 105 (typically, a FIFO (First-In First-Out) buffer) and the plurality of I / O requests. By creating a matrix based on a plurality of blocks and performing rearrangement on the matrix, the overall waiting time can be shortened. The optimization unit 106 is a computer program that is executed by the CPU (not shown) of the server 104, like the DBMS 107. The optimization unit 106 may be incorporated in the DBMS 107 or may exist outside the DBMS 107 so as to be communicable with the DBMS 107.

図２は、最適化部１０６の機能の説明図である。 FIG. 2 is an explanatory diagram of the function of the optimization unit 106.

最適化部１０６は、キュー監視蓄積部１０６１と、Ｉ／Ｏ性能蓄積部１０６２と、Ｉ／Ｏ行列置換部１０６３とを備える。 The optimization unit 106 includes a queue monitoring storage unit 1061, an I / O performance storage unit 1062, and an I / O matrix replacement unit 1063.

キュー監視蓄積部１０６１は、待ち受けキュー１０５の状況を定期的にチェックし、該状況を表す情報（以下、「キュー状況情報」）２６１１を、所定の記憶域２６１（例えばサーバ１０４における図示しないメモリ上の領域）に蓄積する。キュー状況情報２６１１としては、例えば、待ち受けキュー１０５に蓄えられているＩ／Ｏ要求の数、順序、各Ｉ／Ｏ要求の到着時刻、commitフラグ、要求内容等がある。要求内容には、書込み又は読込みといったデータベース操作の種別や、Ｉ／Ｏ先のブロック１０７４等の情報が含まれる。 The queue monitoring accumulation unit 1061 periodically checks the status of the standby queue 105, and stores information (hereinafter referred to as “queue status information”) 2611 indicating the status on a predetermined storage area 261 (for example, on a memory (not shown) in the server 104). Area). The queue status information 2611 includes, for example, the number and order of I / O requests stored in the standby queue 105, the arrival time of each I / O request, a commit flag, and request contents. The request content includes information such as the type of database operation such as writing or reading, the I / O destination block 1074, and the like.

Ｉ／Ｏ性能蓄積部１０６２は、二次記憶装置ごとに、その二次記憶装置に対応するＩ／Ｏ性能を取得する。ここで言うＩ／Ｏ性能とは、例えば、ＤＢＭＳプロセスによって二次記憶装置にＩ／Ｏコマンドが発行されてから所定の応答がその二次記憶装置から戻ってくるまでの時間長である。Ｉ／Ｏ性能蓄積部１０６２は、取得したＩ／Ｏ性能を表す数値（Ｉ／Ｏ性能値）を用いて、該Ｉ／Ｏ性能の二次記憶装置に対応したＩ／Ｏ性能情報２６１２を更新する。Ｉ／Ｏ性能情報２６１２は、例えば、所定の記憶域２６１に二次記憶装置毎に用意される。Ｉ／Ｏ性能情報２６１２には、各時点で取得されたＩ／Ｏ性能値と、第一の時点から第二の時点までの複数のＩ／Ｏ性能値の統計（Ｉ／Ｏ性能統計）とが含まれる。Ｉ／Ｏ性能統計は、例えば、Ｉ／Ｏ性能蓄積部１０６２によって求められる。Ｉ／Ｏ性能統計は、典型的には、標準正規分布に従うとみなすことができる。このＩ／Ｏ性能統計は、例えば、この第一実施形態に係るシステムで業務を開始する前のテスト稼動において、各ブロックサイズが適切かどうかを管理者が判断するのに有用である。例えば、管理者は、同種の二次記憶装置（例えば、データ転送速度やＩ／Ｏ速度が同程度である二次記憶装置）のＩ／Ｏ性能統計を参照することにより、同種の二次記憶装置であるにも関わらずＩ／Ｏ性能（例えば、平均、標準偏差）に想定誤差以上の差があるか否かを判断し、そのような差がある場合には、Ｉ／Ｏ性能の低い方の二次記憶装置に対応付けるブロックサイズを、Ｉ／Ｏ性能の高い方の二次記憶装置に対応付けるブロックサイズに比して、小さくする（言い換えれば、一つのブロックに割り当てる物理的な記憶領域のサイズを小さくする）といったチューニングを施すことができる。上記の判断及びチューニングは、コンピュータプログラム（例えば最適化部１０６に含まれるプログラム）によって自動的に行われても良い。 The I / O performance accumulating unit 1062 acquires the I / O performance corresponding to the secondary storage device for each secondary storage device. The I / O performance referred to here is, for example, the length of time from when an I / O command is issued to the secondary storage device by the DBMS process until a predetermined response is returned from the secondary storage device. The I / O performance storage unit 1062 updates the I / O performance information 2612 corresponding to the secondary storage device of the I / O performance, using the acquired numerical value (I / O performance value) representing the I / O performance. To do. The I / O performance information 2612 is prepared for each secondary storage device in a predetermined storage area 261, for example. The I / O performance information 2612 includes an I / O performance value acquired at each time point, a plurality of I / O performance value statistics (I / O performance statistics) from the first time point to the second time point, and Is included. The I / O performance statistics are obtained by the I / O performance accumulation unit 1062, for example. I / O performance statistics can typically be considered to follow a standard normal distribution. This I / O performance statistic is useful, for example, for an administrator to determine whether each block size is appropriate in a test operation before starting business in the system according to the first embodiment. For example, the administrator refers to the I / O performance statistics of the same type of secondary storage device (for example, a secondary storage device having the same data transfer rate and I / O rate), thereby allowing the same type of secondary storage device. It is determined whether the I / O performance (for example, average, standard deviation) has a difference greater than the expected error despite being a device, and if there is such a difference, the I / O performance is low. The block size associated with the secondary storage device is made smaller than the block size associated with the secondary storage device with higher I / O performance (in other words, the physical storage area allocated to one block Tuning, such as reducing the size). The above determination and tuning may be automatically performed by a computer program (for example, a program included in the optimization unit 106).

Ｉ／Ｏ行列置換部１０６３は、キュー監視蓄積部１０６１が蓄積したキュー状況情報に基づいて、待ち受けキュー１０５で並んでいるＩ／Ｏ要求の順序を、例えば図示のように並び替える。 Based on the queue status information accumulated by the queue monitoring accumulation unit 1061, the I / O matrix substitution unit 1063 rearranges the order of I / O requests arranged in the standby queue 105, for example, as illustrated.

以下、図３及び図４を参照して、この並び替え処理の詳細を説明する。なお、各図において、“Ｒ”は、Ｒｅｑｕｅｓｔ（要求）の頭文字を意味し、“Ｂ”は、Ｂｌｏｃｋ（ブロック）の頭文字を意味する。また、以下では簡単のために、待ち受けキュー１０５には、ある時点のスナップショットとして７つのＩ／Ｏ要求Ｒ１〜Ｒ７が蓄えられていて、Ｉ／Ｏ要求Ｒ１〜Ｒ７は、全て、commitフラグが有効になっており、各々のＩ／Ｏ要求は、ブロックＢ１〜Ｂ７の範囲にＩ／Ｏを発生せしめるという前提で説明する。 Details of the rearrangement process will be described below with reference to FIGS. 3 and 4. In each figure, “R” means an acronym for Request (request), and “B” means an acronym for Block (block). In the following, for the sake of simplicity, the standby queue 105 stores seven I / O requests R1 to R7 as snapshots at a certain point in time, and all the I / O requests R1 to R7 have a commit flag. Each I / O request is valid and will be described on the assumption that I / O is generated in the range of blocks B1 to B7.

Ｉ／Ｏ行列置換部１０６３が、例えば、キュー状況情報を解釈し、該解釈の結果を基に、図３（ａ）で示されるようなマトリクスを作成する。このマトリクスは、行方向に、待ち受けキュー１０５に蓄えられているＩ／Ｏ要求Ｒ１〜Ｒ７を配列し（つまり、キュー状況情報から特定される順序に従ってＩ／Ｏ要求Ｒ１〜Ｒ７を配列し）、列方向に、ブロックＢ１〜Ｂ７を、それぞれ配列したものである。各行において、その行に対応したＩ／Ｏ要求でＩ／Ｏ先となっているブロックに対応した位置には、所定の値（図では丸印で表している）がセットされる。従って、このマトリクスからは、例えば、Ｉ／Ｏ要求Ｒ１では、ブロックＢ１及びＢ５がＩ／Ｏ先とされていることがわかる。 For example, the I / O matrix replacement unit 1063 interprets the queue status information and creates a matrix as shown in FIG. 3A based on the result of the interpretation. This matrix arranges the I / O requests R1 to R7 stored in the standby queue 105 in the row direction (that is, arranges the I / O requests R1 to R7 according to the order specified from the queue status information), Blocks B1 to B7 are arranged in the column direction. In each row, a predetermined value (represented by a circle in the figure) is set at the position corresponding to the block that is the I / O destination in the I / O request corresponding to that row. Therefore, it can be seen from this matrix that, for example, in the I / O request R1, the blocks B1 and B5 are the I / O destinations.

このようなマトリクスを作成した後、列の置換処理と、行の置換処理とが行われる。 After creating such a matrix, column replacement processing and row replacement processing are performed.

列の置換処理は、以下の通りである。すなわち、Ｉ／Ｏ行列置換部１０６３が、各ブロック毎のＩ／Ｏ数（丸印の数）を計算する。そして、Ｉ／Ｏ行列置換部１０６３は、図３（ｂ）に示すように、Ｉ／Ｏ数の多いブロック（列）ほど左に寄せる。この結果、左から順に、Ｂ１、Ｂ６、Ｂ２、Ｂ５、Ｂ４、Ｂ７、Ｂ３となるように並び替えられる。 The column replacement process is as follows. That is, the I / O matrix replacement unit 1063 calculates the number of I / Os (number of circles) for each block. Then, as shown in FIG. 3B, the I / O matrix replacement unit 1063 moves the block (column) with the larger number of I / Os to the left. As a result, they are rearranged in order from the left to B1, B6, B2, B5, B4, B7, and B3.

行の置換処理では、Ｉ／Ｏ数の多いブロックに着目して、着目したブロックをＩ／Ｏ先とするＩ／Ｏ要求（行）の順番がなるべく先になるように並び替えられる。その際、その着目しているブロックよりもＩ／Ｏ数の多いブロックに着目した並び（上詰め）を崩さないようにし、崩れるようであれば、行の置換処理を終える。 In the row replacement processing, attention is paid to blocks having a large number of I / Os, and rearrangement is performed so that the order of I / O requests (rows) with the focused block as an I / O destination is as far as possible. At this time, the arrangement (uploading) focused on the block having a larger number of I / Os than the focused block is not broken, and if it is broken, the row replacement process is finished.

具体的には、例えば、まず、図４（ａ）に示すように、最もＩ／Ｏ数の多いブロックＢ１に着目して、ブロックＢ１がＩ／Ｏ先になっているＩ／Ｏ要求が上位にくるように（出力される順番が先になるように）並び替える。ここでは、Ｉ／Ｏ要求Ｒ２のみがブロックＢ１へのＩ／Ｏを行わないため、Ｉ／Ｏ要求Ｒ２に対応した行は最下位へ移動させられる。 Specifically, for example, as shown in FIG. 4A, first, focusing on the block B1 having the largest number of I / Os, the I / O request in which the block B1 is the I / O destination is higher. Rearrange so that it comes to (so that the output order comes first). Here, since only the I / O request R2 does not perform I / O to the block B1, the row corresponding to the I / O request R2 is moved to the lowest order.

次に、図４（ｂ）に示すように、２番目にＩ／Ｏ数の多いブロックＢ６に着目して、Ｂ１の上詰め状態を崩さないようにして、ブロックＢ６がＩ／Ｏ先になっているＩ／Ｏ要求（行）が上位にくるように並び替える。 Next, as shown in FIG. 4B, paying attention to the block B6 having the second largest number of I / Os, the block B6 becomes the I / O destination so as not to disturb the top-filled state of B1. Rearrange the I / O requests (rows) that are currently in the higher rank.

次に、図４（ｃ）に示すように、３番目にＩ／Ｏ数の多いブロックＢ２に着目して、Ｂ１及びＢ６の上詰め状態を崩さないようにして、ブロックＢ２がＩ／Ｏ先になっているＩ／Ｏ要求（行）が上位にくるように並び替える。 Next, as shown in FIG. 4C, paying attention to the block B2 having the third largest number of I / Os, the block B2 is connected to the I / O destination in such a way that the top-packed state of B1 and B6 is not destroyed. Rearrange the I / O requests (rows) at the top.

次に、４番目にＩ／Ｏ数の多いブロックＢ５に着目して、Ｂ１、Ｂ６及びＢ２の上詰め状態を崩さないようにして、ブロックＢ５がＩ／Ｏ先になっているＩ／Ｏ要求（行）が上位にくるように並び替えることを試みる。しかし、ここでは、ブロックＢ２に着目した上詰め後のマトリクスである図４（ｄ）のマトリクスからわかるように、ブロックＢ５をＩ／Ｏ先とするＩ／Ｏ要求Ｒ４、Ｒ１及びＲ７のいずれをより上位にしようとすると、Ｂ１、Ｂ６及びＢ２のいずれかの上詰め状態が崩れてしまうことになる。 Next, paying attention to the block B5 having the fourth largest number of I / Os, an I / O request in which the block B5 is an I / O destination without disturbing the top-filled state of B1, B6, and B2 Try rearranging so that (row) comes to the top. However, here, as can be seen from the matrix in FIG. 4D, which is the top-padded matrix focusing on the block B2, any of the I / O requests R4, R1, and R7 with the block B5 as the I / O destination is selected. If it is going to be higher, the top-packed state of any of B1, B6 and B2 will be destroyed.

従って、この段階で、行の置換処理を終了する。 Therefore, at this stage, the row replacement process is terminated.

以上の一連の処理によって、Ｉ／Ｏ要求の処理順序の最適化が完了する。以下、この最適化により期待できる効果を検証する。その際、一つのブロックについてのＩ／Ｏ（具体的には、該ブロックに対応した物理的な記憶領域を有する二次記憶装置に対するＩ／Ｏ）に関する処理で遅れる確率（Ｐ）を一律に0.05とする。遅れる確率（Ｐ）とは、例えば、その二次記憶装置のＩ／Ｏ性能の標準正規分布（Ｉ／Ｏ性能情報中の標準正規分布）において、標準偏差（σ）が±２σの範囲から外れる確率である。 Through the series of processes described above, the optimization of the processing order of I / O requests is completed. The effects that can be expected from this optimization will be verified below. At that time, the probability (P) of delay in processing related to I / O for one block (specifically, I / O for a secondary storage device having a physical storage area corresponding to the block) is uniformly 0.05. And The probability of delay (P) is, for example, that the standard deviation (σ) is out of the range of ± 2σ in the standard normal distribution of I / O performance of the secondary storage device (standard normal distribution in the I / O performance information). It is a probability.

（１）並び替え前［図３（ａ）参照］
・ブロックＢ１：Ｉ／Ｏ要求Ｒ３〜Ｒ７と５つ連続でＩ／Ｏが発生しているので、
遅れる確率：Ｐ_B1＝1−（1−0.05）⁵＝ 0.2262190625
・ブロックＢ５：最終のＩ／Ｏ要求Ｒ７でＩ／Ｏが発生するが、前のＩ／Ｏ要求Ｒ６ではブロックＢ５についてＩ／Ｏが発生しないので、
遅れる確率：Ｐ_B5＝0.05
・ブロックＢ７：最終のＩ／Ｏ要求Ｒ７でＩ／Ｏが発生するが、前のＩ／Ｏ要求Ｒ６ではブロックＢ7についてＩ／Ｏが発生しないので、
遅れる確率：Ｐ_B7＝0.05
他のブロックに関しては、最終Ｉ／Ｏ要求（Ｒ７）が送出されるタイミングでは新たなＩ／Ｏが発生しない。それ以前の遅れは、Ｒ７のタイムスロットで吸収される。 (1) Before rearrangement [see FIG. 3 (a)]
-Block B1: Since I / O requests R3 to R7 and I / O are continuously generated,
Probability of delay: P _B1 = 1− (1−0.05) ⁵ = 0.2262190625
Block B5: I / O is generated in the final I / O request R7, but I / O is not generated in block B5 in the previous I / O request R6.
Probability of delay: P _B5 = 0.05
Block B7: I / O occurs in the final I / O request R7, but no I / O occurs in block B7 in the previous I / O request R6.
Probability of delay: P _B7 = 0.05
For other blocks, no new I / O occurs at the timing when the final I / O request (R7) is sent. The earlier delay is absorbed in the R7 time slot.

それぞれの遅れは独立に発生すると考えてよい。そのため、Ｂ１、Ｂ５及びＢ７のうち、２つ以上で遅れが同時に発生する確率は、それぞれの確率の積となるため、それらは無視できるほど小さな値になる。故に、最終のＩ／Ｏ要求Ｒ７のタイミングで遅れが生じる確率Ｐは、
Ｐ≒Ｐ_B1＋Ｐ_B5＋Ｐ_B7＝0.226 + 0.05 + 0.05 ＝ 0.326
となる。 Each delay may be considered to occur independently. Therefore, the probability that the delay occurs simultaneously in two or more of B1, B5, and B7 is a product of the respective probabilities, and thus becomes a small value that can be ignored. Therefore, the probability P that the delay occurs at the timing of the final I / O request R7 is
P ≒ P _B1 + P _B5 + P _B7 = 0.226 + 0.05 + 0.05 = 0.326
It becomes.

（２）並べ替え後［図４（ｄ）参照］
・ブロックＢ６：最終のＩ／Ｏ要求Ｒ２でＩ／Ｏが発生するが、前のＩ／Ｏ要求Ｒ７ではブロックＢ６についてＩ／Ｏが発生しないので、
Ｐ_B6＝0.05
・ブロックＢ２：最終のＩ／Ｏ要求Ｒ２でＩ／Ｏが発生するが、前のＩ／Ｏ要求Ｒ７ではブロックＢ２についてＩ／Ｏが発生しないので、
Ｐ_B2＝0.05
それぞれの遅れは独立に発生すると考えてよい。そのため、Ｂ６及びＢ２の両方で同時に遅れが発生する確率は、それぞれの確率の積となるが、それらは無視できるほど小さな値になる。故に、最終のＩ／Ｏ要求Ｒ２で遅れが生じる確率Ｐは、
Ｐ≒Ｐ_B6＋Ｐ_B2＝0.05 + 0.05＝0.1
となる。 (2) After rearrangement [see FIG. 4 (d)]
Block B6: I / O occurs in the final I / O request R2, but no I / O occurs in block B6 in the previous I / O request R7.
P _B6 = 0.05
Block B2: I / O is generated in the final I / O request R2, but I / O is not generated for the block B2 in the previous I / O request R7.
P _B2 = 0.05
Each delay may be considered to occur independently. Therefore, the probability that the delay occurs simultaneously in both B6 and B2 is the product of the respective probabilities, but they are small enough to be ignored. Therefore, the probability P of delay in the final I / O request R2 is
P ≒ P _B6 + P _B2 = 0.05 + 0.05 = 0.1
It becomes.

以上、第一の実施形態によれば、待ち受けキュー１０５で蓄積されたまま残っている複数のＩ／Ｏ要求でＩ／Ｏ先とされる二以上のブロックのそれぞれについてＩ／Ｏ数を算出し、Ｉ／Ｏ数が多いブロックをＩ／Ｏ先とするＩ／Ｏ要求ほど優先的に先に処理するような並び替えが行われる。これにより、ＤＢＭＳ１０７のスループットの向上が期待できる。 As described above, according to the first embodiment, the number of I / Os is calculated for each of two or more blocks that are set as I / O destinations in a plurality of I / O requests that remain accumulated in the standby queue 105. Thus, rearrangement is performed such that an I / O request in which a block having a large number of I / Os is an I / O destination is processed first. Thereby, improvement of the throughput of the DBMS 107 can be expected.

＜第二の実施形態＞ <Second Embodiment>

以下、第二の実施形態について説明する。その際、第一の実施形態との相違点について主に説明し、第一の実施形態との共通点については説明を省略或いは簡略する。 The second embodiment will be described below. At that time, differences from the first embodiment will be mainly described, and description of common points with the first embodiment will be omitted or simplified.

Ｉ／Ｏ行列置換部１０６３が、行の合成処理を実行する。以下、それについて、図５（ａ）を例に採り説明する。Ｉ／Ｏ行列置換部１０６３は、行の置換処理後のマトリクス（図４（ｄ）参照）を基に、行ベクトルの内積が 0 になる２つ以上の行があるか否か（Ｉ／Ｏ先となるブロックが重なり合わないＩ／Ｏ要求）があるか否かを判断する。そのような２つ以上の行があれば、Ｉ／Ｏ行列置換部１０６３は、２つ以上の行を合成、すなわち、２つ以上のＩ／Ｏ要求（例えばＲ２及びＲ７）を一つのＩ／Ｏ要求にまとめる。これにより、ＤＢＭＳ１０７のスループットの更なる向上が期待できる。 The I / O matrix replacement unit 1063 executes row composition processing. Hereinafter, this will be described with reference to FIG. Based on the matrix after row replacement processing (see FIG. 4D), the I / O matrix replacement unit 1063 determines whether there are two or more rows in which the inner product of the row vectors is 0 (I / O It is determined whether or not there is an I / O request in which the previous block does not overlap. If there are two or more rows, the I / O matrix replacement unit 1063 combines two or more rows, that is, combines two or more I / O requests (eg, R2 and R7) into one I / O. Summarize in O request. Thereby, further improvement of the throughput of the DBMS 107 can be expected.

なお、このように２つのＩ／Ｏ要求を一つにまとめた場合、それら２つのＩ／Ｏ要求がそれぞれ２つのアプリケーション１０２から発行されたＩ／Ｏ要求であれば、一つにまとめられたＩ／Ｏ要求の処理が済んだ場合に、ＤＢＭＳ１０７によって、それら２つのアプリケーション１０２に、完了が通知される。 When two I / O requests are combined into one as described above, if the two I / O requests are I / O requests issued from two applications 102, they are combined into one. When the processing of the I / O request is completed, the DBMS 107 notifies the completion to the two applications 102.

また、例えばブロックの数が膨大であって、一つのマトリクスで行の置換処理などを行うことが困難な場合、Ｉ／Ｏ行列置換部１０６３は、マトリクスを多段で用意し、多段のマトリクスのうちの少なくとも一つで、列の置換処理、行の置換処理、行の合成処理を行っても良い。例えば、図５（ｂ）に例示するように、第一段のマトリクスにおいてＮ個（Ｎは２以上の整数）のブロックにそれぞれ対応するＮ個の位置を、第二段（第一段の一つ上の段）のマトリクスにおいて一つの位置に対応付けても良い。その場合、第二段のマトリクスにおいて、行列が交差する各位置には、その位置に対応するＮ個のブロックが全てＩ／Ｏ先である場合を１としたときの数値が設定される。図５（ｂ）の例では、第二段のマトリクスの或る位置には、８個のブロックのうちの４個のブロックがＩ／Ｏ先となっているので、０．５という値が設定される。この第二段のマトリクスで、列の置換処理、行の置換処理、行の合成処理を行っても良い。行の合成処理の際は、例えば、Ｉ／Ｏ行列置換部１０６３は、行ベクトルの内積が所定値以下になる２つ以上の行があるか否かを判断し（例えば、行の合成処理を行ってもＩ／Ｏ先が重なり合う可能性が０に極めて近い２つ以上の行があるかを判断し）、そのような２つ以上の行があれば、行の合成を行っても良い。 For example, when the number of blocks is enormous and it is difficult to perform row replacement processing in one matrix, the I / O matrix replacement unit 1063 prepares the matrix in multiple stages, At least one of the above, column replacement processing, row replacement processing, and row composition processing may be performed. For example, as illustrated in FIG. 5B, N positions corresponding to N (N is an integer of 2 or more) blocks in the first-stage matrix are assigned to the second stage (one of the first stages). It may be associated with one position in the upper matrix. In that case, in the second-stage matrix, a numerical value is set at each position where the matrix intersects when the number of N blocks corresponding to the position is all I / O destinations. In the example of FIG. 5B, a value of 0.5 is set at a certain position of the second-stage matrix because four of the eight blocks are I / O destinations. Is done. In this second-stage matrix, column replacement processing, row replacement processing, and row composition processing may be performed. In the row composition processing, for example, the I / O matrix replacement unit 1063 determines whether there are two or more rows in which the inner product of the row vectors is equal to or less than a predetermined value (for example, the row composition processing is performed). It is determined whether there are two or more rows that are very close to 0 with the possibility of overlapping I / O destinations even if they are executed). If there are two or more such rows, the rows may be combined.

＜第三の実施形態＞ <Third embodiment>

この第三の実施形態では、Ｉ／Ｏ性能統計がより有効に活用される。具体例としては、以下の第一〜第三の例とがある。 In this third embodiment, I / O performance statistics are utilized more effectively. Specific examples include the following first to third examples.

第一の例は、Ｉ／Ｏ性能の向上化である。例えば、Ｉ／Ｏ性能蓄積部１０６２が、定期的に又は不定期的に、図６（ａ）に例示する処理を実行する。具体的には、例えば、Ｉ／Ｏ性能蓄積部１０６２が、同種の二次記憶装置のＩ／Ｏ性能統計を参照することにより、同種の二次記憶装置間でＩ／Ｏ性能（例えば、平均、標準偏差）に想定誤差以上の差があるか否かを判断する（ステップＳ１０１）。そのような差がある場合には、Ｉ／Ｏ性能蓄積部１０６２は、同種の二次記憶装置のうちのＩ／Ｏ性能の低い方の二次記憶装置に対して最適化処理を実行させる（Ｓ１０２）。 The first example is an improvement in I / O performance. For example, the I / O performance accumulation unit 1062 executes the process illustrated in FIG. 6A periodically or irregularly. Specifically, for example, the I / O performance accumulation unit 1062 refers to the I / O performance statistics of the same type of secondary storage device, so that the I / O performance (for example, the average , Standard deviation) is determined whether or not there is a difference greater than the expected error (step S101). If there is such a difference, the I / O performance storage unit 1062 causes the secondary storage device with the lower I / O performance among the same type of secondary storage devices to execute the optimization process ( S102).

ここで、複数の二次記憶装置のうちのどれが同種の二次記憶装置であるかは、例えば、二次記憶装置に関する属性情報が書かれた管理テーブルを参照する等の方法により、特定することができる。 Here, which of the plurality of secondary storage devices is the same type of secondary storage device is identified by a method such as referring to a management table in which attribute information related to the secondary storage device is written. be able to.

また、二次記憶装置に対して実行させる最適化処理とは、例えば、デフラグメンテーション処理、すなわち、二次記憶装置内のデータの断片化を解消する処理である。Ｉ／Ｏ性能蓄積部１０６２は、例えば、ＯＳ２１１でサポートされているユーティリティ機能を呼び出すことで、所望の二次記憶装置に対する最適化処理を実行させることができる。 The optimization process executed on the secondary storage device is, for example, a defragmentation process, that is, a process for eliminating fragmentation of data in the secondary storage device. For example, the I / O performance storage unit 1062 can execute optimization processing for a desired secondary storage device by calling a utility function supported by the OS 211.

第二の例は、待ち受けキュー１０５からのＩ／Ｏ要求の出力タイミングの決定である。すなわち、この第三の実施形態では、ＯＳ２１１によって決定されたタイミングではなく、最適化部１０６のキュー制御部（図示せず）によって決定されたタイミングで待ち受けキュー１０５からＩ／Ｏ要求が出力される。具体例を図６（ｂ）に示す。 The second example is the determination of the output timing of the I / O request from the standby queue 105. In other words, in the third embodiment, the I / O request is output from the standby queue 105 not at the timing determined by the OS 211 but at the timing determined by the queue control unit (not shown) of the optimization unit 106. . A specific example is shown in FIG.

すなわち、キュー制御部は、Ｉ／Ｏ要求を読み出した場合に、該Ｉ／Ｏ要求の処理の際に発行されるＩ／Ｏコマンドの発行先二次記憶装置に対応したＩ／Ｏ性能統計を基に、該二次記憶装置についてのＩ／Ｏ性能を予測する（Ｓ２０１）。ここで予測されるＩ／Ｏ性能は、例えば、標準正規分布での平均であっても良いし、標準正規分布における所定の範囲内での或るＩ／Ｏ性能であっても良い。 That is, when the queue control unit reads an I / O request, the queue control unit displays I / O performance statistics corresponding to the secondary storage device to which the I / O command issued at the time of processing the I / O request. Based on this, the I / O performance for the secondary storage device is predicted (S201). The I / O performance predicted here may be, for example, an average in a standard normal distribution or a certain I / O performance within a predetermined range in the standard normal distribution.

キュー制御部は、予測されたＩ／Ｏ性能に基づいて、次のＩ／Ｏ要求を読み出すタイミング（例えば今回の読出しから何秒後に読み出すか）を決定する（Ｓ２０２）。そして、キュー制御部は、そのタイミングで次のＩ／Ｏ要求を読出し（Ｓ２０３）、その読み出したＩ／Ｏ要求について、再びＳ２０１を実行する。 Based on the predicted I / O performance, the queue control unit determines the timing for reading the next I / O request (for example, how many seconds after the current reading) (S202). Then, the queue control unit reads the next I / O request at that timing (S203), and executes S201 again for the read I / O request.

第三の例は、Ｉ／Ｏ行列置換部１０６３による行の置換処理において、どのようなブロックを優先してＩ／Ｏ要求を並び替えるかの決定である。すなわち、上述した第一の実施形態では、Ｉ／Ｏ数が多い順が、ブロックの優先順位の高い順であるが、この第三の例では、必ずしも、Ｉ／Ｏ数が多い順が、ブロックの優先順位の高い順とはならない。例えば、ブロックＢ１のＩ／Ｏ数が最も多いが、ブロックＢ２に対応した二次記憶装置のＩ／Ｏ性能がブロックＢ１に対応した二次記憶装置のＩ／Ｏ性能よりもかなり低い場合には、ブロックＢ２を最優先とした行の置換処理が行われても良い。 The third example is determination of what block is preferentially rearranged in the row replacement processing by the I / O matrix replacement unit 1063. That is, in the first embodiment described above, the order in which the number of I / Os is large is the order in which the block priority is high, but in this third example, the order in which the number of I / Os is large is not necessarily the order. The order of priority is not high. For example, when the number of I / Os in block B1 is the largest, the I / O performance of the secondary storage device corresponding to block B2 is considerably lower than the I / O performance of the secondary storage device corresponding to block B1. The row replacement process with the block B2 as the highest priority may be performed.

上述した本発明の幾つかの実施形態は、本発明の説明のための例示であり、本発明の範囲をそれらの実施形態にのみ限定する趣旨ではない。本発明は、その要旨を逸脱することなく、その他の様々な態様でも実施することができる。 The several embodiments of the present invention described above are examples for explaining the present invention, and are not intended to limit the scope of the present invention only to those embodiments. The present invention can be implemented in various other modes without departing from the gist thereof.

本発明の一実施形態に係るシステム全体の構成を示す。1 shows a configuration of an entire system according to an embodiment of the present invention. 最適化部の機能の説明図。Explanatory drawing of the function of an optimization part. 並び替え処理における列の置換処理の説明図。Explanatory drawing of the replacement process of the column in a rearrangement process. 並び買え処理における行の置換処理の説明図。Explanatory drawing of the replacement process of the line in a line-up purchase process. （ａ）は、並び替え処理における行の合成処理の説明図。（ｂ）は、マトリクスが多段になった場合の一例の説明図。(A) is explanatory drawing of the synthetic | combination process of the line in a rearrangement process. (B) is an explanatory view of an example when the matrix is multistage. （ａ）は、Ｉ／Ｏ性能統計の利用方法の第一の例を示す図。（ｂ）は、Ｉ／Ｏ性能統計の利用方法の第二の例を示す図。(A) is a figure which shows the 1st example of the utilization method of I / O performance statistics. (B) is a figure which shows the 2nd example of the utilization method of I / O performance statistics.

Explanation of symbols

１０１…クライアント１０２…アプリケーションプログラム１０５…待ち受けキュー１０６…最適化部１０７…ＤＢＭＳ１０６１…キュー監視蓄積部１０６２…Ｉ／Ｏ性能蓄積部１０６３…Ｉ／Ｏ行列置換部 DESCRIPTION OF SYMBOLS 101 ... Client 102 ... Application program 105 ... Standby queue 106 ... Optimization part 107 ... DBMS 1061 ... Queue monitoring storage part 1062 ... I / O performance storage part 1063 ... I / O matrix replacement part

Claims

A request response that is a response to an I / O request when an I / O request is received from an I / O request issuer, accumulated once in a queue, and predetermined processing is completed for the I / O request output from the queue A computer program for causing a computer to optimize the processing order of I / O requests that can be applied to an I / O control system that returns an I / O request to the I / O request issuer,
A queue status check step for checking how many I / O requests are arranged in what order in the queue and which storage area is the I / O destination of each I / O request;
Based on the result of the check, an I / O destination for each of two or more storage areas that are set as I / O destinations in a plurality of I / O requests arranged in the queue among the plurality of storage areas. The number of I / O requests, which is the number of I / O requests, is calculated, and a plurality of I / O requests in the queue are arranged so that the I / O requests that have a storage area with a large number of I / Os are in the earlier order. A computer program that causes a computer to execute an order rearranging step for rearranging the order of I / O requests.

The predetermined process is an I / O command issued to a storage device corresponding to a storage area that is an I / O destination in an I / O request output from the queue, and a response to the I / O command. Receiving a command response from the corresponding storage device;
For each I / O command issued by the I / O control system, the length of time from when the I / O control system issues an I / O command to the storage device until the command response is received from the storage device I / O performance accumulating step for acquiring I / O performance, and accumulating the acquired I / O performance in the storage means,
The computer program according to claim 1, further causing the computer to execute.

Based on the plurality of I / O performances stored in the storage means, I / O performance statistics are obtained for each storage device, and the difference in I / O performance between the same types of storage devices exceeds a predetermined value. A storage device optimization step for determining whether or not the storage device is large, and in the case of being large, performing an optimization process on a storage device having a lower I / O performance among the storage devices of the same type;
The computer program according to claim 2, further causing the computer to execute.

In the order rearranging step, the plurality of I / O performances stored in the storage unit and the two or more storages that are set as I / O destinations by the plurality of I / O requests arranged in the queue. And determining the I / O performance of each of the two or more storage areas based on the I / O performance of each of the two or more storage areas in addition to the obtained number of I / Os. The priority order of each of the two or more storage areas is determined, and the order of the plurality of I / O requests is arranged so that an I / O request having a storage area having a higher priority order as an I / O destination is earlier. Change
The computer program according to claim 2.

In the order rearranging step, from the result of the check, among the plurality of I / O requests arranged in the queue, two or more probabilities that the storage areas as I / O destinations overlap each other are not more than a predetermined value. I / O requests are specified, and two or more specified I / O requests are defined as one I / O with two or more storage areas specified by the two or more I / O requests as I / O destinations. Composite to request,
The computer program according to claim 1.

A request response that is a response to an I / O request when an I / O request is received from an I / O request issuer, accumulated once in a queue, and predetermined processing is completed for the I / O request output from the queue Is an I / O control system that returns to the I / O request issuer,
A queue status check unit for checking in which order and how many I / O requests are arranged in the queue and which storage area is the I / O destination of each I / O request;
Based on the result of the check, an I / O destination for each of two or more storage areas that are set as I / O destinations in a plurality of I / O requests arranged in the queue among the plurality of storage areas. The number of I / O requests, which is the number of I / O requests, is calculated, and a plurality of I / O requests in the queue are arranged so that the I / O requests that have a storage area with a large number of I / Os are in the earlier order. An I / O control system comprising an order rearranging unit for rearranging the order of I / O requests.