JP2006092503A

JP2006092503A - Multi-instance in-memory database

Info

Publication number: JP2006092503A
Application number: JP2004308853A
Authority: JP
Inventors: Shuhei Nishiyama; 修平西山
Original assignee: Nishiyama Shuhei
Current assignee: Nishiyama Shuhei
Priority date: 2004-09-27
Filing date: 2004-09-27
Publication date: 2006-04-06
Anticipated expiration: 2024-09-27
Also published as: JP4313845B2

Abstract

<P>PROBLEM TO BE SOLVED: To divide single data into sizes that can be loaded into main storage devices and handle the divisions for search, processing and the like in the same manner as the single undivided data so that an in-memory database is applied to a large scale database. <P>SOLUTION: Data having many tuples are grouped into data sets sized for handling on the main storage devices and are allocated with storage location information assigned. In search, a data set in a storage location is specified, and a data storage device is identified by the location information. If the target data set is on the main storage device, it is accessed direct. If not, the data set is accessed after it is checked in from a secondary storage device or the storage device on another networked electronic computer. The location information about the data sets is set according to characteristics such as a bias at the storage start of stored data and, with time series changes in the characteristics of the data, the relation between the data sets and the data storage devices is coordinated accordingly. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、インメモリ・データベースおよびそれを用いたグリッド・コンピューティング・システムに関する。 The present invention relates to an in-memory database and a grid computing system using the same.

データベース管理システムの基本技術は、階層型にはじまり、ネットワーク型、リレーショナル型を経てオブジェクト指向型に至っているが、未だリレーショナル型が主流である。 The basic technology of the database management system has started from the hierarchical type, has reached the object-oriented type through the network type and the relational type, but the relational type is still mainstream.

加えて、その主流であるリレーショナル型データベース管理システムは、データ格納媒体として、格納容量が大きく、電力の供給が断たれても、その記憶内容が比較的安定している磁気ディスクを主要部品とするハードディスク装置を二次記憶装置として使用することを前提としてきた。 In addition, the mainstream relational database management system has as its main component a magnetic disk as a data storage medium that has a large storage capacity and whose stored contents are relatively stable even when power supply is cut off. It has been assumed that a hard disk device is used as a secondary storage device.

二次記憶装置としてのハードディスク装置は、停電に対しては安定しているが、機械的摺動部分を多く持つという理由から、半導体記憶装置に対して故障発生率は桁違いに高く、読出し書込みのアクセス時間は桁違いに遅い。また、周辺の磁場の影響を受けやすい磁気媒体を記憶媒体としているため、磁石の接近に対して記憶保持力は不安定である。さらに、読出し書込みのアクセスは磁気ヘッダの接触によっておこなわれるため、磁気ヘッダは磁気記憶媒体に非常に近い場所に常に置かれている。そのため、衝撃に対しても磁気記憶媒体の記憶保持力が脆弱であることは周知のことである。 Hard disk devices as secondary storage devices are stable against power outages, but because they have many mechanical sliding parts, the failure occurrence rate is much higher than that of semiconductor storage devices. Access time is orders of magnitude slower. Further, since the magnetic medium that is easily affected by the surrounding magnetic field is used as the storage medium, the memory retention force is unstable with respect to the approach of the magnet. Furthermore, since read / write access is performed by contact of the magnetic header, the magnetic header is always placed in a location very close to the magnetic storage medium. For this reason, it is well known that the memory retention force of a magnetic storage medium is vulnerable to impact.

二次記憶装置として不揮発性メモリ等の半導体記憶媒体を使用する動きもあるが、単位記憶容量あたりの価格が未だに高価であるため、経済的な理由から、その普及はデータベース管理システムに利用するほどには普及していない。 Although there is a movement to use a semiconductor storage medium such as a non-volatile memory as a secondary storage device, the price per unit storage capacity is still expensive, and for economic reasons, its spread is used as a database management system. Is not popular.

最近の工業生産技術の向上に伴い、揮発性半導体記憶媒体は低価格大容量化の傾向にあり、それに伴いギガビットオーダーの大容量揮発性半導体記憶装置を主記憶装置とするパーソナル・コンピュータ（以下ＰＣ）も登場してきている。また、３２ビットＣＰＵではアドレッシングの理由から２の３２乗バイトすなわち４２億９千４百９６万７千２百９６バイト（約４ＧＢ）以上のメモリを主記憶装置として搭載しても無意味であったが、６４ビットＣＰＵの登場で２の６４乗バイトのメモリ空間を持つことが可能になり、理論上はテラ・バイト・オーダーのデータベースをインメモリで取扱うことも可能になってきている。実際には半導体メモリの現在の集積技術の制限から一個の６４ビットＣＰＵに搭載される主記憶装置としての半導体メモリはせいぜい数十ＧＢレベルと推定される。 With the recent improvement of industrial production technology, volatile semiconductor storage media has been trending toward lower prices and larger capacities. Along with this, personal computers (hereinafter referred to as PCs) with large-capacity volatile semiconductor storage devices on the order of gigabits as main storage devices. ) Has also appeared. On the other hand, in a 32-bit CPU, it is meaningless to install a memory of 2 32 bytes, that is, 4,294,962,722,96 bytes (about 4 GB) or more as a main storage device for addressing reasons. However, with the advent of 64-bit CPUs, it is possible to have a memory space of 2 64 bytes, and it is theoretically possible to handle a terabyte order database in-memory. Actually, it is estimated that the semiconductor memory as the main storage device mounted on one 64-bit CPU is at most several tens GB level due to the limitation of the current integration technology of the semiconductor memory.

前記揮発性半導体記憶媒体は低価格大容量化の傾向は、データベース管理システムの在り方を根底から覆すこととなり、主記憶装置をメインのデータ格納装置とし、二次記憶装置を障害時のリカバリのための永続化（パーシステント）用記憶媒体と位置づけるインメモリ・データベース管理システムが商用プロダクトとして複数製品が市場に登場してきている。 The trend toward lower prices and larger capacities of the volatile semiconductor storage medium is to completely overturn the database management system, the main storage device is the main data storage device, and the secondary storage device is used for recovery in the event of a failure. In-memory database management system, which is positioned as a persistent storage medium, has appeared on the market as a commercial product.

現在市場に登場してきているインメモリ・データベース管理システムは、マルチューザによる複雑な更新処理をともなうトランザクション用データベース管理システムではなく、ＯＬＡＰやデータマイニングを行うためのデータウェアハウス（以下ＤＷＨ）用のデータベース管理システムとしての位置付けが妥当であり、数十ＧＢレベルの大きさのデータベース管理システムのインメモリ化には成功しているといえる。 The in-memory database management system that has appeared on the market is not a database management system for transactions involving complicated update processing by a Maltuser, but a database management for a data warehouse (hereinafter referred to as DWH) for OLAP and data mining. Positioning as a system is reasonable, and it can be said that a database management system having a size of several tens of GB has been successfully implemented in memory.

特開２００４−２２７１６９号（ＰＣＴ／ＪＰ２００３／０１４３９０）公報JP-A-2004-227169 (PCT / JP2003 / 014390) 特開２００４−１４５６４０号公報JP 2004-145640 A 特開２０００−３３９３９０号公報JP 2000-339390 A

ＥＵの環境問題規制強化にからみ工業生産製品、農業生産物等の有害化学物質の不使用の証明や誤って使用された場合の消費者に対する使用禁止等のメッセージ通知や回収等のためのトレーサビリティの確保のために、数百テラ・バイト・オーダーの大容量超高速データベースの実用化が求められている。 In order to strengthen EU environmental problem regulations, traceability for proof of non-use of hazardous chemical substances such as industrial products and agricultural products, and notification and collection of messages such as prohibition of use to consumers when used incorrectly. In order to secure it, there is a demand for practical use of a large-capacity ultrahigh-speed database of the order of several hundred terabytes.

６４ビットＣＰＵ搭載ＰＣの登場により、大容量超高速データベースの実現に近づいたとはいえ、主記憶装置に用いられる半導体揮発性記憶装置の集積度は、一台のＰＣに数百テラ・バイト・オーダーの半導体揮発性記憶装置を搭載するほどには至っていない。従って、ネットワーク上に配置された複数個の電子計算機にデータベースを分散して処理ができることが求められている。そのため、同一属性を持つ大規模データの集合を、数ギガ・バイト・オーダーの主記憶装置しか持たない電子計算機上で主記憶装置のみで処理可能な大きさのデータ・セットに分割して、前記数ギガ・バイト・オーダーの主記憶装置しか持たない電子計算機を複数台ネットワークに接続することが考えられるが、データ処理を行うユーザの電子計算機からは前記分割された複数個のデータ・セットが、統合して仮想的に単一のデータ・セットとして取扱えるようにすることができないことが問題であった。 Although the introduction of a 64-bit CPU-equipped PC has made it close to the realization of a large-capacity ultra-high-speed database, the degree of integration of semiconductor volatile storage devices used in main storage devices is in the order of several hundred terabytes per PC. The semiconductor volatile memory device is not installed. Therefore, it is required that the database can be distributed to a plurality of electronic computers arranged on the network. Therefore, a set of large-scale data having the same attribute is divided into data sets of a size that can be processed only by the main storage device on an electronic computer having only a few gigabyte order main storage device, Although it is conceivable to connect a plurality of electronic computers having only a few gigabytes of main storage to a network, a plurality of divided data sets are obtained from a user's electronic computer that performs data processing. The problem was that they could not be integrated and handled virtually as a single data set.

また、全データ・セットがネットワーク上に配置された電子計算機上の主記憶装置上に展開されている状態が理想であるが、接続する電子計算機の台数の制限からハードディスク装置等の二次記憶媒体を効率的に利用することが求められる場合もあり、この二次記憶媒体を効率的に利用することができないことも問題であった。 In addition, it is ideal that all data sets are expanded on a main storage device on an electronic computer arranged on a network, but a secondary storage medium such as a hard disk device due to the limitation of the number of connected electronic computers. In some cases, it is required to efficiently use the secondary storage medium, and the secondary storage medium cannot be used efficiently.

また、ネットワーク上に接続される電子計算機は、ＣＰＵの性能が一様ではなく、搭載主記憶装置の容量も各様であり、時系列的にも、配置換えや新旧の入替えにより、ネットワーク上に接続される電子計算機の台数は変化し、前記電子計算機に搭載されるＣＰＵの性能は変化し、前記電子計算機に搭載される主記憶装置の容量も変化していく。その多様性や変化に対応すべく前記分割されたデータ・セットのロケーション情報がダイナミックに変更可能ではなく、加えて最適配置も困難であったことも問題であった。 In addition, the computer connected to the network does not have uniform CPU performance, and the capacity of the mounted main storage device is also various. Even in time series, it can be placed on the network by rearranging or replacing old and new. The number of connected electronic computers changes, the performance of the CPU mounted on the electronic computer changes, and the capacity of the main storage device mounted on the electronic computer also changes. Another problem is that the location information of the divided data sets cannot be dynamically changed to cope with the diversity and changes, and in addition, the optimal arrangement is difficult.

さらに、単純に大規模データ・セットが適正規模の複数データ・セットに分割されても、生産管理システムにおけるＢＯＭ（部品構成表）の正展開、逆展開を表現するためのように複数データ・セットに跨るセルフ・ジョインを横断的に効率的に実現することも困難であった。 Furthermore, even if a large-scale data set is simply divided into a plurality of appropriately-sized data sets, a plurality of data sets are used to express forward and backward development of BOM (parts configuration table) in the production management system. It was also difficult to efficiently implement self-joining across the two.

そこで、本発明は、本データベース・システムにおいて大規模データ・セットが小規模データ・セットに分割されていても、仮想的に統合して単一の大規模データ・セットとして取扱えるようにし、前記データベース・システムを運用しながら、前記データベース・システムを構成するネットワーク上に配置された一個以上の電子計算機の台数ないし前記電子計算機に搭載されているＣＰＵの性能ないし前記ＣＰＵの個数ないし前記電子計算機に搭載されている主記憶装置の容量等のリソースの変化に呼応して、前記分割されたデータ・セットのロケーションをダイナミックに最適配置していくことを目的とする。 Therefore, the present invention enables a large data set to be handled as a single large data set by virtually integrating even if a large data set is divided into small data sets in the database system. While operating the database system, the number of one or more electronic computers arranged on the network constituting the database system, the performance of the CPU mounted on the electronic computer, the number of CPUs, or the electronic computer The object is to dynamically and optimally arrange the locations of the divided data sets in response to changes in resources such as the capacity of the main storage device mounted.

上記の課題を解決するために、本発明においては、請求項１に示されるように、同一の属性を持つ大規模データの集合を、格納すべきデータの情報の一部または全部をハッシング等のアルゴリズムを特定のパラメータで用いて、ネットワーク上に配置された電子計算機の主記憶装置のみで処理可能な大きさに分割配置し、それぞれをデータ・セットとする。各データ・セットには識別記号とロケーション情報が与えられ、検索や加工の際には対象となるデータの情報の一部または全部を前記アルゴリズムと同一のアルゴリズムを同一のパラメータで用いて、格納先のデータ・セットを特定する。全データ・セットがネットワーク上に配置された電子計算機上の主記憶装置上に展開されている状態が理想であるが、データ・セットの、他の記憶媒体への待避（以下チェックアウト）、召還（以下チェックイン）機能を持つことによって、接続される電子計算機の台数の制限に対応する。 In order to solve the above-described problems, in the present invention, as shown in claim 1, a large-scale data set having the same attribute, a part or all of data information to be stored is hashed or the like. Using an algorithm with specific parameters, the data is divided and arranged in a size that can be processed only by the main storage device of the electronic computer arranged on the network, and each is used as a data set. Each data set is given an identification symbol and location information. When searching or processing, a part or all of the information of the target data is stored in the same parameter using the same algorithm as the above algorithm. Identify the data set. Ideally, all data sets are deployed on the main storage device on a computer located on the network, but the data sets can be saved to other storage media (hereinafter referred to as checkout) and summoned. By having a function (hereinafter referred to as “check-in”), it supports the limitation of the number of connected computers.

また、請求項２に示されるように、データ・セットのデータ・セット識別記号ロケーション情報変換部のロケーション情報を変更する機能を有することにより、各時点での各データ・セットの配置を再検討し、最適再配置を可能にする。 Further, as described in claim 2, by having a function of changing the location information of the data set identification symbol location information conversion unit of the data set, the arrangement of each data set at each time point is reviewed. Enable optimal relocation.

また、請求項３に示されるように、配置されたデータ・セットの大きさに対して、そのデータ・セットが配置されている電子計算機の主記憶装置の未使用記憶容量が、アクセス対象でチェックアウトされているデータ・セットを、現在チェックインしているデータ・セットをチェックアウトせずにチェックイン可能な場合、現在のデータ・セットのチェックアウトが行われずに、アクセス対象のデータ・セットがチェックインされる。これによって、二次記憶装置の使用を抑制し、性能向上を図る。 Further, as described in claim 3, for the size of the arranged data set, the unused storage capacity of the main storage device of the electronic computer in which the data set is arranged is checked in the access target. If a checked out data set can be checked in without checking out the currently checked in data set, the current data set is not checked out and the accessed data set is Check in. This suppresses the use of the secondary storage device and improves performance.

また、請求項４に示されるように、各データ・セットと前記ネットワーク上に配置された電子計算機のＣＰＵ稼働率や搭載主記憶装置の未使用記憶容量を常時ないし定期的に監視し、データ・セットのその時点での実際の大きさ、配置されていて使用可能な電子計算機のＣＰＵ稼働率や搭載主記憶装置の未使用記憶容量の変化に応じて、自動的にロケーション情報の変更、データ・セットの再配置を行うことにより、二次記憶装置の使用を抑制し、性能向上を図る。 Further, as shown in claim 4, the CPU utilization rate of each data set and the electronic computer arranged on the network and the unused storage capacity of the installed main storage device are constantly or regularly monitored, Depending on changes in the actual size of the set at that time, the CPU utilization rate of the computers that are installed and usable, and the unused storage capacity of the installed main storage, the location information is automatically changed, By rearranging the set, the use of the secondary storage device is suppressed and the performance is improved.

また、請求項５に示されるように、データ・セットが、自分自身のデータ・セットの他のタプルから参照されている時、参照しているタプルのアトリビュート・データを参照されているデータ・セットの前記データ・セット識別記号変換部への入力データとすることにより、セルフ・ジョインを再帰的におこない、各データ・セットの分割される前の元の大規模データ・セットに記述された親部品子部品構成表に基づく前記ＢＯＭの正展開、逆展開を行う。 Also, as set forth in claim 5, when a data set is referenced from another tuple of its own data set, the data set referenced by the attribute data of the referring tuple By using the data as input data to the data set identification symbol conversion unit, the self part is recursively performed and the parent part described in the original large-scale data set before each data set is divided The BOM is forwardly expanded and reversely expanded based on the child component configuration table.

これにより、数百テラ・バイト・オーダー・レベルの大規模データ・セットからの抽出ないしソートないしマージないしジョインないしプロジェクションも高速に実行することが可能となる。 As a result, extraction, sorting, merging, joining, and projection from a large-scale data set of several hundred terabyte order level can be executed at high speed.

また、数テラ件数オーダー・レベルの大規模ＢＯＭにおいても、その正展開、逆展開を高速で行うことが可能となる。 Further, even in a large-scale BOM of several tera number order level, it is possible to perform forward and reverse expansion at high speed.

使用開始時には数ギガ・バイト・オーダー・レベルの大きさの小規模データベースが、時間とともに大きくなり、数百テラ・バイト・オーダー・レベルの大規模データベースに成長しても、基本的構造を変更せずに、前記ネットワーク上に配置される電子計算機の台数の増加、若しくは前記ネットワーク上に配置された個々の電子計算機に搭載されたＣＰＵの性能の向上ないし個数の増加、若しくは前記ネットワーク上に配置された個々の電子計算機に搭載された主記憶装置の容量の増加、だけでその成長の度合に応じて、その時点での最適規模のデータベースに拡張しながら構築することが可能となる。 Even if a small database with a size of several gigabytes at the start of use grows over time and grows into a large database with a few hundred terabytes, the basic structure can be changed. Without increasing the number of computers arranged on the network, or improving the performance or increasing the number of CPUs mounted on individual computers arranged on the network, or arranged on the network. It is possible to construct the database while expanding the database to the optimum scale at that time according to the degree of growth only by increasing the capacity of the main storage device mounted on each individual computer.

以下、本発明を実施するための最良の形態について、図を用いて説明する。なお、本発明は、これら実施の形態に何ら限定されるものではなく、その要旨を逸脱しない範囲において、種々たる態様で実施し得る。 The best mode for carrying out the present invention will be described below with reference to the drawings. Note that the present invention is not limited to these embodiments, and can be implemented in various modes without departing from the scope of the present invention.

（発明の概念）
図１は、請求項１に基づく本発明の概念を示す。
データ入力装置１０１、
データ格納部特定装置１０２、
データ格納装置１０３、１０４、１０５、
データ検索加工装置１０６、
データ出力装置１０７が、
本発明に係わるマルチインスタンス・インメモリ・データベースの構成要素である。
データ・セット識別記号変換部１２１、
データ・セット識別記号ロケーション情報変換部１２２は、
前記マルチインスタンス・インメモリ・データベースの構成要素の一部であるデータ格納部特定装置１０２の構成要素である。
データ格納部１３１、１４１、１５１、
リソース・マネジメント部１３２、１４２、１５２、
チェックアウト・データ待避スペース１３３、１４３、１５３、１５４は、
前記マルチインスタンス・インメモリ・データベースの構成要素の一部であるデータ格納装置１０３、１０４，１０５の構成要素である。
データ検索部１６１、
データ加工部１６２、
ワークスペース部１６３、
リソース・マネジメント部１６４は、
前記マルチインスタンス・インメモリ・データベースの構成要素の一部であるデータ検索加工装置１０６の構成要素である。(Concept of invention)
FIG. 1 shows the concept of the invention according to claim 1.
Data input device 101,
Data storage specifying device 102,
Data storage devices 103, 104, 105,
Data search processing device 106,
The data output device 107 is
2 is a component of a multi-instance in-memory database according to the present invention.
Data set identification symbol converter 121,
The data set identification symbol location information conversion unit 122
It is a component of the data storage unit specifying device 102 that is a part of the components of the multi-instance in-memory database.
Data storage units 131, 141, 151,
Resource management unit 132, 142, 152,
Checkout / data saving spaces 133, 143, 153, 154
It is a component of the data storage devices 103, 104, 105 that are part of the components of the multi-instance in-memory database.
Data search unit 161,
Data processing unit 162,
Workspace part 163,
The resource management unit 164
It is a component of the data search processing device 106 that is a part of the components of the multi-instance in-memory database.

データ入力装置１０１は、ネットワーク上の他の電子計算機からデータをインポートしてもよい。 The data input device 101 may import data from other electronic computers on the network.

データ入力装置１０１は、ネットワーク上の電子計算機若しくはネットワークに接続されていない電子計算機がフロッピー・ディスク若しくはＣＤ若しくはＤＶＤ若しくはメモリ・ディスク等の取外し可能な記憶媒体に作成したデータを前記記憶媒体からインポートしてもよい。 The data input device 101 imports data created on a removable storage medium such as a floppy disk, a CD, a DVD, or a memory disk by a network computer or a computer not connected to the network from the storage medium. May be.

データ入力装置１０１は、前記データ入力装置１０１が実装されている電子計算機上に接続されたキーボートやマウス等の入力機器から直接入力されたデータをインポートしてもよい。 The data input device 101 may import data directly input from an input device such as a keyboard or a mouse connected to the computer on which the data input device 101 is mounted.

データ・セット識別記号変換部１２１は、データ入力装置１０１がインポートしたデータをタプルごとに読出し、前記タプルを構成するアトリビュートに記録されている情報の全部または一部を入力情報として、前記タプルが格納されるべきデータ・セットの識別記号（すなわちＩＤ）に変換する。 The data set identification symbol conversion unit 121 reads the data imported by the data input device 101 for each tuple, and stores the tuple as input information using all or part of the information recorded in the attributes constituting the tuple. Convert to the identification (ie, ID) of the data set to be done.

前記タプルを構成するアトリビュートに記録されている情報の全部または一部である入力情報を前記タプルが格納されるべきデータ・セットの識別記号への変換には、適当なハッシュ・アルゴリズムに適当なパラメータ値を与えて実施してもよい。ただし、本発明に係わるデータベース・システムが稼働している最中は、前記ハッシュ・アルゴリズムおよびパラメータ値は変更しないものとする。 In order to convert the input information, which is all or part of the information recorded in the attribute constituting the tuple, into the identification symbol of the data set in which the tuple is to be stored, an appropriate parameter for an appropriate hash algorithm is used. You may carry out by giving a value. However, the hash algorithm and parameter values are not changed while the database system according to the present invention is operating.

データ・セット識別記号ロケーション情報変換部１２２は、前記データ・セット識別記号変換部１２１が変換して取得した前記前記タプルが格納されるべきデータ・セットの識別記号と実際に物理的に前記データ・セットが格納されるデータ格納装置１０３、１０４、１０５が実装されている前記ネットワーク上に配置された電子計算機のネットワーク上でのロケーション情報を対応付けるテーブルである。 The data set identification symbol location information conversion unit 122 and the data set identification symbol to be stored in the data set identification symbol conversion unit 121 obtained by the data set identification symbol conversion unit 121 and the physical data It is a table which matches the location information on the network of the electronic computer arrange | positioned on the said network in which the data storage apparatus 103, 104, 105 in which a set is stored is mounted.

図２は、前記データ・セットの識別記号と前記ロケーション情報を対応付けるテーブルの構成の一例である。この例では、
識別記号欄２０１は、ロケーション情報欄２１１と、
識別記号欄２０２は、ロケーション情報欄２１２と、
識別記号欄２０３は、ロケーション情報欄２１３と、
識別記号欄２０４は、ロケーション情報欄２１４と、
識別記号欄２０５は、ロケーション情報欄２１５と、
識別記号欄２０６は、ロケーション情報欄２１６と、
識別記号欄２０７は、ロケーション情報欄２１７と、
は対応付けられている。
したがって、識別記号欄２０１に在る識別記号１のデータセットは、ロケーション情報欄２１１にあるロケーション情報１９２．１６８．１．１１で示されるネットワーク上の電子計算機上にデータ格納装置を持つことを意味している。他の識別記号欄２０２、２０３、２０４、２０５、２０６、２０７にある識別記号も他のロケーション情報欄２１２、２１３、２１４、２１５、２１６、２１７にあるロケーション情報と対応付けられる。複数の識別記号が同一のロケーション情報を持っていてもよい。前記複数の識別記号が同一のロケーション情報を持っている場合、請求項１に示すように前記データ格納装置の構成要素であるリソース・マネジメント部によって、データ・セットのチェックインないしチェックアウトが行われる。ただし、請求項３に示されるように、現在、前記データ格納部に格納中のデータセットをチェックアウトを要せずに、アクセス要求のあった別のデータセットをチェックインできる場合にはこの限りではない。
この図２の例を図１に適用してみると、１０３のデータ格納装置Ａのロケーション情報を「１９２．１６８．１．１１」、１０４のデータ格納装置Ｂのロケーション情報を「１９２．１６８．１．１２」、１０５のデータ格納装置Ｃのロケーション情報を「１９２．１６８．１．１３」としてもよい。FIG. 2 shows an example of the configuration of a table that associates the identification symbol of the data set with the location information. In this example,
The identification symbol column 201 includes a location information column 211,
The identification symbol column 202 includes a location information column 212,
The identification symbol column 203 includes a location information column 213,
The identification symbol column 204 includes a location information column 214,
The identification symbol column 205 includes a location information column 215,
The identification symbol column 206 includes a location information column 216,
The identification symbol column 207 includes a location information column 217,
Are associated.
Therefore, the data set of the identification symbol 1 in the identification symbol column 201 means having a data storage device on the electronic computer on the network indicated by the location information 192.168.1.11 in the location information column 211. is doing. The identification symbols in the other identification symbol columns 202, 203, 204, 205, 206, and 207 are also associated with the location information in the other location information columns 212, 213, 214, 215, 216, and 217. A plurality of identification symbols may have the same location information. When the plurality of identification symbols have the same location information, a data management check-in or check-out is performed by a resource management unit that is a component of the data storage device as shown in claim 1. . However, as described in claim 3, this is not necessary when another data set requested to be accessed can be checked in without requiring the data set currently stored in the data storage unit to be checked out. is not.
When the example of FIG. 2 is applied to FIG. 1, the location information of the data storage device A 103 is “192.168.1.11”, and the location information of the data storage device B 104 is “192.168..11”. The location information of the data storage device C of “1.12” and 105 may be “192.168.1.13”.

図１のデータ格納装置１０３、１０４、１０５を構成する要素の一部であるリソース・マネジメント部１３２、１４２、１５２は、それぞれ対応する前記データ格納装置１０３、１０４、１０５を構成する要素の一部であるデータ格納部１３１、１４１、１５１の未使用主記憶容量と新たにアクセス要求が起っているデータ・セットの大きさに基づいて、現在データ格納部に格納されているデータ・セットを、前記データ格納装置１０３、１０４、１０５を構成する要素の一部であるチェックアウト・データ待避スペース１３３、１４３、１５３、１５４に待避させる必要の是非を評価し、待避させる必要があれば待避し、新たにアクセス要求が発生しているデータ・セットが、新規のデータ・セットであれば新規作成し、チェックアウト・データ待避スペース１３３、１４３、１５３、１５４に待避されているデータ・セットであればチェックインして召還する。前記データ・セット識別記号変換部の機能により、新たにアクセス要求が起っているデータ・セットは、既存のデータ・セットであれば必ず、アクセス要求が起っているデータ・セットが所属するデータ格納装置に接続されたチェックアウト・データ待避スペースに待避させられていることが保証される。 The resource management units 132, 142, and 152, which are part of the elements constituting the data storage apparatuses 103, 104, and 105 in FIG. 1, are part of the elements that constitute the corresponding data storage apparatuses 103, 104, and 105, respectively. Based on the unused main storage capacity of the data storage units 131, 141, and 151 and the size of the data set for which a new access request has occurred, the data set currently stored in the data storage unit is Evaluate whether or not the checkout / data saving spaces 133, 143, 153, and 154 that are part of the elements constituting the data storage devices 103, 104, and 105 need to be saved, and if necessary, save them. If the data set for which a new access request has occurred is a new data set, a new data set is created and checked out. To summon to check-in if the data sets that are saved in the chromatography data saved space 133,143,153,154. If the data set for which an access request has newly occurred is an existing data set by the function of the data set identifier conversion unit, the data to which the data set for which the access request has occurred belongs to It is guaranteed that it is saved in a checkout data saving space connected to the storage device.

前記チェックアウト・データ待避スペースは、図１のデータ格納装置Ｃ１０５に示されるチェックアウト・データ待避スペース１５３、１５４のように複数個あってもよい。 There may be a plurality of checkout / data saving spaces such as checkout / data saving spaces 153 and 154 shown in the data storage device C105 of FIG.

分割される前の元データ・セットが十分小さく、またはネットワーク上に接続された電子計算機の主記憶装置容量が十分大きく、またはネットワーク上に接続された電子計算機の台数が十分多い場合には、チェックアウト・データ待避スペースとしてのハードディスク装置等の二次記憶装置を使用せず、すべてのデータ・セットを主記憶装置上に格納してもよい。 Check if the original data set before being divided is sufficiently small, or the main storage capacity of the computer connected to the network is sufficiently large, or the number of computers connected to the network is sufficiently large All data sets may be stored on the main storage device without using a secondary storage device such as a hard disk device as an out data saving space.

図１によって示されるように、データ検索加工装置１０６の一部を構成するデータ検索部１６１は、検索キーを前記データ格納部特定装置１２２によって検索対象となるデータ・セットを特定した後、該当するデータ格納装置中のデータ・セットが検索される。特定されたデータ・セットが、データ格納装置１０３に存在するとすると、データ格納装置１０３内のデータ格納部１３１に存在するときにはそのまま、チェックアウト・データ待避スペース１３３に存在する場合には、その待避された当該データ・セットをデータ格納部１３１にチェックインして、検索を行い、加工する必要があれば、前記データ検索加工装置１０６中のデータ加工部１６２によって、検索結果を加工対象として加工する。前記データ検索加工装置１０６中のワークスペース１６３は、前記データ加工部１６２が作業領域として使用する。また、前記データ検索加工装置１０６中のリソース・マネジメント部１６４は、前記データ加工部１６２の加工作業に伴い必要量が増減する前記ワークスペース部の容量のマネジメントをおこなってもよい。 As shown in FIG. 1, the data search unit 161 constituting a part of the data search processing device 106 applies a search key after specifying a data set to be searched by the data storage unit specifying device 122. A data set in the data store is retrieved. If the specified data set exists in the data storage device 103, the data set is saved when it exists in the data storage unit 131 in the data storage device 103, and is saved when it exists in the checkout data save space 133. If it is necessary to check the data set into the data storage unit 131 for searching and processing, the data processing unit 162 in the data search processing unit 106 processes the search result as a processing target. The work space 163 in the data search processing device 106 is used as a work area by the data processing unit 162. Further, the resource management unit 164 in the data search processing device 106 may manage the capacity of the work space unit in which the required amount increases or decreases with the processing operation of the data processing unit 162.

データ出力装置１０７は、ネットワーク上の他の電子計算機へデータをエクスポートしてもよい。 The data output device 107 may export the data to other electronic computers on the network.

データ出力装置１０７は、ネットワーク上の電子計算機若しくはネットワークに接続されていない電子計算機によってフロッピー・ディスク若しくはＣＤ若しくはＤＶＤ若しくはメモリ・ディスク等の取外し可能な記憶媒体に作成して、データを前記記憶媒体によってエクスポートしてもよい。 The data output device 107 creates a removable storage medium such as a floppy disk, a CD, a DVD, or a memory disk by using an electronic computer on the network or an electronic computer not connected to the network, and uses the storage medium to create data. You may export.

データ出力装置１０７は、前記データ出力装置１０７が実装されている電子計算機上に接続されたＣＲＴ装置やプリンタ装置等の出力機器へデータを直接エクスポートしてもよい。 The data output device 107 may directly export the data to an output device such as a CRT device or a printer device connected to the computer on which the data output device 107 is mounted.

請求項１に示される、データ入力装置およびデータ格納部特定装置およびデータ格納装置およびデータ検索加工装置およびデータ出力装置は、その全部ないし一部が同一の電子計算機上に在ってもよい。 The data input device, the data storage unit specifying device, the data storage device, the data search processing device, and the data output device shown in claim 1 may be wholly or partially on the same electronic computer.

図２は、識別記号が識別記号欄２０１、２０２、２０３、２０４、２０５、２０６、２０７に格納され、ロケーション情報がロケーション情報欄２１１、２１２、２１３、２１４、２１５、２１６、２１７に格納されている図であり、請求項２に示されるように、識別記号欄、ロケーション情報欄への記載内容を変更することによって、データ・セットの物理的な格納先を変更することが可能であることを示している。 In FIG. 2, the identification symbols are stored in the identification symbol columns 201, 202, 203, 204, 205, 206, and 207, and the location information is stored in the location information columns 211, 212, 213, 214, 215, 216, and 217. As shown in claim 2, it is possible to change the physical storage location of the data set by changing the contents described in the identification symbol field and the location information field. Show.

図３は、データ格納装置３０１において、現在データ格納部３０３内にはデータ・セット３０４が存在している状況で、データ格納装置３０１にアサインされていて、チェックアウト・データ待避スペース３０５内に待避されているデータ・セットに新たにアクセス要求が発生し、リソース・マネジメント部３０２によってデータ・セット３０４をチェックアウト・データ待避スペース３０５にチェックアウトする必要があると判断された場合を示しており、データ・セット３０４をチェックアウト・データ待避スペース３０５にチェックアウトして後、前記チェックアウト・データ待避スペース３０５から前記新たにアクセス要求が発生しているデータ・セットを、データ格納部３０３にチェックインしようとしている図である。これは、請求項１に示される、一個のデータ格納装置に複数のデータ・セットをアサインすることが可能であることを示している。 FIG. 3 shows a state in which the data set 304 exists in the data storage unit 303 in the data storage device 301 and is assigned to the data storage device 301 and saved in the checkout / data saving space 305. This shows a case where a new access request is generated for the data set that has been stored and the resource management unit 302 determines that the data set 304 needs to be checked out to the checkout data save space 305, After the data set 304 is checked out to the checkout data save space 305, the data set for which the access request is newly generated from the checkout data save space 305 is checked into the data storage unit 303. FIG. This indicates that it is possible to assign a plurality of data sets to one data storage device as shown in claim 1.

図４は、データ格納装置４０１において、現在データ格納部４０３内にはデータ・セット４０４が存在している状況で、データ格納装置４０１にアサインされていて、チェックアウト・データ待避スペース４０５内に待避されているデータ・セットに新たにアクセス要求が発生し、リソース・マネジメント部４０２によってデータ・セット４０４をチェックアウト・データ待避スペース４０５にチェックアウトする必要がないと判断された場合を示しており、データ・セット４０４をチェックアウト・データ待避スペース４０５にチェックアウトすることなく、前記チェックアウト・データ待避スペース４０５から前記新たにアクセス要求が発生しているデータ・セットを、データ格納部４０３にデータ・セット４０６として、チェックインしようとしている図である。これは、請求項３に示されている、新たにチェックインしようとするデータ・セットが十分小さいか、データ格納部が十分大きい場合若しくはその両方である場合、チェックアウト・データ待避スペースの様な二次記憶装置の使用を抑制し、一個のデータ格納装置のデータ格納部に複数のデータ・セットをアサインし高速にアクセスすることが可能であることを示している。 FIG. 4 shows that the data storage device 401 has a data set 404 currently in the data storage unit 403 and is assigned to the data storage device 401 and saved in the checkout data saving space 405. This shows a case where a new access request is generated for the data set that has been stored and the resource management unit 402 determines that the data set 404 does not need to be checked out to the checkout data save space 405. Without checking out the data set 404 to the checkout data save space 405, the data set for which the access request is newly generated from the checkout data save space 405 is transferred to the data storage unit 403. Check in as set 406 Diagrams are the cornerstone. This is the case when the data set to be newly checked in is sufficiently small and / or when the data storage is sufficiently large, or both, as shown in claim 3. This shows that the use of a secondary storage device can be suppressed, and a plurality of data sets can be assigned to the data storage unit of one data storage device and accessed at high speed.

図５は、データ格納装置５０１において、リソース・マネジメント部５０２内に所属データ・セット・リスト５０３とデータ格納部未使用主記憶装置容量レジスタ５０４を配置して、データ格納部に現在存在するデータ・セットのデータ量の増減を常時監視し、データ格納部未使用主記憶装置容量を計算し、前記データ格納部未使用主記憶装置容量レジスタ５０４に格納し、新たにアクセス要求が発生したデータ・セットの現在のデータ量を所属データ・セット・リスト５０３から取出し、前記データ格納部未使用主記憶装置容量レジスタ５０４に格納された前記データ格納部未使用主記憶装置容量と比較し、未使用主記憶装置容量がチェックインしようとするデータ・セットよりも大きい場合には、請求項３で示されたように、チェックアウトをしないでチェックインすることが可能であることを図４よりも詳しく示している。 FIG. 5 shows that in the data storage device 501, the belonging data set list 503 and the data storage unit unused main storage capacity register 504 are arranged in the resource management unit 502, and the data A data set for which an access request is newly generated is generated by constantly monitoring increase / decrease in the data amount of the set, calculating an unused main storage capacity of the data storage unit, storing it in the unused main storage capacity register 504 of the data storage unit Is retrieved from the belonging data set list 503, and is compared with the data storage unit unused main storage capacity stored in the data storage unit unused main storage capacity register 504, and used main memory If the device capacity is larger than the data set to be checked in, the check Shows detail than FIG. 4 that it is possible to check in without the door.

図６は、ネットワーク６２１に配置された電子計算機６０１と電子計算機６１１がデータ格納装置６０２、６１２をそれぞれ持ち、前記データ格納装置６０２、６１２の内部にリソース・マネジメント部６０３、６１３をそれぞれ持ち、前記リソース・マネジメント部６０３、６１３の内部に識別記号とデータ・サイズの対応表６０４、６１４と割当てられた主記憶装置容量をそれぞれ持ち、電子計算機６０１と電子計算機６１１との間で情報交換することにより、請求項４に示されている、所属データ・セットのデータ・サイズとデータ格納部主記憶装置容量との関係を自動的に最適化して、所属データ・セットの配置換えを行い、データ格納部６０６，６１６に格納されているデータ・セットおよびチェックアウト・データ待避スペース６０７、６１７に格納されているデータ・セットをそれぞれ配置換えして最適化することが可能であることを示している。ここで前記情報交換を行うネットワーク上に配置された電子計算機は２台以上であってもよい。 FIG. 6 shows that an electronic computer 601 and an electronic computer 611 arranged in a network 621 have data storage devices 602 and 612, respectively, and resource management units 603 and 613 inside the data storage devices 602 and 612, respectively. The resource management units 603 and 613 have identification symbols and data size correspondence tables 604 and 614 and allocated main storage capacity, respectively, and exchange information between the electronic computer 601 and the electronic computer 611. And automatically optimizing the relationship between the data size of the belonging data set and the capacity of the data storage unit main storage device, and relocating the belonging data set. Data set and checkout data saving space stored in 606,616 It indicates that the data set stored in 07,617 can be optimized relocated respectively. Here, two or more computers may be arranged on the network for exchanging information.

図７は、前記データ・セット識別記号変換部７０１において、生産管理システムにおける部品構成表（以下ＢＯＭ）の製品番号にあたるトップ・レベル（以下Ｌ０）の部品番号を入力値として、前記入力値が３の剰余系で前記識別記号に変換され、前記データ・セット識別記号ロケーション情報変換部７０２においてロケーション情報に変換され、それぞれのロケーション情報の指し示すネットワーク上に配置された電子計算機の前記データ格納装置のデータ格納部７０３、７０４、７０５に分割された部品構成表のデータ・セットが配置されていることを示している。 FIG. 7 shows that in the data set identification symbol conversion unit 701, a top level (hereinafter referred to as L0) part number corresponding to a product number in a parts configuration table (hereinafter referred to as BOM) in the production management system is used as an input value. Of the data storage device of the electronic computer arranged on the network indicated by each location information, converted into location information by the data set identification symbol location information conversion unit 702 It shows that the data set of the parts configuration table divided in the storage units 703, 704, and 705 is arranged.

図８は、図７に示された分割された部品構成表のデータ・セットから親部品番号３、１６、１８の親部品を製品として、ＢＯＭの正展開表８０１を示したものである。図７において、親部品番号３を与えられた製品は、前記データ・セット識別記号変換部７０１によって３の剰余系として識別番号０を得る。識別番号０は、前記データ・セット識別記号ロケーション情報変換部７０２によってロケーション情報１９２．１６８．１．１０を得る。１９２．１６８．１．１０のロケーション情報を与えられた電子計算機上のデータ格納装置内のデータ格納部７０３に格納されたデータ・セット内の部品構成表の一部から部品番号８、１０、１２を子部品としていることを得る。部品番号８を親番号とする部品が前記データ格納部７０３に格納されたデータ・セット内の部品構成表の一部内には無いため、部品番号８を前記データ・セット識別記号変換部７０１の入力値として与えることにより識別記号２を得る。識別記号２から前記データ・セット識別記号ロケーション情報変換部７０２によってロケーション情報１９２．１６８．１．３０を得る。１９２．１６８．１．３０のロケーション情報を与えられた電子計算機上のデータ格納装置内のデータ格納部７０５に格納されたデータ・セット内の部品構成表の一部から部品番号１１、１３、２０を子部品としていることを得る。部品番号１１を親番号とする部品構成表のデータはデータ格納部７０５に格納されたデータ・セット内に存在するため、そのまま検索し、子部品を持たない最末端部品であることを確認する。部品番号１３を親番号とする部品はデータ格納部７０５に格納されたデータ・セット内に存在しないため、再度、部品番号１３を前記データ・セット識別記号変換部７０１の入力値として与えることにより識別記号１を得る。識別記号１から前記データ・セット識別記号ロケーション情報変換部７０２によってロケーション情報１９２．１６８．１．２０を得る。１９２．１６８．１．２０のロケーション情報を与えられた電子計算機上のデータ格納装置内のデータ格納部７０４に格納されたデータ・セット内の部品構成表の一部から子部品を持たない最末端部品であることを確認する。部品番号２０を親番号とする部品構成表のデータはデータ格納部７０５に格納されたデータ・セット内に存在するため、そのまま検索し、部品番号１７を子部品としていることを得る。部品番号１７を親番号とする部品構成表のデータはデータ格納部７０５に格納されたデータ・セット内に存在するため、そのまま検索し、子部品を持たない最末端部品であることを確認する。同様のプロセスを部品番号１０、１２を持つものについて行い、請求項５で示されているように、図８で示されたＢＯＭの正展開表８０１の中のレベル０（以下Ｌ０）の値が３のもの、すなわち部品番号３をトップ・レベルにもつ製品のＢＯＭの正展開表をえることが可能であることを示している。 FIG. 8 shows a BOM forward development table 801 using the parent parts of the parent part numbers 3, 16, and 18 as products from the data set of the divided parts configuration table shown in FIG. In FIG. 7, the product given the parent part number 3 obtains the identification number 0 as the remainder system of 3 by the data set identification symbol conversion unit 701. For the identification number 0, the data set identification symbol location information conversion unit 702 obtains location information 192.168.1.10. Part numbers 8, 10, 12 from a part of the parts configuration table in the data set stored in the data storage unit 703 in the data storage device on the electronic computer given the location information of 192.168.1.10. Get the child parts. Since the part having the part number 8 as the parent number is not included in a part of the part configuration table in the data set stored in the data storage unit 703, the part number 8 is input to the data set identification symbol conversion unit 701. The identification symbol 2 is obtained by giving it as a value. The location information 192.168.1.30 is obtained from the identification symbol 2 by the data set identification symbol location information conversion unit 702. Part numbers 11, 13, 20 from a part of the parts configuration table in the data set stored in the data storage unit 705 in the data storage device on the electronic computer given the location information of 192.168.1.30 Get the child parts. Since the data of the part configuration table having the part number 11 as the parent number exists in the data set stored in the data storage unit 705, the data is searched as it is to confirm that it is the terminal part having no child parts. Since the part having the part number 13 as the parent number does not exist in the data set stored in the data storage unit 705, the part number 13 is identified by giving the part number 13 again as the input value of the data set identification symbol conversion unit 701. The symbol 1 is obtained. The location information 192.168.1.20 is obtained from the identification symbol 1 by the data set identification symbol location information conversion unit 702. A terminal that has no child parts from a part of the parts configuration table in the data set stored in the data storage unit 704 in the data storage device on the electronic computer given the location information of 192.168.1.20 Confirm that it is a part. Since the data of the part configuration table having the part number 20 as the parent number exists in the data set stored in the data storage unit 705, the data is searched as it is and the part number 17 is obtained as the child part. Since the data of the part configuration table having the part number 17 as the parent number exists in the data set stored in the data storage unit 705, the data is searched as it is to confirm that it is the terminal part having no child parts. The same process is performed for parts having part numbers 10 and 12, and as shown in claim 5, the value of level 0 (hereinafter referred to as L0) in the BOM regular expansion table 801 shown in FIG. 3, that is, it is possible to obtain a BOM positive development table of a product having a part number 3 at the top level.

図７におけるデータ格納部７０３、７０４、７０５には、市販されている若しくはシェアウェア化されている若しくはフリーウェア化されている若しくは独自開発されたインメモリ・データベース・エンジンを代りに充ててもよい。 The data storage units 703, 704, and 705 in FIG. 7 may be replaced with commercially available, shareware, freeware, or originally developed in-memory database engines. .

ＥＵの環境問題規制強化にからみ工業生産製品、農業生産物等の有害化学物質の不使用の証明や誤って使用された場合の消費者に対する使用禁止等のメッセージ通知や回収等のためのトレーサビリティの確保のために、大容量超高速データベースの実用化が求められている。 In order to strengthen EU environmental problem regulations, traceability for proof of non-use of hazardous chemical substances such as industrial products and agricultural products, and notification and collection of messages such as prohibition of use to consumers when used incorrectly. In order to secure it, there is a demand for practical use of a large-capacity ultra high-speed database.

本発明により、超高速データベース・エンジンであるインメモリ・データベースを搭載するＰＣサーバー複数台をネットワークで接続し、小規模データベースの連携によるグリッド・コンピューティングとして、スケーラビリティをもって、大容量化することを可能にし、来るＥＵの環境問題規制強化に対処しようとしている。 With the present invention, it is possible to connect multiple PC servers equipped with an in-memory database, which is an ultra-high-speed database engine, via a network, and to scale up and scale up as grid computing by linking small databases. In the meantime, the EU is trying to cope with tightening regulations on environmental issues in the EU.

全体構成図本発明の請求項１に基づく全体構成図Overall configuration diagram Overall configuration diagram according to claim 1 of the present invention データ・セット識別記号ロケーション情報変換部の一例図本発明の請求項１および請求項２に基づくデータ・セット識別記号ロケーション情報変換部の一例図Example of Data Set Identification Symbol Location Information Conversion Unit Example of Data Set Identification Symbol Location Information Conversion Unit Based on Claims 1 and 2 of the Present Invention データ・セットのチェックアウト、チェックイン概念図本発明の請求項１に基づく、一個のデータ格納装置に複数のデータ・セットをアサインすることが可能であることを示している概念図Conceptual diagram of check-out and check-in of data set Conceptual diagram showing that a plurality of data sets can be assigned to one data storage device according to claim 1 of the present invention. データ・セットのチェックアウトを伴わないチェックイン概念図本発明の請求項３に基づく、チェックアウトをしないでチェックインすることが可能であることを示している概念図Check-in conceptual diagram without data set check-out Conceptual diagram showing that it is possible to check-in without checking out, according to claim 3 of the present invention. データ・セットのチェックアウトを伴わないチェックイン詳細概念図本発明の請求項３に基づく、チェックアウトをしないでチェックインすることが可能であることを詳しく示している詳細説明図Detailed conceptual diagram of check-in without check-out of data set Detailed explanatory diagram showing in detail that it is possible to check-in without check-out according to claim 3 of the present invention データ・セットの格納ロケーションの自動最適化機構の構成図本発明の請求項４に基づく、所属データ・セットのデータ・サイズとデータ格納部主記憶装置容量との関係を自動的に最適化計算、配置換えによる最適化を示している構成図Configuration diagram of automatic optimization mechanism of storage location of data set Based on claim 4 of the present invention, automatically optimizes the relationship between the data size of the affiliated data set and the capacity of the data storage main storage device, Configuration diagram showing optimization by relocation 分割後複数データ・セットの横断統合セルフ・ジョイン機構説明図分割された部品構成表の正展開を例とした横断統合セルフ・ジョイン機構の説明図Cross-sectional integration self-join mechanism explanatory diagram of multiple data sets after splitting ＢＯＭの正展開の一例図図７に示された例示の情報を、本発明の請求項５に基づき、分割された部品構成表の正展開をシミュレートした結果表FIG. 7 is a table showing a result of simulating the normal development of the divided parts configuration table based on claim 5 of the present invention.

Claims

Placed on the network,
One or more data input devices for inputting external data;
One or more data storage unit specifying devices for specifying the storage destination of the input data;
One or more data storage devices for storing the input data on a main storage device;
One or more data search processing devices for searching and processing the stored data;
One or more data output devices for outputting the searched and processed data to the outside;
A multi-instance in-memory database system comprising:
The data storage unit specifying device includes:
Using part or all of the information of the input data as a data set identification symbol for specifying the data set where the input data is stored, using an algorithm such as hashing with specific parameters A data set identifier conversion unit for the input data to be converted;
A data set identification symbol location information conversion unit of the data set that associates data set location information specifying the location of the data set corresponding to the identification symbol and the data set identification symbol;
Have
The data storage device includes:
In the data storage device specified by the data storage unit location information, a data storage unit for storing the input data in a data set existing on a main storage device on the data storage device;
A secondary storage device or network that currently has the same data set location information as the data set present on the main storage device on the data storage device and is currently connected to the data storage device When an access request such as insertion or extraction occurs in another data set saved on the main storage device or the secondary storage device on the other electronic computer above, the current data storage device The data set residing on the main storage device is saved on a secondary storage device connected to the data storage device, on a main storage device on another electronic computer on the network, or on a secondary storage device. On a secondary storage device currently connected to the data storage device or on a main storage device or a secondary storage device on another electronic computer on the network And resource management portion in which the access request has been avoided to summon a data set is generated,
Have
The data search processing device includes:
Data search in which one or more data sets on one or more data storage devices are traversed while specifying the data set to be searched using the same algorithm with the same parameters as the algorithm And
The data specified and extracted by the data search unit is changed or deleted while specifying the data set to be processed using the same algorithm as the algorithm with the same parameters, or one or more of the data A data processing unit that performs data processing across data sets such as joins, projections, sorts, and merges of data groups across sets;
A work space part which is a main storage space provided for the processing of the data search part or the data processing part;
When resources of the main storage space provided to the data storage unit, the data search unit, the data processing unit, and the work space unit are insufficient, a secondary storage device such as a hard disk device or another electronic computer on the network A resource management unit that performs resource management by using resources of main storage devices and secondary storage devices of
Having
Multi-instance in-memory database system.

The data storage unit specifying device has a function of changing the location information of the data set identification symbol location information conversion unit of the data set, the main storage capacity of the data storage device and the size of the stored data set The multi-instance in-memory database system according to claim 1, wherein the relation can be optimized and rearranged.

The data storage device is attempting to newly create another data set having the same location information as the data set currently stored on the main storage device on the data storage device, or When an access request is made to a data set saved on a secondary storage device connected to the data storage device, on a main storage device on another electronic computer on the network, or on a secondary storage device The data set currently stored on the main storage device on the data storage device is stored on the secondary storage device currently connected to the data storage device or on the other electronic computer on the network. There is enough main storage capacity in the data storage device to create or recall without saving on the device or secondary storage device In this case, one or more data sets can be stored on the main storage device on the data storage device without saving the data set currently stored on the main storage device on the data storage device. The multi-instance in-memory database system of claim 1, which is possible.

Monitor the capacity of each main storage device and the size of each data set on one or more data storage devices placed on the network so that the number of evacuation or checkout, summoning or checkin is minimized. 4. The multi-instance in-memory database system according to claim 1, wherein the location information is automatically rearranged.

When the data set is referenced from another data set or another tuple of its own data set, the data set of the data set referenced by the attribute data of the referring tuple 4. The join result can be created as a new data set by using the input data to the set identification symbol conversion unit. 5. The multi-instance in-memory database system according to 4.