JP7174372B2

JP7174372B2 - Data management method, device and program in distributed storage network

Info

Publication number: JP7174372B2
Application number: JP2020027917A
Authority: JP
Inventors: 操片岡; 博史野口; 卓万磯田; 勉村瀬; 智稀伊藤
Original assignee: Nippon Telegraph and Telephone Corp; Tokai National Higher Education and Research System NUC
Current assignee: Nippon Telegraph and Telephone Corp; Tokai National Higher Education and Research System NUC
Priority date: 2020-02-21
Filing date: 2020-02-21
Publication date: 2022-11-17
Anticipated expiration: 2040-02-21
Also published as: JP2021131796A

Description

本発明は、データをネットワーク上の複数のストレージに分散して記憶する分散ストレージネットワークの技術に関する。 The present invention relates to distributed storage network technology for distributing and storing data in a plurality of storages on a network.

近年、ＩｏＴ（Internet of Things）技術の急速な発展によりＩｏＴ機器が時々刻々生成する大量のデータがネットワークに流入してきている。このような大量のデータの一例としては、１０ｆｐｓ監視カメラ映像のデータが挙げられる。これらのデータはリアルタイムで処理するだけでなくサーバのストレージに蓄積・保管することにより、例えばＡＩの学習データとして用いるなど有効な活用が期待されている。そこで、サービス充実を目的としてこれらの蓄積データをオープン化するＯｐｅｎＩｏＴというプラットフォームが提唱されている。 In recent years, due to the rapid development of IoT (Internet of Things) technology, a large amount of data generated by IoT devices is flowing into networks. An example of such a large amount of data is 10 fps surveillance camera video data. It is expected that these data will not only be processed in real time, but will also be accumulated and stored in the storage of servers for effective utilization, such as use as learning data for AI, for example. Therefore, a platform called OpenIoT has been proposed for making such accumulated data open for the purpose of enhancing services.

ところで、従来の通信アーキテクチャでは、ストレージに保管されているデータを取得するためには、データが保管されているサーバのアドレス情報を取得し、このサーバにアクセスする必要がある。これに対して、近年、データそのものを示す識別子（コンテンツ名など）を利用し、この識別子を指定することによりネットワークからデータを取得する情報指向ネットワーク技術（ＩＣＮ：Information-Centric Networking）というアーキテクチャが提案されている（非特許文献１参照）。 By the way, in the conventional communication architecture, in order to obtain the data stored in the storage, it is necessary to obtain the address information of the server in which the data is stored and to access this server. On the other hand, in recent years, an architecture called Information-Centric Networking (ICN) has been proposed, which uses an identifier (such as a content name) that indicates the data itself, and acquires data from the network by specifying this identifier. (See Non-Patent Document 1).

このようなアーキテクチャは、あらゆるデータを、ひとつの仮想的なストレージ（＝仮想ストレージネットワーク）で収容するという思想に基づくものである。非特許文献１ではＭｏｂｉｌｉｔｙＦｉｒｓｔと呼ばれる技術が提案されている。 Such an architecture is based on the idea of accommodating all data in one virtual storage (=virtual storage network). Non-Patent Document 1 proposes a technology called Mobility First.

ＭｏｂｉｌｉｔｙＦｉｒｓｔでは、ネットワークを構成する各ノードであるルータは、それぞれストレージを備えている。Ｐｕｂｌｉｓｈｅｒと呼ばれるデータ送信端末がデータをネットワークに投入すると、当該データはネットワークの何れかのノードのストレージに保存される。データを取得する際には、Ｓｕｂｓｃｒｉｂｅｒと呼ばれるデータ要求端末が、データの識別子を指定して管理サーバに当該データの位置情報を問い合わせ、取得した位置情報を用いてネットワーク内のノードからデータを取得する。ここで、ネットワーク内においてデータ要求端末に提供されるデータの通信経路上の各ノードは、データの中継処理を行うとともに自身のストレージに当該データをキャッシュとして記憶する。各ノードでは、ＬＲＵ（Least Recently Used）などのキャッシュ管理アルゴリズムにより、自身のストレージに記憶しているキャッシュを管理する。このような構成により、仮想ストレージネットワークには同一の多数のデータが分散して保管される。そして、あるデータについて利用頻度の高いデータ要求端末とネットワーク的距離が近いノードに当該データがキャッシュされている可能性が高いので、低遅延を実現できる。 In MobilityFirst, routers, which are nodes constituting a network, each have a storage. When a data transmission terminal called Publisher inputs data into the network, the data is stored in the storage of any node of the network. When acquiring data, a data requesting terminal called a Subscriber designates the identifier of the data and inquires of the management server about the location information of the data, and acquires the data from the nodes in the network using the acquired location information. . Here, each node on the communication path of data provided to the data requesting terminal in the network relays the data and stores the data as a cache in its own storage. Each node manages the cache stored in its own storage by a cache management algorithm such as LRU (Least Recently Used). With such a configuration, a large number of identical data are distributed and stored in the virtual storage network. Since there is a high possibility that certain data is cached in a node that is close in terms of network to the data requesting terminal that uses the data with high frequency, low delay can be realized.

なお、同一データを複数のノードに分散して記憶するという技術に関しては、非特許文献２に記載されたＷｅｂプロキシキャッシュという技術も知られている。Ｗｅｂプロキシキャッシュでは、明示的あるいは暗示的に指定したストレージを経由する経路を構成し、経路上の全てのストレージにキャッシュを置く。これにより、データ要求端末は自身の近くのサーバからデータを取得することができる。 As for the technique of distributing and storing the same data in a plurality of nodes, a technique called Web proxy cache described in Non-Patent Document 2 is also known. A Web proxy cache constructs a route via explicitly or implicitly specified storages, and places caches in all the storages on the route. This allows the data requesting terminal to acquire data from a server near itself.

Dipankar Raychaudhuri, 他2名, "MobilityFirst: a robust and trustworthy mobility-centric architecture for the future Internet", ACM SIGMOBILE Mobile Computing and Communications Review, 2012.Dipankar Raychaudhuri, et al., "MobilityFirst: a robust and trustworthy mobility-centric architecture for the future Internet", ACM SIGMOBILE Mobile Computing and Communications Review, 2012. Nikolaos Laoutaris, 他2名, "Meta Algorithms for Hierarchical Web Caches", [online], [令和2年1月8日検索], インターネット<URL: http://cgi.di.uoa.gr/~laoutar/lcdt.pdf>Nikolaos Laoutaris, 2 others, "Meta Algorithms for Hierarchical Web Caches", [online], [searched on January 8, 2020], Internet<URL: http://cgi.di.uoa.gr/~laoutar /lcdt.pdf>

仮想ストレージネットワークに格納するデータの数及び容量は増加する一方であるが、従来の技術では、上述のようにデータの通信経路上にある全てのノードにおいてデータのキャッシュを記憶するので、ネットワーク全体でのストレージの利用効率が低いという問題がある。 Although the number and capacity of data to be stored in a virtual storage network is increasing, in the conventional technology, as described above, data caches are stored in all nodes on the data communication path. There is a problem that the utilization efficiency of the storage is low.

すなわち、従来の技術では、人気がなく将来的なアクセスが期待されないデータであっても、データの通信経路上にある全てのノードにおいてデータのキャッシュが記憶される。このため、人気があり将来的なアクセスが期待されるデータを記憶するための領域が実質的に小さくなってしまう。また、各ノードにおけるキャッシュ管理では、あるデータについて仮想ストレージネットワーク全体での同一データの保管数は考慮されていない。この保管数が多いデータは実質的に記憶容量を無駄にしている無駄なデータであると考えられる。したがって、この無駄なデータが多数存在することにより、人気があり将来的なアクセスが期待されるデータを記憶するための領域が実質的に小さくなってしまう。そして、人気があり将来的なアクセスが期待されるデータについてキャッシュのヒット率を維持するには、より大きな容量のストレージが必要になるという問題がある。 That is, in the conventional technology, even data that is not popular and is not expected to be accessed in the future is cached in all nodes on the data communication path. This effectively reduces the area for storing data that is popular and expected to be accessed in the future. Moreover, cache management in each node does not consider the number of identical data stored in the entire virtual storage network for certain data. This data with a large number of storages is considered to be useless data that substantially wastes the storage capacity. Therefore, the presence of this large amount of wasted data effectively reduces the area for storing data that is popular and expected to be accessed in the future. Then there is the problem that a larger storage capacity is required to maintain a cache hit rate for data that is popular and expected to be accessed in the future.

ところで、仮想ストレージネットワークに格納するデータの数及び容量の増加に伴い、ストレージとして、応答速度は早いが容量が相対的に小さく費用コストも高いストレージと、応答速度は遅いが容量が相対的に大きく費用コストも低いストレージとを併用することが考えられている。前者は例えばＳＳＤ（Solid State Drive）や高速なＨＤＤ（Hard Disk Drive）が相当する。後者は例えば低速なＨＤＤやテープや光学ディスクなどが相当する。本明細書では、前者をホットストレージと呼び、後者をコールドストレージと呼ぶものとする。 By the way, with the increase in the number and capacity of data stored in the virtual storage network, there are two types of storage: storage with fast response speed but relatively small capacity and high cost, and storage with slow response speed but relatively large capacity. It is being considered to use it together with a storage whose cost is also low. The former corresponds to SSDs (Solid State Drives) and high-speed HDDs (Hard Disk Drives), for example. The latter corresponds to, for example, slow HDDs, tapes, optical discs, and the like. In this specification, the former is called hot storage, and the latter is called cold storage.

従来の技術においてこのようなストレージ構成を採用した場合、各ノードはホットストレージをキャッシュとして利用し、ＬＲＵなどのキャッシュ管理によりホットストレージから削除すべきデータはコールドストレージに保管する処理を行う。ここで、コールドストレージに保管されたデータへのアクセスは、ホットストレージに保管されたデータへのアクセスよりも応答時間が遅いものとなる。このため、データが何れかのノードのコールドストレージに保管されているが、何れのノードのホットストレージにも保管されていない場合には、コールドストレージにアクセスする必要がある。このため、応答時間が遅いものとなるだけでなく、許容時間内にデータ取得できないものとして、その補償として経済的損失が発生することが考えられる。 When such a storage configuration is adopted in the conventional technology, each node uses the hot storage as a cache and stores data to be deleted from the hot storage in the cold storage by cache management such as LRU. Here, access to data stored in cold storage has a slower response time than access to data stored in hot storage. Therefore, if data is stored in the cold storage of any node, but is not stored in the hot storage of any node, it is necessary to access the cold storage. For this reason, not only is the response time slow, but it is also possible that data cannot be acquired within the allowable time, resulting in financial loss as compensation.

本発明は上記事情に鑑みてなされたものであり、その目的とするところは、分散ストレージネットワークにおいて、ストレージの利用効率及び応答速度が良好なデータ管理方法、装置、プログラムを提供することにある。 SUMMARY OF THE INVENTION The present invention has been made in view of the above circumstances, and its object is to provide a data management method, apparatus, and program with good storage utilization efficiency and response speed in a distributed storage network.

上記目的を達成するために、本願発明は、第１のストレージと、前記第１のストレージよりも応答速度が低い第２のストレージと、前記第１のストレージ及び前記第２のストレージに格納するデータの配置を制御するデータ配置制御手段とを備えた複数の情報処理装置を相互にネットワークで接続して仮想的なストレージを形成した分散ストレージネットワークにおけるデータ管理方法であって、分散ストレージネットワーク内のデータに対するユーザ端末からのアクセスに対して、前記ユーザ端末から前記データが格納された情報処理装置に至る通信経路上にある複数の情報処理装置のデータ配置制御手段が、前記通信経路上にある複数の情報処理装置を複数のクラスタに分割するとともに、各クラスタにおいて当該クラスタに属する情報処理装置の何れか１つの第１のストレージに前記データをキャッシュとして記憶するキャッシュ記憶ステップと、各情報処理装置のデータ配置制御手段が、第１のストレージに記憶されている各データについて所定の規則に基づき評価値を算出し、最低評価値のデータと同一のデータが分散ストレージネットワーク内の他の情報処理装置の第１のストレージに保存されている場合は前記最低評価値のデータを第１のストレージから削除し、前記最低評価値のデータと同一のデータが分散ストレージネットワーク内の他の情報処理装置の第１のストレージに保存されていない場合は前記最低評価値のデータを第１のストレージから第２のストレージ又は他の情報処理装置の第１のストレージに移動させるデータ配置ステップとを備えたことを特徴とする。 In order to achieve the above object, the present invention provides a first storage, a second storage having a response speed lower than that of the first storage, and data to be stored in the first storage and the second storage. A data management method in a distributed storage network in which a plurality of information processing devices are interconnected via a network to form a virtual storage, the method comprising data placement control means for controlling the placement of data in the distributed storage network data arrangement control means of a plurality of information processing devices on a communication path from the user terminal to the information processing device in which the data is stored, in response to access from the user terminal to the plurality of information processing devices on the communication route a cache storage step of dividing an information processing device into a plurality of clusters and storing the data as a cache in a first storage of any one of the information processing devices belonging to the cluster in each cluster; and data of each information processing device. The arrangement control means calculates an evaluation value for each data stored in the first storage based on a predetermined rule, and the same data as the data with the lowest evaluation value is distributed to the other information processing devices in the distributed storage network. If the data with the lowest evaluation value is stored in the first storage, the data with the lowest evaluation value is deleted from the first storage, and the same data as the data with the lowest evaluation value is stored in the first storage of another information processing device in the distributed storage network. and a data arrangement step of moving the data with the lowest evaluation value from the first storage to the second storage or the first storage of another information processing apparatus if it is not stored in the storage. .

また、本願発明は、第１のストレージと、前記第１のストレージよりも応答速度が低い第２のストレージと、前記第１のストレージ及び前記第２のストレージに格納するデータの配置を制御するデータ配置制御手段とを備えた複数の情報処理装置を相互にネットワークで接続して仮想的なストレージを形成した分散ストレージネットワークにおける情報処理装置であって、前記データ配置制御手段は、分散ストレージネットワーク内のデータに対するユーザ端末からのアクセスに対して、前記ユーザ端末から前記データが格納された情報処理装置に至る通信経路上にある複数の情報処理装置を複数のクラスタに分割する分割処理手段と、各クラスタについて当該クラスタに属する情報処理装置のうち前記データをキャッシュとして記憶する１つの情報処理装置を決定するとともに自身に前記データを記憶すると決定した場合には第１のストレージに前記データをキャッシュとして記憶する配置先決定処理手段と、第１のストレージに記憶されている各データについて所定の規則に基づき評価値を算出する評価値算出手段と、最低評価値のデータと同一のデータが分散ストレージネットワーク内の他の情報処理装置の第１のストレージに保存されている場合は前記最低評価値のデータを第１のストレージから削除し、前記最低評価値のデータと同一のデータが分散ストレージネットワーク内の他の情報処理装置の第１のストレージに保存されていない場合は前記最低評価値のデータを第１のストレージから第２のストレージ又は他の情報処理装置の第１のストレージに移動させるデータ配置処理手段とを備えたことを特徴とする。 Further, the present invention provides a first storage, a second storage whose response speed is lower than that of the first storage, and data for controlling the arrangement of data to be stored in the first storage and the second storage. an information processing device in a distributed storage network in which a virtual storage is formed by interconnecting a plurality of information processing devices, each of which includes an arrangement control means; division processing means for dividing a plurality of information processing devices on a communication path from the user terminal to an information processing device storing the data into a plurality of clusters in response to access to data from the user terminal; determines one of the information processing devices belonging to the cluster to store the data as a cache, and if it determines to store the data in itself, stores the data as a cache in the first storage. allocation destination determination processing means; evaluation value calculation means for calculating an evaluation value based on a predetermined rule for each data stored in the first storage; If the data with the lowest evaluation value is stored in the first storage of another information processing device, the data with the lowest evaluation value is deleted from the first storage, and the same data as the data with the lowest evaluation value is stored in another information processing device in the distributed storage network. data placement processing means for moving the data with the lowest evaluation value from the first storage to a second storage or to a first storage of another information processing device if it is not stored in the first storage of the information processing device; characterized by comprising

本発明によれば、ユーザ端末がアクセスしたデータは、その通信経路上にある全ての情報処理装置ではなく、通信経路上にある情報処理装置が複数のクラスタに分割され、各クラスタ内の１つの情報処理装置のストレージにキャッシュとして記憶されるので、ストレージの利用効率が向上する。また、本発明によれば、分散ストレージネットワークの全ての第１のストレージに１つだけ存在するデータ（以下「ラストワンデータ」と呼ぶ）については、当該データを保管している情報処理装置においてキャッシュ管理の評価値が低いものであっても、所定の条件で他の情報処理装置の第１のストレージに移動する。これにより、応答速度の遅い第２のストレージにアクセスする機会が減り、全体として応答速度が向上する。 According to the present invention, data accessed by a user terminal is divided into a plurality of clusters by the information processing apparatuses on the communication path, not by all the information processing apparatuses on the communication path, and one cluster in each cluster. Since it is stored as a cache in the storage of the information processing device, the utilization efficiency of the storage is improved. Further, according to the present invention, data that exists in only one piece in all first storages of the distributed storage network (hereinafter referred to as "last one data") is cached in the information processing device that stores the data. Even if the management evaluation value is low, it is moved to the first storage of another information processing apparatus under a predetermined condition. This reduces the chances of accessing the second storage, which has a slow response speed, and improves the response speed as a whole.

前提となる仮想ストレージネットワークのシステム構成図System configuration diagram of the prerequisite virtual storage network 仮想ストレージネットワークにおけるデータ管理方法の概要を説明する図A diagram explaining the outline of the data management method in the virtual storage network キャッシュ数とコストの相関グラフCorrelation graph of number of caches and cost データ配置に係る動作を説明するフローチャートFlowchart for explaining operations related to data placement 経路上ノードクラスタリングを説明する図Diagram explaining route node clustering 変形ＬＲＵの動作を説明するフローチャートFlowchart explaining the operation of the modified LRU 変形ＬＲＵを説明する図Diagram explaining modified LRU 変形ＬＲＵを説明する図Diagram explaining modified LRU 実施例に係る情報処理装置の機能ブロック図Functional block diagram of an information processing apparatus according to an embodiment 他の実施形態に係る仮想ストレージネットワークのシステム構成図System configuration diagram of a virtual storage network according to another embodiment 他の実施形態の実施例に係る情報処理装置の機能ブロック図Functional block diagram of an information processing apparatus according to an example of another embodiment

本発明の一実施の形態に係る仮想ストレージネットワークについて説明する。本発明では、最適なコスト・利便性性能を提供するデータ保存先の決定を目標としており、以下の４点を性能指標とする（ただし後述するネットワークインパクトは制約とする）。 A virtual storage network according to one embodiment of the present invention will be described. In the present invention, the goal is to determine a data storage destination that provides optimal cost/convenience performance, and the following four points are used as performance indicators (however, network impact, which will be described later, is a constraint).

（１）ストレージ間のデータ移動の通信コスト（データ移動通信コスト）
これは、後述するようにデータ配置を変更するとき、物理ストレージ間でデータを移動するために発生する通信距離・サイズに応じた通信コストである。なお、このデータ移動通信コストは電力コストの一種でもある。 (1) Communication cost for data transfer between storages (data transfer communication cost)
This is the communication cost according to the communication distance and size, which is incurred for moving data between physical storages when changing the data arrangement as described later. Note that this data mobile communication cost is also a kind of power cost.

（２）Ｓｕｂｓｃｒｉｂｅｒのデータ取得の通信コスト（データ取得通信コスト）
これは、ホットストレージにアクセスしたとき、ホットストレージからＳｕｂｓｃｒｉｂｅｒにデータを転送するために発生する通信距離・サイズに応じた通信コストである。なお、このデータ取得通信コストは電力コストの一種でもある。 (2) Subscriber data acquisition communication cost (data acquisition communication cost)
This is the communication cost corresponding to the communication distance and size that is incurred for transferring data from the hot storage to the subscriber when the hot storage is accessed. Note that this data acquisition communication cost is also a kind of power cost.

（３）データを許容時間内に取得できないときの経済的損失（コールドペナルティ）
これは、応答速度が遅いコールドストレージにアクセスすると、許容時間内にデータ取得できないものとし、その補償として発生するものと仮定した経済的損失である。なお、このコールドペナルティは運用コストの一種である。 (3) Economic loss when data cannot be obtained within the allowable time (cold penalty)
This is an economic loss assuming that data cannot be obtained within the allowable time when cold storage with a slow response speed is accessed, and that it is assumed to occur as compensation. Note that this cold penalty is a kind of operating cost.

（４）データ配置を変更時のトラヒックが引き起こすネットワークへの悪影響（ネットワークインパクト）
これは、他サービスとのネットワーク共用を前提とすると、ネットワークに対して及ぼす、背景トラヒックの通信品質の劣化や、ネットワークの回線容量の不足等の悪影響である。 (4) Network impact caused by traffic when data allocation is changed (network impact)
Assuming that the network is shared with other services, this has adverse effects on the network, such as degradation of communication quality of background traffic and insufficient network line capacity.

本実施の形態において前提となる仮想ストレージネットワークのシステム構成について図１を参照して説明する。図１は前提となる仮想ストレージネットワークのシステム構成図である。本実施の形態における仮想ストレージネットワークはＩＣＮのアーキテクチャを採用する。 A system configuration of a virtual storage network, which is a premise in this embodiment, will be described with reference to FIG. FIG. 1 is a system configuration diagram of a premise virtual storage network. The virtual storage network in this embodiment adopts the ICN architecture.

図１に示すように、ネットワーク１０はノードとして複数の情報処理装置１００を含む。各情報処理装置１００は、ホットストレージ２１０とコールドストレージ２２０とを備えている。本発明では複数の情報処理装置１００のうち何れか１つ又は複数の情報処理装置１００のホットストレージ２１０又はコールドストレージ２２０にデータが保管される。すなわち、ネットワーク１０が１つの仮想的なストレージとして機能する。なお、ネットワーク１０は、他サービスと共用することができる。 As shown in FIG. 1, network 10 includes a plurality of information processing devices 100 as nodes. Each information processing device 100 comprises a hot storage 210 and a cold storage 220 . In the present invention, data is stored in the hot storage 210 or the cold storage 220 of any one or a plurality of information processing apparatuses 100 among the plurality of information processing apparatuses 100 . That is, the network 10 functions as one virtual storage. Note that the network 10 can be shared with other services.

情報処理装置１００は、他の情報処理装置１００とネットワーク回線（リンク）を介して通信可能に接続する。ネットワーク１０における情報処理装置１００の接続形態に係るネットワークトポロジーは不問である。情報処理装置１００は、ネットワーク１０においてデータを中継するルータとして実装することができる。情報処理装置１００の具体的な構成例については実施例として後述する。 The information processing device 100 is communicably connected to other information processing devices 100 via network lines (links). The network topology related to the form of connection of the information processing apparatus 100 in the network 10 is irrelevant. The information processing device 100 can be implemented as a router that relays data in the network 10 . A specific configuration example of the information processing apparatus 100 will be described later as an embodiment.

Ｐｕｂｌｉｓｈｅｒと呼ばれるデータ送信端末１がデータをネットワーク１０に投入（ｓｔｏｒｅ）すると、当該データはネットワーク１０の何れかの情報処理装置１００のホットストレージ２１０に保存される。 When a data transmission terminal 1 called a publisher stores data in the network 10 , the data is stored in the hot storage 210 of any information processing device 100 on the network 10 .

仮想ストレージネットワーク内においてデータは、所定のデータ配置制御処理により複数の情報処理装置１００に保存される。ここで、データは、データ送信端末１が保存したオリジナルのデータと、データ配置制御処理においてキャッシュとして複製された１以上のデータ（レプリカとも言う）とは同一のものであり両者は区別されない。また、情報処理装置１００に保存されているデータは、所定のデータ配置制御処理により他の情報処理装置１００に移動することができる。ネットワーク１０内のどの情報処理装置１００にデータが保存されているかの対応情報は、ＧＮＲＳ（Global Name Resolution Service）と呼ばれる管理サーバ３により管理される。 Within the virtual storage network, data is stored in a plurality of information processing apparatuses 100 by predetermined data placement control processing. Here, the original data stored by the data transmission terminal 1 and the one or more pieces of data (also referred to as replicas) replicated as a cache in the data allocation control process are the same, and the two are indistinguishable. Data stored in the information processing apparatus 100 can be moved to another information processing apparatus 100 by a predetermined data arrangement control process. Correspondence information as to which information processing apparatus 100 in the network 10 stores data is managed by a management server 3 called GNRS (Global Name Resolution Service).

Ｓｕｂｓｃｒｉｂｅｒと呼ばれるデータ要求端末２がデータを取得するには、まず、データの識別子（ＩＤ）を指定して管理サーバ３に当該データの位置情報（ｌｏｃａｔｉｏｎｓ）を問い合わせる。管理サーバ３は当該データの位置情報を回答する。データ要求端末２は、当該位置情報を参照してネットワーク１０にアクセスすることにより所望のデータを取得（ｒｅｔｒｉｖｅ）する。ここで、データが仮想ストレージネットワーク内において複数の情報処理装置１００に保存されている場合、管理サーバ３は各データについての位置情報をデータ要求端末２に回答する。データ要求端末２は、複数の位置情報から任意の位置情報を選択して、当該位置情報を参照してデータを取得することができる。例えば、データ要求端末２は、ネットワーク距離が最も近い情報処理装置１００からデータを取得することができる。 In order for the data requesting terminal 2 called a Subscriber to acquire data, first, it specifies the identifier (ID) of the data and inquires of the management server 3 about the location information (locations) of the data. The management server 3 replies with the location information of the data. The data requesting terminal 2 retrieves desired data by accessing the network 10 with reference to the location information. Here, when data is stored in a plurality of information processing apparatuses 100 within a virtual storage network, the management server 3 replies to the data request terminal 2 with location information about each data. The data requesting terminal 2 can select arbitrary position information from a plurality of position information and obtain data by referring to the position information. For example, the data requesting terminal 2 can acquire data from the information processing device 100 with the closest network distance.

次に、本願発明の概要について図２を参照して説明する。図２は本発明に係る仮想ストレージネットワークにおけるデータ管理方法の概要を説明する図である。なお各図において、例えば情報処理装置など、複数存在する同一の構成要素を個別に識別するために参照符号の末尾に添え字を付すものとして、複数存在する同一の構成要素を総称する際には添え字を取り除いた参照符号を用いるものとする。また、ホットストレージに格納されているデータのうちラストワンデータについてはハッチングを付している。 Next, the outline of the present invention will be described with reference to FIG. FIG. 2 is a diagram for explaining the outline of the data management method in the virtual storage network according to the present invention. In each figure, a subscript is added to the end of the reference numeral to individually identify a plurality of identical constituent elements, such as an information processing device. Reference signs with subscripts removed shall be used. Also, the last one data among the data stored in the hot storage is hatched.

従来の技術では遠くからアクセスされると経路上に大量のキャッシュが作成されることになるが、前述したように、「遠くからアクセスされたデータ＝ホットストレージに大量にキャッシュを残すべきデータ」ではない。本願発明はこの点に注目し、（１）経路上の全ノードのストレージにキャッシュを置くことはせず、経路上のノードをアクセス頻度等によってまびく、（２）ラストワンデータは仮想ストレージネットワーク内のホットストレージ全体で管理、その他のキャッシュは各ストレージ内で管理し、ラスタワンデータがコールドストレージへ降格されにくくする、という２つの特徴点を有している。 With conventional technology, when accessed from a distance, a large amount of cache is created on the path. do not have. The present invention pays attention to this point, and (1) does not put caches in the storage of all nodes on the path, but spreads the nodes on the path according to the access frequency, etc. (2) Last one data is a virtual storage network It has two features: it manages the hot storage as a whole, manages other caches in each storage, and makes it difficult for raster one data to be demoted to cold storage.

前述した特徴点（１）について図２を例として説明する。ここでは、データ要求端末２が、情報処理装置１００ａのホットストレージ２１０ａに保存されているデータを取得することを考える。取得するデータはラストワンデータであってもそれ以外のデータであってもよい。データ要求端末２と情報処理装置１００ａとの間の通信経路上には複数の情報処理装置１００ｂ～１００ｅが存在する。 The aforementioned characteristic point (1) will be described with reference to FIG. 2 as an example. Here, it is assumed that the data request terminal 2 acquires data stored in the hot storage 210a of the information processing device 100a. The data to be acquired may be last-one data or other data. A plurality of information processing devices 100b to 100e exist on the communication path between the data request terminal 2 and the information processing device 100a.

図２の例では、情報処理装置１００ａのホットストレージ２１０ａに格納されておりデータ要求端末２の所望するデータが情報処理装置１００ｂ～１００ｅにより中継される際に、経路上にある全てのホットストレージ２１０ｂ～２１０ｅを２つの仮想ストレージと捉え、各仮想ストレージに１つずつ当該データをキャッシュとして記憶する。換言すれば、通信経路上にある複数の情報処理装置１００ｂ～１００ｅを複数のクラスタ４００ａ，４００ｂに分割するとともに、各クラスタ４００ａ，４００ｂにおいて当該クラスタ４００ａ，４００ｂに属する情報処理装置１００ｂ～ｃ，１００ｄ～ｅの何れか１つのホットストレージ２１０ｂ又は２１０ｃ，ホットストレージ２１０ｄ又は２１０ｅに前記データをキャッシュとして記憶する。 In the example of FIG. 2, when the data requested by the data requesting terminal 2 stored in the hot storage 210a of the information processing device 100a is relayed by the information processing devices 100b to 100e, all the hot storages 210b on the route 210e are treated as two virtual storages, and the data is stored as a cache in each virtual storage. In other words, the plurality of information processing apparatuses 100b to 100e on the communication path are divided into a plurality of clusters 400a and 400b, and the information processing apparatuses 100b to 100d belonging to the clusters 400a and 400b are divided into the clusters 400a and 400b. The data is stored as a cache in one of the hot storages 210b or 210c and the hot storage 210d or 210e.

前述した特徴点（２）について図２を例として説明する。各情報処理装置１００では、ホットストレージ２１０のすべての格納データは、有用性に関する後述する評価値に基づき降順に整列されたリスト構造で保持するものとする。そして、各情報処理装置１００では、ラストワンデータを他にも同一データがあるデータより優先する。また、ラストワンデータについては、自ホットストレージ２１０から追い出され他ホットストレージ２１０にもいれさせてもらえない場合はコールドストレージ２２０に降格する。 The characteristic point (2) described above will be described with reference to FIG. 2 as an example. In each information processing apparatus 100, all data stored in the hot storage 210 are held in a list structure arranged in descending order based on evaluation values regarding usefulness, which will be described later. Then, in each information processing apparatus 100, the last one data is prioritized over other data having the same data. Last-one data is demoted to the cold storage 220 if it is expelled from its own hot storage 210 and cannot be stored in another hot storage 210 .

以上のような本願発明の特徴点と従来技術の差違について説明する。前述のＭｏｂｉｌｉｔｙＦｉｒｓｔではホットストレージ／コールドストレージの違いはなく、「キャッシュがユーザからいかに近いか」を重要視している。一方、本願発明で前提としている仮想ストレージネットワークではホットストレージとコールドストレージを備えており、ラストワンデータが、コスト、利便性が大きく異なるどちらのストレージにデータが存在するかも重要視している。すなわち、ラストワンデータを気軽にコールドストレージに降格できないので従来技術ほど気軽に新規キャッシュを置くとコストがかかることに注目している。ホットストレージにおけるキャッシュ数とコストとの相関性はおおよそ図３に示すような曲線を示す。本願発明では図３の曲線の底部付近を狙うものである。なお、コストとは、費用コストだけでなく、通信コスト、電力コストなどを含む総合的なコストを意味する点に留意されたい。 The features of the present invention as described above and the differences between the conventional techniques will be described. MobilityFirst does not distinguish between hot storage and cold storage, and emphasizes "how close the cache is to the user." On the other hand, the virtual storage network premised in the present invention has hot storage and cold storage, and it is important to consider in which storage the last-one data resides, which is significantly different in cost and convenience. In other words, since the last-one data cannot be easily demoted to cold storage, it will be costly if a new cache is installed as easily as in the conventional technology. The correlation between the number of caches in hot storage and the cost roughly exhibits a curve as shown in FIG. The present invention aims at the vicinity of the bottom of the curve in FIG. It should be noted that the cost means not only the expense cost but also the total cost including communication cost, power cost, and the like.

次に、上述した本願発明の特徴点についてより詳細に説明する。まず、上記特徴点（１）の具体例である経路上ノードクラスタリングについて図４及び図５を参照して説明する。図４はデータ配置に係る動作を説明するフローチャート、図５は経路上ノードクラスタリングを説明する図である。 Next, the features of the present invention described above will be described in more detail. First, route node clustering, which is a specific example of feature point (1), will be described with reference to FIGS. 4 and 5. FIG. FIG. 4 is a flow chart for explaining operations related to data arrangement, and FIG. 5 is a diagram for explaining on-route node clustering.

本願発明では、まず、あるデータｘ_ｎ（ｘ_１，…ｘ_ｎ，…ｘ_Ｎは仮想ストレージネットワーク内のホットストレージ２１０に格納された同一データ）へのアクセス頻度によって、クラスタ数を決定する（図４のステップＳ１）。クラスタ数の計算例としては次式（１）が挙げられる。 In the present invention, _first , the number of clusters is determined according to the frequency of access to certain data _xn (x1, ... _xn , ... _xN is the same data stored in the hot storage 210 in the virtual storage network) (Fig. 4 step S1). A calculation example of the number of clusters is the following formula (1).

１／（ω_１＊データｘ全体へのアクセス頻度＋ω_２＊データｘ_ｎへのアクセス頻度） …式（１）
ここで、ω_１及びω_２は定数である。図５の例では、クラスタ数は２である。 1/(ω ₁ * frequency of access to entire data x + ω ₂ * frequency of access to data x _n ) Equation (1)
where ω ₁ and ω ₂ are constants. In the example of FIG. 5, the number of clusters is two.

次に、情報処理装置１００間のリンク帯域が小さい順にクラスタの分割ポイントとし、決定したクラスタ数で分割する（図４のステップＳ２）。図５の例では、情報処理装置１００ｃと情報処理装置１００ｄの間のリンク帯域が、他の情報処理装置１００間のリンク帯域よりも小さいものとする。この場合、情報処理装置１００ｃと情報処理装置１００ｄの間が分割ポイントとなる。これにより、情報処理装置１００ｂ及び１００ｃからなるクラスタ４００ａと、情報処理装置１００ｄ及び１００ｄからなるクラスタ４００ｂが設定される。 Next, cluster division points are set in ascending order of the link bandwidth between the information processing apparatuses 100, and division is performed by the determined number of clusters (step S2 in FIG. 4). In the example of FIG. 5, the link band between the information processing device 100c and the information processing device 100d is assumed to be smaller than the link band between the other information processing devices 100. In the example of FIG. In this case, the division point is between the information processing device 100c and the information processing device 100d. As a result, a cluster 400a consisting of the information processing devices 100b and 100c and a cluster 400b consisting of the information processing devices 100d and 100d are set.

次に、各クラスタ４００の中においては、クラスタ４００に属する各情報処理装置１００について評価値Ｐ１（ｘ）を算出し、当該評価値Ｐ１（ｘ）を最大とする情報処理装置１００のホットストレージ２１０にデータを配置する（図４のステップＳ３）。評価値Ｐ１（ｘ）は、データの各種属性情報に基づき算出することができる。例えば、次式（２）により評価値Ｐ１（ｘ）を算出する。 Next, in each cluster 400, the evaluation value P1(x) is calculated for each information processing apparatus 100 belonging to the cluster 400, and the hot storage 210 of the information processing apparatus 100 having the maximum evaluation value P1(x) is calculated. (step S3 in FIG. 4). The evaluation value P1(x) can be calculated based on various attribute information of the data. For example, the evaluation value P1(x) is calculated by the following equation (2).

Ｐ１（ｘ）＝ω_３＊（アクセス頻度）／（ユーザからの平均アクセス距離）＋ω_４＊残ストレージ容量 …式（２）
ここで、ω_３及びω_４は定数である。図５の例では、クラスタ４００ａについては情報処理装置１００ｃのホットストレージ２１０ｃにデータを保存するとともに、クラスタ４００ｂについては情報処理装置１００ｅのホットストレージ２１０ｅにデータを保存する。 P1(x)=ω3* ₍ access frequency)/( _average access distance from users)+ω4*remaining storage capacity Equation (2)
where ω3 and _ω4 _are constants. In the example of FIG. 5, data for the cluster 400a is stored in the hot storage 210c of the information processing device 100c, and data for the cluster 400b is stored in the hot storage 210e of the information processing device 100e.

次に、上記特徴点（２）の具体例である変形ＬＲＵについて図６～図８を参照して説明する。図６は変形ＬＲＵの動作を説明するフローチャート、図７及び図８は変形ＬＲＵを説明する図である。 Next, a modified LRU, which is a specific example of feature (2), will be described with reference to FIGS. 6 to 8. FIG. FIG. 6 is a flowchart for explaining the operation of the modified LRU, and FIGS. 7 and 8 are diagrams for explaining the modified LRU.

本発明に係る変形ＵＲＬでは、ホットストレージ２１０に格納するデータは有用性に関する評価値Ｐ２（ｘ）に基づき降順に整列されたリスト構造で保持する。この評価値Ｐ２（ｘ）は逐次更新される値であり、削除や強制移動などを決めるためのデータｘのポイントを意味する。評価値Ｐ２（ｘ）はデータの各種属性情報に基づき算出することができる。例えば、次式により評価値Ｐ２（ｘ）を算出する。 In the modified URL according to the present invention, the data stored in the hot storage 210 is held in a list structure arranged in descending order based on the evaluation value P2(x) regarding usefulness. This evaluation value P2(x) is a value that is sequentially updated, and means a point of data x for determining deletion, forced movement, or the like. The evaluation value P2(x) can be calculated based on various attribute information of the data. For example, the evaluation value P2(x) is calculated by the following equation.

Ｐ２（ｘ）＝ω_５／（同一データ数）＋ω_６＊（アクセス頻度）／（ユーザからの平均アクセス距離）－ω_７（現在アクセス時刻－最終アクセス時刻） …式（３）
ここで、ω_５，ω_６及びびω_７は定数である。 P2(x)= _ω5 /(number of identical data)+ω6*(access frequency)/(average access distance from user) _−ω7 (current access time ₋ last access time) Equation (3)
where ω ₅ , ω ₆ and ω ₇ are constants.

情報処理装置１００は、当該情報処理装置１００にとって新規のデータをホットストレージ２１０に保存する際には、評価値Ｐ２（ｘ）を算出し、この評価値Ｐ２（ｘ）に基づく順序となるようホットストレージ２１０に保存する（図６のステップＳ１１，Ｓ１３、図７（ａ））。すなわち、新たなデータを変形ＬＲＵの中間に挿入するために、その新規データの有用性に応じて、リスト構造の適切な位置のポインタを繋ぎ変える。ここで、新規のデータの保存に必要な空き容量がホットストレージ２１０にない場合、以下のアルゴリズムにより空き容量を確保する（図６のステップＳ１２，Ｓ１４～Ｓ１８）。 The information processing apparatus 100 calculates an evaluation value P2(x) when storing data new to the information processing apparatus 100 in the hot storage 210, and performs hot data processing so that the order is based on the evaluation value P2(x). It saves in the storage 210 (steps S11 and S13 in FIG. 6, FIG. 7(a)). That is, in order to insert new data in the middle of the modified LRU, the pointers at the appropriate positions in the list structure are spliced according to the usefulness of the new data. Here, if the hot storage 210 does not have enough free space for storing new data, free space is secured by the following algorithm (steps S12, S14 to S18 in FIG. 6).

情報処理装置１００は、ホットストレージ２１０に格納されたデータのうち前記評価値Ｐ２（ｘ）が最低のデータがラストワンデータでない場合、当該データは他の情報処理装置１００のホットストレージに格納されているので、自身のホットストレージ２１０から削除する（図６のステップＳ１４～Ｓ１５、図７（ｂ））。 If the data with the lowest evaluation value P2(x) among the data stored in the hot storage 210 is not the last one data, the information processing apparatus 100 stores the data in the hot storage of another information processing apparatus 100. Therefore, it deletes it from its own hot storage 210 (steps S14 and S15 in FIG. 6, FIG. 7(b)).

一方、前記評価値Ｐ２（ｘ）が最低のデータがラストワンデータである場合、情報処理装置１００は、仮想ストレージネットワーク内のホットストレージ２１０全体で強制移動するかどうかの判断を行う（図６のステップＳ１６）。強制移動するか否かの判断には各情報処理装置１００における前記評価値Ｐ２（ｘ）を用いることができる。具体的には、他の情報処理装置１００に前記ラストワンデータの評価値Ｐ２（ｘ）よりも低い評価値Ｐ２（ｘ）のデータが格納されていたら、当該他の情報処理装置１００に強制移動すると判断する。情報処理装置１００は、強制移動すると判断した場合、他の情報処理装置１００にラストワンデータを移動させる（図６のステップＳ１６，Ｓ１８、図７（ｄ））。ここで、移動先の他の情報処理装置１００では、前記ラストワンデータの評価値Ｐ２（ｘ）よりも低い評価値Ｐ２（ｘ）のデータは当該情報処理装置１００のコールドストレージ２２０に追い出す。一方、強制移動しないと判断した場合、情報処理装置１００は、ラストワンデータを自身のコールドストレージ２２０に移動させる（図６のステップＳ１６，Ｓ７、図７（ｃ））。なお、データをホットストレージ２１０からコールドストレージ２２０に移動させることを、「降格させる」や「アーカイブする」とも言う。 On the other hand, if the data with the lowest evaluation value P2(x) is the last-one data, the information processing apparatus 100 determines whether to forcibly migrate the entire hot storage 210 in the virtual storage network (see FIG. 6). step S16). The evaluation value P2(x) in each information processing apparatus 100 can be used to determine whether or not to forcibly move. Specifically, if data with an evaluation value P2(x) lower than the evaluation value P2(x) of the last-one data is stored in another information processing apparatus 100, the data is forcibly moved to the other information processing apparatus 100. Then judge. When the information processing apparatus 100 determines to forcibly move, the information processing apparatus 100 moves the last-one data to another information processing apparatus 100 (steps S16 and S18 in FIG. 6, FIG. 7(d)). Here, in the other information processing apparatus 100 of the moving destination, the data with the evaluation value P2(x) lower than the evaluation value P2(x) of the last one data is expelled to the cold storage 220 of the information processing apparatus 100 concerned. On the other hand, if it is determined not to forcibly move, the information processing apparatus 100 moves the last-one data to its own cold storage 220 (steps S16 and S7 in FIG. 6, FIG. 7(c)). Note that moving data from the hot storage 210 to the cold storage 220 is also called "demoting" or "archiving."

上述のクラスタ分割数の算出、クラスタ内におけるデータ配置先の決定で用いる評価値Ｐ１（ｘ）の算出、ホットストレージ２１０におけるデータの有用性に関する評価値Ｐ２（ｘ）の算出で必要な各パラメータは、自身が保有しているもの、換言すれば自ノードで閉じたものであるならば、当該パラメータを用いる。各パラメータが自ノードに閉じない場合には、各ノード間で互いに情報をやりとりすることで取得する。なお、データがどの情報処理装置１００に格納されているか否かは管理サーバ３が管理している。そこで、例えば他のホットストレージ２１０におけるデータのアクセス頻度を取得するには、まず管理サーバ３に当該データの位置情報を取得し、この位置情報に基づき当該データを格納している情報処理装置１００に問い合わせればよい。 Each parameter necessary for calculating the number of cluster divisions, calculating the evaluation value P1(x) used in determining the data allocation destination in the cluster, and calculating the evaluation value P2(x) regarding the usefulness of the data in the hot storage 210 is , if it is owned by itself, in other words, if it is closed in its own node, the parameter is used. If each parameter is not closed to its own node, it is acquired by exchanging information between each node. The management server 3 manages which information processing apparatus 100 stores the data. Therefore, for example, in order to acquire the access frequency of data in the other hot storage 210, the location information of the data is first acquired by the management server 3, and based on this location information, the information processing device 100 storing the data is Please inquire.

情報処理装置１００の一実施例について図９を参照して説明する。図９は実施例に係る情報処理装置の機能ブロック図である。 An embodiment of the information processing apparatus 100 will be described with reference to FIG. FIG. 9 is a functional block diagram of the information processing device according to the embodiment.

情報処理装置１００は、ホットストレージ２１０とコールドストレージ２２０とを備えたデータ蓄積部２００と、データ配置制御部３００とを備えている。 The information processing apparatus 100 includes a data storage unit 200 having a hot storage 210 and a cold storage 220 and a data allocation control unit 300 .

データ蓄積部２００は、他の情報処理装置１００からのデータをさらに他の情報処理装置１００に中継する機能を有する。また、データ蓄積部２００は、データ要求端末２から自身のホットストレージ２１０又はコールドストレージ２２０に保存されているデータのアクセス要求があると、当該データをデータ要求端末２に送信する機能を有する。また、データ蓄積部２００は、中継するデータを、データ配置制御部３００からの指示によりホットストレージ２１０に保存する機能を有する。 The data storage unit 200 has a function of relaying data from another information processing device 100 to another information processing device 100 . The data storage unit 200 also has a function of transmitting the data to the data requesting terminal 2 when the data requesting terminal 2 requests access to data stored in its own hot storage 210 or cold storage 220 . The data storage unit 200 also has a function of storing relayed data in the hot storage 210 according to an instruction from the data placement control unit 300 .

ホットストレージ２１０は、応答速度は早いが容量が相対的に小さく費用コストも高い不揮発性の記憶媒体である。ホットストレージ２１０としては、例えばＳＳＤや高速なＨＤＤが挙げられる。コールドストレージ２２０は、応答速度は遅いが容量が相対的に大きく費用コストも低い不揮発性の記憶媒体である。コールドストレージ２２０としては、例えば低速なＨＤＤやテープや光学ディスクなどが相当する。なおコールドストレージ２２０は、メディアを自動又は手動で着脱自在な構成としてもよい。 The hot storage 210 is a nonvolatile storage medium that has a fast response speed, a relatively small capacity, and a high cost. Examples of the hot storage 210 include SSDs and high-speed HDDs. The cold storage 220 is a non-volatile storage medium with a slow response speed, relatively large capacity, and low cost. The cold storage 220 corresponds to, for example, a low-speed HDD, tape, or optical disk. Note that the cold storage 220 may have a configuration in which media can be automatically or manually detachable.

データ配置制御部３００は、アクセス偏在性評価部３１０と、変形ＬＲＵベース有用性評価部３２０と、制御操作実行部３３０とを備えている。制御操作実行部３３０は、キャッシュ配置操作実行部３３１と、強制移動操作実行部３３２と、削除操作実行部３３３と、アーカイブ操作実行部３３４とを備えている。 The data placement control unit 300 includes an access unevenness evaluation unit 310 , a modified LRU-based utility evaluation unit 320 , and a control operation execution unit 330 . The control operation execution unit 330 includes a cache allocation operation execution unit 331 , a forced migration operation execution unit 332 , a deletion operation execution unit 333 and an archive operation execution unit 334 .

アクセス偏在性評価部３１０は、仮想ストレージネットワーク内に保存されている各データに対するデータ要求端末２からのアクセスについて、そのアクセス情報を取得・評価する。アクセス情報は、自身の情報処理装置１００に対するアクセスについては自身で収集し、他の情報処理装置１００に対するアクセスについては当該他の情報処理装置１００から収集する。 The access maldistribution evaluation unit 310 acquires and evaluates access information regarding access from the data request terminal 2 to each data saved in the virtual storage network. The access information is collected by itself for access to its own information processing apparatus 100 and is collected from the other information processing apparatus 100 for access to another information processing apparatus 100 .

変形ＬＲＵベース有用性評価部３２０は、前述した有用性に関する評価値Ｐ２（ｘ）を算出する。 The modified LRU-based utility evaluation unit 320 calculates the evaluation value P2(x) regarding utility described above.

キャッシュ配置操作実行部３３１は、データ蓄積部２００に対するデータのキャッシュ操作及び管理を行うものであり、前述した特徴点（１）及び（２）の機能を有する。キャッシュ配置操作実行部３３１は、クラスタ分割部３３１ａと、クラスタ内配置先ストレージ決定部３３１ｂを備える。 The cache placement operation execution unit 331 performs cache operation and management of data in the data storage unit 200, and has the functions of the features (1) and (2) described above. The cache placement operation execution unit 331 includes a cluster division unit 331a and an intra-cluster placement destination storage determination unit 331b.

クラスタ分割部３３１ａは、前述したように経路上にある情報処理装置１００を複数のクラスタ４００に分割する処理を行う。クラスタ内配置先ストレージ決定部３３１ｂは、前述したように分割した各クラスタ４００内においてデータを配置する情報処理装置１００を決定する。 The cluster dividing unit 331a performs processing for dividing the information processing apparatuses 100 on the route into a plurality of clusters 400 as described above. The intra-cluster placement destination storage determination unit 331b determines the information processing apparatus 100 in which data is to be placed in each cluster 400 divided as described above.

強制移動操作実行部３３２は、自身のホットストレージ２１０から他の情報処理装置１００のホットストレージ２１０にデータを強制移動させる処理の実行を行う。削除操作実行部３３３は、自身のホットストレージ２１０からデータを削除する処理の実行を行う。アーカイブ操作実行部３３４は、自身のホットストレージ２１０からコールドストレージ２２０へデータを移動させる処理の実行を行う。 The forced migration operation execution unit 332 executes a process of forcibly migrating data from its own hot storage 210 to the hot storage 210 of another information processing apparatus 100 . The deletion operation execution unit 333 executes processing for deleting data from its own hot storage 210 . The archive operation execution unit 334 executes processing for moving data from its own hot storage 210 to the cold storage 220 .

情報処理装置１００は、主演算装置、メモリ、ネットワークインタフェース等を備えた周知の情報処理装置からなる。ここで情報処理装置１００の実装形態は不問である。例えば、情報処理装置１００の各部を分散して複数の装置により実装してもよい。また、情報処理装置１００の各部は汎用のコンピュータにプログラムをインストールすることにより実装してもよいし、専用のハードウェアとして実装してもよいし、これらを組み合わせてもよい。 The information processing device 100 is a well-known information processing device including a main processing unit, memory, network interface, and the like. Here, the implementation form of the information processing apparatus 100 does not matter. For example, each part of the information processing device 100 may be distributed and implemented by a plurality of devices. Also, each part of the information processing apparatus 100 may be implemented by installing a program in a general-purpose computer, may be implemented as dedicated hardware, or may be combined.

以上のように本発明によれば、データ要求端末２がアクセスしたデータは、その通信経路上にある全ての情報処理装置１００ではなく、通信経路上にある情報処理装置１００が複数のクラスタ４００に分割され、各クラスタ４００内の１つの情報処理装置１００のホットストレージ２１０にキャッシュとして記憶されるので、ストレージの利用効率が向上する。また、本発明によれば、ラストワンデータについては、当該データを保管している情報処理装置１００においてキャッシュ管理の評価値が低いものであっても、所定の条件で他の情報処理装置１００のホットストレージ２１０に移動する。これにより、応答速度の遅いコールドストレージ２２０にアクセスする機会が減り、全体として応答速度が向上する。 As described above, according to the present invention, the data accessed by the data requesting terminal 2 is distributed to the plurality of clusters 400 by the information processing apparatuses 100 on the communication path, not by all the information processing apparatuses 100 on the communication path. Since it is divided and stored as a cache in the hot storage 210 of one information processing apparatus 100 in each cluster 400, storage utilization efficiency is improved. Further, according to the present invention, even if the cache management evaluation value of the last-one data is low in the information processing apparatus 100 that stores the data, it can be transferred to another information processing apparatus 100 under a predetermined condition. Move to hot storage 210 . This reduces the chances of accessing the cold storage 220, which has a slow response speed, and improves the response speed as a whole.

以上、本発明の一実施の形態について詳述したが、本発明は上記実施の形態に限定されるものではなく、本発明の主旨を逸脱しない範囲において、種々の改良や変更をしてもよい。 Although one embodiment of the present invention has been described in detail above, the present invention is not limited to the above-described embodiment, and various improvements and modifications may be made without departing from the gist of the present invention. .

例えば、上記実施の形態では、経路上ノードクラスタリングや変形ＬＲＵの処理において必要な情報は、自身の情報処理装置１００又は他の情報処理装置１００から取得していたが、これらの情報を他の情報管理装置で統括的に管理するとともに、各情報処理装置１００は前記情報管理装置から前記情報を取得するようにしてもよい。また、経路上ノードクラスタリングや変形ＬＲＵにおける処理における一部を、他の情報管理装置で実施するようにしてもよい。この場合、当該一部の処理に必要な情報は、各情報処理装置１００から取得するようにすればよい。情報管理装置１０の配備位置は各情報処理装置１００と通信可能であればよくネットワーク的な位置は不問である。このような実施形態の一例について図１０及び図１１を参照して説明する。 For example, in the above-described embodiment, information necessary for on-route node clustering and modified LRU processing was obtained from the own information processing device 100 or another information processing device 100, but such information is obtained from other information processing devices. Each information processing apparatus 100 may be configured to acquire the information from the information management apparatus while performing overall management by the management apparatus. In addition, part of the processes in on-route node clustering and modified LRU may be performed by another information management device. In this case, the information necessary for the partial processing may be acquired from each information processing apparatus 100 . The information management apparatus 10 may be installed at any position as long as it can communicate with each information processing apparatus 100, regardless of the network position. An example of such an embodiment will be described with reference to FIGS. 10 and 11. FIG.

本実施形態では、図１０に示すように、前述の情報管理装置に相当する強制移動候補決定装置５を各情報処理装置１００と通信可能に配備する。強制移動候補決定装置５は、各情報処理装置１００のホットストレージ２１０における強制移動候補のランキングを統合的に管理するものであり、より具体的には、各ノードで最もコールドストレージ降格する又はキャッシュ削除されやすいデータのもつポイントを管理する機能を有する。各情報処理装置１００は、強制移動候補決定装置５に問い合わせることによりデータを他の情報処理装置１００に強制移動する否かを判定することができる。このような実施形態における情報処理装置１００の実施例を図１１に示す。 In this embodiment, as shown in FIG. 10, a forced migration candidate determination device 5 corresponding to the information management device described above is arranged so as to be communicable with each information processing device 100 . The forced migration candidate determination device 5 comprehensively manages the ranking of the forced migration candidates in the hot storage 210 of each information processing device 100. It has a function to manage the points of data that are likely to be stolen. Each information processing device 100 can determine whether or not to forcibly migrate data to another information processing device 100 by inquiring of the forced migration candidate determination device 5 . FIG. 11 shows an example of an information processing apparatus 100 in such an embodiment.

また、上記の各実施の形態において、例えば、上述のクラスタ分割数の算出、クラスタ内におけるデータ配置先の決定で用いる評価値Ｐ１（ｘ）の算出、ホットストレージ２１０におけるデータの有用性に関する評価値Ｐ２（ｘ）の算出で用いた数式は一例であり、他の数式やパラメータを用いることができる。パラメータは、例えば、リンク帯域、リンク利用率やＱｏＳ、メインパスからストレージまでの離れ具合（距離）、ストレージ容量、ストレージ存在地のユーザ数、これまでの歴史、以前にレプリカを作ったとかの歴史、ＡｇｅｏｆＩｎｆｏｒｍａｔｉｏｎの大小、レプリカ数の増減率、ストレージに繋がっているＰｕｂｌｉｓｈｅｒ数、ストレージにストアされるオリジナルデータのスループット、データサイズ、トポロジ分岐数（隣接ノード数）、累計（過去）のアクセス回数、推定（将来）のアクセス回数、アクセス頻度、直近アクセス間隔、ストレージ容量に占める重複配置データの割合、生成元Ｐｕｂｌｉｓｈｅｒの最新データであるか否か（ＡｏＩがデータ生成間隔を下回っているか否か）、被利用データが重複配置データであるとき同じデータが現時点でどこに配置されているかなどの、リンク状況、データ特性、キャッシュ配置先のストレージ状況、ネットワークトポロジー、といった各種データ属性を任意に用いることができる。 Further, in each of the above embodiments, for example, the calculation of the number of cluster divisions, the calculation of the evaluation value P1(x) used in determining the data allocation destination in the cluster, and the evaluation value regarding the usefulness of the data in the hot storage 210 The formula used for calculating P2(x) is an example, and other formulas and parameters can be used. Parameters include, for example, link bandwidth, link utilization rate and QoS, distance (distance) from main path to storage, storage capacity, number of users in storage location, past history, history of making replicas before , Age of Information size, increase/decrease rate of number of replicas, number of Publishers connected to storage, throughput of original data stored in storage, data size, number of topology branches (number of adjacent nodes), cumulative (past) access count , estimated (future) number of accesses, access frequency, most recent access interval, ratio of redundantly arranged data to storage capacity, whether it is the latest data of the source publisher (whether AoI is less than the data generation interval) Various data attributes such as link status, data characteristics, cache location destination storage status, network topology, etc., such as where the same data is currently located when the data to be used is redundantly allocated data, can be used arbitrarily. can.

１…データ送信端末
２…データ要求端末
３…管理サーバ
１０…ネットワーク
１００…情報処理装置
２１０…ホットストレージ
２２０…コールドストレージ
３００…データ配置制御部
４００…クラスタ DESCRIPTION OF SYMBOLS 1... Data transmission terminal 2... Data request terminal 3... Management server 10... Network 100... Information processing apparatus 210... Hot storage 220... Cold storage 300... Data arrangement control part 400... Cluster

Claims

a first storage; a second storage having a response speed slower than that of the first storage; and data placement control means for controlling the placement of data stored in the first storage and the second storage. A data management method in a distributed storage network in which a plurality of information processing devices are connected to each other via a network to form a virtual storage,
In response to access from a user terminal to data in a distributed storage network, data arrangement control means of a plurality of information processing devices on a communication path from the user terminal to the information processing device in which the data is stored performs the communication. a cache storage step of dividing a plurality of information processing apparatuses on a path into a plurality of clusters and storing the data as a cache in a first storage of any one of the information processing apparatuses belonging to the cluster in each cluster;
The data arrangement control means of each information processing device calculates an evaluation value for each data stored in the first storage based on a predetermined rule, and the same data as the data with the lowest evaluation value is distributed to other data in the distributed storage network. is stored in the first storage of the information processing device, the data with the lowest evaluation value is deleted from the first storage, and the same data as the data with the lowest evaluation value is stored in other information in the distributed storage network and a data placement step of moving the data with the lowest evaluation value from the first storage to a second storage or the first storage of another information processing device if it is not stored in the first storage of the processing device. A data management method in a distributed storage network, characterized by:

The cache storage step comprises: calculating the division number of the cluster; and calculating the division position of the cluster based on link information between a plurality of information processing apparatuses on the communication path and the division number. The data management method in a distributed storage network of claim 1, comprising:

In the data arrangement step, when data having an evaluation value lower than the lowest evaluation value is stored in a first storage of another information processing device, the data having the lowest evaluation value is transferred from the first storage to the other information processing device. If data with an evaluation value lower than the lowest evaluation value is not stored in the first storage of another information processing device, the data with the lowest evaluation value is transferred to the first storage of the information processing device. 3. The method of managing data in a distributed storage network according to claim 1 or 2, further comprising the step of moving the data from the first storage to the second storage.

the data allocation control means, when data is moved from another information processing device, moving data with an evaluation value lower than the evaluation value of the data from the first storage to the second storage. 4. A data management method in a distributed storage network according to claim 3.

a first storage; a second storage having a response speed slower than that of the first storage; and data placement control means for controlling the placement of data stored in the first storage and the second storage. An information processing device in a distributed storage network in which a plurality of information processing devices are interconnected via a network to form a virtual storage,
The data arrangement control means is
Division processing for dividing a plurality of information processing devices on a communication path from the user terminal to the information processing device storing the data into a plurality of clusters in response to access from the user terminal to data in the distributed storage network. means and
For each cluster, one of the information processing devices belonging to the cluster is determined to store the data as a cache, and when it is determined to store the data in itself, the data is stored in a first storage as a cache. placement destination determination processing means for storing;
Evaluation value calculation means for calculating an evaluation value based on a predetermined rule for each data stored in the first storage;
If the same data as the data with the lowest evaluation value is stored in the first storage of another information processing device in the distributed storage network, the data with the lowest evaluation value is deleted from the first storage, and the data with the lowest evaluation value is deleted. If the same data as the value data is not stored in the first storage of another information processing device in the distributed storage network, the data of the lowest evaluation value is transferred from the first storage to the second storage or other information An information processing device in a distributed storage network, comprising data arrangement processing means for moving data to a first storage of the processing device.

A program that causes a computer to function as the data arrangement control means according to claim 5.