WO2021224960A1

WO2021224960A1 - Storage device, storage method, and program

Info

Publication number: WO2021224960A1
Application number: PCT/JP2020/018517
Authority: WO
Inventors: ホンジェジェイ; 圭大村
Original assignee: 日本電信電話株式会社
Priority date: 2020-05-07
Filing date: 2020-05-07
Publication date: 2021-11-11

Abstract

A data storage system according to the present embodiment divides a group of data into a plurality of blocks and stores the blocks. The data storage system comprises: storage 20 which divides data into blocks according to feature quantities and stores the blocks; a feature quantity calculation unit 11 which calculates feature quantities of input data; a storage unit 12 which temporarily retains data in data clusters according to feature quantities; and a management unit 13 which, for each data cluster, writes the data retained in the data cluster to a corresponding block with a prescribed timing.

Description

Preservation device, storage method, and program

The present invention relates to a storage device, a storage method, and a program.

With the evolution of mobile terminals and the spread of social networking services (SNS), a wide variety of large amounts of data have come to be collected. The wide variety and large amount of data collected is called big data. By analyzing big data, it can be useful for people's lives and businesses.

In big data processing, data is searched and extracted according to the analysis task for the data collected from various data sources. In big data processing, the data to be searched / extracted is often performed in units of attributes (columns). Therefore, for big data processing, column-oriented storage that can store data continuously in the column direction and acquire data in column units is suitable. As a column-oriented storage, Parquet, which implements the technique of Non-Patent Document 1, is known.

In column-oriented storage, the larger the column data itself, the lower the reading efficiency. Parquet divides the column data into a plurality of blocks (LowGroup) and holds the feature amount of each block. The feature quantity is, for example, the minimum and maximum values of the data in the block, the hash value of the binary, or the vector representation of the character string. By comparing the feature amount included in the data search / extraction query with the feature amount of the block, it can be determined whether or not the requested data exists in the block. If the requested data does not exist in the block, the entire block can be skipped, so improvement in reading efficiency can be expected. For example, the column data that stores the age is divided into three blocks and stored, and the minimum and maximum values of the data in the first block are 19 and 31, and the minimum value of the data in the second block. It is assumed that the maximum values are 10 and 25, and the minimum and maximum values of the data in the third block are 35 and 70. When acquiring the data of 31 years old or older from this column data, since the second block does not include the data of 31 years old or older, the second block may be skipped and the first and third blocks may be read.

However, if the blocks are simply divided into blocks of a certain size, the data properties tend to be uniform among the blocks, and there is a problem that the skip effect is low.

Therefore, it is considered effective to combine data with similar properties into the same block. However, if new data arrives, the data is distributed to the corresponding blocks and written, a large amount of disk I / O is generated, and the writing performance is deteriorated.

In addition, it is conceivable to keep new data in a temporary file and recreate blocks at regular intervals, but the latest data cannot be used until the data held in the temporary file is taken into the block (storage), so real-time analysis is performed. Can not do it.

The present invention has been made in view of the above, and an object of the present invention is to realize efficient reading and writing of data.

The storage device of one aspect of the present invention is a storage device that divides a data group into a plurality of blocks and stores the data, and stores the data by dividing the data into the blocks according to the feature amount, and the characteristics of the input data. A calculation unit for calculating the amount, a storage unit for temporarily holding the data in a cluster according to the feature amount, and a block corresponding to the data held by the cluster for each cluster at a predetermined timing. It has a management unit for writing.

One aspect of the storage method of the present invention is a storage method in which a data group is divided into a plurality of blocks and stored, and the storage divides the data into the blocks according to the feature amount and stores the data. The feature amount of the input data is calculated, the data is temporarily held in the cluster corresponding to the feature amount, and the data held by the cluster is written to the corresponding block for each of the clusters at a predetermined timing. ..

According to the present invention, efficient reading and writing of data can be realized.

FIG. 1 is a diagram showing an example of the configuration of the data storage system of the present embodiment. FIG. 2 is a diagram for explaining the flow of data. FIG. 3 is a diagram showing an example of a storage data structure. FIG. 4 is a flowchart showing an example of the processing flow when new data is input. FIG. 5 is a diagram showing a state in which a data cluster is added. FIG. 6 is a flowchart showing an example of a flow of processing for storing data in the storage. FIG. 7 is a diagram showing a state in which a new block is added to the storage. FIG. 8 is a diagram showing an example of the hardware configuration of the cache device.

Hereinafter, embodiments of the present invention will be described with reference to the drawings.

The configuration of the data storage system of the present embodiment will be described with reference to FIG. The data storage system of FIG. 1 includes a cache device 10 and a storage 20, and stores column data having the same attributes. Further, the data storage system returns the data stored in the storage 20 in response to the query.

The cache device 10 includes a feature amount calculation unit 11, a storage unit 12, and a management unit 13. The cache device 10 calculates the feature amount of the newly written input data, temporarily holds the feature amount in the data cluster according to the feature amount of the input data, and when a predetermined write condition is satisfied, for each data cluster, The data held by the data cluster is collectively written to the storage 20.

The feature amount calculation unit 11 has a function of calculating the feature amount of the input data and distributing the input data to the data cluster based on the feature amount. The feature amount is information for allocating data to a plurality of data clusters. The feature quantity is, for example, information indicating a range of numerical values when the data is a numerical value, a hash value of the binary data when the data is binary data, and a vector representation of the character string when the data is a character string. ..

The storage unit 12 has a plurality of areas for holding the cluster feature amount and the data cluster, and has a function of temporarily holding the input data. The cluster feature is the feature of the data held by the corresponding data cluster. The data cluster holds the data. The data cluster is made to correspond to the block held by the storage 20, and the cluster feature amount is made to correspond to the feature amount of the block. The storage unit 12 may use a memory in order to maintain the write performance. Any storage medium may be used as the storage unit 12.

The management unit 13 has a function of writing the data held by the storage unit 12 to the storage 20 based on the write conditions. The writing conditions can be arbitrarily set according to the desired processing. For example, when obtaining write performance, the write condition is the passage of a certain period of time or the size of the data cluster. When real-time processing is required, the writing condition is the occurrence of a data search / extraction request.

The storage unit 12 may adjust the data cluster according to the input data. For example, when the number of data held by a certain data cluster becomes unevenly large, the storage unit 12 may create a new data cluster for holding these data. When writing the data to the storage 20, the management unit 13 creates a block corresponding to the newly generated data cluster, and writes the data of the data cluster to the block.

The storage 20 includes a metadata unit 21 and a data unit 22. The metadata unit 21 and the data unit 22 may be recorded in the same area in the storage 20.

The data unit 22 holds data for each block. Data can be read from the data unit 22 in block units.

The metadata unit 21 holds the feature amount of each block. The feature amount is the same as the cluster feature amount used by the cache device 10. In order to maximize the reading efficiency, the features should be ordered.

The metadata unit 21 may hold an index of each data held by the data unit 22. The index is used for associating with the data of other data columns.

Here, the flow of input data will be described with reference to FIG. FIG. 2 illustrates the input data, the data cluster of the storage unit 12 of the cache device 10, and the metadata unit 21 and the data unit 22 of the storage 20. The data storage system shall store numerical data indicating age.

Input data is temporarily held in a data cluster according to the amount of data features. In the example of FIG. 2, the cluster features indicate a numerical range. Specifically, the cluster features are classified into five categories: 15-19, 20-26, 20-29, 30-39, and 40-79. The corresponding data cluster holds data within the range indicated by the cluster features. If the cluster feature is in the numerical range represented by the minimum and maximum values, even if the number of data values minus the minimum value is saved in the data cluster as the minimum number of bits obtained by subtracting the minimum value from the maximum value. good. For example, for data stored in a data cluster with cluster features of 15-19, 15 is 0 (0b000), 16 is 1 (0b001), 17 is 2 (0b010), 18 is 3 (0b011), and 19 is 4. Save as (0b100) in the data cluster.

The data in the data cluster is stored in the storage 20 at a predetermined timing.

The metadata unit 21 holds the minimum and maximum values of the data stored in each block as feature quantities. The minimum value and the maximum value correspond to the cluster feature amount of the storage unit 12.

The data unit 22 holds the data indicated by the feature amount of the metadata unit 21 for each block. The data temporarily held in the data cluster is collectively added to the corresponding block for each data cluster.

An example of the data structure of the storage 20 will be described with reference to FIG. When the feature amount obtained by dividing a block into numerical data is set to the minimum value and the maximum value of the numerical value, it is conceivable to use a binary search tree with each of the minimum value and the maximum value as a node and a block as an edge. For example, in the example of FIG. 3, a block in which data of 20 or more and less than 30 is stored is assigned to an edge connecting 30 nodes and 20 nodes. You can add new blocks by adding nodes and edges.

Next, the operation when the cache device 10 inputs new data will be described with reference to the flowchart of FIG. The process of FIG. 4 is executed when new data is input.

In step S11, the cache device 10 calculates the feature amount of the input data.

In step S12, the cache device 10 compares the data feature amount with the cluster feature amount, and identifies the data cluster according to the feature amount. For example, when the data to be input is a numerical value and the cluster feature amount is indicated by a numerical range, the data cluster in the numerical range including the numerical value of the data is set as the data cluster for storing the data. When there are a plurality of applicable data clusters, the data cluster that stores the data may be specified according to the number of data in the data cluster, the number of data in the corresponding block, and the like.

In step S13, the cache device 10 determines whether or not there is a data cluster that stores the input data. Further, in step S13, the cache device 10 may determine that a new data cluster is created according to the data stored in the data cluster. For example, as shown in FIG. 5, when a large amount of data having a value of 70 or more is input, a new data cluster for storing 70 or more data is created.

If the data cluster does not exist, the cache device 10 creates a new data cluster in step S14. If the data cluster does not exist, the cache device 10 may change the cluster feature amount in step S14. For example, in the example of FIG. 5, when data having a value of 14 is input, the cluster feature amount of 15-19 is changed to 14-19.

In step S15, the cache device 10 stores the data in the corresponding data cluster.

In step S16, the cache device 10 updates the cluster feature amount using the data added to the data cluster.

Next, the operation of saving data in the storage 20 will be described with reference to the flowchart of FIG. The process of FIG. 6 is executed at the timing of writing data to the storage 20.

In step S21, the cache device 10 writes the cluster feature amount to the metadata unit 21. When a new data cluster has been created, the cluster feature amount of the new data cluster is written in the metadata unit 21. If the data structure of the storage 20 is a binary search tree, add new nodes and edges as shown in FIG. In the example of FIG. 7, a node having a value of 70 was added, and an edge (block) for storing data having a minimum value of 70 and a maximum value of 79 was added.

In step S22, the cache device 10 adds data to the block of the data unit 22 for each data cluster.

In step S23, the cache device 10 deletes the data in the data cluster of the storage unit 12.

As described above, the data storage system of the present embodiment is a data storage system that divides a data group into a plurality of blocks and stores the data. The data storage system temporarily stores the data in a storage 20 that stores the data in blocks according to the feature amount, a feature amount calculation unit 11 that calculates the feature amount of the input data, and a data cluster according to the feature amount. A storage unit 12 to be held in the data cluster and a management unit 13 to write the data held by the data cluster to the corresponding block for each data cluster at a predetermined timing are provided. As a result, the input data is temporarily held in the data cluster according to the feature amount of the data, and the data is collectively written to the storage 20 for each data cluster at a predetermined timing, so that the data writing disk I / O is performed. Can be reduced.

In the data storage system of the present embodiment, the write performance can be obtained by writing data to the storage 20 by the cache device 10 after a certain period of time elapses or when the size of the data cluster becomes a predetermined value or more. When data is requested, the cache device 10 writes the data to the storage 20 to support real-time processing.

The cache device 10 described above includes, for example, a central processing unit (CPU) 901, a memory 902, a storage 903, a communication device 904, an input device 905, and an output device 906, as shown in FIG. A general-purpose computer system including the above can be used. In this computer system, the cache device 10 is realized by the CPU 901 executing a predetermined program loaded on the memory 902. This program can be recorded on a computer-readable recording medium such as a magnetic disk, an optical disk, or a semiconductor memory, or can be distributed via a network.

10 ... Cache device 11 ... Feature calculation unit 12 ... Storage unit 13 ... Management unit 20 ... Storage 21 ... Metadata unit 22 ... Data unit

Claims

A storage device that divides a data group into multiple blocks and stores it.
Storage that stores data by dividing it into the blocks according to the feature amount,
A calculation unit that calculates the features of the input data,
A storage unit that temporarily holds the data in a cluster according to the feature amount, and a storage unit.
A storage device including a management unit that writes data held by the cluster to the corresponding block at a predetermined timing for each cluster.
The storage device according to claim 1.
The management unit is a storage device that writes data held by the cluster to the corresponding block when a certain period of time elapses, the size of the cluster exceeds a predetermined value, or data is requested.
The storage device according to claim 1 or 2.
In the storage unit, a cluster corresponding to the feature amount is newly added.
The management unit is a storage device that writes data held by the cluster to a new block corresponding to the newly added cluster.
It is a saving method that divides a data group into multiple blocks and saves it.
The storage divides the data into the blocks according to the feature amount and stores it.
The computer
Calculate the features of the input data and
The data is temporarily held in a cluster according to the feature amount,
A storage method in which data held by the cluster is written to the corresponding block for each cluster at a predetermined timing.
A program that operates a computer as each part of the storage device according to any one of claims 1 to 3.