WO2019086016A1

WO2019086016A1 - Data storage method and device

Info

Publication number: WO2019086016A1
Application number: PCT/CN2018/113837
Authority: WO
Inventors: 赵旺; 陈飘; 张鹏
Original assignee: 华为技术有限公司
Priority date: 2017-11-03
Filing date: 2018-11-02
Publication date: 2019-05-09
Also published as: CN109753225A; CN111666043A; CN109753225B

Abstract

A data storage method and device, relating to the technical field of computers, and solving the problem that data is sent twice across nodes, resulting in an I/O bandwidth of a system being low. A particular solution is: a receiving node receiving a first writing request comprising data to be written and a logical address of the data to be written, storing, in its own buffer, the data to be written, determining a mirror node according to the logical address of the data to be written, and recording a first correlation, wherein the first correlation is a mapping relation between the logical address of the data to be written and the mirror node; and the receiving node sending the data to be written and the logical address of the data to be written to the mirror node, selecting N target data blocks from its own buffer according to the first correlation, with mirror nodes of the N target data blocks being different from one another, and respectively sending a notification message to N mirror nodes corresponding to the N target data blocks.

Description

Data storage method and device

Technical field

The embodiments of the present invention relate to the field of computer technologies, and in particular, to a data storage method and device.

Background technique

A distributed array of independent disk (RAID) system generally consists of multiple storage nodes interconnected by a network. In the case of storing data to a distributed RAID system, in order to ensure data reliability, redundant algorithms such as erasure code (EC) algorithm can be used to implement redundant storage of data, and in order to improve the system Input/output (I/O) performance enables data caching (cache) mechanisms in distributed RAID systems, allowing storage nodes to quickly read and write data from the cache.

In the prior art, the local storage node may send the write request to other storage nodes in the distributed RAID system after receiving a write request, so that the data in the write request can be separately stored to include the local storage node. At least two storage nodes are cached to ensure the reliability of the data, and then the local storage node can reply to the write request. After the local storage node receives multiple write requests, it may determine N data blocks from the data in the cache that has not been written to the hard disk, and calculate M check data blocks according to the EC algorithm, and then respectively go to N+M. The storage nodes send N data blocks and M check data blocks, so that each of the N+M storage nodes can store the data in the received data block into its own hard disk.

At least the following technical problems exist in the prior art: since the local storage node sends data to the node when sending the write request to other storage nodes in the process of storing the data to the hard disk, the local storage node sends N data to the N storage nodes. Blocks also need to send data across nodes, so data is sent twice across nodes, resulting in lower I/O bandwidth.

Summary of the invention

The embodiment of the present application provides a data storage method and device, which solves the problem that the I/O bandwidth of the system is low because the data is sent twice across the node.

To achieve the above objectives, the present application adopts the following technical solutions:

In a first aspect, the present application provides a data storage method, which is applied to a distributed RAID system, where the distributed RAID system includes K storage nodes, and K is an integer greater than 0. The method may include: receiving, receiving, including, to be written a first write request of the data and the logical address of the data to be written, and storing the data to be written in its own cache, and determining a mirror node according to the logical address of the data to be written, and recording the first correspondence, the first A correspondence relationship is a mapping relationship between the logical address of the data to be written and the mirror node, and the receiving node sends the logical address to be written and the data to be written to the mirror node, and from the cache according to the first correspondence. Selecting N target data blocks, the mirror nodes of the N target data blocks are different from each other, so that the receiving node sends a notification message to the N mirror nodes corresponding to the N target data blocks, and the notification message is included in each notification message. The logical address of the target data block corresponding to the mirrored node to be used to indicate that the mirror node caches itself and the logic in the notification message. Access to the corresponding data block stored in the hard disk itself. The receiving node is any one of K storage nodes, and N is an integer greater than 0 and less than K.

In the data storage method provided by the embodiment of the present application, the receiving node determines the mirroring node according to the logical address of the data to be written, and sends the logical address to be written and the data to be written to the mirroring node, so that the N target data blocks respectively Stored in the N mirror nodes, the receiving node can directly send a notification message including the logical address of the target data block to the N mirror nodes, to instruct the mirror node to write the data block corresponding to the logical address to the hard disk, so that Compared with the prior art, the storage node needs to send data to be written twice across the node, and the receiving node only needs to send data once across the node when sending the data to be written to the mirror node, and the logic included in the notification message The address has a smaller amount of data and consumes less bandwidth, thus improving system I/O bandwidth compared to the prior art.

With reference to the first aspect, in a possible implementation, the logical address of the data to be written includes at least one sub-logical address, and the receiving node determines the mirroring node according to the logical address of the data to be written, which may specifically include: adopting the receiving node Formula: X=Int (sub-logical address/length of data block), each sub-logical address in the logical address of the data to be written is rounded after dividing by the length of the data block to obtain a correspondence corresponding to each sub-logical address The integer X, and the receiving node performs a hash calculation on each integer X by using a pre-configured hash algorithm, and then the result of each hash calculation is used, and the number of the mirror node is obtained according to the result of the redundancy. The length of the data block is the number of sub-logical addresses included in each data block.

With reference to the first aspect and the foregoing possible implementation manners, in another possible implementation manner, when the receiving node performs hash calculation on each integer X by using a pre-configured hash algorithm, if the preset number of mirroring nodes is One, the integer X is hashed using a pre-configured hash algorithm.

With reference to the first aspect and the foregoing possible implementation manner, in another possible implementation manner, when the receiving node performs a hash calculation on each integer X by using a pre-configured hash algorithm, if the preset number of mirroring nodes is greater than One, the integer X is calculated using a different type of hash algorithm pre-configured.

With reference to the first aspect and the foregoing possible implementation manner, in another possible implementation manner, when the receiving node performs a hash calculation on each integer X by using a pre-configured hash algorithm, if the preset number of mirroring nodes is greater than One is to calculate the integer X by using a pre-configured hash algorithm, and then obtain a preset number of other results according to the calculation result.

With reference to the first aspect and the foregoing possible implementation manner, in another possible implementation manner, the logical address of the data to be written includes at least one sub-logical address, and the first correspondence relationship records the mirror node corresponding to each sub-logical address. The receiving node selects N target data blocks in the cache according to the first correspondence, and specifically includes: the receiving node determines, in the first correspondence, a child logical address having the same mirror node, and the child corresponding to the N different mirror nodes Among the logical addresses, the sub-logical addresses constituting one data block are selected, and the data corresponding to the selected sub-logical addresses constitutes N target data blocks.

With reference to the first aspect and the foregoing possible implementation manners, in another possible implementation manner, when the receiving node selects a sub-logical logical address constituting one data block from the sub-logical logical addresses corresponding to the N different mirroring nodes, determining the composition A sub-logical address missing in a data block, determining whether the missing sub-logical address is recorded in the second correspondence, the second correspondence being the logical address of the data written to the hard disk of the mirror node and the hard disk written to the mirror node Correspondence of physical addresses. If not, set the data corresponding to the missing sub-logical address to 0 to form the target data block, and record the missing sub-logical address to the first correspondence; if yes, according to the second correspondence, from the mirror node Obtaining the data corresponding to the missing sub-logical address constitutes the target data block, and adding the missing sub-logical address to the first correspondence.

With reference to the first aspect and the foregoing possible implementation manner, in another possible implementation manner, the method may further include: the receiving node calculates M check data blocks according to the N target data blocks, and sends the M check data blocks. Storage node storage except for N mirror nodes.

With reference to the first aspect and the foregoing possible implementation manner, in another possible implementation manner, after the receiving node sends the M check data blocks to the storage node storage other than the N mirror nodes, the method may further include: The receiving node deletes the correspondence between the sub-logical address of the N target data blocks and the mirror node from the first correspondence. And the receiving node deletes the N target data blocks from the cache, and sends an indication message to the N mirror nodes corresponding to the N target data blocks, where each indication message includes a mirror node corresponding to the indication message The logical address of the target data block, used to instruct the mirror node to delete the data block corresponding to the logical address in the indication message from the cache.

In a second aspect, the present application provides a data storage method, which is applied to a distributed RAID system, where the distributed RAID system includes K storage nodes, and K is an integer greater than 0. The method may include: the mirror node receives at least one receiving node. The first write request sent includes the logical address of the mirrored data and the mirrored data, and the mirrored data is written into its own cache, and the logical relationship between the logical address of the mirrored data and the receiving node is recorded. And the mirror node selects N target data blocks in its own cache according to the correspondence between the mirror data and the receiving node, and the receiving nodes of the N target data blocks are different from each other, and respectively receive N corresponding to the N target data blocks. The node sends a notification message, where each notification message includes a logical address of the target data block corresponding to the receiving node to which the notification message is sent, to instruct the receiving node to write the data block corresponding to the logical address in the cache to the hard disk. The receiving node is a node that receives a second write request from the host, and the second write request includes a logical address to be written and a data to be written, and the mirror node is determined by the receiving node according to the logical address of the data to be written. The image data is the data written to the mirror node determined by the receiving node, and N is an integer greater than 0 and less than K.

In the data storage method provided by the embodiment of the present application, the mirroring node writes the mirrored data in the received first write request to the cache, and records the correspondence between the logical address of the mirrored data and the receiving node, so that the N target data blocks respectively Stored in the N receiving nodes, the mirroring node can directly send a notification message including the logical address of the target data block to the N receiving nodes, to instruct the receiving node to write the data block corresponding to the logical address to the hard disk, so that Compared with the prior art, the storage node needs to send data twice to be written across the node, and only sends data once across the node, and the occupied bandwidth is small because the data amount of the logical address included in the notification message is small. Therefore, the system I/O bandwidth is improved compared with the prior art.

In a third aspect, the present application provides a receiving node, which may include a module capable of implementing the method in the above first aspect and its various embodiments.

In a fourth aspect, the application provides a mirroring node, which may include a module capable of implementing the method in the second aspect above.

In a fifth aspect, the application provides a storage node, where the storage node includes: at least one processor, a memory, a communication interface, and a communication bus. At least one processor is coupled to the memory and the communication interface via a communication bus, the memory is configured to store the computer execution instructions, and when the storage node is in operation, the processor executes the memory storage computer execution instructions to cause the storage node to perform the first aspect or the first A data storage method of any of the possible implementations of aspects, or a data storage method as in the second aspect.

In a sixth aspect, the present application provides a computer storage medium having stored thereon computer-executable instructions for implementing any of the possible implementations of the first aspect or the first aspect when the computer-executed instructions are executed by the processor A data storage method, or a data storage method as in the second aspect.

In a seventh aspect, the present application also provides a computer program product, which when executed on a computer, causes the computer to perform the method of the first aspect or the second aspect described above.

In an eighth aspect, the present application also provides a communication chip in which computer-executed instructions are stored, and when executed on a computer, cause the computer to perform the method of the first aspect or the second aspect described above.

It is to be understood that any of the devices or computer storage media or computer program products provided above are used to perform the corresponding methods provided above, and therefore, the beneficial effects that can be achieved can be referred to the beneficial effects in the corresponding methods. , will not repeat them here.

DRAWINGS

FIG. 1 is a schematic structural diagram of an embodiment of the present application;

2 is a schematic structural diagram of a storage node according to an embodiment of the present application;

FIG. 3 is a flowchart of a data storage method according to an embodiment of the present application;

FIG. 4 is a flowchart of another data storage method according to an embodiment of the present application;

FIG. 5 is a schematic structural diagram of another storage node according to an embodiment of the present disclosure;

FIG. 6 is a schematic structural diagram of another storage node according to an embodiment of the present disclosure;

FIG. 7 is a schematic structural diagram of another storage node according to an embodiment of the present disclosure;

FIG. 8 is a schematic structural diagram of another storage node according to an embodiment of the present application.

Detailed ways

FIG. 1 is a schematic structural diagram of an embodiment of the present application. As shown in FIG. 1 , the architecture may include: a host 11 and a distributed RAID system 12 .

The host 11 is configured to send a first write request to any one of the distributed RAID systems 12 when the host 11 needs to store data to the distributed RAID system 12, and also receive a response message returned by the storage node. The response message is used to notify the host 11 that the data to be stored in the first write request has been stored in the distributed RAID system 12.

The distributed RAID system 12 is composed of K storage nodes through network interconnection, and is used to provide a large amount of storage space, K is an integer greater than 1, and each storage node in the distributed RAID system 12 can be a server in a specific implementation.

It should be noted that, in the embodiment of the present application, in order to facilitate distinguishing different storage nodes in the distributed RAID system 12, different numbers may be used to represent different storage nodes.

FIG. 2 is a schematic diagram of a composition of a storage node according to an embodiment of the present disclosure. As shown in FIG. 2, the storage node may include: at least one processor 21, a memory 22, a communication interface 23, and a communication bus 24.

The following describes the components of the storage node in conjunction with Figure 2:

The processor 21 is a control center of the storage node, and may be a processor or a collective name of a plurality of processing elements. For example, the processor 21 is a central processing unit (CPU), may be an application specific integrated circuit (ASIC), or one or more integrated circuits configured to implement the embodiments of the present application. For example, one or more digital signal processors (DSPs), or one or more field programmable gate arrays (FPGAs).

In a particular implementation, as an embodiment, the storage node may include one or more CPUs, such as CPU0 and CPU1 shown in FIG. 2. Moreover, as an embodiment, the storage node may include a plurality of processors, such as processor 21 and processor 25 shown in FIG. Each of these processors can be a single core processor (CPU) or a multi-core processor (multi-CPU). A processor herein may refer to one or more devices, circuits, and/or processing cores for processing data, such as computer program instructions.

In a particular implementation, processor 21 may perform various functions of the storage node by running or executing a software program stored in memory 22, as well as invoking data stored in memory 22. For example, the processor 21 can execute the computer program code stored in the memory 22 to execute the data storage method provided by the present application, and save the data to be stored in the write request to the hard disk of the distributed RAID system.

The memory 22 can be a read-only memory (ROM) or other type of static storage device that can store static information and instructions, a random access memory (RAM) or other type that can store information and instructions. The dynamic storage device can also be an electrically erasable programmable read-only memory (EEPROM), a compact disc read-only memory (CD-ROM) or other optical disc storage, and a disc storage device. (including compact discs, laser discs, optical discs, digital versatile discs, Blu-ray discs, etc.), magnetic disk storage media or other magnetic storage devices, or can be used to carry or store desired program code in the form of instructions or data structures and can be Any other media accessed, but not limited to this. Memory 22 may be present independently and coupled to processor 21 via communication bus 24. The memory 22 can also be integrated with the processor 21.

In a particular implementation, the memory 22 is used to store data in the present application and to execute the software program of the present application. For example, the memory 22 can be used to store computer program code corresponding to the data storage method provided by the embodiment of the present application. In the embodiment of the present application, the memory 22 may include a cache and a hard disk for storing data to be stored in the write request.

The communication interface 23 uses devices such as any transceiver for communicating with other devices or communication networks, such as a host, a radio access network (RAN), a wireless local area networks (WLAN), and the like. The communication interface 23 may include a receiving unit that implements a receiving function, and a transmitting unit that implements a transmitting function.

The communication bus 24 may be an industry standard architecture (ISA) bus, a peripheral component interconnect (PCI) bus, or an extended industry standard architecture (EISA) bus. The bus can be divided into an address bus, a data bus, a control bus, and the like. For ease of representation, only one thick line is shown in Figure 2, but it does not mean that there is only one bus or one type of bus.

In the prior art, when the local storage node sends a write request to other storage nodes, it is required to send data across nodes, and when sending N data blocks to N storage nodes respectively, it is also required to send data across nodes, so that data is sent across nodes. Twice, resulting in a lower I/O bandwidth of the system. Therefore, in order to solve the problem that the system has a low I/O bandwidth, the data storage method shown in FIG. 3 may be used to store data. As shown in FIG. 3, the method may include:

It should be noted that, in the embodiment of the present application, the receiving node is taken as the first storage node as an example. The first storage node is a storage node that receives any first write request sent by the host in any one of the distributed RAID systems.

301. The first storage node receives a first write request sent by the host.

The first write request may include a logical address to be written and a data to be written, and the logical address of the data to be written includes at least one sub-logical address. In a specific implementation, the logical address of the data to be written generally includes a first address and a data length. For example, if the first address is 0 and the data length is 3, the logical address of the data to be written includes 3 sub-logical addresses. , that is, logical address 0, logical address 1, and logical address 2.

302. The first storage node stores the data to be written in the first write request in its own cache.

303. The first storage node determines the mirror node according to the logical address of the data to be written in the first write request.

The length of the data block is the number of sub-logical addresses included in each data block. After the first storage node stores the data to be written in its own cache, the first storage node may adopt the formula: X=Int (the length of the sub-logical address/data block), each of the logical addresses of the data to be written. The sub-logical address is rounded after dividing by the length of the data block to obtain an integer X corresponding to each sub-logical address. At this time, the first storage node may perform a hash calculation on each integer X by using a pre-configured hash algorithm, and then perform the result of each hash calculation after dividing by the number K of storage nodes in the distributed RAID system. Take the remainder and get the number of the mirrored node based on the result of the remainder.

When the first storage node performs hash calculation on each integer X by using a pre-configured hash algorithm, if the preset number of mirror nodes is only one, the first storage node may adopt a pre-configured hash algorithm. Calculate the integer X and then take the remainder to get the number of the mirror node. If the preset number of the mirrored nodes is greater than one, in one implementation manner, the first storage node may calculate the integer X by using a different type of hash algorithm configured in advance, and the type of the hash algorithm and the mirror node The number is the same. After the hash result is used, the number of different mirror nodes is obtained. In another implementation manner, the first storage node may perform hash calculation on the integer X first, and after obtaining the remainder, obtain the number of one mirror node, and then determine the number of the other mirror node according to the determined result of the number of the mirror node. For example, if the preset number of mirror nodes is three, the first storage node may first perform hash calculation on the integer X, and take the number of the mirror node obtained after the remainder, and then the number result of the mirror node with the determination The two mirror node number values adjacent to each other are used as the other two mirror nodes, or two hash calculations are performed on the calculation result to obtain the numbers of the other two mirror nodes.

It should be noted that, in the embodiment of the present application, the length S of the data block may be pre-configured in the first storage node, so that starting from the sub-logical address 0, each consecutive S sub-logical addresses is a logical address of one data block. . In this way, the first storage node performs a rounding operation on each sub-logical address by adopting the length S of the data block, so that the integer X obtained by rounding each sub-logical address in one data block is the same, thereby making it according to a data block. The mirror nodes calculated by each sub-logical address are the same, that is, one data block can exist in one mirror node.

Moreover, in the embodiment of the present application, the preset number of mirror nodes is determined by the reliability index of the distributed RAID system. Exemplarily, assuming that the reliability index of the distributed RAID system is that in the case where the caches of the two storage nodes respectively fail, the data in the caches of the two storage nodes can still be recovered, and the data in each storage node is It needs to be stored in the cache of at least three storage nodes respectively, that is, the preset number of mirror nodes in the first storage node is two or more.

Exemplarily, the sub-logical addresses included in the logical address of the data to be written are: logical address 2, logical address 3, logical address 4, and the length S of the pre-configured data block in the first storage node is 4, and There are two hash algorithms configured, the preset number of mirror nodes is two, and the number K of storage nodes in the distributed RAID system is 16. Then, the first storage node can perform rounding calculation on the logical addresses 2, 3, and 4 respectively, and obtain corresponding Xs of 0, 0, and 1, respectively. In this way, the first storage node may perform hash calculation on the integer 0 by using two hash algorithms respectively, and then perform the remainder after dividing the two hash calculation results by 16 to obtain the number of the mirror node, which is assumed to be 5 respectively. And 6, and use two hash algorithms to calculate the integer 1 respectively, and then separately calculate the results of the two hash calculations after dividing by 16, to obtain the number of the mirror node, which is assumed to be 6 and 7. At this time, the first storage node can determine that the mirror nodes corresponding to the logical address 2 are the storage nodes numbered 5 and 6, and the mirror nodes corresponding to the logical address 3 are the storage nodes numbered 5 and 6, and the logical address 4 The corresponding mirror nodes are storage nodes numbered 6 and 7.

304. The first storage node records the first correspondence.

After the first storage node determines the mirroring node according to the logical address of the data to be written, the first storage node may save the first correspondence, where the first correspondence is a mapping relationship between each sub-logical address and the number of the mirror node. .

Exemplarily, according to the example in step 303, assuming that the number of the first storage node is 1, the first storage node 1 can save the first correspondence as shown in Table 1.

Table 1

逻辑地址Logical address	接收节点的编号Receive node number	镜像节点的编号Mirror node number
2、32, 3	11	5、65,6
44	11	6、76,7

305. The first storage node sends the logical address to be written and the data to be written to the mirror node.

After the first storage node records the first correspondence, the first storage node may send the logical address of the data to be written and the data to be written to the mirror node. Specifically, the first storage node may send a second write request to each mirror node, where the second write request includes a logical address of the mirrored data and the mirrored data, where the logical address of the mirrored data is in the first correspondence relationship and the The logical address corresponding to the mirror node. The mirror node can save the mirror data contained in the received second write request in its own cache. And the mirroring node may send a response message to the first storage node, to notify the first storage node that the mirror data included in the second write request has been stored in the cache of the mirror node.

Exemplarily, according to the example in step 304, it is assumed that the data to be written included in the first write request is data A, data B, and data C, wherein data A corresponds to logical address 2, and data B corresponds to logical address 3. The data C corresponds to the logical address 4. Then, in the second write request sent by the first storage node 1 to the storage node 5, the logical address of the mirrored data includes: a logical address 2 corresponding to the mirror node 5, a logical address 3, and mirror data. Includes: Data A and Data B. In the second write request sent by the first storage node 1 to the storage node 6, the logical address of the mirrored data includes: a logical address 2 corresponding to the mirror node 6, a logical address 3, and a logical address 4. The mirrored data includes: data A, data B and data C. In the second write request sent by the first storage node 1 to the storage node 7, the logical address of the mirrored data includes: a logical address 4 corresponding to the mirror node 7, and the mirrored data includes the data C.

306. The first storage node sends a response message to the host.

After the first storage node receives the response message sent by each mirror node, the first storage node may send a response message to the host to notify the host that the data to be written in the first write request is stored in the distributed RAID system. in.

It should be noted that, in the embodiment of the present application, the host sends multiple write requests to the first storage node. At this time, the first storage node may process the received multiple write requests in parallel according to the processing capability, and the first storage Steps 301 - 306 can be performed when the node processes each write request. After the first storage node replies to the response message to the host, the data can be persisted in the background. Specifically, the following steps 307-308 can be performed:

307. The first storage node selects N target data blocks from its own cache according to the first correspondence, and the mirror nodes of the N target data blocks are different from each other.

The target data block includes data corresponding to each consecutive S logical addresses. The first storage node may first determine a sub-logical address having the same mirror node in the first correspondence, the same mirror node may be one or more, and if there are multiple mirror nodes, the first storage A node can first select a mirror node from multiple mirror nodes. In this way, the first storage node can select N mutually different mirror nodes, and select sub-logical addresses constituting one data block from the sub-logical addresses corresponding to the N different mirror nodes, and the selected sub-logical addresses correspond to The data is N target data blocks.

When the first storage node selects a sub-logical address constituting a data block, it may first determine a sub-logical address that is missing when the data block is formed, and then determine whether the missing logical address is recorded in the second correspondence, The second correspondence is a correspondence between a logical address of data written to the hard disk of the mirror node and a physical address of the hard disk written to the mirror node. If the missing sub-logical address does not exist in the second correspondence, it indicates that the first storage node does not write data corresponding to the missing sub-logical address to the hard disk, and the first storage node may set the missing sub-logic The data corresponding to the address is 0 to constitute a target data block, and the missing sub-logical address is recorded into the first correspondence. If the missing sub-logical address exists in the second correspondence, it indicates that the first storage node has stored data corresponding to the missing logical address to the hard disk of the mirror node, and at this time, the first storage node may be according to the second Corresponding relationship, the data corresponding to the missing sub-logical address is obtained from the mirror node to form a target data block, and the missing sub-logical address is recorded into the first correspondence.

Exemplarily, assuming that the length S of the data block is 3 and N is 3, the first correspondence stored by the first storage node 1 is as shown in Table 2.

Table 2

逻辑地址Logical address	接收节点Receiving node	镜像节点Mirror node
33	11	22
0、1、20, 1, 2	11	44
44	11	22
9、109,10	11	33
1111	11	33
1515	11	77

Then the first storage node 1 can be determined in Table 2, respectively, with the sub-logical addresses of the mirror nodes 2, 3, 4, 7. Then, the first storage node 1 can select three mirror nodes that are different from each other, such as mirror nodes numbered 2, 3, and 4. At this time, the first storage node 1 can obtain the sub-logical addresses corresponding to the three mirror nodes. Selecting a sub-logical address constituting a data block, and when selecting a sub-logical address constituting a data block, the first storage node may determine the missing sub-logical address 5 and add the corresponding logical address 5 corresponding to the missing Data is used to form a target data block, and the missing logical address 5 is recorded into the first correspondence. At this time, the three target data blocks selected by the first storage node 1 are: logical address 3, logical address 4, and logic. Data corresponding to address 5, data corresponding to logical address 9, logical address 10, and logical address 11, data corresponding to logical address 0, logical address 1, and logical address 2.

308. The first storage node sends a notification message to the N mirror nodes corresponding to the N target data blocks.

The notification message includes a logical address of the target data block corresponding to the mirror node to which the notification message is sent, and the notification message is used to indicate that the mirror node caches the data block in the cache corresponding to the logical address in the notification message. Store to your own hard drive.

After the first storage node determines the N target data blocks, the first storage node may send a notification message to the N mirror nodes corresponding to the N target data blocks, respectively. After receiving the notification message, each mirroring node may first determine whether data corresponding to the logical address in the notification message is stored in its own cache. If the data corresponding to the logical address in the notification message is stored in the self cache, the mirror node may store the data block corresponding to the logical address in the notification message to its own hard disk. If the data corresponding to a logical address in the notification message is not stored in the cache, the mirror node may determine whether the logical address of the missing data is recorded in the logical address of the data written to the hard disk and the physical address written to the hard disk. In the mapping relationship, if it does not exist, the mirror node may set the data corresponding to the logical address of the missing data to 0 to constitute the target data block. If yes, the mirror node can acquire data corresponding to the logical address to form a target data block according to a mapping relationship between a logical address of data written to the hard disk and a physical address written to the hard disk. The first storage node then stores the data block corresponding to the logical address in the notification message to its own hard disk.

Exemplarily, according to the example in step 307, the first storage node 1 may respectively send a notification message to the storage node numbered 2, 3, 4, wherein the first storage node 1 sends a notification to the storage node numbered 2 The logical address included in the message is 3, 4, and 5. The logical address included in the notification message sent by the first storage node 1 to the storage node numbered 3 is 9, 10, and 11, and the first storage node 1 is numbered 4 The logical address included in the notification message sent by the storage node is 0, 1, and 2.

309. The first storage node calculates M check data blocks according to the N target data blocks.

In order to ensure data reliability when writing data to the hard disk, after the first storage node determines N target data blocks in step 307, the first storage node may calculate the data of the N target data blocks by using the EC algorithm. , obtain M check data blocks.

310. The first storage node sends the M check data blocks to a storage node storage other than the N mirror nodes.

After the first storage node obtains M check data blocks, the first storage node may select M storage nodes except N mirror nodes, and send M check data blocks to the M storage nodes respectively. Storage, so that each storage node stores the received check data block into its own hard disk, so that N target data blocks stored in the hard disk of the N storage nodes and M checksums stored in the hard disks of the M storage nodes are stored. The data blocks may form a stripe such that, in the event that data of any of the N target data blocks of the strip is corrupted or lost, the remaining uncorrupted NY target data in the strip may be passed. The data of the block and the data of the M check data blocks recover the data of the damaged Y target data blocks.

311. The first storage node deletes the correspondence between the sub-logical address of the N target data blocks and the mirror node from the first correspondence, and deletes N target data blocks from the cache, and respectively goes to the N target data blocks. The corresponding N mirror nodes send an indication message.

The data of the N target data blocks is stored in the hard disks of the N storage nodes of the distributed RAID system, and the data of the M check data blocks is stored in the hard disks of the M storage nodes, and the N target data is indicated. The block's data has been reliably stored in a distributed RAID system. At this time, the first storage node may delete the correspondence between the sub-logical address of the N target data blocks and the mirror node from the first correspondence. The first storage node may delete the N target data blocks from the cache, and send an indication message to the N mirror nodes corresponding to the N target data blocks, where the indication message includes the mirror node corresponding to the indication message. The logical address of the target data block is used to instruct the mirror node to delete the data block corresponding to the logical address in the indication message from its own cache.

It should be noted that, in the embodiment of the present application, the logical address included in the indication message sent by the first storage node to any one of the N mirror nodes corresponding to the N target data blocks, and the first step in step 308 The logical address included in the notification message sent by the storage node to the mirror node is the same.

Moreover, the first storage node rounds each sub-logical address by adopting the length of the data block, so that one data block can exist in one mirror node. The result of each hash calculation is divided by K, and the number of the mirror node is obtained according to the result of the redundancy, so that the calculated mirror node is a storage node in the distributed RAID system.

FIG. 4 is a flowchart of another data storage method according to an embodiment of the present application. As shown in FIG. 4, the method may include:

It should be noted that, in the embodiment of the present application, the mirror node is used as the second storage node as an example for description.

401. The second storage node receives a first write request sent by at least one receiving node.

The first write request includes a logical address of the mirrored data and the mirrored data, the receiving node is a node that receives the second write request from the host, and the second write request includes the logical address to be written and the data to be written, and the mirror node The receiving node determines, according to the logical address of the data to be written, the mirrored data is data determined by the receiving node to be written to the mirror node. And the logical address of the mirrored data includes at least one sub-logical address.

402. The second storage node writes the mirrored data into its own cache, and records the correspondence between the logical address of the mirrored data and the receiving node.

After the second storage node receives the first write request, the second storage node may save the mirrored data in the received first write request in the own cache, and record each sub-logical address in the logical address of the mirrored data. Correspondence with the receiving node.

403. The second storage node selects N target data blocks from its own cache according to the correspondence between the logical address of the mirrored data and the receiving node, and the receiving nodes of the N data blocks are different from each other.

The second storage node may first determine a sub-logical address having the same receiving node in the correspondence between the logical address of the mirrored data and the receiving node. Then, the first storage node may select N different receiving nodes, and select sub-logical addresses constituting one data block from the sub-logical addresses corresponding to the N different receiving nodes, and the selected sub-logical addresses correspond to The data is N target data blocks.

It should be noted that, in the embodiment of the present application, a specific description of selecting a sub-logical address constituting a data block for the second storage node may refer to step 307 in FIG. 3, where the first storage node selects a data block. The description of the sub-logical address is not described here.

Step 404-Step 407, in the embodiment of the present application, the related description of Step 404-Step 407 is similar to the related description of Step 308-Step 311 in FIG. 3 of another embodiment of the present application, and the related description of Step 404-Step 407 is performed. For details, refer to the descriptions of the steps 308 to 311 in FIG. 3, which are not described in detail in the embodiments of the present application.

The solution provided by the embodiment of the present application is mainly introduced from the perspective of a storage node. It can be understood that, in order to implement the above functions, the storage node includes corresponding hardware structures and/or software modules for performing various functions. Those skilled in the art will readily appreciate that the present invention can be implemented in a combination of hardware or hardware and computer software in combination with the algorithm steps of the various examples described in the embodiments disclosed herein. Whether a function is implemented in hardware or computer software to drive hardware depends on the specific application and design constraints of the solution. A person skilled in the art can use different methods for implementing the described functions for each particular application, but such implementation should not be considered to be beyond the scope of the present invention.

The embodiment of the present application may perform the division of the function module on the storage node according to the foregoing method example. For example, each function module may be divided according to each function, or two or more functions may be integrated into one processing module. The above integrated modules can be implemented in the form of hardware or in the form of software functional modules. It should be noted that the division of the modules in the embodiment of the present application is schematic, and is only a logical function division, and the actual implementation may have another division manner.

FIG. 5 is a schematic diagram showing a possible composition of the storage node involved in the foregoing and the embodiment. As shown in FIG. 5, the storage node may include: a receiving unit 51. The storage unit 52, the determining unit 53, the transmitting unit 54, and the selecting unit 55.

The receiving unit 51 is configured to support the storage node to perform step 301 in the data storage method shown in FIG. 3.

The storage unit 52 is configured to support the storage node to perform step 302 and step 304 in the data storage method shown in FIG. 3.

The determining unit 53 is configured to support the storage node to perform step 303 in the data storage method shown in FIG. 3.

The sending unit 54 is configured to support the storage node to send the N mirror nodes corresponding to the N target data blocks respectively, as described in step 305, step 306, step 308, step 310, and step 311 in the data storage method shown in FIG. Indicate the message.

The selecting unit 55 is configured to support the storage node to perform step 307 in the data storage method shown in FIG. 3.

In the embodiment of the present application, further, as shown in FIG. 6, the storage node may further include: a calculating unit 56 and a deleting unit 57.

The calculating unit 56 is configured to support the storage node to perform step 309 in the data storage method shown in FIG. 3.

a deleting unit 57, configured to support the storage node to perform the correspondence between the sub-logical address and the mirror node of the N target data blocks deleted from the first correspondence relationship as described in step 311 in the data storage method shown in FIG. N target data blocks are deleted in their own cache.

It should be noted that all the related content of the steps involved in the foregoing method embodiments may be referred to the functional descriptions of the corresponding functional modules, and details are not described herein again.

The storage node provided by the embodiment of the present application is used to execute the data storage method in FIG. 3 above, so that the same effect as the above data storage method can be achieved.

FIG. 7 is a schematic diagram showing a possible composition of the storage node involved in the foregoing and the embodiment. As shown in FIG. 7, the storage node may include: a receiving unit 61. The storage unit 62, the selection unit 63, and the transmission unit 64.

The receiving unit 61 is configured to support the storage node to perform step 401 in the data storage method shown in FIG. 4.

The storage unit 62 is configured to support the storage node to perform step 402 in the data storage method shown in FIG. 4.

The selecting unit 63 is configured to support the storage node to perform step 403 in the data storage method shown in FIG. 4.

The sending unit 64 is configured to support the storage node to send the indication message to the N receiving nodes corresponding to the N target data blocks respectively, as described in step 404, step 406, and step 407 in the data storage method shown in FIG. 4 .

The storage node provided in the embodiment of the present application is used to execute the data storage method in FIG. 4 above, so that the same effect as the above data storage method can be achieved.

In the case of employing an integrated unit, FIG. 8 shows another possible composition diagram of the storage node involved in the above embodiment. As shown in FIG. 8, the storage node includes a processing module 71 and a communication module 72.

The processing module 71 is configured to control and manage the action of the storage node. For example, the processing module 71 is configured to support the storage node to perform the deletion from the first correspondence relationship as described in step 303, step 307, step 309, and step 311 in FIG. 3 . Corresponding relationship between the sub-logical address of the N target data blocks and the mirror node, and deleting N target data blocks from its own cache, the logic of the slave mirror data described in step 403, step 405, and step 407 shown in FIG. The correspondence between the address and the receiving node deletes the correspondence between the sub-logical address of the N target data blocks and the mirror node, and deletes N target data blocks from its own cache, and/or other techniques for the techniques described herein process. Communication module 72 is used to support communication of storage nodes with other network entities, such as hosts, other storage nodes in a distributed RAID system. For example, the communication module 72 is configured to support the storage node to perform the sending to the N mirror nodes corresponding to the N target data blocks, respectively, as described in step 301, step 305, step 306, step 308, step 310, and step 311 in FIG. The indication message, the step 401, the step 404, the step 406, and the step 407 shown in FIG. 4 respectively send an indication message to the N receiving nodes corresponding to the N target data blocks. The storage node may further include a storage module 73 for storing program code and data of the storage node. For example, the storage module 73 is configured to support the storage node to perform step 302, step 304 in FIG. 3, and step 402 shown in FIG.

The processing module 71 can be the processor or controller in FIG. 2. It is possible to implement or carry out the various illustrative logical blocks, modules and circuits described in connection with the present disclosure. The processor can also be a combination of computing functions, for example, including one or more microprocessor combinations, a combination of a DSP and a microprocessor, and the like. The communication module 72 can be the communication interface or the like in FIG. The storage module 73 can be the memory in FIG.

Through the description of the above embodiments, those skilled in the art can clearly understand that for the convenience and brevity of the description, only the division of the above functional modules is illustrated. In practical applications, the above functions can be allocated according to needs. It is completed by different functional modules, that is, the internal structure of the device is divided into different functional modules to complete all or part of the functions described above.

In the several embodiments provided by the present application, it should be understood that the disclosed apparatus and method may be implemented in other manners. For example, the device embodiments described above are merely illustrative. For example, the division of the modules or units is only a logical function division. In actual implementation, there may be another division manner, for example, multiple units or components may be used. The combination may be integrated into another device, or some features may be ignored or not performed. In addition, the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or unit, and may be in an electrical, mechanical or other form.

The units described as separate components may or may not be physically separated, and the components displayed as units may be one physical unit or multiple physical units, that is, may be located in one place, or may be distributed to multiple different places. . Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of the embodiment.

In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit. The above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.

The integrated unit, if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a readable storage medium. Based on such understanding, the technical solution of the embodiments of the present application may be embodied in the form of a software product in the form of a software product in essence or in the form of a contribution to the prior art, and the software product is stored in a storage medium. A number of instructions are included to cause a device (which may be a microcontroller, chip, etc.) or a processor to perform all or part of the steps of the methods described in various embodiments of the present invention. The foregoing storage medium includes various media that can store program codes, such as a USB flash drive, a mobile hard disk, a ROM, a RAM, a magnetic disk, or an optical disk.

The above is only the specific embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions within the technical scope of the present invention should be covered by the scope of the present invention. . Therefore, the scope of the invention should be determined by the scope of the appended claims.

Claims

A data storage method, which is characterized in that it is applied to a distributed independent disk redundant array RAID system, the distributed RAID system includes K storage nodes, and K is an integer greater than 1, the method includes:

Receiving, by the receiving node, a first write request, where the first write request includes a logical address to be written data and a data to be written, and the receiving node is any one of the K storage nodes;

The receiving node stores the to-be-written data in its own cache;

The receiving node determines a mirroring node according to the logical address of the data to be written, and records a first correspondence, where the first correspondence is a mapping relationship between a logical address to be written data and a mirror node;

Sending, by the receiving node, the logical address of the data to be written and the data to be written to the mirror node;

The receiving node selects N target data blocks in the cache according to the first correspondence, and the mirror nodes of the N target data blocks are different from each other, where N is an integer greater than 0 and less than K;

The receiving node sends a notification message to the N mirror nodes corresponding to the N target data blocks, where each notification message includes a logical address of the target data block corresponding to the mirror node to which the notification message is sent, and the notification message It is used to instruct the mirror node to store the data block in its cache corresponding to the logical address in the notification message to its own hard disk.
The method according to claim 1, wherein the logical address of the data to be written includes at least one sub-logical address, and the receiving node determines the mirroring node according to the logical address of the data to be written, including:

The receiving node adopts a formula: X=Int (sub logical address/length of the data block), and performs rounding operation after dividing each sub-logical address in the logical address of the data to be written by the length of the data block. Obtaining an integer X corresponding to each sub-logical address, the length of the data block being the number of sub-logical addresses included in each data block;

The receiving node performs a hash calculation on each integer X by using a pre-configured hash algorithm, and then performs a remainder after dividing the result of each hash calculation by K, and obtains the number of the mirror node according to the result of the redundancy. .
The method according to claim 1 or 2, wherein the logical address of the data to be written includes at least one sub-logical address, and the first correspondence records a mirror node corresponding to each sub-logical address;

The receiving node selects N target data blocks in the cache according to the first correspondence, and includes:

The receiving node determines, in the first correspondence, a sub-logical address having the same mirror node;

The receiving node selects a sub-logical logical address constituting one data block from the sub-logical logical addresses corresponding to the N different mirroring nodes, and the data corresponding to the selected sub-logical logical address constitutes the N target data blocks.
The method of any of claims 1-3, wherein the method further comprises:

The receiving node calculates M check data blocks according to the N target data blocks;

The receiving node sends the M check data blocks to a storage node storage other than the N mirror nodes.
The method according to claim 4, further comprising: after the receiving node sends the M check data blocks to a storage node other than the N mirror nodes, further comprising:

Determining, by the receiving node, a correspondence between a sub-logical address of the N target data blocks and a mirror node from the first correspondence relationship;

The receiving node deletes the N target data blocks from its own cache, and sends an indication message to the N mirror nodes corresponding to the N target data blocks respectively; each indication message includes the indication message sent to The logical address of the target data block corresponding to the mirror node, the indication message is used to instruct the mirror node to delete the data block corresponding to the logical address in the indication message from the cache.
A data storage method, which is characterized in that it is applied to a distributed independent disk redundant array RAID system, the distributed RAID system includes K storage nodes, and K is an integer greater than 1, the method includes:

The mirroring node receives a first write request sent by the at least one receiving node, where the first write request includes a logical address of the mirrored data and the mirrored data, and the receiving node is a node that receives the second write request from the host, where the The second write request includes a logical address to be written and a data to be written, the mirror node is determined by the receiving node according to the logical address of the data to be written, and the mirror data is a write determined by the receiving node. Data into the mirror node;

The mirroring node writes the mirrored data into its own cache, and records a corresponding relationship between the logical address of the mirrored data and the receiving node;

The mirroring node selects N target data blocks in its own cache according to the corresponding relationship between the mirrored data and the receiving node, and the receiving nodes of the N target data blocks are different from each other, where N is greater than 0 and An integer less than K;

The mirroring node sends a notification message to the N receiving nodes corresponding to the N target data blocks, where each notification message includes a logical address of the target data block corresponding to the receiving node to which the notification message is sent, to indicate the The receiving node writes the data block corresponding to the logical address in its own cache to the hard disk.
A receiving node, characterized in that it is applied to a distributed independent disk redundant array RAID system, the distributed RAID system includes K storage nodes, K is an integer greater than 1, and the receiving node is the K storage Any one of the nodes, the receiving node includes: a receiving unit, a storage unit, a determining unit, a sending unit, and a selecting unit;

The receiving unit is configured to receive a first write request, where the first write request includes a logical address to be written data and data to be written;

The storage unit is configured to store the to-be-written data in its own cache;

The determining unit is configured to determine a mirror node according to the logical address of the data to be written;

The storage unit is further configured to record a first correspondence, where the first correspondence is a mapping relationship between a logical address to be written data and a mirror node;

The sending unit is configured to send the logical address of the data to be written and the data to be written to the mirror node;

The selecting unit is further configured to select, in the cache, N target data blocks according to the first correspondence relationship recorded by the storage unit, where mirror nodes of the N target data blocks are different from each other, where N is greater than 0 and an integer less than K;

The sending unit is further configured to send a notification message to the N mirror nodes corresponding to the N target data blocks, where each notification message includes a logical address of a target data block corresponding to the mirror node to which the notification message is sent. The notification message is used to instruct the mirroring node to store the data block in the cache that corresponds to the logical address in the notification message to its own hard disk.
The receiving node according to claim 7, wherein the logical address of the data to be written includes at least one sub-logical address, and the determining unit is specifically configured to:

Using the formula: X=Int (sub-logical address/length of the data block), each sub-logical address in the logical address of the data to be written is rounded and divided by the length of the data block to obtain a An integer X corresponding to each sub-logical address, the length of the data block being the number of sub-logical addresses included in each data block;

Each integer X is hashed by a pre-configured hash algorithm, and the result of each hash calculation is used, and the number of the mirror node is obtained according to the result of the remainder.
The receiving node according to claim 7 or 8, wherein the logical address of the data to be written includes at least one sub-logical address, and the first correspondence records a mirror node corresponding to each sub-logical address;

The selection unit is specifically configured to:

Determining a sub-logical address having the same mirror node in the first correspondence relationship;

The sub-logical addresses constituting one data block are selected from the sub-logical addresses corresponding to the N different mirror nodes, and the data corresponding to the selected sub-logical addresses constitutes N target data blocks.
The receiving node according to any one of claims 7-9, wherein the receiving node further comprises: a calculating unit;

The calculating unit is configured to calculate M check data blocks according to the N target data blocks;

The sending unit is further configured to send the M check data blocks to a storage node storage other than the N mirror nodes.
The receiving node according to claim 10, wherein the receiving unit further comprises: a deleting unit;

The deleting unit is configured to delete a correspondence between a sub-logical address of the N target data blocks and a mirror node from the first correspondence, and delete the N target data blocks from a cache thereof;

The sending unit is further configured to send an indication message to the N mirror nodes corresponding to the N target data blocks, where each indication message includes a logical address of the target data block corresponding to the mirror node to which the indication message is sent. The indication message is used to instruct the mirroring node to delete the data block corresponding to the logical address in the indication message from the cache.
A mirror node, which is applied to a distributed independent disk redundant array RAID system, the distributed RAID system includes K storage nodes, K is an integer greater than 1, and the mirror node includes: a receiving unit, and a storage Unit, selection unit, and transmission unit;

The receiving unit is configured to receive a first write request sent by the at least one receiving node, where the first write request includes a logical address of the mirrored data and the mirrored data, and the receiving node receives the second write request from the host. a node, the second write request includes a logical address to be written data and a data to be written, the mirror node is determined by the receiving node according to the logical address of the data to be written, and the mirrored data is the Receiving data determined by the node to be written to the mirror node;

The storage unit is configured to write the mirrored data into its own cache, and record a correspondence between a logical address of the mirrored data and the receiving node;

The selecting unit is configured to select N target data blocks in the cache according to the corresponding relationship between the mirror data and the receiving node, where the receiving nodes of the N target data blocks are different from each other, where N is An integer greater than 0 and less than K;

The sending unit is configured to send a notification message to the N receiving nodes corresponding to the N target data blocks, where each notification message includes a logical address of a target data block corresponding to the receiving node to which the notification message is sent, Instructing the receiving node to write a data block corresponding to the logical address in its own cache to the hard disk.