WO2017162177A1

WO2017162177A1 - Redundant storage system, redundant storage method and redundant storage device

Info

Publication number: WO2017162177A1
Application number: PCT/CN2017/077754
Authority: WO
Inventors: 王东临; 金友兵; 莫仲华
Original assignee: 北京书生国际信息技术有限公司; 书生云公司
Priority date: 2016-03-24
Filing date: 2017-03-22
Publication date: 2017-09-28
Also published as: CN105843557A; CN105843557B

Abstract

The embodiment of the invention provides a redundant storage system, a redundant storage method and a redundant storage device and solves the problem that a system structure based on traditional redundant storage is low in disaster tolerance treatment efficiency. The redundant storage system comprising a storage network, at least two storage nodes and at least two storage devices, wherein the at least two storage nodes are connected to the storage network; the at least two storage devices are connected to the storage network; each storage device comprises at least one storage medium; each storage node accessesthe at least two storage devices via the storage network; at least one storage block of each storage device in the at least two storage devices accessed by the same storage node stores data in a redundant storage manner, and the storage block is a complete storage medium or one part of a storage medium.

Description

Redundant storage systems, redundant storage methods, and redundant storage devices

Technical field

The present invention relates to the field of data storage technologies, and in particular, to a redundant storage system, a redundant storage method, and a redundant storage device.

Background technique

As computer applications become larger and larger, the demand for storage space is increasing. Correspondingly, integrating the storage resources of multiple devices (such as storage media) into one storage pool to provide storage services has become the mainstream. In a conventional redundant storage system, the redundant storage system is usually composed of a plurality of distributed storage nodes connected by a TCP/IP network. FIG. 1 shows a schematic diagram of the architecture of a prior art redundant storage system. As shown in FIG. 1, in a conventional redundant storage system, each storage node S is connected to a TCP/IP network (through a core switch) through an access network switch. Each storage node is a separate physical server, and each server has its own storage medium. Each storage node is connected by a storage network such as an IP network to form a storage pool. On the other side of the core switch, each compute node C is also connected to the TCP/IP network (through the core network switch) through the access network switch to access the entire storage pool over the TCP/IP network.

In the conventional redundant storage system, the storage node is located on the storage medium side, and the storage medium is a built-in disk of the physical machine where the storage node is located, and the storage node is equivalent to a control machine, a storage node, and a local physical machine of all storage media in the local physical machine. All storage media within it constitute a storage device. Although the disk mounted on each storage node S can be used for redundancy management through redundant storage, when a storage node S fails, the disk mounted under the storage node can no longer be used. Being read and written, and restoring the data in the disk mounted by the failed storage node S will seriously affect the working efficiency of the entire redundant storage system.

Summary of the invention

In view of this, the embodiments of the present invention provide a redundant storage system, a redundant storage method, and a redundant storage device, which solve the problem of low efficiency of disaster recovery processing based on the structure of the traditional redundant storage system.

An embodiment of the invention provides a redundant storage system, including:

Storage network

At least two storage nodes connected to the storage network;

At least two storage devices connected to the storage network, each of the storage devices including at least one storage medium;

Each of the storage nodes accesses at least two storage devices through the storage network, and is redundantly stored between at least one storage block of each of the at least two storage devices accessed by the same storage node. The data is saved, wherein the storage block is a complete storage medium or is part of a storage medium.

An embodiment of the present invention further provides a redundant storage method, where the redundant storage system includes: a storage network; at least two storage nodes connected to the storage network; and at least two storage devices connected to the a storage network, each of the storage devices including at least one storage medium; wherein each of the storage nodes accesses at least two storage devices through the storage network; the method includes:

Saving data in a redundant storage manner between at least one of each of at least two storage devices accessed by the same storage node, wherein the storage block is a complete storage medium or a storage Part of the media.

An embodiment of the present invention further provides a redundant storage device, where the redundant storage system includes: a storage network; at least two storage nodes connected to the storage network; and at least two storage devices connected to the a storage network, each of the storage devices including at least one storage medium; wherein each of the storage nodes accesses at least two storage devices through the storage network; the redundant storage device includes:

A redundant storage module configured to be in at least two storage devices accessed by the same storage node The data is stored in a redundant storage manner between at least one of the storage blocks of each storage device, wherein the storage block is a complete storage medium or a part of a storage medium.

An embodiment of the present invention also provides a computer program product embodied in a computer readable storage medium having computer readable program code portions stored therein, the computer readable program code portion being Configured to perform the redundant storage method as described previously.

The present invention provides a redundant storage system, a redundant storage method, and a redundant storage device. The storage node and the storage device are independently connected to the storage network, and each storage node can access multiple storage devices through the storage network. And is redundantly stored between multiple storage devices accessed by the same storage node. In this way, even if a storage device fails, the data in the storage device can be quickly recovered through other working storage devices, which greatly improves the disaster recovery processing efficiency of the entire redundant storage system.

DRAWINGS

Figure 1 shows the architecture of a traditional storage system.

FIG. 2 is a schematic structural diagram of a storage system according to an embodiment of the invention.

FIG. 3 is a schematic structural diagram of a storage system according to another embodiment of the present invention.

FIG. 4 is a schematic structural diagram of a storage pool using redundant storage according to an embodiment of the present invention.

FIG. 5 is a schematic structural diagram of a storage pool using redundant storage according to another embodiment of the present invention.

detailed description

The technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present invention. It is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.

FIG. 2 is a schematic structural diagram of a storage system according to an embodiment of the invention. As shown 2 is shown. The storage system includes: a storage network; at least two storage nodes connected to the storage network; and at least two storage devices connected to the storage network, each of the storage devices including at least one storage medium. In the embodiment of the present invention, the storage node is a software module that provides a storage service, instead of a hardware server including a storage medium in a general sense. The storage nodes in the description of the subsequent embodiments also refer to the same concepts, and therefore will not be described again.

In an embodiment of the present invention, each storage node accesses at least two storage devices through a storage network, and is redundantly stored between at least one storage block of each of at least two storage devices accessed by the same storage node. The way to save data, where the storage block is a complete storage medium or part of a storage medium. It can be seen that since the data is stored in the storage block of different storage devices in a redundant storage manner, the storage system is a redundant storage system.

In a conventional storage system architecture, the storage node is located on the storage medium side, or strictly speaking, the storage medium is a built-in disk of the physical machine where the storage node is located. In the embodiment of the present invention, the physical machine where the storage node is located is independent of the storage device, and the storage device is more used as a channel connecting the storage medium and the storage network, and the storage node and the storage device are independently connected to the storage network, and each of the storage nodes and the storage device are independently connected to the storage network. The storage node can access multiple storage devices through the storage network, and is redundantly stored between multiple storage devices accessed by the same storage node, thereby implementing redundant storage across the storage device under the same storage node. In this way, even if a storage device fails, the data in the storage device can be quickly restored through other working storage devices, which greatly improves the disaster recovery processing efficiency of the entire storage system.

In this way, when dynamic balancing is required, physical data does not need to be migrated in different storage media, and only storage media managed by different storage nodes need to be balanced.

In an embodiment of the invention, the storage network is configured such that each storage node can access all storage media without the aid of other storage nodes. Thus, all storage media of the present invention can be shared by all storage nodes, and all storage media in the storage system actually constitute a global storage pool accessible by all storage nodes.

In another embodiment of the present invention, the storage node side further includes a compute node, and the calculation section The point and storage nodes are set up in a physical server that is connected to the storage device through the storage network. The aggregated storage system in which the computing node and the storage node are located in the same physical machine constructed by using the embodiment of the present invention can reduce the number of physical devices required, thereby reducing the cost. At the same time, the compute node can also access the storage resources it wishes to access locally. In addition, since the compute nodes and storage nodes are aggregated on the same physical server, the data exchange between the two can be as simple as shared memory, and the performance is particularly excellent.

In the storage system provided by the embodiment of the present invention, the length of the I/O data path between the computing node and the storage medium includes: (1) the storage medium to the storage node; and (2) the storage node to the computing node aggregated in the same physical server. (CPU bus path). In contrast, in the prior art storage system shown in FIG. 1, the I/O data path length between the compute node and the storage medium includes: (1) storage medium to storage node; (2) storage node to storage Network access network switch; (3) storage network access network switch to core network switch; (4) core network switch to computing network access network switch; and (5) computing network access network switch to computing node. Obviously, the total data path of the storage system of the embodiment of the present invention is only close to item (1) of the conventional storage system. That is, the storage system provided by the embodiment of the present invention can greatly improve the I/O channel performance of the storage system by extremely compressing the I/O data path length, and the actual running effect is very close to the I/O of the local hard disk. O channel.

In an embodiment of the invention, the storage node may be a virtual machine of the physical server, a container, a module running directly on the physical operating system of the server, or a combination thereof (for example, a firmware of a part of the storage node on the expansion card, The other part is a module in the physical operating system, and some are in the virtual machine); the computing node can also be a virtual machine of the same physical machine server, a container, and a module running directly on the physical operating system of the server. Or the combination above. In one embodiment, each storage node may correspond to one or more compute nodes.

Specifically, one physical server can be divided into multiple virtual machines, one of which is used as a storage node, and the other virtual machine is used as a computing node; or a module on a physical OS is utilized. Do storage nodes for better performance.

In an embodiment of the invention, the virtualization technology forming the virtual machine may be KVM or Zen or VMware or Hyper-V virtualization technology, and the container technology forming the container may be Docker or Rocket or Odin or Chef or LXC or Vagrant. Or Ansible or Zone or Jail or Hyper-V container technology.

In an embodiment of the present invention, each storage node is only responsible for managing a fixed storage medium at the same time, and one storage medium is not simultaneously written by multiple storage nodes to avoid data conflict, thereby enabling each storage node to be able to implement each storage node. The storage medium managed by it is accessed without resorting to other storage nodes, and the integrity of the data stored in the storage system can be guaranteed.

In an embodiment of the present invention, all the storage media in the system may be divided according to storage logic. Specifically, the storage pool of the entire system may be divided into a logical storage hierarchy structure such as a storage area, a storage group, and a storage block. Among them, the storage block is the smallest storage unit. In an embodiment of the invention, the storage pool may be divided into at least two storage areas.

In an embodiment of the invention, each storage area may be divided into at least one storage group. In a preferred embodiment, each storage area is divided into at least two storage groups.

In some embodiments, the storage area and the storage group can be merged such that one level can be omitted in the storage hierarchy.

In an embodiment of the invention, each storage area (or storage group) may be composed of at least one storage block, wherein the storage block may be a complete storage medium or a part of a storage medium. In order to construct redundant storage inside the storage area, each storage area (or storage group) may be composed of at least two storage blocks, and when any one of the storage blocks fails, the complete storage block may be calculated from the remaining storage blocks in the group. The data is stored. The redundant storage mode can be multi-copy mode, independent redundant disk array (RAID) mode, and erasure code mode. In an embodiment of the invention, the redundant storage mode can be established by the ZFS file system. In an embodiment of the present invention, in order to combat the hardware failure of the storage device/storage medium, the plurality of storage blocks included in each storage area (or storage group) are not located in the same storage medium, or even in the same Storage devices. In an embodiment of the invention, any two storage blocks included in each storage area (or storage group) are not located in the same storage medium/storage device. In another embodiment of the present invention, the number of storage blocks located in the same storage medium/storage device in the same storage area (or storage group) is preferably less than or equal to the redundancy of the redundant storage. For example, when the RAID 5 mode of storage redundancy is adopted, the redundancy of redundant storage is 1, and the number of storage blocks of the same storage group of the same storage device is at most 1; for RAID 6, the redundancy of redundant storage With a redundancy of 2, the number of memory blocks in the same storage group on the same storage device is up to 2.

Since the storage blocks in the storage group are actually from different storage devices, the fault tolerance level of the storage pool is related to the fault tolerance level of the redundant storage in the storage group. Therefore, in an embodiment of the invention, the storage system further includes a fault tolerance level. The adjustment module is configured to adjust the storage pool by adjusting the number of storage blocks in the storage group that allow simultaneous failures and/or selecting the number of storage blocks for aggregation into the same storage group from each of the at least two storage devices of the storage pool Fault tolerance level. Specifically, if the number of storage blocks in the storage group that allow simultaneous failures is represented by D, the storage for aggregation into the same storage group is selected from each of the at least two storage devices of the storage pool by N. The number of blocks, in M, represents the number of storage devices in the storage pool that are allowed to fail simultaneously. Then, the fault tolerance level of the storage pool determined by the fault tolerance level adjustment module is M=D/N, and D/N only takes integer bits. In this way, different fault-tolerant storage systems can be implemented according to actual needs.

In an embodiment of the invention, each storage node can only read and write its own managed storage area. Since the read operations of the same storage block by multiple storage nodes do not conflict with each other, and multiple storage nodes write one storage block at the same time, conflicts are easily generated. Therefore, in another embodiment, each storage node can only Write the storage area managed by yourself, but you can read the storage area managed by yourself and the storage area managed by other storage nodes, that is, the write operation is local, but the read operation can be global.

In one embodiment, the storage system may further include a storage control node coupled to the storage network for determining a storage area managed by each storage node. In another embodiment, each storage node may include a storage allocation module for determining a storage area managed by the storage node, This can be achieved by a communication and coordination processing algorithm between the various storage allocation modules included in each storage node, which algorithm can for example be based on load balancing between the various storage nodes.

In one embodiment, upon detecting a failure of a storage node, other or all of the storage nodes may be configured such that the storage nodes take over the storage area previously managed by the failed storage node. For example, one of the storage nodes may take over a storage area managed by the failed storage node, or may be taken over by at least two other storage nodes, wherein each storage node takes over a portion of the storage area managed by the failed storage node, For example, at least two other storage nodes respectively take over different storage groups in the storage area.

In one embodiment, the storage medium may include, but is not limited to, a hard disk, a flash memory, an SRAM, a DRAM, an NVME, or an NVRAM. The access interface of the storage medium may include, but is not limited to, a SAS interface, a SATA interface, a PCI/e interface, and a DIMM. Interface, NVMe interface, SCSI interface, AHCI interface.

In an embodiment of the invention, the storage network may include at least one storage switching device, and the storage node accesses the storage medium through data exchange between the storage switching devices included therein. Specifically, the storage node and the storage medium are respectively connected to the storage switching device through the storage channel.

In an embodiment of the invention, the storage switching device may be a SAS switch or a PCI/e switch. Correspondingly, the storage channel may be a SAS (Serial Attached SCSI) channel or a PCI/e channel.

Taking the SAS channel as an example, compared with the traditional IP-based storage solution, the SAS-based switching solution has the advantages of high performance, large bandwidth, and a large number of disks per device. When used in conjunction with a host adapter (HBA) or a SAS interface on a server board, the storage provided by the SAS system can be easily accessed by multiple servers connected simultaneously.

Specifically, the SAS switch is connected to the storage device through a SAS line, and the storage device and the storage medium are also connected by a SAS interface. For example, the storage device internally connects the SAS channel to each storage medium (may be in the storage device) Internally set a SAS switch chip). Since the bandwidth of a SAS network can reach 24Gb or 48Gb, it is dozens of times that of Gigabit Ethernet, and Several times the cost of 10 Gigabit Ethernet; at the same time, the link layer SAS has an order of magnitude improvement over the IP network. At the transport layer, due to the TCP handshake three times, the overhead is high and the TCP delay acknowledgement mechanism is slow. The startup sometimes causes a delay of 100 milliseconds. The delay of the SAS protocol is only a few tenths of that of TCP, and the performance is greatly improved. In summary, SAS networks offer significant advantages in terms of bandwidth and latency over Ethernet-based TCP/IP. Those skilled in the art will appreciate that the performance of the PCI/e channel can also be adapted to the needs of the system.

In an embodiment of the invention, the storage network may include at least two storage switching devices, each of which may be connected to any one of the storage devices through any one of the storage switching devices, thereby being connected to the storage medium. When any storage switching device or storage channel connected to a storage switching device fails, the storage node reads and writes data on the storage device through other storage switching devices.

Referring to Figure 3, there is shown a particular storage system 30 constructed in accordance with one embodiment of the present invention. The storage devices in the storage system 30 are constructed as a plurality of JBODs 307-310, which are respectively connected to the two SAS switches 305 and 306 through SAS data lines, which constitute the switching core of the storage network included in the storage system. The front end is at least two servers 301 and 302, each of which is connected to the two SAS switches 305 and 306 via an HBA device (not shown) or a SAS interface on the motherboard. There is a basic network connection between the servers for monitoring and communication. Each server has a storage node that manages some or all of the disks in all JBOD disks using information obtained from the SAS links. Specifically, the storage area, the storage group, and the storage block described above in the application file may be used to divide the JBOD disk into different storage groups. Each storage node manages one or more sets of such storage groups. When redundant storage is used inside each storage group, redundantly stored metadata can exist on the disk, so that redundant storage can be directly recognized from the disk by other storage nodes.

In the exemplary storage system 30 shown, the storage node can install a monitoring and management module that is responsible for monitoring the status of local storage and other servers. When a JBOD is abnormal overall or a disk on the JBOD is abnormal, data reliability is ensured by redundant storage. When a server fails, The management module in the storage node on another pre-configured server will locally identify and take over the disk managed by the storage node of the failed server according to the data on the disk. The storage node originally provided by the storage node of the faulty server will also be extended on the storage node on the new server. So far, a new highly available global storage pool structure has been implemented.

As can be seen, the exemplary storage system 30 is constructed to provide a multi-point, controllable, globally accessible storage pool. The hardware uses multiple servers to provide external services, and uses JBOD to store disks. Multiple JBODs are connected to two SAS switches, and the two switches are respectively connected to the server's HBA cards, thereby ensuring that all disks on the JBOD can be accessed by all servers. The SAS redundant link also ensures high availability on the link.

Locally, each server uses redundant storage technology to select redundant disks from each JBOD to avoid redundant data loss. When one server fails, the module that monitors the overall state will schedule another server to access the disks managed by the storage node of the failed server through the SAS channel, and quickly take over the disks that the other party is responsible for, achieving high-available global storage.

Although the JBOD storage disk is illustrated in FIG. 3 as an example, it should be understood that the embodiment of the present invention as shown in FIG. 3 also supports a storage device other than JBOD. In addition, the above is an example in which one storage medium (entire) is used as one storage block, and the same applies to a case where a part of one storage medium is used as one storage block.

An embodiment of the present invention further provides a redundant storage method, where the applicable storage system includes: a storage network; at least two storage nodes connected to the storage network; and at least two storage devices connected to the storage network, each storage The device includes at least one storage medium; wherein each storage node accesses at least two storage devices through the storage network; the method includes:

Saving data in a redundant storage manner between at least one of each of at least two storage devices accessed by the same storage node, wherein the storage block is a complete storage medium or a storage medium portion.

In an embodiment of the invention, all storage media in the storage system form a storage pool. The storage pool is a global storage pool as described above, that is, all storage media in the storage pool can be shared by all storage nodes in the storage system, and each storage node can access the storage pool without using other storage nodes. All storage media.

Specifically, the redundant storage method based on the global storage pool may be implemented by first selecting multiple storage devices from the storage pool, and then selecting each of the selected multiple storage devices. At least one storage block aggregates all the storage blocks selected by the above process into a storage group. Thus, in the storage group, data is stored in all storage blocks of the storage group in a redundant manner. When a certain storage block in the storage group fails, data in the other storage block in the storage group can be used to acquire data in the failed storage block.

It should be understood that the storage blocks in one storage group do not necessarily come from all storage devices in the storage pool, and the storage devices in the storage pool are not necessarily all used for redundant storage, and are not selected for redundant storage. Storage devices and storage blocks can be used as hot spare devices that are not normally used.

It should be understood that the manner of redundant storage between the storage blocks in the storage group may be specifically implemented by a multiple copy mode, a RAID mode, or an erasure code mode. The specific manner of the redundant storage between the storage blocks in the storage group is not Make a limit.

In an embodiment of the present invention, in order to satisfy more flexible storage settings according to specific content stored, a plurality of storage groups may also be aggregated into a storage area.

As mentioned earlier, since the storage blocks in the storage group are actually from different storage devices, the fault tolerance level of the storage pool is related to the fault tolerance level of the redundant storage in the storage group, so the fault tolerance level of the storage pool can be adjusted by adjusting the storage group. The number of memory blocks that are allowed to fail at the same time and/or the number of memory blocks for aggregation into the same memory group are each selected from at least two storage devices of the storage pool. The specific adjustment manner may be the same as the method performed by the fault tolerance level adjustment module in the foregoing storage system, and details are not described herein again.

It can be seen that by adopting the redundant redundant storage method applied to the storage system provided by the embodiment of the present invention, different fault tolerance levels of the storage pool can be realized by adjusting the fault tolerance level of the storage group and the selection policy of the storage block in the storage group. To adapt to different levels of actual storage needs.

FIG. 4 is a schematic structural diagram of a storage pool using redundant storage according to an embodiment of the present invention. As shown in FIG. 4, the storage pool 40 includes five storage devices JBOD1 to JBOD5, and each storage device includes five storage blocks. The five storage devices JBOD1 to JBOD5 in the storage pool 40 are used for redundant storage, and each storage device selects one storage block to be aggregated into a storage group in an erasure code. For example, the memory blocks D1 to D5 are aggregated into one memory group P1, and D11 to D15 can be aggregated into another memory group. In the storage group P1, the data is stored in the storage blocks D1 to D5 in an erasure code, and the check level of the erasure code is 2, that is, the number of storage blocks allowed to simultaneously fail in the storage group P1 is 2, then the storage is The number of storage devices allowed to fail simultaneously in pool 40 is also two.

FIG. 5 is a schematic structural diagram of a storage pool using redundant storage according to another embodiment of the present invention. As shown in FIG. 5, the five storage devices JBOD1 to JBOD5 in the storage pool 50 are also used for redundant storage, but each storage device selects two storage blocks and is aggregated in an erasure code. Storage group. For example, the memory blocks D1 to D15 are aggregated into one memory group P2, and the memory blocks D21 to D35 can be aggregated into another memory group. In the storage group P2, the check level of the erasure code is 3, that is, the number of storage blocks allowed to be simultaneously faulty in the storage group P2 is three, and the number of storage devices that allow simultaneous failure in the storage pool 50 is 3/2. The integer bit = 1, that is, the number of storage devices in the storage pool 50 that allow simultaneous failure is only one.

An embodiment of the present invention further provides a redundant storage device, where the storage system includes: a storage network; at least two storage nodes connected to the storage network; and at least two storage devices connected to the storage network Each of the storage devices includes at least one storage medium; wherein each of the storage nodes accesses at least two storage devices through the storage network; the redundant storage device includes:

a redundant storage module configured to store data in a redundant manner between at least one of each of the at least two storage devices accessed by the same storage node, wherein the storage block is a complete The storage medium is either part of a storage medium. It should be understood that the method performed by the redundant storage module is the same as the foregoing redundant storage method, and the functional effects that can be achieved are also the same, and details are not described herein again.

An embodiment of the invention further provides a computer program product of a computer readable storage medium, comprising computer program code, which when executed by a processor, enables the processor to be implemented according to the method of the embodiments of the invention A redundant storage method of an embodiment. The computer storage medium can be any tangible medium such as a floppy disk, CD-ROM, DVD, hard drive, or even network media.

It should be understood that although an implementation form of the embodiments of the present invention described above may be a computer program product, the method or apparatus of the embodiments of the present invention may be implemented in software, hardware, or a combination of software and hardware. The hardware portion can be implemented using dedicated logic; the software portion can be stored in memory and executed by a suitable instruction execution system, such as a microprocessor or dedicated design hardware. One of ordinary skill in the art will appreciate that the methods and apparatus described above can be implemented using computer-executable instructions and/or embodied in processor control code, such as a carrier medium such as a magnetic disk, CD or DVD-ROM, such as a read only memory. Such code is provided on a programmable memory (firmware) or on a data carrier such as an optical or electronic signal carrier. The method and apparatus of the present invention may be implemented by hardware circuits such as very large scale integrated circuits or gate arrays, semiconductors such as logic chips, transistors, etc., or programmable hardware devices such as field programmable gate arrays, programmable logic devices, etc., also It can be implemented by software executed by various types of processors, or by a combination of the above-described hardware circuits and software such as firmware.

It should also be understood that the descriptions of the present invention are merely illustrative of some key, non-essential techniques and features, and may not be described in a manner that can be realized by those skilled in the art.

The above is only the preferred embodiment of the present invention, and is not intended to limit the present invention. Any modifications, equivalent substitutions, etc., which are within the spirit and principles of the present invention, should be included in the scope of the present invention. within.

Claims

A redundant storage system, comprising:

Storage network

At least two storage nodes connected to the storage network;

At least two storage devices connected to the storage network, each of the storage devices including at least one storage medium;

Each of the storage nodes accesses at least two storage devices through the storage network, and is redundantly stored between at least one storage block of each of the at least two storage devices accessed by the same storage node. The data is saved, wherein the storage block is a complete storage medium or is part of a storage medium.
The redundant storage system of claim 1 wherein said storage network is configured such that each storage node is capable of accessing all storage media without the aid of other storage nodes.
The redundant storage system according to claim 2, wherein all storage media included in the redundant storage system constitute a storage pool, and the storage pool is divided into at least two storage areas, each storage node Responsible for managing zero to multiple storage areas.
The redundant storage system according to claim 3, wherein each of said storage areas comprises at least two storage blocks, and at least two storage blocks constituting said each storage area are divided into one or more storages. Groups, which store data in redundant storage between storage blocks within each storage group.
The redundant storage system according to claim 4, wherein the number of storage blocks located in the same storage device in one of the storage groups is less than or equal to the redundancy of the redundant storage.
The redundant storage system of claim 4, further comprising:

a fault tolerance level adjustment module configured to adjust a number of storage blocks in the storage group that allow simultaneous failures and/or to select a number of storage blocks to be aggregated into the same storage group from each of the at least two storage devices of the storage pool To adjust the fault tolerance level of the storage pool.
The redundant storage system according to claim 6, wherein D indicates a number of storage blocks in the storage group that allow simultaneous failure, and N indicates that each of the storage pools is selected from at least two storage devices. The number of storage blocks that are aggregated into the same storage group, and M indicates the number of storage devices in the storage pool that are allowed to fail simultaneously; then the fault tolerance level of the storage pool determined by the fault tolerance level adjustment module is M=D/ N, D/N takes only integer bits.
The redundant storage system of claim 4 wherein one storage group has at most one storage block in a storage device.
The redundant storage system according to any one of claims 1 to 8, wherein the redundant storage mode is RAID, erasure code or multiple copy mode; or

The storage device is a JBOD; and/or the storage medium is a hard disk, a flash memory, a DRAM or an NVRAM; and/or the interface of the storage medium is a SAS interface, a SATA interface, a PCI/e interface, a DIMM interface, and an NVMe interface. , SCSI interface or AHCI interface.
A redundant storage system according to any one of claims 1 to 8, wherein said storage node is a combination of one or more of the following: a virtual machine of said server, a container and A module that runs directly on the physical operating system of the server.
A redundant storage method, characterized in that the applicable redundant storage system comprises: a storage network;

At least two storage nodes connected to the storage network; and at least two storage devices connected to the storage network, each of the storage devices including at least one storage medium; wherein each of the storage nodes passes the The storage network accesses at least two storage devices; the method includes:

Saving data in a redundant storage manner between at least one of each of at least two storage devices accessed by the same storage node, wherein the storage block is a complete storage medium or a storage Part of the media.
The method according to claim 11, wherein storing data in a redundant manner between at least one of the at least two storage devices of the at least two storage devices accessed by the same storage node comprises:

At least one storage block of each of the at least two storage devices accessed by the same storage node is aggregated into a storage group in a redundant storage manner.
The method of claim 12 wherein said storage network is configured such that each storage node is capable of accessing all storage media without the aid of other storage nodes, all storage media included in said redundant storage system Forming a storage pool, wherein the method further comprises:

Adjusting the storage pool by adjusting the number of storage blocks in the storage group that allow simultaneous failures and/or selecting the number of storage blocks for aggregation into the same storage group from each of the at least two storage devices of the storage pool Fault tolerance level.
The method according to claim 13, wherein the number of storage blocks allowed to simultaneously fail in the storage group is adjusted and/or selected from at least two storage devices of the storage pool for aggregation into the same The number of storage blocks of the storage group to adjust the fault tolerance level of the storage pool includes:

The number of storage blocks in the storage group that allow simultaneous failures is represented by D, and the number of storage blocks selected from the at least two storage devices of the storage pool for aggregation into the same storage group is represented by N. Indicates the number of storage devices in the storage pool that are allowed to fail at the same time; then M=D/N, D/N takes only integer bits.
The method according to any one of claims 11 to 14, further comprising: a plurality of said storage groups being aggregated into a storage area.
A redundant storage device, characterized in that the applicable redundant storage system comprises: a storage network; at least two storage nodes connected to the storage network; and at least two storage devices connected to the storage network, Each of the storage devices includes at least one storage medium; wherein each of the storage nodes accesses at least two storage devices through the storage network; the redundant storage device includes:

a redundant storage module configured to store data in a redundant manner between at least one of each of the at least two storage devices accessed by the same storage node, wherein The storage block is a complete storage medium or a part of a storage medium.