RU2018122861A

RU2018122861A - Method for building high-performance fault-tolerant storage systems based on distributed file systems and NVMe over Fabrics technology

Info

Publication number: RU2018122861A
Application number: RU2018122861A
Authority: RU
Inventors: Егор Александрович Дружинин; Антон Владимирович Катенев; Павел Александрович Лавренко; Константин Алексеевич Пономарев; Александр Александрович Московский
Original assignee: Общество с ограниченной ответственностью "РСК Лабс" (ООО "РСК Лабс")
Priority date: 2018-06-22
Filing date: 2018-06-22
Publication date: 2019-12-23
Also published as: RU2716040C2; RU2018122861A3

Claims

1. A method of constructing a high-performance fault-tolerant storage system based on a distributed file system and NVMe over Fabrics technology in hyperconverged infrastructures (systems), which consists in constructing a system including: computing nodes of a server farm (servers), which include standard components, such as CPUs, RAM, full-duplex data network with support for RDMA technology, power, cooling, control subsystems, data storage devices in the form of SSD drives, plug data to the server farm computing nodes (servers) using the NVMe protocol and a full-duplex data transmission network with RDMA technology support, while the server farm computing nodes (servers), SSD drives and a full-duplex data transmission network supporting RDMA technology are combined in a hyperconverged infrastructure using software funds, and their management occurs through a common administration console, characterized in that

using storage devices provided by NVMe over Fabrics technology from all hyper-converged infrastructure, connected by a data transmission network with support for RDMA technology;

all network components are duplicated;

As nodes providing access to the entire hyperconverged infrastructure using NMVe over Fabrics technology, all servers of the hyperconverged infrastructure, as well as specialized shelves with NVMe disks, are used;

some nodes containing NVMe storage devices used in the storage system receive the Target role and provide remote access to the storage devices in their composition, and the remaining nodes containing NVMe storage devices used in the storage system receive the Host role, remote storage devices are connected to them, which in turn are assembled into software RAID arrays with a certain level of data redundancy, these RAID arrays act as disk space for data (OSS), Distributed File System (RFU), also run on the nodes of the storage system, where one RAID array is connected to one OSS, running on the same storage node,

in this case, all storage devices included in one RAID array must be physically located on different Target, that is, on different servers, for each active RAID array there must be at least one inactive copy located on another storage node, for each active OSS There must be at least one inactive copy located on another data storage node.

2. The method of constructing a high-performance fault-tolerant data storage system according to claim 1, characterized in that the storage system can have a number of spare NVMe storage devices connected to one or another server in the role of Target and not included in any of RAID arrays.