CN112783669A

CN112783669A - Distributed storage management method and system

Info

Publication number: CN112783669A
Application number: CN202110014809.XA
Authority: CN
Inventors: 杨宏兵; 杨恒; 宣旭迎; 张承冰; 李洪新; 周石开; 周天睿; 袁鹏飞; 文中领; 周泽湘
Original assignee: Beijing Toyou Feiji Electronics Co ltd
Current assignee: Beijing Toyou Feiji Electronics Co ltd
Priority date: 2021-01-06
Filing date: 2021-01-06
Publication date: 2021-05-11

Abstract

The invention provides a distributed storage management method and a distributed storage management system. The method comprises the following steps: deploying message queues among different storage nodes; and synchronizing the data update of the storage node to other storage nodes in the storage system through the deployed message queue. The distributed storage management method and the distributed storage management system provided by the invention can improve the availability of the system and avoid the problems of single-point failure and continuous breakdown.

Description

Distributed storage management method and system

Technical Field

The present invention relates to the field of distributed system technologies, and in particular, to a distributed storage management method and system.

Background

With the rapid development of the mobile internet and the explosive growth of business data, businesses of enterprises increasingly depend on accurate data information, and therefore, the position of a storage system in an enterprise information system is increasingly highlighted. At present, the data storage mode of a storage system is mainly divided into: centralized storage and distributed storage. Correspondingly, the storage management software is also divided into centralized storage management software and distributed storage management software, and all necessary interface operations of storage resource management, monitoring, reporting, hierarchical storage, performance management, disaster recovery and service continuity are provided for users. The distributed storage management software runs on a plurality of device nodes of distributed storage, in order to ensure the continuity of services, different High Availability (HA) schemes are adopted in many occasions, and the HA scheme aims to construct reliable, safe, extensible and easily maintained storage management software so as to ensure the stable running of a system and avoid single-point failure.

At present, a high availability scheme of distributed storage management software mainly adopts a method of transferring stateless services, that is, when a certain node fails, services running on the node are transferred to other nodes running normally in a distributed storage cluster. Highly available solutions using the above method typically include two parts: IP address drift and service data synchronous backup. The IP address drifting technology is to transfer the IP address of a failure node to a normally operating node or configure a virtual IP, the virtual IP drifts among a plurality of nodes in distributed storage according to the health state of the node, and the virtual IP always points to the normally operating node, so that the continuous operation of the service is ensured. The service data synchronous backup technology comprises two modes: software synchronization and hardware synchronization. And the software synchronization utilizes the synchronization function of the database to synchronize the service data to the data of different nodes in real time. The hardware backup adopts a storage backup function, and data is stored on different physical disks at random.

The highly available solution for distributed storage management software to employ stateless service migration may include the following disadvantages: node idleness problems and continuous crash problems. The distributed storage management software runs on a plurality of nodes of the distributed storage, and when one node fails, the service running on the node is transferred to other nodes which normally run. Before the failure, the clock of the transferred node is in an idle state, and the service resource of the node is not fully utilized. If the failed node crashes because the running task is too heavy, the standby node will also crash with the same problem after transferring to the standby node.

Disclosure of Invention

The technical problem to be solved by the invention is to provide a distributed storage management method and system, which can improve the availability of the system and avoid the problems of single-point failure and continuous breakdown.

In order to solve the above technical problem, the present invention provides a distributed storage management method, where the method includes: deploying message queues among different storage nodes; and synchronizing the data update of the storage node to other storage nodes in the storage system through the deployed message queue.

In some embodiments, the synchronization of the data updates comprises: real-time synchronization and timing synchronization.

In some embodiments, further comprising: when a part of storage nodes in the storage system have faults, the service node is set as the storage node of which the storage system has no faults.

In some embodiments, further comprising: and distributing the access load to each storage node in the storage system through a load balancing mechanism.

In some embodiments, deploying message queues between different storage nodes includes: message queues are deployed between adjacent storage nodes of a ring storage system.

In some embodiments, deploying message queues between different storage nodes includes: message queues are deployed between adjacent storage nodes of the star storage system.

In some embodiments, deploying message queues between different storage nodes includes: and deploying the message queue between any two storage nodes in the storage system.

In addition, the present invention also provides a distributed storage management system, including: a plurality of different storage nodes, with message queues deployed between the plurality of different storage nodes, the distributed storage management system being configured to perform the distributed storage management method according to the foregoing.

After adopting such design, the invention has at least the following advantages:

1. improve equipment's utilization ratio, avoid the node idle:

the operation tasks of the users are distributed to a plurality of nodes by using a message queue mechanism, so that the utilization rate of the equipment is improved;

2. the availability of the system is improved, and single-point failure and continuous running are avoided:

and averagely transferring the service on the failed node to other running nodes by using a message queue mechanism.

Drawings

The foregoing is only an overview of the technical solutions of the present invention, and in order to make the technical solutions of the present invention more clearly understood, the present invention is further described in detail below with reference to the accompanying drawings and the detailed description.

Fig. 1 is a system architecture diagram of distributed storage management software provided by an embodiment of the present invention.

Detailed Description

The preferred embodiments of the present invention will be described in conjunction with the accompanying drawings, and it will be understood that they are described herein for the purpose of illustration and explanation and not limitation.

Referring to fig. 1, a distributed storage system according to an embodiment of the present invention includes a plurality of different storage nodes. These storage nodes may be, for example, IP a nodes, IP B nodes, or IP C nodes shown in fig. 1. The storage data in the distributed storage system is distributed over these different storage nodes.

In the embodiment of the invention, message queues are deployed among different storage nodes. These message queues may be, for example, Redis message queues. The function of these message queues is to synchronize data updates on the storage nodes to other storage nodes in the storage system.

The deployment positions of the message queues are among different storage nodes. If the storage nodes in the storage system are in a star configuration, the message queues may be deployed between different network nodes in the star overall architecture. If the storage nodes in the storage system are ring structured, the message queues are deployed between adjacent nodes of the ring structure.

There is also a special case where the above-mentioned message queue is deployed between any two storage nodes of the storage system.

An advantage of deploying message queues between different storage nodes is that the overall availability of the storage system can be increased. Because any data update is synchronized to other storage nodes in the system, if part of nodes in the system fail, the system can continue to provide service to the outside by depending on the nodes without failure, and the system does not need to be stopped for maintenance.

In addition, any data updating is synchronous in real time, and any node in the system can independently provide service for users, so that the service load actually borne by each node is greatly reduced, the equipment utilization rate of a single node is improved, and the idle condition of the node is avoided.

Of course, if a certain load balancing strategy is matched, the technical effect of improving the utilization rate of the equipment is more obvious.

In addition, the data synchronization mode through the message queue may be: real-time synchronization, or timing synchronization.

Real-time synchronization refers to synchronization to other storage nodes as soon as a data update occurs locally. The timing synchronization means that even if data update occurs locally, update to other storage nodes is not performed, and all data update occurring in a period of time is performed in batch synchronization to a fixed time point.

The invention also provides a distributed storage management method. Specifically, the method comprises the following steps: deploying message queues among different storage nodes; and synchronizing the data update of the storage node to other storage nodes in the storage system through the deployed message queue.

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the present invention in any way, and it will be apparent to those skilled in the art that the above description of the present invention can be applied to various modifications, equivalent variations or modifications without departing from the spirit and scope of the present invention.

Claims

1. A distributed storage management method, comprising:

deploying message queues among different storage nodes;

and synchronizing the data update of the storage node to other storage nodes in the storage system through the deployed message queue.

2. The distributed storage management method of claim 1, wherein synchronizing data updates comprises: real-time synchronization and timing synchronization.

3. The distributed storage management method according to claim 1, further comprising:

when a part of storage nodes in the storage system have faults, the service node is set as the storage node of which the storage system has no faults.

4. The distributed storage management method according to claim 1, further comprising:

and distributing the access load to each storage node in the storage system through a load balancing mechanism.

5. The distributed storage management method of claim 1, wherein deploying message queues among different storage nodes comprises:

message queues are deployed between adjacent storage nodes of a ring storage system.

6. The distributed storage management method of claim 1, wherein deploying message queues among different storage nodes comprises:

message queues are deployed between adjacent storage nodes of the star storage system.

7. The distributed storage management method of claim 1, wherein deploying message queues among different storage nodes comprises:

and deploying the message queue between any two storage nodes in the storage system.

8. A distributed storage management system, comprising:

a plurality of different storage nodes having message queues deployed therebetween, the distributed storage management system being configured to perform the distributed storage management method according to any one of claims 1 to 7.