CN115118779A

CN115118779A - Method, system, device and medium for building cluster based on centralized storage

Info

Publication number: CN115118779A
Application number: CN202210732771.4A
Authority: CN
Inventors: 袁东海; 胡玉鹏; 李红卫
Original assignee: Inspur Jinan data Technology Co ltd
Current assignee: Inspur Jinan data Technology Co ltd
Priority date: 2022-06-24
Filing date: 2022-06-24
Publication date: 2022-09-27
Anticipated expiration: 2042-06-24
Also published as: CN115118779B

Abstract

The invention provides a method, a system, equipment and a medium for building a cluster based on centralized storage, wherein the method comprises the steps of allocating memory space for writing of data of each node in the cluster, controlling the writing of the data of the nodes into a memory and combining the written data for centralized storage; when the centralized storage is carried out, a second cache for controlling data reading and writing is added; the cache uses a local disk for storing incremental data. And when the nodes in the cluster need to delete data, directly deleting the data from the centralized storage. Based on the method, the system, the equipment and the medium for building the cluster based on the centralized storage are also provided. The invention ensures the consistency of data by configuring all nodes of the cluster to use the same centralized storage, solves the problem of data security, can ensure that the data used by each node is consistent by using the centralized storage, and can ensure the high availability and throughput of cluster services and the data security by using the same centralized storage for a plurality of nodes.

Description

Method, system, equipment and medium for building cluster based on centralized storage

Technical Field

The invention belongs to the technical field of cluster building, and particularly relates to a method, a system, equipment and a medium for building a cluster based on centralized storage.

Background

rabbitmq: message middleware is used for producing and consuming messages. After the client is connected with the rabbitmq service, the client can send a message to a server end of the rabbitmq and receive the message from the server end, wherein the sending of the message is production, and the receiving of the message is consumption. rabbitmq is used in many products as a mature, widely used messaging middleware. For high availability of services and throughput of rabbitmq services, rabbitmq services are typically deployed in clusters.

Currently used rabbitmq is a distributed storage architecture, with data stored on multiple nodes. Fig. 1 shows a data storage situation of a general rabbitmq cluster. Each node uses local storage data (local disk), and synchronization of data is required among each node of the cluster. Each node within the cluster may provide a service. This method has a disadvantage that when the node is abnormal, the respective data of the nodes in the cluster may be inconsistent. If the inconsistency cannot be repaired by the synchronization of the data in the cluster, the rabbitmq service cannot be started. This problem is the data security problem of rabbitmq.

Disclosure of Invention

In order to solve the technical problems, the invention provides a method, a system, equipment and a medium for building a cluster based on centralized storage, which ensure the security of cluster data on the basis of keeping the high available function of the cluster on the premise of not obviously reducing the throughput of the cluster.

In order to achieve the purpose, the invention adopts the following technical scheme:

a cluster building method based on centralized storage comprises the following steps:

allocating memory space for writing of each node data in the cluster, controlling the writing of the node data into the memory and combining the written data for centralized storage;

when the centralized storage is carried out, a second cache for controlling data reading and writing is added; the cache adopts a local disk for storing incremental data.

Further, the method also includes deleting data directly from the centralized storage when a node in the cluster needs to delete data.

Further, the data of each node in the cluster is located in the first cache, and the data of each node in the cluster adopts memory cache data.

Furthermore, in the process of controlling the data of the nodes to be written into the memory, global control is added, so that only one node is written in within a period of time, and the next node can be written in after the current node is written in.

Further, the method for merging the written data for centralized storage includes:

only one copy of the same part in the data of each node is stored in a centralized storage;

different parts of the data of each node are respectively stored in a centralized storage.

Further, after the data increment is stored, the data read-write control periodically controls data to be merged and then written into the centralized storage.

The invention also provides a system for building the cluster based on the centralized storage, which comprises the following steps: a write module and a storage module;

the write-in module is used for distributing memory space for writing in of each node data in the cluster, controlling the writing in of the node data into the memory and combining the written data for centralized storage;

the storage module is used for adding a second cache for data read-write control when performing centralized storage; the cache adopts a local disk for storing incremental data.

Further, the system also comprises a deleting module;

and the deleting module is used for directly deleting the data from the centralized storage when the nodes in the cluster need to delete the data.

The invention also proposes a device comprising:

a memory for storing a computer program;

a processor for implementing the method steps as described when executing the computer program.

The invention also proposes a readable storage medium having stored thereon a computer program which, when being executed by a processor, carries out the method steps.

The effect provided in the summary of the invention is only the effect of the embodiment, not all the effects of the invention, and one of the above technical solutions has the following advantages or beneficial effects:

the invention provides a method and a system for building a cluster based on centralized storage, wherein the method comprises the steps of allocating memory space for writing of data of each node in the cluster, controlling the writing of the data of the nodes into a memory and combining the written data for centralized storage; when the centralized storage is carried out, a second cache for controlling data reading and writing is added; the cache adopts a local disk for storing incremental data. The method also includes deleting directly from the centralized storage when a node in the cluster needs to delete data. Based on a method for building a cluster based on centralized storage, a system, equipment and a medium for building a cluster based on centralized storage are also provided. The invention ensures the consistency of data by configuring all nodes of the cluster to use the same centralized storage, and solves the problem of data security. The centralized storage can ensure that the data used by each node is consistent, and the multiple nodes using the same centralized storage can ensure high availability and throughput of cluster services and data security.

In the invention, the Rabbitmq service does not use local storage, uses cache data, and does not synchronize data among all nodes of the cluster any longer, thereby ensuring the security of cluster data on the basis of keeping the high available function of the cluster on the premise of not obviously reducing the throughput of the cluster.

After the data increment is stored, the data read-write control part periodically merges the data and writes the data into the centralized storage, and merges the data and writes the data into the centralized storage through time delay, so that the storage and writing times are reduced.

Drawings

FIG. 1 is a schematic diagram of a rabbitmq cluster structure commonly used in the prior art;

fig. 2 is a schematic diagram of a structure of a centrally stored rabbitmq cluster according to embodiment 1 of the present invention;

fig. 3 is a flowchart of a method for building a cluster based on centralized storage according to embodiment 1 of the present invention;

fig. 4 is a schematic structural diagram of writing and merging control node data into a memory according to embodiment 1 of the present invention;

fig. 5 is a schematic structural diagram of incremental saving in embodiment 1 of the present invention;

fig. 6 is a schematic diagram of a system for building a cluster based on centralized storage according to embodiment 2 of the present invention.

Detailed Description

In order to clearly explain the technical features of the present invention, the following detailed description of the present invention is provided with reference to the accompanying drawings. The following disclosure provides many different embodiments, or examples, for implementing different features of the invention. To simplify the disclosure of the present invention, specific example components and arrangements are described below. Furthermore, the present invention may repeat reference numerals and/or letters in the various examples. This repetition is for the purpose of simplicity and clarity and does not in itself dictate a relationship between the various embodiments and/or configurations discussed. It should be noted that the components illustrated in the figures are not necessarily drawn to scale. Descriptions of well-known components and processing techniques and procedures are omitted so as to not unnecessarily limit the invention.

Example 1

The embodiment 1 of the invention provides a method for building a cluster based on centralized storage, which ensures the consistency of data by configuring all nodes of the cluster and using the same centralized storage, and solves the problem of data security in the prior art.

Fig. 2 is a schematic diagram of a structure of a centrally stored rabbitmq cluster according to embodiment 1 of the present invention; the centralized storage can ensure that the data used by each node is consistent, and the multiple nodes using the same centralized storage can ensure high availability and throughput of cluster services and data security.

Fig. 3 is a flowchart of a method for building a cluster based on centralized storage according to embodiment 1 of the present invention.

In step S300, allocating a memory space for writing of each node data in the cluster, controlling writing of the node data into the memory and merging the written data for centralized storage;

in the application, the Rabbitmq service does not use local storage and uses cache data, data is not synchronized among all nodes in a cluster, and data synchronization is not performed among all nodes in the cluster.

In an original rabbitmq cluster architecture, each node uses local storage data (local disk), and data synchronization needs to be performed among the nodes of the cluster. Each node within the cluster may provide a service.

The invention designs that each node uses centralized storage, so that each node does not need to store data in a local disk, and each node does not need to synchronize the data (the data are all written into the centralized storage). A block of area may be allocated in the memory using the cache data, and the rabbitmq service reads and writes data from the memory.

Fig. 4 is a schematic structural diagram of writing and merging of control node data into a memory according to embodiment 1 of the present invention.

And in the process of controlling the data of the nodes to be written into the memory, global control is added to ensure that only one node is written in within a period of time, and the next node can be written in after the current node is written in.

The rabbitmq data of each node in the cluster needs to be written into the centralized storage. The data read-write control section is required to control the reading and writing of data. The use of centralized storage has a significant problem that data written by each node may generate conflicts, for example, data written by node 1 is ABC, data written by node 2 is BCD, and direct write storage may generate errors. The more general processing mode is to add a global control, when the node 1 writes, other nodes cannot write, after the node 1 finishes writing, the other node can write, and only one writing operation needs to be ensured in one time.

The method for solving the problem in the invention is to control the data writing through the data reading and writing control part. The data control part allocates the space (memory) written by each node, and after the data of each node is written, the data are merged.

The method for merging the written data for centralized storage comprises the following steps: only one copy of the same part in the data of each node is stored in a centralized storage; different parts of the data of each node are respectively stored in a centralized storage.

In fig. 4, data written in the node 1 is ABC, data written in the node 2 is BCD, and data written in the node 3 is ACD.

The data read-write control part allocates memory space in advance, stores the write-in data of 3 nodes, and then merges the data according to the write-in data of the nodes, wherein the merged data is ABCD.

Since rabbitmq is the message middleware, the message is composed of two parts, namely a message header and a message content. The message header contains some characteristics and attributes of the message, and the message content is the specific data of the message. The messages can be classified and combined according to the information of the message headers.

When the centralized storage is performed in step S310, a second cache for data read-write control is added; the cache adopts a local disk for storing incremental data.

Due to the production and consumption of the rabbitmq messages, the data is changed in real time in order to reduce the number of times data is written to the centralized storage. The design of a cache is introduced in the data read-write control part, and the data in the cache is written into the centralized storage after a certain delay time.

The cache design of the data read-write control part stores the data in a local disk, and the storage mode is increment storage.

Fig. 5 is a schematic structural diagram of incremental saving in embodiment 1 of the present invention; after the data increment is saved, the data read-write control part periodically combines the data and writes the data into the centralized storage.

The invention also includes deleting directly from the centralized storage when the nodes in the cluster need to delete data.

In fig. 5, if a and D need to be deleted in node 3, they are deleted in the centralized storage, and the BCE remains in the centralized storage.

The Rabbitmq uses a local disk to store data, the method does not use the local disk any more, and uses memory cache data which is read from a centralized storage.

In the method for building the cluster based on the centralized storage provided by embodiment 1 of the present invention, all nodes of the cluster are configured to use the same centralized storage to ensure the consistency of data, so as to solve the problem of data security. The centralized storage can ensure that the data used by each node is consistent, and the multiple nodes using the same centralized storage can ensure high availability and throughput of cluster services and data security.

In the method for building the cluster based on the centralized storage provided by embodiment 1 of the invention, the Rabbitmq service does not use local storage, uses cache data, and does not synchronize data among all nodes of the cluster any more, so that the safety of cluster data is ensured on the basis of keeping the high available function of the cluster on the premise of not obviously reducing the throughput of the cluster.

After data increment is stored in the method for building the cluster based on the centralized storage, which is provided by the embodiment 1 of the invention, the data read-write control part periodically merges the data and writes the merged data into the centralized storage, and writes the merged data into the centralized storage by delaying, so that the storage and writing times are reduced.

Example 2

Based on a method for building a cluster based on centralized storage provided by embodiment 1 of the present invention, embodiment 2 of the present invention provides a system for building a cluster based on centralized storage, and as shown in fig. 6, the system for building a cluster based on centralized storage provided by embodiment 2 of the present invention is schematically illustrated, and includes a write-in module and a storage module;

the storage module is used for increasing a second cache for data read-write control when centralized storage is carried out; the cache adopts a local disk for storing incremental data.

The system also includes a deletion module; and the deleting module is used for directly deleting the data from the centralized storage when the nodes in the cluster need to delete the data.

The data of each node in the cluster in the write-in module is located in the first cache, and the data of each node in the cluster adopts memory cache data.

The method for merging the written data for centralized storage comprises the following steps: only one copy of the same part in the data of each node is stored in a centralized storage; the different parts of the data of each node are respectively stored in a centralized storage

In the storage module, data is changed in real time due to the production and consumption of the rabbitmq message, so as to reduce the number of times data is written into the centralized storage. The design of a cache is introduced in the data read-write control part, and the data in the cache is written into the centralized storage after a certain delay time.

The cache design of the data read-write control part stores the data in a local disk, and the storage mode is increment storage. After the data increment is saved, the data read-write control part periodically combines the data and writes the data into the centralized storage.

The system for building the cluster based on the centralized storage, which is provided by the embodiment 2 of the invention, ensures the consistency of data by configuring all nodes of the cluster to use the same centralized storage, and solves the problem of data security. The centralized storage can ensure that the data used by each node is consistent, and the multiple nodes using the same centralized storage can ensure high availability and throughput of cluster services and data security.

In the system for building the cluster based on the centralized storage provided by embodiment 2 of the present invention, the Rabbitmq service does not use local storage, uses cache data, and does not synchronize data among nodes in the cluster any more, so that the security of cluster data is ensured on the basis of maintaining the high available function of the cluster on the premise of not significantly reducing the throughput of the cluster.

After data increment storage in the system for building the cluster based on the centralized storage, which is provided by the embodiment 2 of the invention, is performed, the data read-write control part periodically merges the data and writes the merged data into the centralized storage, and the merged data is written into the centralized storage by delaying, so that the storage and writing times are reduced.

Example 3

The invention also proposes a device comprising:

a memory for storing a computer program;

a processor for implementing the method steps when executing the computer program as follows:

The rabbitmq data of each node in the cluster needs to be written into the centralized storage. The data read-write control section is required to control the reading and writing of data. The use of centralized storage has a obvious problem that data written by each node can generate conflict, for example, data written by node 1 is ABC, data written by node 2 is BCD, and direct write storage can generate errors. The more general processing mode is to add a global control, when the node 1 writes, other nodes cannot write, after the node 1 finishes writing, the other node can write, and only one writing operation needs to be ensured in one time.

In fig. 4, data written in node 1 is ABC, data written in node 2 is BCD, and data written in node 3 is ACD.

Fig. 5 is a schematic structural diagram of incremental saving in embodiment 1 of the present invention; after the data increment is stored, the data read-write control part periodically merges the data and writes the data into the centralized storage.

In fig. 5, if a and D need to be deleted in the node 3, they are deleted in the centralized storage, and the BCE remains in the centralized storage.

The device provided in embodiment 3 of the present invention ensures data consistency by configuring all nodes of a cluster to use the same centralized storage, thereby solving the problem of data security. The centralized storage can ensure that the data used by each node is consistent, and the multiple nodes using the same centralized storage can ensure high availability and throughput of cluster services and data security.

In the apparatus provided in embodiment 3 of the present invention, the Rabbitmq service does not use local storage, uses cache data, and does not synchronize data among nodes in a cluster any more, so that on the premise of not significantly reducing cluster throughput, security of cluster data is ensured on the basis of maintaining high available functions of a cluster.

After the data increment is stored in the device provided in embodiment 3 of the present invention, the data read/write control portion periodically merges the data and writes the merged data into the centralized storage, and writes the merged data into the centralized storage by delaying, thereby reducing the number of times of storage and writing.

Need to explain: the technical solution of the present invention also provides an electronic device, including: the communication interface can carry out information interaction with other equipment such as network equipment and the like; and the processor is connected with the communication interface to realize information interaction with other equipment, and is used for executing the method for building the cluster based on the centralized storage provided by one or more technical schemes when running a computer program, and the computer program is stored on the memory. Of course, in practice, the various components in an electronic device are coupled together by a bus system. It will be appreciated that a bus system is used to enable the communication of the connections between these components. The bus system includes a power bus, a control bus, and a status signal bus in addition to the data bus. The memory in the embodiments of the present application is used to store various types of data to support the operation of the electronic device. Examples of such data include: any computer program for operating on an electronic device. It will be appreciated that the memory can be either volatile memory or nonvolatile memory, and can include both volatile and nonvolatile memory. Among them, the nonvolatile Memory may be a Read Only Memory (ROM), a Programmable Read Only Memory (PROM), an Erasable Programmable Read-Only Memory (EPROM), an Electrically Erasable Programmable Read-Only Memory (EEPROM), a magnetic random access Memory (FRAM), a Flash Memory (Flash Memory), a magnetic surface Memory, an optical disk, or a Compact Disc Read-Only Memory (CD-ROM); the magnetic surface storage may be disk storage or tape storage. The volatile memory may be a Random Access Memory (RAM) which serves as an external cache. By way of illustration and not limitation, many forms of RAM are available, such as Static Random Access Memory (SRAM), Synchronous Static Random Access Memory (SSRAM), Dynamic Random Access Memory (DRAM), Synchronous Dynamic Random Access Memory (SDRAM), Double Data Rate Synchronous Dynamic Random Access Memory (DDRSDRAM), Enhanced Synchronous Dynamic Random Access Memory (ESDRAM), Enhanced Synchronous Dynamic Random Access Memory (Enhanced DRAM), Synchronous Dynamic Random Access Memory (SLDRAM), Direct Memory (DRmb Access), and Random Access Memory (DRAM). The memories described in the embodiments of the present application are intended to comprise, without being limited to, these and any other suitable types of memory. The method disclosed in the embodiments of the present application may be applied to a processor, or may be implemented by a processor. The processor may be an integrated circuit chip having signal processing capabilities. In implementation, the steps of the above method may be performed by integrated logic circuits of hardware in a processor or instructions in the form of software. The processor may be a general purpose processor, a DSP (Digital Signal Processing, i.e., a chip capable of implementing Digital Signal Processing technology), or other programmable logic device, discrete gate or transistor logic device, discrete hardware components, etc. The processor may implement or perform the methods, steps, and logic blocks disclosed in the embodiments of the present application. A general purpose processor may be a microprocessor or any conventional processor or the like. The steps of the method disclosed in the embodiments of the present application may be directly implemented by a hardware decoding processor, or implemented by a combination of hardware and software modules in the decoding processor. The software modules may be located in a storage medium located in a memory where a processor reads the programs in the memory and in combination with its hardware performs the steps of the method as previously described. When the processor executes the program, corresponding processes in the methods of the embodiments of the present application are implemented, and for brevity, are not described herein again.

Example 4

The invention also proposes a readable storage medium on which a computer program is stored, which, when executed by a processor, implements the method steps of:

Due to the production and consumption of the rabbitmq messages, the data is changed in real time in order to reduce the number of times data is written to the centralized storage. The design of a cache is introduced in a data read-write control part, and data in the cache is written into a centralized storage after a certain delay time.

The Rabbitmq uses a local disk to store data, the method does not use the local disk any more, uses memory cache data, and the memory cache data are read from centralized storage.

The storage medium provided in embodiment 4 of the present invention ensures data consistency by configuring all nodes of a cluster to use the same centralized storage, thereby solving the problem of data security. The centralized storage can ensure that the data used by each node is consistent, and the multiple nodes using the same centralized storage can ensure high availability and throughput of cluster services and data security.

In the storage medium provided in embodiment 4 of the present invention, the Rabbitmq service does not use local storage, uses cache data, and does not synchronize data among nodes in a cluster any more, so that on the premise of not significantly reducing cluster throughput, security of cluster data is ensured on the basis of maintaining high available functions of a cluster.

In the storage medium provided in embodiment 4 of the present invention, after the data increment is stored, the data read-write control portion periodically merges the data and writes the merged data into the centralized storage, and writes the merged data into the centralized storage by delaying, so as to reduce the number of times of storage and writing.

Embodiments of the present application further provide a storage medium, that is, a computer storage medium, specifically, a computer-readable storage medium, for example, a memory storing a computer program, where the computer program is executable by a processor to perform the steps of the foregoing method. The computer readable storage medium may be Memory such as FRAM, ROM, PROM, EPROM, EEPROM, Flash Memory, magnetic surface Memory, optical disk, or CD-ROM.

Those of ordinary skill in the art will understand that: all or part of the steps for implementing the method embodiments may be implemented by hardware related to program instructions, and the program may be stored in a computer readable storage medium, and when executed, the program performs the steps including the method embodiments; and the aforementioned storage medium includes: a removable storage device, a ROM, a RAM, a magnetic or optical disk, or various other media that can store program code. Alternatively, the integrated units described above in the present application may be stored in a computer-readable storage medium if they are implemented in the form of software functional modules and sold or used as independent products. Based on such understanding, the technical solutions of the embodiments of the present application may be essentially implemented or portions thereof that contribute to the prior art may be embodied in the form of a software product, which is stored in a storage medium and includes several instructions for enabling an electronic device (which may be a personal computer, a server, or a network device) to execute all or part of the methods described in the embodiments of the present application. And the aforementioned storage medium includes: a removable storage device, a ROM, a RAM, a magnetic or optical disk, or various other media that can store program code.

For a description of a relevant part in a processing device and a storage medium for building a cluster based on centralized storage provided in an embodiment of the present application, reference may be made to a detailed description of a corresponding part in a method for building a cluster based on centralized storage provided in embodiment 1 of the present application, and details are not described here again.

It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Furthermore, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include elements inherent in the list. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element. In addition, parts of the above technical solutions provided in the embodiments of the present application, which are consistent with the implementation principles of corresponding technical solutions in the prior art, are not described in detail so as to avoid redundant description.

Although the embodiments of the present invention have been described with reference to the accompanying drawings, the scope of the present invention is not limited thereto. Various modifications and alterations will occur to those skilled in the art based on the foregoing description. And are neither required nor exhaustive of all embodiments. On the basis of the technical scheme of the invention, various modifications or changes which can be made by a person skilled in the art without creative efforts are still within the protection scope of the invention.

Claims

1. A cluster building method based on centralized storage is characterized by comprising the following steps:

2. The method for building the cluster based on the centralized storage according to claim 1, wherein the method further comprises deleting the data directly from the centralized storage when the nodes in the cluster need to delete the data.

3. The method for building the cluster based on the centralized storage according to claim 1, wherein the data of each node in the cluster is located in a first cache, and the data of each node in the cluster adopts memory cache data.

4. The method for building the cluster based on the centralized storage is characterized in that global control is added in the process of controlling the data of the nodes to be written into the memory, so that only one node can be written in one time, and the next node can be written in after the current node is written in.

5. The method for building a cluster based on centralized storage according to claim 1, wherein the method for merging written data for centralized storage comprises:

6. The method for building a cluster based on centralized storage according to claim 1, wherein after the data are incrementally stored, the data read-write control writes the periodically control data into the centralized storage after merging.

7. A system for building a cluster based on centralized storage, comprising: a write module and a storage module;

the storage module is used for adding a second cache for data read-write control when carrying out centralized storage; the cache adopts a local disk for storing incremental data.

8. The system for building a cluster based on centralized storage according to claim 7, further comprising a deletion module;

9. An apparatus, comprising:

a memory for storing a computer program;

a processor for implementing the method steps of any one of claims 1 to 6 when executing the computer program.

10. A readable storage medium, characterized in that the readable storage medium has stored thereon a computer program which, when being executed by a processor, carries out the method steps of any one of claims 1 to 6.