WO2023000534A1

WO2023000534A1 - Communication method and apparatus between cluster nodes

Info

Publication number: WO2023000534A1
Application number: PCT/CN2021/127507
Authority: WO
Inventors: 李宏伟; 颜秉珩
Original assignee: 苏州浪潮智能科技有限公司
Priority date: 2021-07-20
Filing date: 2021-10-29
Publication date: 2023-01-26
Also published as: CN113676515A

Abstract

The present application discloses a communication method and apparatus between cluster nodes. The method comprises: dividing a message communication area for each cluster node on a shared disk of a cluster, a plurality of channels corresponding to each cluster node, and a plurality of buffers corresponding to the depths of the channels; switching the communication mode of a first cluster node from a socket communication mode to a network communication mode, and continuously monitoring the message communication area of the first cluster node on the shared disk; writing first distributed lock manager information into the channel corresponding to the first cluster node in the message communication area of a second cluster node on the shared disk; and invoking an information processing function to process second distributed lock manager information, popping up the buffer where the second distributed lock manager information is located, and sending a message reply to the second cluster node. The present application can avoid the phenomenon of contention of DLM communication, reduce the read-write overhead and delay of data in a large-scale cluster, and improve the practicability and availability of a shared disk communication system in the cluster.

Description

A communication method and device between cluster nodes

This application claims the priority of the Chinese patent application submitted to the China Patent Office on July 20, 2021, with the application number 202110820023.7, and the title of the invention is "a communication method and device between cluster nodes", the entire content of which is incorporated herein by reference. Applying.

technical field

The present application relates to the field of network transmission, and more specifically, to a communication method and device among cluster nodes.

Background technique

In the field of server virtualization, since the cluster file system can be shared and mounted by multiple servers at the same time, it is often used as a bridge between multiple computing nodes and centralized storage. The cluster file system can provide file concurrent access control, integrity assurance, high availability, and redundancy, etc., and is used by virtualization systems to store virtual machine images and share storage pools. The distributed lock manager (DLM) is a key component of the cluster file system and is used to manage concurrent access to shared resources; it mainly solves the problem of disk cache consistency between cluster nodes and improves the efficiency of shared file access . Common cluster file systems such as GFS, VMFS, OpenVMS Files, ocfs2, etc. have implemented their own DLM.

During the working process of DLM, it needs to rely on the network for inter-node communication to synchronize lock information, including operations such as lock information query, remote lock acquisition, and lock downgrade. Therefore, the reliability of the network directly affects the efficiency and stability of DLM. In a common DLM implementation method, a persistent socket (socket) connection is established between nodes of the cluster based on a specified port, and lock information is exchanged through TCP/IP after encapsulating lock messages. However, the stability of the network is poor. Network fluctuations and delays will affect the transmission of DLM messages, directly affect the work of the cluster file system, and even trigger the protection mechanism (fence) of the file system, causing some nodes in the cluster to be paralyzed. However, in the server virtualization scenario, the reliability of the TCP/IP network is low, so this design will greatly affect the overall reliability of the system.

In view of the above problems, a DLM implementation method based on shared disk parallel communication is provided in the prior art (the publication number is 109376014B), so that the work of the cluster file system does not depend on the TCP/IP network, which greatly improves the reliability and reliability of the system. high availability. However, in this scheme, when multiple nodes send messages to the faulty node, they need to rely on the disk paxos algorithm to compete for the message sending area. However, the execution process of the disk paxos algorithm itself is relatively complicated. It needs to go through processes such as multi-node initiation of proposals, waiting for proposals to be received, and random delays to avoid conflicts after election failures, which will significantly increase IO (reading and writing) overhead and delay. This process itself is relatively time-consuming. Especially for large-scale clusters, the collision probability of disk paxos increases, which further increases IO overhead and delay, affects the performance of the cluster, and limits the scale of the cluster.

There is currently no effective solution to the problem of high data read and write overhead and prolonged time in large-scale clusters in the contention mechanism of DLM communication between nodes in the prior art.

Contents of the invention

In view of this, the purpose of the embodiment of the present application is to propose a communication method and device between cluster nodes, which can avoid the contention phenomenon of DLM communication, reduce the read and write overhead and delay of data in large-scale clusters, and improve the sharing of disks in clusters. Availability and usability of communication systems.

Based on the above purpose, the first aspect of the embodiment of the present application provides a communication method between cluster nodes, including performing the following steps:

Divide a message communication area for each cluster node on the shared disk of the cluster, divide multiple channels corresponding to each cluster node in each message communication area, and generate a channel corresponding to the channel depth in each channel multiple buffers;

In response to the first cluster node of the cluster detecting that the socket connection between the first cluster node and the second cluster node in the same cluster is interrupted, the communication mode of the first cluster node is switched from the socket communication mode to the network communication mode , and continuously monitor the message communication area of the first cluster node on the shared disk;

Responsive to the first cluster node sending the first distributed lock manager information to the second cluster node in the network communication mode, and writing the first distributed lock manager information into the message communication area of the second cluster node on the shared disk in the channel corresponding to the first cluster node;

In response to the first cluster node monitoring the receipt of the second distributed lock manager information in the channel corresponding to the second cluster node in the message communication area of the first cluster node on the shared disk, the information processing function is called to process the second Distributed lock manager information, pop up the buffer where the second distributed lock manager information is located, and send a message reply to the second cluster node.

In some implementations, after the information of the first distributed lock manager is written into the channel corresponding to the first cluster node in the message communication area of the second cluster node on the shared disk, it continues to monitor the first lock manager on the shared disk. A channel corresponding to the first cluster node in the message communication area of the two cluster nodes.

In some implementations, in response to the second cluster node monitoring the receipt of the first distributed lock manager information in the channel corresponding to the first cluster node in the message communication area of the second cluster node on the shared disk, the call The information processing function processes the information of the first distributed lock manager, pops up the buffer where the information of the first distributed lock manager is located, and sends the information to the message communication area of the second cluster node on the shared disk corresponding to the first cluster node A first message reply to the information of the first distributed lock manager is written in the channel.

In some implementations, in response to the first cluster node listening to the channel corresponding to the first cluster node in the message communication area of the second cluster node on the shared disk receiving the first message for the first distributed lock manager message reply, and feedback that the first distributed lock manager completes the information processing and stops listening to the channel corresponding to the first cluster node in the message communication area of the second cluster node on the shared disk.

In some implementations, in response to the second cluster node monitoring the receipt of the first distributed lock manager information in the channel corresponding to the first cluster node in the message communication area of the second cluster node on the shared disk, the call The information processing function processes the information of the first distributed lock manager, pops up the buffer where the information of the first distributed lock manager is located, and sends the information to the message communication area of the first cluster node on the shared disk corresponding to the second cluster node A first message reply to the information of the first distributed lock manager is written in the channel.

In some implementations, the channel corresponding to the second cluster node in the message communication area of the first cluster node on the shared disk receives the first message for the first distributed lock manager in response to the first cluster node listening. The message is replied, and the first distributed lock manager is fed back that the information processing is completed.

In some implementations, in response to switching the communication mode of the first cluster node from the network communication mode to the socket communication mode, the first cluster node stops listening to any area on the shared disk, wherein the listening includes a period for the shared disk sexual polling.

In some implementations, generating a plurality of buffers corresponding to the channel depth in each channel includes: acquiring a predetermined channel depth based on the required message concurrent processing capability, and generating buffers corresponding to the channel depth in each channel A plurality of buffers with a positively related number of depths, wherein each buffer is configured with a disk space for storing a piece of first distributed lock manager information, a piece of second distributed lock manager information, or a message reply.

In some implementations, in response to the addition of the third cluster node in the cluster, the message communication area for the third cluster node is divided on the shared disk, and the message communication area of the third cluster node is divided into a message communication area corresponding to each cluster node. corresponding to multiple channels, and generate multiple buffers corresponding to the channel depth in each channel, and at the same time, divide the channel corresponding to the third cluster node in the message communication area of other existing cluster nodes.

The second aspect of the embodiments of the present application provides a communication device between cluster nodes, including:

processor;

The controller stores program codes executable by the processor, and the processor performs the following steps when running the program codes:

The present application has the following beneficial technical effects: the method and device for inter-cluster node communication provided by the embodiment of the present application, by dividing the message communication area for each cluster node on the shared disk of the cluster, each message communication area is divided into each a plurality of channels corresponding to a cluster node, and generate a plurality of buffers corresponding to the depth of the channel in each channel; in response to the first cluster node of the cluster detecting that the first cluster node is in the same cluster as the second cluster node The socket connection of the node is interrupted, and the communication mode of the first cluster node is switched from the socket communication mode to the network communication mode, and the message communication area of the first cluster node on the shared disk is continuously monitored; in response to the first cluster node Send the first distributed lock manager information to the second cluster node in the network communication mode, and write the first distributed lock manager information into the message communication area of the second cluster node on the shared disk with the first cluster In the channel corresponding to the node; in response to the first cluster node listening to the second distributed lock manager information received in the channel corresponding to the second cluster node in the message communication area of the first cluster node on the shared disk, and Calling the information processing function to process the information of the second distributed lock manager, popping up the buffer where the information of the second distributed lock manager is located, and sending a message reply to the second cluster node can avoid the contention phenomenon of DLM communication, Reduce the read and write overhead and delay of data in large-scale clusters, and improve the practicability and availability of shared disk communication systems in clusters.

Description of drawings

In order to more clearly illustrate the technical solutions in the embodiments of the present application or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description are only These are some embodiments of the present application. Those skilled in the art can also obtain other drawings based on these drawings without creative work.

FIG. 1 is a schematic flow diagram of a communication method between cluster nodes provided by the present application;

Fig. 2 is the space division diagram of the shared disk of the communication method between cluster nodes provided by the present application;

FIG. 3 is a communication flowchart of the communication method between cluster nodes provided by the present application.

detailed description

In order to make the purpose, technical solution and advantages of the present application clearer, the embodiments of the present application will be further described in detail below in combination with specific embodiments and with reference to the accompanying drawings.

It should be noted that all expressions using "first" and "second" in the embodiments of this application are to distinguish between two entities with the same name but different parameters or parameters that are not the same, see "first" and "second" It is only for the convenience of expression, and should not be construed as a limitation on the embodiments of the present application, which will not be described one by one in the subsequent embodiments.

Based on the above purpose, the first aspect of the embodiment of the present application proposes a method to avoid the contention phenomenon of DLM communication, reduce the data read and write overhead and delay in large-scale clusters, and improve the practicability of the shared disk communication system in the cluster An embodiment of a method for inter-node communication of a cluster with availability. FIG. 1 shows a schematic flowchart of a communication method between cluster nodes provided by the present application.

The communication method between cluster nodes, as shown in Figure 1, includes the following steps:

Step S101, divide the message communication area for each cluster node on the shared disk of the cluster, divide a plurality of channels corresponding to each cluster node in each message communication area, and generate and channel depth in each channel Corresponding multiple buffers;

Step S103, in response to the first cluster node of the cluster detecting that the socket connection between the first cluster node and the second cluster node in the same cluster is interrupted, switching the communication mode of the first cluster node from the socket communication mode to Network communication mode, and continuously monitor the message communication area of the first cluster node on the shared disk;

Step S105, in response to the first cluster node sending the first distributed lock manager information to the second cluster node in the network communication mode, and writing the first distributed lock manager information into the second cluster node on the shared disk In the channel corresponding to the first cluster node in the message communication area;

Step S107, in response to the first cluster node receiving the second distributed lock manager information in the channel corresponding to the second cluster node in the message communication area of the first cluster node on the shared disk, and calling the information processing function Processing the second distributed lock manager information, popping the buffer where the second distributed lock manager information is located, and sending a message reply to the second cluster node.

Those of ordinary skill in the art can understand that all or part of the processes in the methods of the above embodiments can be implemented through computer programs to instruct relevant hardware to complete. The program can be stored in a computer-readable storage medium, and the program can be executed when , may include the flow of the embodiments of the above-mentioned methods. Wherein, the storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM) or a random access memory (RAM) and the like. The computer program embodiments can achieve the same or similar effects as any of the corresponding foregoing method embodiments.

The various illustrative logical blocks, modules, circuits, and algorithm steps described in connection with the disclosure herein may be implemented as electronic hardware, computer software, or combinations of both. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, modules, circuits, and steps have been described generally in terms of their functionality. Whether such functionality is implemented as software or as hardware depends upon the particular application and design constraints imposed on the overall system. Those skilled in the art can implement the described functions in various ways for each specific application, but such implementation decisions should not be interpreted as causing a departure from the scope disclosed in the embodiments of the present application.

The specific implementation manner of the present application will be further described below according to the specific embodiments shown in FIGS. 2 and 3 .

This program presents a distributed lock manager implementation method based on shared disk multi-channel communication. First, specify an area in the shared disk, and reserve an address space for each node in the cluster as a message communication area. The communication area is composed of channels (channels) supported by the cluster with the maximum number of nodes (let it be M), and each channel is composed of a buffer (buffer) with a depth of N. Secondly, when node A senses that the network connection with node B is disconnected, it will switch from the socket communication mode to the network communication mode. A will write the DLM message to channel A in the communication area of node B; at the same time, because the socket connection is bidirectional, B will also perceive that the connection with A is disconnected, so B will listen to channel A in its own message area . The monitoring process is realized by polling. When B detects a valid message, it processes the message and writes the reply message back to the area. Similarly, when B wants to send a message to A, it will also write the message into channel B in the message area of A, thus realizing two-way message communication. In addition, in order to reduce the IO pressure of disk polling, the node will only add the channel corresponding to the node to the polling channel list after it senses that it is disconnected from other nodes, avoiding unnecessary IO overhead.

The DLM lock manager scheme based on shared disk multi-channel communication optimizes the communication area selection problem in disk communication. By introducing a multi-message channel mechanism, it avoids the contention of the message sending area, greatly improves the efficiency of the disk communication scheme, and reduces The IO pressure of the disk is reduced, and the application range and practicability of the disk communication scheme are further improved. This solution is applicable to IP-SAN storage and FC-SAN storage.

Specifically, first specify an area in the shared disk, and reserve an address space for each node in the cluster as a message communication area. The communication area is composed of channels (channels) that support the maximum number of nodes (let it be M) in the cluster, and each channel is composed of slots (buffers) with a depth of N. As shown in "Communication Space Disk Layout", the cluster consists of 5 nodes, so a communication area of 5 nodes is reserved in the formatting stage, each communication area consists of M channels, and each channel consists of a depth of N The slot (buffer) composition. This division method is shown in Figure 2 for details. It should be noted that the diagonal area is not used, since nodes do not send messages to themselves. In addition, the cluster file system also supports dynamically adding nodes to the cluster; when adding nodes, the message communication area of the nodes will be correspondingly increased.

When node A senses that the network connection with node B is disconnected, it will switch from the socket communication mode to the network communication mode. A will write the DLM message to channel A in the communication area of node B; at the same time, because the socket connection is bidirectional, B will also perceive that the connection with A is disconnected, so B will listen to channel A in its own message area . The monitoring process is realized by polling. When B detects a valid message, it processes the message and writes the reply message back to the area. Similarly, when B wants to send a message to A, it will also write the message into channel B in the message area of A, thus realizing two-way message communication. In addition, in order to reduce the IO pressure of disk polling, the node will only add the channel corresponding to the node to the polling channel list after it senses that it is disconnected from other nodes, avoiding unnecessary IO overhead.

Also take Figure 2 as an example, when node 3 sends data to node 1, the channel is located in channel 3 of the communication area of node 1, which is recorded as [node 1, chan 3, slot x]. Where x is the index of the message channel cache, and the message receiver and sender will increment according to the agreement, which can realize concurrent sending and receiving of messages.

Referring to the numbers in Figure 3, when node 3 sends a message to node 1, it mainly includes the following steps:

(1) Node 3 writes the DLM message to [node 1, chan 3, slot x];

(2) Node 3 adds chan 3 to the message sending monitoring list, and periodically polls to check the message return result;

(3) Since node 1 receives the node 3 socket connection disconnection event, it will put the message receiving channel [node 1, chan 3, slot x] corresponding to node 1 into the message receiving monitoring list, and periodically poll to see the sending of the other node information;

(4) Node 1 receives the message from Node 3 by polling;

(5) Node 1 calls the information processing function to process the message;

(6) Node 1 finishes processing the message, and writes the ACK message back to the message channel [node 1, chan 3, slot x], indicating that the message is processed;

(7) Node 3 receives the message reply from node 1 through polling, and removes [node 1, chan 3, slot x] from the message sending monitoring list to complete a message sending process.

Similarly, when node 1 sends a message to node 3, it will select the channel [node 3, chan 1, slot y] for message communication, and the process is the same as above.

However, it should be noted that in this way, node 3 needs to poll [node 3, chan x, slot x] to receive information that may be sent by other nodes during normal operation, and additional Polling [node 1, chan 3, slot x]. In another embodiment, when node 1 finishes processing the message, it no longer writes the ACK message back to the message channel [node 1, chan 3, slot x], but writes it back to [node 3, chan 1, slot x], so First, node 3 can only poll [node 3, chan x, slot x] forever, further reducing the polling pressure of node 3 on the disk under the premise of avoiding the contention phenomenon of DLM communication.

By the way, although this application focuses on how to avoid contention in DLM communication, in fact the technical solution of this application is not only applicable to DLM data. For any data transmission aimed at avoiding communication contention, the technical solution of the present application can be applied to obtain the same or similar technical effects.

In addition, the method disclosed according to the embodiment of the present application may also be implemented as a computer program executed by a CPU, and the computer program may be stored in a computer-readable storage medium. When the computer program is executed by the CPU, the above functions defined in the methods disclosed in the embodiments of the present application are executed. The above-mentioned method steps and system units can also be implemented by using a controller and a computer-readable storage medium for storing a computer program that enables the controller to realize the functions of the above-mentioned steps or units.

It can be seen from the above embodiments that in the communication method between cluster nodes provided by the embodiment of the present application, by dividing the message communication area for each cluster node on the shared disk of the cluster, each message communication area is divided into a message communication area with each cluster node. a plurality of channels corresponding to a node, and generate a plurality of buffers corresponding to the depth of the channel in each channel; in response to a first cluster node of the cluster detecting a connection between the first cluster node and a second cluster node in the same cluster The socket connection is interrupted, and the communication mode of the first cluster node is switched from the socket communication mode to the network communication mode, and the message communication area of the first cluster node on the shared disk is continuously monitored; in response to the first cluster node being in the network In the communication mode, the first distributed lock manager information is sent to the second cluster node, and the first distributed lock manager information is written in the message communication area of the second cluster node on the shared disk that is related to the first cluster node. In the corresponding channel; in response to receiving the second distributed lock manager information in the channel corresponding to the second cluster node in the message communication area of the first cluster node on the shared disk by the first cluster node, and calling the information The processing function processes the information of the second distributed lock manager, pops up the buffer where the information of the second distributed lock manager is located, and sends a message reply to the second cluster node, which can avoid the contention phenomenon of DLM communication and reduce large The read and write overhead and delay of data in large-scale clusters improve the practicability and availability of the shared disk communication system in the cluster.

It should be pointed out that each step in each embodiment of the communication method between cluster nodes can be interleaved, replaced, added, or deleted. Therefore, these reasonable permutations and combinations should also belong to the communication method between cluster nodes. protection scope of the present application and should not limit the protection scope of the application to the examples described.

Based on the above purpose, the second aspect of the embodiment of the present application proposes a method to avoid the contention phenomenon of DLM communication, reduce the read and write overhead and delay of data in large-scale clusters, and improve the practicability of the shared disk communication system in the cluster An embodiment of a means for inter-node communication of a cluster with availability. Devices include:

processor;

The devices and equipment disclosed in the examples of this application can be various electronic terminal equipment, such as mobile phones, personal digital assistants (PDA), tablet computers (PAD), smart TVs, etc., or large terminal equipment, such as servers, etc. Therefore, the scope of protection disclosed in the embodiments of the present application should not be limited to a specific type of device or equipment. The client disclosed in the embodiments of the present application may be applied to any of the above-mentioned electronic terminal devices in the form of electronic hardware, computer software, or a combination of the two.

It can be seen from the above embodiments that the communication device between cluster nodes provided by the embodiment of the present application divides the message communication area for each cluster node on the shared disk of the cluster, and divides each message communication area to communicate with each cluster. a plurality of channels corresponding to a node, and generate a plurality of buffers corresponding to the depth of the channel in each channel; in response to a first cluster node of the cluster detecting a connection between the first cluster node and a second cluster node in the same cluster The socket connection is interrupted, and the communication mode of the first cluster node is switched from the socket communication mode to the network communication mode, and the message communication area of the first cluster node on the shared disk is continuously monitored; in response to the first cluster node being in the network In the communication mode, the first distributed lock manager information is sent to the second cluster node, and the first distributed lock manager information is written in the message communication area of the second cluster node on the shared disk that is related to the first cluster node. In the corresponding channel; in response to receiving the second distributed lock manager information in the channel corresponding to the second cluster node in the message communication area of the first cluster node on the shared disk by the first cluster node, and calling the information The processing function processes the information of the second distributed lock manager, pops up the buffer where the information of the second distributed lock manager is located, and sends a message reply to the second cluster node, which can avoid the contention phenomenon of DLM communication and reduce large The read and write overhead and delay of data in large-scale clusters improve the practicability and availability of the shared disk communication system in the cluster.

It should be pointed out that the embodiment of the above-mentioned device uses the embodiment of the communication method between cluster nodes to specifically illustrate the working process of each module. Those skilled in the art can easily imagine that applying these modules to the cluster node In other embodiments of the inter-communication method. Of course, since each step in the embodiment of the communication method between cluster nodes can be interleaved, replaced, added, or deleted, these reasonable permutations and combinations should also belong to the protection scope of the present application for the device. And the scope of protection of the present application should not be limited to the examples described.

Finally, it should be noted that those skilled in the art can understand that the implementation of all or part of the processes in the methods of the above embodiments can be completed by instructing related hardware through computer programs, and the programs can be stored in a computer-readable storage medium When the program is executed, it may include the processes of the embodiments of the above-mentioned methods. Wherein, the storage medium may be a magnetic disk, an optical disk, a read-only memory (ROM) or a random access memory (RAM) and the like. The computer program embodiments can achieve the same or similar effects as any of the corresponding foregoing method embodiments.

The above are the exemplary embodiments disclosed in the present application, but it should be noted that various changes and modifications can be made without departing from the scope of the embodiments disclosed in the present application defined by the claims. The functions, steps and/or actions of the method claims in accordance with the disclosed embodiments described herein need not be performed in any particular order. In addition, although the elements disclosed in the embodiments of the present application may be described or required in an individual form, they may also be understood as plural unless explicitly limited to a singular number.

Those of ordinary skill in the art should understand that: the discussion of any of the above embodiments is exemplary only, and is not intended to imply that the scope (including claims) disclosed by the embodiments of the present application is limited to these examples; under the idea of the embodiments of the present application , the technical features in the above embodiments or different embodiments can also be combined, and there are many other changes in different aspects of the embodiments of the present application as described above, which are not provided in details for the sake of brevity. Therefore, within the spirit and principle of the embodiments of the present application, any omissions, modifications, equivalent replacements, improvements, etc., shall be included in the protection scope of the embodiments of the present application.

Claims

A communication method between cluster nodes, characterized in that it comprises the following steps:

Divide a message communication area for each cluster node on the shared disk of the cluster, divide a plurality of channels corresponding to each of the cluster nodes in each of the message communication areas, and generate in each of the channels Multiple buffers corresponding to channel depth;

In response to the first cluster node of the cluster detecting that the socket connection between the first cluster node and the second cluster node in the same cluster is interrupted, switching the communication mode of the first cluster node from the socket communication mode In network communication mode, and continuously monitor the message communication area of the first cluster node on the shared disk;

In response to the first cluster node sending the first distributed lock manager information to the second cluster node in the network communication mode, writing the first distributed lock manager information into the shared In the channel corresponding to the first cluster node in the message communication area of the second cluster node on the disk;

Responsive to the first cluster node receiving a second distributed lock in the channel corresponding to the second cluster node in the message communication area of the first cluster node on the shared disk Manager information, call an information processing function to process the second distributed lock manager information, pop up the buffer where the second distributed lock manager information is located, and send a message reply to the second cluster node .
The method according to claim 1, wherein the information of the first distributed lock manager is written in the message communication area of the second cluster node on the shared disk with the first After entering the channel corresponding to a cluster node, continuously monitor the channel corresponding to the first cluster node in the message communication area of the second cluster node on the shared disk.
The method according to claim 2, wherein, in response to the second cluster node listening to the message communication area of the second cluster node on the shared disk The information of the first distributed lock manager is received in the corresponding channel, and the information processing function is called to process the information of the first distributed lock manager, and the information of the first distributed lock manager is popped up. the buffer, and write the information for the first distributed lock management into the channel corresponding to the first cluster node in the message communication area of the second cluster node on the shared disk Reply to the first message of the device information.
The method according to claim 3, wherein, in response to the first cluster node listening to the message communication area of the second cluster node on the shared disk The corresponding channel receives the first message reply for the information of the first distributed lock manager, and feeds back that the information processing of the first distributed lock manager is completed and stops listening to the The channel corresponding to the first cluster node in the message communication area of the second cluster node.
The method according to claim 1, wherein, in response to the second cluster node listening to the message communication area of the second cluster node on the shared disk The information of the first distributed lock manager is received in the corresponding channel, and the information processing function is called to process the information of the first distributed lock manager, and the information of the first distributed lock manager is popped up. the buffer, and write the information for the first distributed lock management to the channel corresponding to the second cluster node in the message communication area of the first cluster node on the shared disk Reply to the first message of the device information.
The method according to claim 5, wherein, in response to the first cluster node listening to the message communication area of the first cluster node on the shared disk The corresponding channel receives the first message reply for the first distributed lock manager information, and feeds back that the processing of the first distributed lock manager information is completed.
The method according to claim 1, wherein in response to switching the communication mode of the first cluster node from the network communication mode to the socket communication mode, stopping the first cluster node from communicating with the shared disk Monitoring of any area; wherein the monitoring includes periodic polling for the shared disk.
The method according to claim 1, wherein generating a plurality of buffers corresponding to the channel depth in each of the channels comprises: obtaining the channel depth predetermined based on the required message concurrent processing capability , and generate a plurality of buffers in each of the channels that are positively correlated with the channel depth, wherein each of the buffers is configured to store a piece of the first distributed lock manager information, a piece of The second distributed lock manager information, or the disk space of a message reply.
The method according to claim 1, wherein, in response to the addition of a third cluster node in the cluster, a message communication area is divided for the third cluster node on the shared disk, and the third cluster node Divide multiple channels corresponding to each of the cluster nodes on the message communication area, and generate multiple buffers corresponding to the channel depth in each of the channels. The channel corresponding to the third cluster node is divided in the message communication area of the cluster node.
A communication device between cluster nodes, characterized in that it includes:

processor;

The controller stores program code executable by the processor, and the processor executes the following steps when running the program code:

Divide a message communication area for each cluster node on the shared disk of the cluster, divide a plurality of channels corresponding to each of the cluster nodes in each of the message communication areas, and generate in each of the channels Multiple buffers corresponding to channel depth;

In response to the first cluster node of the cluster detecting that the socket connection between the first cluster node and the second cluster node in the same cluster is interrupted, switching the communication mode of the first cluster node from the socket communication mode In network communication mode, and continuously monitor the message communication area of the first cluster node on the shared disk;

In response to the first cluster node sending the first distributed lock manager information to the second cluster node in the network communication mode, writing the first distributed lock manager information into the shared In the channel corresponding to the first cluster node in the message communication area of the second cluster node on the disk;

Responsive to the first cluster node receiving a second distributed lock in the channel corresponding to the second cluster node in the message communication area of the first cluster node on the shared disk Manager information, call an information processing function to process the second distributed lock manager information, pop up the buffer where the second distributed lock manager information is located, and send a message reply to the second cluster node .