WO2023124663A1 - Procédé de mise en œuvre de quorum de grappe, dispositif de réseau et système - Google Patents

Procédé de mise en œuvre de quorum de grappe, dispositif de réseau et système Download PDF

Info

Publication number
WO2023124663A1
WO2023124663A1 PCT/CN2022/133885 CN2022133885W WO2023124663A1 WO 2023124663 A1 WO2023124663 A1 WO 2023124663A1 CN 2022133885 W CN2022133885 W CN 2022133885W WO 2023124663 A1 WO2023124663 A1 WO 2023124663A1
Authority
WO
WIPO (PCT)
Prior art keywords
node
cluster
network device
master
master node
Prior art date
Application number
PCT/CN2022/133885
Other languages
English (en)
Chinese (zh)
Inventor
张翠敏
王巧灵
张君逸
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2023124663A1 publication Critical patent/WO2023124663A1/fr

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W84/00Network topologies
    • H04W84/02Hierarchically pre-organised networks, e.g. paging networks, cellular networks, WLAN [Wireless Local Area Network] or WLL [Wireless Local Loop]
    • H04W84/04Large scale networks; Deep hierarchical networks
    • H04W84/08Trunked mobile radio systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W24/00Supervisory, monitoring or testing arrangements
    • H04W24/04Arrangements for maintaining operational condition

Definitions

  • the present application relates to the computer field, in particular to a cluster arbitration method, network equipment and system.
  • a distributed cluster system includes multiple cluster nodes. Among them, distributed refers to the distribution of important operating information of the cluster on multiple nodes. When one or more cluster nodes in the distributed cluster system fail, the distributed cluster system can automatically adapt, and the normal cluster Nodes are responsible for business, improving the reliability and availability of the distributed cluster system as a whole.
  • a distributed cluster system needs to ensure data consistency among cluster nodes.
  • consensus algorithms such as Raft and Paxos can be used to achieve data consistency between cluster nodes.
  • This type of consensus algorithm needs to determine a master node among the cluster nodes of the distributed cluster system.
  • the master node can coordinate and manage other cluster nodes and is responsible for data synchronization.
  • the master node can be negotiated and determined by the cluster nodes in the cluster, or it can be arbitrated by a third party that all cluster nodes can access.
  • the method of determining the master node by a third party is also called quorum.
  • the current quorum-based master selection algorithm needs to additionally set up a shared storage space that each cluster node can access in the distributed cluster system. Shared storage space is third party.
  • the shared storage space may be, for example, devices such as servers and hard disks.
  • the cluster nodes participating in the master election send a master grab request to the shared storage space.
  • the shared storage space determines the master node according to certain arbitration rules.
  • the embodiment of the present application provides a cluster arbitration method, network equipment and system, which can use the network equipment in the cluster to determine the master node of the cluster, without additionally setting up other shared storage spaces in the cluster, and can ensure a certain degree of reliability On the basis of the distributed cluster system, the cost of the distributed cluster system is reduced.
  • the present application provides a cluster arbitration method, which can be applied to a first network device.
  • the method specifically includes the following steps: the first network device acquires a master grab request message of the first node, and determines the master node of the first cluster.
  • the first node is a cluster node participating in the master node election in the first cluster.
  • the master grabbing request message includes the first cluster identifier.
  • the first cluster identifier is used to identify the first cluster.
  • the first network device can determine that it is the first node in the first cluster that requests to elect the master node, and then determine the master node of the first cluster.
  • the primary node of the first cluster is determined by the first network device, and the cluster arbitration of the first cluster is realized. There is no need to additionally set up a shared storage space for arbitration in the first cluster, which can reduce the establishment cost of the distributed cluster system to a certain extent. and maintenance costs.
  • the first network device determines the first node as the master node of the first cluster.
  • the first node may be a cluster node that complies with the arbitration rule of the first network device among the cluster nodes requesting election.
  • the arbitration rule is to use the node that first obtains the master-grab request message as the master node
  • the master-grab request message sent by the first node may be the first network device obtained during the process of selecting the master node. A grab master request message.
  • the arbitration rule is to use the cluster node with the highest priority as the master node
  • the first node may be the cluster node with the highest priority in the first cluster.
  • the first master node information is used to indicate that the master node of the first cluster does not exist.
  • the first network device can determine based on the first master node information that the master node of the first cluster does not exist, and needs to determine the master node of the first cluster. Further, after the first network device sets the first node as the master node of the first cluster, there is a master node, that is, the first node, in the first cluster.
  • the first network device updates the first master node information, so that the updated first master node information indicates the existence of the master node of the first cluster, so that the first network device can subsequently determine the master node based on the updated first master node information, to achieve Timely update of master node information.
  • the first network device may obtain information about the second master node. Wherein, the second master node information is used to indicate the existence status of the master node of the first cluster. If the first network device does not obtain the second master node information, the first network device determines that the master node of the first cluster does not exist, and needs to determine the master node of the first cluster.
  • the first network device can acquire the third master node information.
  • the third master node information is used to indicate that the master node of the first cluster exists.
  • the first network device can determine the master node of the first cluster based on the third master node information.
  • the first network device determines whether the first node is the first candidate master node of the first cluster after obtaining the master grab request message of the first node.
  • the first candidate master node of the first cluster is a cluster node that can be set as the master node of the first cluster. If the first node is the first candidate master node of the first cluster, the first network device may determine the first node as the master node of the first cluster.
  • the first network device can determine whether the first node is the first candidate master node of the first cluster based on the first node information set.
  • the first node information set includes node information of a first candidate master node in the first cluster.
  • the first network device may obtain node information of the first node included in the master grab request message.
  • the first node information set includes node information of the first node, it indicates that the first node is the first candidate master node of the first cluster.
  • the first network device can determine that the first node is the first candidate master node of the first cluster.
  • the candidate master nodes that can serve as the cluster master nodes in the first cluster will change.
  • the first network device may delete the first node information set, so as to avoid subsequent selection of a wrong master node based on the first node information set.
  • the first network device may acquire the node information of the second candidate master node of the first cluster, Obtain the second node information set.
  • the second node information set includes node information of a second candidate master node of the first cluster.
  • the first network device may acquire node information of the second candidate master node of the first cluster from the first node.
  • the second node information set obtained by the first network device is established based on node information of cluster nodes in the first cluster that can serve as candidate master nodes after the first node serves as the master node. The first network device can more accurately perform the next master node election based on the second node information set.
  • the first network device can generate a synchronization packet, and send the synchronization packet to the second network device.
  • the synchronization message includes node information of the master node of the first cluster.
  • the synchronization message is used to instruct the second network device to back up the node information of the master node of the first cluster.
  • the second network device is a backup network device of the first network device. Based on the acquired synchronization message, the second network device can back up the node information of the master node of the first cluster included in the synchronization message. In this way, when there are an active network node and a backup network node, the node information of the master node stored in the active network node and the backup network node can be synchronized.
  • the first network device can obtain the synchronization success message sent by the second network device.
  • the synchronization success message is used to indicate that the backup of the second network device is successful.
  • the first network device can determine that the second network device has successfully backed up the node information of the master node of the first cluster in the synchronization message.
  • the synchronization message further includes node information of the first node.
  • the synchronization message is also used to instruct the second network device to send the first master grabbing response message to the first node.
  • the first master grabbing response message includes the master election result of the first node.
  • the second network device After obtaining the synchronization message, not only backs up the node information of the master node of the first cluster included in the synchronization message, but also sends a first master grabbing response message to the first node.
  • the master election result of the first node is used to identify whether the first node succeeds in preempting the master.
  • the first network device may generate a second master grabbing response message, and send the second master grabbing response message to the first node.
  • the second master snatching response message includes the master election result of the first node. The master election result of the first node is used to identify whether the first node succeeds in preempting the master.
  • the first network device also obtains the query request message of the second node.
  • the second node is a cluster node of the second cluster.
  • the query request packet includes the second cluster identifier.
  • the second cluster identification can identify the second cluster.
  • the first network device may generate a query response message.
  • the query response message includes a query result of the first network device querying the master node of the second cluster.
  • the first network device sends a query response packet to the second node. Based on the query result in the query response message, the second node can determine the existence status of the master node of the second cluster.
  • the present application provides a network device, where the network device includes: a first acquiring unit and a first processing unit.
  • the first acquiring unit is configured to acquire a master grab request message of a first node, where the first node is a cluster node participating in master node election in the first cluster, and the master grab request message includes a first cluster identifier , the first cluster identifier is used to identify the first cluster;
  • the first processing unit is configured to determine the master node of the first cluster.
  • the first processing unit is specifically configured to, in response to determining that the master node of the first cluster does not exist, determine the first node as the master node of the first cluster.
  • the first processing unit is specifically configured to determine that the master node of the first cluster does not exist based on the first master node information, where the first master node information is used to indicate that the first cluster The master node for does not exist.
  • the network device further includes: a second processing unit.
  • the second processing unit is configured to update the first master node information, and the updated first master node information is used to indicate that the master node of the first cluster exists.
  • the first processing unit is specifically configured to, in response to not obtaining the second master node information, determine that the master node of the first cluster does not exist, and the second master node information uses Indicates the existence status of the master node of the first cluster.
  • the first processing unit is specifically configured to determine the master node of the first cluster based on third master node information, where the third master node information is used to indicate that the first cluster The master node exists.
  • the first processing unit is specifically configured to, in response to determining that the first node is the first candidate master node of the first cluster, determine the first node as the first The master node of the cluster.
  • the master grabbing request message includes node information of the first node
  • the first processing unit is specifically configured to determine that the first node information set includes node information of the first node
  • the first node information set includes node information of a first candidate master node of the first cluster.
  • the network device further includes: a third processing unit.
  • the third processing unit is configured to delete the first node information set.
  • the network device further includes: a second acquiring unit.
  • the second acquiring unit is configured to acquire the node information of the second candidate master node of the first cluster, and obtain a second node information set, the second node information set including the nodes of the second candidate master node of the first cluster information.
  • the network device further includes: a third processing unit and a first sending unit.
  • the third processing unit is configured to generate a synchronization message, where the synchronization message includes node information of the master node of the first cluster;
  • the first sending unit is configured to send the synchronization message to the second network device, the synchronization message is used to instruct the second network device to back up the node information of the master node of the first cluster, and the second The network device is a backup network device of the first network device.
  • the network device further includes: a third acquiring unit.
  • the third obtaining unit is configured for the first network device to obtain a synchronization success message sent by the second network device, where the synchronization success message is used to indicate that the backup of the second network device is successful.
  • the synchronization message further includes node information of the first node, and the synchronization message is further used to instruct the second network device to send The first node sends a first master grabbing response message, where the first master grabbing response message includes a master election result of the first node.
  • the network device further includes: a fourth processing unit and a second sending unit.
  • a fourth processing unit configured to generate a second master grabbing response message, where the second master grabbing response message includes the master election result of the first node.
  • a second sending unit configured to send the second master grabbing response message to the first node.
  • the network device further includes: a fourth acquiring unit, a fifth processing unit, and a third sending unit.
  • the fourth obtaining unit is configured to obtain a query request message of a second node, the query request message includes a second cluster identifier, the second cluster identifier is used to identify a second cluster, and the second node is the Cluster nodes for the second cluster.
  • a fifth processing unit configured to generate a query response message, where the query response message includes a query result, and the query result is used to enable the second node to determine the master node of the second cluster according to the query result state of being.
  • a third sending unit configured to send the query response message to the second node.
  • the present application provides a network device, the network device includes a processor chip and a memory, the memory is used to store instructions or program codes, and the processor chip is used to call and run the instructions or program codes from the memory , so as to execute the method in the first aspect or any implementation manner of the first aspect.
  • the present application provides a network system, where the network system includes a network device and a first node, and the network device is configured to execute the method in the first aspect or any implementation manner of the first aspect.
  • the present application provides a computer-readable storage medium, which is characterized in that it includes instructions, programs or codes, and when it is executed on a computer, the computer executes any one of the first aspect or the first aspect. The method described in the implementation mode.
  • the present application provides a chip, including a memory and a processor.
  • Memory is used to store instructions or program codes.
  • the processor is configured to call and run the instruction or program code from the memory, so as to execute the above first aspect or the method in any possible implementation manner of the first aspect.
  • the above-mentioned chip only includes a processor, and the processor is used to read and execute instructions or program codes stored in the memory. When the instructions or program codes are executed, the processor performs the first aspect or the first aspect. A method in any of the possible implementations of .
  • FIG. 1 is a schematic diagram of the architecture of a traditional distributed cluster system
  • FIG. 2 is a schematic diagram of the architecture of a distributed cluster system provided by an embodiment of the present application
  • FIG. 3 is a schematic diagram of the architecture of another distributed cluster system provided by the embodiment of the present application.
  • FIG. 4 is a schematic flow diagram of a cluster arbitration method provided in an embodiment of the present application.
  • FIG. 5 is a schematic structural diagram of a network device provided in an embodiment of the present application.
  • FIG. 6 is a schematic structural diagram of a network system provided by an embodiment of the present application.
  • FIG. 7 is a schematic structural diagram of the device provided by the embodiment of the present application.
  • FIG. 8 is a schematic structural diagram of a device provided by an embodiment of the present application.
  • the master node of the distributed cluster system is used to coordinate and manage other cluster nodes.
  • voting, default configuration, or competitive arbitration can be used to determine the master node of the cluster.
  • the following is a brief introduction to the selection method of competitive arbitration.
  • the master selection method of competitive arbitration requires additional configuration of shared storage space in the distributed cluster system.
  • the shared storage space needs to establish a connection with each cluster node in the distributed cluster system.
  • each cluster node participating in the master election sends a master grab request to the shared storage space.
  • the shared storage space obtains the master grab request, and based on the arbitration rules, determines the master node among the cluster nodes.
  • the master selection method of competitive arbitration relies on the shared storage space in the distributed cluster system.
  • the distributed cluster system 100 includes cluster nodes 101 - 105 , a switch 106 and a server 107 .
  • the cluster nodes 101-105 are connected to the switch 106, and are connected to the cluster system through the switch 106.
  • the server 107 is an additional shared storage space configured in the distributed cluster system 100 .
  • Server 107 is connected to cluster nodes 101-105, respectively.
  • the server 107 is used to obtain master grabbing requests of the cluster nodes 101-105, and determine the master node of the cluster.
  • the master selection method of competition arbitration will not be affected by the parity of the number of cluster nodes, and can flexibly determine the master node based on the operation of the distributed cluster system.
  • setting additional shared storage space in the distributed cluster system results in higher hardware cost and maintenance cost.
  • a corresponding shared storage space will be configured for every two cluster nodes, which greatly increases the cost of distributed clusters.
  • the embodiment of the present application provides a cluster arbitration method, which implements arbitration by the network devices in the cluster, determines the master node in the cluster, and does not need to set additional shared storage space in the cluster, reducing the cost of building a cluster and maintaining the cluster the cost of.
  • FIG. 2 is a schematic diagram of a distributed cluster system architecture provided by an embodiment of the present application.
  • the distributed cluster system 200 includes cluster nodes 201 - 205 and a network device 206 .
  • Cluster nodes 201 - 205 are connected to network device 206 .
  • Cluster nodes 201 - 205 access the network based on network device 206 .
  • the network device 206 may be a switch, a router or other devices with a forwarding function in the distributed cluster system.
  • the network device 206 is capable of processing arbitration packets to implement arbitration election.
  • the network device 206 may have a storage function, or be able to access storage space.
  • the network device 206 may implement the arbitration master selection function based on the switch chip.
  • a virtualization technology is used to provide an isolated virtualized open space on the network device 206 to run the arbitration master selection function.
  • FIG. 3 the figure is a schematic diagram of another distributed cluster system architecture provided by the embodiment of the present application.
  • the distributed cluster system 300 includes cluster nodes 301-305 and network devices 306-308.
  • Cluster nodes 301-305 are connected to network devices 306-308, respectively.
  • Cluster nodes 301-305 access the network based on network devices 306-308.
  • the network devices 306-308 may be switches, routers or other devices with a forwarding function in the distributed cluster system.
  • Network device 306 may be an active network device, and network devices 307-308 may be backup network devices.
  • Network devices 306-308 are capable of processing arbitration packets to implement arbitration election.
  • the network devices 306-308 can implement the arbitration master selection function based on the switch chip.
  • virtualization technology is used to provide an isolated virtualized open space on the network devices 306-308 to run the arbitration master selection function.
  • network devices 306-308 have storage capabilities, or can access storage space on other devices.
  • FIG. 4 the figure is a schematic flowchart of a cluster arbitration method provided by an embodiment of the present application.
  • the cluster arbitration method provided in the embodiment of the present application includes S401-S402.
  • the first network device acquires a master grab request message of the first node.
  • the first node is a cluster node in the first cluster. As shown in FIG. 2 , the first node may be any cluster node in the cluster nodes 201-205.
  • the first node participates in the master node election, it generates a master grab request message, and sends the master grab request message to the first network device.
  • the master grabbing request message includes the first cluster identifier.
  • the first cluster identifier is used to identify the first cluster to which the first node belongs.
  • the first network device is connected to the cluster nodes in the first cluster to which the first node belongs.
  • the first network device may be directly connected to the first node, or may be indirectly connected through other network devices.
  • the first network device is a network device that connects the cluster nodes to the network.
  • the first network device is located at the center of the distributed cluster system, and is convenient for interacting with the cluster nodes.
  • the first network device is a network device with storage function and programmable function.
  • the first network device is capable of accessing the storage space, and is capable of implementing arbitration to elect the master.
  • the first network device may directly obtain the master grab request message sent by the first node, or may obtain it through other devices.
  • the first network device may be the network device 206 in FIG. 2 , and the first node is the cluster node 201 in FIG. 2 .
  • the network device 206 may obtain the master grab request message sent by the cluster node 201 .
  • the first network device may be a network device 306 , and the first node is the cluster node 301 in FIG. 3 .
  • the network device 306 may obtain the master grab request message sent by the cluster node 301 through the network device 307 .
  • the first network device obtains the master grabbing request message sent by the first node, and based on the first cluster identifier in the master grabbing request message, can determine the first cluster for which the master node needs to be determined.
  • the first network device is connected to cluster nodes of multiple clusters. Based on the first cluster identifier in the master grab request message, the first network device can distinguish clusters and determine the first cluster for which the first node requests master election.
  • a possible arbitration message is provided.
  • the first network device can identify and process the arbitration packet. See Table 1, which is the fields included in the arbitration message.
  • the selection (option, op) field is used to identify the operation type.
  • the value of the cluster (cluster) identification (identity, ID) field is the cluster identification.
  • the sequence field is used to identify the response message.
  • the verification (verify) field is used to confirm whether it is an out-of-sequence packet.
  • a synchronization (sync, sync) identification field is used to determine the validity of the synchronization content.
  • the status (status) field is used to identify the existence status of the master node of the cluster.
  • the value of the master node (owner) field is the Internet Protocol (IP) address of the master node.
  • the error code field carries information.
  • the arbitration message when it is specifically a master grab request message, for example, it may include an op field, a cluster ID field, a sequence field, a verify field, and an owner field.
  • the value of the op field is preempt (preemption), indicating that the operation type is preemption.
  • the cluster ID field carries the first cluster identifier.
  • the owner field carries the IP address of the first node, indicating that the first node is running for the master node.
  • the value of the sequence field identifies the request message.
  • the value of the verify field is used to verify whether the packet is out of sequence.
  • the value of the sequence field can be a predefined character string such as letters and numbers.
  • the value of the verify field can be a predefined string of letters, numbers, etc.
  • the first network device determines a master node of the first cluster.
  • the first network device can obtain the master grabbing request message of the first message, and can process the master grabbing request message to determine the master node of the first cluster.
  • the first network device is connected to the cluster nodes in the first cluster to which the first node belongs, and can interact with the cluster nodes to achieve arbitration and master selection. In this way, there is no need to set up additional shared storage space in the cluster, greatly reducing the cost of third-party arbitration in the distributed cluster system, and improving the availability of the distributed cluster system. In addition, based on the first network device, arbitration is used to select the leader. Compared with the majority algorithm, there is no need for negotiation between cluster nodes to choose the leader, and the election time is shorter.
  • the embodiment of the present application provides two possible implementation manners of determining the master node of the first cluster, which are only used to illustrate the technical solution of the present application, but not to limit the manner of determining the master node of the first cluster.
  • the first method In a possible way of determining the master node of the first cluster, the first network device first determines whether the master node of the cluster exists, and determines the master node of the first cluster according to the existence status of the master node of the first cluster .
  • Manner 1 In response to determining that the master node of the first cluster does not exist, the first network device determines the first node as the master node of the first cluster.
  • the first node that sends the master grab request message may be determined as the master node of the first cluster.
  • the embodiment of the present application does not limit the master selection rule for the first network device to determine the master node of the first cluster.
  • the first network device may be based on a master selection rule that determines the node that requests first to be the master node.
  • the first master grab request message received by the first network device is sent by the first node.
  • the first network device determines the first node as the master node of the first cluster based on the master selection rule.
  • the first network device may determine the master node based on the priorities of the cluster nodes participating in master grabbing. The first node is the node with the highest priority among the cluster nodes participating in the master grab. Then, the first network device determines the first node as the master node.
  • the absence of the master node can include the following two situations:
  • the first network device may acquire the first master node information.
  • the first master node information is used to indicate that the master node of the first cluster does not exist. Among them, if the master node does not exist, it may be that the previous master node failed, or disconnected, etc., and the master node needs to be re-determined.
  • the first master node information may include a first state identifier.
  • the first status identifier is a status identifier indicating that the master node does not exist.
  • the first state identifier may be, for example, "idle (idle)".
  • the first master node information may be generated by the first network device based on the running conditions of the master nodes in the first cluster.
  • the first master node information may be stored in an information table.
  • the embodiment of the present application does not limit the storage location of the first master node information. If the first network device has a storage function, the first network device can store the first master node information in an information table in the local storage space. If the first network device can access other storage spaces, the first network device may store the first master node information in an information table in other storage spaces. For example, the first network device can access the storage space provided in other devices. The first network device may store the first master node information in an information table in the storage space of other devices. When it is necessary to obtain the first master node information, the first network device accesses the information table in the storage space of other devices, and obtains the first master node information from the information table.
  • the first master node information may correspond to the first cluster identifier.
  • the first network device may store the correspondence between the first master node information and the first cluster identifier in an information table in a local storage space, or in an information table in another storage space.
  • the first network device can query and obtain the first master node information based on the first cluster identifier in the master grab request message and the corresponding relationship between the first master node information and the first cluster identifier in the information table.
  • the first network device can correspondingly update the information of the first master node.
  • the updated first master node information is used to indicate that the master node of the first cluster exists.
  • the updated first master node information may include a second state identifier.
  • the second status identifier is a status identifier indicating the existence of the master node.
  • the second status flag can be, for example, "in use (in use)".
  • the updated first master node information may also include node information of the first node.
  • the node information of the first node may be acquired by the first network device.
  • the node information of the first node may be the IP address of the first node.
  • Case 2 In another implementation, when the first network device does not obtain the second master node information, the first network device determines that the master node of the first cluster does not exist.
  • the second master node information is used to indicate the existence status of the master node of the first cluster.
  • the existence state includes two states of existence and non-existence.
  • the first network device cannot obtain the information of the second master node. For example, it may be that the cluster has not started to elect the master node, or the first network device has not obtained the information related to the master node.
  • the second master node information may be stored in an information table.
  • the first network device can query the information table in the local storage space, or the information table in other storage spaces. If the information of the second master node cannot be obtained from the query, the first network device cannot determine whether the master node of the first cluster exists. In order for the first cluster to operate normally, the first network device determines that the master node of the first cluster does not exist, and needs to determine the master node.
  • the first network device determines the master node of the first cluster according to the master election rule. Similarly, the first network device can determine the first node as the master node of the first cluster based on the master election rule. Correspondingly, the first network device generates second master node information. The generated second master node information indicates that the master node of the first cluster exists, and may include a second status identifier. The first network device writes the second master node information into the information table.
  • the following introduces a possible implementation manner in which the first network device determines that the master node of the first cluster does not exist and determines that the first node is the master node of the first cluster in combination with the above master grab request message.
  • the first network device obtains the master grabbing request message.
  • the preemption operation is determined based on the value of the op field.
  • the first network device first obtains the value pkt.verify of verify in the preemption request message. Compare pkt.verify with switch0.verify stored by the first network device.
  • the first network device acquires the first cluster identifier in the cluster ID field in the master grab request message.
  • the first network device queries the information table based on the first cluster identifier to obtain corresponding first master node information.
  • the first master node information includes a first state identifier, that is, "idle”.
  • the first network device determines the first node as the master node of the first cluster.
  • the first network device updates the first master node information.
  • the updated first master node information includes the second status identifier, that is, "inuse", and includes the IP address of the first node.
  • the IP address of the first node is obtained from the master grabbing request message.
  • the first network device fails to find the second master node information corresponding to the first cluster identifier in the information table.
  • the first network device determines that the master node of the first cluster does not exist, and determines the first node as the master node of the first cluster.
  • the first network device writes the second master node information corresponding to the first cluster identifier in the information table.
  • the second master node information includes the second state identifier, that is, inuse, and includes the IP address of the first node.
  • the first network device also updates switch0.verify to pkt.verify.
  • the first network device may generate a corresponding response message.
  • the error code field is carried in the response message.
  • the error code field carries the information that the check code is wrong.
  • Manner 2 In response to determining that the master node of the first cluster exists, the first network device determines the master node of the first cluster.
  • the first network device can determine the master node of the first cluster. When the master node of the cluster exists, the first node fails to run for the master node.
  • the first network device may obtain information about the third master node.
  • the third master node information is used to indicate that the master node of the first cluster exists.
  • the third master node information may indicate that the existence status of the master node of the first cluster is existing.
  • the third master node information includes the second state identifier.
  • the third master node information may include node information of the master node of the first cluster.
  • the first network device can determine the master node of the first cluster based on the node information of the master node of the first cluster.
  • the third master node information may be generated by the first network device based on the running conditions of the master nodes in the cluster.
  • the third master node information may be stored in an information table.
  • the first network device can store the third master node information in an information table in the local storage space.
  • the first network device may store the third master node information in an information table in other storage spaces.
  • the first network device can access the storage space provided in other devices.
  • the first network device may store the third master node information in an information table in the storage space of other devices.
  • the first network device accesses the information table in the storage space of other devices, and obtains the third master node information from the information table.
  • the third master node information may correspond to the first cluster identifier.
  • the first network device may store the correspondence between the third master node information and the first cluster identifier in an information table in a local storage space, or in an information table in a storage space of other devices.
  • the first network device can query and obtain the third master node information based on the first cluster identifier in the master grab request message and the correspondence between the third master node information and the cluster identifier in the information table.
  • the following introduces a possible implementation manner in which the master node of the first cluster exists and the first network device determines the master node of the first cluster in combination with the above master grab request message.
  • the first network device queries the information table based on the first cluster identifier to obtain the corresponding third master node information.
  • the third master node information includes the second status identifier, that is, "inuse”.
  • the third master node information also includes the IP address of the master node.
  • the first network device can determine that the master node of the first cluster exists based on the third master node information, and can determine the master node of the first cluster based on the IP address of the master node.
  • Second method In another possible manner of determining the master node of the first cluster, the first network device determines whether the first node is a first candidate master node. If the first node is the first candidate master node, the first network device determines the first node as the master node of the first cluster after receiving the master grab request message from the first node.
  • the first network device determines the first node as the master node of the first cluster.
  • the first candidate master node of the first cluster is predetermined by the first network device.
  • the first candidate master node of the first cluster is a cluster node that can be elected as the master node when the master node fails or cannot be connected.
  • the first candidate master node of the first cluster is related to the master node of the first cluster.
  • the first candidate master node of the first cluster may be a cluster node capable of providing external services other than the master node of the first cluster.
  • the first network device may acquire node information of the first candidate master node of the first cluster, and store it in the first node information set.
  • the first node information set includes node information of a first candidate master node.
  • the node information of the first candidate master node may be an IP address.
  • the master snatching request message of the first node acquired by the first network device includes node information of the first node.
  • the first network device can obtain the node information of the first node based on the master grab request message of the first node, and then determine whether the first node information set includes the node information of the first node. If so, it may be determined that the first node is the first candidate master node, and then the first node is used as the master node of the first cluster.
  • the first network device may acquire node information of the first candidate master node through the master node.
  • cluster nodes in the first cluster may negotiate to determine the master node, or the first network device may determine the master node.
  • the first network device may acquire node information of the cluster nodes of the first cluster.
  • the first network device determines the master node among the cluster nodes participating in the master election based on the obtained node information and the master election rule.
  • the master selection rule may be, for example, to determine the master node based on the priority of the cluster nodes.
  • the primary node may be determined according to the time sequence of the acquired node information of the cluster nodes.
  • the master node After determining the master node of the first cluster, the master node interacts with other cluster nodes in the first cluster to acquire node information of other cluster nodes.
  • the master node then sends the acquired node information of other cluster nodes to the first network device.
  • the master node may send the node information of the cluster nodes capable of providing services to the first network device.
  • the first network device uses the cluster node corresponding to the acquired node information as a first candidate master node, and generates a first node information set.
  • the first network device can store the first node information set in a local storage space. If the first network device can access other storage spaces, the first network device may store the first node information set in other storage spaces. For example, the first network device can access the storage space provided in other devices. The first network device may store the first node information set in the storage space of other devices. When it is necessary to query the first node information set, the first network device accesses the storage space of other devices, and queries whether the first node information set includes the node information of the first node.
  • the first node information set may correspond to the first cluster identifier.
  • the first network device may store the correspondence between the first node information set and the first cluster identifier in a local storage space or in other storage spaces.
  • the first network device can obtain the first node information set based on the first cluster identifier in the master grab request message and the corresponding relationship between the first node information set and the first cluster identifier, and then determine whether the first node information set contains Including node information of the first node.
  • the candidate master nodes of the first cluster need to be changed accordingly.
  • the first network device may delete the first node information set.
  • the first network device may also acquire node information of a second candidate master node of the first cluster, and generate a second node information set.
  • the first node that is, the re-determined primary node of the first cluster, may obtain node information of other cluster nodes in the cluster, and send the node information of other cluster nodes to the first network device.
  • the first network device uses the cluster node from which the node information has been obtained as a second candidate master node, and generates a second node information set based on the node information of the second candidate master node.
  • the following introduces a possible implementation manner in which the first network device determines the first node as the first candidate master node and uses the first node as the master node of the first cluster in combination with the above master grab request message.
  • the first network device queries the storage space to obtain the corresponding first node information set based on the first cluster identifier carried in the "cluster id" field in the master grab request message. The first network device determines whether the IP address of the first node is included in the first node information set. The IP address of the first node is obtained from the "owner" field in the master grabbing request message.
  • the cluster When the cluster includes multiple network devices, information needs to be synchronized between the multiple network devices.
  • the embodiment of the present application does not limit the way of triggering the synchronization of information between network devices, for example, it may be that the information stored in the network devices changes, or it may be that the synchronization time is reached.
  • the embodiment of the present application provides a possible specific implementation manner of synchronizing data between network devices.
  • the first cluster includes the first network device and the second network device.
  • the second network device is a backup device of the first network device. Both the first network device and the second network device are connected to the cluster nodes of the first cluster.
  • the second network device also has the function of cluster arbitration and can access the storage space.
  • the first network device may be a network device 306
  • the second network device may be a network device 307 and a network device 308 .
  • the first network device After determining the first node as the master node of the first cluster, the first network device needs to synchronize with the second network device relevant information that the master node of the first cluster is the first node.
  • the first network device generates a synchronization packet.
  • the synchronization message includes node information of the master node of the first cluster. Specifically, after the first is determined as the master node of the first cluster, the synchronization message generated by the first network device includes node information of the first node.
  • the node information of the first node may be, for example, the IP address of the first node.
  • the first network device sends a synchronization packet to the second network device.
  • the second network device can back up the node information of the master node of the first cluster in the synchronization message, so as to realize data synchronization with the first network device.
  • a plurality may mean at least two.
  • some second network devices are not directly connected to the first network device.
  • the synchronization message may be forwarded through other second network devices.
  • a first network device that is, network device 306
  • a second network device that is, network device 307.
  • a second network device ie, network device 307
  • the network device 306 generates and sends a synchronization message to the network device 307 .
  • the network device 307 then forwards the synchronization packet to the network device 308 .
  • this embodiment of the present application provides a synchronization message.
  • the synchronization message may include, for example, an op field, a cluster ID field, a sequence field, a sync ID field, a verify field, a status field, and an owner field.
  • the value of the op field is sync, indicating that the operation type is synchronization.
  • the cluster ID field carries the first cluster identifier.
  • the owner field carries the IP address of the first node and represents the node information of the master node.
  • the sync ID field is used to verify whether the query synchronization message is valid.
  • the verify field is used to verify whether the synchronization packet is an out-of-sequence packet.
  • the first network device sends the synchronization packet to the second network device.
  • the second network device determines that a synchronization operation needs to be performed based on the value of the op field as sync.
  • the second network device first obtains the value pkt.sync ID of the sync ID field in the synchronization message. Compare the pkt.sync ID with the stored switch1.sync ID of the second network device. If the pkt.sync ID is greater than the switch1.sync ID, it is determined that the content carried by the synchronization message is valid for the second network device.
  • the second network device obtains the information carried in the cluster ID field, sync ID field, status field and owner field, and stores the information in the storage space corresponding to the second network device.
  • the second network device If the second network device stores the information carried in the above fields, the second network device updates the stored information correspondingly. Moreover, if the second network device is not the last device in the forwarding path of the synchronization message, continue to forward the synchronization message to other second network devices.
  • the pkt.sync ID is less than or equal to the switch1.sync ID, it means that the content carried by the synchronization message is invalid for the second network device, that is, the second network device has already synchronized the content carried by the synchronization message. If the second network device is not the last device in the forwarding path of the synchronization message, continue to forward the synchronization message to other second network devices.
  • the second network device may generate a synchronization success message, and send the synchronization success message to the first network device. Based on the acquired synchronization success message, the first network device can determine that the synchronization of the second network device in the forwarding path of the synchronization message is successful.
  • the synchronization message is repeatedly sent until the first network device receives the synchronization success message.
  • the network device 308 is the last network device in the forwarding path of the synchronization message. After receiving the synchronization message and synchronizing the content carried in the synchronization message, the network device 308 generates a synchronization success message. The network device 308 sends a synchronization success message to the network device 306 .
  • this embodiment of the present application provides a synchronization success message.
  • the synchronization packet may include, for example, an op field, a cluster ID field, a sequence field, and a sync ID field.
  • the value of the op field is sync-ack, indicating that the operation is a successful synchronization.
  • the cluster ID field carries the first cluster identifier.
  • the content carried in the sync ID field is consistent with the content carried in the sync ID field in the synchronization message.
  • the second network device sends a synchronization success message to the first network device. Based on the value of the op field in the synchronization success message as sync-ack, the first network device can determine that the message is a synchronization success message. Moreover, based on the cluster ID field, the first network device can determine that the successful synchronization is the node information of the master node of the first cluster corresponding to the first cluster ID. Based on the sync ID field, the first network device can determine the content of the sync packet that has been successfully synced.
  • the first network device After the first network device determines the master node of the first cluster, it also needs to send a master grabbing response message to the first node, so that the first node obtains a master election result.
  • the first network device may send the second master grabbing response message to the first node.
  • the second network device may send the first master grab response message to the first node, or the first network device may send the first network device to the first node.
  • the node sends the second master grabbing response message.
  • the first master grabbing response message and the second master grabbing response message include the master election result of the first node. If the first network device determines the first node as the master node of the first cluster, or the master node of the first cluster exists and the master node is the first node, the result of the master election is that the master election is successful. If the master node of the first cluster exists, and the master node is not the first node, the master election result is master election failure.
  • the first network device determines that the master node of the first cluster exists, or the master node of the first cluster is the first node, the information stored in the first network device has not been updated, and there is no need to communicate with The second network device is synchronized.
  • the first network device generates a second master grabbing response message and sends the second master grabbing response message to the first node.
  • the first network device may send the second master-grabbing response message to the first node, or the second network device may send the first master-grabbing response message to the first node.
  • the synchronization message sent by the first network device includes node information of the first node.
  • the second network device can determine the master selection result of the first node based on the node information of the master node of the first cluster and the node information of the first node in the synchronization message.
  • the second network device that sends the first master grab response message may be the last second network device in the forwarding path of the synchronization message.
  • the cluster node 301 is the first node
  • the network device 306 is the first network device
  • the network device 307 and the network device 308 are the second network devices.
  • the forwarding path of the synchronization message includes the network device 306 , the network device 307 and the network device 308 .
  • Network device 308 is the last network device in the forwarding path.
  • the network device 306 determines that the master node of the first cluster exists and is not the cluster node 301 , it generates a master grab response message including the master election result of master election failure, and sends the master grab response message to the cluster node 301 .
  • the network device 306 determines that the master node of the first cluster exists and is the cluster node 301 , it generates a master grabbing response message including a master election result of successful master election, and sends the master grabbing response message to the cluster node 301 .
  • the network device 306 determines the cluster node 301 as the master node of the first cluster, it generates a synchronization message and sends the synchronization message to the network device 307 .
  • the network device 306 generates a master-preemption response message including a master-selection result of successful master election, and sends the master-preemption response message to the cluster node 301 .
  • the network device 308 obtains the synchronization message through the network device 307, based on the node information of the first node in the synchronization message, it generates a master grabbing response message including the result of master election success , and send a master grab response message to the cluster node 301.
  • this embodiment of the present application provides a preemption response message.
  • the master grabbing response message may include, for example, an op field, a cluster ID field, a sequence field, a sync ID field, a verify field, a status field, an owner field, and an error code field.
  • the value of the op field is preempt, indicating that the operation type is preempting the master.
  • the cluster ID field carries the first cluster identifier.
  • the content carried in the sequence field may be consistent with the content carried in the preemption request message, indicating that it is a preemption response message corresponding to the preemption request message.
  • the sync ID field is used to verify whether the master grabbing response message is valid.
  • the verify field is used to verify whether the master grabbing response packet is an out-of-sequence packet.
  • the status field carries a second status identifier, indicating that the master node of the first cluster exists.
  • the owner field carries the IP address of the first node, indicating that the master node is the first node.
  • the error code field carries the master selection result.
  • the first network device can also be used to query the master node.
  • using the first network device to query the master node of the cluster may include the following three steps:
  • Step 1 The first network device obtains the query request message of the second node.
  • the first network device can be connected to cluster nodes in multiple clusters.
  • the first network device is directly or indirectly connected to the cluster nodes of the second cluster.
  • the second node is a cluster node of the second cluster.
  • the second cluster may be a cluster different from the first cluster, or may be the first cluster.
  • the second node may be a cluster node different from the first node, or may be the first node.
  • the first network device may directly obtain the query request message sent by the second node, or may obtain the query request message through other devices, such as the second network device.
  • the query request packet includes the second cluster identifier.
  • the second cluster identifier corresponds to the second cluster. Based on the second cluster identifier in the query request message, the first network device can determine that node information of the master node of the second cluster needs to be queried.
  • this embodiment of the present application provides a query request message.
  • the query request message may include, for example, an op field, a cluster ID field and a sequence field.
  • the cluster ID field carries the second cluster identifier.
  • the first network device may query the node information of the master node of the second cluster in the storage space, and generate a corresponding query result.
  • the query result may be a query failure. If the first network device inquires that the master node of the second cluster is not determined, the query result may include a status identifier indicating that the master node of the second cluster does not exist. For example, it may be a first state identifier. If the first network device queries to determine the master node of the second cluster, the query result may include a status identifier indicating the existence of the master node of the second cluster, for example, it may be the second status identifier, and/or the status identifier of the master node of the second cluster Node information. The node information of the master node of the second cluster may be the IP address of the master node of the second cluster.
  • the first network device may store the node information of the master node of the second cluster and/or the existence state information of the master node of the second cluster in the storage space. Specifically, the first network device may pre-acquire the node information of the master node of the second cluster and/or the existence state information of the master node of the second cluster, and store them locally or in a storage space of other devices.
  • Step 2 The first network device generates a query response message.
  • the first network device generates a query response message based on the query result.
  • the query response message includes the query result.
  • this embodiment of the present application provides an arbitration message whose specific operation type is query.
  • the query result includes a status identifier indicating the presence of the master node of the second cluster and node information of the master node of the second cluster.
  • the query response message may include, for example, an op field, a cluster ID field, a sequence field, a verify field, a status field, an owner field, and an error code field .
  • the value of the op field is read, indicating that the operation type is query.
  • the cluster ID field carries the second cluster identifier.
  • the content carried in the sequence field may be consistent with the content carried in the sequence field in the query request message, indicating that it is a query response message.
  • the sync ID field is used to verify whether the query response message is valid.
  • the verify field is used to verify whether the query response message is an out-of-sequence message, and the status field carries the second status identifier, indicating that the master node of the second cluster exists.
  • the owner field carries the IP address of the master node of the second cluster.
  • the error code field carries a success identifier, indicating that the query is successful.
  • the successful identifier may be, for example, "succeed".
  • Step 3 the first network device sends a query response message to the second node.
  • the first network device sends the generated query response message to the second node.
  • the second node obtains the query response message.
  • the second node can determine whether the master node of the second cluster exists according to the query result carried in the query response message. In this way, it is convenient for the second node to apply for the master or interact with the master node.
  • FIG. 5 shows a schematic structural diagram of a possible network device involved in the above embodiment, and the network device 500 can implement the function of the first network device in the example shown in FIG. 4 .
  • the network device 500 includes: a first acquiring unit 501 and a first processing unit 502 .
  • the first acquiring unit 501 is configured to support the network device 500 to execute S401 in FIG. 4; the first processing unit 502 is configured to support the network device 500 to execute S402 in FIG. 4; and/or the first network device in the technology described herein other processes performed.
  • the first acquisition unit 501 is configured to perform various acquisition operations performed by the first network device in the above method embodiments; the first processing unit 502 is configured to perform various processing operations of the first network device in the above method embodiments .
  • the first obtaining unit 501 is configured to obtain a master grab request message of the first node; the first processing unit 502 is configured to determine the master node of the first cluster.
  • the specific execution process please refer to the detailed description of the corresponding steps in the above embodiment shown in FIG. 4 , which will not be repeated here.
  • each functional unit in the embodiment of the present application may be integrated into one processing unit, or each unit may physically exist separately, or two or more units may be integrated into one unit.
  • the acquisition unit and the processing unit may be the same unit or different units.
  • the above-mentioned integrated units can be implemented in the form of hardware or in the form of software functional units.
  • an embodiment of the invention provides a cluster arbitration system 600, which is used to implement the cluster arbitration method in the foregoing method embodiments.
  • the system 600 includes a network device 601 and a first node 602 .
  • the network device 601 may implement the function of the first network device in the embodiment shown in FIG. 4 .
  • For the specific execution process please refer to the detailed description of the corresponding steps in the above embodiment shown in FIG. 4 , which will not be repeated here.
  • FIG. 7 is a schematic structural diagram of a device 700 provided in an embodiment of the present application.
  • the network device 500 in FIG. 5 can be implemented by the device shown in FIG. 7 .
  • the device 700 includes at least one processor 701 , a communication bus 702 and at least one network interface 704 , and optionally, the device 700 may further include a memory 703 .
  • the processor 701 may be a general-purpose central processing unit (central processing unit, CPU), a specific application integrated circuit (application-specific integrated circuit, ASIC) or one or more integrated circuits (integrated circuit) for controlling the program execution of the application program , IC).
  • the processor can be used to process the message, so as to implement the cluster arbitration method provided in the embodiment of the present application.
  • the processor can be used to obtain the master grab request message of the first node and determine the master node of the first cluster.
  • the specific function For implementation, reference may be made to the processing part corresponding to the first network device in the method embodiment.
  • Communication bus 702 is used to transfer information between processor 701 , network interface 704 and memory 703 .
  • the memory 703 can be a read-only memory (read-only memory, ROM) or other types of static storage devices that can store static information and instructions, and the memory 703 can also be a random access memory (random access memory, RAM) or can store information and other types of dynamic storage devices for instructions, and can also be compact disc read-only Memory (CD-ROM) or other optical disc storage, optical disc storage (including compact disc, laser disc, optical disc, digital versatile disc, Blu-ray optical discs, etc.), magnetic disk storage media or other magnetic storage devices, or any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a computer, but is not limited thereto.
  • the memory 703 may exist independently, and is connected to the processor 701 through the communication bus 702 .
  • the memory 703 can also be integrated with the processor 701.
  • the memory 703 is used to store program codes or instructions for implementing the solutions of the present application, and the execution is controlled by the processor 701 .
  • the processor 701 is used to execute program codes or instructions stored in the memory 703 .
  • One or more software modules may be included in the program code.
  • the processor 701 may also store program codes or instructions for executing the solutions of the present application. In this case, the processor 701 does not need to read the program codes or instructions from the memory 703 .
  • the network interface 704 can be a device such as a transceiver for communicating with other devices or a communication network, and the communication network can be Ethernet, radio access network (RAN) or wireless local area networks (wireless local area networks, WLAN). In the embodiment of the present application, the network interface 704 may be used to receive messages sent by other nodes in the segment routing network, and may also send messages to other nodes in the segment routing network.
  • the network interface 704 may be an Ethernet interface (ethernet) interface, a fast ethernet (fast ethernet, FE) interface or a gigabit ethernet (gigabit ethernet, GE) interface, etc.
  • the device 700 may include multiple processors, for example, the processor 701 and the processor 405 shown in FIG. 7 .
  • processors may be a single-core (single-CPU) processor or a multi-core (multi-CPU) processor.
  • a processor herein may refer to one or more devices, circuits, and/or processing cores for processing data (eg, computer program instructions).
  • FIG. 8 is a schematic structural diagram of a device 800 provided in an embodiment of the present application.
  • the first network device in FIG. 4 may be implemented by the device shown in FIG. 8 .
  • the device 800 includes a main control board and one or more interface boards.
  • the main control board is communicatively connected with the interface board.
  • the main control board is also called the main processing unit (main processing unit, MPU) or route processing card (route processor card). Route calculation, device management and maintenance functions.
  • the interface board is also called a line processing unit (line processing unit, LPU) or a line card (line card), and is used to receive and send packets.
  • line processing unit line processing unit
  • LPU line processing unit
  • line card line card
  • the communication between the main control board and the interface board or between the interface board and the interface board is through a bus.
  • the interface boards communicate through a switching fabric board.
  • the device 800 also includes a switching fabric board.
  • the switching fabric board communicates with the main control board and the interface board.
  • the switching fabric board is used to forward the interface board.
  • the SFU can also be called a switch fabric unit (SFU).
  • the interface board includes a CPU, a memory, a forwarding engine, and an interface card (interface card, IC), where the interface card may include one or more network interfaces.
  • the network interface may be an Ethernet interface, an FE interface, or a GE interface.
  • the CPU communicates with the memory, the forwarding engine and the interface card respectively.
  • the memory is used to store the forwarding table.
  • the forwarding engine is used to forward the received message based on the forwarding table stored in the memory. If the destination address of the received message is the IP address of the device 800, the message is sent to the CPU of the main control board or the interface board for further processing. Processing; if the destination address of the received message is not the IP address of the device 800, the forwarding table is checked according to the destination, if the next hop and the outgoing interface corresponding to the destination address are found from the forwarding table, the message is Forward to the outbound interface corresponding to the destination address.
  • the forwarding engine may be a network processor (network processor, NP).
  • the interface card is also called a daughter card, which can be installed on the interface board, and is responsible for converting the photoelectric signal into a data frame, and checking the validity of the data frame before forwarding it to the forwarding engine for processing or the CPU of the interface board.
  • the CPU can also perform the function of the forwarding engine, such as implementing soft forwarding based on a general-purpose CPU, so that no forwarding engine is needed in the interface board.
  • the forwarding engine may be implemented by an ASIC or a field programmable gate array (field programmable gate array, FPGA).
  • the memory storing the forwarding table can also be integrated into the forwarding engine as a part of the forwarding engine.
  • the embodiment of the present application also provides a chip system, including: a processor, the processor is coupled with a memory, and the memory is used to store programs or instructions, and when the programs or instructions are executed by the processor, the The chip system implements the method executed by the first network device in the embodiment shown in FIG. 4 above.
  • processors in the chip system there may be one or more processors in the chip system.
  • the processor can be realized by hardware or by software.
  • the processor may be a logic circuit, an integrated circuit, or the like.
  • the processor may be a general-purpose processor implemented by reading software codes stored in a memory.
  • the memory can be integrated with the processor, or can be set separately from the processor, which is not limited in this application.
  • the memory can be a non-transitory processor, such as a read-only memory ROM, which can be integrated with the processor on the same chip, or can be respectively arranged on different chips.
  • the setting method of the processor is not specifically limited.
  • the system-on-a-chip can be an FPGA, an ASIC, a system on chip (SoC), a CPU, an NP, or a digital signal processing circuit (digital signal processor, DSP), can also be a microcontroller (micro controller unit, MCU), can also be a programmable controller (programmable logic device, PLD) or other integrated chips.
  • SoC system on chip
  • DSP digital signal processing circuit
  • MCU microcontroller
  • PLD programmable controller
  • each step in the foregoing method embodiments may be implemented by an integrated logic circuit of hardware in a processor or instructions in the form of software.
  • the method steps disclosed in connection with the embodiments of the present application may be directly implemented by a hardware processor, or implemented by a combination of hardware and software modules in the processor.
  • the embodiment of the present application also provides a computer-readable storage medium, including instructions, which, when run on a computer, cause the computer to execute the method in the foregoing embodiments.
  • At least one (one) means one or more, and “multiple” means two or more.
  • At least one of the following" or similar expressions refer to any combination of these items, including any combination of single or plural items.
  • at least one item (piece) of a, b, or c can represent: a, b, c, a-b, a-c, b-c, or a-b-c, where a, b, c can be single or multiple .
  • “A and/or B” is considered to include A alone, B alone, and A+B.
  • the disclosed systems, devices and methods can be implemented in other ways.
  • the device embodiments described above are only illustrative.
  • the division of the units is only a logical module division. In actual implementation, there may be other division methods.
  • multiple units or components can be combined or May be integrated into another system, or some features may be ignored, or not implemented.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be through some interfaces, and the indirect coupling or communication connection of devices or units may be in electrical, mechanical or other forms.
  • the units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in one place, or may be distributed to multiple network units. Part or all of the units can be obtained according to actual needs to achieve the purpose of the solution of this embodiment.
  • each module unit in each embodiment of the present application may be integrated into one processing unit, each unit may exist separately physically, or two or more units may be integrated into one unit.
  • the above-mentioned integrated units can be implemented in the form of hardware or in the form of software module units.
  • the integrated unit is implemented in the form of a software module unit and sold or used as an independent product, it can be stored in a computer-readable storage medium.
  • the technical solution of the present application is essentially or part of the contribution to the prior art or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium , including several instructions to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the methods described in the various embodiments of the present application.
  • the aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk or optical disc, etc., which can store program codes. .
  • the functions described in the present invention may be implemented by hardware, software, firmware or any combination thereof.
  • the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium.
  • Computer-readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one place to another.
  • a storage media may be any available media that can be accessed by a general purpose or special purpose computer.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
  • Hardware Redundancy (AREA)

Abstract

La présente invention concerne un procédé de mise en œuvre de quorum de grappe, un dispositif de réseau et un système. Le procédé consiste à : acquérir un message de demande de préemption d'un premier nœud au moyen d'un premier dispositif de réseau dans une grappe ; et déterminer un nœud maître d'une première grappe. De cette manière, un dispositif de réseau dans une grappe est utilisé pour réaliser une sélection de nœud maître basée sur quorum sans avoir besoin de configurer en plus un autre espace de stockage partagé dans la grappe, ce qui permet de réduire les coûts du système de grappe distribuée tout en garantissant la fiabilité de celui-ci dans une certaine mesure.
PCT/CN2022/133885 2021-12-28 2022-11-24 Procédé de mise en œuvre de quorum de grappe, dispositif de réseau et système WO2023124663A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111633349.5 2021-12-28
CN202111633349.5A CN116367202A (zh) 2021-12-28 2021-12-28 一种集群仲裁方法、网络设备及系统

Publications (1)

Publication Number Publication Date
WO2023124663A1 true WO2023124663A1 (fr) 2023-07-06

Family

ID=86940455

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/133885 WO2023124663A1 (fr) 2021-12-28 2022-11-24 Procédé de mise en œuvre de quorum de grappe, dispositif de réseau et système

Country Status (2)

Country Link
CN (1) CN116367202A (fr)
WO (1) WO2023124663A1 (fr)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1512729A (zh) * 2002-12-31 2004-07-14 联想(北京)有限公司 网络设备自适应负载均衡的方法
EP1869868A2 (fr) * 2005-04-13 2007-12-26 Nokia Corporation Systeme, dispositif reseau, procede et progiciel assurant un equilibrage actif de la charge au moyen de noeuds groupes faisant office de serveurs de noms de domaine autorises
CN106331046A (zh) * 2015-07-02 2017-01-11 中兴通讯股份有限公司 集群主节点选举方法及装置
CN106878382A (zh) * 2016-12-29 2017-06-20 北京华为数字技术有限公司 一种分布式仲裁集群中动态改变集群规模的方法及装置
CN111355600A (zh) * 2018-12-21 2020-06-30 杭州海康威视数字技术股份有限公司 一种主节点确定方法和装置

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1512729A (zh) * 2002-12-31 2004-07-14 联想(北京)有限公司 网络设备自适应负载均衡的方法
EP1869868A2 (fr) * 2005-04-13 2007-12-26 Nokia Corporation Systeme, dispositif reseau, procede et progiciel assurant un equilibrage actif de la charge au moyen de noeuds groupes faisant office de serveurs de noms de domaine autorises
CN106331046A (zh) * 2015-07-02 2017-01-11 中兴通讯股份有限公司 集群主节点选举方法及装置
CN106878382A (zh) * 2016-12-29 2017-06-20 北京华为数字技术有限公司 一种分布式仲裁集群中动态改变集群规模的方法及装置
CN111355600A (zh) * 2018-12-21 2020-06-30 杭州海康威视数字技术股份有限公司 一种主节点确定方法和装置

Also Published As

Publication number Publication date
CN116367202A (zh) 2023-06-30

Similar Documents

Publication Publication Date Title
US11734138B2 (en) Hot standby method, apparatus, and system
US7859992B2 (en) Router redundancy in data communication networks
US9054958B2 (en) System and method for reducing information loss in an aggregated information handling system
US7225356B2 (en) System for managing operational failure occurrences in processing devices
EP2993838B1 (fr) Procédés de configuration d'identité de membre de dispositif formant passerelle, et dispositifs formant passerelle de gestion correspondants
EP3316555B1 (fr) Procédé, dispositif et système de synchronisation d'adresse mac
US10560550B1 (en) Automatic configuration of a replacement network device in a high-availability cluster
US8320281B2 (en) Methods, systems and computer readable media for utilizing a standard spanning tree protocol to indicate an active switching fabric in a network node
JP2016520904A (ja) 分散型ロードバランサにおける非対称パケットフロー
US10447652B2 (en) High availability bridging between layer 2 networks
US20160205033A1 (en) Pool element status information synchronization method, pool register, and pool element
WO2020057445A1 (fr) Système, procédé et dispositif de communication
US10819628B1 (en) Virtual link trunking control of virtual router redundancy protocol master designation
WO2018161632A1 (fr) Procédé et appareil de mise à jour de capacité pour un dispositif distribué
US20140241374A1 (en) System and method for ingress port identification in aggregate switches
US11012301B2 (en) Notification and transfer of link aggregation group control in anticipation of a primary node reboot
WO2023124663A1 (fr) Procédé de mise en œuvre de quorum de grappe, dispositif de réseau et système
US8984634B2 (en) Quarantine network system, server apparatus, and program
JP2012209625A (ja) クラスタシステムの結線作業の煩雑さを軽減するシステム及び方法
WO2020181733A1 (fr) Procédé d'intercommunication de centre de données multiples basé sur vpc et dispositif associé
WO2022213783A1 (fr) Procédé et appareil d'acheminement de trafic
US11418382B2 (en) Method of cooperative active-standby failover between logical routers based on health of attached services
US11528222B2 (en) Decentralized control plane
WO2021244588A1 (fr) Procédé de traitement d'un message de routage, dispositif de communication, support d'enregistrement et système
CN115152192B (zh) Pce受控网络可靠性

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22913921

Country of ref document: EP

Kind code of ref document: A1