CN114050960B - Distributed server cluster node management method and device - Google Patents

Distributed server cluster node management method and device Download PDF

Info

Publication number
CN114050960B
CN114050960B CN202111218604.XA CN202111218604A CN114050960B CN 114050960 B CN114050960 B CN 114050960B CN 202111218604 A CN202111218604 A CN 202111218604A CN 114050960 B CN114050960 B CN 114050960B
Authority
CN
China
Prior art keywords
node
service
server
management
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202111218604.XA
Other languages
Chinese (zh)
Other versions
CN114050960A (en
Inventor
赵鹏松
钟金超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xiamen Yealink Network Technology Co Ltd
Original Assignee
Xiamen Yealink Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xiamen Yealink Network Technology Co Ltd filed Critical Xiamen Yealink Network Technology Co Ltd
Priority to CN202111218604.XA priority Critical patent/CN114050960B/en
Publication of CN114050960A publication Critical patent/CN114050960A/en
Application granted granted Critical
Publication of CN114050960B publication Critical patent/CN114050960B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/04Network management architectures or arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Computer And Data Communications (AREA)

Abstract

The invention discloses a distributed server cluster node management method and a distributed server cluster node management device, which are characterized in that communication connection is established among a service management terminal, a node management terminal, a service management server and a node management server in a server cluster, so that each function service module information of server nodes corresponding to a plurality of service management servers is acquired, a function service module list is generated, corresponding server node information of a plurality of node management terminals is acquired, a node information list is generated, the separation of function service module management and server node information management is realized, and a database is updated according to the node information list and the function service module list; compared with the prior art, the invention realizes low coupling by adopting the separated design of service management and node management, and increases the expansibility of the server cluster.

Description

Distributed server cluster node management method and device
Technical Field
The present invention relates to the field of embedded device cluster technologies, and in particular, to a method and an apparatus for managing distributed server cluster nodes.
Background
In the server cluster, the high availability of the server cluster is influenced by the state of the server node, and the server node is online and offline, so that the high availability of the server cluster is reduced; in order to improve the high availability of the server cluster, it is required that a part of the server nodes are offline due to faults, and the server nodes joining the cluster have no influence on the cluster service, so that the server nodes of the server cluster need to be managed, so that the server nodes are as simple as possible and have no state. Meanwhile, when a service function module is newly added, the adaptation compatibility of the bottom layer is good, and the complexity of a layer architecture needs to be reduced as much as possible, so that the large modification and design of the upper layer service are caused.
Disclosure of Invention
The invention aims to solve the technical problems that: the method and the device for managing the distributed server cluster nodes are provided, and low coupling is realized by adopting a separated design of service management and node management, so that the expansibility of the server cluster is increased.
In order to solve the technical problems, the invention provides a distributed server cluster node management method which is applied to a node management server, wherein the node management server is in communication connection with a plurality of node management terminals, the plurality of node management terminals are in communication connection with the service management server, the service management server is in communication connection with a plurality of service management terminals, and the plurality of node management terminals are correspondingly in communication connection with the plurality of service management terminals; the distributed server cluster node management method comprises the following steps:
Acquiring corresponding server node information of a plurality of node management terminals, and generating a node information list; simultaneously acquiring information of each functional service module of a server node corresponding to a plurality of service management service ends to generate a functional service module list, wherein the information of each functional service module of the server node is acquired by the corresponding service management terminal and written into the service management service ends;
Acquiring and updating the node information list and the function service module list according to node configuration information of a database, and writing the updated node information list and the updated function service module list into the database to update the database;
And uploading the updated node information list and the updated function service module list to the service management terminal at regular time through the node management terminal so as to keep alive with the node management terminal and the service management server.
Further, the method for managing the distributed server cluster nodes provided by the invention further comprises the following steps:
When a node modification instruction sent by an administrator is detected, the node modification instruction is transmitted to the node management terminal, so that the node management terminal executes the node modification instruction, feeds back a modification state and simultaneously sends node modification information to the service management server;
and when the fed-back modification state is successful, writing the modified node information into the database, and sending an operation state prompt to the administrator.
Further, after updating the node information list and the function service module list, the method further includes:
if the node management terminal does not upload the updated node information list to the service management terminal at regular time, marking the server node corresponding to the node management terminal as abnormal;
if the service management server cannot acquire node information in the updated database from the service management terminal within the preset time, the information is considered to be overtime;
And if the node information in the updated database cannot be acquired from the service management server, marking the server node corresponding to the node management terminal as offline.
Further, after the information of each functional service module of the server node is obtained by the corresponding service management terminal and written into the service management server, the method further includes:
A functional service module in the server node sends a query instruction through the corresponding service management terminal, wherein the query instruction comprises functional service module information to be called, and the functional service module information comprises IP port configuration and interface information;
And sending the query instruction to the service management server through the service management terminal, and acquiring the information of the functional service module to be called from the service management server so as to establish communication connection of the functional service module.
The invention further provides a distributed server cluster node management device which is applied to a node management server, wherein the node management server is in communication connection with a plurality of node management terminals, the plurality of node management terminals are in communication connection with the service management server, the service management server is in communication connection with a plurality of service management terminals, and the plurality of node management terminals are correspondingly in communication connection with the plurality of service management terminals; the distributed server cluster node management apparatus includes: the device comprises an acquisition unit, an updating unit and a timing uploading unit;
The acquisition unit is used for acquiring corresponding server node information of a plurality of node management terminals and generating a node information list; simultaneously acquiring information of each functional service module of a server node corresponding to a plurality of service management service ends to generate a functional service module list, wherein the information of each functional service module of the server node is acquired by the corresponding service management terminal and written into the service management service ends;
the updating unit is used for acquiring and updating the node information list and the function service module list according to node configuration information of a database, and writing the updated node information list and the updated function service module list into the database so as to update the database;
The timing uploading unit is used for uploading the updated node information list and the updated function service module list to the service management terminal through the node management terminal at regular time so as to keep alive with each other between the node management terminal and the service management server.
Further, the present invention provides a distributed server cluster node management device, which further includes: a node modifying unit;
The node modification module is used for transmitting the node modification instruction to the node management terminal when detecting the node modification instruction sent by the administrator, so that the node management terminal executes the node modification instruction and feeds back the modification state, and meanwhile, node modification information is sent to the service management server;
and when the fed-back modification state is successful, writing the modified node information into the database, and sending an operation state prompt to the administrator.
Further, the present invention provides a distributed server cluster node management device, which further includes: a service abnormality unit;
The service abnormality module is used for marking the server node corresponding to the node management terminal as abnormal if the node management terminal does not upload the updated node information list to the service management terminal at regular time;
if the service management server cannot acquire node information in the updated database from the service management terminal within the preset time, the information is considered to be overtime;
And if the node information in the updated database cannot be acquired from the service management server, marking the server node corresponding to the node management terminal as offline.
Further, the present invention provides a distributed server cluster node management device, which further includes: a module calling unit;
The module calling unit is used for sending a query instruction through a corresponding service management terminal by one functional service module in the server node, wherein the query instruction comprises functional service module information to be called, and the functional service module information comprises IP port configuration and interface information;
And sending the query instruction to the service management server through the service management terminal, and acquiring the information of the functional service module to be called from the service management server so as to establish communication connection of the functional service module.
Further, the invention also provides a terminal device, which comprises a processor, a memory and a computer program stored in the memory and configured to be executed by the processor, wherein the distributed server cluster node management method according to any one of the above is realized when the processor executes the computer program.
Further, the invention also provides a computer readable storage medium, which comprises a stored computer program, wherein when the computer program runs, equipment where the computer readable storage medium is controlled to execute the distributed server cluster node management method according to any one of the above.
Compared with the prior art, the method and the device for managing the distributed server cluster nodes have the following beneficial effects:
The invention relates to a method and a device for managing distributed server cluster nodes, which are used for establishing communication connection for a service management terminal, a node management terminal, a service management server and a node management server in a server cluster, so as to obtain the information of each functional service module of server nodes corresponding to a plurality of service management servers, generate a functional service module list, and simultaneously obtain the information of corresponding server nodes of a plurality of node management terminals, generate a node information list, and realize the separation of the management of the functional service modules and the management of the server node information; compared with the prior art, the invention realizes low coupling by adopting the separated design of service management and node management, and increases the expansibility of the server cluster.
Drawings
FIG. 1 is a flow chart of an embodiment of a distributed server cluster node management method provided by the present invention;
fig. 2 is a schematic structural diagram of an embodiment of a distributed server cluster node management apparatus according to the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and fully with reference to the accompanying drawings, in which it is evident that the embodiments described are only some, but not all embodiments of the invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
Example 1
Referring to fig. 1, fig. 1 is a flow chart of an embodiment of a distributed server cluster node management method provided by the present invention, as shown in fig. 1, the method includes steps 101 to 103, specifically as follows:
Step 101: acquiring corresponding server node information of a plurality of node management terminals, and generating a node information list; and simultaneously acquiring the information of each functional service module of the server node corresponding to the service management server side to generate a functional service module list, wherein the information of each functional service module of the server node is acquired by the corresponding service management terminal and written into the service management server side.
In this embodiment, for deployment of a server cluster, a node management terminal and a service management terminal are deployed on all cluster server nodes, and at least two node management servers and at least two service management servers are deployed in the entire server cluster; the node management server is respectively in communication connection with the plurality of node management terminals, the service management server is respectively in communication connection with the plurality of service management terminals, and the plurality of node management terminals are correspondingly in communication connection with the plurality of service management terminals.
In this embodiment, the service management terminal provides an interface to each functional service module of the server node, and obtains and writes information of all functional service modules of the server node to the service management server through the service management terminal. The service management server side is used for collecting the functional service module information on a plurality of server nodes, information synchronization is realized among different service management server sides, and a single service management server side can collect the functional service module information corresponding to all server nodes in the whole server cluster.
In this embodiment, based on the designs of the service management terminal and the service management server, the function service modules of each server node are easily invoked. As an example in this embodiment, for example, a first service function module on a first server node registers service information of the first service function module to any one service management server through a service management terminal on its own server node, that is, tells the service management server about information of the first service function module, for example, an interface, an IP port configuration, port information, etc. of the first service function module, because of cluster deployment between service management servers, information synchronization between deployed service management servers is caused, that is, information of the first service function module of the first server node exists on any one service management server in the server cluster; and repeating the steps by all the service function modules in other server nodes so that the service function module information on all the server nodes in the server cluster is concentrated on any service management server.
In this embodiment, a functional service module in a server node sends a query instruction through a corresponding service management terminal, where the query instruction includes functional service module information to be called, and the functional service module information includes IP port configuration and interface information; and sending the query instruction to the service management server through the service management terminal, and acquiring the information of the functional service module to be called from the service management server so as to establish communication connection of the functional service module. As an example in this embodiment, when the second service function module of the second server node wants to call the first service function module on the first server node, a query instruction is sent to the service management terminal on the second server node, where the query instruction includes the IP port configuration and interface information of the first service function module on the first server node, so that the service management terminal on the second server node sends the query instruction to the service management server on the second server node, and waits for a reply of the service management server on the second server node, and after the service management server on the second server node replies, the service management terminal on the second server node returns the service management server on the second server node to tell the second service function module of the second server node about the first service function module on the first server node, so that a communication connection can be established between the second service function module of the second server node and the first service function module on the first server node, and call function modules between different server nodes are implemented.
In this embodiment, the node management server is mainly configured to collect system information of all server nodes in the server cluster, and may issue an instruction to the node management terminal, for example, modify IP configuration; simultaneously, web call is provided, and the web can write network configuration information to the server; the node management terminal is used for reporting own server node information, such as information of own node IP address, port, firewall configuration, NAT configuration and the like, to the node management server in real time. In this embodiment, when a plurality of node management terminals report current node information to a single node management service point, due to cluster deployment between node management service ends, information synchronization between deployed node management service ends is caused, that is, node information of any server node exists on any node management service end in a server cluster.
In this embodiment, the node management terminal of any one server node in the server cluster sends registration information to any one service management server through the service management terminal of the server node of the node management terminal, so that the IP and port information of the server node are sent to any one service management server.
In this embodiment, because of the communication connection between the service management server and the plurality of node management terminals, the service management server obtains the corresponding server node information of the plurality of node management terminals, and generates the node information list, and when the ip information of the node management terminal of any one server node in the server cluster changes, the service management server can quickly obtain the ip information of the corresponding node management terminal after the change, and update the node information list.
In this embodiment, the service management server side simultaneously obtains information of each functional service module of the server nodes corresponding to the service management server sides, and generates a functional service information list, so that when an administrator accesses the system through the web side, the module information can be displayed to the administrator through the web side. Meanwhile, the displayed module information can be partially filtered according to the requirements, such as the packet loss rate of the current system, and if the web end has no related requirements, the packet loss rate information of the system can be filtered.
Step 102: and acquiring and updating the node information list and the function service module list according to node configuration information of a database, and writing the updated node information list and the updated function service module list into the database to update the database.
In this embodiment, the node management center of the node management server acquires server node configuration information in the database from the database, updates the node information list and the function service module list acquired in step 101 according to the server node configuration information in the database, and writes the updated node information list and the updated function service module list into the database at the same time, and updates data in the database so as to make the data in the database persistent. In this embodiment, the database type may be Redis or TiDB.
Step 103: and uploading the updated node information list and the updated function service module list to the service management terminal at regular time through the node management terminal so as to keep alive with the node management terminal and the service management server.
In this embodiment, since the update of the database needs to rely on the writing of information by the node management server, there is information synchronization between the node management server and the database, and at this time, the node management server obtains server node information of the entire server cluster and service function module information of the entire server cluster; the node management server uploads the updated node information list and the updated function service module list to the service management terminal of the corresponding server node at regular time through the node management terminals of different server nodes, so that the node management server is mutually kept alive with the node management terminal and the service management server respectively, and meanwhile the service management server and the service management terminal are mutually kept alive, but the node management server is not kept alive with the service management server.
In this embodiment, the node management server may obtain server node information of the entire server cluster and service function module information of the entire server cluster, and when the server node information is changed, the node management server may timely obtain the changed server node information from other server components, and modify the corresponding server node information stored in the node management server.
In this embodiment, for modification of the node information list, when each service component operates normally and keep alive is normal, when a node modification instruction sent by an administrator is detected, the node modification instruction is transmitted to the node management terminal, so that the node management terminal executes the node modification instruction and feeds back a modification state, and meanwhile, node modification information is sent to the service management server; and when the fed-back modification state is successful, writing the modified node information into the database, and sending an operation state prompt to the administrator. As an example in this embodiment, when an administrator modifies a server node information through a web terminal operation, for example, when modifying an IP port configuration, a node management server receives a node modification instruction of the administrator, and issues a node modification instruction to a service management terminal of a corresponding server node, when the service management terminal executes the node modification instruction to modify the IP port configuration, the service management terminal sends information of successful modification to the service management server, and simultaneously sends the server node information after successful modification to the service management server through the service management terminal on the server node, so that the server node information is synchronously modified on the service management server, at this time, the node management server notifies the database to write the updated server node information, and simultaneously notifies the web terminal that the operation is completed, so that the web terminal prompts the administrator that the operation is successful.
In this embodiment, when an abnormality occurs to a certain server node in the server cluster, the node management terminal of the server node may be abnormal, so that the node management terminal does not upload the updated node information list to the service management terminal at regular time, so that the node management terminal communicates with the node management server, and the server node corresponding to the node management terminal is marked as abnormal, for example, a yellow alarm; if the service management server cannot acquire node information in the updated database from the service management terminal within the preset time, the information is considered to be overtime; meanwhile, if the node management server cannot acquire the updated node information in the database from the service management server, the information is considered to be overtime, and when the node management server and the service management server both consider that the information is overtime, the server node corresponding to the node management terminal is marked as offline, such as a red alarm. In this embodiment, the node management server and the service management server determine twice, and after the confirmation information is overtime, the server node is defined as offline, so that accuracy can be further improved.
Referring to fig. 2, fig. 2 is a schematic structural diagram of an embodiment of a distributed server cluster node management apparatus provided by the present invention, as shown in fig. 2, where the apparatus includes an obtaining unit 201, an updating unit 202, and a timing uploading unit 203, specifically as follows:
The obtaining unit 201 is configured to obtain corresponding server node information of a plurality of node management terminals, and generate a node information list; and simultaneously acquiring the information of each functional service module of the server node corresponding to the service management server side to generate a functional service module list, wherein the information of each functional service module of the server node is acquired by the corresponding service management terminal and written into the service management server side.
In this embodiment, for deployment of a server cluster, a node management terminal and a service management terminal are deployed on all cluster server nodes, and at least two node management servers and at least two service management servers are deployed in the entire server cluster; the node management server is respectively in communication connection with the plurality of node management terminals, the service management server is respectively in communication connection with the plurality of service management terminals, and the plurality of node management terminals are correspondingly in communication connection with the plurality of service management terminals.
In this embodiment, the service management terminal provides an interface to each functional service module of the server node, and obtains and writes information of all functional service modules of the server node to the service management server through the service management terminal. The service management server side is used for collecting the functional service module information on a plurality of server nodes, information synchronization is realized among different service management server sides, and a single service management server side can collect the functional service module information corresponding to all server nodes in the whole server cluster.
In this embodiment, based on the designs of the service management terminal and the service management server, the function service modules of each server node are easily invoked. As an example in this embodiment, for example, a first service function module on a first server node registers service information of the first service function module to any one service management server through a service management terminal on its own server node, that is, tells the service management server about information of the first service function module, for example, an interface, an IP port configuration, port information, etc. of the first service function module, because of cluster deployment between service management servers, information synchronization between deployed service management servers is caused, that is, information of the first service function module of the first server node exists on any one service management server in the server cluster; and repeating the steps by all the service function modules in other server nodes so that the service function module information on all the server nodes in the server cluster is concentrated on any service management server.
In this embodiment, a functional service module in a server node sends a query instruction through a corresponding service management terminal, where the query instruction includes functional service module information to be called, and the functional service module information includes IP port configuration and interface information; and sending the query instruction to the service management server through the service management terminal, and acquiring the information of the functional service module to be called from the service management server so as to establish communication connection of the functional service module. As an example in this embodiment, when the second service function module of the second server node wants to call the first service function module on the first server node, a query instruction is sent to the service management terminal on the second server node, where the query instruction includes the IP port configuration and interface information of the first service function module on the first server node, so that the service management terminal on the second server node sends the query instruction to the service management server on the second server node, and waits for a reply of the service management server on the second server node, and after the service management server on the second server node replies, the service management terminal on the second server node returns the service management server on the second server node to tell the second service function module of the second server node about the first service function module on the first server node, so that a communication connection can be established between the second service function module of the second server node and the first service function module on the first server node, and call function modules between different server nodes are implemented.
In this embodiment, the node management server is mainly configured to collect system information of all server nodes in the server cluster, and may issue an instruction to the node management terminal, for example, modify IP configuration; simultaneously, web call is provided, and the web can write network configuration information to the server; the node management terminal is used for reporting own server node information, such as information of own node IP address, port, firewall configuration, NAT configuration and the like, to the node management server in real time. In this embodiment, when a plurality of node management terminals report current node information to a single node management service point, due to cluster deployment between node management service ends, information synchronization between deployed node management service ends is caused, that is, node information of any server node exists on any node management service end in a server cluster.
In this embodiment, the node management terminal of any one server node in the server cluster sends registration information to any one service management server through the service management terminal of the server node of the node management terminal, so that the IP and port information of the server node are sent to any one service management server.
In this embodiment, because of the communication connection between the service management server and the plurality of node management terminals, the service management server obtains the corresponding server node information of the plurality of node management terminals, and generates the node information list, and when the ip information of the node management terminal of any one server node in the server cluster changes, the service management server can quickly obtain the ip information of the corresponding node management terminal after the change, and update the node information list. In this embodiment, the service management server side simultaneously obtains information of each functional service module of the server nodes corresponding to the service management server sides, and generates a functional service information list, so that when an administrator accesses the system through the web side, the module information can be displayed to the administrator through the web side. Meanwhile, the displayed module information can be partially filtered according to the requirements, such as the packet loss rate of the current system, and if the web end has no related requirements, the packet loss rate information of the system can be filtered.
The updating unit 202 is configured to obtain and update the node information list and the function service module list according to node configuration information of a database, and write the updated node information list and function service module list into the database to update the database.
In this embodiment, the node management center of the node management server acquires server node configuration information in the database from the database, updates the node information list and the function service module list acquired in the acquisition unit 201 according to the server node configuration information in the database, and writes the updated node information list and the updated function service module list into the database, and updates data in the database so as to make the data in the database persistent. In this embodiment, the database type may be Redis or TiDB.
The timing uploading unit 203 is configured to upload the updated node information list and the updated function service module list to the service management terminal at regular time through the node management terminal, so as to keep alive with each other between the node management terminal and the service management server.
In this embodiment, since the update of the database needs to rely on the writing of information by the node management server, there is information synchronization between the node management server and the database, and at this time, the node management server obtains server node information of the entire server cluster and service function module information of the entire server cluster; the node management server uploads the updated node information list and the updated function service module list to the service management terminal of the corresponding server node at regular time through the node management terminals of different server nodes, so that the node management server is mutually kept alive with the node management terminal and the service management server respectively, and meanwhile the service management server and the service management terminal are mutually kept alive, but the node management server is not kept alive with the service management server.
In this embodiment, the node management server may obtain server node information of the entire server cluster and service function module information of the entire server cluster, and when the server node information is changed, the node management server may timely obtain the changed server node information from other server components, and modify the corresponding server node information stored in the node management server.
In this embodiment, for modification of the node information list, when each service component operates normally and keep alive is normal, when a node modification instruction sent by an administrator is detected, the node modification instruction is transmitted to the node management terminal, so that the node management terminal executes the node modification instruction and feeds back a modification state, and meanwhile, node modification information is sent to the service management server; and when the fed-back modification state is successful, writing the modified node information into the database, and sending an operation state prompt to the administrator. As an example in this embodiment, when an administrator modifies a server node information through a web terminal operation, for example, when modifying an IP port configuration, a node management server receives a node modification instruction of the administrator, and issues a node modification instruction to a service management terminal of a corresponding server node, when the service management terminal executes the node modification instruction to modify the IP port configuration, the service management terminal sends information of successful modification to the service management server, and simultaneously sends the server node information after successful modification to the service management server through the service management terminal on the server node, so that the server node information is synchronously modified on the service management server, at this time, the node management server notifies the database to write the updated server node information, and simultaneously notifies the web terminal that the operation is completed, so that the web terminal prompts the administrator that the operation is successful.
In this embodiment, when an abnormality occurs to a certain server node in the server cluster, the node management terminal of the server node may be abnormal, so that the node management terminal does not upload the updated node information list to the service management terminal at regular time, so that the node management terminal communicates with the node management server, and the server node corresponding to the node management terminal is marked as abnormal, for example, a yellow alarm; if the service management server cannot acquire node information in the updated database from the service management terminal within the preset time, the information is considered to be overtime; meanwhile, if the node management server cannot acquire the updated node information in the database from the service management server, the information is considered to be overtime, and when the node management server and the service management server both consider that the information is overtime, the server node corresponding to the node management terminal is marked as offline, such as a red alarm. In this embodiment, through two judgments of the node management server and the service management server, after the confirmation information is overtime, the server node is defined as offline, so that accuracy can be further improved, and meanwhile, for abnormal offline of the node, the server node can be rapidly displayed on the web side for prompting an administrator, and meanwhile, the server node can be rapidly operated.
In summary, the method and the device for managing the distributed server cluster nodes establish communication connection with a service management terminal, a node management terminal, a service management server and a node management server in a server cluster, so as to obtain the information of each functional service module of the server nodes corresponding to a plurality of service management servers, generate a functional service module list, and simultaneously obtain the information of the corresponding server nodes of a plurality of node management terminals, generate a node information list, realize the separation of the management of the functional service modules and the management of the server nodes, and update a database according to the node information list and the functional service module list; compared with the prior art, the invention realizes low coupling by adopting the separated design of service management and node management, and increases the expansibility of the server cluster.
The foregoing is merely a preferred embodiment of the present invention, and it should be noted that modifications and substitutions can be made by those skilled in the art without departing from the technical principles of the present invention, and these modifications and substitutions should also be considered as being within the scope of the present invention.

Claims (10)

1. The distributed server cluster node management method is characterized by being applied to node management service terminals, wherein one node management terminal and one service management terminal are deployed on all cluster server nodes, at least two node management service terminals and at least two service management service terminals are deployed in the whole server cluster, the node management service terminals are in communication connection with a plurality of node management terminals, the plurality of node management terminals are in communication connection with the service management service terminals, the service management service terminals are in communication connection with a plurality of service management terminals, and the plurality of node management terminals are correspondingly in communication connection with the plurality of service management terminals; the distributed server cluster node management method comprises the following steps:
Acquiring corresponding server node information of a plurality of node management terminals, and generating a node information list; simultaneously acquiring information of each functional service module of a server node corresponding to a plurality of service management service ends to generate a functional service module list, wherein the information of each functional service module of the server node is acquired by the corresponding service management terminal and written into the service management service ends;
Acquiring and updating the node information list and the function service module list according to node configuration information of a database, and writing the updated node information list and the updated function service module list into the database to update the database;
And uploading the updated node information list and the updated function service module list to the service management terminal at regular time through the node management terminal so as to keep alive with the node management terminal and the service management server.
2. The method for managing distributed server cluster nodes according to claim 1, further comprising:
When a node modification instruction sent by an administrator is detected, the node modification instruction is transmitted to the node management terminal, so that the node management terminal executes the node modification instruction, feeds back a modification state and simultaneously sends node modification information to the service management server;
and when the fed-back modification state is successful, writing the modified node information into the database, and sending an operation state prompt to the administrator.
3. The method for managing nodes of a distributed server cluster according to claim 1, wherein after updating the node information list and the function service module list, further comprising:
if the node management terminal does not upload the updated node information list to the service management terminal at regular time, marking the server node corresponding to the node management terminal as abnormal;
if the service management server cannot acquire node information in the updated database from the service management terminal within the preset time, the information is considered to be overtime;
And if the node information in the updated database cannot be acquired from the service management server, marking the server node corresponding to the node management terminal as offline.
4. The method for managing distributed server cluster nodes according to claim 1, wherein after each piece of functional service module information of said server node is obtained by the corresponding service management terminal and written into the service management server, further comprising:
A functional service module in the server node sends a query instruction through the corresponding service management terminal, wherein the query instruction comprises functional service module information to be called, and the functional service module information comprises IP port configuration and interface information;
And sending the query instruction to the service management server through the service management terminal, and acquiring the information of the functional service module to be called from the service management server so as to establish communication connection of the functional service module.
5. The distributed server cluster node management device is characterized by being applied to node management service terminals, wherein one node management terminal and one service management terminal are deployed on all cluster server nodes, at least two node management service terminals and at least two service management service terminals are deployed in the whole server cluster, the node management service terminals are in communication connection with a plurality of node management terminals, the plurality of node management terminals are in communication connection with the service management service terminals, the service management service terminals are in communication connection with a plurality of service management terminals, and the plurality of node management terminals are correspondingly in communication connection with the plurality of service management terminals; the distributed server cluster node management apparatus includes: the device comprises an acquisition unit, an updating unit and a timing uploading unit;
The acquisition unit is used for acquiring corresponding server node information of a plurality of node management terminals and generating a node information list; simultaneously acquiring information of each functional service module of a server node corresponding to a plurality of service management service ends to generate a functional service module list, wherein the information of each functional service module of the server node is acquired by the corresponding service management terminal and written into the service management service ends;
the updating unit is used for acquiring and updating the node information list and the function service module list according to node configuration information of a database, and writing the updated node information list and the updated function service module list into the database so as to update the database;
The timing uploading unit is used for uploading the updated node information list and the updated function service module list to the service management terminal through the node management terminal at regular time so as to keep alive with each other between the node management terminal and the service management server.
6. The distributed server cluster node management apparatus of claim 5, further comprising: a node modifying unit;
The node modification module is used for transmitting the node modification instruction to the node management terminal when detecting the node modification instruction sent by the administrator, so that the node management terminal executes the node modification instruction and feeds back the modification state, and meanwhile, node modification information is sent to the service management server;
and when the fed-back modification state is successful, writing the modified node information into the database, and sending an operation state prompt to the administrator.
7. The distributed server cluster node management apparatus of claim 5, further comprising: a service abnormality unit;
The service abnormality module is used for marking the server node corresponding to the node management terminal as abnormal if the node management terminal does not upload the updated node information list to the service management terminal at regular time;
if the service management server cannot acquire node information in the updated database from the service management terminal within the preset time, the information is considered to be overtime;
And if the node information in the updated database cannot be acquired from the service management server, marking the server node corresponding to the node management terminal as offline.
8. The distributed server cluster node management apparatus of claim 5, further comprising: a module calling unit;
The module calling unit is used for sending a query instruction through a corresponding service management terminal by one functional service module in the server node, wherein the query instruction comprises functional service module information to be called, and the functional service module information comprises IP port configuration and interface information;
And sending the query instruction to the service management server through the service management terminal, and acquiring the information of the functional service module to be called from the service management server so as to establish communication connection of the functional service module.
9. A terminal device comprising a processor, a memory and a computer program stored in the memory and configured to be executed by the processor, the processor implementing the distributed server cluster node management method according to any one of claims 1 to 4 when executing the computer program.
10. A computer readable storage medium, characterized in that the computer readable storage medium comprises a stored computer program, wherein the computer program when run controls a device in which the computer readable storage medium is located to perform the distributed server cluster node management method according to any one of claims 1 to 4.
CN202111218604.XA 2021-10-19 2021-10-19 Distributed server cluster node management method and device Active CN114050960B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111218604.XA CN114050960B (en) 2021-10-19 2021-10-19 Distributed server cluster node management method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111218604.XA CN114050960B (en) 2021-10-19 2021-10-19 Distributed server cluster node management method and device

Publications (2)

Publication Number Publication Date
CN114050960A CN114050960A (en) 2022-02-15
CN114050960B true CN114050960B (en) 2024-05-14

Family

ID=80205555

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111218604.XA Active CN114050960B (en) 2021-10-19 2021-10-19 Distributed server cluster node management method and device

Country Status (1)

Country Link
CN (1) CN114050960B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115361439B (en) * 2022-07-12 2024-03-15 北京奇艺世纪科技有限公司 Node management method, node management device, electronic equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106789202A (en) * 2016-12-08 2017-05-31 努比亚技术有限公司 service management device and method
CN107493199A (en) * 2017-08-31 2017-12-19 郑州云海信息技术有限公司 A kind of distributed type assemblies management method and system
CN109327509A (en) * 2018-09-11 2019-02-12 武汉魅瞳科技有限公司 A kind of distributive type Computational frame of the lower coupling of master/slave framework
CN111291101A (en) * 2019-12-17 2020-06-16 航天信息股份有限公司 Cluster management method and system
CN112162857A (en) * 2020-09-24 2021-01-01 珠海格力电器股份有限公司 Cluster server node management system
CN112272209A (en) * 2020-09-28 2021-01-26 苏州浪潮智能科技有限公司 Gateway separation method, device, equipment and storage medium in distributed object storage

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9071631B2 (en) * 2012-08-09 2015-06-30 International Business Machines Corporation Service management roles of processor nodes in distributed node service management
US20160043892A1 (en) * 2014-07-22 2016-02-11 Intigua, Inc. System and method for cloud based provisioning, configuring, and operating management tools
US10911342B2 (en) * 2018-11-30 2021-02-02 Sap Se Distributed monitoring in clusters with self-healing

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106789202A (en) * 2016-12-08 2017-05-31 努比亚技术有限公司 service management device and method
CN107493199A (en) * 2017-08-31 2017-12-19 郑州云海信息技术有限公司 A kind of distributed type assemblies management method and system
CN109327509A (en) * 2018-09-11 2019-02-12 武汉魅瞳科技有限公司 A kind of distributive type Computational frame of the lower coupling of master/slave framework
CN111291101A (en) * 2019-12-17 2020-06-16 航天信息股份有限公司 Cluster management method and system
CN112162857A (en) * 2020-09-24 2021-01-01 珠海格力电器股份有限公司 Cluster server node management system
CN112272209A (en) * 2020-09-28 2021-01-26 苏州浪潮智能科技有限公司 Gateway separation method, device, equipment and storage medium in distributed object storage

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Hierarchical distributed management clustering protocol for wirelss sensor networks;Amin Shahraki et al.;Telecommunication Systems;20160901;全文 *
基于分布式协调服务的集群管理的设计与实现;李文韵;《中国优秀硕士学位论文全文数据库》;20150415;全文 *

Also Published As

Publication number Publication date
CN114050960A (en) 2022-02-15

Similar Documents

Publication Publication Date Title
US8086721B2 (en) Network resource management in a network device
US7961594B2 (en) Methods and systems for history analysis for access paths in networks
US8775589B2 (en) Distributed network management system and method
US7860016B1 (en) Method and apparatus for configuration and analysis of network routing protocols
US20070260721A1 (en) Physical server discovery and correlation
CN110445697B (en) Video big data cloud platform equipment access service method
KR20090085098A (en) Method for managing a communication between a server device and a customer device
CN113852673B (en) Signaling tracking method and device of core network element and storage medium
CN101621548A (en) Method and system for realizing terminal resource sharing based on peer connection system
JP5617304B2 (en) Switching device, information processing device, and fault notification control program
CN114050960B (en) Distributed server cluster node management method and device
CN111417037A (en) Management and control system of optical transport network
CN111130833B (en) Train network automatic configuration method and computer readable storage medium
CN110740355A (en) Equipment monitoring method and device, electronic equipment and storage medium
CN114553867A (en) Cloud-native cross-cloud network monitoring method and device and storage medium
US7792045B1 (en) Method and apparatus for configuration and analysis of internal network routing protocols
CN110290163A (en) A kind of data processing method and device
CN102082652A (en) Method, device and system for acquiring network clock topological structure
CN108900603A (en) A kind of server discovery methods, devices and systems
CN115426258B (en) Information configuration method, device, switch and readable storage medium
CN105049259A (en) Network equipment, and server, system and method used for network equipment
JP2003507976A (en) Comprehensive alignment process in multi-manager environment
CN108683540B (en) Cross-platform lightweight implementation method and system for network management protocol channel
CN113194115A (en) Method for automatically deploying client, network equipment and storage medium
CN110520838B (en) Method for configuring at least one device in a network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant