CN114050960A - Distributed server cluster node management method and device - Google Patents

Distributed server cluster node management method and device Download PDF

Info

Publication number
CN114050960A
CN114050960A CN202111218604.XA CN202111218604A CN114050960A CN 114050960 A CN114050960 A CN 114050960A CN 202111218604 A CN202111218604 A CN 202111218604A CN 114050960 A CN114050960 A CN 114050960A
Authority
CN
China
Prior art keywords
node
service
server
management
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202111218604.XA
Other languages
Chinese (zh)
Other versions
CN114050960B (en
Inventor
赵鹏松
钟金超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xiamen Yealink Network Technology Co Ltd
Original Assignee
Xiamen Yealink Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xiamen Yealink Network Technology Co Ltd filed Critical Xiamen Yealink Network Technology Co Ltd
Priority to CN202111218604.XA priority Critical patent/CN114050960B/en
Publication of CN114050960A publication Critical patent/CN114050960A/en
Application granted granted Critical
Publication of CN114050960B publication Critical patent/CN114050960B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/04Network management architectures or arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Computer And Data Communications (AREA)

Abstract

The invention discloses a distributed server cluster node management method and a device, which are used for establishing communication connection with a service management terminal, a node management terminal, a service management server and a node management server in a server cluster to acquire information of each functional service module of a plurality of server nodes corresponding to the service management server, generate a functional service module list, acquire information of the server nodes corresponding to the node management terminals at the same time, generate a node information list, realize the separation of the functional service module management and the server node information management, and update a database according to the node information list and the functional service module list; compared with the prior art, the invention realizes low coupling and increases the expansibility of the server cluster by adopting the separated design of service management and node management.

Description

Distributed server cluster node management method and device
Technical Field
The invention relates to the technical field of embedded equipment clusters, in particular to a distributed server cluster node management method and device.
Background
In the server cluster, the high availability of the server cluster is influenced by the state of the server node, and the high availability of the server cluster is reduced by online and offline of the server node; in order to improve the high availability of the server cluster, it is required that some nodes of the server are offline due to failure, and the server nodes joining the cluster have no influence on the cluster service, so that the server nodes of the server cluster need to be managed, and the server nodes are as simple and stateless as possible. Meanwhile, when a new service function module is added, the bottom layer is good in adaptation compatibility, and the problem that the upper layer service is greatly modified and designed due to the complexity of a layer architecture needs to be reduced as much as possible.
Disclosure of Invention
The technical problem to be solved by the invention is as follows: the distributed server cluster node management method and device are provided, low coupling is achieved by adopting a separated design of service management and node management, and expansibility of a server cluster is increased.
In order to solve the technical problem, the invention provides a distributed server cluster node management method, which is applied to a node management server, wherein the node management server is in communication connection with a plurality of node management terminals, the plurality of node management terminals are in communication connection with a service management server, the service management server is in communication connection with a plurality of service node management terminals, and the plurality of node management terminals are in communication connection with the plurality of service management terminals correspondingly; the distributed server cluster node management method comprises the following steps:
acquiring corresponding server node information of a plurality of node management terminals, and generating a node information list; simultaneously acquiring information of each functional service module of a server node corresponding to a plurality of service management service terminals, and generating a functional service module list, wherein the information of each functional service module of the server node is acquired by the corresponding service management service terminal and written into the service management service terminal;
acquiring and updating the node information list and the functional service module list according to node configuration information of a database, and writing the updated node information list and the updated functional service module list into the database to update the database;
and uploading the updated node information list and the updated functional service module list to the service management terminal through the node management terminal so as to keep the node management terminal and the service management server terminal mutually alive.
Further, the method for managing distributed server cluster nodes provided by the present invention further includes:
when a node modification instruction sent by an administrator is detected, the node modification instruction is transmitted to the node management terminal, so that the node management terminal executes the node modification instruction, feeds back a modification state and sends node modification information to the service management server side;
and when the fed-back modification state is successful, writing the modified node information into the database, and sending an operation state prompt to the administrator.
Further, after updating the node information list and the functional service module list, the method further includes:
if the node management terminal does not upload the updated node information list to the service management terminal at regular time, marking the server node corresponding to the node management terminal as abnormal;
if the service management server side cannot acquire the updated node information in the database from the service management terminal within the preset time, the information is considered to be overtime;
and meanwhile, if the updated node information in the database cannot be acquired from the service management server, marking the server node corresponding to the node management terminal as offline.
Further, after the information of each functional service module of the server node is obtained by the corresponding service management terminal and written into the service management server, the method further includes:
a functional service module in the server node sends a query instruction through the corresponding service management terminal, wherein the query instruction comprises information of the functional service module to be called, and the information of the functional service module comprises IP port configuration and interface information;
and sending the query instruction to the service management server through the service management terminal, and acquiring the information of the functional service module to be called from the service management server so as to establish communication connection between the functional service modules.
The invention further provides a distributed server cluster node management device, which is applied to a node management server, wherein the node management server is in communication connection with a plurality of node management terminals, the plurality of node management terminals are in communication connection with the service management server, the service management server is in communication connection with a plurality of service node management terminals, and the plurality of node management terminals are in communication connection with the plurality of service management terminals correspondingly; the distributed server cluster node management device comprises: the device comprises an acquisition unit, an updating unit and a timing uploading unit;
the acquisition unit is used for acquiring corresponding server node information of a plurality of node management terminals and generating a node information list; simultaneously acquiring information of each functional service module of a server node corresponding to a plurality of service management service terminals, and generating a functional service module list, wherein the information of each functional service module of the server node is acquired by the corresponding service management service terminal and written into the service management service terminal;
the updating unit is used for acquiring and updating the node information list and the functional service module list according to node configuration information of a database, and writing the updated node information list and the updated functional service module list into the database so as to update the database;
the timing uploading unit is used for uploading the updated node information list and the updated functional service module list to the service management terminal through the node management terminal so as to keep the node management terminal and the service management server terminal mutually alive.
Further, the distributed server cluster node management apparatus provided by the present invention further includes: a node modification unit;
the node modification module is used for transmitting a node modification instruction to the node management terminal when the node modification instruction sent by an administrator is detected, so that the node management terminal executes the node modification instruction, feeds back a modification state and sends node modification information to the service management server side;
and when the fed-back modification state is successful, writing the modified node information into the database, and sending an operation state prompt to the administrator.
Further, the distributed server cluster node management apparatus provided by the present invention further includes: a service exception unit;
the service exception module is used for marking the server node corresponding to the node management terminal as exception if the node management terminal does not upload the updated node information list to the service management terminal at regular time;
if the service management server side cannot acquire the updated node information in the database from the service management terminal within the preset time, the information is considered to be overtime;
and meanwhile, if the updated node information in the database cannot be acquired from the service management server, marking the server node corresponding to the node management terminal as offline.
Further, the distributed server cluster node management apparatus provided by the present invention further includes: a module calling unit;
the module calling unit is used for sending a query instruction by one functional service module in the server node through the corresponding service management terminal, wherein the query instruction comprises information of the functional service module to be called, and the information of the functional service module comprises IP port configuration and interface information;
and sending the query instruction to the service management server through the service management terminal, and acquiring the information of the functional service module to be called from the service management server so as to establish communication connection between the functional service modules.
Further, the present invention also provides a terminal device, which includes a processor, a memory, and a computer program stored in the memory and configured to be executed by the processor, and when the processor executes the computer program, the distributed server cluster node management method according to any one of the above items is implemented.
Further, the present invention also provides a computer-readable storage medium, where the computer-readable storage medium includes a stored computer program, where when the computer program runs, the apparatus where the computer-readable storage medium is located is controlled to execute the distributed server cluster node management method according to any one of the above items.
Compared with the prior art, the distributed server cluster node management method and the distributed server cluster node management device provided by the embodiment of the invention have the following beneficial effects:
the invention relates to a distributed server cluster node management method and a device, which are used for establishing communication connection with a service management terminal, a node management terminal, a service management server and a node management server in a server cluster to acquire information of each functional service module of a plurality of server nodes corresponding to the service management server, generate a functional service module list, acquire information of the server nodes corresponding to the node management terminals at the same time, generate a node information list and realize the separation of functional service module management and server node information management; compared with the prior art, the invention realizes low coupling and increases the expansibility of the server cluster by adopting the separated design of service management and node management.
Drawings
Fig. 1 is a schematic flowchart of an embodiment of a distributed server cluster node management method provided in the present invention;
fig. 2 is a schematic structural diagram of an embodiment of a distributed server cluster node management apparatus provided in the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the present application, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Example 1
Referring to fig. 1, fig. 1 is a schematic flowchart of an embodiment of a distributed server cluster node management method provided by the present invention, and as shown in fig. 1, the method includes steps 101 to 103, which are specifically as follows:
step 101: acquiring corresponding server node information of a plurality of node management terminals, and generating a node information list; and simultaneously acquiring information of each functional service module of a server node corresponding to a plurality of service management service terminals, and generating a functional service module list, wherein the information of each functional service module of the server node is acquired by the corresponding service management service terminal and written into the service management service terminal.
In this embodiment, for the deployment of a server cluster, a node management terminal and a service management terminal are deployed on all cluster server nodes, and at least two node management server sides and at least two service management server sides are deployed in the whole server cluster; the node management server is respectively in communication connection with the plurality of node management terminals, the service management server is respectively in communication connection with the plurality of service node management terminals, and the plurality of node management terminals are correspondingly in communication connection with the plurality of service management terminals.
In this embodiment, the service management terminal provides an interface to each functional service module of the server node, and acquires and writes information of all functional service modules of the server node to the service management server through the service management terminal. The service management server is used for collecting the functional service module information on the plurality of server nodes, information synchronization is realized among different service management servers, and a single service management server can collect the functional service module information corresponding to all the server nodes in the whole server cluster.
In this embodiment, based on the design of the service management terminal and the service management server, mutual calling between the functional service modules of each server node is easily implemented. As an example in this embodiment, for example, a first service function module on a first server node registers service information of the first service function module to any one service management server through a service management terminal on the server node, that is, the first service function module informs the service management server of the information of the first service function module, such as an interface, an IP port configuration, port information, and the like of the first service function module, due to cluster deployment among the service management servers, information synchronization among the deployed service management servers is caused, that is, information of the first service function module of the first server node exists on any one service management server in a server cluster; and repeating the steps by all the service function modules in other server nodes so as to centralize the information of the service function modules on all the server nodes in the server cluster on any service management server side.
In this embodiment, a functional service module in a server node sends a query instruction through a corresponding service management terminal, where the query instruction includes information of the functional service module to be called, and the information of the functional service module includes IP port configuration and interface information; and sending the query instruction to the service management server through the service management terminal, and acquiring the information of the functional service module to be called from the service management server so as to establish communication connection between the functional service modules. As an example in this embodiment, when a second service function module of a second server node wants to call a first service function module on a first server node, a query instruction is sent to a service management terminal on the second server node, where the query instruction includes IP port configuration and interface information of the first service function module on the first server node, so that the service management terminal on the second server node sends the query instruction to the service management server on the second server node and waits for a reply from the service management server on the second server node, and after the service management server on the second server node replies, the service management terminal on the second server node informs the second service function module of the second server node of information about the first service function module on the first server node replied by the service management server on the second server node, therefore, communication connection can be established between the second service function module of the second server node and the first service function module on the first server node, and calling between service function modules of different server nodes is realized.
In this embodiment, the node management server is mainly used to collect system information of all server nodes in the server cluster, and may issue an instruction to the node management terminal, such as modifying IP configuration; meanwhile, the network configuration information is provided for web calling, and the web can write the network configuration information to the server; the node management terminal is used for reporting server node information of the node management terminal to the node management server in real time, such as IP address, port, firewall configuration, NAT configuration and the like of the node management server. In this embodiment, when a plurality of node management terminals report current node information to a single node management service point, due to cluster deployment among node management service terminals, information synchronization among the deployed node management service terminals is caused, that is, node information of any server node exists on any node management service terminal in a server cluster.
In this embodiment, the node management terminal of any server node in the server cluster sends the registration information to any service management server through the service management terminal of its own server node, so that the IP and port information of its own server node are sent to any service management server.
In this embodiment, due to the communication connection between the service management server and the plurality of node management terminals, the service management server obtains the server node information corresponding to the plurality of node management terminals and generates the node information list, and when ip information of a node management terminal of any one server node in the server cluster changes, the service management server can quickly obtain the changed ip information of the corresponding node management terminal and update the node information list.
In this embodiment, the service management server obtains information of each functional service module of the server node corresponding to a plurality of service management servers at the same time, and generates a functional service information list, so that when an administrator accesses the system through the web server, the administrator can display the module information through the web server. Meanwhile, partial filtering can be performed on the displayed module information according to requirements, for example, the packet loss rate of the current system is what, if the web end has no relevant requirements, the packet loss rate information of the system can be filtered.
Step 102: and acquiring and updating the node information list and the functional service module list according to the node configuration information of the database, and writing the updated node information list and the updated functional service module list into the database so as to update the database.
In this embodiment, the node management center of the node management server obtains server node configuration information in the database from the database, updates the node information list and the functional service module list obtained in step 101 according to the server node configuration information in the database, writes the updated node information list and the updated functional service module list into the database, and updates data in the database, so as to make the data in the database persistent. In this embodiment, the database type may be Redis or TiDB.
Step 103: and uploading the updated node information list and the updated functional service module list to the service management terminal through the node management terminal so as to keep the node management terminal and the service management server terminal mutually alive.
In this embodiment, since the update of the database needs to rely on the writing of the node management server to the information, information synchronization exists between the node management server and the database, and at this time, the node management server obtains the server node information of the whole server cluster and the service function module information of the whole server cluster; the node management server uploads the updated node information list and the updated functional service module list to the service management terminal of the corresponding server node at regular time through the node management terminals of different server nodes, so that the node management server is kept alive with the node management terminal and the service management server respectively, and meanwhile, the service management server and the service management terminal are kept alive with each other, but the node management server is not kept alive with the service management server.
In this embodiment, the node management server may obtain server node information of the entire server cluster and service function module information of the entire server cluster, and when the server node information changes, the node management server may timely obtain the changed server node information from other server components, and modify corresponding server node information stored in the node management server.
In this embodiment, for modification of the node information list, when each service component normally operates and keeps alive normally, when a node modification instruction sent by an administrator is detected, the node modification instruction is transmitted to the node management terminal, so that the node management terminal executes the node modification instruction, feeds back a modification state, and sends node modification information to the service management server; and when the fed-back modification state is successful, writing the modified node information into the database, and sending an operation state prompt to the administrator. As an example in this embodiment, when an administrator modifies a server node information through a web-side operation, if the IP port configuration is modified, the node management server side sends the node modification instruction to the service management terminal of the corresponding server node when receiving the node modification instruction of the administrator, when the service management terminal executes the node modification instruction to modify the IP port configuration, the information of successful modification is sent to the service management server, meanwhile, the server node information after being successfully modified is sent to a service management server through a service management terminal on the server node, so that the service management server side can synchronously modify the server node information, at this time, the node management server can inform the database to write the updated server node information, and meanwhile, the web end is informed that the operation is finished so as to prompt an administrator to operate successfully on the web end.
In this embodiment, when a server node in a server cluster is abnormal, a node management terminal of the server node is abnormal, which causes that the node management terminal does not upload an updated node information list to a service management terminal at regular time, so that the node management terminal and a node management server communicate with each other, and the server node corresponding to the node management terminal is marked as abnormal, for example, a yellow alarm; if the service management server side cannot acquire the updated node information in the database from the service management terminal within the preset time, the information is considered to be overtime; meanwhile, if the node management server cannot acquire the updated node information in the database from the service management server, the information is considered to be overtime, and when both the node management server and the service management server consider the information to be overtime, the server node corresponding to the node management terminal is marked as offline, for example, a red alarm. In this embodiment, the node management server and the service management server perform two judgments, and after the confirmation information is overtime, the server node is defined as offline, so that the accuracy can be further improved.
Referring to fig. 2, fig. 2 is a schematic structural diagram of an embodiment of a distributed server cluster node management apparatus provided in the present invention, as shown in fig. 2, the apparatus includes an obtaining unit 201, an updating unit 202, and a timing uploading unit 203, which are specifically as follows:
the acquiring unit 201 is configured to acquire server node information corresponding to a plurality of node management terminals, and generate a node information list; and simultaneously acquiring information of each functional service module of a server node corresponding to a plurality of service management service terminals, and generating a functional service module list, wherein the information of each functional service module of the server node is acquired by the corresponding service management service terminal and written into the service management service terminal.
In this embodiment, for the deployment of a server cluster, a node management terminal and a service management terminal are deployed on all cluster server nodes, and at least two node management server sides and at least two service management server sides are deployed in the whole server cluster; the node management server is respectively in communication connection with the plurality of node management terminals, the service management server is respectively in communication connection with the plurality of service node management terminals, and the plurality of node management terminals are correspondingly in communication connection with the plurality of service management terminals.
In this embodiment, the service management terminal provides an interface to each functional service module of the server node, and acquires and writes information of all functional service modules of the server node to the service management server through the service management terminal. The service management server is used for collecting the functional service module information on the plurality of server nodes, information synchronization is realized among different service management servers, and a single service management server can collect the functional service module information corresponding to all the server nodes in the whole server cluster.
In this embodiment, based on the design of the service management terminal and the service management server, mutual calling between the functional service modules of each server node is easily implemented. As an example in this embodiment, for example, a first service function module on a first server node registers service information of the first service function module to any one service management server through a service management terminal on the server node, that is, the first service function module informs the service management server of the information of the first service function module, such as an interface, an IP port configuration, port information, and the like of the first service function module, due to cluster deployment among the service management servers, information synchronization among the deployed service management servers is caused, that is, information of the first service function module of the first server node exists on any one service management server in a server cluster; and repeating the steps by all the service function modules in other server nodes so as to centralize the information of the service function modules on all the server nodes in the server cluster on any service management server side.
In this embodiment, a functional service module in a server node sends a query instruction through a corresponding service management terminal, where the query instruction includes information of the functional service module to be called, and the information of the functional service module includes IP port configuration and interface information; and sending the query instruction to the service management server through the service management terminal, and acquiring the information of the functional service module to be called from the service management server so as to establish communication connection between the functional service modules. As an example in this embodiment, when a second service function module of a second server node wants to call a first service function module on a first server node, a query instruction is sent to a service management terminal on the second server node, where the query instruction includes IP port configuration and interface information of the first service function module on the first server node, so that the service management terminal on the second server node sends the query instruction to the service management server on the second server node and waits for a reply from the service management server on the second server node, and after the service management server on the second server node replies, the service management terminal on the second server node informs the second service function module of the second server node of information about the first service function module on the first server node replied by the service management server on the second server node, therefore, communication connection can be established between the second service function module of the second server node and the first service function module on the first server node, and calling between service function modules of different server nodes is realized.
In this embodiment, the node management server is mainly used to collect system information of all server nodes in the server cluster, and may issue an instruction to the node management terminal, such as modifying IP configuration; meanwhile, the network configuration information is provided for web calling, and the web can write the network configuration information to the server; the node management terminal is used for reporting server node information of the node management terminal to the node management server in real time, such as IP address, port, firewall configuration, NAT configuration and the like of the node management server. In this embodiment, when a plurality of node management terminals report current node information to a single node management service point, due to cluster deployment among node management service terminals, information synchronization among the deployed node management service terminals is caused, that is, node information of any server node exists on any node management service terminal in a server cluster.
In this embodiment, the node management terminal of any server node in the server cluster sends the registration information to any service management server through the service management terminal of its own server node, so that the IP and port information of its own server node are sent to any service management server.
In this embodiment, due to the communication connection between the service management server and the plurality of node management terminals, the service management server obtains the server node information corresponding to the plurality of node management terminals and generates the node information list, and when ip information of a node management terminal of any one server node in the server cluster changes, the service management server can quickly obtain the changed ip information of the corresponding node management terminal and update the node information list. In this embodiment, the service management server obtains information of each functional service module of the server node corresponding to a plurality of service management servers at the same time, and generates a functional service information list, so that when an administrator accesses the system through the web server, the administrator can display the module information through the web server. Meanwhile, partial filtering can be performed on the displayed module information according to requirements, for example, the packet loss rate of the current system is what, if the web end has no relevant requirements, the packet loss rate information of the system can be filtered.
The updating unit 202 is configured to acquire and update the node information list and the functional service module list according to node configuration information of a database, and write the updated node information list and functional service module list into the database to update the database.
In this embodiment, the node management center of the node management server acquires server node configuration information in the database from the database, updates the node information list and the functional service module list acquired in the acquisition unit 201 according to the server node configuration information in the database, writes the updated node information list and the updated functional service module list into the database, and updates data in the database, so as to make the data in the database persistent. In this embodiment, the database type may be Redis or TiDB.
The timing uploading unit 203 is configured to upload the updated node information list and the updated functional service module list to the service management terminal at regular time through the node management terminal, so that the node management terminal and the service management server are kept alive with each other.
In this embodiment, since the update of the database needs to rely on the writing of the node management server to the information, information synchronization exists between the node management server and the database, and at this time, the node management server obtains the server node information of the whole server cluster and the service function module information of the whole server cluster; the node management server uploads the updated node information list and the updated functional service module list to the service management terminal of the corresponding server node at regular time through the node management terminals of different server nodes, so that the node management server is kept alive with the node management terminal and the service management server respectively, and meanwhile, the service management server and the service management terminal are kept alive with each other, but the node management server is not kept alive with the service management server.
In this embodiment, the node management server may obtain server node information of the entire server cluster and service function module information of the entire server cluster, and when the server node information changes, the node management server may timely obtain the changed server node information from other server components, and modify corresponding server node information stored in the node management server.
In this embodiment, for modification of the node information list, when each service component normally operates and keeps alive normally, when a node modification instruction sent by an administrator is detected, the node modification instruction is transmitted to the node management terminal, so that the node management terminal executes the node modification instruction, feeds back a modification state, and sends node modification information to the service management server; and when the fed-back modification state is successful, writing the modified node information into the database, and sending an operation state prompt to the administrator. As an example in this embodiment, when an administrator modifies a server node information through a web-side operation, if the IP port configuration is modified, the node management server side sends the node modification instruction to the service management terminal of the corresponding server node when receiving the node modification instruction of the administrator, when the service management terminal executes the node modification instruction to modify the IP port configuration, the information of successful modification is sent to the service management server, meanwhile, the server node information after being successfully modified is sent to a service management server through a service management terminal on the server node, so that the service management server side can synchronously modify the server node information, at this time, the node management server can inform the database to write the updated server node information, and meanwhile, the web end is informed that the operation is finished so as to prompt an administrator to operate successfully on the web end.
In this embodiment, when a server node in a server cluster is abnormal, a node management terminal of the server node is abnormal, which causes that the node management terminal does not upload an updated node information list to a service management terminal at regular time, so that the node management terminal and a node management server communicate with each other, and the server node corresponding to the node management terminal is marked as abnormal, for example, a yellow alarm; if the service management server side cannot acquire the updated node information in the database from the service management terminal within the preset time, the information is considered to be overtime; meanwhile, if the node management server cannot acquire the updated node information in the database from the service management server, the information is considered to be overtime, and when both the node management server and the service management server consider the information to be overtime, the server node corresponding to the node management terminal is marked as offline, for example, a red alarm. In this embodiment, the server node is defined as offline only after the confirmation information is overtime through two judgments by the node management server and the service management server, so that the accuracy can be further improved, and meanwhile, the abnormal offline of the node can be quickly displayed on the web side for prompting a manager and helping to operate the server node quickly.
In summary, the present invention provides a distributed server cluster node management method and apparatus, wherein a communication connection is established between a service management terminal, a node management terminal, a service management server and a node management server in a server cluster, so as to obtain information of each functional service module of a plurality of server nodes corresponding to the service management server, generate a functional service module list, obtain information of server nodes corresponding to a plurality of node management terminals, generate a node information list, implement separation of functional service module management and server node information management, and update a database according to the node information list and the functional service module list; compared with the prior art, the invention realizes low coupling and increases the expansibility of the server cluster by adopting the separated design of service management and node management.
The above description is only a preferred embodiment of the present invention, and it should be noted that, for those skilled in the art, various modifications and substitutions can be made without departing from the technical principle of the present invention, and these modifications and substitutions should also be regarded as the protection scope of the present invention.

Claims (10)

1. A distributed server cluster node management method is characterized by being applied to a node management server, wherein the node management server is in communication connection with a plurality of node management terminals, the node management terminals are in communication connection with a service management server, the service management server is in communication connection with a plurality of service node management terminals, and the node management terminals are correspondingly in communication connection with the service management terminals; the distributed server cluster node management method comprises the following steps:
acquiring corresponding server node information of a plurality of node management terminals, and generating a node information list; simultaneously acquiring information of each functional service module of a server node corresponding to a plurality of service management service terminals, and generating a functional service module list, wherein the information of each functional service module of the server node is acquired by the corresponding service management service terminal and written into the service management service terminal;
acquiring and updating the node information list and the functional service module list according to node configuration information of a database, and writing the updated node information list and the updated functional service module list into the database to update the database;
and uploading the updated node information list and the updated functional service module list to the service management terminal through the node management terminal so as to keep the node management terminal and the service management server terminal mutually alive.
2. The distributed server cluster node management method of claim 1, further comprising:
when a node modification instruction sent by an administrator is detected, the node modification instruction is transmitted to the node management terminal, so that the node management terminal executes the node modification instruction, feeds back a modification state and sends node modification information to the service management server side;
and when the fed-back modification state is successful, writing the modified node information into the database, and sending an operation state prompt to the administrator.
3. The method for managing distributed server cluster nodes according to claim 1, wherein after updating the node information list and the functional service module list, the method further comprises:
if the node management terminal does not upload the updated node information list to the service management terminal at regular time, marking the server node corresponding to the node management terminal as abnormal;
if the service management server side cannot acquire the updated node information in the database from the service management terminal within the preset time, the information is considered to be overtime;
and meanwhile, if the updated node information in the database cannot be acquired from the service management server, marking the server node corresponding to the node management terminal as offline.
4. The method according to claim 1, wherein after the information of each functional service module of the server node is obtained by the corresponding service management terminal and written into the service management server, the method further comprises:
a functional service module in the server node sends a query instruction through the corresponding service management terminal, wherein the query instruction comprises information of the functional service module to be called, and the information of the functional service module comprises IP port configuration and interface information;
and sending the query instruction to the service management server through the service management terminal, and acquiring the information of the functional service module to be called from the service management server so as to establish communication connection between the functional service modules.
5. The distributed server cluster node management device is applied to a node management server, wherein the node management server is in communication connection with a plurality of node management terminals, the node management terminals are in communication connection with a service management server, the service management server is in communication connection with a plurality of service node management terminals, and the node management terminals are correspondingly in communication connection with the service management terminals; the distributed server cluster node management device comprises: the device comprises an acquisition unit, an updating unit and a timing uploading unit;
the acquisition unit is used for acquiring corresponding server node information of a plurality of node management terminals and generating a node information list; simultaneously acquiring information of each functional service module of a server node corresponding to a plurality of service management service terminals, and generating a functional service module list, wherein the information of each functional service module of the server node is acquired by the corresponding service management service terminal and written into the service management service terminal;
the updating unit is used for acquiring and updating the node information list and the functional service module list according to node configuration information of a database, and writing the updated node information list and the updated functional service module list into the database so as to update the database;
the timing uploading unit is used for uploading the updated node information list and the updated functional service module list to the service management terminal through the node management terminal so as to keep the node management terminal and the service management server terminal mutually alive.
6. The distributed server cluster node management apparatus of claim 5, further comprising: a node modification unit;
the node modification module is used for transmitting a node modification instruction to the node management terminal when the node modification instruction sent by an administrator is detected, so that the node management terminal executes the node modification instruction, feeds back a modification state and sends node modification information to the service management server side;
and when the fed-back modification state is successful, writing the modified node information into the database, and sending an operation state prompt to the administrator.
7. The distributed server cluster node management apparatus of claim 5, further comprising: a service exception unit;
the service exception module is used for marking the server node corresponding to the node management terminal as exception if the node management terminal does not upload the updated node information list to the service management terminal at regular time;
if the service management server side cannot acquire the updated node information in the database from the service management terminal within the preset time, the information is considered to be overtime;
and meanwhile, if the updated node information in the database cannot be acquired from the service management server, marking the server node corresponding to the node management terminal as offline.
8. The distributed server cluster node management apparatus of claim 5, further comprising: a module calling unit;
the module calling unit is used for sending a query instruction by one functional service module in the server node through the corresponding service management terminal, wherein the query instruction comprises information of the functional service module to be called, and the information of the functional service module comprises IP port configuration and interface information;
and sending the query instruction to the service management server through the service management terminal, and acquiring the information of the functional service module to be called from the service management server so as to establish communication connection between the functional service modules.
9. A terminal device comprising a processor, a memory, and a computer program stored in the memory and configured to be executed by the processor, the processor implementing the distributed server cluster node management method of any one of claims 1 to 4 when executing the computer program.
10. A computer-readable storage medium, comprising a stored computer program, wherein the computer program, when executed, controls an apparatus in which the computer-readable storage medium is located to perform the distributed server cluster node management method according to any one of claims 1 to 4.
CN202111218604.XA 2021-10-19 2021-10-19 Distributed server cluster node management method and device Active CN114050960B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111218604.XA CN114050960B (en) 2021-10-19 2021-10-19 Distributed server cluster node management method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111218604.XA CN114050960B (en) 2021-10-19 2021-10-19 Distributed server cluster node management method and device

Publications (2)

Publication Number Publication Date
CN114050960A true CN114050960A (en) 2022-02-15
CN114050960B CN114050960B (en) 2024-05-14

Family

ID=80205555

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111218604.XA Active CN114050960B (en) 2021-10-19 2021-10-19 Distributed server cluster node management method and device

Country Status (1)

Country Link
CN (1) CN114050960B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115361439A (en) * 2022-07-12 2022-11-18 北京奇艺世纪科技有限公司 Node management method, node management device, electronic equipment and storage medium

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140047088A1 (en) * 2012-08-09 2014-02-13 International Business Machines Corporation Service management roles of processor nodes in distributed node service management
US20160043892A1 (en) * 2014-07-22 2016-02-11 Intigua, Inc. System and method for cloud based provisioning, configuring, and operating management tools
CN106789202A (en) * 2016-12-08 2017-05-31 努比亚技术有限公司 service management device and method
CN107493199A (en) * 2017-08-31 2017-12-19 郑州云海信息技术有限公司 A kind of distributed type assemblies management method and system
CN109327509A (en) * 2018-09-11 2019-02-12 武汉魅瞳科技有限公司 A kind of distributive type Computational frame of the lower coupling of master/slave framework
US20200177481A1 (en) * 2018-11-30 2020-06-04 Sap Se Distributed monitoring in clusters with self-healing
CN111291101A (en) * 2019-12-17 2020-06-16 航天信息股份有限公司 Cluster management method and system
CN112162857A (en) * 2020-09-24 2021-01-01 珠海格力电器股份有限公司 Cluster server node management system
CN112272209A (en) * 2020-09-28 2021-01-26 苏州浪潮智能科技有限公司 Gateway separation method, device, equipment and storage medium in distributed object storage

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140047088A1 (en) * 2012-08-09 2014-02-13 International Business Machines Corporation Service management roles of processor nodes in distributed node service management
US20160043892A1 (en) * 2014-07-22 2016-02-11 Intigua, Inc. System and method for cloud based provisioning, configuring, and operating management tools
CN106789202A (en) * 2016-12-08 2017-05-31 努比亚技术有限公司 service management device and method
CN107493199A (en) * 2017-08-31 2017-12-19 郑州云海信息技术有限公司 A kind of distributed type assemblies management method and system
CN109327509A (en) * 2018-09-11 2019-02-12 武汉魅瞳科技有限公司 A kind of distributive type Computational frame of the lower coupling of master/slave framework
US20200177481A1 (en) * 2018-11-30 2020-06-04 Sap Se Distributed monitoring in clusters with self-healing
CN111291101A (en) * 2019-12-17 2020-06-16 航天信息股份有限公司 Cluster management method and system
CN112162857A (en) * 2020-09-24 2021-01-01 珠海格力电器股份有限公司 Cluster server node management system
CN112272209A (en) * 2020-09-28 2021-01-26 苏州浪潮智能科技有限公司 Gateway separation method, device, equipment and storage medium in distributed object storage

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
AMIN SHAHRAKI ET AL.: "Hierarchical distributed management clustering protocol for wirelss sensor networks", TELECOMMUNICATION SYSTEMS, 1 September 2016 (2016-09-01) *
李文韵: "基于分布式协调服务的集群管理的设计与实现", 《中国优秀硕士学位论文全文数据库》, 15 April 2015 (2015-04-15) *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115361439A (en) * 2022-07-12 2022-11-18 北京奇艺世纪科技有限公司 Node management method, node management device, electronic equipment and storage medium
CN115361439B (en) * 2022-07-12 2024-03-15 北京奇艺世纪科技有限公司 Node management method, node management device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN114050960B (en) 2024-05-14

Similar Documents

Publication Publication Date Title
US8086721B2 (en) Network resource management in a network device
US20080104221A1 (en) Method and system for simple network management protocol based data management
CN113852673B (en) Signaling tracking method and device of core network element and storage medium
CN110445697B (en) Video big data cloud platform equipment access service method
CN111641521B (en) Method for deploying big data clusters on cloud based on Ambari
CN114050960A (en) Distributed server cluster node management method and device
CN110875857A (en) Method, device and system for reporting disconnected network state
CN111787031A (en) Method for sending control instruction of Internet of things equipment based on distributed technology
CN110740355A (en) Equipment monitoring method and device, electronic equipment and storage medium
CN114553867A (en) Cloud-native cross-cloud network monitoring method and device and storage medium
CN110290163A (en) A kind of data processing method and device
CN111901245B (en) Iscsi multi-path management system, method, equipment and storage medium
JP2000066978A (en) Network management information collection system, network management device to be used for the system and node to be managed
CN108900603A (en) A kind of server discovery methods, devices and systems
JP4673532B2 (en) Comprehensive alignment process in a multi-manager environment
CN113824595B (en) Link switching control method and device and gateway equipment
EP1654653B1 (en) Active storage area network discovery system and method
CN110520838B (en) Method for configuring at least one device in a network
CN116915837B (en) Communication method and communication system based on software defined network
KR101740799B1 (en) Method of failover for network service in software defined networking environment
CN112583622A (en) Method and system for reporting fault event information
CN109005117B (en) Network topology aggregation method and device and SDN
CN105323088A (en) Springboard processing method and springboard processing device
CN113794580B (en) Management method and device of network equipment
CN112887450B (en) Communication address configuration method and device of Internet of things equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant