CN109861842A - O&M method, apparatus, electronic equipment and the storage medium of server cluster - Google Patents

O&M method, apparatus, electronic equipment and the storage medium of server cluster Download PDF

Info

Publication number
CN109861842A
CN109861842A CN201811410234.8A CN201811410234A CN109861842A CN 109861842 A CN109861842 A CN 109861842A CN 201811410234 A CN201811410234 A CN 201811410234A CN 109861842 A CN109861842 A CN 109861842A
Authority
CN
China
Prior art keywords
executed
work order
maintenance
server
task
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811410234.8A
Other languages
Chinese (zh)
Inventor
苏赛
陈逸凡
侯利曼
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Advanced New Technologies Co Ltd
Advantageous New Technologies Co Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201811410234.8A priority Critical patent/CN109861842A/en
Publication of CN109861842A publication Critical patent/CN109861842A/en
Pending legal-status Critical Current

Links

Landscapes

  • Stored Programmes (AREA)

Abstract

The embodiment of the present disclosure discloses O&M method, apparatus, electronic equipment and the storage medium of a kind of server cluster.Wherein, the O&M method of server cluster includes: to obtain the operation/maintenance data that user provides in response to Client-initiated O&M event;Pending work order is generated according to the operation/maintenance data;Wherein, the pending work order includes at least configuration information, collection instructions to be performed and/or pending task-set;The pending work order is distributed to one or more destination servers in server cluster, so that the destination server executes corresponding operation according to the pending work order.The embodiment of the present disclosure is cooperated by the execution unit on destination server in Operation and Maintenance Center and server cluster, the batch that can be realized O&M event issues, it realizes that the safe and reliable dynamic of configuration information updates on server, improves user to the O&M efficiency of destination servers one or more in server cluster.

Description

Operation and maintenance method and device of server cluster, electronic equipment and storage medium
Technical Field
The present disclosure relates to the field of computer technologies, and in particular, to an operation and maintenance method and apparatus for a server cluster, an electronic device, and a storage medium.
Background
In the server cluster system, most of the application operation and maintenance work is increased along with the increase of the number of servers and the number of products, the shortage of the number of operation and maintenance people leads to heavy work tasks born by single operation and maintenance personnel, and simultaneously, the nonstandard operation and maintenance work and the non-automation cause the application operation and maintenance tasks to be very complicated, and a large amount of personnel cost, time cost and communication cost are consumed. Service configuration updates in operation and maintenance management are not only very frequent, but also dangerous and error-prone. Especially for a large number of service clusters, the workload of configuration updating is large, the time consumption is long, and errors are not easy to find.
Disclosure of Invention
The embodiment of the disclosure provides an operation and maintenance method and device for a server cluster, electronic equipment and a computer-readable storage medium.
In a first aspect, an embodiment of the present disclosure provides an operation and maintenance method for a server cluster.
Specifically, the operation and maintenance method for the server cluster includes:
responding to an operation and maintenance event initiated by a user, and acquiring operation and maintenance data provided by the user;
generating a work order to be executed according to the operation and maintenance data; the to-be-executed work order at least comprises configuration information, an instruction set to be executed and/or a task set to be executed;
and distributing the to-be-executed work order to one or more target servers in a server cluster so that the target servers execute corresponding operations according to the to-be-executed work order.
Further, the set of instructions to be executed includes at least one of: network instructions, file instructions, directory instructions, script instructions;
and/or the set of tasks to be performed comprises at least one of the following:
and configuring a polling task, a health examination task and a timing task.
Further, the operation and maintenance method of the server cluster further includes:
and receiving and displaying feedback data returned by the target server.
In a second aspect, an operation and maintenance method for a server cluster is provided in the embodiments of the present disclosure.
Specifically, the operation and maintenance method for the server cluster includes:
receiving a work order to be executed distributed by an operation and maintenance center server;
analyzing the work order to be executed, and extracting configuration information, an instruction set to be executed and/or a task set to be executed in the work order to be executed;
and automatically executing corresponding operation according to the work order to be executed.
Further, the automatically executing corresponding operations according to the work order to be executed includes at least one of the following:
transmitting the configuration information to corresponding application service so as to update the configuration information in the application service memory;
and generating a configuration file according to the configuration information, and storing the configuration file into a directory corresponding to the configuration file of the application service.
Further, the automatically executing corresponding operations according to the work order to be executed includes at least one of the following:
checking the configuration file on the target server at regular time according to the task in the task set to be executed, and returning a checking result to the operation and maintenance center;
and checking a preset interface of the application service on the target server at regular time according to the task in the to-be-executed task set, and returning a checking result to the operation and maintenance center.
In a third aspect, an operation and maintenance device for a server cluster is provided in the embodiments of the present disclosure.
Specifically, the operation and maintenance device of the server cluster includes:
the response module is configured to respond to an operation and maintenance event initiated by a user and acquire operation and maintenance data provided by the user;
the generating module is configured to generate a work order to be executed according to the operation and maintenance data; the to-be-executed work order at least comprises configuration information, an instruction set to be executed and/or a task set to be executed;
the distribution module is configured to distribute the to-be-executed work order to one or more target servers in a server cluster, so that the target servers execute corresponding operations according to the to-be-executed work order.
Further, the set of instructions to be executed includes at least one of: network instructions, file instructions, directory instructions, script instructions;
and/or the set of tasks to be performed comprises at least one of the following:
and configuring a polling task, a health examination task and a timing task.
Further, the operation and maintenance device of the server cluster further includes:
and the receiving module is configured to receive and display the feedback data returned by the target server.
In one possible design, the structure of the operation and maintenance device of the server cluster includes a memory and a processor, where the memory is used to store one or more computer instructions for supporting the operation and maintenance device of the server cluster to execute the operation and maintenance method of the server cluster in the first aspect, and the processor is configured to execute the computer instructions stored in the memory. The operation and maintenance device of the server cluster may further include a communication interface, which is used for the operation and maintenance device of the server cluster to communicate with other equipment or a communication network.
In a fourth aspect, an operation and maintenance device for a server cluster is provided in the embodiments of the present disclosure.
Specifically, the operation and maintenance device of the server cluster includes:
the receiving module is configured to receive the work order to be executed distributed by the operation and maintenance center server;
the analysis module is configured to analyze the work order to be executed and extract configuration information, an instruction set to be executed and/or a task set to be executed in the work order to be executed;
and the execution module is configured to automatically execute corresponding operation according to the work order to be executed.
Further, the execution module includes at least one of:
the transmission submodule is configured to transmit the configuration information to the corresponding application service so as to update the configuration information in the application service memory;
and the generation submodule is configured to generate a configuration file according to the configuration information, and store the configuration file into a directory corresponding to the configuration file of the application service.
Further, the execution module includes at least one of:
the first checking submodule is configured to check the configuration file on the target server at regular time according to the tasks in the to-be-executed task set and return a checking result to the operation and maintenance center;
and the second checking submodule is configured to regularly check a preset interface of the application service on the target server according to the task in the to-be-executed task set, and return a checking result to the operation and maintenance center.
The functions can be realized by hardware, and the functions can also be realized by executing corresponding software by hardware. The hardware or software includes one or more modules corresponding to the above-described functions.
In one possible design, the structure of the operation and maintenance device of the server cluster includes a memory and a processor, the memory is used for storing one or more computer instructions for supporting the operation and maintenance device of the server cluster to execute the operation and maintenance method of the server cluster in the second aspect, and the processor is configured to execute the computer instructions stored in the memory. The operation and maintenance device of the server cluster may further include a communication interface, which is used for the operation and maintenance device of the server cluster to communicate with other equipment or a communication network.
In a fifth aspect, an embodiment of the present disclosure provides an operation and maintenance system for a server cluster, including:
an operation and maintenance center server for executing the method of the first aspect; and/or the presence of a gas in the gas,
a target server for executing the method of the second aspect.
In a sixth aspect, an embodiment of the present disclosure provides an electronic device, including a memory and a processor; wherein the memory is configured to store one or more computer instructions, wherein the one or more computer instructions are executed by the processor to implement the method steps of the first aspect.
In a seventh aspect, an embodiment of the present disclosure provides a computer-readable storage medium for storing computer instructions for an operation and maintenance device of a server cluster, where the computer instructions include computer instructions for executing the operation and maintenance method of the server cluster in the first aspect.
The technical scheme provided by the embodiment of the disclosure can have the following beneficial effects:
when an event that a user carries out operation and maintenance on a server in a server cluster is received, a to-be-executed work order at least comprising configuration information, an instruction set to be executed and/or a task to be executed is generated according to operation and maintenance data provided by the user, the generated to-be-executed work order is distributed to a target server in the server cluster, and the target server automatically executes corresponding operation according to the to-be-executed work order. According to the embodiment of the disclosure, the operation and maintenance center is matched with the execution units on the target servers in the server cluster, so that batch issuing of operation and maintenance events can be realized, safe and reliable dynamic updating of configuration information on the servers can be realized, and the operation and maintenance efficiency of a user on one or more target servers in the server cluster can be improved.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the disclosure.
Drawings
Other features, objects, and advantages of the present disclosure will become more apparent from the following detailed description of non-limiting embodiments when taken in conjunction with the accompanying drawings. In the drawings:
fig. 1 shows a flowchart of an operation and maintenance method of a server cluster according to an embodiment of the present disclosure;
FIG. 2 shows a flow chart of an operation and maintenance method of a server cluster according to another embodiment of the present disclosure;
fig. 3 shows a block diagram of an operation and maintenance device of a server cluster according to an embodiment of the present disclosure;
fig. 4 shows a block diagram of an operation and maintenance device of a server cluster according to another embodiment of the present disclosure;
fig. 5 is a schematic structural diagram of an electronic device suitable for implementing an operation and maintenance method of a server cluster according to an embodiment of the present disclosure.
Detailed Description
Hereinafter, exemplary embodiments of the present disclosure will be described in detail with reference to the accompanying drawings so that those skilled in the art can easily implement them. Also, for the sake of clarity, parts not relevant to the description of the exemplary embodiments are omitted in the drawings.
In the present disclosure, it is to be understood that terms such as "including" or "having," etc., are intended to indicate the presence of the disclosed features, numbers, steps, behaviors, components, parts, or combinations thereof, and are not intended to preclude the possibility that one or more other features, numbers, steps, behaviors, components, parts, or combinations thereof may be present or added.
It should be further noted that the embodiments and features of the embodiments in the present disclosure may be combined with each other without conflict. The present disclosure will be described in detail below with reference to the accompanying drawings in conjunction with embodiments.
Fig. 1 shows a flowchart of an operation and maintenance method of a server cluster according to an embodiment of the present disclosure. The operation and maintenance method of the server cluster is operated on the operation and maintenance center server. As shown in fig. 1, the operation and maintenance method of the server cluster includes the following steps S101 to S103:
in step S101, in response to an operation and maintenance event initiated by a user, operation and maintenance data provided by the user is acquired;
in step S102, a work order to be executed is generated according to the operation and maintenance data; the to-be-executed work order at least comprises configuration information, an instruction set to be executed and/or a task set to be executed;
in step S103, the work order to be executed is distributed to one or more target servers in a server cluster, so that the target servers execute corresponding operations according to the work order to be executed.
In this embodiment, the operation and maintenance center server performs operation and maintenance management and control on one or more target servers in the server cluster, for example, a user sends an instruction to one or more target servers in the server cluster through the operation and maintenance center server to determine whether an application service running on the target server is normal or not.
The operation and maintenance events may include an inspection of the health condition of a target server in the server cluster by a user of the operation and maintenance center, an information configuration of an application system running on the target server, and the like. The operation and maintenance data may include, but is not limited to, one or more of configuration information, an instruction set, a task set, and a target server identification. The configuration information may be configuration update data for an application system running on the target server, and the instruction set may be one or more instructions to be executed by the target server, such as an update instruction of the configuration information; the set of tasks may be one or more tasks to be performed on the target server, such as health check tasks and the like.
After the operation and maintenance central server obtains the operation and maintenance data provided by the user, a work order to be executed is generated according to the operation and maintenance data, and the work order to be executed at least comprises configuration information, an instruction set to be executed and/or a task set to be executed. After the operation and maintenance center server generates the work order to be executed according to the operation and maintenance data, the operation and maintenance center server distributes the work order to be executed to one or more target servers in the server cluster, wherein the target servers can be provided by users, can be all servers in the server cluster, and can also be one or more servers therein, and the operation and maintenance center server is specifically determined according to the requirements of the users. The target server may run an execution unit, which is used to receive the work order to be executed, and execute corresponding operations after analyzing the content of the work order to be executed. For example, if the configuration information of one or some application services is specified to be updated in the to-be-executed work order, the execution center may transmit the configuration information to the corresponding application service, update the configuration information in the memory by the corresponding application service, and simultaneously store the original configuration file into the specified directory and store the updated configuration information into the configuration file for subsequent use. For another example, if the job ticket to be executed specifies that the health check task of the application service is to be executed, the execution unit may verify whether the updated configuration file is correct, and the like. The configuration information, the instruction set to be executed, and the task to be executed that can be specified in the work order to be executed are only exemplified above, but the actual application is not limited to the content exemplified above, and other settings may be performed according to the actual situation, so as to implement the corresponding operation and maintenance event.
When an event that a user carries out operation and maintenance on a server in a server cluster is received, a to-be-executed work order at least comprising configuration information, an instruction set to be executed and/or a task to be executed is generated according to operation and maintenance data provided by the user, the generated to-be-executed work order is distributed to a target server in the server cluster, and the target server automatically executes corresponding operation according to the to-be-executed work order. According to the embodiment of the disclosure, the operation and maintenance center is matched with the execution units on the target servers in the server cluster, so that batch issuing of operation and maintenance events can be realized, safe and reliable dynamic updating of configuration information on the servers can be realized, and the operation and maintenance efficiency of a user on one or more target servers in the server cluster can be improved.
In an optional implementation manner of this embodiment, the instruction set to be executed includes at least one of: network instructions, file instructions, directory instructions, script instructions;
and/or the set of tasks to be performed comprises at least one of the following:
and configuring a polling task, a health examination task and a timing task.
In this optional implementation manner, the network instruction may be a local network request initiated by an execution unit on the target server, and is used to transmit the configuration information in the work order to be executed to an application service running on the target server, so as to update the configuration information in the memory. File instructions may include, but are not limited to: the file generation instruction is used for generating a new configuration file according to the configuration information in the work order to be executed and setting the file attribute; a file download instruction for downloading a file from the specified URL to the specified directory; a file deleting instruction for deleting the file of the specified path; the file polling command is used for returning the name and the MD5 value of the specified file; a file reading instruction for returning the file content, the MD5 to the file and the last modification time; directory instructions may include, but are not limited to: a directory reading instruction for returning the files in the directory and the last modification date thereof; and the directory moving instruction is used for moving the files in the directory to the specific directory. The script instructions may be instructions to execute a particular script file.
The execution unit on the target server can check whether the MD5 value of the configuration file on the target server is consistent with the expected MD5 value according to the configuration inspection task, record and return the check result to the operation and maintenance center. And the execution unit on the target server can record and return a verification result to the operation and maintenance center according to whether the specific interface of the health check task request application service is consistent with the expected response value. The timing tasks may be other timing tasks that extend beyond the configuration patrol task and the health check task.
In an optional implementation manner of this embodiment, the method further includes the following steps:
and receiving and displaying feedback data returned by the target server.
In this optional implementation, after the target server updates the configuration information, completes the instruction execution, and/or completes the task execution, the result may be returned to the operation and maintenance center, so that the user can view the operation and maintenance result for the target server. In addition, the feedback data indicates that the target server does not successfully complete the operation corresponding to the work order to be executed, and the work order to be executed can be sent to the target server again; or when a certain task is executed and the health condition of the target server is found to be problematic, alarm information can be automatically sent to the user.
Fig. 2 shows a flowchart of an operation and maintenance method of a server cluster according to an embodiment of the present disclosure. The operation and maintenance method of the server cluster is operated on a target server in the server cluster. As shown in fig. 2, the operation and maintenance method of the server cluster includes the following steps S201 to S203:
in step S201, a work order to be executed distributed by the operation and maintenance center server is received;
in step S202, the to-be-executed work order is analyzed, and configuration information, an to-be-executed instruction set, and/or an to-be-executed task set in the to-be-executed work order are extracted;
in step S203, corresponding operations are automatically executed according to the work order to be executed.
In this embodiment, the execution unit on the target server may monitor the information of the operation and maintenance center at any time, after receiving the to-be-executed work order distributed by the operation and maintenance center, the execution unit analyzes the content of the to-be-executed work order after verifying the validity of the request, extracts the configuration information, the to-be-executed instruction set, and/or the to-be-executed task set in the to-be-executed work order, and then executes the corresponding operation according to the content of the to-be-executed work order. Other details may be referred to in the description of the operation and maintenance method of the server cluster in the embodiment shown in fig. 1, and are not described herein again.
In an optional implementation manner of this embodiment, automatically executing a corresponding operation according to the to-be-executed work order includes at least one of the following:
transmitting the configuration information to corresponding application service so as to update the configuration information in the application service memory;
and generating a configuration file according to the configuration information, and storing the configuration file into a directory corresponding to the configuration file of the application service.
In this optional implementation manner, when the to-be-executed work order includes the configuration information and the corresponding configuration information update instruction, the execution unit on the target server initiates a local network request to transmit the configuration information to the application service, so as to update the configuration information in the application service memory. After the memory configuration is updated successfully, the execution unit may further create a new directory, move the old configuration file to the new directory, generate a new configuration file according to the configuration information in the work order to be executed, and move the new configuration file to the specified directory, that is, the directory corresponding to the configuration file of the application service.
In an optional implementation manner of this embodiment, automatically executing a corresponding operation according to the to-be-executed work order includes at least one of the following:
checking the configuration file on the target server at regular time according to the task in the task set to be executed, and returning a checking result to the operation and maintenance center;
and checking a preset interface of the application service on the target server at regular time according to the task in the to-be-executed task set, and returning a checking result to the operation and maintenance center.
In this optional implementation manner, when the to-be-executed work order includes a configuration polling task for checking the configuration file and/or a corresponding application service health check task on the verification target server, the execution unit may verify the health of the updated configuration information according to the to-be-executed work order, for example, the execution unit starts the task polling configuration file, checks whether the MD5 value of the configuration file changes, and if so, records a time point of changing the configuration file to the local monitoring file, and returns the time point to the operation and maintenance center for displaying or alarming. In addition, the execution unit can also start a timing task to request a preset interface of the application service at a timing, compare a result returned by the preset interface with an expected response value, record the result and return the result to the operation and maintenance center.
The following are embodiments of the disclosed apparatus that may be used to perform embodiments of the disclosed methods.
Fig. 3 shows a block diagram of an operation and maintenance device of a server cluster according to an embodiment of the present disclosure, which may be implemented as part of or all of an electronic device through software, hardware, or a combination of the two. The operation and maintenance device of the server cluster operates on the operation and maintenance center server, as shown in fig. 3, the operation and maintenance device of the server cluster includes:
the response module 301 is configured to respond to an operation and maintenance event initiated by a user, and acquire operation and maintenance data provided by the user;
a generating module 302 configured to generate a work order to be executed according to the operation and maintenance data; the to-be-executed work order at least comprises configuration information, an instruction set to be executed and/or a task set to be executed;
the distributing module 303 is configured to distribute the to-be-executed work order to one or more target servers in a server cluster, so that the target servers execute corresponding operations according to the to-be-executed work order.
In this embodiment, the operation and maintenance center server performs operation and maintenance management and control on one or more target servers in the server cluster, for example, a user sends an instruction to one or more target servers in the server cluster through the operation and maintenance center server to determine whether an application service running on the target server is normal or not.
The operation and maintenance events may include an inspection of the health condition of a target server in the server cluster by a user of the operation and maintenance center, an information configuration of an application system running on the target server, and the like. The operation and maintenance data may include, but is not limited to, one or more of configuration information, an instruction set, a task set, and a target server identification. The configuration information may be configuration update data for an application system running on the target server, and the instruction set may be one or more instructions to be executed by the target server, such as an update instruction of the configuration information; the set of tasks may be one or more tasks to be performed on the target server, such as health check tasks and the like.
After the operation and maintenance central server obtains the operation and maintenance data provided by the user, a work order to be executed is generated according to the operation and maintenance data, and the work order to be executed at least comprises configuration information, an instruction set to be executed and/or a task set to be executed. After the operation and maintenance center server generates the work order to be executed according to the operation and maintenance data, the operation and maintenance center server distributes the work order to be executed to one or more target servers in the server cluster, wherein the target servers can be provided by users, can be all servers in the server cluster, and can also be one or more servers therein, and the operation and maintenance center server is specifically determined according to the requirements of the users. The target server may run an execution unit, which is used to receive the work order to be executed, and execute corresponding operations after analyzing the content of the work order to be executed. For example, if the configuration information of one or some application services is specified to be updated in the to-be-executed work order, the execution center may transmit the configuration information to the corresponding application service, update the configuration information in the memory by the corresponding application service, and simultaneously store the original configuration file into the specified directory and store the updated configuration information into the configuration file for subsequent use. For another example, if the job ticket to be executed specifies that the health check task of the application service is to be executed, the execution unit may verify whether the updated configuration file is correct, and the like. The configuration information, the instruction set to be executed, and the task to be executed that can be specified in the work order to be executed are only exemplified above, but the actual application is not limited to the content exemplified above, and other settings may be performed according to the actual situation, so as to implement the corresponding operation and maintenance event.
When an event that a user carries out operation and maintenance on a server in a server cluster is received, a to-be-executed work order at least comprising configuration information, an instruction set to be executed and/or a task to be executed is generated according to operation and maintenance data provided by the user, the generated to-be-executed work order is distributed to a target server in the server cluster, and the target server automatically executes corresponding operation according to the to-be-executed work order. According to the embodiment of the disclosure, the operation and maintenance center is matched with the execution units on the target servers in the server cluster, so that batch issuing of operation and maintenance events can be realized, safe and reliable dynamic updating of configuration information on the servers can be realized, and the operation and maintenance efficiency of a user on one or more target servers in the server cluster can be improved.
In an optional implementation manner of this embodiment, the instruction set to be executed includes at least one of: network instructions, file instructions, directory instructions, script instructions;
and/or the set of tasks to be performed comprises at least one of the following:
and configuring a polling task, a health examination task and a timing task.
In this optional implementation manner, the network instruction may be a local network request initiated by an execution unit on the target server, and is used to transmit the configuration information in the work order to be executed to an application service running on the target server, so as to update the configuration information in the memory. File instructions may include, but are not limited to: the file generation instruction is used for generating a new configuration file according to the configuration information in the work order to be executed and setting the file attribute; a file download instruction for downloading a file from the specified URL to the specified directory; a file deleting instruction for deleting the file of the specified path; the file polling command is used for returning the name and the MD5 value of the specified file; a file reading instruction for returning the file content, the MD5 to the file and the last modification time; directory instructions may include, but are not limited to: a directory reading instruction for returning the files in the directory and the last modification date thereof; and the directory moving instruction is used for moving the files in the directory to the specific directory. The script instructions may be instructions to execute a particular script file.
The execution unit on the target server can check whether the MD5 value of the configuration file on the target server is consistent with the expected MD5 value according to the configuration inspection task, record and return the check result to the operation and maintenance center. And the execution unit on the target server can record and return a verification result to the operation and maintenance center according to whether the specific interface of the health check task request application service is consistent with the expected response value. The timed tasks may be other timed tasks that extend beyond configuration patrol tasks and health checks.
In an optional implementation manner of this embodiment, the apparatus further includes:
and the receiving module is configured to receive and display the feedback data returned by the target server.
In this optional implementation, after the target server updates the configuration information, completes the instruction execution, and/or completes the task execution, the result may be returned to the operation and maintenance center, so that the user can view the operation and maintenance result for the target server. In addition, the feedback data indicates that the target server does not successfully complete the operation corresponding to the work order to be executed, and the work order to be executed can be sent to the target server again; or when a certain task is executed and the health condition of the target server is found to be problematic, alarm information can be automatically sent to the user.
Fig. 4 shows a block diagram of an operation and maintenance device of a server cluster according to an embodiment of the present disclosure. And the operation and maintenance device of the server cluster is installed on a target server in the server cluster. The operation and maintenance method of the server cluster is executed on a target server in the server cluster, as shown in fig. 4, the operation and maintenance device of the server cluster includes:
the receiving module 401 is configured to receive a work order to be executed distributed by the operation and maintenance center server;
an analysis module 402 configured to analyze the to-be-executed work order and extract configuration information, an instruction set to be executed, and/or a task set to be executed in the to-be-executed work order;
and the execution module 403 is configured to automatically execute corresponding operations according to the work order to be executed.
In this embodiment, the execution unit on the target server may monitor the information of the operation and maintenance center at any time, after receiving the to-be-executed work order distributed by the operation and maintenance center, the execution unit analyzes the content of the to-be-executed work order after verifying the validity of the request, extracts the configuration information, the to-be-executed instruction set, and/or the to-be-executed task set in the to-be-executed work order, and then executes the corresponding operation according to the content of the to-be-executed work order. Other details may be referred to in the description of the operation and maintenance method of the server cluster in the embodiment shown in fig. 1, and are not described herein again.
In an optional implementation manner of this embodiment, the executing module 403 includes at least one of:
the transmission submodule is configured to transmit the configuration information to the corresponding application service so as to update the configuration information in the application service memory;
and the generation submodule is configured to generate a configuration file according to the configuration information, and store the configuration file into a directory corresponding to the configuration file of the application service.
In this optional implementation manner, when the to-be-executed work order includes the configuration information and the corresponding configuration information update instruction, the execution unit on the target server initiates a local network request to transmit the configuration information to the application service, so as to update the configuration information in the application service memory. After the memory configuration is updated successfully, the execution unit may further create a new directory, move the old configuration file to the new directory, generate a new configuration file according to the configuration information in the work order to be executed, and move the new configuration file to the specified directory, that is, the directory corresponding to the configuration file of the application service.
In an optional implementation manner of this embodiment, the executing module 403 includes at least one of:
the first checking submodule is configured to check the configuration file on the target server at regular time according to the tasks in the to-be-executed task set and return a checking result to the operation and maintenance center;
and the second checking submodule is configured to regularly check a preset interface of the application service on the target server according to the task in the to-be-executed task set, and return a checking result to the operation and maintenance center.
In this optional implementation manner, when the to-be-executed work order includes a configuration polling task for checking the configuration file and/or a corresponding application service health check task on the verification target server, the execution unit may verify the health of the updated configuration information according to the to-be-executed work order, for example, the execution unit starts the task polling configuration file, checks whether the MD5 value of the configuration file changes, and if so, records a time point of changing the configuration file to the local monitoring file, and returns the time point to the operation and maintenance center for displaying or alarming. In addition, the execution unit can also start a timing task to request a preset interface of the application service at a timing, compare a result returned by the preset interface with an expected response value, record the result and return the result to the operation and maintenance center.
The embodiment of the present disclosure provides an operation and maintenance system of a server cluster, including: the operation and maintenance center server and/or the target server; wherein,
the operation and maintenance center server is used for operating the operation and maintenance method of the server cluster shown in fig. 1 and the related embodiments;
the target server is used for operating the operation and maintenance method of the server cluster shown in fig. 2 and the related embodiments;
fig. 5 is a schematic structural diagram of an electronic device suitable for implementing an operation and maintenance method of a server cluster according to an embodiment of the present disclosure.
As shown in fig. 5, the electronic apparatus 500 includes a Central Processing Unit (CPU)501 that can execute various processes in the embodiment shown in fig. 1 described above according to a program stored in a Read Only Memory (ROM)502 or a program loaded from a storage section 508 into a Random Access Memory (RAM) 503. In the RAM503, various programs and data necessary for the operation of the electronic apparatus 500 are also stored. The CPU501, ROM502, and RAM503 are connected to each other via a bus 504. An input/output (I/O) interface 505 is also connected to bus 504.
The following components are connected to the I/O interface 505: an input portion 506 including a keyboard, a mouse, and the like; an output portion 507 including a display such as a Cathode Ray Tube (CRT), a Liquid Crystal Display (LCD), and the like, and a speaker; a storage portion 508 including a hard disk and the like; and a communication section 509 including a network interface card such as a LAN card, a modem, or the like. The communication section 509 performs communication processing via a network such as the internet. The driver 510 is also connected to the I/O interface 505 as necessary. A removable medium 511 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like is mounted on the drive 510 as necessary, so that a computer program read out therefrom is mounted into the storage section 508 as necessary.
In particular, according to embodiments of the present disclosure, the method described above with reference to fig. 1 may be implemented as a computer software program. For example, embodiments of the present disclosure include a computer program product comprising a computer program tangibly embodied on a medium readable thereby, the computer program comprising program code for performing the method illustrated in FIG. 1. In such an embodiment, the computer program may be downloaded and installed from a network through the communication section 509, and/or installed from the removable medium 511.
The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowcharts or block diagrams may represent a module, a program segment, or a portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
The units or modules described in the embodiments of the present disclosure may be implemented by software or hardware. The units or modules described may also be provided in a processor, and the names of the units or modules do not in some cases constitute a limitation of the units or modules themselves.
As another aspect, the present disclosure also provides a computer-readable storage medium, which may be the computer-readable storage medium included in the apparatus in the above-described embodiment; or it may be a separate computer readable storage medium not incorporated into the device. The computer readable storage medium stores one or more programs for use by one or more processors in performing the methods described in the present disclosure.
The foregoing description is only exemplary of the preferred embodiments of the disclosure and is illustrative of the principles of the technology employed. It will be appreciated by those skilled in the art that the scope of the invention in the present disclosure is not limited to the specific combination of the above-mentioned features, but also encompasses other embodiments in which any combination of the above-mentioned features or their equivalents is possible without departing from the inventive concept. For example, the above features and (but not limited to) the features disclosed in this disclosure having similar functions are replaced with each other to form the technical solution.

Claims (15)

1. An operation and maintenance method for a server cluster is characterized by comprising the following steps:
responding to an operation and maintenance event initiated by a user, and acquiring operation and maintenance data provided by the user;
generating a work order to be executed according to the operation and maintenance data; the to-be-executed work order at least comprises configuration information, an instruction set to be executed and/or a task set to be executed;
and distributing the to-be-executed work order to one or more target servers in a server cluster so that the target servers execute corresponding operations according to the to-be-executed work order.
2. The method of claim 1, wherein the set of instructions to be executed comprises at least one of: network instructions, file instructions, directory instructions, script instructions; and/or the set of tasks to be performed comprises at least one of the following:
and configuring a polling task, a health examination task and a timing task.
3. The method of claim 1 or 2, further comprising:
and receiving and displaying feedback data returned by the target server.
4. An operation and maintenance method for a server cluster is characterized by comprising the following steps:
receiving a work order to be executed distributed by an operation and maintenance center server;
analyzing the work order to be executed, and extracting configuration information, an instruction set to be executed and/or a task set to be executed in the work order to be executed;
and automatically executing corresponding operation according to the work order to be executed.
5. The method of claim 4, wherein automatically performing the corresponding operation according to the work order to be performed comprises at least one of:
transmitting the configuration information to corresponding application service so as to update the configuration information in the application service memory;
and generating a configuration file according to the configuration information, and storing the configuration file into a directory corresponding to the configuration file of the application service.
6. The method according to claim 4 or 5, wherein automatically performing the corresponding operation according to the work order to be performed comprises at least one of:
checking the configuration file on the target server at regular time according to the task in the task set to be executed, and returning a checking result to the operation and maintenance center;
and checking a preset interface of the application service on the target server at regular time according to the task in the to-be-executed task set, and returning a checking result to the operation and maintenance center.
7. An operation and maintenance device for a server cluster, comprising:
the response module is configured to respond to an operation and maintenance event initiated by a user and acquire operation and maintenance data provided by the user;
the generating module is configured to generate a work order to be executed according to the operation and maintenance data; the to-be-executed work order at least comprises configuration information, an instruction set to be executed and/or a task set to be executed;
the distribution module is configured to distribute the to-be-executed work order to one or more target servers in a server cluster, so that the target servers execute corresponding operations according to the to-be-executed work order.
8. The apparatus of claim 7, wherein the set of instructions to be executed comprises at least one of: network instructions, file instructions, directory instructions, script instructions;
and/or the set of tasks to be performed comprises at least one of the following:
and configuring a polling task, a health examination task and a timing task.
9. The apparatus of claim 7 or 8, further comprising:
and the receiving module is configured to receive and display the feedback data returned by the target server.
10. An operation and maintenance device for a server cluster, comprising:
the receiving module is configured to receive the work order to be executed distributed by the operation and maintenance center server;
the analysis module is configured to analyze the work order to be executed and extract configuration information, an instruction set to be executed and/or a task set to be executed in the work order to be executed;
and the execution module is configured to automatically execute corresponding operation according to the work order to be executed.
11. The apparatus of claim 10, wherein the execution module comprises at least one of:
the transmission submodule is configured to transmit the configuration information to the corresponding application service so as to update the configuration information in the application service memory;
and the generation submodule is configured to generate a configuration file according to the configuration information, and store the configuration file into a directory corresponding to the configuration file of the application service.
12. The apparatus of claim 10 or 11, wherein the execution module comprises at least one of:
the first checking submodule is configured to check the configuration file on the target server at regular time according to the tasks in the to-be-executed task set and return a checking result to the operation and maintenance center;
and the second checking submodule is configured to regularly check a preset interface of the application service on the target server according to the task in the to-be-executed task set, and return a checking result to the operation and maintenance center.
13. An operation and maintenance system of a server cluster, comprising:
an operation and maintenance center server for executing the method of any one of claims 1-3; and/or the presence of a gas in the gas,
a target server for performing the method of any one of claims 4-6.
14. An electronic device comprising a memory and a processor; wherein,
the memory is configured to store one or more computer instructions, wherein the one or more computer instructions are executed by the processor to implement the method steps of any of claims 1-6.
15. A computer-readable storage medium having stored thereon computer instructions, characterized in that the computer instructions, when executed by a processor, carry out the method steps of any of claims 1-6.
CN201811410234.8A 2018-11-23 2018-11-23 O&M method, apparatus, electronic equipment and the storage medium of server cluster Pending CN109861842A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811410234.8A CN109861842A (en) 2018-11-23 2018-11-23 O&M method, apparatus, electronic equipment and the storage medium of server cluster

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811410234.8A CN109861842A (en) 2018-11-23 2018-11-23 O&M method, apparatus, electronic equipment and the storage medium of server cluster

Publications (1)

Publication Number Publication Date
CN109861842A true CN109861842A (en) 2019-06-07

Family

ID=66890229

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811410234.8A Pending CN109861842A (en) 2018-11-23 2018-11-23 O&M method, apparatus, electronic equipment and the storage medium of server cluster

Country Status (1)

Country Link
CN (1) CN109861842A (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110719190A (en) * 2019-08-28 2020-01-21 北京大学(天津滨海)新一代信息技术研究院 Method and system for designing operation and maintenance instruction
CN110825420A (en) * 2019-11-26 2020-02-21 中国银行股份有限公司 Configuration parameter updating method, device, equipment and storage medium for distributed cluster
CN111371598A (en) * 2020-02-26 2020-07-03 普信恒业科技发展(北京)有限公司 Configuration method and system
CN111459631A (en) * 2020-03-27 2020-07-28 厦门梦加网络科技股份有限公司 Automatic batch processing method and system for server
CN112134950A (en) * 2020-09-22 2020-12-25 平安科技(深圳)有限公司 Method, device, maintenance server and medium for maintaining Linux cluster
CN112445549A (en) * 2019-09-03 2021-03-05 网联清算有限公司 Operation and maintenance method, operation and maintenance device, electronic equipment and medium
CN112990890A (en) * 2021-05-10 2021-06-18 鹏城实验室 Notification message sending method, server and computer readable storage medium
CN113032054A (en) * 2021-04-21 2021-06-25 杭州华橙软件技术有限公司 Service execution method, device, storage medium and electronic device
CN113778798A (en) * 2021-09-06 2021-12-10 北京沃东天骏信息技术有限公司 Server control method, server control device, electronic device, and storage medium
CN113783731A (en) * 2021-09-15 2021-12-10 云茂互联智能科技(厦门)有限公司 Edge server control method, device, storage medium and electronic device
CN113849244A (en) * 2021-09-13 2021-12-28 广州咨元信息科技有限公司 Configuration method for realizing custom service scene based on layout engine
CN113904924A (en) * 2021-10-19 2022-01-07 吴枚兰 Efficient operation and maintenance method for local area network system of Internet of things
CN113900672A (en) * 2021-02-09 2022-01-07 京东科技控股股份有限公司 Operation control method and device of server cluster, server and storage medium
CN114161410A (en) * 2021-11-16 2022-03-11 中国电信集团系统集成有限责任公司 Operation and maintenance method and device, electronic equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102075358A (en) * 2010-12-31 2011-05-25 网宿科技股份有限公司 System and method for distributing and deploying content of large-scale server cluster
CN103051467A (en) * 2011-10-17 2013-04-17 中兴通讯股份有限公司 Automatic detecting method and device of northbound interface
CN105099786A (en) * 2015-08-31 2015-11-25 华为技术有限公司 Method and apparatus of service configuration in network cutover
WO2017148268A1 (en) * 2016-02-29 2017-09-08 阿里巴巴集团控股有限公司 Task processing method, apparatus and system based on distributed system
CN108076125A (en) * 2016-11-11 2018-05-25 北京京东尚科信息技术有限公司 Interface allocation method and system
CN108768683A (en) * 2018-03-15 2018-11-06 北京奇艺世纪科技有限公司 A kind of automation O&M methods, devices and systems

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102075358A (en) * 2010-12-31 2011-05-25 网宿科技股份有限公司 System and method for distributing and deploying content of large-scale server cluster
CN103051467A (en) * 2011-10-17 2013-04-17 中兴通讯股份有限公司 Automatic detecting method and device of northbound interface
CN105099786A (en) * 2015-08-31 2015-11-25 华为技术有限公司 Method and apparatus of service configuration in network cutover
WO2017148268A1 (en) * 2016-02-29 2017-09-08 阿里巴巴集团控股有限公司 Task processing method, apparatus and system based on distributed system
CN108076125A (en) * 2016-11-11 2018-05-25 北京京东尚科信息技术有限公司 Interface allocation method and system
CN108768683A (en) * 2018-03-15 2018-11-06 北京奇艺世纪科技有限公司 A kind of automation O&M methods, devices and systems

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110719190A (en) * 2019-08-28 2020-01-21 北京大学(天津滨海)新一代信息技术研究院 Method and system for designing operation and maintenance instruction
CN112445549A (en) * 2019-09-03 2021-03-05 网联清算有限公司 Operation and maintenance method, operation and maintenance device, electronic equipment and medium
CN110825420A (en) * 2019-11-26 2020-02-21 中国银行股份有限公司 Configuration parameter updating method, device, equipment and storage medium for distributed cluster
CN110825420B (en) * 2019-11-26 2023-09-22 中国银行股份有限公司 Method, device, equipment and storage medium for updating configuration parameters of distributed cluster
CN111371598A (en) * 2020-02-26 2020-07-03 普信恒业科技发展(北京)有限公司 Configuration method and system
CN111459631A (en) * 2020-03-27 2020-07-28 厦门梦加网络科技股份有限公司 Automatic batch processing method and system for server
CN112134950A (en) * 2020-09-22 2020-12-25 平安科技(深圳)有限公司 Method, device, maintenance server and medium for maintaining Linux cluster
CN112134950B (en) * 2020-09-22 2023-08-29 平安科技(深圳)有限公司 Method and device for maintaining Linux cluster, maintenance server and medium
CN113900672A (en) * 2021-02-09 2022-01-07 京东科技控股股份有限公司 Operation control method and device of server cluster, server and storage medium
CN113032054B (en) * 2021-04-21 2023-07-14 杭州华橙软件技术有限公司 Service execution method and device, storage medium and electronic device
CN113032054A (en) * 2021-04-21 2021-06-25 杭州华橙软件技术有限公司 Service execution method, device, storage medium and electronic device
CN112990890B (en) * 2021-05-10 2021-08-27 鹏城实验室 Notification message sending method, server and computer readable storage medium
CN112990890A (en) * 2021-05-10 2021-06-18 鹏城实验室 Notification message sending method, server and computer readable storage medium
CN113778798A (en) * 2021-09-06 2021-12-10 北京沃东天骏信息技术有限公司 Server control method, server control device, electronic device, and storage medium
CN113849244A (en) * 2021-09-13 2021-12-28 广州咨元信息科技有限公司 Configuration method for realizing custom service scene based on layout engine
CN113849244B (en) * 2021-09-13 2024-01-12 广州咨元信息科技有限公司 Configuration method for realizing custom service scene based on orchestration engine
CN113783731A (en) * 2021-09-15 2021-12-10 云茂互联智能科技(厦门)有限公司 Edge server control method, device, storage medium and electronic device
CN113904924A (en) * 2021-10-19 2022-01-07 吴枚兰 Efficient operation and maintenance method for local area network system of Internet of things
CN114161410A (en) * 2021-11-16 2022-03-11 中国电信集团系统集成有限责任公司 Operation and maintenance method and device, electronic equipment and storage medium
CN114161410B (en) * 2021-11-16 2024-01-09 中电信数智科技有限公司 Operation and maintenance method, device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN109861842A (en) O&M method, apparatus, electronic equipment and the storage medium of server cluster
CN105357038B (en) Monitor the method and system of cluster virtual machine
US9383992B2 (en) Enterprise wide software version recommendation
US11086618B2 (en) Populating a software catalogue with related product information
US20050114864A1 (en) Method and system for software installation
US20040111505A1 (en) Method, system, and article of manufacture for network management
CN109933317B (en) Point burying method, device, server and readable storage medium
US11086919B2 (en) Service regression detection using real-time anomaly detection of log data
CN111367531A (en) Code processing method and device
CN114691782A (en) Database table increment synchronization method and device and storage medium
CN114185776A (en) Big data point burying method, device, equipment and medium for application program
CN111046316B (en) Application on-shelf state monitoring method, intelligent terminal and storage medium
CN116303320A (en) Real-time task management method, device, equipment and medium based on log file
CN116167858A (en) Service processing method, device and system and electronic equipment
CN111459737B (en) Problem positioning method, device, computer equipment and storage medium
CN114996081A (en) Batch job progress monitoring method and device, electronic equipment and storage medium
CN114547010A (en) Data analysis method and device, electronic equipment and storage medium
CN109032654B (en) Method and apparatus for configuring parameters in server, server and storage medium
JP2023002968A (en) Model management system and model management method
CN113485896A (en) Container state monitoring method, device, system and medium
CN103164236A (en) Method and system of online update of remote server
CN110554895A (en) Data calling method and device
Seuster et al. Context-aware distributed cloud computing using CloudScheduler
CN113674798B (en) Proteomics data analysis system
CN108874621B (en) File monitoring method and device, electronic equipment and computer readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20200918

Address after: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman Islands

Applicant after: Advanced innovation technology Co.,Ltd.

Address before: A four-storey 847 mailbox in Grand Cayman Capital Building, British Cayman Islands

Applicant before: Alibaba Group Holding Ltd.

Effective date of registration: 20200918

Address after: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman Islands

Applicant after: Innovative advanced technology Co.,Ltd.

Address before: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman Islands

Applicant before: Advanced innovation technology Co.,Ltd.

RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190607