CN117873673A - Automatic operation and maintenance method for server and related products - Google Patents

Automatic operation and maintenance method for server and related products Download PDF

Info

Publication number
CN117873673A
CN117873673A CN202311824727.7A CN202311824727A CN117873673A CN 117873673 A CN117873673 A CN 117873673A CN 202311824727 A CN202311824727 A CN 202311824727A CN 117873673 A CN117873673 A CN 117873673A
Authority
CN
China
Prior art keywords
maintenance
server
flow
atomic
control
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202311824727.7A
Other languages
Chinese (zh)
Inventor
崔宇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Henan Kunlun Technology Co ltd
Original Assignee
Henan Kunlun Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Henan Kunlun Technology Co ltd filed Critical Henan Kunlun Technology Co ltd
Priority to CN202311824727.7A priority Critical patent/CN117873673A/en
Publication of CN117873673A publication Critical patent/CN117873673A/en
Pending legal-status Critical Current

Links

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The application provides a server automation operation and maintenance method and related products, which can be applied to the technical field of server operation and maintenance, and the method comprises the following steps: first, an operation and maintenance flow package of a server is determined based on an operation and maintenance flow rule base. The operation and maintenance flow rule base stores operation and maintenance flows corresponding to the server and command controls required to be called for executing the operation and maintenance flows. Then determining an execution strategy of the operation and maintenance flow packet; and finally, executing the operation and maintenance flow package based on the execution strategy. Therefore, the automatic operation and maintenance work of the server is replaced by the automatic technology, for example, the automatic operation and maintenance work of the server is replaced by the RRA technology, and the operation and maintenance flow of the server is executed on the basis of the DBUS interface with certain universality, so that the efficiency and the universality of the operation and maintenance of the server are improved.

Description

Automatic operation and maintenance method for server and related products
Technical Field
The application relates to the technical field of server operation and maintenance, in particular to a server automation operation and maintenance method and related products.
Background
The various industries are continuously digitalized, and meanwhile, the scale of the information technology (Internet Technology, IT) digital center is becoming larger and larger, and the functions are becoming more and more obvious. Therefore, the complexity and difficulty of the operation and maintenance of the server are greatly increased, and how to better realize the operation and maintenance of the server is of great importance.
The conventional server operation and maintenance work is generally performed manually, and the method relies on technicians to develop each server operation and maintenance work in a targeted manner, so that the method has certain universality, but the problem that the universality and the efficiency of the server operation and maintenance cannot be considered in the prior art due to the great increase of the complexity and the difficulty of the server operation and maintenance work.
Therefore, how to improve the efficiency of server operation and maintenance while ensuring the versatility of server operation and maintenance is a problem that needs to be solved by those skilled in the art.
Disclosure of Invention
Based on the above problems, the embodiments of the present application provide a server automation operation and maintenance method and related products, which replace manual server operation and maintenance work based on an automation means, and can consider the universality and efficiency of server operation and maintenance.
In a first aspect, an embodiment of the present application provides a server automation operation and maintenance method, including:
determining an operation and maintenance flow package of the server based on the operation and maintenance flow rule base; the operation and maintenance flow rule base stores an operation and maintenance flow corresponding to the server and a command control which needs to be called when the operation and maintenance flow is executed; determining an execution strategy of the operation and maintenance flow packet; and executing the operation and maintenance flow package based on the execution strategy. Therefore, the RPA technology is used for replacing the manual thought, an operation and maintenance flow rule base is constructed, an operation and maintenance flow package of the server is determined based on the operation and maintenance flow rule base, and finally the operation and maintenance flow package is executed based on a corresponding execution strategy, so that the automatic operation and maintenance of the server are realized. The RPA robot replaces manual work, so that the operation and maintenance efficiency of the server is improved, and in addition, the DBUS interface has certain universality.
In one possible implementation manner, before determining the operation and maintenance flow package of the server based on the operation and maintenance flow rule base, the method further includes: constructing operation and maintenance flow standards based on the execution steps corresponding to the operation and maintenance work of each server; constructing operation and maintenance action standards based on operation methods corresponding to the operation and maintenance work of each server; constructing an operation control library based on the operation flow standard and the operation action standard; the operation control library comprises the following components: DBUS command controls and operation class controls; and constructing an operation and maintenance flow rule base based on the operation and maintenance flow standard and the operation and maintenance operation control base. For this reason, the embodiment of the application introduces the operation control library for storing DBUS command controls and operation class controls required by executing operation work of each server, and then constructs the operation flow rule library based on the operation control library and execution steps corresponding to the operation work of each server, and improves the universality of the operation and the maintenance of the server by relating to the breadth of the operation and the maintenance work of the server and the universality of the DBUS.
In one possible implementation manner, the constructing an operation and maintenance action standard based on the operation method corresponding to the operation and maintenance work of each server includes: splitting operation methods corresponding to the operation and maintenance work of each server into associated first atomic operations respectively; a normalized execution method is formulated for the first atomic operation, and the execution method is set as an operation and maintenance action standard corresponding to the first atomic operation; the execution method comprises the following steps: the input, output, corresponding DBUS commands, and standardized enforcement rules of the first atomic operation. According to the embodiment of the application, the operation method corresponding to each server operation and maintenance work is split into the minimum actions which can be independently executed, and a normalized execution method is formulated for the minimum actions, namely, the server operation and maintenance action standard is defined, so that an operation and maintenance flow rule base constructed later is finer and more accurate, and the accuracy of automatic operation and maintenance of the server is improved.
In one possible implementation manner, the building an operation control library based on the operation flow standard and the operation action standard includes: developing a corresponding DBUS command control for the first atomic operation based on the operation flow standard and the operation action standard; based on the operation flow standard and the operation action standard, developing operation class controls for repeated class operations in operation and maintenance work of each server; and constructing an operation control library based on the DBUS command control and the operation control class control. According to the embodiment of the application, the DBUS command control is developed based on the refined operation flow standard and operation action standard, and meanwhile, the server operation control is developed for the repeated operation, so that an operation control library constructed based on the DBUS command control and the operation control is more accurate and careful, and the accuracy and the universality of the automatic operation of the server are improved.
In one possible implementation manner, the developing a corresponding DBUS command control for the first atomic operation based on the operation flow standard and the operation action standard includes: searching a connection name corresponding to the first atomic operation from a DBUS interface definition table based on the operation flow standard and the operation action standard; splicing the first atomic operation with the corresponding DBUS command based on the connection name; and based on an RPA design platform, carrying out standardized encapsulation on the first atomic operation spliced with the DBUS command to form a DBUS command control corresponding to the first atomic operation. According to the embodiment of the application, the DBUS interface definition table is introduced, DBUS commands corresponding to the first atomic operation are searched from the DBUS interface definition table and spliced and packaged, the operation control library is constructed based on the DBUS commands, and the flow package of the RPA robot is developed, so that the operation and maintenance efficiency and the universality of the server are improved.
In one possible implementation manner, the developing an operation class control for the repetitive class operation in the operation and maintenance work of each server based on the operation and maintenance flow standard and the operation and maintenance action standard includes: based on the operation and maintenance flow standard and the operation and maintenance action standard, converting repeated class operation in operation and maintenance work of each server into second atomic operation; and based on an RPA design platform, carrying out standardized encapsulation on the second atomic operation to form an operation and maintenance operation class control corresponding to the second atomic operation. According to the embodiment of the application, the server operation and maintenance operation type controls are packaged for the repeated type operations in the operation and maintenance work of each server, an operation and maintenance operation control library is constructed based on the operation and maintenance operation control, and the flow package of the RPA robot is developed, so that the efficiency and the universality of the operation and maintenance of the server are improved.
In one possible implementation manner, the constructing an operation and maintenance flow rule base based on the operation and maintenance flow standard and the operation and maintenance control base includes: based on the operation and maintenance flow standard, exploring controls required by operation and maintenance work of each server from the operation and maintenance operation control library; arranging the control according to the operation and maintenance flow standard to form an RPA business flow; and constructing an operation and maintenance flow rule base based on the RPA business flow. According to the embodiment of the application, the comprehensive operation and maintenance operation control library covering the control is introduced, and when the operation and maintenance flow rule library is constructed, the control is explored from the operation and maintenance flow rule library according to operation and maintenance flow standards and is arranged, so that the construction of the operation and maintenance flow rule library is more accurate and efficient, and meanwhile, the accuracy and the high efficiency of the automatic operation and maintenance of the server are improved.
In one possible implementation manner, the executing the operation and maintenance flow packet based on the execution policy includes: operating the operation and maintenance flow package based on a scheduling strategy; if no abnormality occurs in the whole operation process, realizing the automatic operation and maintenance of the server; if abnormality occurs in the running process, the running process is adjusted according to the failure strategy, and the automatic running and maintenance of the server are realized. According to the embodiment of the application, the execution strategy comprising the scheduling strategy and the failure strategy is introduced, the situation that the RPA robot possibly happens when running the flow package is assumed, and different coping methods are set for different situations, so that the flexibility and the high efficiency of the automatic operation and maintenance of the server are improved.
In a second aspect, the present application provides a server automation operation and maintenance device, including: the first determining module is used for determining an operation and maintenance flow packet of the server based on the operation and maintenance flow rule base; the operation and maintenance flow rule base stores an operation and maintenance flow corresponding to the server and a command control which needs to be called when the operation and maintenance flow is executed; the second determining module is used for determining an execution strategy of the operation and maintenance flow packet; and the execution module is used for executing the operation and maintenance flow packet based on the execution strategy.
In a third aspect, the present application provides a server comprising: a memory for storing a computer program; and the processor is used for realizing the steps of the server automation operation and maintenance method when executing the computer program.
From the above technical solution, compared with the prior art, the present application has the following advantages:
the method comprises the steps of firstly determining an operation and maintenance flow package of a server based on an operation and maintenance flow rule base. The operation and maintenance flow rule base stores operation and maintenance flows corresponding to the server and command controls required to be called for executing the operation and maintenance flows. Then determining an execution strategy of the operation and maintenance flow packet; and finally, executing the operation and maintenance flow package based on the execution strategy. Therefore, the automatic operation and maintenance work of the server is performed based on an automatic means instead of a manual work, for example, the PRA technology is used instead of a manual work, and the efficiency and the universality of the operation and maintenance of the server are improved by widely relating to the operation and maintenance work of the server and executing the operation and maintenance process of the server based on a DBUS interface with certain universality.
Drawings
Fig. 1 is a flowchart of a server automation operation and maintenance method provided in an embodiment of the present application;
fig. 2 is a flowchart of an RPA robot execution failure policy provided in an embodiment of the present application;
fig. 3 is a schematic structural diagram of an automatic operation and maintenance device for a server according to an embodiment of the present application;
fig. 4 is a schematic structural diagram of a server according to an embodiment of the present application.
Detailed Description
As described above, the existing server operation and maintenance method cannot simultaneously consider the versatility and efficiency of server operation and maintenance. In particular, various industries are continuously digitalized and transformed, meanwhile, the scale of an IT digital center becomes larger and larger, the function of the IT digital center is more and more obvious, and the IT digital center becomes the core competitiveness affecting the sustainable development of enterprises. However, the existing server operation and maintenance work is generally performed manually, and as the complexity and difficulty of the server operation and maintenance work are increased, the manual operation and maintenance cost and the error rate of the manual operation and maintenance become higher. Although the method for artificial operation and maintenance can depend on technicians to develop operation and maintenance work of each server in a targeted manner, the method has certain universality, and the problem that the universality and the efficiency of the operation and the maintenance of the server cannot be considered in the prior art due to the great increase of the complexity and the difficulty of the operation and the maintenance work of the server.
In order to solve the above problems, the present application provides a server automation operation and maintenance method, including: first, an operation and maintenance flow package of a server is determined based on an operation and maintenance flow rule base. The operation and maintenance flow rule base stores operation and maintenance flows corresponding to the server and command controls required to be called for executing the operation and maintenance flows. Then determining an execution strategy of the operation and maintenance flow packet; and finally, executing the operation and maintenance flow package based on the execution strategy.
Thus, the automatic operation and maintenance work of the server is performed based on an automation technology instead of manual work, for example, a robot process automation (Robotic Process Automation, RPA) technology instead of manual work, and the efficiency and the universality of the operation and maintenance of the server are improved by the process of executing the operation and maintenance of the server based on a DBUS interface with a certain universality and widely related to the operation and maintenance work of the server.
It should be noted that the server automation operation and maintenance method and the related products provided by the application can be applied to the technical field of server operation and maintenance. The foregoing is merely an example, and is not intended to limit the application fields of a server automation operation and maintenance method and related products provided in the present application.
For the purpose of making the objects, technical solutions and advantages of the embodiments of the present application more clear, the technical solutions of the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is apparent that the described embodiments are only some embodiments of the present application, but not all embodiments. All other embodiments, which can be made by one of ordinary skill in the art without undue burden from the present disclosure, are within the scope of the present disclosure.
In practical application, the problem that the universality and the efficiency of server operation and maintenance cannot be considered in the prior art is solved. Therefore, the embodiment of the application provides a server automation operation and maintenance method, which replaces manual operation based on an automation technology, replaces manual operation based on a PRA technology, and executes a flow of server operation and maintenance through a DBUS interface which relates to the wide range of the server operation and maintenance operation and has a certain universality, thereby improving the efficiency and the universality of the server operation and maintenance.
The RPA is a software technology, and can enable a software robot to perform operations such as data input, extraction, verification and the like among different systems like a person. The RPA is a program for replacing human staff to implement highly repetitive work based on rules, and is a process processing machine which is not physically present, and is also called as a 'digital staff', and the RPA robot has the characteristics of quick and flexible deployment, timely response to various business demands, great improvement of the operation efficiency of a business process and no need of manual participation.
DBUS is an inter-process communication (interprocess communication, IPC) mechanism optimized for desktop environments for inter-process communication or process-to-kernel communication. The most basic D-Bus protocol is a one-to-one communication protocol.
Fig. 1 is a flowchart of a server automation operation and maintenance method provided in an embodiment of the present application. Referring to fig. 1, a method for automatically operating and maintaining a server according to an embodiment of the present application may include:
s101: determining an operation and maintenance flow package of the server based on the operation and maintenance flow rule base; and the operation and maintenance flow rule base stores an operation and maintenance flow corresponding to the server and a command control which needs to be called when the operation and maintenance flow is executed.
Specifically, the embodiment of the application constructs an operation and maintenance flow rule base, and the operation and maintenance flow of operation and maintenance work of each server and a specific implementation mode are stored in the operation and maintenance flow rule base, wherein the specific implementation mode is a DBUS command control required by executing the operation and maintenance flow. The server automation operation and maintenance device provided by the embodiment of the application can be an RPA robot, and for the RPA robot, the corresponding operation and maintenance flow and DBUS command control are required to be obtained from the operation and maintenance flow rule base in advance according to the difference of operation and maintenance work of the server, and then are combined into an operation and maintenance flow packet (flow packet+version number form) to be stored. Therefore, when the server is operated, the corresponding operation and maintenance flow package can be determined according to the specific operation and maintenance work of the server.
The server operation and maintenance work comprises: power-on and power-off operations, network configuration delivery, component management, version upgrade, user management, and the like. For example, in the form of a flow packet+version number, the operation and maintenance flow packet may be a version upgrade flow packet with a version number of 001.
In some embodiments, in order to better control the RPA robot to run the corresponding operation and maintenance flow package, some policies may also need to be formulated for the RPA robot in combination with situations that may occur during the running process. Specifically, an execution policy including a scheduling policy and a failure policy is formulated for the RPA robot. The scheduling strategy controls the RPA robot to schedule according to which plan, for example, on-time execution, periodic execution, cyclic execution, and the like. The failure strategy is to control how to process the RPA robot when the RPA robot encounters an abnormal situation in the running process, such as the number of times of repeated execution of failure, log information related to failure feedback, and the like. Taking version upgrading as an example, a periodically executed scheduling strategy is adopted, and the failure strategy is set as a control (DBUS command control or operation and maintenance operation type control) which is in error when the RPA robot fails to execute, until the execution is successful or the execution times are more than 10 times. And finally, controlling the RPA robot operation flow packet based on the execution strategy to realize the automatic operation and maintenance of the operation and maintenance work of the current server.
The ways of constructing the operation and maintenance flow rule base are different, and a possible construction way is described below.
In one case, before determining the operation and maintenance flow package of the server based on the operation and maintenance flow rule base, the method further comprises:
constructing operation and maintenance flow standards based on the execution steps corresponding to the operation and maintenance work of each server; constructing operation and maintenance action standards based on operation methods corresponding to the operation and maintenance work of each server;
constructing an operation control library based on the operation flow standard and the operation action standard; the operation control library comprises the following components: DBUS command controls and operation class controls;
and constructing an operation and maintenance flow rule base based on the operation and maintenance flow standard and the operation and maintenance operation control base.
In practical application, the embodiment of the application utilizes the architecture of the executing steps and the operating methods of the existing manual operation and maintenance work of each server, adjusts the manual execution to be executed by the RPA robot, and constructs the operation and maintenance flow standard and the operation and maintenance action standard which can be understood by the RPA robot based on the manual execution. Taking the operation and maintenance work of a server with version upgrading as an example, if manual execution steps in the prior art are starting, logging in a network manager, confirming a version, starting upgrading and confirming a result in sequence, taking the execution steps as operation and maintenance flow standards corresponding to the version upgrading of the RPA robot; if the operation method corresponding to the starting step is "according to a preset scheduling scheme, the upgrade program is started periodically", and the preset scheduling scheme may be once a week, etc., the operation and maintenance action standard is constructed according to the operation method. Thus, the operation method and the execution steps of the common operation flow in the operation and maintenance work of each server are all constructed into operation and maintenance action standards and operation and maintenance flow standards which can be understood and used by the RPA robot. After that, based on the operation and maintenance action standard and the operation and maintenance flow standard, constructing corresponding DBUS control and operation and maintenance control class control, and storing in an operation and maintenance control library. The operation control library can be thought of as a database containing all the execution controls required for server operation. And finally, according to the operation and maintenance work of each server, searching corresponding controls from an operation and maintenance control library, and packaging and storing the controls in sequence, so that an operation and maintenance flow rule library is formed, and an operation and maintenance flow corresponding to each operation and maintenance work of the server and a command control which needs to be called for executing the operation and maintenance flow are stored in the operation and maintenance flow rule library.
In addition, the operation and maintenance operation standards are constructed differently, and a possible construction method is described below.
In one case, constructing an operation and maintenance action standard based on the operation methods corresponding to the operation and maintenance works of the servers includes:
splitting operation methods corresponding to the server operation and maintenance work into associated first atomic operations respectively;
a normalized execution method is formulated for the first atomic operation, and the execution method is set as an operation and maintenance action standard corresponding to the first atomic operation;
the execution method comprises the following steps: the input, output, corresponding DBUS commands, and standardized enforcement rules of the first atomic operation.
In practical application, in order to make the operation and maintenance action standard finer, in the embodiment of the present application, a standard operation method commonly used in operation and maintenance work of each server is split into associated atomic operations, i.e. first atomic operations, it can be understood that each atomic operation is a minimum action that can be independently executed. The first atomic operation is only referred to herein to distinguish that the atomic operation corresponds to the execution method. Then, in order to make the RPA robot read and understand a specific operation method, a normalized execution method needs to be formulated for all the atomic operations, and specific input and output, corresponding DBUS commands and standardized implementation rules are set for each atomic operation. And finally, taking the normalized execution method of each atomic operation as the operation and maintenance action standard corresponding to the atomic operation, thereby realizing the construction of the operation and maintenance action standard. Taking version upgrading as an example, the operation method corresponding to the starting upgrading step may be "call DBUS command control to confirm and download version package, automatically download matched update package", split the operation method to obtain atomic operations such as calling DBUS, version package confirmation and version package downloading. An enumeration of DBUS commands may be that "DBUS-daemon" is used to initiate and manage all D-Bus sessions in the system; "qdbus" is used to query or invoke methods and properties, etc., on the D-Bus object.
In addition, since the manners of constructing the operation control library are different, one possible construction manner is described below.
In one case, constructing an operation control library based on the operation flow criteria and the operation action criteria includes:
developing a corresponding DBUS command control for the first atomic operation based on the operation flow standard and the operation action standard;
based on the operation flow standard and the operation action standard, developing operation class controls for repeated class operations in operation and maintenance work of each server;
and constructing an operation control library based on the DBUS command control and the operation control class control.
In practical application, the constructed operation control library is used for storing execution controls required by all server operation and maintenance works in an automatic operation process. The execution control is divided into a DBUS command control and a server operation class control. Among other things, DBUS is an inter-process communication (interprocess communication, IPC) mechanism optimized for desktop environments for inter-process communication or process-to-kernel communication. DBUS is generally divided into two types, system bus (system bus) for communication and messaging between systems (Linux) and user programs; session bus (session bus) for communication between user programs such as desktop. Specifically, in order to make the operation and maintenance action standard finer, the corresponding DBUS command control and operation and maintenance operation class control are developed based on RPA on the basis of the operation and maintenance flow standard and operation and maintenance action standard corresponding to the atomic operation. And finally, storing all the controls in an operation control library, thereby realizing the construction of the operation control library.
In addition, since the manners of developing DBUS command controls are different, one possible development manner is described below.
In one case, it is directed to how to develop a DBUS command control. Correspondingly, the developing a corresponding DBUS command control for the first atomic operation based on the operation flow standard and the operation action standard includes:
searching a connection name corresponding to the first atomic operation from a DBUS interface definition table based on the operation flow standard and the operation action standard;
splicing the first atomic operation with the corresponding DBUS command based on the connection name;
and based on an RPA design platform, carrying out standardized encapsulation on the first atomic operation spliced with the DBUS command to form a DBUS command control corresponding to the first atomic operation.
In practical application, in combination with the concepts of the operation flow standard and the operation action standard explained in the above examples, when the application develops a DBUS command control corresponding to an atomic operation, the application first determines the atomic operation needed to construct the DBUS command control. And then searching the connection names corresponding to the atomic operations from the DBUS interface definition table based on the DBUS interface definition table. Taking version upgrade as an example, two atomic operations of "network time protocol (Network Time Protocol, NTP) enabled status on" and "set NTP server" need to define corresponding DBUS command controls, first look up the connection Name (Bus Name) corresponding to "NTP enabled status on" and "set NTP server" in the DBUS interface definition table, including a public Name (well-knownnames) and a unique Name (Unique Connection Name), for determining a connection applied to the message Bus. Thus, the corresponding unique path and interface methods of the two atomic operations of the NTP enabling state opening and the NTP setting server are determined, and the DBUS commands corresponding to the two atomic operations of the NTP enabling state opening and the NTP setting server are spliced based on the unique path and the interface methods. And finally, standardized packaging is carried out on the spliced DBUS commands based on the RPA design platform, so that two atomic operations, namely 'NTP enabled state on' and 'NTP server setting', are formed to correspond to DBUS command controls. And so on until all the atomic operations involved in the server operation and maintenance work that require the construction of the DBUS command control are constructed in this way to complete the corresponding DBUS command control.
In addition, since the manners of developing the server operation class control are different, one possible development manner will be described below.
In one case, developing an operation class control for repeated class operations in operation and maintenance work of each server based on the operation flow standard and the operation action standard includes:
converting repeated class operation in operation and maintenance work of each server in use into second atomic operation based on the operation and maintenance flow standard and the operation and maintenance action standard;
and based on an RPA design platform, carrying out standardized encapsulation on the second atomic operation to form an operation and maintenance operation class control corresponding to the second atomic operation.
In practical applications, the operation method involved in the business process of each server operation and maintenance work can be split into a plurality of atomic operations. In order to facilitate the RPA robot to read and understand the atomic operations, a normalized execution method needs to be formulated for the atomic operations, that is, an operation and maintenance flow standard and an operation and maintenance action standard corresponding to the atomic operations are constructed. However, in order for the RPA robot to actually perform these atomic operations, corresponding execution controls need to be formulated for these atomic operations. Specifically, for the atomic operation needed to construct the DBUS command control, the corresponding DBUS command control is constructed through the method given above. And for some other repetitive classes of operation methods we convert and define them as second atomic operations. The second atomic operation is only referred to herein to distinguish that the atomic operation corresponds to the execution method. And then, carrying out standardized encapsulation on the second atomic operation based on the RPA design platform so as to form a corresponding server operation and maintenance operation class control. It should be noted that the operation type controls include, but are not limited to, various types of peripheral (mouse, keyboard, etc.) input, information capture, feature alignment, and data transmission. Taking version upgrade as an example, two steps of 'login network management' and 'version determination' are considered as repeated class operation, and then the specific operation method corresponding to 'login network management' is as follows: the method comprises the steps of setting an automatic execution period to read a user name and a password to log in a network management system, converting the user name and the password into a second atomic operation, and then packaging the second atomic operation according to the method, so that a corresponding login network management control is obtained; the specific operation method corresponding to the version determination is as follows: and (3) according to the current version number of the environment, confirming whether the target version needing upgrading exists or not, converting the target version into a second atomic operation, and then packaging according to the method, so that a corresponding confirmed version control is obtained. It can be appreciated that both the "login network management control" and the "confirm version control" belong to the server operation class control.
In addition, since the ways of constructing the operation and maintenance flow rule base are different, one possible construction way is described below.
In one case, constructing an operation and maintenance flow rule base based on the operation and maintenance flow standard and the operation and maintenance operation control base includes:
based on the operation and maintenance flow standard, exploring controls required by operation and maintenance work of each server from the operation and maintenance operation control library;
arranging the control according to the operation and maintenance flow standard to form an RPA business flow;
and constructing an operation and maintenance flow rule base based on the RPA business flow.
In practical application, the operation and maintenance flow standard prescribes the execution sequence of all execution actions corresponding to the operation and maintenance work of each server. And the operation control library stores DBUS command controls and operation control types required by executing operation and maintenance work of each server. When the operation and maintenance flow rule base is constructed, firstly, the operation flow of the operation and maintenance work of the current server is determined according to operation and maintenance flow standards, and then controls corresponding to all execution actions are explored from the operation and maintenance operation control base based on all execution actions involved in the operation flow. And then arranging the controls according to the operation and maintenance flow standard, thereby forming a service flow based on the RPA robot, and constructing an operation and maintenance flow rule base based on the RPA service flow. For example, there is a business process a required by a certain usage scenario, that is, the operation and maintenance process standard is ordered into login baseboard management controller (Baseboard Management Controller, BMC), collection log, feature comparison and result transmission according to the execution sequence, then the server automation operation and maintenance device based on the DBUS interface searches the controls required by the execution actions of login BMC, collection log, feature comparison and result transmission from the operation and maintenance operation control library when constructing the operation and maintenance process rule library, and then the controls are organized according to the execution sequence of the business process a, so as to form the RPA business process corresponding to the RPA robot. Thus, the RPA business processes corresponding to the RPA robots are generated by the method for the business processes corresponding to all the use scenes, and then all the RPA business processes are sorted and stored, so that the construction of the operation and maintenance process rule base is realized.
S102: and determining an execution strategy of the operation and maintenance flow packet.
In practical application, the construction of the operation and maintenance flow rule base is the basis for realizing the automatic operation and maintenance of the server by utilizing the RPA robot, wherein the business flows corresponding to the operation and maintenance work of each server and all inter-process communication mechanisms DBUS command controls and operation and maintenance operation type controls required by executing the business flows are stored. When the server operates and maintains, when a specific scene of the RPA robot is given, the RPA robot can search from an operation and maintenance flow rule base, and then a set of RPA business flow corresponding to the current scene is extracted and executed, so that the automatic operation and maintenance of the server are realized. The operation and maintenance flow packages of the RPA robots corresponding to the operation and maintenance work of different servers can be all generated and pre-stored according to the method, so that the corresponding operation and maintenance flow packages can be directly searched when the operation and maintenance work of the RPA robots in the operation and maintenance flow packages of the servers in the corresponding scenes is realized, each generation is not needed, and the operation and maintenance efficiency of the servers is improved. The method and the device for executing the flow packet corresponding to the scene also need to be preset, specifically, in the embodiment of the application, an execution strategy is formulated for the RPA robot in advance, and different execution strategies aim at different operation and maintenance scenes. The execution policy includes a scheduling policy and a failure policy, where the scheduling policy refers to a schedule according to which the RPA robot performs scheduling, for example, on-time execution, periodic execution, loop execution, and the like. The failure strategy refers to a processing scheme under abnormal conditions in the running process of the RPA robot, such as the number of repeated execution times of failure, log information related to failure feedback and the like. Taking a version-up server operation and maintenance scenario as an example, a scheduling policy which is executed periodically is adopted.
S103: and executing the operation and maintenance flow package based on the execution strategy.
In practical application, an execution strategy corresponding to the RPA robot is preset for each different server operation and maintenance working scene. When the server operation and maintenance is carried out, firstly, specific server operation and maintenance work is determined, then the RPA robot is controlled to operate a flow packet which is previously developed and corresponds to the server operation and maintenance work according to a corresponding execution strategy, and therefore the automatic operation and maintenance of the current server operation and maintenance work is achieved.
In addition, since the ways of controlling the RPA robot to run the operation and maintenance flow packet are different, a possible control manner will be described below.
In one case, based on the execution policy, executing the operation and maintenance flow packet specifically includes:
operating the operation and maintenance flow package based on a scheduling strategy;
if no abnormality occurs in the whole operation process, realizing the automatic operation and maintenance of the operation and maintenance work of the server;
if abnormality occurs in the running process, the running process is adjusted according to the failure strategy, and the automatic running and maintenance of the server are realized.
In practical application, each different server operation and maintenance scene is developed with a specific RPA robot flow package and a corresponding version number. When a specific server operation and maintenance working scene is determined, the RPA robot operates an operation and maintenance flow packet which is previously developed and corresponds to the server operation and maintenance working according to a corresponding execution strategy, so that the automatic operation and maintenance of the current server operation and maintenance working is realized. Taking the version-upgraded server operation and maintenance scene as an example, a scheduling strategy which is executed periodically is adopted, and if no abnormal operation condition occurs in the operation process of the whole operation and maintenance flow packet, the operation of the flow packet is completed, and the automatic operation and maintenance of the operation and maintenance work of the current server is realized. If an abnormal operation occurs in the operation process of the one-time operation and maintenance flow packet, the operation is adjusted based on a failure strategy. Taking the version-up server operation and maintenance scenario as an example, the failure policy may be set such that after the RPA robot fails to execute, the RPA robot may re-execute the control with the error until the execution is successful, or the execution number exceeds 10 times. Fig. 2 is a flowchart of an RPA robot execution failure policy provided in an embodiment of the present application. In connection with fig. 2, taking a version-up server operation and maintenance scenario as an example, when an RPA automatic upgrade robot has abnormal operation, that is, when the execution of a flow packet fails, an error is reported and a specific failure control is located, and the control is re-executed according to preset times. If the execution result of the control is successful within ten times, the subsequent control in the service flow is continuously executed. If the failure times are more than ten times, ending the flow, and running the log of the mobile phone RPA robot.
In summary, the present application first determines an operation and maintenance flow package of a server based on an operation and maintenance flow rule base. The operation and maintenance flow rule base stores operation and maintenance flows corresponding to the server and command controls required to be called for executing the operation and maintenance flows. Then determining an execution strategy of the operation and maintenance flow packet; and finally, executing the operation and maintenance flow package based on the execution strategy. Therefore, the RPA technology is used for replacing the manual automatic operation and maintenance work of the server, and the efficiency and the universality of the operation and maintenance of the server are improved by the wide operation and maintenance work of the server and the execution of the operation and maintenance process of the server based on the DBUS interface with certain universality.
Based on the server automation operation and maintenance method provided by the embodiment, the embodiment of the application also provides a server automation operation and maintenance device. The server automation operation and maintenance device will be described with reference to the embodiments and drawings, respectively.
Fig. 3 is a schematic structural diagram of an automatic operation and maintenance device for a server according to an embodiment of the present application. Referring to fig. 3, a server automation operation and maintenance device 200 provided in an embodiment of the present application includes:
a first determining module 201, configured to determine an operation and maintenance flow packet of the server based on the operation and maintenance flow rule base; the operation and maintenance flow rule base stores an operation and maintenance flow corresponding to the server and a command control which needs to be called when the operation and maintenance flow is executed;
A second determining module 202, configured to determine an execution policy of the operation and maintenance flow packet;
and the execution module 203 is configured to execute the operation and maintenance flow packet based on the execution policy. As an embodiment, the server automation operation and maintenance device 200 further includes: the device comprises a first building module, a second building module, a third building module and a fourth building module;
the first construction module is used for constructing operation and maintenance flow standards based on execution steps corresponding to operation and maintenance work of each server;
the second construction module is used for constructing operation and maintenance action standards based on operation methods corresponding to the operation and maintenance work of each server;
the third construction module is used for constructing an operation control library based on the operation flow standard and the operation action standard; the operation control library comprises the following components: DBUS command controls and operation class controls;
and the fourth construction module is used for constructing an operation and maintenance flow rule base based on the operation and maintenance flow standard and the operation and maintenance operation control base.
As an embodiment, the second building module is specifically configured to:
splitting operation methods corresponding to the operation and maintenance work of each server into associated first atomic operations respectively;
A normalized execution method is formulated for the first atomic operation, and the execution method is set as an operation and maintenance action standard corresponding to the first atomic operation;
the execution method comprises the following steps: the input, output, corresponding DBUS commands, and standardized enforcement rules of the first atomic operation.
As an embodiment, the third building module, for how to build the operation control library, includes: the system comprises a first development sub-module, a second development sub-module and a third construction sub-module;
the first development submodule is used for developing a corresponding DBUS command control for the first atomic operation based on the operation flow standard and the operation action standard;
the second development sub-module is used for developing server operation and maintenance operation class controls for repeated class operations in the operation and maintenance work of each server in use based on the operation and maintenance flow standard and the operation and maintenance action standard;
and the third construction sub-module is used for constructing an operation control library based on the DBUS command control and the operation control class control.
As an embodiment, the first development submodule is specifically configured to develop a corresponding DBUS command control for a first atomic operation:
Searching a connection name corresponding to the first atomic operation from a DBUS interface definition table based on the operation flow standard and the operation action standard;
splicing the first atomic operation with the corresponding DBUS command based on the connection name;
and based on an RPA design platform, carrying out standardized encapsulation on the first atomic operation spliced with the DBUS command to form a DBUS command control corresponding to the first atomic operation.
As an implementation manner, the second development sub-module is specifically configured to develop a server operation class control for a repetitive class operation in each server operation job:
based on the operation and maintenance flow standard and the operation and maintenance action standard, converting repeated class operation in operation and maintenance work of each server into second atomic operation;
and based on an RPA design platform, carrying out standardized encapsulation on the second atomic operation to form a server operation class control corresponding to the second atomic operation.
As an implementation manner, the fourth building module is specifically configured to:
based on the operation and maintenance flow standard, exploring controls required by operation and maintenance work of each server from the operation and maintenance operation control library;
Arranging the control according to the operation and maintenance flow standard to form an RPA business flow corresponding to the RPA operation and maintenance robot;
and constructing an operation and maintenance flow rule base based on the RPA business flow.
As an embodiment, the execution module 203 is specifically configured to:
operating the operated and maintained flow package based on a scheduling strategy;
if no abnormality occurs in the whole operation process, realizing the automatic operation and maintenance of the server;
if abnormality occurs in the running process, the running process is adjusted according to the failure strategy, and the automatic running and maintenance of the server are realized.
In summary, the present application first determines an operation and maintenance flow package of a server based on an operation and maintenance flow rule base. The operation and maintenance flow rule base stores operation and maintenance flows corresponding to the server and command controls required to be called for executing the operation and maintenance flows. Then determining an execution strategy of the operation and maintenance flow packet; and finally, executing the operation and maintenance flow package based on the execution strategy. Therefore, the server operation and maintenance work based on the RPA technology replaces manual operation, and the efficiency and the universality of the server operation and maintenance are improved by the process that the server operation and maintenance work is widely related and the server operation and maintenance are executed based on the DBUS interface with certain universality.
In addition, the application also provides a server, which comprises: a memory for storing a computer program; and the processor is used for realizing the steps of the server automation operation and maintenance method when executing the computer program.
In addition, the application further provides a readable storage medium, and the readable storage medium stores a computer program, and the computer program realizes the steps of the server automation operation and maintenance method when being executed by a processor.
Fig. 4 is a schematic diagram of a server structure provided in an embodiment of the present application, and, with reference to fig. 4, a server 400 includes a processor 410 and a memory 420; wherein the memory 420 stores computer instructions, the processor 410 is configured to execute the computer instructions, so that the server 400 performs the server automation operation and maintenance method based on the DBUS interface as shown above.
In some embodiments, the processor 410 may be a CPU, but may also be other general purpose processors, digital signal processors (digital signal processor, DSPs), application specific integrated circuits (application specific integrated circuit, ASICs), field programmable gate arrays (fieldprogrammable gate array, FPGAs), or other programmable logic devices, transistor logic devices, hardware components, or any combination thereof. The general purpose processor may be a microprocessor, but in the alternative, it may be any conventional processor.
In some embodiments, memory 420 may be a volatile memory (volatile memory) or a non-volatile memory, such as a register or the like. Specifically, the volatile memory refers to a memory in which data stored therein is lost when power supply is interrupted. Among them, volatile memories are mainly random access memories (random access memory, RAM), including Static RAM (SRAM) and Dynamic RAM (DRAM). The nonvolatile memory is a memory in which data stored therein is not lost even if power supply is interrupted. Common nonvolatile memories include Read Only Memory (ROM), optical discs, magnetic disks, solid state disks, and various memory cards based on flash memory technology.
In some embodiments, memory 420 has executable code that memory 420 executes to implement a server automated operation and maintenance method based on a DBUS interface.
The communication interface 430 is used for communicating with the outside, for example, the communication interface 430 is used as an external interface, the external interface receives a request for creating a virtual machine, and various parameters required for creating the virtual machine can also be received through the communication interface 430.
The bus may be a peripheral component interconnect (Peripheral Component Interconnect, PCI) bus, or an extended industry standard architecture (extended industry standard architecture, eisa) bus, among others. The buses may be divided into address buses, data buses, control buses, etc. For ease of understanding, fig. 4 is represented by only one bold line, but does not represent only one bus or one type of bus.
It should be understood that the above servers are only schematic representations, and in actual use, the above servers may be independent physical servers, or may be server clusters or distributed systems formed by a plurality of physical servers, or may be cloud servers that provide cloud services, cloud databases, cloud computing, cloud functions, cloud storage, cloud communication, middleware services, domain name services, security services, content delivery networks (Content Delivery Network, CDN), and basic cloud computing services such as big databases and artificial intelligence platforms. When the server is a server cluster formed by a plurality of physical servers, or a distributed system, the plurality of physical servers may form a blockchain, and each physical server is a node on the blockchain. The physical type of the service area can be a rack server, a high-density server, a GPU server, a tower server, a blade server, a whole cabinet server and the like, and the application is not particularly limited.
The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present application. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the application. Thus, the present application is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims (10)

1. A method for server automation operations, the method comprising:
determining an operation and maintenance flow package of the server based on the operation and maintenance flow rule base; the operation and maintenance flow rule base stores an operation and maintenance flow corresponding to the server and a command control which needs to be called when the operation and maintenance flow is executed;
determining an execution strategy of the operation and maintenance flow packet;
and executing the operation and maintenance flow package based on the execution strategy.
2. The method of claim 1, wherein prior to determining the operation and maintenance flow package for the server based on the operation and maintenance flow rule base, further comprising:
constructing operation and maintenance flow standards based on the execution steps corresponding to the operation and maintenance work of each server; constructing operation and maintenance action standards based on operation methods corresponding to the operation and maintenance work of each server;
Constructing an operation control library based on the operation flow standard and the operation action standard; the operation control library comprises the following components: DBUS command controls and operation class controls;
and constructing an operation and maintenance flow rule base based on the operation and maintenance flow standard and the operation and maintenance operation control base.
3. The method according to claim 2, wherein the constructing operation and maintenance action criteria based on the operation methods corresponding to the operation and maintenance works of the respective servers includes:
splitting operation methods corresponding to the operation and maintenance work of each server into associated first atomic operations respectively;
a normalized execution method is formulated for the first atomic operation, and the execution method is set as an operation and maintenance action standard corresponding to the first atomic operation;
the execution method comprises the following steps: the input, output, corresponding DBUS commands, and standardized enforcement rules of the first atomic operation.
4. A method according to claim 3, wherein said building an operation control library based on said operation flow criteria and said operation action criteria comprises:
developing a corresponding DBUS command control for the first atomic operation based on the operation flow standard and the operation action standard;
Based on the operation flow standard and the operation action standard, developing operation class controls for repeated class operations in operation and maintenance work of each server;
and constructing an operation control library based on the DBUS command control and the operation control class control.
5. The method of claim 4, wherein developing a corresponding DBUS command control for the first atomic operation based on the operation flow criteria and the operation action criteria comprises:
searching a connection name corresponding to the first atomic operation from a DBUS interface definition table based on the operation flow standard and the operation action standard;
splicing the first atomic operation with the corresponding DBUS command based on the connection name;
and based on an RPA design platform, carrying out standardized encapsulation on the first atomic operation spliced with the DBUS command to form a DBUS command control corresponding to the first atomic operation.
6. The method of claim 4, wherein the developing an operation class control for the repetitive class operation in each server operation based on the operation flow criteria and the operation action criteria comprises:
Based on the operation and maintenance flow standard and the operation and maintenance action standard, converting repeated class operation in operation and maintenance work of each server into second atomic operation;
and based on an RPA design platform, carrying out standardized encapsulation on the second atomic operation to form an operation and maintenance operation class control corresponding to the second atomic operation.
7. The method of claim 2, wherein the constructing an operation flow rule base based on the operation flow criteria and the operation control base comprises:
based on the operation and maintenance flow standard, exploring controls required by operation and maintenance work of each server from the operation and maintenance operation control library;
arranging the control according to the operation and maintenance flow standard to form an RPA business flow;
and constructing an operation and maintenance flow rule base based on the RPA business flow.
8. The method of claim 1, wherein executing the operation and maintenance flow package based on the execution policy comprises:
operating the operation and maintenance flow package based on a scheduling strategy;
if no abnormality occurs in the whole operation process, realizing the automatic operation and maintenance of the server;
if abnormality occurs in the running process, the running process is adjusted according to the failure strategy, and the automatic running and maintenance of the server are realized.
9. A server automated operation and maintenance device, comprising:
the first determining module is used for determining an operation and maintenance flow packet of the server based on the operation and maintenance flow rule base; the operation and maintenance flow rule base stores an operation and maintenance flow corresponding to the server and a command control which needs to be called when the operation and maintenance flow is executed;
the second determining module is used for determining an execution strategy of the operation and maintenance flow packet;
and the execution module is used for executing the operation and maintenance flow packet based on the execution strategy.
10. A server, comprising:
a memory for storing a computer program;
a processor for implementing the steps of the server automation operation and maintenance method according to any one of claims 1 to 8 when executing the computer program.
CN202311824727.7A 2023-12-27 2023-12-27 Automatic operation and maintenance method for server and related products Pending CN117873673A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311824727.7A CN117873673A (en) 2023-12-27 2023-12-27 Automatic operation and maintenance method for server and related products

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311824727.7A CN117873673A (en) 2023-12-27 2023-12-27 Automatic operation and maintenance method for server and related products

Publications (1)

Publication Number Publication Date
CN117873673A true CN117873673A (en) 2024-04-12

Family

ID=90576464

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311824727.7A Pending CN117873673A (en) 2023-12-27 2023-12-27 Automatic operation and maintenance method for server and related products

Country Status (1)

Country Link
CN (1) CN117873673A (en)

Similar Documents

Publication Publication Date Title
CN110716748B (en) Service processing method, device, computer readable medium and electronic equipment
US5528503A (en) Integrated automation development system and method
JP2022008781A (en) Decentralized training method, system, device, storage medium and program
CN111625316A (en) Environment deployment method and device, electronic equipment and storage medium
WO2021135584A1 (en) Front-end project framework construction method and apparatus, computer device, and storage medium
EP3361331B1 (en) Method and apparatus for controlling and managing an industry process
CN111176801B (en) Multi-process management method, device, equipment and storage medium
CN112799786A (en) Exit method, device, equipment and storage medium of micro-service instance
CN110162344B (en) Isolation current limiting method and device, computer equipment and readable storage medium
CN102968303A (en) Program design system and program design method
US11005974B2 (en) Method and apparatus for accessing data or functions of a programmable controller via a network protocol
CN113094125B (en) Business process processing method, device, server and storage medium
CN117873673A (en) Automatic operation and maintenance method for server and related products
CN116974689A (en) Cluster container scheduling method, device, equipment and computer readable storage medium
US20220276901A1 (en) Batch processing management
CN112231231B (en) Cloud service debugging method, system and device
CN114610413A (en) Method, device, equipment and storage medium for executing synchronous and asynchronous tasks based on Java
CN114493493A (en) Decision engine and decision engine implementation method
CN113485692A (en) Big data componentization development method and device based on component store
Farhat et al. Safe Dynamic Reconfiguration of Concurrent Component-based Applications
CN117391202B (en) Method and system for obtaining analysis decision result based on central control knowledge graph
US20240114051A1 (en) Protecting a device against a cyber incident
US11474845B2 (en) System and method for versioned script management
US20230188627A1 (en) Differentiating controllers and reconcilers for software operators in a distributed computing environment
US11233747B2 (en) Systems and methods for acquiring server resources at schedule time

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination