CN113657702A - Automatic operation and maintenance method and device for internet data center and readable storage medium - Google Patents

Automatic operation and maintenance method and device for internet data center and readable storage medium Download PDF

Info

Publication number
CN113657702A
CN113657702A CN202110748707.0A CN202110748707A CN113657702A CN 113657702 A CN113657702 A CN 113657702A CN 202110748707 A CN202110748707 A CN 202110748707A CN 113657702 A CN113657702 A CN 113657702A
Authority
CN
China
Prior art keywords
information
server
online
configuration information
configuration
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110748707.0A
Other languages
Chinese (zh)
Inventor
张莹光
刘占一
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sina Technology China Co Ltd
Original Assignee
Sina Technology China Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sina Technology China Co Ltd filed Critical Sina Technology China Co Ltd
Priority to CN202110748707.0A priority Critical patent/CN113657702A/en
Publication of CN113657702A publication Critical patent/CN113657702A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0631Resource planning, allocation, distributing or scheduling for enterprises or organisations
    • G06Q10/06311Scheduling, planning or task assignment for a person or group
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/20Administration of product repair or maintenance

Landscapes

  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Engineering & Computer Science (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Operations Research (AREA)
  • Tourism & Hospitality (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Development Economics (AREA)
  • Educational Administration (AREA)
  • Game Theory and Decision Science (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The embodiment of the invention provides an automatic operation and maintenance method, an automatic operation and maintenance device and a readable storage medium for an internet data center.

Description

Automatic operation and maintenance method and device for internet data center and readable storage medium
Technical Field
The invention relates to the field of Internet data center installation automation, in particular to an Internet data center automatic operation and maintenance method, an Internet data center automatic operation and maintenance device and a readable storage medium.
Background
In the prior art, when a new hardware server is added to an internet data center, an operation and maintenance engineer needs to manually receive a resource allocation type work order, manually execute a script pushing and installing task according to an on-line requirement, forward the work order to a field engineer, arrange the work order on a rack for the server according to the work order requirement, connect a network, start installation, manually record and collect states and problems in an installation process by the field engineer, feed back the states and problems to the operation and maintenance engineer, and send a mail delivery work order by the operation and maintenance engineer after the on-line is successful. The above process requires a variety of manual coordination and a large number of manual operations in each step. An operation and maintenance engineer needs to manually analyze the content of the work order, extract useful information and then manually execute a script to push an installation task, the process is easy to make mistakes, and time and labor are wasted when a large order is met; the installation process is complex and tedious, the error rate is high, a field engineer needs to connect an external mouse, a keyboard and a display to a server to search for problems when an abnormality occurs, and the time is about 30 minutes for a single device; after the machine is installed, an operation and maintenance engineer needs to manually execute the script to perform pressure measurement on the machine, and the process is poor in controllability and simple in function; after the online operation is finished, the operation and maintenance engineer needs to manually update the configuration management database; and an audit function is not available, and accidents cannot be traced.
In the process of implementing the invention, the applicant finds that at least the following problems exist in the prior art:
misunderstanding easily occurs in manual coordination communication among multiple kinds of work, so that accuracy is low and efficiency is low, manual misoperation easily occurs in the execution process due to a large amount of manual operations, and finally production efficiency is low and accident rate is high.
Disclosure of Invention
The embodiment of the invention provides an automatic operation and maintenance method and device for an internet data center and a readable storage medium, and solves the problems of low production efficiency and high accident rate by realizing the automation of internet center installation.
To achieve the above object, in one aspect, an embodiment of the present invention provides an automatic operation and maintenance method for an internet data center, including:
receiving a task work order from a message queue, and analyzing according to the task work order to obtain a work order type and configuration information;
after the work order type is confirmed to be an on-line type, verifying the legality of each configuration item in the configuration information according to the on-line type;
after the online type is confirmed to be the online type of the reinstallation system and all the configuration items in the configuration information are legal, the server appointed in the configuration information executes online operation of the reinstallation system according to the configuration information;
receiving and saving online state data during the period of executing online operation of the reinstallation system; the online state data is obtained by a pre-installed state collection program in the online operation process of the reinstallation system;
and after the online operation of the reinstallation system is completed, starting a pressure test aiming at the specified server.
Further, the receiving the task work order from the message queue and analyzing the task work order to obtain the work order type and the configuration information includes:
receiving the task work order from the message queue, and acquiring the work order type from the task work order;
and inquiring a related work order of the task work order, and obtaining the configuration information according to the task work order and the related work order.
Further, when the online type is an online type of a reinstallation system, the configuration items in the configuration information include one or any combination of the following items: server delivery information, operating system information, server hardware information, network information and user information;
the verifying the validity of each configuration item in the configuration information according to the online type includes:
verifying that the server supports the online operation of the reinstallation system according to the server delivery information, wherein the server delivery information is legal, and otherwise, the server delivery information is illegal;
if the operating system corresponding to the operating system information supported by the operating system library is verified, the operating system information is legal, otherwise, the operating system information is illegal; the operating system library is used for recording an operating system which can be used for the online operation of the reinstallation system;
if the server supports the server hardware information, the server hardware information is legal, otherwise, the server hardware information is illegal;
if the user information is verified to be matched with the record in the user information base, the user information is legal, otherwise, the user information is illegal; the user information is used for remotely logging in the server; the user information base is used for recording user login information;
and if the network information corresponding to the server is verified to be complete and valid, the network information is legal, otherwise, the network information is illegal.
Further, after the verifying the validity of each configuration item in the configuration information according to the online type, the method further includes:
after confirming that at least one configuration item in the configuration information is illegal, generating verification result prompt information according to the configuration item of the illegal configuration information, and sending the verification result prompt information to a first receiving end appointed in the configuration information through a first prompt mode appointed in the configuration information.
Further, the online type of the reinstallation system includes: the online type of the new server, the online type of the old server, the online reinstallation system type or the reinstallation system migration online type;
correspondingly, after the online type is confirmed to be the online type of the reinstallation system and all the configuration items in the configuration information are legal, the server specified in the configuration information executes the online operation of the reinstallation system according to the configuration information, and the method comprises the following steps:
analyzing the configuration information to generate installation information;
controlling the appointed server to execute restarting through a first remote command, and entering a pre-starting execution environment;
the installation information is issued to the appointed server through the pre-starting execution environment, so that the appointed server completes system installation according to the installation information;
synchronizing the configuration information to a designated configuration management database.
Further, the installation information includes: the state collection program;
the issuing of the installation information to the designated server through the pre-starting execution environment so as to enable the designated server to complete system installation according to the installation information comprises the following steps:
issuing and installing the state collection program to the specified server through the pre-starting execution environment, wherein the state collection program is used for collecting online state data of the specified server in the online operation execution process of the reinstallation system;
the receiving and saving online state data during the execution of the online operation of the reinstallation system includes:
receiving the on-line state data uploaded by the state collection program, and writing the on-line state data into a first database;
if abnormal information exists in the online state data, automatically repairing the fault corresponding to the abnormal information according to the abnormal information;
and if the fault corresponding to the abnormal information cannot be automatically repaired, generating operation and maintenance prompt information according to the abnormal information, and sending the operation and maintenance prompt information to a second receiving end appointed in the configuration information through a second prompt mode appointed in the configuration information.
Further, after the online operation of the reinstallation system is completed, starting a pressure test for the specified server, including:
generating a pressure measurement task according to a preset pressure measurement rule;
issuing the pressure testing task to the appointed server through a third remote command so that the appointed server starts a pressure test according to the pressure testing task;
and receiving a pressure measurement result from the specified server, and comparing the pressure measurement result with a preset standard result library to generate a pressure measurement report.
Further, the online type further includes: reserving the system migration online type;
correspondingly, the configuration items in the configuration information include one or any combination of the following items: a server IP address and a server MAC address;
the verifying the validity of each configuration item in the configuration information according to the online type includes:
checking that the server IP address conforms to the IP address format through regular check, wherein the server IP address is legal, and otherwise, the server IP address is illegal; and the number of the first and second groups,
checking that the server MAC address conforms to the MAC address format through regular check, wherein the server MAC address is legal, otherwise, the server MAC address is illegal;
the method further comprises the following steps:
and after the online type is confirmed to be a reserved system migration online type and all configuration items in the configuration information are legal, synchronizing the configuration information to a specified configuration management database.
On the other hand, an embodiment of the present invention further provides an automatic operation and maintenance device for an internet data center, including:
the task work order obtaining unit is used for receiving the task work order from the message queue and analyzing the task work order to obtain the work order type and the configuration information;
the configuration information validity checking unit is used for checking the validity of each configuration item in the configuration information according to the on-line type after the work order type is determined to be the on-line type;
the first execution unit is used for executing online operation of the reinstallation system according to the configuration information aiming at the server specified in the configuration information after the online type is confirmed to be the online type of the reinstallation system and all the configuration items in the configuration information are legal;
the state collection unit is used for receiving and saving on-line state data during the execution of the on-line operation of the reinstallation system; the online state data is obtained by a pre-installed state collection program in the online operation process of the reinstallation system;
and the pressure testing unit is used for starting the pressure test aiming at the specified server after the online operation of the reinstallation system is finished.
Further, the task work order obtaining unit includes:
the work order type acquisition module is used for receiving the task work orders from the message queue and acquiring the work order types from the task work orders;
and the configuration information acquisition module is used for inquiring the associated work order of the task work order and acquiring the configuration information according to the task work order and the associated work order.
Further, when the online type is an online type of a reinstallation system, the configuration items in the configuration information include one or any combination of the following items: server delivery information, operating system information, server hardware information, network information and user information;
the configuration information validity checking unit comprises:
a factory information checking module, configured to check that the server supports the online operation of the reinstallation system according to the factory information of the server, if the factory information of the server is legal, otherwise, the factory information of the server is illegal;
the operating system checking module is used for checking that the operating system library supports an operating system corresponding to the operating system information, if the operating system information is legal, otherwise, the operating system information is illegal; the operating system library is used for recording an operating system which can be used for the online operation of the reinstallation system;
the hardware information checking module is used for checking that the server supports the server hardware information, if the server hardware information is legal, otherwise, the server hardware information is illegal;
the user information checking module is used for checking that the user information is matched with records in a user information base, the user information is legal, and otherwise, the user information is illegal; the user information is used for remotely logging in the server; the user information base is used for recording user login information;
and the network information checking module is used for checking that the network information corresponding to the server is complete and valid, if the network information is legal, otherwise, the network information is illegal.
Further, after the configuration information validity checking unit, the method further includes:
and the configuration information prompting unit is used for generating verification result prompting information according to the configuration items of the configuration information which are illegal after confirming that at least one configuration item in the configuration information is illegal, and sending the verification result prompting information to a first receiving end appointed in the configuration information through a first prompting mode appointed in the configuration information.
Further, the online type of the reinstallation system includes: the online type of the new server, the online type of the old server, the online reinstallation system type or the reinstallation system migration online type;
accordingly, the first execution unit includes:
the installation information acquisition module is used for analyzing the configuration information to generate installation information;
the pre-starting module is used for controlling the appointed server to execute restarting through a first remote command and entering a pre-starting execution environment;
the installation information issuing module is used for issuing the installation information to the specified server through the pre-starting execution environment so that the specified server can complete system installation according to the installation information;
a first configuration update module to synchronize the configuration information to a specified configuration management database.
Further, the installation information includes: the state collection program;
the installation information issuing module is further configured to issue and install the state collection program to the specified server through the pre-boot execution environment, and the state collection program is configured to collect online state data of the specified server in an online operation execution process of the reinstallation system.
The state collection unit includes:
the state collection module is used for receiving the online state data uploaded by the state collection program and writing the online state data into a first database;
the fault processing module is used for automatically repairing the fault corresponding to the abnormal information according to the abnormal information if the abnormal information exists in the online state data;
and the fault prompt module is used for generating operation and maintenance prompt information according to the abnormal information if the fault corresponding to the abnormal information cannot be automatically repaired, and sending the operation and maintenance prompt information to a second receiving end appointed in the configuration information through a second prompt mode appointed in the configuration information.
Further, the pressure measurement unit includes:
the pressure measurement task generation module is used for generating a pressure measurement task according to a preset pressure measurement rule;
the pressure measurement starting module is used for issuing the pressure measurement task to the appointed server through a third remote command so as to enable the appointed server to start a pressure test according to the pressure measurement task;
and the pressure measurement result receiving module is used for receiving the pressure measurement result from the specified server and comparing the pressure measurement result with a preset standard result library to generate a pressure measurement report.
Further, the online type further includes: reserving the system migration online type;
correspondingly, the configuration items in the configuration information include one or any combination of the following items: a server IP address and a server MAC address;
the configuration information validity checking unit comprises:
the IP address format checking unit is used for checking that the server IP address conforms to the IP address format through regular check, the server IP address is legal, and otherwise, the server IP address is illegal; and the number of the first and second groups,
the MAC address format checking unit is used for checking that the server MAC address conforms to the MAC address format through regular check, if so, the server MAC address is legal, otherwise, the server MAC address is illegal;
the device further comprises:
and the second execution unit is used for synchronizing the configuration information to a specified configuration management database after the online type is determined to be the reserved system migration online type and all configuration items in the configuration information are legal.
In another aspect, an embodiment of the present invention further provides a readable storage medium, on which a computer program is stored, where the computer program is executed by a processor to implement any one of the methods described above.
The technical scheme has the following beneficial effects: the work order, the extraction configuration information and the verification of the legality of the configuration information are automatically obtained through the automatic monitoring message queue, the work order and the configuration information do not need to be manually extracted, and an operation and maintenance engineer only needs to process a machine with problems after the work; the pressure test task can be automatically generated according to the pressure test rule preset by the operation and maintenance engineer, and automatically issued to the server, so that the pressure test is automatically completed and the pressure test result is fed back. The operation and maintenance process which needs a large amount of manual work to participate originally is automated, so that the effects of improving the production efficiency and reducing the manual operation accident rate are achieved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a flowchart of an automatic operation and maintenance method for an internet data center according to an embodiment of the present invention;
fig. 2 is a schematic structural diagram of an automatic operation and maintenance device of an internet data center according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
In one aspect, as shown in fig. 1, an embodiment of the present invention provides an automatic operation and maintenance method for an internet data center, which is applied to an automatic operation and maintenance device for an internet data center, where the automatic operation and maintenance device for an internet data center may be any terminal or server with a computing function, and the automatic operation and maintenance device for an internet data center is used to maintain a server in the internet data center, and includes:
s100, receiving a task work order from a message queue, and analyzing according to the task work order to obtain a work order type and configuration information;
s101, after the work order type is confirmed to be an on-line type, verifying the legality of each configuration item in the configuration information according to the on-line type;
s102, after the online type is confirmed to be the online type of the reinstallation system and all the configuration items in the configuration information are legal, the server appointed in the configuration information executes online operation of the reinstallation system according to the configuration information;
s103, receiving and storing online state data during the online operation of the reinstallation system; the online state data is obtained by a pre-installed state collection program in the online operation process of the reinstallation system;
and S104, after the online operation of the reinstallation system is completed, starting a pressure test aiming at the specified server.
In one embodiment, a task issuer, such as an operation and maintenance person, submits a task work order to a message queue through a Web UI interface at the front end, and an automatic operation and maintenance device of the internet data center monitors the message queue, receives the task work order from the message queue, and analyzes the task work order to obtain the type and configuration information of the work order. If the work order type is checked to be the online type, the legality of each configuration item in the configuration information is checked according to the online type, a prompt can be sent to an illegal configuration item, for example, a front-end Web UI pop-up prompt is sent, configuration invalid prompt information is generated according to the illegal configuration item, and the configuration invalid prompt information is sent to a specified mailbox through a mail. And if all the configuration items in the configuration information are legal, performing online operation on the server specified by the configuration information according to the configuration information. The server specified in the configuration information may be one server, and at this time, the online operation is performed on the one server, or may be a batch of servers, and at this time, the online operation is performed on the batch of servers. And when the online type is judged to be the online type of the reinstallation system, the specific online operation is to execute the online operation of the reinstallation system according to the configuration information aiming at the server specified in the configuration information. In order to monitor the unexpected problem in the online operation process of the reloading system, during the online operation of the reloading system, online state data of a server end which is executed with the online operation during the online operation of the reloading system is collected through a state collection program which is pre-installed on a server which is executed with the online operation, the state collection program sends the online state data back to an automatic operation and maintenance device of an internet data center, and the automatic operation and maintenance device of the internet data center stores the online state data in a specified database after receiving the online state data. After the online operation of the reloading system is completed smoothly, the automatic operation and maintenance device of the internet data center remotely starts a pressure test aiming at the server which is executed with the online operation.
In the embodiment, the automatic operation and maintenance device of the internet data center provides a humanized Web UI, an operation and maintenance engineer does not need to manually extract work order information, the automatic operation and maintenance device of the internet data center can automatically monitor a message queue to automatically analyze the content of the work order and push an installation task, and the operation and maintenance engineer only needs to process a machine with problems afterwards; the automatic operation and maintenance device of the internet data center automatically judges the validity of the data content when automatically analyzing the content of the resource allocation work order; the operation and maintenance engineer can preset different standards and durations to perform pressure measurement on the server, and the automatic operation and maintenance device of the internet data center can collect results, compare the results with a standard library and then generate a pressure measurement report; after the online is successful, the automatic operation and maintenance device of the internet data center automatically synchronizes the change information to the CMDB (Configuration Management Database); the behavior recording function is integrated, so that the accident problem can be quickly traced; a system function API interface based on HTTP REST is provided between the front-end Web UI and the automatic operation and maintenance device of the Internet data center, and any programming language can be used for secondary development; through reasonable rule formulation, functional plates are distinguished, and safety control is facilitated.
The embodiment of the invention has the following technical effects: the work order, the extraction configuration information and the verification of the legality of the configuration information are automatically obtained through the automatic monitoring message queue, the work order and the configuration information do not need to be manually extracted, and an operation and maintenance engineer only needs to process a machine with problems after the work; the pressure test task can be automatically generated according to the pressure test rule preset by the operation and maintenance engineer, and automatically issued to the server, so that the pressure test is automatically completed and the pressure test result is fed back. The operation and maintenance process which needs a large amount of manual work to participate originally is automated, so that the effects of improving the production efficiency and reducing the manual operation accident rate are achieved.
Further, the receiving the task work order from the message queue and analyzing the task work order to obtain the work order type and the configuration information includes:
receiving the task work order from the message queue, and acquiring the work order type from the task work order;
and inquiring a related work order of the task work order, and obtaining the configuration information according to the task work order and the related work order.
In one embodiment, after receiving the task work order, obtaining a work order type and a work order number, and querying an associated work order of the task work order according to the work order number, where for example, the type of the task work order is a specified online type, and the purpose is to online a specified server or servers, where the associated work order may include, but is not limited to, a work order of purchasing the server, and the work order of purchasing the server may be queried for a hardware configuration of the purchased server; the method can also comprise a deployment work order of the server, wherein information such as which machine room the server is deployed in, which cabinet, the IP address of the management card corresponding to the server and the like can be inquired. After obtaining the associated work order of the task work order, extracting configuration information necessary for finishing the specified online type from the task work order and the associated work order;
the embodiment of the invention has the following technical effects: the task work order is managed through the message queue, the associated work order is inquired according to the task work order, the work order type and the configuration information are automatically obtained, an operation and maintenance engineer does not need to manually extract the work order information, labor is saved, manual operation errors which often occur in manual operation are effectively avoided, the production efficiency is improved, and the error rate is reduced.
Further, when the online type is an online type of a reinstallation system, the configuration items in the configuration information include one or any combination of the following items: server delivery information, operating system information, server hardware information, network information and user information;
in one embodiment, when the online operation of the reinstallation system is performed, for a server in an internet data center, before the online operation, a specified server needs to be prepared, wherein the specified server needs to be prepared, and the preparation includes the steps of preparing a hardware configuration of the specified server, deploying the hardware configuration on a specified cabinet in a specified machine room, deploying a management card corresponding to the specified server, connecting various power lines and network cables, allocating an IP address, adding information of the corresponding server to a configuration management database, and the like; according to configuration information extracted from the task work order, obtaining factory information of the server, operating system information of an operating system to be installed, server hardware information supported by the server, network information of the server and user information, wherein the user information is used for logging in the server; the configuration information is verified, when any one item is illegal, the configuration information verification result is considered to be illegal, and when all configuration items of the configuration information are verified to be legal, the configuration information is considered to be legal; for the existence of illegal configuration items, the user is prompted through a front-end Web UI interface or sent to a specified receiving address through other communication means such as mails to be processed by specified personnel. And when all the configuration items of the configuration information are legal, continuing to execute online operation.
The verifying the validity of each configuration item in the configuration information according to the online type includes:
verifying that the server supports the online operation of the reinstallation system according to the server delivery information, wherein the server delivery information is legal, and otherwise, the server delivery information is illegal;
in some embodiments, the factory information of the server may include a factory date of the server, and the servers that factory in different time periods may have different software and hardware configurations, for example, the server that factory in an early stage may not support an online operation of the remote reinstallation system, and the server may be judged by the factory date of the server, and if the factory date of the server is judged to be after a specified factory date, the current server supports the online operation of the reinstallation system, that is, the factory date of the server is legal, or else, the server does not support the online operation of the reinstallation system, that is, the factory date of the server is illegal.
If the operating system corresponding to the operating system information supported by the operating system library is verified, the operating system information is legal, otherwise, the operating system information is illegal; the operating system library is used for recording an operating system which can be used for the online operation of the reinstallation system;
in some embodiments, the information of the operating system supporting remote installation may be centrally stored in an operating system library (i.e., an OS system library), and after the operating system information in the configuration information is acquired, the operating system library is queried, and if the operating system information in the configuration information can be queried in the operating system library, the operating system information in the configuration information is considered to be legal, otherwise, the operating system information is considered to be illegal.
If the server supports the server hardware information, the server hardware information is legal, otherwise, the server hardware information is illegal;
in some embodiments, the server hardware information may include one or any combination of the following: raid level, whether hyper-threading is supported, whether virtualization is supported, and whether BIOS setting is supported; specifically, a Configuration Management Database (i.e., CMDB) may be queried online through a JavaScript script to obtain whether the server specified in the current Configuration information supports the current Raid level, and if the server supports the current Raid level, the current Configuration item is legal, otherwise, the current Configuration item is illegal. And the factory information table of the server specified by the configuration information can be obtained by inquiring the configuration management database, and whether the server specified by the configuration information supports hyper-threading, virtualization, BiOS setting and the like can be judged. The configuration management database includes, but is not limited to, resources, configurations, and dependencies among each other for holding the occupation of various devices, including servers, in the internet data center.
If the user information is verified to be matched with the record in the user information base, the user information is legal, otherwise, the user information is illegal; the user information is used for remotely logging in the server; the user information base is used for recording user login information;
in some embodiments, the user information may include Shell login user information of the server; verifying whether the user information in the configuration information is a legal user by calling an interface provided by an Enterprise Resource Planning (ERP) system; the ERP records the information of each legal user including the information of the Shell login user;
and if the network information corresponding to the server is verified to be complete and valid, the network information is legal, otherwise, the network information is illegal.
In some embodiments, the network information includes, but is not limited to, one or any combination of the following: an IP address and a MAC address; automatically judging the validity of the IP address through algorithms such as regular algorithm and the like, and generating gateway information when the IP address is legal; and whether the default MAC address is recorded in the corresponding server is judged by inquiring the configuration management database, and if the default MAC address is not recorded, an operation and maintenance engineer can be reminded to manually input the default MAC address into the configuration management database in communication modes such as mails and the like. The IP address, MAC address, and gateway information are used to access the server specified in the configuration information through the network interface.
The embodiment of the invention has the following technical effects: by checking whether the configuration information is legal or not, an operation and maintenance engineer does not need to manually check the content of the configuration information, and the checking efficiency and the accuracy are improved.
Further, after the verifying the validity of each configuration item in the configuration information according to the online type, the method further includes:
after confirming that at least one configuration item in the configuration information is illegal, generating verification result prompt information according to the configuration item of the illegal configuration information, and sending the verification result prompt information to a first receiving end appointed in the configuration information through a first prompt mode appointed in the configuration information.
In one embodiment, if the configuration information is illegal, generating verification result prompt information according to the illegal configuration information, and sending the verification result prompt information to a specified first receiving end through a first prompt mode specified in the configuration information, such as a mail, a WeChat, a short message and the like, such as a specified mail address, a micro signal, a short message number and the like, and processing the verification result prompt information by a corresponding specified worker, such as an operation and maintenance engineer or a task issuer; and the prompt information of the verification result can be fed back to a front-end Web UI interface and can be modified in time by a task issuer, and specifically, the communication with the front-end Web UI can be based on a system function API (application program interface) of HTTP REST (hypertext transfer protocol) and can be developed secondarily by using any programming language.
The embodiment of the invention has the following technical effects: after the task work order is received and the configuration information is automatically extracted, the validity of the configuration information can be automatically checked, feedback is given to illegal configuration information in time, the time for correcting errors is shortened, and the efficiency is improved.
Further, the online type of the reinstallation system includes: the online type of the new server, the online type of the old server, the online reinstallation system type or the reinstallation system migration online type;
in one embodiment, the online type of the reinstallation system may be one or more of an online type of a new server, an online type of an old server, an online type of the reinstallation system, or an online type of migration of the reinstallation system. The new server online type is online operation aiming at a newly purchased server; the old server online type is an online operation aiming at a server used once; the online reloading system type is a server for which a reloading system is proposed for the server currently in use but in need; the reinstallation system is migrated to the online type, namely, the service content is migrated from the old server to the newly deployed server.
Correspondingly, after the online type is confirmed to be the online type of the reinstallation system and all the configuration items in the configuration information are legal, the server specified in the configuration information executes the online operation of the reinstallation system according to the configuration information, and the method comprises the following steps:
analyzing the configuration information to generate installation information;
controlling the appointed server to execute restarting through a first remote command, and entering a pre-starting execution environment;
the installation information is issued to the appointed server through the pre-starting execution environment, so that the appointed server completes system installation according to the installation information;
synchronizing the configuration information to a designated configuration management database.
In one embodiment, after the configuration information is verified to be legal, installed information is generated according to the configuration information, and the installed information is issued to a server corresponding to the configuration information; the control of the server restart by the first remote command may be implemented by the following specific steps: the management card IP address corresponding to the server appointed in the configuration information is obtained through the configuration information, IPMI command communication with the management card is established through the management card IP address, the management card is used for controlling starting, shutdown and restarting of the server, is irrelevant to an operating system of the server and is not limited by the current running state of the server. The remote operation IPMI command sets the default starting mode of the server as network starting through the management card, restarts the server, and enters a pre-starting execution environment (namely PXE, pre-boot execution environment) after the server is started. And continuously realizing the issuing of installation information to the server through the following steps, and completing the system installation: the server carries out DHCP (dynamic host configuration protocol) broadcast in a PXE environment, the automatic operation and maintenance device of the Internet data center allocates an IP address for the server after receiving the broadcast, and the IP address of the server is used as the IP address used by the server during the online operation of the reloading system; the Internet data center automatic operation and maintenance device sends installation information such as, but not limited to, a kernel boot file and a kernel mirror image to the server through the TFTP, the server installs the kernel and starts the kernel; the automatic operation and maintenance device of the internet data center sends other parts of installation information to the server through communication with the kernel, such as, but not limited to, a mirror image of an operating system to be installed; after receiving other parts of the installation information, the server starts the installation of the system; after the system is installed, the Internet data center automatic operation and maintenance device synchronizes the content of the configuration information related to the server to the specified configuration management database. The embodiment of the invention realizes the automatic issuing of the installation information according to the configuration information and the completion of the installation operation, thereby achieving the technical effects of improving the installation efficiency and reducing the error rate.
Further, the installation information includes: the state collection program;
the issuing of the installation information to the designated server through the pre-starting execution environment so as to enable the designated server to complete system installation according to the installation information comprises the following steps:
issuing and installing the state collection program to the specified server through the pre-starting execution environment, wherein the state collection program is used for collecting online state data of the specified server in the online operation execution process of the reinstallation system;
the receiving and saving online state data during the execution of the online operation of the reinstallation system includes:
receiving the on-line state data uploaded by the state collection program, and writing the on-line state data into a first database;
if abnormal information exists in the online state data, automatically repairing the fault corresponding to the abnormal information according to the abnormal information;
and if the fault corresponding to the abnormal information cannot be automatically repaired, generating operation and maintenance prompt information according to the abnormal information, and sending the operation and maintenance prompt information to a second receiving end appointed in the configuration information through a second prompt mode appointed in the configuration information.
In some embodiments, a second prompting manner and a second receiving end are preset in the configuration information, the second prompting manner may be a communication manner with a front-end interface, such as a communication manner of a mail, a WeChat, a short message, and the like, the second receiving end is a receiving end corresponding to the second prompting manner, such as a task display interface, a mailbox address, a micro signal, a short message number, and the like of the front end, and a person responsible for receiving and processing information in the second receiving end may be an operation and maintenance engineer, a field engineer, a task initiator, and the like. The original operating system on the server can be the same as or different from the operating system reinstalled in the current online operation; the method comprises the steps of pre-installing a state collection program on a server, collecting online state data in the online operation process of the reinstallation system in real time through the pre-installed state collection program, writing the online state data into a first database, attempting to automatically repair a fault corresponding to abnormal information according to the abnormal information when the online state data is judged to have the abnormal information, wherein the fault cannot continue to advance when a certain step in the online operation of the reinstallation system is overtime, judging that the abnormal step exists according to the abnormal information, stopping the online operation of the reinstallation system, restarting the online operation of the reinstallation system once, and continuing the online operation of the reinstallation system from the step in which the abnormality exists in the last online process. And for the fault which is attempted to be repaired more than the specified repair times or cannot be automatically repaired according to the abnormal information, modifying online state data in the first database, recording the fault condition, generating operation and maintenance prompt information according to the abnormal information, sending the operation and maintenance prompt information to a second receiving end corresponding to a second prompt mode through the second prompt mode, such as but not limited to communication modes including mails, micro-messages, short messages and the like, and reminding an operation and maintenance engineer, a field engineer, a task initiator and the like of intervention processing. And the online state data collected in real time in the online process can be fed back to the front-end Web UI interface by the back-end processing program through the WebSokey and displayed to the task initiator in real time.
The embodiment of the invention has the following technical effects: the information in the online operation process of the reloading system is collected through the preassembled state collecting program in the online operation process of the reloading system, the monitored online operation process is realized, the abnormity in the online operation process of the reloading system can be timely and automatically found, the abnormity is automatically repaired, and when the abnormity cannot be repaired, manual intervention processing is timely applied, so that traceable information is provided for the online operation process of the reloading system, and basic data is provided for continuously improving the online operation process of the reloading system and timely processing faults. The on-line progress is displayed in real time, so that a task initiator can know the on-line progress in time and reasonably arrange the time of a subsequent task.
Further, after the online operation of the reinstallation system is completed, starting a pressure test for the specified server, including:
generating a pressure measurement task according to a preset pressure measurement rule;
issuing the pressure testing task to the appointed server through a third remote command so that the appointed server starts a pressure test according to the pressure testing task;
and receiving a pressure measurement result from the specified server, and comparing the pressure measurement result with a preset standard result library to generate a pressure measurement report.
In one embodiment, after the system installation of the server is completed, a pressure test for the server may also be performed, and a pressure test rule (i.e., a pressure test rule) is preset by an operation and maintenance engineer; the automatic operation and maintenance device of the internet data center automatically generates a pressure measurement task according to a pressure measurement rule; the third remote command may be communication completed by establishing a PXE environment between the automatic operation and maintenance device of the internet data center and the server, or execution of the third remote command by establishing remote communication between the automatic operation and maintenance device of the internet data center and an operating system already installed on the server. Downloading a pressure test task to the server through a third remote command, and starting a pressure test of the server; and automatically receiving the pressure test result from the server and generating a pressure test report, wherein the pressure test report can be fed back to a WebUI interface at the front end and can also be sent to a mail address specified in the configuration information through a mail system.
The embodiment of the invention has the following technical effects: the method has the advantages that the steps of receiving the work order, checking configuration information, executing the online operation installation system and executing the pressure test are integrated together to form an automatic completion step, state information is collected in the executing process, all steps are automatically executed to the maximum degree, faults are automatically solved, the operation result and the unsolvable faults are timely fed back to corresponding manual processing, automation of a complete production line from the requirement generation of the task work order to the online of the installation to the final pressure test is realized, production efficiency is improved, labor cost is reduced, and accuracy is improved.
Further, the online type further includes: reserving the system migration online type;
correspondingly, the configuration items in the configuration information include one or any combination of the following items: a server IP address and a server MAC address;
the verifying the validity of each configuration item in the configuration information according to the online type includes:
checking that the server IP address conforms to the IP address format through regular check, wherein the server IP address is legal, and otherwise, the server IP address is illegal; and the number of the first and second groups,
checking that the server MAC address conforms to the MAC address format through regular check, wherein the server MAC address is legal, otherwise, the server MAC address is illegal;
the method further comprises the following steps:
and after the online type is confirmed to be a reserved system migration online type and all configuration items in the configuration information are legal, synchronizing the configuration information to a specified configuration management database.
In one embodiment, for the reserved system migration online type without reinstalling the system, after confirming that the configuration information is legal, the configuration information can be updated to the storage item corresponding to the corresponding server in the configuration management database. The configuration information of each server is uniformly managed in the configuration management database, so that the distribution condition of each resource and the complete resource dependence relationship of each server can be known conveniently from the whole situation, and the equipment maintenance work efficiency of the internet data center can be improved.
It should be specially noted that, in each embodiment of the present specification, the first receiving end specified in the configuration information and the second receiving end specified in the configuration information may be the same receiving end or different receiving ends; each receiving end may refer to a device or a person, for example, may refer to a display device, or may refer to an operation and maintenance engineer, a field engineer, or a task issuing person.
On the other hand, as shown in fig. 2, an embodiment of the present invention further provides an automatic operation and maintenance device for an internet data center, including:
the task work order obtaining unit 200 is used for receiving the task work order from the message queue and analyzing the task work order to obtain the work order type and the configuration information;
a configuration information validity checking unit 201, configured to check validity of each configuration item in the configuration information according to the on-line type after confirming that the work order type is the on-line type;
an executing unit 202, configured to, after it is determined that the online type is an online type of a reinstallation system and all configuration items in the configuration information are legal, execute an online operation of the reinstallation system according to the configuration information for a server specified in the configuration information;
a state collection unit 203, configured to receive and save online state data during performing online operation of the reinstallation system; the online state data is obtained by a pre-installed state collection program in the online operation process of the reinstallation system;
and the pressure test unit 204 is used for starting a pressure test for the specified server after the online operation of the reinstallation system is completed.
Further, the task work order obtaining unit 200 includes:
the work order type acquisition module is used for receiving the task work orders from the message queue and acquiring the work order types from the task work orders;
and the configuration information acquisition module is used for inquiring the associated work order of the task work order and acquiring the configuration information according to the task work order and the associated work order.
Further, when the online type is an online type of a reinstallation system, the configuration items in the configuration information include one or any combination of the following items: server delivery information, operating system information, server hardware information, network information and user information;
the configuration information validity checking unit 201 includes:
a factory information checking module, configured to check that the server supports the online operation of the reinstallation system according to the factory information of the server, if the factory information of the server is legal, otherwise, the factory information of the server is illegal;
the operating system checking module is used for checking that the operating system library supports an operating system corresponding to the operating system information, if the operating system information is legal, otherwise, the operating system information is illegal; the operating system library is used for recording an operating system which can be used for the online operation of the reinstallation system;
the hardware information checking module is used for checking that the server supports the server hardware information, if the server hardware information is legal, otherwise, the server hardware information is illegal;
the user information checking module is used for checking that the user information is matched with records in a user information base, the user information is legal, and otherwise, the user information is illegal; the user information is used for remotely logging in the server; the user information base is used for recording user login information;
and the network information checking module is used for checking that the network information corresponding to the server is complete and valid, if the network information is legal, otherwise, the network information is illegal.
Further, after the configuration information validity checking unit 201, the method further includes:
and the configuration information prompting unit is used for generating verification result prompting information according to the configuration items of the configuration information which are illegal after confirming that at least one configuration item in the configuration information is illegal, and sending the verification result prompting information to a first receiving end appointed in the configuration information through a first prompting mode appointed in the configuration information.
Further, the online type of the reinstallation system includes: the online type of the new server, the online type of the old server, the online reinstallation system type or the reinstallation system migration online type;
accordingly, the first execution unit 202 includes:
the installation information acquisition module is used for analyzing the configuration information to generate installation information;
the pre-starting module is used for controlling the appointed server to execute restarting through a first remote command and entering a pre-starting execution environment;
the installation information issuing module is used for issuing the installation information to the specified server through the pre-starting execution environment so that the specified server can complete system installation according to the installation information;
a first configuration update module to synchronize the configuration information to a specified configuration management database.
Further, the installation information includes: the state collection program;
the installation information issuing module is further configured to issue and install the state collection program to the specified server through the pre-boot execution environment, and the state collection program is configured to collect online state data of the specified server in an online operation execution process of the reinstallation system.
The state collection unit 203 includes:
the state collection module is used for receiving the online state data uploaded by the state collection program and writing the online state data into a first database;
the fault processing module is used for automatically repairing the fault corresponding to the abnormal information according to the abnormal information if the abnormal information exists in the online state data;
and the fault prompt module is used for generating operation and maintenance prompt information according to the abnormal information if the fault corresponding to the abnormal information cannot be automatically repaired, and sending the operation and maintenance prompt information to a second receiving end appointed in the configuration information through a second prompt mode appointed in the configuration information.
Further, the pressure measurement unit 204 includes:
the pressure measurement task generation module is used for generating a pressure measurement task according to a preset pressure measurement rule;
the pressure measurement starting module is used for issuing the pressure measurement task to the appointed server through a third remote command so as to enable the appointed server to start a pressure test according to the pressure measurement task;
and the pressure measurement result receiving module is used for receiving the pressure measurement result from the specified server and comparing the pressure measurement result with a preset standard result library to generate a pressure measurement report.
Further, the online type further includes: reserving the system migration online type;
correspondingly, the configuration items in the configuration information include one or any combination of the following items: a server IP address and a server MAC address;
the configuration information validity checking unit 201 includes:
the IP address format checking unit is used for checking that the server IP address conforms to the IP address format through regular check, the server IP address is legal, and otherwise, the server IP address is illegal; and the number of the first and second groups,
the MAC address format checking unit is used for checking that the server MAC address conforms to the MAC address format through regular check, if so, the server MAC address is legal, otherwise, the server MAC address is illegal;
the device further comprises:
and the second execution unit is used for synchronizing the configuration information to a specified configuration management database after the online type is determined to be the reserved system migration online type and all configuration items in the configuration information are legal.
The automatic operation and maintenance device for the internet data center provided in the embodiment of the present invention is a product class implementation row corresponding to the aforementioned automatic operation and maintenance method for the internet data center, and a person skilled in the art can understand the automatic operation and maintenance device for the internet data center provided in the embodiment of the present invention according to the corresponding embodiment of the aforementioned automatic operation and maintenance method for the internet data center without objection, and details are not described herein again.
In another aspect, an embodiment of the present invention further provides a readable storage medium, on which a computer program is stored, where the computer program is executed by a processor to implement any one of the methods described above.
The technical scheme has the following beneficial effects: the work order, the extraction configuration information and the verification of the legality of the configuration information are automatically obtained through the automatic monitoring message queue, the work order and the configuration information do not need to be manually extracted, and an operation and maintenance engineer only needs to process a machine with problems after the work; the pressure test task can be automatically generated according to the pressure test rule preset by the operation and maintenance engineer, and automatically issued to the server, so that the pressure test is automatically completed and the pressure test result is fed back. The operation and maintenance process which needs a large amount of manual work to participate originally is automated, so that the effects of improving the production efficiency and reducing the manual operation accident rate are achieved.
It should be understood that the specific order or hierarchy of steps in the processes disclosed is an example of exemplary approaches. Based upon design preferences, it is understood that the specific order or hierarchy of steps in the processes may be rearranged without departing from the scope of the present disclosure. The accompanying method claims present elements of the various steps in a sample order, and are not intended to be limited to the specific order or hierarchy presented.
In the foregoing detailed description, various features are grouped together in a single embodiment for the purpose of streamlining the disclosure. This method of disclosure is not to be interpreted as reflecting an intention that the claimed embodiments of the subject matter require more features than are expressly recited in each claim. Rather, as the following claims reflect, invention lies in less than all features of a single disclosed embodiment. Thus, the following claims are hereby expressly incorporated into the detailed description, with each claim standing on its own as a separate preferred embodiment of the invention.
The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. To those skilled in the art; various modifications to these embodiments will be readily apparent, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the disclosure. Thus, the present disclosure is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.
What has been described above includes examples of one or more embodiments. It is, of course, not possible to describe every conceivable combination of components or methodologies for purposes of describing the aforementioned embodiments, but one of ordinary skill in the art may recognize that many further combinations and permutations of various embodiments are possible. Accordingly, the embodiments described herein are intended to embrace all such alterations, modifications and variations that fall within the scope of the appended claims. Furthermore, to the extent that the term "includes" is used in either the detailed description or the claims, such term is intended to be inclusive in a manner similar to the term "comprising" as "comprising" is interpreted when employed as a transitional word in a claim. Furthermore, any use of the term "or" in the specification of the claims is intended to mean a "non-exclusive or".
Those of skill in the art will further appreciate that the various illustrative logical blocks, units, and steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware, computer software, or combinations of both. To clearly illustrate the interchangeability of hardware and software, various illustrative components, elements, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as hardware or software depends upon the particular application and design requirements of the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present embodiments.
The various illustrative logical blocks, or elements, described in connection with the embodiments disclosed herein may be implemented or performed with a general purpose processor, a digital signal processor, an Application Specific Integrated Circuit (ASIC), a field programmable gate array or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination thereof designed to perform the functions described herein. A general-purpose processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine. A processor may also be implemented as a combination of computing devices, e.g., a digital signal processor and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a digital signal processor core, or any other similar configuration.
The steps of a method or algorithm described in connection with the embodiments disclosed herein may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. A software module may be stored in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form of storage medium known in the art. For example, a storage medium may be coupled to the processor such the processor can read information from, and write information to, the storage medium. In the alternative, the storage medium may be integral to the processor. The processor and the storage medium may reside in an ASIC, which may be located in a user terminal. In the alternative, the processor and the storage medium may reside in different components in a user terminal.
In one or more exemplary designs, the functions described above in connection with the embodiments of the invention may be implemented in hardware, software, firmware, or any combination of the three. If implemented in software, the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium. Computer-readable media includes both computer storage media and communication media that facilitate transfer of a computer program from one place to another. Storage media may be any available media that can be accessed by a general purpose or special purpose computer. For example, such computer-readable media can include, but is not limited to, RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to carry or store program code in the form of instructions or data structures and which can be read by a general-purpose or special-purpose computer, or a general-purpose or special-purpose processor. Additionally, any connection is properly termed a computer-readable medium, and, thus, is included if the software is transmitted from a website, server, or other remote source via a coaxial cable, fiber optic cable, twisted pair, Digital Subscriber Line (DSL), or wirelessly, e.g., infrared, radio, and microwave. Such discs (disk) and disks (disc) include compact disks, laser disks, optical disks, DVDs, floppy disks and blu-ray disks where disks usually reproduce data magnetically, while disks usually reproduce data optically with lasers. Combinations of the above may also be included in the computer-readable medium.
The above-mentioned embodiments are intended to illustrate the objects, technical solutions and advantages of the present invention in further detail, and it should be understood that the above-mentioned embodiments are merely exemplary embodiments of the present invention, and are not intended to limit the scope of the present invention, and any modifications, equivalent substitutions, improvements and the like made within the spirit and principle of the present invention should be included in the scope of the present invention.

Claims (10)

1. An automatic operation and maintenance method for an internet data center is characterized by comprising the following steps:
receiving a task work order from a message queue, and analyzing according to the task work order to obtain a work order type and configuration information;
after the work order type is confirmed to be an on-line type, verifying the legality of each configuration item in the configuration information according to the on-line type;
after the online type is confirmed to be the online type of the reinstallation system and all the configuration items in the configuration information are legal, the server appointed in the configuration information executes online operation of the reinstallation system according to the configuration information;
receiving and saving online state data during the period of executing online operation of the reinstallation system; the online state data is obtained by a pre-installed state collection program in the online operation process of the reinstallation system;
and after the online operation of the reinstallation system is completed, starting a pressure test aiming at the specified server.
2. The method as claimed in claim 1, wherein the receiving the task work order from the message queue and analyzing the task work order to obtain the work order type and configuration information comprises:
receiving the task work order from the message queue, and acquiring the work order type from the task work order;
and inquiring a related work order of the task work order, and obtaining the configuration information according to the task work order and the related work order.
3. The automatic operation and maintenance method for the internet data center according to claim 1, wherein when the online type is an online type of a reinstallation system, the configuration items in the configuration information include one or any combination of the following items: server delivery information, operating system information, server hardware information, network information and user information;
the verifying the validity of each configuration item in the configuration information according to the online type includes:
verifying that the server supports the online operation of the reinstallation system according to the server delivery information, wherein the server delivery information is legal, and otherwise, the server delivery information is illegal;
if the operating system corresponding to the operating system information supported by the operating system library is verified, the operating system information is legal, otherwise, the operating system information is illegal; the operating system library is used for recording an operating system which can be used for the online operation of the reinstallation system;
if the server supports the server hardware information, the server hardware information is legal, otherwise, the server hardware information is illegal;
if the user information is verified to be matched with the record in the user information base, the user information is legal, otherwise, the user information is illegal; the user information is used for remotely logging in the server; the user information base is used for recording user login information;
and if the network information corresponding to the server is verified to be complete and valid, the network information is legal, otherwise, the network information is illegal.
4. The method as claimed in claim 1, wherein after the checking the validity of each configuration item in the configuration information according to the online type, the method further comprises:
after confirming that at least one configuration item in the configuration information is illegal, generating verification result prompt information according to the configuration item of the illegal configuration information, and sending the verification result prompt information to a first receiving end appointed in the configuration information through a first prompt mode appointed in the configuration information.
5. The method as claimed in claim 1, wherein the online type of the reloading system comprises: the online type of the new server, the online type of the old server, the online reinstallation system type or the reinstallation system migration online type;
correspondingly, after the online type is confirmed to be the online type of the reinstallation system and all the configuration items in the configuration information are legal, the server specified in the configuration information executes the online operation of the reinstallation system according to the configuration information, and the method comprises the following steps:
analyzing the configuration information to generate installation information;
controlling the appointed server to execute restarting through a first remote command, and entering a pre-starting execution environment;
the installation information is issued to the appointed server through the pre-starting execution environment, so that the appointed server completes system installation according to the installation information;
synchronizing the configuration information to a designated configuration management database.
6. The internet data center automatic operation and maintenance method of claim 5, wherein the installation information comprises: the state collection program;
the issuing of the installation information to the designated server through the pre-starting execution environment so as to enable the designated server to complete system installation according to the installation information comprises the following steps:
issuing and installing the state collection program to the specified server through the pre-starting execution environment, wherein the state collection program is used for collecting online state data of the specified server in the online operation execution process of the reinstallation system;
the receiving and saving online state data during the execution of the online operation of the reinstallation system includes:
receiving the on-line state data uploaded by the state collection program, and writing the on-line state data into a first database;
if abnormal information exists in the online state data, automatically repairing the fault corresponding to the abnormal information according to the abnormal information;
and if the fault corresponding to the abnormal information cannot be automatically repaired, generating operation and maintenance prompt information according to the abnormal information, and sending the operation and maintenance prompt information to a second receiving end appointed in the configuration information through a second prompt mode appointed in the configuration information.
7. The method for automatically operating and maintaining the internet data center according to claim 1, wherein the starting of the stress test for the designated server after the online operation of the reloading system is completed comprises:
generating a pressure measurement task according to a preset pressure measurement rule;
issuing the pressure testing task to the appointed server through a third remote command so that the appointed server starts a pressure test according to the pressure testing task;
and receiving a pressure measurement result from the specified server, and comparing the pressure measurement result with a preset standard result library to generate a pressure measurement report.
8. The method for automatically operating and maintaining the internet data center according to claim 1, wherein the online type further comprises: reserving the system migration online type;
correspondingly, the configuration items in the configuration information include one or any combination of the following items: a server IP address and a server MAC address;
the verifying the validity of each configuration item in the configuration information according to the online type includes:
checking that the server IP address conforms to the IP address format through regular check, wherein the server IP address is legal, and otherwise, the server IP address is illegal; and the number of the first and second groups,
checking that the server MAC address conforms to the MAC address format through regular check, wherein the server MAC address is legal, otherwise, the server MAC address is illegal;
the method further comprises the following steps:
and after the online type is confirmed to be a reserved system migration online type and all configuration items in the configuration information are legal, synchronizing the configuration information to a specified configuration management database.
9. An automatic operation and maintenance device of an internet data center is characterized by comprising:
the task work order obtaining unit is used for receiving the task work order from the message queue and analyzing the task work order to obtain the work order type and the configuration information;
the configuration information validity checking unit is used for checking the validity of each configuration item in the configuration information according to the on-line type after the work order type is determined to be the on-line type;
the first execution unit is used for executing online operation of the reinstallation system according to the configuration information aiming at the server specified in the configuration information after the online type is confirmed to be the online type of the reinstallation system and all the configuration items in the configuration information are legal;
the state collection unit is used for receiving and saving on-line state data during the execution of the on-line operation of the reinstallation system; the online state data is obtained by a pre-installed state collection program in the online operation process of the reinstallation system;
and the pressure testing unit is used for starting the pressure test aiming at the specified server after the online operation of the reinstallation system is finished.
10. A readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the method according to any one of claims 1-8.
CN202110748707.0A 2021-07-02 2021-07-02 Automatic operation and maintenance method and device for internet data center and readable storage medium Pending CN113657702A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110748707.0A CN113657702A (en) 2021-07-02 2021-07-02 Automatic operation and maintenance method and device for internet data center and readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110748707.0A CN113657702A (en) 2021-07-02 2021-07-02 Automatic operation and maintenance method and device for internet data center and readable storage medium

Publications (1)

Publication Number Publication Date
CN113657702A true CN113657702A (en) 2021-11-16

Family

ID=78477875

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110748707.0A Pending CN113657702A (en) 2021-07-02 2021-07-02 Automatic operation and maintenance method and device for internet data center and readable storage medium

Country Status (1)

Country Link
CN (1) CN113657702A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114095350A (en) * 2021-11-22 2022-02-25 中国电信股份有限公司 Equipment configuration method, service configuration method, device, equipment and medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104484253A (en) * 2014-12-29 2015-04-01 浪潮电子信息产业股份有限公司 Automatic testing method for human-computer interaction Intel MIC (Many Integrated Core) card
CN106681854A (en) * 2015-11-11 2017-05-17 北京国双科技有限公司 Information checking method, device and system
CN110188021A (en) * 2019-04-11 2019-08-30 深圳市同泰怡信息技术有限公司 A kind of automated testing method of server

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104484253A (en) * 2014-12-29 2015-04-01 浪潮电子信息产业股份有限公司 Automatic testing method for human-computer interaction Intel MIC (Many Integrated Core) card
CN106681854A (en) * 2015-11-11 2017-05-17 北京国双科技有限公司 Information checking method, device and system
CN110188021A (en) * 2019-04-11 2019-08-30 深圳市同泰怡信息技术有限公司 A kind of automated testing method of server

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114095350A (en) * 2021-11-22 2022-02-25 中国电信股份有限公司 Equipment configuration method, service configuration method, device, equipment and medium

Similar Documents

Publication Publication Date Title
CN110879712B (en) Cloud data center physical host installation method and related device
EP1978672A1 (en) Method for implementing management software, hardware with pre-configured software and implementing method thereof
CN105260208A (en) Method for automatically refreshing RAID card drive in batches by server
CN105183520A (en) Automatic remote installing and debugging method and system for computer software
CN111414169B (en) BMC (baseboard management controller) image upgrading method and related components
CN113504932B (en) Firmware data updating method and device
CN113312064A (en) Installation configuration method and device of physical machine and computer readable medium
CN112256505A (en) Server stability testing method and device and related components
CN115567392A (en) Automatic deployment and upgrade method for customer internal business system
CN113657702A (en) Automatic operation and maintenance method and device for internet data center and readable storage medium
CN111273932A (en) Component refreshing method, system and computer readable storage medium
CN110297749B (en) Method and terminal for testing new function
CN107992420B (en) Management method and system for test item
CN113656088B (en) Self-service management method, device and storage medium for internet data center server
CN112711575A (en) Deployment method, system and related device of database cluster
CN116483416A (en) Firmware online upgrading method, server and storage medium
CN115454851A (en) Interface regression testing method and device, storage medium and electronic device
CN103995776A (en) Automatic version verification method based on B/S structure system
CN115292175A (en) Regression testing method, device, equipment and storage medium
CN115033258A (en) Automatic upgrading and pressure testing method for SD card firmware of camera
CN113918162A (en) Front-end code automatic checking method based on centralized management mode
CN110134558B (en) Method and device for detecting server
CN113179181A (en) Data acquisition method, device and system, data processing device and electronic equipment
CN113903368B (en) Automatic test method, device and equipment for disc and storage medium
CN114817042A (en) Server testing method and device, testing platform and readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20230506

Address after: Room 501-502, 5/F, Sina Headquarters Scientific Research Building, Block N-1 and N-2, Zhongguancun Software Park, Dongbei Wangxi Road, Haidian District, Beijing, 100193

Applicant after: Sina Technology (China) Co.,Ltd.

Address before: 100193 7th floor, scientific research building, Sina headquarters, plot n-1, n-2, Zhongguancun Software Park, Dongbei Wangxi Road, Haidian District, Beijing, 100193

Applicant before: Sina.com Technology (China) Co.,Ltd.