WO2021203979A1 - 运维处理方法、装置及计算机设备 - Google Patents

运维处理方法、装置及计算机设备 Download PDF

Info

Publication number
WO2021203979A1
WO2021203979A1 PCT/CN2021/083003 CN2021083003W WO2021203979A1 WO 2021203979 A1 WO2021203979 A1 WO 2021203979A1 CN 2021083003 W CN2021083003 W CN 2021083003W WO 2021203979 A1 WO2021203979 A1 WO 2021203979A1
Authority
WO
WIPO (PCT)
Prior art keywords
application
information
target application
target
maintenance
Prior art date
Application number
PCT/CN2021/083003
Other languages
English (en)
French (fr)
Inventor
司媛媛
吴咏梅
赵冬
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2021203979A1 publication Critical patent/WO2021203979A1/zh

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3055Monitoring arrangements for monitoring the status of the computing system or of the computing system component, e.g. monitoring if the computing system is on, off, available, not available
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3051Monitoring arrangements for monitoring the configuration of the computing system or of the computing system component, e.g. monitoring the presence of processing resources, peripherals, I/O links, software programs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3089Monitoring arrangements determined by the means or processing involved in sensing the monitored data, e.g. interfaces, connectors, sensors, probes, agents
    • G06F11/3093Configuration details thereof, e.g. installation, enabling, spatial arrangement of the probes

Definitions

  • This application relates to the field of operation and maintenance technology, and in particular to an operation and maintenance processing method, device, and computer equipment.
  • IT Information Technology
  • this application provides an operation and maintenance processing method, device, and computer equipment, the main purpose of which is to improve the current traditional IT operation and maintenance methods that affect the IT operation and maintenance efficiency and increase the IT operation and maintenance cost technical problems.
  • an operation and maintenance processing method which includes: collecting status information of each application; configuring operation and maintenance operation information for each application according to the status information of each application; and responding to the The operation and maintenance instruction of the target application in each application sends an application service request corresponding to the operation and maintenance instruction to the server of the target application according to the operation information of the target application; and the server returns according to the target application To determine whether the target application is abnormal.
  • an operation and maintenance processing device which includes: a collection module for collecting status information of each application; a configuration module for reporting the status information of each application to each Application configuration operation and maintenance operation information; a sending module for responding to the operation and maintenance instructions of the target application in each application, according to the operation and maintenance operation information of the target application, to send information to the server of the target application
  • the application service request corresponding to the dimension instruction; the determining module is used to determine whether the target application is abnormal according to the request response information returned by the server of the target application.
  • a storage medium having computer-readable instructions stored thereon, and when the computer-readable instructions are executed by a processor, the following method is implemented: collecting status information of each application; The status information of the application configures operation and maintenance operation information for each application; in response to the operation and maintenance instructions of the target application in each application, according to the operation and maintenance operation information of the target application, the server of the target application sends and The application service request corresponding to the operation and maintenance instruction; according to the request response information returned by the server of the target application, it is determined whether the target application is abnormal.
  • a computer device including a storage medium, a processor, and computer-readable instructions stored on the storage medium and executable on the processor, and the processor executes the computer-readable instructions.
  • the following methods are implemented when instructing: collecting status information of each application; configuring operation and maintenance operation information for each application according to the status information of each application; responding to the operation and maintenance instruction of the target application in each application, according to the The operation and maintenance operation information of the target application sends an application service request corresponding to the operation and maintenance instruction to the server of the target application; according to the request response information returned by the server of the target application, it is determined whether the target application is abnormal.
  • this application can truly realize the one-click management and operation of the middleware component application process, and then can realize the one-click automated IT operation and maintenance processing.
  • IT operation and maintenance automation Through the IT operation and maintenance automation, it can help improve the operation.
  • the operation efficiency of maintenance personnel reduces the repetitive tasks of operation and maintenance, which can improve IT operation and maintenance efficiency and save IT operation and maintenance costs.
  • FIG. 1 shows a schematic flowchart of an operation and maintenance processing method provided by an embodiment of the present application.
  • FIG. 2 shows a schematic flowchart of another operation and maintenance processing method provided by an embodiment of the present application.
  • FIG. 3 shows a schematic structural diagram of an operation and maintenance processing apparatus provided by an embodiment of the present application.
  • the technical solution of this application may involve the field of blockchain technology.
  • the data involved in this application such as status information and/or abnormality determination results, can be stored in a database, or can be stored in a blockchain, such as distributed storage through a blockchain, which is not limited in this application .
  • this embodiment provides an operation and maintenance processing method. As shown in FIG. 1, the method includes the following steps.
  • the execution subject of this embodiment may be a device or equipment used for IT operation and maintenance processing, and may be configured on the side of the operation and maintenance management system.
  • Application status information may include: subsystem name, Chinese name, cluster information, instance information, application type (such as Java, Docker, Kafka, Zookeeper, Spark, Hadoop, etc.), environment information, host IP address, etc.
  • the application data synchronization tool can be used to call the corresponding functional interface to synchronize the status information of each application that needs to be monitored by IT operation and maintenance to the side of the operation and maintenance management system, so as to facilitate subsequent automated IT operation and maintenance processing.
  • the process shown in steps 102 to 104 can be performed.
  • the operation and maintenance operation information can include the automatic operation and maintenance operation content for a single application, such as writing corresponding scripts or command line data, etc., each application can have its own corresponding operation and maintenance operation information, and operation and maintenance operation information for different applications Can be the same or different.
  • the operation and maintenance personnel can click the edit button on the front end of the operation and maintenance management system to jump to the editing page, and then edit Configure the automated operation and maintenance scripts or command line data of a single application on the page.
  • the whole process can be visualized, which facilitates operation and maintenance management, and improves the efficiency of automated IT operation and maintenance.
  • the target application can be any of the various applications that need to be monitored for IT operation and maintenance, or a specific application among them.
  • Operation and maintenance instructions can be automatically triggered by the system at regular intervals or input by operation and maintenance personnel.
  • the operation and maintenance personnel can use the button click event in the front end of the operation and maintenance management system to perform the operation of the target application (such as application start, stop, restart, etc.), thereby automatically sending the application corresponding to the operation and maintenance instruction to the server of the target application Service request, this method can implement the operation and maintenance operations of the target application in an automated manner.
  • the operation and maintenance instruction when receiving the operation and maintenance instruction of the target application, the operation and maintenance instruction can be parsed to determine the target application and the application service that the target application needs to request; according to the operation and maintenance operation information corresponding to the target application configuration, obtain the required application service The executed script or command line data is executed, and then the corresponding application service request is sent to the server of the target application.
  • the user and password of the target application server have been edited in the operation and maintenance operation information in advance. Therefore, in the IT automation operation During the maintenance process, automatic application user login can be realized, reducing the risk of operation and maintenance personnel logging in to the application server, and improving the operation efficiency of operation and maintenance personnel.
  • the request response information may include the request result information of the application request corresponding to the operation and maintenance instruction. For example, if the application service returned by the server of the target application is the same as the requested application service, and it is received within the standard time without delay or data loss, then it can be determined that the target application is not abnormal; and if The application service returned by the server of the target application is different from the requested application service, or there is a delay in reception, data loss, etc., it can be determined that the target application is abnormal.
  • the operation and maintenance operation information corresponding to each application can be pre-configured according to the collected status information of each application, and subsequently, when the operation and maintenance instruction of the target application is received, the corresponding operation and maintenance operation information can be determined according to the target application.
  • the configured operation and maintenance operation information sends the application service request corresponding to the operation and maintenance instruction to the server of the target application, and then can automatically determine whether the target application is abnormal according to the request response information returned by the server.
  • this embodiment can truly realize the one-click management and operation of the middleware component application process, and then can realize the one-click automatic IT operation and maintenance processing.
  • IT operation and maintenance automation it can help improve The operation efficiency of operation and maintenance personnel reduces the repetitive work of operation and maintenance, which can improve IT operation and maintenance efficiency and save IT operation and maintenance costs.
  • the method includes the following steps.
  • Kettel data synchronization tool uses the Kettel data synchronization tool to call the UCMDB interface to synchronize application status information to the operation and maintenance management system side, such as application status information including subsystem name, Chinese name, cluster information, instance information, application type, environment information, host IP address, etc. .
  • the status information of these applications can be pre-stored in the blockchain, such as the status information of each application. Stored in one or more nodes of the blockchain.
  • Blockchain essentially a decentralized database, is a series of data blocks associated with cryptographic methods. Each data block contains a batch of network transaction information for verification. The validity of the information (anti-counterfeiting) and the generation of the next block.
  • the blockchain can include the underlying platform of the blockchain, the platform product service layer, and the application service layer.
  • the state information of the corresponding application can be obtained from the node of the blockchain, and then the operation and maintenance operation information corresponding to the application can be configured according to the obtained application state information.
  • configure operation and maintenance operation information for each application according to the status information of each application which may specifically include: firstly based on the application identification of each application (such as application name, ID number, etc.), application instance information, and application type (such as Java, Docker, Kafka, Zookeeper, Spark, Hadoop, etc.), application environment information, system identification of the application subsystem (such as system name, system number, etc.), application host IP address, host cluster information (such as cluster name, cluster Number of nodes, master and slave nodes, etc.), configure simulated user login information for each application (such as the application user and application user password of the application, etc., the application user can be a simulated user, used for IT operation and maintenance testing, the simulated user's operation The behavior is consistent with the real user's operation behavior) and simulated user operation event information (such as simulated user's operation command on the application, including application start command, application stop command, application restart command, command to call a certain function of the application, etc.); then send the simulated user Operation event information configure
  • the operation and maintenance personnel can click the edit button on the front end of the operation and maintenance management system to jump to the editing page, and then configure a single application in the editing page Automatic operation and maintenance scripts or command line data, etc., and then realize the daily operation commands of configuration and maintenance applications, such as application user, application user password, start command, stop command, restart command, log path information, and operation and maintenance personnel information.
  • configuration and maintenance applications such as application user, application user password, start command, stop command, restart command, log path information, and operation and maintenance personnel information.
  • the script editing tool can be used to edit the preset script information corresponding to the simulated user operation event information.
  • the preset script information When executed, it will be used to request the application service corresponding to the simulated user operation event information. Sent to the server corresponding to application A to obtain the application service.
  • simulated user login information for each application based on the application identification of each application, application instance information, application type, application environment information, system identification of the subsystem to which the application belongs, IP address of the host to which the application belongs, and cluster information of the host to which the host belongs
  • simulated user operation event information which can specifically include: application identification based on each application, application type, application instance information, application environment information, system identification of the subsystem to which the application belongs, IP address of the host to which the application belongs, and information about the cluster to which the host belongs to each application Configure simulated user login information with non-ROOT account login permissions, and configure simulated user operation event information.
  • the application operation commands contained in the simulated user operation event information are operated in a single command line, and the file path in the application operation command is absolute Configure by path.
  • the simulated user login information can run and log in with a non-ROOT account
  • the application operation command contained in the simulated user operation event information is operated in a single command line mode
  • the file path in the application operation command is configured in an absolute path mode.
  • the application process is usually run with a non-ROOT account
  • the application user and password correspond to the startup user and login password of the application.
  • the start and stop commands of the application are operated in a single command line mode.
  • the file path in the command needs to be configured in an absolute path mode.
  • step 204 may specifically include: first, according to the operation and maintenance instructions of the target application, determine the target operation event information of the simulated user on the target application (such as operation events such as starting, stopping, restarting, calling a certain function of the target application) ; Then obtain the target preset script information corresponding to the target operation event information (you can obtain the target application corresponding to the target application from the pre-configured preset script information and start, stop, or restart, or call a function of the target application And other corresponding target preset script information); then execute the target preset script information, and send an application service request corresponding to the target operation event information to the server of the target application.
  • the target operation event information of the simulated user on the target application such as operation events such as starting, stopping, restarting, calling a certain function of the target application
  • the operation and maintenance personnel can perform application operations (such as start, stop, restart, etc.) through button click events on the front of the system.
  • application operations such as start, stop, restart, etc.
  • This method will automate the application operation and reduce the operation and maintenance personnel.
  • the risk of logging in to the server improves the operational efficiency of operation and maintenance personnel.
  • the background log of the operation command will be displayed on the front page of the operation and maintenance management system to understand the dynamic information of the application in real time.
  • step 205 may specifically include: judging whether the target application service is requested according to the request response information; if the target application service is not requested, then determining that the target application is abnormal; Whether the target application service is consistent with the target application service required for the operation and maintenance instruction; if it is determined that the requested target application service is inconsistent with the target application service required for the operation and maintenance instruction, it is determined that the target application is abnormal; if it is determined that the requested target application service is inconsistent If the target application service is consistent with the target application service required by the operation and maintenance instruction, it is determined whether the dynamic information of the target application after the target application service meets the preset standard dynamic change condition; if it is determined that the dynamic information meets the preset standard dynamic change condition, It is determined that there is no abnormality in the target application; if it is determined that the dynamic information does not meet the preset standard dynamic change condition, it is determined that the target application is abnormal.
  • the target is determined There is no abnormality in the application; if it is determined that the application service is not requested, or the requested target application service is inconsistent with the target application service required by the operation and maintenance instructions, or the dynamic information of the target application after the target application service is obtained does not meet the preset standard Dynamically changing conditions, it is determined that the target application is abnormal.
  • it can accurately determine whether the target application is abnormal, and one-click automated IT operation and maintenance can be achieved, which improves the efficiency of IT operation and maintenance and saves the cost of IT operation and maintenance.
  • the preset standard dynamic change conditions can be pre-set according to actual needs. For example, it is determined whether the dynamic information of the target application after the target application service meets the preset standard dynamic change conditions, including: if the target application service is enabled For services with preset functions, it is determined whether the application data generated by the target application after obtaining the target application service contains the scheduled application data that should be generated after the service with the preset function is started. If the scheduled application data is not included, it is determined that the dynamic information is not included.
  • the target application service is judged to be a service that shuts down the target application, it is judged whether the preset application data of the target application after the target application service is deleted within the preset time period, if the preset application data If it is not deleted within the preset time period, it is determined that the dynamic information does not meet the preset standard dynamic change condition.
  • the application data generated by application 1 after receiving application service A should contain some specific data. If these specific data are not included, it means that application service A has not been obtained successfully, and then it is determined that the dynamic information of application 1 does not meet the expectations.
  • Set the standard dynamic change conditions for another example, after application 2 receives application service B, the specific application data should be deleted within the preset time period. If the specific application data is not deleted within the preset time period, it means that an abnormality has occurred. It is determined that the dynamic information of Application 2 does not meet the preset standard dynamic change conditions.
  • the application in response to a stop request sent by application 3 to obtain an application service that is required to stop application 3, if the application service is requested and the running data of application 3 has been deleted within a preset period of time after the application service is executed, the application is determined 3 No abnormality; if the application service is not requested, or the other application services requested, or the application service is requested, and the running data of application 3 is not deleted within the preset period of time after the application service is executed, then confirm Application 3 is abnormal.
  • the method of this embodiment may further include: if it is determined that the target application is abnormal, saving operation log information corresponding to the operation and maintenance instruction; and outputting alarm information corresponding to the target application in the operation and maintenance management system.
  • any operation record can be written into the database in the form of a data table as the operation history (operation log information), and the history record is used as the production operation record of the operation and maintenance personnel. Trace any operation of production.
  • the alarm information of the application can be output on the front end of the operation and maintenance management system, and the warning can be given in the form of text, picture, audio, video, light, vibration, etc., so as to facilitate the operation and maintenance personnel in the first time Know the abnormal application, which is convenient for IT operation and maintenance. If there is an abnormality, you can also record the data and trace the cause of the abnormality to facilitate finding the solution information in time.
  • This embodiment provides an automated and interactive IT operation and maintenance management system and its corresponding application method.
  • the operation and maintenance management system can dynamically collect application status information and obtain application status information.
  • one-click management and operation of middleware component application processes such as Java, Docker, Kafka, Zookeeper, Spark, Hadoop, etc.; at the same time; Keep track of the operation time, operation content, and operator logs, and use the historical record as the production operation record of the operation and maintenance personnel to trace any production operations.
  • middleware component application processes such as Java, Docker, Kafka, Zookeeper, Spark, Hadoop, etc.
  • IT operation and maintenance automation it can help improve the operation efficiency of operation and maintenance personnel, reduce operation and maintenance and solve repetitive tasks.
  • this embodiment provides an operation and maintenance processing device.
  • the device includes: a collection module 31, a configuration module 32, a sending module 33, Determine module 34.
  • the collection module 31 is used to collect status information of each application;
  • the configuration module 32 is used to configure operation and maintenance operation information for each application according to the status information of each application;
  • the sending module 33 is used to respond to each application
  • the operation and maintenance instruction of the target application in the target application sends an application service request corresponding to the operation and maintenance instruction to the server of the target application according to the operation and maintenance operation information of the target application;
  • the request response information returned by the application server determines whether the target application is abnormal.
  • the configuration module 32 is specifically configured to be based on the application identification of each application, application instance information, application type, application environment information, system identification of the subsystem to which the application belongs, IP address of the host to which the application belongs, and the host to which the host belongs
  • the cluster information configures simulated user login information and simulated user operation event information to the respective applications; configures preset script information to the simulated user operation event information, wherein the preset script information is used to use the simulated
  • the application service request corresponding to the user operation event information is sent to the server corresponding to each application.
  • the configuration module 32 is specifically used for the application identification, application type, application instance information, application environment information, system identification of the subsystem to which the application belongs, the IP address of the host to which the application belongs, and the host
  • the belonging cluster information configures the simulated user login information with non-ROOT account login authority to the respective applications, and configures simulated user operation event information, wherein the application operation commands contained in the simulated user operation event information are operated in a single command line mode ,
  • the file path in the application operation command is configured in an absolute path mode.
  • the sending module 33 is specifically configured to determine the target operation event information of the simulated user on the target application according to the operation and maintenance instruction; obtain target preset script information corresponding to the target operation event information; The target preset script information is executed, and an application service request corresponding to the target operation event information is sent to the server.
  • the determining module 34 is specifically configured to determine whether the target application service is requested according to the request response information; if the target application service is not requested, determine that the target application is abnormal; if the request is requested to the target Application request, it is determined whether the requested target application service is consistent with the target application service required corresponding to the operation and maintenance instruction; if it is determined that the requested target application service is inconsistent with the target application service required corresponding to the operation and maintenance instruction, It is determined that the target application is abnormal; if it is determined that the requested target application service is consistent with the target application service required by the operation and maintenance instruction, it is determined whether the dynamic information of the target application after the target application service is obtained Meet the preset standard dynamic change condition; if it is determined that the dynamic information meets the preset standard dynamic change condition, it is determined that the target application does not appear abnormal; if it is determined that the dynamic information does not meet the preset standard dynamic change condition, it is determined that all The target application is abnormal.
  • the determining module 34 is specifically further configured to determine whether the target application includes the application data generated after obtaining the target application service if the target application service is a service with a preset function enabled. If the predetermined application data that should be generated after the service with the preset function is started, if the predetermined application data is not included, it is determined that the dynamic information does not meet the predetermined standard dynamic change condition; or, it is determined that the target application service is To close the service of the target application, it is determined whether the preset application data of the target application after obtaining the target application service is deleted within a preset time period, if the preset application data is not deleted within the preset time period, Then it is determined that the dynamic information does not meet the preset standard dynamic change condition.
  • the device further includes: a first storage module and an output module; the first storage module is configured to determine whether the target application is abnormal according to the request response information returned by the server, If it is determined that the target application is abnormal, the operation log information corresponding to the operation and maintenance instruction is saved; an output module is used to output the alarm information corresponding to the target application in the operation and maintenance management system.
  • the device further includes: a second storage module; a second storage module for saving the collected state information of each application in a block after the state information of each application is collected
  • the configuration module 32 is specifically used to obtain the status information of each application from the blockchain; configure the status information of the various applications obtained from the blockchain Operation and maintenance operation information corresponding to each application.
  • this embodiment also provides a storage medium on which computer-readable instructions are stored.
  • the computer-readable instructions are executed by a processor, the above-mentioned Figure 1 And the operation and maintenance processing method shown in Figure 2.
  • the storage medium involved in this application may be a readable storage medium, or may be referred to as a computer-readable storage medium.
  • the storage medium such as a readable storage medium, may be non-volatile, such as a non-volatile readable storage medium; or, may also be volatile, such as a volatile readable storage medium.
  • the technical solution of this application can be embodied in the form of a software product.
  • the software product can be stored in a non-volatile storage medium (which can be a CD-ROM, U disk, mobile hard disk, etc.), including several
  • the instructions are used to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute the methods in each implementation scenario of the present application.
  • this embodiment also provides a computer device, which may specifically be a personal computer, a notebook computer, or a server.
  • a computer device which may specifically be a personal computer, a notebook computer, or a server.
  • the physical equipment includes a storage medium and a processor; the storage medium is used to store computer-readable instructions; the processor is used to execute computer-readable instructions to implement the above-mentioned operation and maintenance as shown in Figure 1 and Figure 2 Approach.
  • the computer device may also include a user interface, a network interface, a camera, a radio frequency (RF) circuit, a sensor, an audio circuit, a WI-FI module, and so on.
  • the user interface may include a display screen (Display), an input unit such as a keyboard (Keyboard), etc., and the optional user interface may also include a USB interface, a card reader interface, and the like.
  • the optional network interface can include standard wired interface, wireless interface (such as Bluetooth interface, WI-FI interface), etc.
  • the computer device structure provided in this embodiment does not constitute a limitation on the physical device, and may include more or fewer components, or combine certain components, or arrange different components.
  • the storage medium may also include an operating system and a network communication module.
  • the operating system is a program that manages the hardware and software resources of the aforementioned physical devices, and supports the operation of information processing programs and other software and/or programs.
  • the network communication module is used to realize the communication between the various components in the storage medium and the communication with other hardware and software in the physical device.
  • the operation and maintenance operation information corresponding to each application can be pre-configured according to the collected status information of each application, and subsequently when the operation and maintenance instruction of the target application is received, the corresponding configuration according to the target application
  • the operation and maintenance operation information sends an application service request corresponding to the operation and maintenance instruction to the server of the target application, and then can automatically determine whether the target application is abnormal according to the request response information returned by the server.
  • this embodiment can truly realize the one-click management and operation of the middleware component application process, and then can realize the one-click automatic IT operation and maintenance processing.
  • IT operation and maintenance automation it can help improve The operation efficiency of operation and maintenance personnel reduces the repetitive work of operation and maintenance, which can improve IT operation and maintenance efficiency and save IT operation and maintenance costs.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Debugging And Monitoring (AREA)

Abstract

一种运维处理方法、装置及计算机设备,涉及运维技术领域。其中方法包括:首先采集各个应用的状态信息(101);再根据所述各个应用的状态信息向各个应用配置运维操作信息(102);响应于各个应用中的目标应用的运维指令,根据目标应用的运维操作信息,向目标应用的服务器发送与运维指令相应的应用服务请求(103);然后依据目标应用的服务器返回的请求响应信息,确定目标应用是否出现异常(104)。可一键式实现自动化IT运维处理,可提高IT运维效率和节省IT运维成本。此外,还涉及区块链技术,应用状态数据可存储于区块链中,以保证数据私密和安全性。

Description

运维处理方法、装置及计算机设备
本申请要求于2020年11月16日提交中国专利局、申请号为202011278764.9,发明名称为“运维处理方法、装置及计算机设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。
技术领域
本申请涉及运维技术领域,尤其是涉及到一种运维处理方法、装置及计算机设备。
背景技术
随着互联网应用的发展与普及,信息技术(Information Technology,IT)项目的端到端交付,项目周期中的易变性和不确定性,对项目预算及成本投入影响不可预估,目标用户需求的模糊性、网络架构与应用的复杂性所带来的项目营运痛点日渐突出。而运维团队作为端到端交付流水线中至关重要的角色,直面生产多样化的用户及需求,IT运维团队作为生产系统的屏障持续保障着系统的稳定性、可用性、安全性。而在某些领域对于问题处理时效有着严格的要求,系统架构复杂度与业务复杂度不断提升,组件解耦、分布式、容器等技术发展,单靠人工进行生产问题排查、定位与解决,已经无法满足工作时效要求,无论是从用户、管理者、运维工作者等多角度考虑,运维工作机制与技术的提升逐步得到各方的重视,标准化管理、自动化技术、架构优化、过程改进等较多方面推动持续改进。其中自动化技术作为代替人工操作提升运维工作效率的考虑点被广泛研究和应用。
目前,针对IT运维人工处理到自动化模式的升级转换,已作为专题在许多企业的IT部门展开。然而,本申请创造的发明人在研究中发现,传统IT运维很多仍需人工维护操作管理应用后台服务,而这种模式经常被业内人士戏称为“半自动化”的运维模式,运维团队成员在这种模式下虽优于原有传统纯手工式的工作,但被问题推着走的“救火”式被动机制与工具约束下的效率低,导致运维团队工作压力骤增。进而不但影响了IT运维的效率,而且还增加了IT运维的成本。
技术问题
有鉴于此,本申请提供了一种运维处理方法、装置及计算机设备,主要目的在于改善目前传统的IT运维方式会影响IT运维效率和增加IT运维成本的技术问题。
技术解决方案
根据本申请的一个方面,提供了一种运维处理方法,该方法包括:采集各个应用的状态信息;根据所述各个应用的状态信息向所述各个应用配置运维操作信息;响应于所述各个应用中的目标应用的运维指令,根据所述目标应用的运维操作信息,向所述目标应用的服务器发送与所述运维指令相应的应用服务请求;依据所述目标应用的服务器返回的请求响应信息,确定所述目标应用是否出现异常。
根据本申请的另一个方面,提供了一种运维处理装置,该装置包括:采集模块,用于采集各个应用的状态信息;配置模块,用于根据所述各个应用的状态信息向所述各个应用配置运维操作信息;发送模块,用于响应于所述各个应用中的目标应用的运维指令,根据所述目标应用的运维操作信息,向所述目标应用的服务器发送与所述运维指令相应的应用服务请求;确定模块,用于依据所述目标应用的服务器返回的请求响应信息,确定所述目标应用是否出现异常。
根据本申请的又一个方面,提供了一种存储介质,其上存储有计算机可读指令,所述计算机可读指令被处理器执行时实现以下方法:采集各个应用的状态信息;根据所述各个应用的状态信息向所述各个应用配置运维操作信息;响应于所述各个应用中的目标应用的运维指令,根据所述目标应用的运维操作信息,向所述目标应用的服务器发送与所述运维指令相应的应用服务请求;依据所述目标应用的服务器返回的请求响应信息,确定所述目标应用是否出现异常。
根据本申请的再一个方面,提供了一种计算机设备,包括存储介质、处理器及存储在存储介质上并可在处理器上运行的计算机可读指令,所述处理器执行所述计算机可读指令时实现以下方法:采集各个应用的状态信息;根据所述各个应用的状态信息向所述各个应用配置运维操作信息;响应于所述各个应用中的目标应用的运维指令,根据所述目标应用的运维操作信息,向所述目标应用的服务器发送与所述运维指令相应的应用服务请求;依据所述目标应用的服务器返回的请求响应信息,确定所述目标应用是否出现异常。
有益效果
与目前传统的IT运维方式相比,本申请可真正实现一键式管理与操作中间件组件应用进程,进而可一键式实现自动化IT运维处理,通过IT运维自动化,能够帮忙提高运维人员的操作效率,降低运维解决重复性工作,从而可提高IT运维效率和节省IT运维成本。
附图说明
图1示出了本申请实施例提供的一种运维处理方法的流程示意图。
图2示出了本申请实施例提供的另一种运维处理方法的流程示意图。
图3示出了本申请实施例提供的一种运维处理装置的结构示意图。
本发明的实施方式
下文中将参考附图并结合实施例来详细说明本申请。需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互结合。
本申请的技术方案可涉及区块链技术领域。可选的,本申请涉及的数据如状态信息和/或是否异常的确定结果等可存储于数据库中,或者可以存储于区块链中,比如通过区块链分布式存储,本申请不做限定。
针对改善目前传统的IT运维方式会影响IT运维效率和增加IT运维成本的技术问题,本实施例提供了一种运维处理方法,如图1所示,该方法包括以下步骤。
101、采集各个应用的状态信息。
对于本实施例的执行主体可为用于IT运维处理的装置或设备,可配置在运维管理系统侧。应用的状态信息可包括:子系统名、中文名、集群信息、实例信息、应用类型(如Java、Docker、Kafka、Zookeeper、Spark、Hadoop等)、环境信息、主机IP地址等。
在本实施例中,可通过应用数据同步工具,调用相应的功能接口,将IT运维需要监控的各个应用的状态信息,同步到运维管理系统侧,便于后续实现自动化的IT运维处理,具体可执行步骤102至104所示的过程。
102、根据所述各个应用的状态信息向各个应用配置运维操作信息。
其中,运维操作信息可包含针对单个应用的自动化地运维操作内容,如编写相应脚本或命令行数据等,每个应用都可有自己对应的运维操作信息,不同应用的运维操作信息可相同或者不同。
例如,在将IT运维需要监控的各个应用的状态信息,完成同步到运维管理系统侧后,运维人员可在运维管理系统的前端点击编辑按钮,跳转至编辑页面,然后在编辑页面中配置单个应用的自动化运维脚本或命令行数据等。整个过程可做到可视化,便于运维管理,提高了自动化IT运维的效率。
103、响应于各个应用中的目标应用的运维指令,根据目标应用的运维操作信息,向目标应用的服务器发送与运维指令相应的应用服务请求。
目标应用可为IT运维需要监控的各个应用中的任一个,或者是其中特定的应用等。运维指令可由系统定时自动触发输入,或者由运维人员主动输入。例如,运维人员可在运维管理系统前端通过按钮点击事件,进而进行目标应用的操作(如应用启动、停止、重启等),从而自动地向目标应用的服务器发送与运维指令相应的应用服务请求,该方式可将目标应用的运维操作以自动化地形式进行实施。
例如,在接收到目标应用的运维指令时,可解析该运维指令,确定目标应用以及目标应用需要请求的应用服务;根据目标应用对应配置的运维操作信息,获取请求该应用服务所需执行的脚本或命令行数据等来执行,进而向目标应用的服务器发送相应的应用服务请求,其中登录目标应用服务器的用户和密码等已经事先在运维操作信息中编辑完成,因此在IT自动化运维的过程中,可实现自动化地应用用户登录,减少运维人员登录应用服务器操作的风险性,提高运维人员的操作效率。
104、依据目标应用的服务器返回的请求响应信息,确定目标应用是否出现异常。
请求响应信息中可包含与运维指令相应应用请求的请求结果信息。例如,如果目标应用的服务器返回的应用服务与请求的应用服务相同,且是在标准时长内接收到的,并没有出现延时、数据丢失等情况,那么可确定目标应用没有出现异常;而如果目标应用的服务器返回的应用服务与请求的应用服务不同,或出现延时接收、数据丢失等情况,可确定目标应用出现异常。
通过本实施例中的运维处理方法,可根据采集到的各个应用的状态信息,预先配置各个应用对应的运维操作信息,后续可在接收到目标应用的运维指令时,根据目标应用对应配置的运维操作信息,向目标应用的服务器发送与运维指令相应的应用服务请求,进而可依据服务器返回的请求响应信息,自动化确定目标应用是否出现异常。与目前传统的IT运维方式相比,本实施例可真正实现一键式管理与操作中间件组件应用进程,进而可一键式实现自动化IT运维处理,通过IT运维自动化,能够帮忙提高运维人员的操作效率,降低运维解决重复性工作,从而可提高IT运维效率和节省IT运维成本。
进一步的,作为上述实施例具体实施方式的细化和扩展,为了完整说明本实施例中的具体实施过程,提供了另一种运维处理方法,如图2所示,该方法包括以下步骤。
201、采集各个应用的状态信息。
例如,通过Kettel数据同步工具调用UCMDB接口,同步应用状态信息至运维管理系统侧,如包括子系统名、中文名、集群信息、实例信息、应用类型、环境信息、主机IP地址等应用状态信息。
202、将采集到的各个应用的状态信息保存在区块链中。
在根据应用的状态信息进行配置应用对应的运维操作信息之前,为了保证应用状态信息的安全性和私密性,这些应用的状态信息可预先保存在区块链中,如各个应用的状态信息可保存在区块链的一个或多个节点中。
需要说明的是,本实施例所指区块链是分布式数据存储、点对点传输、共识机制、加密算法等计算机技术的新型应用模式。区块链(Blockchain),本质上是一个去中心化的数据库,是一串使用密码学方法相关联产生的数据块,每一个数据块中包含了一批次网络交易的信息,用于验证其信息的有效性(防伪)和生成下一个区块。区块链可以包括区块链底层平台、平台产品服务层以及应用服务层等。
203、从区块链中获取各个应用的状态信息,并根据从区块链中获取到的各个应用的状态信息向各个应用配置运维操作信息。
在接收到运维人员配置应用对应运维操作信息的指令时,可从区块链的节点中获取相应应用的状态信息,然后根据获取到的应用状态信息,配置应用对应的运维操作信息。
可选的,根据各个应用的状态信息向各个应用配置运维操作信息,具体可包括:首先基于各个应用的应用标识(如应用名称、ID号等)、应用实例信息、应用类型(如Java、Docker、Kafka、Zookeeper、Spark、Hadoop等类型)、应用环境信息、应用所属子系统的系统标识(如系统名称、系统编号等)、应用所属主机IP地址、主机所属集群信息(如集群名称、集群节点数量、主从节点情况等),向各个应用配置模拟用户登录信息(如应用的应用用户和应用用户密码等,该应用用户可为模拟用户,用于IT运维测试,该模拟用户的操作行为与真实用户操作行为一致)和模拟用户操作事件信息(如模拟用户对应用的操作命令,包括应用启动命令、应用停止命令、应用重启命令、调用应用某功能的命令等);然后向模拟用户操作事件信息配置预设脚本信息(该预设脚本信息可包含相应的脚本程序、和/或命令行数据等),其中,预设脚本信息被执行时用于将模拟用户操作事件信息相应的应用服务请求发送给各个应用对应的服务器。
例如,在通过Kettel数据同步工具调用UCMDB接口,同步各个应用状态信息数据完成后,运维人员可在运维管理系统的前端点击编辑按钮,跳转至编辑页面,然后在编辑页面中配置单个应用的自动化运维脚本或命令行数据等,进而实现配置维护应用的日常操作命令,如应用的应用用户、应用用户密码、启动命令、停止命令、重启命令、日志路径信息以及运维人员信息等。通过这种可视化的运维操作,便于运维管理,可根据实际运维需求准确进行IT运维工作,提高了自动化IT运维的效率。
以各个应用中的应用A为例进行说明具体的配置过程。在配置应用A对应的模拟用户登录信息时,依据应用A所属主机IP地址和主机所属集群信息,在该集群的节点主机目录中找到该主机IP地址对应的主机文件夹,然后依据应用A所属子系统的系统标识,在该主机文件夹中找到该子系统下的系统文件夹,然后依据应用A的应用标识和应用类型,在系统文件夹中找到该应用类型的应用A的用户管理文件数据,在管理文件数据中创建模拟用户的登录信息,并且应用A的事件管理文件数据中创建该模拟用户有权限执行的操作事件信息,即模拟用户操作事件信息,如模拟用户在应用A中有权限调用的应用实例信息、应用环境信息、和在集群节点中有权限请求的应用服务等。在模拟用户操作事件信息配置完成后,可利用脚本编辑工具,编辑模拟用户操作事件信息对应的预设脚本信息,该预设脚本信息被执行时用于将模拟用户操作事件信息相应的应用服务请求发送给应用A对应的服务器,以获取该应用服务。
进一步的可选的,基于各个应用的应用标识、应用实例信息、应用类型、应用环境信息、应用所属子系统的系统标识、应用所属主机IP地址、主机所属集群信息向各个应用配置模拟用户登录信息和模拟用户操作事件信息,具体可包括:基于各个应用的应用标识、应用类型、应用实例信息、应用环境信息、应用所属子系统的系统标识、应用所属主机IP地址、主机所属集群信息向各个应用配置具备非ROOT账号登录权限的模拟用户登录信息,以及配置模拟用户操作事件信息,其中,模拟用户操作事件信息包含的应用操作命令以单命令行方式进行操作,应用操作命令中的文件路径以绝对路径方式进行配置。
例如,模拟用户登录信息中可以非ROOT账号运行登录,模拟用户操作事件信息包含的应用操作命令以单命令行方式进行操作,该应用操作命令中的文件路径以绝对路径方式进行配置。在实际运维场景中,为了保证用户权限最小化,应用进程通常以非ROOT账号运行,应用用户与密码对应应用的启动用户与登录密码。应用的启停命令以单命令行方式进行操作,为了保证命令的可执行率,命令中的文件路径需以绝对路径方式配置。
204、响应于各个应用中的目标应用的运维指令,根据目标应用的运维操作信息,向目标应用的服务器发送与运维指令相应的应用服务请求。
可选的,步骤204具体可包括:首先依据目标应用的运维指令,确定模拟用户对目标应用的目标操作事件信息(如对目标应用的启动、停止、重启、调用应用某功能等操作事件);再获取目标操作事件信息对应的目标预设脚本信息(可从预先配置的预设脚本信息中获取与目标应用对应的、且与目标应用的启动、或停止、或重启、或调用应用某功能等对应的目标预设脚本信息);然后执行目标预设脚本信息,向目标应用的服务器发送与目标操作事件信息相应的应用服务请求。
例如,配置完应用的操作命令后,运维人员可在系统前端通过按钮点击事件从而进行应用的操作(如启动、停止、重启等),该方式将应用操作以自动化的形式,减少运维人员登录服务器操作的风险性,提高运维人员的操作效率。
205、依据服务器返回的请求响应信息,确定目标应用是否出现异常。
例如,当触发应用的如启动、停止、重启等操作后,将在运维管理系统的前端页面显示操作命令的后台日志,实时了解应用的动态信息。
可选的,步骤205具体可包括:根据请求响应信息,判断是否请求到目标应用服务;若未请求到目标应用服务,则确定目标应用出现异常;若请求到目标应用请求,则判断请求到的目标应用服务与运维指令对应所需的目标应用服务是否一致;若判定请求到的目标应用服务与运维指令对应所需的目标应用服务不一致,则确定目标应用出现异常;若判定请求到的目标应用服务与运维指令对应所需的目标应用服务一致,则判断得到目标应用服务后的目标应用的动态信息是否符合预设标准动态变化条件;若判定动态信息符合预设标准动态变化条件,则确定目标应用未出现异常;若判定动态信息不符合预设标准动态变化条件,则确定目标应用出现异常。
例如,如果根据请求响应信息判定请求到的目标应用服务与运维指令对应所需的目标应用服务一致、且得到目标应用服务后的目标应用的动态信息符合预设标准动态变化条件,则确定目标应用未出现异常;若判定未请求到应用服务、或请求到的目标应用服务与运维指令对应所需的目标应用服务不一致、或得到目标应用服务后的目标应用的动态信息不符合预设标准动态变化条件,则确定目标应用出现异常。通过这种可选方式,可准确判别出目标应用是否出现异常,可做到一键式的自动化IT运维,提高IT运维的效率和节省IT运维的成本。
预设标准动态变化条件可根据实际需求预先设定,实例性的,判断得到所述目标应用服务后的目标应用的动态信息是否符合预设标准动态变化条件,具体包括:若目标应用服务为开启预设功能的服务,则判断目标应用在得到目标应用服务后生成的应用数据中是否包含与预设功能的服务开启后应生成的预定应用数据,若不包含预定应用数据,则判定动态信息不符合预设标准动态变化条件;或,判断目标应用服务为关闭目标应用的服务,则判断目标应用在得到目标应用服务后的预置应用数据是否在预设时长内被删除,若预置应用数据在预设时长内未被删除,则判定动态信息不符合预设标准动态变化条件。
例如,应用1在接收到应用服务A后生成的应用数据应该包含一些特定的数据,若未包含这些特定的数据,说明应用服务A实质并未获取成功,进而确定应用1的动态信息不符合预设标准动态变化条件;再例如,应用2在接收到应用服务B后特定的应用数据应该在预设时长内被删除,若特定的应用数据在预设时长内没有被删除,则说明出现异常,确定应用2的动态信息不符合预设标准动态变化条件。
例如,针对应用3发送的停止请求以获取执行停止应用3需求的应用服务,如果请求得到该应用服务、且在执行该应用服务后的预设时长内应用3的运行数据已删除,则确定应用3未出现异常;如果未请求到该应用服务、或请求到的其他应用服务、或请求得到该应用服务,并在执行该应用服务后的预设时长内应用3的运行数据未删除,则确定应用3出现异常。
进一步的,在步骤205之后,本实施例方法还可包括:若确定目标应用出现异常,则保存与运维指令相应的操作日志信息;在运维管理系统中输出目标应用相应的告警信息。
例如,自动化地IT运维过程中,任何的操作记录可将以数据表的形式写入数据库,作为操作的历史记录(操作日志信息),并将历史记录作为运维人员的生产操作记录,以追溯生产的任何操作。在确定某应用出现异常时,可在运维管理系统的前端输出该应用的告警信息,具体可以文字、图片、音频、视频、灯光、振动等形式进行告警提示,以便于运维人员第一时间获知出现异常的应用,便于IT运维。如果出现异常,还可通过记录数据,追溯异常原因等,便于及时找到解决方案信息。
本实施例提供一种自动化可交互地IT运维管理系统,以及其相应的应用方法。该运维管理系统能动态采集应用状态信息,获取应用状态信息,同时根据子系统维度,一键式管理与操作中间件组件应用进程,如Java、Docker、Kafka、Zookeeper、Spark、Hadoop等;同时将操作时间,操作内容,操作人员进行日志留痕,将历史记录作为运维人员的生产操作记录,以追溯生产的任何操作。通过IT运维自动化,能够帮忙提高运维人员的操作效率,降低运维解决重复性工作。
进一步的,作为图1和图2所示方法的具体实现,本实施例提供了一种运维处理装置,如图3所示,该装置包括:采集模块31、配置模块32、发送模块33、确定模块34。采集模块31,用于采集各个应用的状态信息;配置模块32,用于根据所述各个应用的状态信息向所述各个应用配置运维操作信息;发送模块33,用于响应于所述各个应用中的目标应用的运维指令,根据所述目标应用的运维操作信息,向所述目标应用的服务器发送与所述运维指令相应的应用服务请求;确定模块34,用于依据所述目标应用的服务器返回的请求响应信息,确定所述目标应用是否出现异常。
在具体的应用场景中,配置模块32,具体用于基于所述各个应用的应用标识、应用实例信息、应用类型、应用环境信息、应用所属子系统的系统标识、应用所属主机IP地址、主机所属集群信息向所述各个应用配置模拟用户登录信息和模拟用户操作事件信息;向所述模拟用户操作事件信息配置预设脚本信息,其中,所述预设脚本信息被执行时用于将所述模拟用户操作事件信息相应的应用服务请求发送给所述各个应用对应的服务器。
在具体的应用场景中,配置模块32,具体还用于基于所述各个应用的应用标识、应用类型、应用实例信息、应用环境信息、应用所属子系统的系统标识、应用所属主机IP地址、主机所属集群信息向所述各个应用配置具备非ROOT账号登录权限的模拟用户登录信息,以及配置模拟用户操作事件信息,其中,所述模拟用户操作事件信息包含的应用操作命令以单命令行方式进行操作,所述应用操作命令中的文件路径以绝对路径方式进行配置。
在具体的应用场景中,发送模块33,具体用于依据所述运维指令,确定模拟用户对所述目标应用的目标操作事件信息;获取所述目标操作事件信息对应的目标预设脚本信息;执行所述目标预设脚本信息,向所述服务器发送与所述目标操作事件信息相应的应用服务请求。
在具体的应用场景中,确定模块34,具体用于根据所述请求响应信息,判断是否请求到目标应用服务;若未请求到目标应用服务,则确定所述目标应用出现异常;若请求到目标应用请求,则判断请求到的目标应用服务与所述运维指令对应所需的目标应用服务是否一致;若判定请求到的目标应用服务与所述运维指令对应所需的目标应用服务不一致,则确定所述目标应用出现异常;若判定请求到的目标应用服务与所述运维指令对应所需的目标应用服务一致,则判断得到所述目标应用服务后的所述目标应用的动态信息是否符合预设标准动态变化条件;若判定所述动态信息符合预设标准动态变化条件,则确定所述目标应用未出现异常;若判定所述动态信息不符合预设标准动态变化条件,则确定所述目标应用出现异常。
在具体的应用场景中,确定模块34,具体还用于若所述目标应用服务为开启预设功能的服务,则判断所述目标应用在得到所述目标应用服务后生成的应用数据中是否包含与所述预设功能的服务开启后应生成的预定应用数据,若不包含所述预定应用数据,则判定所述动态信息不符合预设标准动态变化条件;或,判断所述目标应用服务为关闭目标应用的服务,则判断所述目标应用在得到所述目标应用服务后的预置应用数据是否在预设时长内被删除,若所述预置应用数据在预设时长内未被删除,则判定所述动态信息不符合预设标准动态变化条件。
在具体的应用场景中,本装置还包括:第一保存模块和输出模块;第一保存模块,用于在所述依据所述服务器返回的请求响应信息,确定所述目标应用是否出现异常之后,若确定所述目标应用出现异常,则保存与所述运维指令相应的操作日志信息;输出模块,用于在运维管理系统中输出所述目标应用相应的告警信息。
在具体的应用场景中,本装置还包括:第二保存模块;第二保存模块,用于在所述采集各个应用的状态信息之后,将采集到的所述各个应用的状态信息保存在区块链中;相应的,配置模块32,具体还用于从所述区块链中获取所述各个应用的状态信息;根据从所述区块链中获取到的所述各个应用的状态信息,配置所述各个应用对应的运维操作信息。
需要说明的是,本实施例提供的一种运维处理装置所涉及各功能单元的其它相应描述,可以参考图1和图2中的对应描述,在此不再赘述。
基于上述如图1和图2所示方法,相应的,本实施例还提供了一种存储介质,其上存储有计算机可读指令,该计算机可读指令被处理器执行时实现上述如图1和图2所示的运维处理方法。
可选的,本申请涉及的存储介质可以是可读存储介质,或者可以称为计算机可读存储介质。该存储介质如可读存储介质可以是非易失性的,如非易失性可读存储介质;或者,也可以是易失性的,如易失性可读存储介质。
基于这样的理解,本申请的技术方案可以以软件产品的形式体现出来,该软件产品可以存储在一个非易失性存储介质(可以是CD-ROM,U盘,移动硬盘等)中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施场景的方法。
基于上述如图1、图2所示的方法,以及图3所示的虚拟装置实施例,为了实现上述目的,本实施例还提供了一种计算机设备,具体可以为个人计算机、笔记本电脑、服务器、网络设备等,该实体设备包括存储介质和处理器;存储介质,用于存储计算机可读指令;处理器,用于执行计算机可读指令以实现上述如图1和图2所示的运维处理方法。
可选的,该计算机设备还可以包括用户接口、网络接口、摄像头、射频(Radio Frequency,RF)电路,传感器、音频电路、WI-FI模块等等。用户接口可以包括显示屏(Display)、输入单元比如键盘(Keyboard)等,可选用户接口还可以包括USB接口、读卡器接口等。网络接口可选的可以包括标准的有线接口、无线接口(如蓝牙接口、WI-FI接口)等。
本领域技术人员可以理解,本实施例提供的计算机设备结构并不构成对该实体设备的限定,可以包括更多或更少的部件,或者组合某些部件,或者不同的部件布置。
存储介质中还可以包括操作系统、网络通信模块。操作系统是管理上述实体设备硬件和软件资源的程序,支持信息处理程序以及其它软件和/或程序的运行。网络通信模块用于实现存储介质内部各组件之间的通信,以及与该实体设备中其它硬件和软件之间通信。
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到本申请可以借助软件加必要的通用硬件平台的方式来实现,也可以通过硬件实现。通过应用本实施例的技术方案,可根据采集到的各个应用的状态信息,预先配置各个应用对应的运维操作信息,后续可在接收到目标应用的运维指令时,根据目标应用对应配置的运维操作信息,向目标应用的服务器发送与运维指令相应的应用服务请求,进而可依据服务器返回的请求响应信息,自动化确定目标应用是否出现异常。与目前传统的IT运维方式相比,本实施例可真正实现一键式管理与操作中间件组件应用进程,进而可一键式实现自动化IT运维处理,通过IT运维自动化,能够帮忙提高运维人员的操作效率,降低运维解决重复性工作,从而可提高IT运维效率和节省IT运维成本。
本领域技术人员可以理解附图只是一个优选实施场景的示意图,附图中的模块或流程并不一定是实施本申请所必须的。本领域技术人员可以理解实施场景中的装置中的模块可以按照实施场景描述进行分布于实施场景的装置中,也可以进行相应变化位于不同于本实施场景的一个或多个装置中。上述实施场景的模块可以合并为一个模块,也可以进一步拆分成多个子模块。
上述本申请序号仅仅为了描述,不代表实施场景的优劣。以上公开的仅为本申请的几个具体实施场景,但是,本申请并非局限于此,任何本领域的技术人员能思之的变化都应落入本申请的保护范围。

Claims (20)

  1. 一种运维处理方法,包括:
    采集各个应用的状态信息;
    根据所述各个应用的状态信息向所述各个应用配置运维操作信息;
    响应于所述各个应用中的目标应用的运维指令,根据所述目标应用的运维操作信息,向所述目标应用的服务器发送与所述运维指令相应的应用服务请求;
    依据所述目标应用的服务器返回的请求响应信息,确定所述目标应用是否出现异常。
  2. 根据权利要求1所述的方法,其中,所述根据所述各个应用的状态信息向所述各个应用配置运维操作信息,具体包括:
    基于所述各个应用的应用标识、应用实例信息、应用类型、应用环境信息、应用所属子系统的系统标识、应用所属主机IP地址、主机所属集群信息向所述各个应用配置模拟用户登录信息和模拟用户操作事件信息;
    向所述模拟用户操作事件信息配置预设脚本信息,其中,所述预设脚本信息被执行时用于将所述模拟用户操作事件信息相应的应用服务请求发送给所述各个应用对应的服务器。
  3. 根据权利要求2所述的方法,其中,所述基于所述各个应用的应用标识、应用实例信息、应用类型、应用环境信息、应用所属子系统的系统标识、应用所属主机IP地址、主机所属集群信息向所述各个应用配置模拟用户登录信息和模拟用户操作事件信息,具体包括:
    基于所述各个应用的应用标识、应用类型、应用实例信息、应用环境信息、应用所属子系统的系统标识、应用所属主机IP地址、主机所属集群信息向所述各个应用配置具备非ROOT账号登录权限的模拟用户登录信息,以及配置模拟用户操作事件信息,其中,所述模拟用户操作事件信息包含的应用操作命令以单命令行方式进行操作,所述应用操作命令中的文件路径以绝对路径方式进行配置。
  4. 根据权利要求3所述的方法,其中,所述响应于所述各个应用中的目标应用的运维指令,根据所述目标应用的运维操作信息,向所述目标应用的服务器发送与所述运维指令相应的应用服务请求,具体包括:
    依据所述运维指令,确定模拟用户对所述目标应用的目标操作事件信息;
    获取所述目标操作事件信息对应的目标预设脚本信息;
    执行所述目标预设脚本信息,向所述服务器发送与所述目标操作事件信息相应的应用服务请求。
  5. 根据权利要求4所述的方法,其中,所述依据所述服务器返回的请求响应信息,确定所述目标应用是否出现异常,具体包括:
    根据所述请求响应信息,判断是否请求到目标应用服务;
    若未请求到目标应用服务,则确定所述目标应用出现异常;
    若请求到目标应用请求,则判断请求到的目标应用服务与所述运维指令对应所需的目标应用服务是否一致;
    若判定请求到的目标应用服务与所述运维指令对应所需的目标应用服务不一致,则确定所述目标应用出现异常;
    若判定请求到的目标应用服务与所述运维指令对应所需的目标应用服务一致,则判断得到所述目标应用服务后的所述目标应用的动态信息是否符合预设标准动态变化条件;
    若判定所述动态信息符合预设标准动态变化条件,则确定所述目标应用未出现异常;
    若判定所述动态信息不符合预设标准动态变化条件,则确定所述目标应用出现异常。
  6. 根据权利要求5所述的方法,其中,所述判断得到所述目标应用服务后的所述目标应用的动态信息是否符合预设标准动态变化条件,具体包括:
    若所述目标应用服务为开启预设功能的服务,则判断所述目标应用在得到所述目标应用服务后生成的应用数据中是否包含与所述预设功能的服务开启后应生成的预定应用数据,若不包含所述预定应用数据,则判定所述动态信息不符合预设标准动态变化条件;或,
    判断所述目标应用服务为关闭目标应用的服务,则判断所述目标应用在得到所述目标应用服务后的预置应用数据是否在预设时长内被删除,若所述预置应用数据在预设时长内未被删除,则判定所述动态信息不符合预设标准动态变化条件;
    在所述依据所述服务器返回的请求响应信息,确定所述目标应用是否出现异常之后,所述方法还包括:
    若确定所述目标应用出现异常,则保存与所述运维指令相应的操作日志信息;
    在运维管理系统中输出所述目标应用相应的告警信息。
  7. 根据权利要求1所述的方法,其中,在所述采集各个应用的状态信息之后,所述方法还包括:
    将采集到的所述各个应用的状态信息保存在区块链中;
    所述根据所述各个应用的状态信息向所述各个应用配置运维操作信息,具体包括:
    从所述区块链中获取所述各个应用的状态信息;
    根据从所述区块链中获取到的所述各个应用的状态信息向所述各个应用配置运维操作信息。
  8. 一种运维处理装置,包括:
    采集模块,用于采集各个应用的状态信息;
    配置模块,用于根据所述各个应用的状态信息向所述各个应用配置运维操作信息;
    发送模块,用于响应于所述各个应用中的目标应用的运维指令,根据所述目标应用的运维操作信息,向所述目标应用的服务器发送与所述运维指令相应的应用服务请求;
    确定模块,用于依据所述目标应用的服务器返回的请求响应信息,确定所述目标应用是否出现异常。
  9. 一种存储介质,其上存储有计算机可读指令,其中,所述计算机可读指令被处理器执行时实现以下方法:
    采集各个应用的状态信息;
    根据所述各个应用的状态信息向所述各个应用配置运维操作信息;
    响应于所述各个应用中的目标应用的运维指令,根据所述目标应用的运维操作信息,向所述目标应用的服务器发送与所述运维指令相应的应用服务请求;
    依据所述目标应用的服务器返回的请求响应信息,确定所述目标应用是否出现异常。
  10. 根据权利要求9所述的存储介质,其中,执行所述根据所述各个应用的状态信息向所述各个应用配置运维操作信息,包括:
    基于所述各个应用的应用标识、应用实例信息、应用类型、应用环境信息、应用所属子系统的系统标识、应用所属主机IP地址、主机所属集群信息向所述各个应用配置模拟用户登录信息和模拟用户操作事件信息;
    向所述模拟用户操作事件信息配置预设脚本信息,其中,所述预设脚本信息被执行时用于将所述模拟用户操作事件信息相应的应用服务请求发送给所述各个应用对应的服务器。
  11. 根据权利要求10所述的存储介质,其中,执行所述基于所述各个应用的应用标识、应用实例信息、应用类型、应用环境信息、应用所属子系统的系统标识、应用所属主机IP地址、主机所属集群信息向所述各个应用配置模拟用户登录信息和模拟用户操作事件信息,包括:
    基于所述各个应用的应用标识、应用类型、应用实例信息、应用环境信息、应用所属子系统的系统标识、应用所属主机IP地址、主机所属集群信息向所述各个应用配置具备非ROOT账号登录权限的模拟用户登录信息,以及配置模拟用户操作事件信息,其中,所述模拟用户操作事件信息包含的应用操作命令以单命令行方式进行操作,所述应用操作命令中的文件路径以绝对路径方式进行配置。
  12. 根据权利要求11所述的存储介质,其中,执行所述响应于所述各个应用中的目标应用的运维指令,根据所述目标应用的运维操作信息,向所述目标应用的服务器发送与所述运维指令相应的应用服务请求,包括:
    依据所述运维指令,确定模拟用户对所述目标应用的目标操作事件信息;
    获取所述目标操作事件信息对应的目标预设脚本信息;
    执行所述目标预设脚本信息,向所述服务器发送与所述目标操作事件信息相应的应用服务请求。
  13. 根据权利要求12所述的存储介质,其中,执行所述依据所述服务器返回的请求响应信息,确定所述目标应用是否出现异常,包括:
    根据所述请求响应信息,判断是否请求到目标应用服务;
    若未请求到目标应用服务,则确定所述目标应用出现异常;
    若请求到目标应用请求,则判断请求到的目标应用服务与所述运维指令对应所需的目标应用服务是否一致;
    若判定请求到的目标应用服务与所述运维指令对应所需的目标应用服务不一致,则确定所述目标应用出现异常;
    若判定请求到的目标应用服务与所述运维指令对应所需的目标应用服务一致,则判断得到所述目标应用服务后的所述目标应用的动态信息是否符合预设标准动态变化条件;
    若判定所述动态信息符合预设标准动态变化条件,则确定所述目标应用未出现异常;
    若判定所述动态信息不符合预设标准动态变化条件,则确定所述目标应用出现异常。
  14. 根据权利要求13所述的存储介质,其中,执行所述判断得到所述目标应用服务后的所述目标应用的动态信息是否符合预设标准动态变化条件,包括:
    若所述目标应用服务为开启预设功能的服务,则判断所述目标应用在得到所述目标应用服务后生成的应用数据中是否包含与所述预设功能的服务开启后应生成的预定应用数据,若不包含所述预定应用数据,则判定所述动态信息不符合预设标准动态变化条件;或,
    判断所述目标应用服务为关闭目标应用的服务,则判断所述目标应用在得到所述目标应用服务后的预置应用数据是否在预设时长内被删除,若所述预置应用数据在预设时长内未被删除,则判定所述动态信息不符合预设标准动态变化条件;
    在所述依据所述服务器返回的请求响应信息,确定所述目标应用是否出现异常之后,所述计算机可读指令被处理器执行时还用于实现:
    若确定所述目标应用出现异常,则保存与所述运维指令相应的操作日志信息;
    在运维管理系统中输出所述目标应用相应的告警信息。
  15. 一种计算机设备,包括存储介质、处理器及存储在存储介质上并可在处理器上运行的计算机可读指令,其中,所述处理器执行所述计算机可读指令时实现以下方法:
    采集各个应用的状态信息;
    根据所述各个应用的状态信息向所述各个应用配置运维操作信息;
    响应于所述各个应用中的目标应用的运维指令,根据所述目标应用的运维操作信息,向所述目标应用的服务器发送与所述运维指令相应的应用服务请求;
    依据所述目标应用的服务器返回的请求响应信息,确定所述目标应用是否出现异常。
  16. 根据权利要求15所述的计算机设备,其中,执行所述根据所述各个应用的状态信息向所述各个应用配置运维操作信息,包括:
    基于所述各个应用的应用标识、应用实例信息、应用类型、应用环境信息、应用所属子系统的系统标识、应用所属主机IP地址、主机所属集群信息向所述各个应用配置模拟用户登录信息和模拟用户操作事件信息;
    向所述模拟用户操作事件信息配置预设脚本信息,其中,所述预设脚本信息被执行时用于将所述模拟用户操作事件信息相应的应用服务请求发送给所述各个应用对应的服务器。
  17. 根据权利要求16所述的计算机设备,其中,执行所述基于所述各个应用的应用标识、应用实例信息、应用类型、应用环境信息、应用所属子系统的系统标识、应用所属主机IP地址、主机所属集群信息向所述各个应用配置模拟用户登录信息和模拟用户操作事件信息,包括:
    基于所述各个应用的应用标识、应用类型、应用实例信息、应用环境信息、应用所属子系统的系统标识、应用所属主机IP地址、主机所属集群信息向所述各个应用配置具备非ROOT账号登录权限的模拟用户登录信息,以及配置模拟用户操作事件信息,其中,所述模拟用户操作事件信息包含的应用操作命令以单命令行方式进行操作,所述应用操作命令中的文件路径以绝对路径方式进行配置。
  18. 根据权利要求17所述的计算机设备,其中,执行所述响应于所述各个应用中的目标应用的运维指令,根据所述目标应用的运维操作信息,向所述目标应用的服务器发送与所述运维指令相应的应用服务请求,包括:
    依据所述运维指令,确定模拟用户对所述目标应用的目标操作事件信息;
    获取所述目标操作事件信息对应的目标预设脚本信息;
    执行所述目标预设脚本信息,向所述服务器发送与所述目标操作事件信息相应的应用服务请求。
  19. 根据权利要求18所述的计算机设备,其中,执行所述依据所述服务器返回的请求响应信息,确定所述目标应用是否出现异常,包括:
    根据所述请求响应信息,判断是否请求到目标应用服务;
    若未请求到目标应用服务,则确定所述目标应用出现异常;
    若请求到目标应用请求,则判断请求到的目标应用服务与所述运维指令对应所需的目标应用服务是否一致;
    若判定请求到的目标应用服务与所述运维指令对应所需的目标应用服务不一致,则确定所述目标应用出现异常;
    若判定请求到的目标应用服务与所述运维指令对应所需的目标应用服务一致,则判断得到所述目标应用服务后的所述目标应用的动态信息是否符合预设标准动态变化条件;
    若判定所述动态信息符合预设标准动态变化条件,则确定所述目标应用未出现异常;
    若判定所述动态信息不符合预设标准动态变化条件,则确定所述目标应用出现异常。
  20. 根据权利要求19所述的计算机设备,其中,执行所述判断得到所述目标应用服务后的所述目标应用的动态信息是否符合预设标准动态变化条件,包括:
    若所述目标应用服务为开启预设功能的服务,则判断所述目标应用在得到所述目标应用服务后生成的应用数据中是否包含与所述预设功能的服务开启后应生成的预定应用数据,若不包含所述预定应用数据,则判定所述动态信息不符合预设标准动态变化条件;或,
    判断所述目标应用服务为关闭目标应用的服务,则判断所述目标应用在得到所述目标应用服务后的预置应用数据是否在预设时长内被删除,若所述预置应用数据在预设时长内未被删除,则判定所述动态信息不符合预设标准动态变化条件;
    在所述依据所述服务器返回的请求响应信息,确定所述目标应用是否出现异常之后,所述处理器还用于执行:
    若确定所述目标应用出现异常,则保存与所述运维指令相应的操作日志信息;
    在运维管理系统中输出所述目标应用相应的告警信息。
PCT/CN2021/083003 2020-11-16 2021-03-25 运维处理方法、装置及计算机设备 WO2021203979A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011278764.9 2020-11-16
CN202011278764.9A CN112380093A (zh) 2020-11-16 2020-11-16 运维处理方法、装置及计算机设备

Publications (1)

Publication Number Publication Date
WO2021203979A1 true WO2021203979A1 (zh) 2021-10-14

Family

ID=74584668

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/083003 WO2021203979A1 (zh) 2020-11-16 2021-03-25 运维处理方法、装置及计算机设备

Country Status (2)

Country Link
CN (1) CN112380093A (zh)
WO (1) WO2021203979A1 (zh)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113867301A (zh) * 2021-10-20 2021-12-31 成都大宏立机器股份有限公司 一种砂石生产线的一键启停控制方法
CN113920767A (zh) * 2021-10-22 2022-01-11 南京智慧交通信息股份有限公司 运维报警的方法、系统、装置以及计算机可读存储介质
CN114143092A (zh) * 2021-12-01 2022-03-04 江苏亨通工控安全研究院有限公司 一种运维功能集中管理平台、用户终端、系统及搭建方法
CN114154591A (zh) * 2021-12-14 2022-03-08 南方电网深圳数字电网研究院有限公司 基于多源信息的设备状态智能预警方法及装置
CN114338407A (zh) * 2022-03-09 2022-04-12 深圳市蔚壹科技有限公司 一种用于企业信息安全的运维管理方法
CN114615254A (zh) * 2022-03-25 2022-06-10 医渡云(北京)技术有限公司 远程连接方法、装置及系统、存储介质、电子设备
CN115686907A (zh) * 2022-10-31 2023-02-03 超聚变数字技术有限公司 一种信息的配置方法及计算装置
CN115766862A (zh) * 2022-11-16 2023-03-07 中国工商银行股份有限公司 容器运维方法、装置、计算机设备和存储介质
CN117312042A (zh) * 2023-12-01 2023-12-29 之江实验室 计算机集群的运维方法和计算机集群的运维系统

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112380093A (zh) * 2020-11-16 2021-02-19 平安科技(深圳)有限公司 运维处理方法、装置及计算机设备
CN113992491B (zh) * 2021-09-29 2024-04-02 中通服科信信息技术有限公司 应用程序服务器群运维管理系统、方法及装置
CN114168663A (zh) * 2021-11-12 2022-03-11 珠海大横琴科技发展有限公司 一种基于运维平台的数据处理方法
CN114186570A (zh) * 2021-12-16 2022-03-15 中国工商银行股份有限公司 集成读卡器设备运维方法、装置、计算机设备和存储介质

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104572416A (zh) * 2014-12-29 2015-04-29 北京锐安科技有限公司 一种运维数据的处理方法及装置
CN107220100A (zh) * 2016-03-22 2017-09-29 中国移动(深圳)有限公司 一种开发运维方法、装置及云计算PaaS平台
CN110196731A (zh) * 2018-10-29 2019-09-03 腾讯科技(深圳)有限公司 一种运维系统、方法及存储介质
US20190296960A1 (en) * 2018-03-22 2019-09-26 Servicenow, Inc. System and method for event processing order guarantee
CN111338646A (zh) * 2020-05-20 2020-06-26 腾讯科技(深圳)有限公司 一种微服务架构的管理方法以及相关装置
CN112380093A (zh) * 2020-11-16 2021-02-19 平安科技(深圳)有限公司 运维处理方法、装置及计算机设备

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104572416A (zh) * 2014-12-29 2015-04-29 北京锐安科技有限公司 一种运维数据的处理方法及装置
CN107220100A (zh) * 2016-03-22 2017-09-29 中国移动(深圳)有限公司 一种开发运维方法、装置及云计算PaaS平台
US20190296960A1 (en) * 2018-03-22 2019-09-26 Servicenow, Inc. System and method for event processing order guarantee
CN110196731A (zh) * 2018-10-29 2019-09-03 腾讯科技(深圳)有限公司 一种运维系统、方法及存储介质
CN111338646A (zh) * 2020-05-20 2020-06-26 腾讯科技(深圳)有限公司 一种微服务架构的管理方法以及相关装置
CN112380093A (zh) * 2020-11-16 2021-02-19 平安科技(深圳)有限公司 运维处理方法、装置及计算机设备

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113867301B (zh) * 2021-10-20 2024-03-01 成都大宏立机器股份有限公司 一种砂石生产线的一键启停控制方法
CN113867301A (zh) * 2021-10-20 2021-12-31 成都大宏立机器股份有限公司 一种砂石生产线的一键启停控制方法
CN113920767B (zh) * 2021-10-22 2023-02-24 南京智慧交通信息股份有限公司 运维报警的方法、系统、装置以及计算机可读存储介质
CN113920767A (zh) * 2021-10-22 2022-01-11 南京智慧交通信息股份有限公司 运维报警的方法、系统、装置以及计算机可读存储介质
CN114143092A (zh) * 2021-12-01 2022-03-04 江苏亨通工控安全研究院有限公司 一种运维功能集中管理平台、用户终端、系统及搭建方法
CN114154591A (zh) * 2021-12-14 2022-03-08 南方电网深圳数字电网研究院有限公司 基于多源信息的设备状态智能预警方法及装置
CN114338407A (zh) * 2022-03-09 2022-04-12 深圳市蔚壹科技有限公司 一种用于企业信息安全的运维管理方法
CN114615254B (zh) * 2022-03-25 2023-09-29 医渡云(北京)技术有限公司 远程连接方法、装置及系统、存储介质、电子设备
CN114615254A (zh) * 2022-03-25 2022-06-10 医渡云(北京)技术有限公司 远程连接方法、装置及系统、存储介质、电子设备
CN115686907A (zh) * 2022-10-31 2023-02-03 超聚变数字技术有限公司 一种信息的配置方法及计算装置
CN115686907B (zh) * 2022-10-31 2023-10-10 超聚变数字技术有限公司 一种信息的配置方法及计算装置
CN115766862A (zh) * 2022-11-16 2023-03-07 中国工商银行股份有限公司 容器运维方法、装置、计算机设备和存储介质
CN117312042A (zh) * 2023-12-01 2023-12-29 之江实验室 计算机集群的运维方法和计算机集群的运维系统

Also Published As

Publication number Publication date
CN112380093A (zh) 2021-02-19

Similar Documents

Publication Publication Date Title
WO2021203979A1 (zh) 运维处理方法、装置及计算机设备
CN109495308B (zh) 一种基于管理信息系统的自动化运维系统
CN108600029B (zh) 一种配置文件更新方法、装置、终端设备及存储介质
US11036598B2 (en) Notification mechanism for disaster recovery events
WO2016127756A1 (zh) 集群弹性部署的方法和管理系统
CN111277432B (zh) 配置信息更新方法、装置、电子设备及存储介质
KR101327477B1 (ko) 통합 관제 및 제어 관리 시스템
US10911299B2 (en) Multiuser device staging
KR100865015B1 (ko) 실시간 통합 관리정보 데이터 변환 및 모니터링 장치 및 그방법
US20200351190A1 (en) Virtual Probes
US20220239735A1 (en) State management for device-driven management workflows
WO2012088905A1 (zh) 一种通讯网络系统及通讯设备的巡检子系统和巡检方法
US11057464B1 (en) Synchronization of data between local and remote computing environment buffers
CN107247648B (zh) 基于Docker实现远程项目系统监管的方法、装置及系统
KR20160136489A (ko) 클라우드 서비스를 위한 가상화 기반 자원 관리 방법
CN106911648B (zh) 一种环境隔离方法及设备
WO2019051948A1 (zh) 监控数据的处理方法、设备、服务器及存储介质
KR101357135B1 (ko) 로그 정보 수집 장치
WO2021167659A1 (en) Systems and methods of monitoring and controlling remote assets
CN113037545A (zh) 网络仿真方法、装置、设备和存储介质
US10122602B1 (en) Distributed system infrastructure testing
CN110661851A (zh) 数据交换方法和装置
CN114465867A (zh) 服务器的维护方法、装置、存储介质及处理器
WO2018188607A1 (zh) 流处理方法及装置
US11513823B2 (en) Chat interface for resource management

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21784375

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21784375

Country of ref document: EP

Kind code of ref document: A1