WO2020024369A1 - Method and device for configuring operation and maintenance alarm template based on private cloud - Google Patents

Method and device for configuring operation and maintenance alarm template based on private cloud Download PDF

Info

Publication number
WO2020024369A1
WO2020024369A1 PCT/CN2018/104975 CN2018104975W WO2020024369A1 WO 2020024369 A1 WO2020024369 A1 WO 2020024369A1 CN 2018104975 W CN2018104975 W CN 2018104975W WO 2020024369 A1 WO2020024369 A1 WO 2020024369A1
Authority
WO
WIPO (PCT)
Prior art keywords
template
alarm
parameters
public
alert
Prior art date
Application number
PCT/CN2018/104975
Other languages
French (fr)
Chinese (zh)
Inventor
林水明
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2020024369A1 publication Critical patent/WO2020024369A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design
    • H04L41/145Network analysis or design involving simulating, designing, planning or modelling of a network
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0805Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
    • H04L43/0817Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking functioning

Definitions

  • the present application belongs to the field of computer technology, and particularly relates to a method and device for configuring an operation and maintenance alarm template based on a private cloud.
  • the monitoring system is the most important part of the entire operation and maintenance link, and even the entire product life cycle, and plays a very important role.
  • the monitoring system can comprehensively monitor and alarm the server, operating system, middleware, and applications. It can promptly detect failures in advance and provide informative data afterwards to track and locate problems.
  • Open-Falcon and Zabbix are commonly used open source operation and maintenance monitoring tools. Because of Open-Falcon's powerful and flexible data collection, humanized alarm settings, efficient alarm policy management, and high availability, it is favored, but the existing Open-Falcon data model's alarm template is universal and installed. Machines that use the operation and maintenance monitoring tool Open-Falcon may not be able to provide early warning when the system is monitored using a common alarm template.
  • a method and a device for configuring an O & M alarm template based on a private cloud are used to solve the problem that in the prior art, a machine installed with an O & M monitoring tool may fail to monitor a system using a general alarm template. Early warning issues.
  • a first aspect of the embodiments of the present application provides a method for configuring an O & M alarm template based on a private cloud, including:
  • attribute parameters and early warning parameters to be configured according to the request information; wherein the attribute parameters are used to create a sub-template of the public alert template;
  • a sub-template of the public alert template is created according to the attribute parameters, and a sub-template of the public alert template is configured based on the alert parameters.
  • a second aspect of the embodiments of the present application provides a device, including:
  • a request information obtaining unit configured to obtain request information for requesting configuration of an alarm template; wherein the request information includes a type identifier of a public alarm template to be called;
  • a parameter obtaining unit configured to obtain attribute parameters and obtain early-warning parameters to be configured according to the request information; wherein the attribute parameters are used to create a sub-template of the public alarm template;
  • a configuration unit is configured to create a sub-template of the public alarm template according to the attribute parameters, and configure a sub-template of the public alarm template based on the early-warning parameters.
  • a third aspect of the embodiments of the present application provides a device, including a memory, a processor, and computer-readable instructions stored in the memory and executable on the processor, where the processor executes the computer-readable instructions. The following steps are implemented when the instruction is read:
  • attribute parameters and early warning parameters to be configured according to the request information; wherein the attribute parameters are used to create a sub-template of the public alert template;
  • a sub-template of the public alert template is created according to the attribute parameters, and a sub-template of the public alert template is configured based on the alert parameters.
  • a fourth aspect of the embodiments of the present application provides a computer-readable storage medium.
  • the computer-readable storage medium stores computer-readable instructions. When the computer-readable instructions are executed by a processor, the following steps are implemented:
  • attribute parameters and early warning parameters to be configured according to the request information; wherein the attribute parameters are used to create a sub-template of the public alert template;
  • a sub-template of the public alert template is created according to the attribute parameters, and a sub-template of the public alert template is configured based on the alert parameters.
  • a sub-template of a public alarm template is created according to the attribute parameters by obtaining attribute parameters and early-warning parameters to be configured, and a sub-template of the common alarm template is configured based on the obtained early-warning parameters, thereby obtaining a personalized alarm template.
  • Monitoring your own running status through a personalized alarm template can improve the real-time nature of the warning and the accuracy of the alarm.
  • FIG. 1 is an implementation flowchart of a method for configuring an O & M alarm template based on a private cloud provided by an embodiment of the present application
  • FIG. 2 is a specific implementation flowchart of S103 in a method for configuring an O & M alarm template based on a private cloud according to an embodiment of the present application;
  • FIG. 3 is a structural block diagram of a device according to an embodiment of the present application.
  • FIG. 4 is a schematic diagram of a device according to another embodiment of the present application.
  • the method for configuring an alarm template in the embodiment of the present application is implemented based on a private cloud.
  • the private cloud (Private Clouds) are built for a single customer use, thus providing the most effective control over data, security, and quality of service.
  • FIG. 1 is an implementation flowchart of a method for configuring an O & M alarm template based on a private cloud provided by an embodiment of the present application.
  • a method for configuring an alarm template based on a private cloud is performed by a device that requires operation and maintenance monitoring.
  • An operation and maintenance monitoring tool is installed in the device.
  • the device that requires operation and maintenance monitoring includes, but is not limited to, servers and network devices.
  • Network devices include, but are not limited to, switches, firewall devices, and load balancing devices.
  • the method for configuring an alarm template based on a private cloud as shown in the figure may include:
  • S101 Acquire request information for requesting configuration of an alarm template.
  • the request information includes a type identifier of a public alarm template to be retrieved.
  • the device can obtain the request information for requesting the configuration of the alarm template when detecting that the user selects the type of the public alarm template to be called and activates the function of configuring the alarm template through the interactive interface for configuring the alarm template.
  • the device may also obtain request information for requesting the configuration of the alarm template when it is detected that the user triggers an operation or instruction for requesting the configuration of the alarm template. Wherein, when it is detected that the user manipulates a key for requesting the configuration of the alarm template, it is recognized that the user triggers an operation for requesting the configuration of the alarm template.
  • a public alarm template refers to a general alarm template.
  • the type identifier of the public alarm template is used to identify the type of the public alarm template to be called.
  • the public alarm template to be retrieved refers to the preset public alarm template that the execution subject needs to use in the operation and maintenance process.
  • the public alarm template may include at least two of the public alarm template of the host class, the public alarm template of the network device class, and the public alarm template of the application class.
  • the host refers to a server.
  • the public alarm template of the host class is used to collect the running data of the host, so as to monitor the running status of the host, so that an alarm can be issued when a host abnormal operation is detected.
  • the public alarm template of the network device class is used to collect the operating data of the network device, so as to monitor the operation status of the network device, so that an alarm can be issued when the abnormal operation of the network device is detected.
  • the public alarm template of the application class is used to collect the running data of the installed application in the server, so as to monitor the running status of the application, so that an alarm can be generated when an abnormal operation of the application is detected.
  • S102 Obtain attribute parameters and warning parameters to be configured according to the request information; wherein the attribute parameters are used to create a sub-template of the public alarm template.
  • the attribute parameters corresponding to each device are different, and the early-warning parameters to be configured for each device may be the same or different, which is not limited here.
  • the device may obtain the attribute parameter matching the type identifier in the request information according to the association relationship between the type identifier and the attribute parameter to be obtained, and obtain the alert parameter to be configured.
  • the attribute parameters are attribute parameters of the execution body itself.
  • the warning parameters to be configured can be set according to the type identification of the public alarm template that is called in advance and stored in the local database, or the device can obtain it from the configuration requirement information, which can be included in the request information sent by the terminal. There is no restriction here.
  • S102 may specifically be: obtaining a unique identifier and the type identifier according to the type identifier in the request information. Information about the type of the operating system, and obtain the warning parameters to be configured.
  • the attribute parameter includes a unique identifier of the execution body itself and type information of the operating system.
  • the unique identifier may be a device name.
  • Warning parameters include one or any combination of the following: Central Processing Unit (Central Processing Unit / Processor (CPU) utilization, CPU IO port waiting time, disk utilization, inode utilization, and memory utilization.
  • S102 may specifically be obtained according to the type identifier in the request information.
  • the attribute parameters include manufacturer information, device type, and device model.
  • the device types of network equipment include controllers, switches, firewalls, and load balancers.
  • the same network equipment produced by different manufacturers may have different names for the same functional indicators, and the operating parameters may also be different; the same network equipment may have different operating parameters, depending on its model. Therefore, configuring the alarm template according to the network device manufacturer information, device type, and device model can accurately monitor the operating status of each early-warning parameter.
  • the warning parameters to be configured include one or any combination of the following: the alarm level of the system log, the danger level of the system log, the emergency level of the system log, and the Error level.
  • the alert parameters to be configured include one or any combination of the following: CPU usage, number of sessions, fan status, high availability status, inbound traffic from the firewall, outbound traffic from the firewall, Xinhui And memory usage.
  • inbound and outbound traffic can be set with different alarm conditions according to the capacity of the firewall.
  • the warning parameters to be configured may further include: an alarm level of the system log, a danger level of the system log, an emergency event level of the system log, and an error level of the system log.
  • the warning parameters to be configured include one or any combination of the following: active connections, CPU usage, memory usage, new sessions, pings, networking requests per second, The number of virtual hosts connected, the number of virtual hosts connected, the alarm level of the system log, the danger level of the system log, the emergency level of the system log, and the error level of the system log.
  • the warning parameters to be configured include one or any combination of the following: CPU usage, inbound traffic of the switch, outbound traffic of the switch, number of ports, memory usage, ping, system log Alarm level, hazard level of the system log, incident level of the system log, and error level of the system log.
  • the early warning parameters to be configured may also include one or any combination of the following: high memory utilization, low memory utilization, MAC address drift, memory utilization, power State, temperature.
  • the type identifier when used to identify a public alarm template of an application, the running status of itself can be accurately monitored according to the installed application in the device, and S102 may specifically be obtained according to the type identifier in the request information.
  • the classification identifier to which the installed application belongs, the identifier of the instance included in the application, and the alert parameter to be configured are obtained.
  • the attribute parameter includes the classification identifier to which the installed application belongs and the identifier of the instance included in the application. In other embodiments, it may also be the identifier of the instance included in the installed application.
  • the alert parameters to be configured may include the number of threads.
  • the warning parameters to be configured may include one or any combination of the following: the number of connections, CPU usage, disk utilization, host name, IO thread status, memory utilization, read-only thread status, master-slave synchronization interval, database thread status.
  • MySQL can be used as a separate application in a client-server network environment, or it can be embedded into other software as a library.
  • S103 Create a sub-template of the public alarm template according to the attribute parameters, and configure a sub-template of the public alarm template based on the early-warning parameters.
  • the device retrieves the public alarm template corresponding to the type identifier contained in the request information according to the obtained attribute parameters, creates a sub-template of the retrieved public alarm template according to the attribute parameters, and according to the acquired alarm parameters that need to be configured for the created sub-templates Template for configuration.
  • the public template to be called is the parent template.
  • the configuration of the created sub-template according to the early-warning parameters to be configured may be specifically: configuring alarm indicators and / or alarm thresholds of each early-warning parameter in the created sub-template.
  • the alarm indicators may include, but are not limited to, the maximum number of alarms, the alarm level, and the effective time.
  • Each alarm level can be set to notify the user.
  • the method of notifying the user may include, but is not limited to, email and SMS. The way to notify users of different alarm levels can be the same or different.
  • the attribute parameter includes a unique identifier of the execution subject itself and type information of the operating system.
  • the device retrieves a public alarm template that matches the type of its operating system according to the type of the operating system, creates a sub-template of the public alarm template to be retrieved according to the unique identifier, and configures the created sub-template according to the alert parameters to be configured.
  • Warning parameters include one or any combination of the following: Central Processing Unit (Central Processing Unit / Processor (CPU) utilization, CPU IO port waiting time, disk utilization, inode utilization, and memory utilization.
  • the host name (unique identifier) of the execution body itself is CNLF011026, and its operating system is linux.
  • the public alarm template (parent template) to be called is common_LINUX
  • the child template created is default_tpl_LINUX_CNLF011026.
  • the default_tpl_LINUX_CNLF011026 does not match the alarm policy.
  • the alarm policy that takes effect on CNLF011026 is the alarm policy in common_LINUX. If the policy with the same alarm parameters (or monitoring items) is set in default_tpl_LINUX_CNLF011026, the alarm policy in default_tpl_LINUX_CNLF011026 will override the alarm policy in common_LINUX. Not only can achieve universal configuration, each host can also have personalized configuration.
  • the alarm index or alarm threshold of each early-warning parameter of the host can be set according to the type of the file system.
  • the category to which the installed application class belongs is: webloic
  • the instance name is instance name 24money-commonStg5SF2707 @ cnsh231149
  • the public alert template (parent template) to be called is common_Weblogic, and then according to the instance contained in the application
  • the child template created by the name is default_tpl_Weblogic_24money-commonStg5SF2707 @ cnsh231149.
  • the installed application is MySQL
  • the host name (unique identifier) is CNLF011026.
  • the public alarm template (parent template) to be called is common_MYSQL
  • the child template created based on the host name is common_default_tpl_MYSQL_CNSZ044501.
  • FIG. 2 is a specific implementation flowchart of S103 in a method for configuring an O & M alarm template based on a private cloud provided by an embodiment of the present application.
  • S103 may include S1031 to S1033. details as follows:
  • S1031 Retrieve the public alarm template and create a sub-template of the public alarm template according to the attribute parameters.
  • the device retrieves the public alarm template corresponding to the type identifier contained in the request information according to the obtained attribute parameters, and creates a sub-template of the retrieved public alarm template according to the attribute parameters.
  • S1032 Determine an alarm indicator corresponding to each of the early-warning parameters.
  • the attribute parameters include the unique identifier of the execution subject itself and the type information of the operating system. Warning parameters include one or any combination of the following: Central Processing Unit (Central Processing Unit / Processor (CPU) utilization, CPU IO port waiting time, disk utilization, inode utilization, and memory utilization.
  • CPU Central Processing Unit
  • the alarm strategy of one type of host is as follows:
  • the attribute parameters include manufacturer information, device type, and device model.
  • the warning parameters to be configured include one or any combination of the following: the alert level of the system log, the danger level of the system log, the incident level of the system log, and the error level of the system log .
  • the alert parameters to be configured include one or any combination of the following: CPU usage, number of sessions, fan status, high availability status, inbound traffic from the firewall, outbound traffic from the firewall, Xinhui And memory usage.
  • the alarm policy of one type of firewall is as follows:
  • inbound and outbound traffic can be set with different alarm conditions according to the capacity of the firewall.
  • the warning parameters to be configured may further include: an alarm level of the system log, a danger level of the system log, an emergency event level of the system log, and an error level of the system log.
  • the warning parameters to be configured include one or any combination of the following: active connections, CPU usage, memory usage, new sessions, pings, networking requests per second, The number of virtual hosts connected, the number of virtual hosts connected, the alarm level of the system log, the danger level of the system log, the emergency level of the system log, and the error level of the system log.
  • the alarm strategy of one type of load balancer is as follows:
  • the warning parameters to be configured include one or any combination of the following: CPU usage, inbound traffic of the switch, outbound traffic of the switch, number of ports, memory usage, ping, system log Alarm level, hazard level of the system log, incident level of the system log, and error level of the system log.
  • the alarm policy of one type of switch is as follows:
  • the early warning parameters to be configured may also include one or any combination of the following: high memory utilization, low memory utilization, MAC address drift, memory utilization, power State, temperature.
  • the device When the type identifier is used to identify a public alarm template of an application, the device may be a server, a network device, or the like.
  • the attribute parameter includes the classification identifier to which the installed application belongs and the identifier of the instance included in the application. In other embodiments, it may also be the identifier of the instance included in the installed application.
  • the alert parameters to be configured may include one or any combination of the following: the number of connections, CPU usage, disk utilization, host name, IO thread status, memory utilization, read-only thread status, master-slave synchronization interval, database thread status, etc.
  • MySQL alarm strategy is as follows:
  • S1033 Configure a sub-template of the common alarm template according to the early-warning parameter and the alarm indicator corresponding to each of the early-warning parameters.
  • a sub-template of a public alarm template is created according to the attribute parameters by obtaining attribute parameters and early-warning parameters to be configured, and a sub-template of the common alarm template is configured based on the obtained early-warning parameters, thereby obtaining a personalized alarm template.
  • Monitoring your own running status through a personalized alarm template can improve the real-time nature of the warning and the accuracy of the alarm.
  • FIG. 3 is a structural block diagram of a device provided by an embodiment of the present application.
  • the device includes, but is not limited to, a server and a network device.
  • the network device includes, but is not limited to, a switch, a firewall device, a load balancing device, and the like.
  • Each unit included in the device is configured to execute steps in the embodiments corresponding to FIG. 1 to FIG. 2.
  • the device 3 includes:
  • the request information obtaining unit 310 is configured to obtain request information for requesting configuration of an alarm template, where the request information includes a type identifier of a public alarm template to be called;
  • a parameter obtaining unit 320 configured to obtain attribute parameters and obtain early-warning parameters to be configured according to the request information; wherein the attribute parameters are used to create a sub-template of the public alarm template;
  • a configuration unit 330 is configured to create a sub-template of the public alarm template according to the attribute parameters, and configure a sub-template of the public alarm template based on the early-warning parameters.
  • configuration unit 330 specifically includes:
  • a creating unit configured to retrieve the public alert template and create a sub-template of the public alert template according to the attribute parameter
  • a determining unit configured to determine an alarm indicator corresponding to each of the early-warning parameters
  • a sub-template configuration unit is configured to configure a sub-template of the public alarm template according to the early-warning parameter and the alarm indicator corresponding to each of the early-warning parameters.
  • the parameter obtaining unit 320 is specifically configured to: obtain the unique identifier and the type information of the operating system according to the type identifier in the request information, and obtain the type identifier. Pre-alarm parameters to be configured.
  • the parameter obtaining unit 320 is specifically configured to: obtain the manufacturer information and device model according to the type identifier in the request information, and obtain all Describe the pre-alarm parameters to be configured.
  • the parameter obtaining unit 320 is specifically configured to obtain, according to the type identifier in the request information, the classification identifier to which the installed application belongs and the application contains Identification of the instance, and obtaining the warning parameter to be configured; wherein the instance is used to provide a service.
  • FIG. 4 is a schematic diagram of a device according to another embodiment of the present application.
  • the device 4 of this embodiment includes a processor 40, a memory 41, and computer-readable instructions 42 stored in the memory 41 and executable on the processor 40, such as a control program of the device. .
  • the processor 40 executes the computer-readable instructions 42
  • the steps in the embodiment of the method for configuring the operation and maintenance alarm template based on the private cloud of each device are implemented, for example, S101 to S103 shown in FIG. 1.
  • the processor 40 executes the computer-readable instructions 42
  • the functions of the units in the foregoing device embodiments are implemented, for example, the functions of the units 310 to 330 shown in FIG. 3.
  • the computer-readable instructions 42 may be divided into one or more units, and the one or more units are stored in the memory 41 and executed by the processor 40 to complete the present application.
  • the one or more units may be instruction segments of a series of computer-readable instructions capable of performing a specific function, and the instruction segments are used to describe an execution process of the computer-readable instructions 42 in the device 4.
  • the computer-readable instructions 42 may be divided into a request information acquisition unit, a parameter acquisition unit, and a configuration unit, and the specific functions of each unit are as described above.
  • the device may include, but is not limited to, a processor 40 and a memory 41.
  • FIG. 4 is only an example of the device 4, and does not constitute a limitation on the device 4. It may include more or fewer parts than shown in the figure, or combine some parts, or different parts, such as
  • the device may further include an input-output device, a network access device, a bus, and the like.
  • the processor 40 may be a central processing unit (Central Processing Unit (CPU), or other general-purpose processors, digital signal processors (DSPs), and application-specific integrated circuits (Applications) Specific Integrated Circuit (ASIC), off-the-shelf Programmable Gate Array (FPGA), or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc.
  • CPU Central Processing Unit
  • DSP digital signal processor
  • ASIC application-specific integrated circuits
  • FPGA off-the-shelf Programmable Gate Array
  • a general-purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
  • the memory 41 may be an internal storage unit of the device 4, such as a hard disk or a memory of the device 4.
  • the memory 41 may also be an external storage device of the device 4, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) card provided on the device 4, Flash card, etc. Further, the memory 41 may include both an internal storage unit of the device 4 and an external storage device.
  • the memory 41 is configured to store the computer-readable instructions and other programs and data required by the device.
  • the memory 41 may also be used to temporarily store data that has been output or is to be output.

Abstract

A method and device for configuring an operation and maintenance alarm template based on a private cloud. The method comprises steps of: obtaining request information for requesting configuration of an alarm template (S101); obtaining an attribute parameter according to the request information and obtaining an early warning parameter to be configured (S102); and creating a sub-template of a public alarm template according to the attribute parameter, and configuring the sub-template of the public alarm template on the basis of the early warning parameter (S103). According to the method, personalized alarm templates can be created, and operation states are monitored by means of the personalized alarm templates; the early warning timeliness and alarm accuracy are improved.

Description

一种基于私有云的配置运维告警模板的方法及设备Method and equipment for configuring operation and maintenance alarm template based on private cloud
本申请要求于2018年08月01日提交中国专利局、申请号为201810863343.9、发明名称为“一种基于私有云的配置运维告警模板的方法及设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of all Chinese patent applications filed on August 01, 2018 with the Chinese Patent Office, application number 201810863343.9, and invention name "A method and device for configuring an O & M alert template based on a private cloud", all of which The contents are incorporated herein by reference.
技术领域Technical field
本申请属于计算机技术领域,尤其涉及一种基于私有云的配置运维告警模板的方法及设备。The present application belongs to the field of computer technology, and particularly relates to a method and device for configuring an operation and maintenance alarm template based on a private cloud.
背景技术Background technique
监控系统是整个运维环节,乃至整个产品生命周期中最重要的一环,起着非常重要的作用。监控系统可以对服务器、操作系统、中间件、应用进行全面的监控及报警,可以在事前及时预警发现故障,事后提供翔实的数据用于追查定位问题。The monitoring system is the most important part of the entire operation and maintenance link, and even the entire product life cycle, and plays a very important role. The monitoring system can comprehensively monitor and alarm the server, operating system, middleware, and applications. It can promptly detect failures in advance and provide informative data afterwards to track and locate problems.
现有技术中,常用的开源运维监控工具有Open-Falcon、Zabbix。由于Open-Falcon强大灵活的数据采集、人性化的告警设置、高效率的告警策略管理、高可用等特点备受青睐,但是现有的Open-Falcon的数据模型中的告警模板是通用的,安装了运维监控工具Open-Falcon的机器在使用通用的告警模板对系统进行监控时,可能出现无法及时预警。In the prior art, commonly used open source operation and maintenance monitoring tools include Open-Falcon and Zabbix. Because of Open-Falcon's powerful and flexible data collection, humanized alarm settings, efficient alarm policy management, and high availability, it is favored, but the existing Open-Falcon data model's alarm template is universal and installed. Machines that use the operation and maintenance monitoring tool Open-Falcon may not be able to provide early warning when the system is monitored using a common alarm template.
技术问题technical problem
本申请实施例一种基于私有云的配置运维告警模板的方法及设备,以解决现有技术中,安装了运维监控工具的机器在使用通用的告警模板对系统进行监控时,可能出现无法及时预警的问题。In the embodiment of the present application, a method and a device for configuring an O & M alarm template based on a private cloud are used to solve the problem that in the prior art, a machine installed with an O & M monitoring tool may fail to monitor a system using a general alarm template. Early warning issues.
技术解决方案Technical solutions
本申请实施例的第一方面提供了一种基于私有云的配置运维告警模板的方法,包括:A first aspect of the embodiments of the present application provides a method for configuring an O & M alarm template based on a private cloud, including:
获取用于请求配置告警模板的请求信息;其中,所述请求信息包括待调取的公共告警模板的类型标识;Obtaining request information for requesting configuration of an alarm template; wherein the request information includes a type identifier of a public alarm template to be called;
根据所述请求信息获取属性参数以及获取待配置的预警参数;其中,所述属性参数用于创建所述公共告警模板的子模板;Obtaining attribute parameters and early warning parameters to be configured according to the request information; wherein the attribute parameters are used to create a sub-template of the public alert template;
根据所述属性参数创建所述公共告警模板的子模板,基于所述预警参数对所述公共告警模板的子模板进行配置。A sub-template of the public alert template is created according to the attribute parameters, and a sub-template of the public alert template is configured based on the alert parameters.
本申请实施例的第二方面提供了一种设备,包括:A second aspect of the embodiments of the present application provides a device, including:
请求信息获取单元,用于获取用于请求配置告警模板的请求信息;其中,所述请求信息包括待调取的公共告警模板的类型标识;A request information obtaining unit, configured to obtain request information for requesting configuration of an alarm template; wherein the request information includes a type identifier of a public alarm template to be called;
参数获取单元,用于根据所述请求信息获取属性参数以及获取待配置的预警参数;其中,所述属性参数用于创建所述公共告警模板的子模板;A parameter obtaining unit, configured to obtain attribute parameters and obtain early-warning parameters to be configured according to the request information; wherein the attribute parameters are used to create a sub-template of the public alarm template;
配置单元,用于根据所述属性参数创建所述公共告警模板的子模板,基于所述预警参数对所述公共告警模板的子模板进行配置。A configuration unit is configured to create a sub-template of the public alarm template according to the attribute parameters, and configure a sub-template of the public alarm template based on the early-warning parameters.
本申请实施例的第三方面提供了一种设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,所述处理器执行所述计算机可读指令时实现以下步骤:A third aspect of the embodiments of the present application provides a device, including a memory, a processor, and computer-readable instructions stored in the memory and executable on the processor, where the processor executes the computer-readable instructions. The following steps are implemented when the instruction is read:
获取用于请求配置告警模板的请求信息;其中,所述请求信息包括待调取的公共告警模板的类型标识;Obtaining request information for requesting configuration of an alarm template; wherein the request information includes a type identifier of a public alarm template to be called;
根据所述请求信息获取属性参数以及获取待配置的预警参数;其中,所述属性参数用于创建所述公共告警模板的子模板;Obtaining attribute parameters and early warning parameters to be configured according to the request information; wherein the attribute parameters are used to create a sub-template of the public alert template;
根据所述属性参数创建所述公共告警模板的子模板,基于所述预警参数对所述公共告警模板的子模板进行配置。A sub-template of the public alert template is created according to the attribute parameters, and a sub-template of the public alert template is configured based on the alert parameters.
本申请实施例的第四方面提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机可读指令,所述计算机可读指令被处理器执行时实现以下步骤:A fourth aspect of the embodiments of the present application provides a computer-readable storage medium. The computer-readable storage medium stores computer-readable instructions. When the computer-readable instructions are executed by a processor, the following steps are implemented:
获取用于请求配置告警模板的请求信息;其中,所述请求信息包括待调取的公共告警模板的类型标识;Obtaining request information for requesting configuration of an alarm template; wherein the request information includes a type identifier of a public alarm template to be called;
根据所述请求信息获取属性参数以及获取待配置的预警参数;其中,所述属性参数用于创建所述公共告警模板的子模板;Obtaining attribute parameters and early warning parameters to be configured according to the request information; wherein the attribute parameters are used to create a sub-template of the public alert template;
根据所述属性参数创建所述公共告警模板的子模板,基于所述预警参数对所述公共告警模板的子模板进行配置。A sub-template of the public alert template is created according to the attribute parameters, and a sub-template of the public alert template is configured based on the alert parameters.
有益效果Beneficial effect
本申请实施例,通过获取属性参数以及待配置的预警参数,从而根据属性参数创建公共告警模板的子模板,基于获取到的预警参数配置公共告警模板的子模板,进而得到个性化的告警模板,通过个性化的告警模板监控自身运行状态,能够提高预警的实时性以及告警的准确性。In the embodiment of the present application, a sub-template of a public alarm template is created according to the attribute parameters by obtaining attribute parameters and early-warning parameters to be configured, and a sub-template of the common alarm template is configured based on the obtained early-warning parameters, thereby obtaining a personalized alarm template. Monitoring your own running status through a personalized alarm template can improve the real-time nature of the warning and the accuracy of the alarm.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1是本申请一实施例提供的一种基于私有云的配置运维告警模板的方法的实现流程图;FIG. 1 is an implementation flowchart of a method for configuring an O & M alarm template based on a private cloud provided by an embodiment of the present application; FIG.
图2是本申请实施例提供的一种基于私有云的配置运维告警模板的方法中S103的具体实现流程图;FIG. 2 is a specific implementation flowchart of S103 in a method for configuring an O & M alarm template based on a private cloud according to an embodiment of the present application; FIG.
图3是本申请一实施例提供的一种设备的结构框图;3 is a structural block diagram of a device according to an embodiment of the present application;
图4是本申请另一实施例提供的一种设备的示意图。FIG. 4 is a schematic diagram of a device according to another embodiment of the present application.
本发明的实施方式Embodiments of the invention
以下描述中,为了说明而不是为了限定,提出了诸如特定系统结构、技术之类的具体细节,以便透彻理解本申请实施例。然而,本领域的技术人员应当清楚,在没有这些具体细节的其它实施例中也可以实现本申请。在其它情况中,省略对众所周知的系统、装置、电路以及方法的详细说明,以免不必要的细节妨碍本申请的描述。In the following description, for the purpose of illustration rather than limitation, specific details such as a specific system structure and technology are provided in order to thoroughly understand the embodiments of the present application. However, it should be clear to a person skilled in the art that the present application can also be implemented in other embodiments without these specific details. In other cases, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present application with unnecessary details.
本申请实施例中的配置告警模板的方法基于私有云实现,私有云(Private Clouds)是为一个客户单独使用而构建的,因而提供对数据、安全性和服务质量的最有效控制。The method for configuring an alarm template in the embodiment of the present application is implemented based on a private cloud. The private cloud (Private Clouds) are built for a single customer use, thus providing the most effective control over data, security, and quality of service.
请参见图1,图1是本申请实施例提供的一种基于私有云的配置运维告警模板的方法的实现流程图。本实施例中基于私有云的配置告警模板的方法的执行主体为需要进行运维监控的设备,该设备内安装有运维监控工具,需要进行运维监控的设备包括但不限于服务器、网络设备,网络设备包括但不限于交换机、防火墙设备、负载均衡设备等。如图所示的基于私有云的配置告警模板的方法可包括:Please refer to FIG. 1, which is an implementation flowchart of a method for configuring an O & M alarm template based on a private cloud provided by an embodiment of the present application. In this embodiment, a method for configuring an alarm template based on a private cloud is performed by a device that requires operation and maintenance monitoring. An operation and maintenance monitoring tool is installed in the device. The device that requires operation and maintenance monitoring includes, but is not limited to, servers and network devices. Network devices include, but are not limited to, switches, firewall devices, and load balancing devices. The method for configuring an alarm template based on a private cloud as shown in the figure may include:
S101:获取用于请求配置告警模板的请求信息;其中,所述请求信息包括待调取的公共告警模板的类型标识。S101: Acquire request information for requesting configuration of an alarm template. The request information includes a type identifier of a public alarm template to be retrieved.
设备可以在检测到用户通过配置告警模板的交互界面,选择待调用的公共告警模板的类型且启动配置告警模板的功能时,获取用于请求配置告警模板的请求信息。公共告警模板的类型为至少两类。The device can obtain the request information for requesting the configuration of the alarm template when detecting that the user selects the type of the public alarm template to be called and activates the function of configuring the alarm template through the interactive interface for configuring the alarm template. There are at least two types of public alarm templates.
设备也可以在检测到用户触发用于请求配置告警模板的操作或指令时,获取用于请求配置告警模板的请求信息。其中,在检测到用户操控用于请求配置告警模板的按键时,识别为已检测到用户触发用于请求配置告警模板的操作。The device may also obtain request information for requesting the configuration of the alarm template when it is detected that the user triggers an operation or instruction for requesting the configuration of the alarm template. Wherein, when it is detected that the user manipulates a key for requesting the configuration of the alarm template, it is recognized that the user triggers an operation for requesting the configuration of the alarm template.
公共告警模板是指通用的告警模板。公共告警模板的类型标识用于标识待调用的公共告警模板的类型,待调取的公共告警模板是指执行主体在运维过程中需要用到的预设的公共告警模板。公共告警模板可以包括主机类的公共告警模板、网络设备类的公共告警模板以及应用类的公共告警模板中的至少两个。A public alarm template refers to a general alarm template. The type identifier of the public alarm template is used to identify the type of the public alarm template to be called. The public alarm template to be retrieved refers to the preset public alarm template that the execution subject needs to use in the operation and maintenance process. The public alarm template may include at least two of the public alarm template of the host class, the public alarm template of the network device class, and the public alarm template of the application class.
其中,在本实施例中,主机是指服务器。主机类的公共告警模板用于采集主机的运行数据,从而监控主机的运行情况,从而能够在检测到主机运行异常时进行告警。In this embodiment, the host refers to a server. The public alarm template of the host class is used to collect the running data of the host, so as to monitor the running status of the host, so that an alarm can be issued when a host abnormal operation is detected.
网络设备类的公共告警模板用于采集网络设备的运行数据,从而监控网络设备的运行情况,从而能够在检测到网络设备运行异常时进行告警。The public alarm template of the network device class is used to collect the operating data of the network device, so as to monitor the operation status of the network device, so that an alarm can be issued when the abnormal operation of the network device is detected.
应用类的公共告警模板用于采集服务器中已安装的应用的运行数据,从而监控该应用的运行情况,从而能够在检测到应用运行异常时进行告警。The public alarm template of the application class is used to collect the running data of the installed application in the server, so as to monitor the running status of the application, so that an alarm can be generated when an abnormal operation of the application is detected.
S102:根据所述请求信息获取属性参数以及待配置的预警参数;其中,所述属性参数用于创建所述公共告警模板的子模板。S102: Obtain attribute parameters and warning parameters to be configured according to the request information; wherein the attribute parameters are used to create a sub-template of the public alarm template.
其中,每个设备对应的属性参数各不相同,每个设备待配置的预警参数可以相同,也可以不同,此处不做限制。The attribute parameters corresponding to each device are different, and the early-warning parameters to be configured for each device may be the same or different, which is not limited here.
设备可以根据类型标识与待获取的属性参数之间的关联关系,获取与请求信息中的类型标识匹配的属性参数,以及获取待配置的预警参数。属性参数为执行主体自身的属性参数。待配置的预警参数可以是预先根据需要调取的公共告警模板的类型标识设置的并存储在本地数据库,也可以是设备从配置需求信息中获取,配置需求信息可以包含在终端发送的请求信息中,此处不做限制。The device may obtain the attribute parameter matching the type identifier in the request information according to the association relationship between the type identifier and the attribute parameter to be obtained, and obtain the alert parameter to be configured. The attribute parameters are attribute parameters of the execution body itself. The warning parameters to be configured can be set according to the type identification of the public alarm template that is called in advance and stored in the local database, or the device can obtain it from the configuration requirement information, which can be included in the request information sent by the terminal. There is no restriction here.
进一步地,当设备为服务器,所述类型标识用于标识主机的公共告警模板时,为了准确监控服务器的运行状态,S102可以具体为:根据所述请求信息中的所述类型标识获取唯一标识和操作系统的类型信息,以及获取待配置的预警参数。Further, when the device is a server and the type identifier is used to identify a public alarm template of the host, in order to accurately monitor the running status of the server, S102 may specifically be: obtaining a unique identifier and the type identifier according to the type identifier in the request information. Information about the type of the operating system, and obtain the warning parameters to be configured.
此时,属性参数包括执行主体自身的唯一标识和操作系统的类型信息。其中,唯一标识可以是设备名称。预警参数包括以下一种或至少两种的任意组合:中央处理器(Central Processing Unit / Processor,CPU)使用率、CPU的IO端口等待时间、磁盘使用率、索引节点(inode)使用率以及内存使用率。At this time, the attribute parameter includes a unique identifier of the execution body itself and type information of the operating system. The unique identifier may be a device name. Warning parameters include one or any combination of the following: Central Processing Unit (Central Processing Unit / Processor (CPU) utilization, CPU IO port waiting time, disk utilization, inode utilization, and memory utilization.
进一步地,当设备为网络设备,所述类型标识用于标识网络设备的公共告警模板时,为了准确监控网络设备的运行状态,S102可以具体为:根据所述请求信息中的所述类型标识获取所述生产厂商信息、设备类型以及设备型号,以及获取所述待配置的预警参数。Further, when the device is a network device and the type identifier is used to identify a public alarm template of the network device, in order to accurately monitor the running status of the network device, S102 may specifically be obtained according to the type identifier in the request information. The manufacturer information, the device type, and the device model, and obtaining the warning parameters to be configured.
此时,属性参数包括生产厂商信息、设备类型以及设备型号。网络设备的设备类型包括控制器、交换机、防火墙、负载均衡器。对于同一种网络设备而言,不同的生产厂商生产的同一种网络设备,其同一功能指标的名称可能不相同,工作参数也可能不同;同一种网络设备其型号不同,工作参数也可能有差异,因此,根据网络设备的生产厂商信息、设备类型以及设备型号配置告警模板能够对各预警参数进行准确监控其运行状态。At this time, the attribute parameters include manufacturer information, device type, and device model. The device types of network equipment include controllers, switches, firewalls, and load balancers. For the same type of network equipment, the same network equipment produced by different manufacturers may have different names for the same functional indicators, and the operating parameters may also be different; the same network equipment may have different operating parameters, depending on its model. Therefore, configuring the alarm template according to the network device manufacturer information, device type, and device model can accurately monitor the operating status of each early-warning parameter.
其中,当网络设备为控制器时,待配置的预警参数包括以下一种或至少两种的任意组合:系统日志的警报等级、系统日志的危险等级、系统日志的突发事件等级以及系统日志的错误等级。When the network device is a controller, the warning parameters to be configured include one or any combination of the following: the alarm level of the system log, the danger level of the system log, the emergency level of the system log, and the Error level.
当网络设备为防火墙时,待配置的预警参数包括以下一种或至少两种的任意组合: CPU使用率、会话数、风扇状态、高可用状态、防火墙的入流量、防火墙的出流量、新会化率以及内存使用率。When the network device is a firewall, the alert parameters to be configured include one or any combination of the following: CPU usage, number of sessions, fan status, high availability status, inbound traffic from the firewall, outbound traffic from the firewall, Xinhui And memory usage.
其中,入流量和出流量可根据防火墙的容量的不同设置不同的报警条件。Among them, inbound and outbound traffic can be set with different alarm conditions according to the capacity of the firewall.
可以理解的是,当网络设备为防火墙时,待配置的预警参数还可以包括:系统日志的警报等级、系统日志的危险等级、系统日志的突发事件等级以及系统日志的错误等级。It can be understood that when the network device is a firewall, the warning parameters to be configured may further include: an alarm level of the system log, a danger level of the system log, an emergency event level of the system log, and an error level of the system log.
当网络设备为负载均衡器时,待配置的预警参数包括以下一种或至少两种的任意组合:活跃连接数、 CPU使用率、内存使用率、新会话数、ping、每秒联网请求数、接入的虚拟主机数、接出的虚拟主机数、系统日志的警报等级、系统日志的危险等级、系统日志的突发事件等级以及系统日志的错误等级。When the network device is a load balancer, the warning parameters to be configured include one or any combination of the following: active connections, CPU usage, memory usage, new sessions, pings, networking requests per second, The number of virtual hosts connected, the number of virtual hosts connected, the alarm level of the system log, the danger level of the system log, the emergency level of the system log, and the error level of the system log.
当网络设备为交换机时,待配置的预警参数包括以下一种或至少两种的任意组合: CPU使用率、交换机的入流量、交换机的出流量、端口数、内存使用率、ping、系统日志的警报等级、系统日志的危险等级、系统日志的突发事件等级以及系统日志的错误等级。When the network device is a switch, the warning parameters to be configured include one or any combination of the following: CPU usage, inbound traffic of the switch, outbound traffic of the switch, number of ports, memory usage, ping, system log Alarm level, hazard level of the system log, incident level of the system log, and error level of the system log.
可以理解的是,针对不同类型的交换机,待配置的预警参数还可以包括以下一种或至少两种的任意组合:高内存利用率、低内存利用率、MAC地址漂移量、内存利用率、电源状态、温度。It can be understood that for different types of switches, the early warning parameters to be configured may also include one or any combination of the following: high memory utilization, low memory utilization, MAC address drift, memory utilization, power State, temperature.
进一步地,当所述类型标识用于标识应用的公共告警模板时,可根据设备内已安装的应用准确监控自身的运行状态,S102可以具体为:根据所述请求信息中的所述类型标识获取已安装的应用所属的分类标识和所述应用包含的实例的标识,以及获取所述待配置的预警参数。Further, when the type identifier is used to identify a public alarm template of an application, the running status of itself can be accurately monitored according to the installed application in the device, and S102 may specifically be obtained according to the type identifier in the request information. The classification identifier to which the installed application belongs, the identifier of the instance included in the application, and the alert parameter to be configured are obtained.
此时,属性参数包括已安装的应用所属的分类标识和所述应用包含的实例的标识,在其他实施例中,还可以是已安装的应用所包含的实例的标识。待配置的预警参数可以包括线程数。At this time, the attribute parameter includes the classification identifier to which the installed application belongs and the identifier of the instance included in the application. In other embodiments, it may also be the identifier of the instance included in the installed application. The alert parameters to be configured may include the number of threads.
假设,已安装的应用的名称为关系型数据管理系统MySQL或者已安装的应用包含的实例的标识对应MySQL时,待配置的预警参数可以包括以下一种或至少两种的任意组合:连接数量、CPU使用率、磁盘利用率、主机名称、IO线程状态、内存利用率、只读线程状态、主从同步时间间隔、数据库线程状态。Assume that when the name of the installed application is relational data management system MySQL or the identifier of the instance included in the installed application corresponds to MySQL, the warning parameters to be configured may include one or any combination of the following: the number of connections, CPU usage, disk utilization, host name, IO thread status, memory utilization, read-only thread status, master-slave synchronization interval, database thread status.
其中,MySQL既能够作为一个单独的应用程序应用在客户端服务器网络环境中,也能够作为一个库而嵌入到其他的软件中。Among them, MySQL can be used as a separate application in a client-server network environment, or it can be embedded into other software as a library.
S103:根据所述属性参数创建所述公共告警模板的子模板,基于所述预警参数对所述公共告警模板的子模板进行配置。S103: Create a sub-template of the public alarm template according to the attribute parameters, and configure a sub-template of the public alarm template based on the early-warning parameters.
设备根据获取到的属性参数调取与请求信息包含的类型标识对应的公共告警模板,根据属性参数创建调取的公共告警模板的子模板,并根据获取到的需要配置的预警参数对创建的子模板进行配置。待调取的公共模板为父模板。The device retrieves the public alarm template corresponding to the type identifier contained in the request information according to the obtained attribute parameters, creates a sub-template of the retrieved public alarm template according to the attribute parameters, and according to the acquired alarm parameters that need to be configured for the created sub-templates Template for configuration. The public template to be called is the parent template.
根据待配置的预警参数对创建的子模板进行配置可以具体为:配置创建的子模板中的各预警参数的告警指标和/或告警阈值。告警指标可以包括但不限于最大报警次数、报警级别、生效时间,每种报警级别可设置通知用户的方式,通知用户的方式可以包括但不限于邮件、短信。不同的报警级别所对应的通知用户的方式可以相同,也可以不同。The configuration of the created sub-template according to the early-warning parameters to be configured may be specifically: configuring alarm indicators and / or alarm thresholds of each early-warning parameter in the created sub-template. The alarm indicators may include, but are not limited to, the maximum number of alarms, the alarm level, and the effective time. Each alarm level can be set to notify the user. The method of notifying the user may include, but is not limited to, email and SMS. The way to notify users of different alarm levels can be the same or different.
其中,当设备为服务器,所述类型标识用于标识主机的公共告警模板时,此时,属性参数包括执行主体自身的唯一标识和操作系统的类型信息。设备根据操作系统类型调取与其操作系统的类型相匹配的公共告警模板,并根据唯一标识创建待调取的公共告警模板的子模板,根据待配置的预警参数对创建的子模板进行配置。预警参数包括以下一种或至少两种的任意组合:中央处理器(Central Processing Unit / Processor,CPU)使用率、CPU的IO端口等待时间、磁盘使用率、索引节点(inode)使用率以及内存使用率。When the device is a server and the type identifier is used to identify a public alarm template of the host, at this time, the attribute parameter includes a unique identifier of the execution subject itself and type information of the operating system. The device retrieves a public alarm template that matches the type of its operating system according to the type of the operating system, creates a sub-template of the public alarm template to be retrieved according to the unique identifier, and configures the created sub-template according to the alert parameters to be configured. Warning parameters include one or any combination of the following: Central Processing Unit (Central Processing Unit / Processor (CPU) utilization, CPU IO port waiting time, disk utilization, inode utilization, and memory utilization.
例如,执行主体自身的主机名(唯一标识)是CNLF011026,其操作系统为linux,那么,待调取的公共告警模板(父模板)为common_LINUX,创建的子模板为default_tpl_LINUX_CNLF011026。默认在default_tpl_LINUX_CNLF011026不配告警策略,对CNLF011026生效的告警策略就是common_LINUX里的告警策略;如果在default_tpl_LINUX_CNLF011026中配相同预警参数(或监控项)的策略,default_tpl_LINUX_CNLF011026的告警策略就会覆盖common_LINUX中的告警策略,这样既能实现通用配置,每台主机又可以有个性化的配置。For example, the host name (unique identifier) of the execution body itself is CNLF011026, and its operating system is linux. Then, the public alarm template (parent template) to be called is common_LINUX, and the child template created is default_tpl_LINUX_CNLF011026. By default, the default_tpl_LINUX_CNLF011026 does not match the alarm policy. The alarm policy that takes effect on CNLF011026 is the alarm policy in common_LINUX. If the policy with the same alarm parameters (or monitoring items) is set in default_tpl_LINUX_CNLF011026, the alarm policy in default_tpl_LINUX_CNLF011026 will override the alarm policy in common_LINUX. Not only can achieve universal configuration, each host can also have personalized configuration.
其中,由于每个文件系统的监控需求可能不尽相同,主机的各预警参数的告警指标或告警阈值可以根据文件系统的类型进行设置。Among them, since the monitoring requirements of each file system may be different, the alarm index or alarm threshold of each early-warning parameter of the host can be set according to the type of the file system.
再例如,已安装的应用类所属的分类为:webloic,实例名称为实例名为24money-commonStg5SF2707@cnsh231149,那么,待调取的公共告警模板(父模板)为common_Weblogic,然后根据该应用包含的实例的名称创建的子模板为default_tpl_Weblogic_24money-commonStg5SF2707@cnsh231149。For another example, the category to which the installed application class belongs is: webloic, the instance name is instance name 24money-commonStg5SF2707 @ cnsh231149, then the public alert template (parent template) to be called is common_Weblogic, and then according to the instance contained in the application The child template created by the name is default_tpl_Weblogic_24money-commonStg5SF2707 @ cnsh231149.
再例如,已安装的应用为MySQL,主机名(唯一标识)是CNLF011026,那么,待调取的公共告警模板(父模板)为common_MYSQL,根据主机名称创建的子模板为common_default_tpl_MYSQL_CNSZ044501。For another example, the installed application is MySQL, and the host name (unique identifier) is CNLF011026. Then, the public alarm template (parent template) to be called is common_MYSQL, and the child template created based on the host name is common_default_tpl_MYSQL_CNSZ044501.
进一步地,为了提高告警的准确性,可以为不同类型的设备的每个预警参数设置告警指标,以便在运维过程中使用配置好的运维告警模板对设备进行监控时,如果出现告警,可通过告警的指标对应的告警参数准确定位故障。请一并参阅图2,图2是本申请实施例提供的一种基于私有云的配置运维告警模板的方法中S103的具体实现流程图。S103可以包括S1031~S1033。具体如下:Further, in order to improve the accuracy of the alarm, an alarm indicator can be set for each early-warning parameter of a different type of equipment, so that when an equipment maintenance alarm template is used to monitor the equipment during the operation and maintenance process, if an alarm occurs, Accurately locate the fault through the alarm parameters corresponding to the alarm indicators. Please refer to FIG. 2 together. FIG. 2 is a specific implementation flowchart of S103 in a method for configuring an O & M alarm template based on a private cloud provided by an embodiment of the present application. S103 may include S1031 to S1033. details as follows:
S1031:根据所述属性参数调取所述公共告警模板并创建所述公共告警模板的子模板。S1031: Retrieve the public alarm template and create a sub-template of the public alarm template according to the attribute parameters.
设备根据获取到的属性参数调取与请求信息包含的类型标识对应的公共告警模板,根据属性参数创建调取的公共告警模板的子模板。The device retrieves the public alarm template corresponding to the type identifier contained in the request information according to the obtained attribute parameters, and creates a sub-template of the retrieved public alarm template according to the attribute parameters.
S1032:确定每个所述预警参数各自对应的告警指标。S1032: Determine an alarm indicator corresponding to each of the early-warning parameters.
当设备为服务器,所述类型标识用于标识主机的公共告警模板时,此时,属性参数包括执行主体自身的唯一标识和操作系统的类型信息。预警参数包括以下一种或至少两种的任意组合:中央处理器(Central Processing Unit / Processor,CPU)使用率、CPU的IO端口等待时间、磁盘使用率、索引节点(inode)使用率以及内存使用率。When the device is a server and the type identifier is used to identify the public alarm template of the host, at this time, the attribute parameters include the unique identifier of the execution subject itself and the type information of the operating system. Warning parameters include one or any combination of the following: Central Processing Unit (Central Processing Unit / Processor (CPU) utilization, CPU IO port waiting time, disk utilization, inode utilization, and memory utilization.
示例性的,一种类型的主机的报警策略如下:Exemplarily, the alarm strategy of one type of host is as follows:
预警参数 Early warning parameters 报警条件 Alarm conditions 最大报警次数 Maximum number of alarms 报警级别 Alarm level 生效时间 Effective time
CPU使用率 CPU usage >80 > 80 1 1 4 4 全天 All day
CPU的IO端口等待时间 CPU IO port wait time >50 > 50 1 1 4 4 全天 All day
磁盘使用率 Disk usage ≥90 ≥90 1 1 3 3 全天 All day
索引节点使用率 Inode usage ≥90 ≥90 1 1 3 3 全天 All day
内存使用率 Memory usage >90 > 90 1 1 3 3 全天 All day
当设备为网络设备,所述类型标识用于标识网络设备的公共告警模板时,此时,属性参数包括生产厂商信息、设备类型以及设备型号。When the device is a network device and the type identifier is used to identify a public alarm template of the network device, at this time, the attribute parameters include manufacturer information, device type, and device model.
当网络设备为控制器时,待配置的预警参数包括以下一种或至少两种的任意组合:系统日志的警报等级、系统日志的危险等级、系统日志的突发事件等级以及系统日志的错误等级。When the network device is a controller, the warning parameters to be configured include one or any combination of the following: the alert level of the system log, the danger level of the system log, the incident level of the system log, and the error level of the system log .
当网络设备为防火墙时,待配置的预警参数包括以下一种或至少两种的任意组合: CPU使用率、会话数、风扇状态、高可用状态、防火墙的入流量、防火墙的出流量、新会化率以及内存使用率。When the network device is a firewall, the alert parameters to be configured include one or any combination of the following: CPU usage, number of sessions, fan status, high availability status, inbound traffic from the firewall, outbound traffic from the firewall, Xinhui And memory usage.
示例性的,一种类型的防火墙的报警策略如下:Exemplarily, the alarm policy of one type of firewall is as follows:
预警参数 Early warning parameters 报警条件 Alarm conditions 最大报警次数 Maximum number of alarms 报警级别 Alarm level 生效时间 Effective time
CPU使用率 CPU usage ≥70 ≥70 1 1 2 2 全天 All day
会话数 Sessions ≥500000 ≥500000 1 1 3 3 全天 All day
风扇状态 Fan status !=0 !! = 0 1 1 3 3 全天 All day
高可用状态 High availability =15 = 15 1 1 3 3 全天 All day
防火墙的入流量 Inbound traffic from the firewall ≥800000000 ≥800000000 1 1 3 3 全天 All day
防火墙的出流量 Outbound traffic from the firewall ≥8000000 ≥8000000 1 1 3 3 全天 All day
新会化率 New meeting rate ≥80 ≥80 1 1 3 3 全天 All day
内存使用率 Memory usage ≥80 ≥80 1 1 3 3 全天 All day
其中,入流量和出流量可根据防火墙的容量的不同设置不同的报警条件。Among them, inbound and outbound traffic can be set with different alarm conditions according to the capacity of the firewall.
可以理解的是,当网络设备为防火墙时,待配置的预警参数还可以包括:系统日志的警报等级、系统日志的危险等级、系统日志的突发事件等级以及系统日志的错误等级。It can be understood that when the network device is a firewall, the warning parameters to be configured may further include: an alarm level of the system log, a danger level of the system log, an emergency event level of the system log, and an error level of the system log.
当网络设备为负载均衡器时,待配置的预警参数包括以下一种或至少两种的任意组合:活跃连接数、 CPU使用率、内存使用率、新会话数、ping、每秒联网请求数、接入的虚拟主机数、接出的虚拟主机数、系统日志的警报等级、系统日志的危险等级、系统日志的突发事件等级以及系统日志的错误等级。When the network device is a load balancer, the warning parameters to be configured include one or any combination of the following: active connections, CPU usage, memory usage, new sessions, pings, networking requests per second, The number of virtual hosts connected, the number of virtual hosts connected, the alarm level of the system log, the danger level of the system log, the emergency level of the system log, and the error level of the system log.
示例性的,一种类型的负载均衡器的报警策略如下:Exemplarily, the alarm strategy of one type of load balancer is as follows:
预警参数 Early warning parameters 报警条件 Alarm conditions 最大报警次数 Maximum number of alarms 报警级别 Alarm level 生效时间 Effective time
活跃连接数 Active connections ≥3000000 ≥3000000 1 1 4 4 全天 All day
CPU使用率 CPU usage ≥80 ≥80 1 1 3 3 全天 All day
内存使用率 Memory usage ≥80 ≥80 1 1 3 3 全天 All day
新会话数 New sessions ≥45000 ≥45000 1 1 4 4 全天 All day
ping ping ≥5000 ≥5000 1 1 3 3 全天 All day
每秒联网请求数 Networking requests per second ≥1600 ≥1600 1 1 2 2 全天 All day
接入的虚拟主机数 Number of connected virtual hosts ≥600000000 ≥600000000 1 1 4 4 全天 All day
接出的虚拟主机数 Number of virtual hosts connected ≥600000000 ≥600000000 1 1 4 4 全天 All day
系统日志的警报等级 Alert level for syslog   Zh -1 -1 3 3 全天 All day
系统日志的危险等级 Danger level of the system log   Zh -1 -1 3 3 全天 All day
系统日志的突发事件等级 Incident Level of System Log   Zh -1 -1 2 2 全天 All day
系统日志的错误等级 System log error levels   Zh -1 -1 5 5 全天 All day
当网络设备为交换机时,待配置的预警参数包括以下一种或至少两种的任意组合: CPU使用率、交换机的入流量、交换机的出流量、端口数、内存使用率、ping、系统日志的警报等级、系统日志的危险等级、系统日志的突发事件等级以及系统日志的错误等级。When the network device is a switch, the warning parameters to be configured include one or any combination of the following: CPU usage, inbound traffic of the switch, outbound traffic of the switch, number of ports, memory usage, ping, system log Alarm level, hazard level of the system log, incident level of the system log, and error level of the system log.
示例性的,一种类型的交换机的报警策略如下:Exemplarily, the alarm policy of one type of switch is as follows:
预警参数 Early warning parameters 报警条件 Alarm conditions 最大报警次数 Maximum number of alarms 报警级别 Alarm level 生效时间 Effective time
CPU使用率 CPU usage ≥80 ≥80 3 3 2 2 全天 All day
交换机的入流量 Inbound traffic to the switch ≥70000000 ≥70000000 3 3 5 5 全天 All day
交换机的出流量 Outgoing traffic from the switch ≥70000000 ≥70000000 3 3 3 3 全天 All day
端口数 Number of ports ≥1000 ≥1000 3 3 4 4   Zh
内存使用率 Memory usage ≥95 ≥95 3 3 2 2 全天 All day
ping ping ≥5000 ≥5000 3 3 3 3 全天 All day
系统日志的警报等级 Alert level for syslog   Zh -1 -1 2 2 全天 All day
系统日志的危险等级 Danger level of the system log   Zh -1 -1 3 3 全天 All day
系统日志的突发事件等级 Incident Level of System Log   Zh -1 -1 2 2 全天 All day
系统日志的错误等级 System log error levels   Zh -1 -1 5 5 全天 All day
可以理解的是,针对不同类型的交换机,待配置的预警参数还可以包括以下一种或至少两种的任意组合:高内存利用率、低内存利用率、MAC地址漂移量、内存利用率、电源状态、温度。It can be understood that for different types of switches, the early warning parameters to be configured may also include one or any combination of the following: high memory utilization, low memory utilization, MAC address drift, memory utilization, power State, temperature.
当所述类型标识用于标识应用的公共告警模板时,设备可以是服务器或网络设备等。此时,属性参数包括已安装的应用所属的分类标识和所述应用包含的实例的标识,在其他实施例中,还可以是已安装的应用所包含的实例的标识。When the type identifier is used to identify a public alarm template of an application, the device may be a server, a network device, or the like. At this time, the attribute parameter includes the classification identifier to which the installed application belongs and the identifier of the instance included in the application. In other embodiments, it may also be the identifier of the instance included in the installed application.
例如,已安装的应用的名称为关系型数据管理系统MySQL或者已安装的应用包含的实例的标识对应MySQL时,待配置的预警参数可以包括以下一种或至少两种的任意组合:连接数量、CPU使用率、磁盘利用率、主机名称、IO线程状态、内存利用率、只读线程状态、主从同步时间间隔、数据库线程状态等。For example, when the name of the installed application is relational data management system MySQL or the identifier of the instance included in the installed application corresponds to MySQL, the alert parameters to be configured may include one or any combination of the following: the number of connections, CPU usage, disk utilization, host name, IO thread status, memory utilization, read-only thread status, master-slave synchronization interval, database thread status, etc.
示例性的,一种类型的MySQL的报警策略如下:Exemplarily, one type of MySQL alarm strategy is as follows:
预警参数 Early warning parameters 报警条件 Alarm conditions 最大报警次数 Maximum number of alarms 报警级别 Alarm level 生效时间 Effective time
连接数量 Number of connections >80 > 80 999 999 3 3 全天 All day
CPU使用率 CPU usage >80 > 80 999 999 3 3 全天 All day
磁盘使用率 Disk usage ≥90 ≥90 999 999 3 3 全天 All day
主机名称 Hostname ==1 == 1 999 999 3 3 全天 All day
内存使用率 Memory usage >80 > 80 999 999 3 3 全天 All day
只读线程状态 Read-only thread state ==1 == 1 999 999 3 3 全天 All day
主从同步时间间隔 Master-slave synchronization interval >600 > 600 999 999 3 3 全天 All day
数据库线程状态 Database thread status <1 <1 999 999 3 3 全天 All day
S1033:根据所述预警参数以及每个所述预警参数各自对应的所述告警指标,配置所述公共告警模板的子模板。S1033: Configure a sub-template of the common alarm template according to the early-warning parameter and the alarm indicator corresponding to each of the early-warning parameters.
本申请实施例,通过获取属性参数以及待配置的预警参数,从而根据属性参数创建公共告警模板的子模板,基于获取到的预警参数配置公共告警模板的子模板,进而得到个性化的告警模板,通过个性化的告警模板监控自身运行状态,能够提高预警的实时性以及告警的准确性。In the embodiment of the present application, a sub-template of a public alarm template is created according to the attribute parameters by obtaining attribute parameters and early-warning parameters to be configured, and a sub-template of the common alarm template is configured based on the obtained early-warning parameters, thereby obtaining a personalized alarm template. Monitoring your own running status through a personalized alarm template can improve the real-time nature of the warning and the accuracy of the alarm.
请参阅图3,图3是本申请一实施例提供的一种设备的结构框图,设备包括但不限于服务器、网络设备,网络设备包括但不限于交换机、防火墙设备、负载均衡设备等。设备包括的各单元用于执行图1~图2对应的实施例中的各步骤。具体请参阅图1~图2各自对应的实施例中的相关描述。为了便于说明,仅示出了与本实施例相关的部分。参见图3,设备3包括:Please refer to FIG. 3. FIG. 3 is a structural block diagram of a device provided by an embodiment of the present application. The device includes, but is not limited to, a server and a network device. The network device includes, but is not limited to, a switch, a firewall device, a load balancing device, and the like. Each unit included in the device is configured to execute steps in the embodiments corresponding to FIG. 1 to FIG. 2. For details, please refer to related descriptions in the embodiments corresponding to FIGS. 1 to 2. For convenience of explanation, only the parts related to this embodiment are shown. Referring to FIG. 3, the device 3 includes:
请求信息获取单元310,用于获取用于请求配置告警模板的请求信息;其中,所述请求信息包括待调取的公共告警模板的类型标识;The request information obtaining unit 310 is configured to obtain request information for requesting configuration of an alarm template, where the request information includes a type identifier of a public alarm template to be called;
参数获取单元320,用于根据所述请求信息获取属性参数以及获取待配置的预警参数;其中,所述属性参数用于创建所述公共告警模板的子模板;A parameter obtaining unit 320, configured to obtain attribute parameters and obtain early-warning parameters to be configured according to the request information; wherein the attribute parameters are used to create a sub-template of the public alarm template;
配置单元330,用于根据所述属性参数创建所述公共告警模板的子模板,基于所述预警参数对所述公共告警模板的子模板进行配置。A configuration unit 330 is configured to create a sub-template of the public alarm template according to the attribute parameters, and configure a sub-template of the public alarm template based on the early-warning parameters.
进一步地,配置单元330具体包括:Further, the configuration unit 330 specifically includes:
创建单元,用于根据所述属性参数调取所述公共告警模板并创建所述公共告警模板的子模板;A creating unit, configured to retrieve the public alert template and create a sub-template of the public alert template according to the attribute parameter;
确定单元,用于确定每个所述预警参数各自对应的告警指标;A determining unit, configured to determine an alarm indicator corresponding to each of the early-warning parameters;
子模板配置单元,用于根据所述预警参数以及每个所述预警参数各自对应的所述告警指标,配置所述公共告警模板的子模板。A sub-template configuration unit is configured to configure a sub-template of the public alarm template according to the early-warning parameter and the alarm indicator corresponding to each of the early-warning parameters.
进一步地,若所述类型标识用于标识主机的公共告警模板,参数获取单元320具体用于:根据所述请求信息中的所述类型标识获取唯一标识和操作系统的类型信息,以及获取所述待配置的预警参数。Further, if the type identifier is used to identify a public alarm template of the host, the parameter obtaining unit 320 is specifically configured to: obtain the unique identifier and the type information of the operating system according to the type identifier in the request information, and obtain the type identifier. Pre-alarm parameters to be configured.
进一步地,若所述类型标识用于标识网络设备的公共告警模板,参数获取单元320具体用于:根据所述请求信息中的所述类型标识获取所述生产厂商信息和设备型号,以及获取所述待配置的预警参数。Further, if the type identifier is used to identify a public alarm template of a network device, the parameter obtaining unit 320 is specifically configured to: obtain the manufacturer information and device model according to the type identifier in the request information, and obtain all Describe the pre-alarm parameters to be configured.
进一步地,若所述类型标识用于标识应用的公共告警模板,参数获取单元320具体用于:根据所述请求信息中的所述类型标识获取已安装的应用所属的分类标识和所述应用包含的实例的标识,以及获取所述待配置的预警参数;其中,所述实例用于提供服务。Further, if the type identifier is used to identify a public alarm template of an application, the parameter obtaining unit 320 is specifically configured to obtain, according to the type identifier in the request information, the classification identifier to which the installed application belongs and the application contains Identification of the instance, and obtaining the warning parameter to be configured; wherein the instance is used to provide a service.
图4是本申请另一实施例提供的一种设备的示意图。如图4所示,该实施例的设备4包括:处理器40、存储器41以及存储在所述存储器41中并可在所述处理器40上运行的计算机可读指令42,例如设备的控制程序。所述处理器40执行所述计算机可读指令42时实现上述各个设备的基于私有云的配置运维告警模板的方法实施例中的步骤,例如图1所示的S101至S103。或者,所述处理器40执行所述计算机可读指令42时实现上述各装置实施例中各单元的功能,例如图3所示单元310至330功能。FIG. 4 is a schematic diagram of a device according to another embodiment of the present application. As shown in FIG. 4, the device 4 of this embodiment includes a processor 40, a memory 41, and computer-readable instructions 42 stored in the memory 41 and executable on the processor 40, such as a control program of the device. . When the processor 40 executes the computer-readable instructions 42, the steps in the embodiment of the method for configuring the operation and maintenance alarm template based on the private cloud of each device are implemented, for example, S101 to S103 shown in FIG. 1. Alternatively, when the processor 40 executes the computer-readable instructions 42, the functions of the units in the foregoing device embodiments are implemented, for example, the functions of the units 310 to 330 shown in FIG. 3.
示例性的,所述计算机可读指令42可以被分割成一个或多个单元,所述一个或者多个单元被存储在所述存储器41中,并由所述处理器40执行,以完成本申请。所述一个或多个单元可以是能够完成特定功能的一系列计算机可读指令的指令段,该指令段用于描述所述计算机可读指令42在所述设备4中的执行过程。例如,所述计算机可读指令42可以被分割成请求信息获取单元、参数获取单元以及配置单元,各单元具体功能如上所述。Exemplarily, the computer-readable instructions 42 may be divided into one or more units, and the one or more units are stored in the memory 41 and executed by the processor 40 to complete the present application. . The one or more units may be instruction segments of a series of computer-readable instructions capable of performing a specific function, and the instruction segments are used to describe an execution process of the computer-readable instructions 42 in the device 4. For example, the computer-readable instructions 42 may be divided into a request information acquisition unit, a parameter acquisition unit, and a configuration unit, and the specific functions of each unit are as described above.
所述设备可包括,但不仅限于,处理器40、存储器41。本领域技术人员可以理解,图4仅仅是设备4的示例,并不构成对设备4的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件,例如所述设备还可以包括输入输出设备、网络接入设备、总线等。The device may include, but is not limited to, a processor 40 and a memory 41. Those skilled in the art can understand that FIG. 4 is only an example of the device 4, and does not constitute a limitation on the device 4. It may include more or fewer parts than shown in the figure, or combine some parts, or different parts, such as The device may further include an input-output device, a network access device, a bus, and the like.
所称处理器40可以是中央处理单元(Central Processing Unit,CPU),还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现成可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。The processor 40 may be a central processing unit (Central Processing Unit (CPU), or other general-purpose processors, digital signal processors (DSPs), and application-specific integrated circuits (Applications) Specific Integrated Circuit (ASIC), off-the-shelf Programmable Gate Array (FPGA), or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. A general-purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
所述存储器41可以是所述设备4的内部存储单元,例如设备4的硬盘或内存。所述存储器41也可以是所述设备4的外部存储设备,例如所述设备4上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。进一步地,所述存储器41还可以既包括所述设备4的内部存储单元也包括外部存储设备。所述存储器41用于存储所述计算机可读指令以及所述设备所需的其他程序和数据。所述存储器41还可以用于暂时地存储已经输出或者将要输出的数据。The memory 41 may be an internal storage unit of the device 4, such as a hard disk or a memory of the device 4. The memory 41 may also be an external storage device of the device 4, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) card provided on the device 4, Flash card, etc. Further, the memory 41 may include both an internal storage unit of the device 4 and an external storage device. The memory 41 is configured to store the computer-readable instructions and other programs and data required by the device. The memory 41 may also be used to temporarily store data that has been output or is to be output.
以上所述实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围,均应包含在本申请的保护范围之内。The above-mentioned embodiments are only used to describe the technical solution of the present application, but are not limited thereto. Although the present application has been described in detail with reference to the foregoing embodiments, those skilled in the art should understand that they can still implement the foregoing implementations. The technical solutions described in the examples are modified, or some technical features are equivalently replaced; and these modifications or replacements do not deviate the essence of the corresponding technical solutions from the spirit and scope of the technical solutions of the embodiments of the present application, and should be included in Within the scope of this application.

Claims (20)

  1. 一种基于私有云的配置运维告警模板的方法,其特征在于,包括:A method for configuring an O & M alarm template based on a private cloud, which includes:
    获取用于请求配置告警模板的请求信息;其中,所述请求信息包括待调取的公共告警模板的类型标识;Obtaining request information for requesting configuration of an alarm template; wherein the request information includes a type identifier of a public alarm template to be called;
    根据所述请求信息获取属性参数以及获取待配置的预警参数;其中,所述属性参数用于创建所述公共告警模板的子模板;Obtaining attribute parameters and early warning parameters to be configured according to the request information; wherein the attribute parameters are used to create a sub-template of the public alert template;
    根据所述属性参数创建所述公共告警模板的子模板,基于所述预警参数对所述公共告警模板的子模板进行配置。A sub-template of the public alert template is created according to the attribute parameters, and a sub-template of the public alert template is configured based on the alert parameters.
  2. 根据权利要求1所述的方法,其特征在于,所述根据所述属性参数创建所述公共告警模板的子模板,基于所述预警参数对所述公共告警模板的子模板进行配置,包括:The method according to claim 1, wherein the creating a sub-template of the public alert template based on the attribute parameters, and configuring the sub-template of the public alert template based on the alert parameters comprises:
    根据所述属性参数调取所述公共告警模板并创建所述公共告警模板的子模板;Calling the public alarm template and creating a sub-template of the public alarm template according to the attribute parameter;
    确定每个所述预警参数各自对应的告警指标;Determining an alarm indicator corresponding to each of the early-warning parameters;
    根据所述预警参数以及每个所述预警参数各自对应的所述告警指标,配置所述公共告警模板的子模板。And configuring a sub-template of the public alarm template according to the alarm parameter and the alarm indicator corresponding to each of the alarm parameters.
  3. 根据权利要求1或2所述的方法,其特征在于,若所述类型标识用于标识主机的公共告警模板;所述根据所述请求信息获取属性参数以及获取待配置的预警参数,包括:The method according to claim 1 or 2, wherein if the type identifier is used to identify a public alarm template of the host; the acquiring attribute parameters and acquiring the alert parameters to be configured according to the request information includes:
    根据所述请求信息中的所述类型标识获取唯一标识和操作系统的类型信息,以及获取所述待配置的预警参数。Acquiring the unique identifier and the type information of the operating system according to the type identifier in the request information, and acquiring the alert parameter to be configured.
  4. 根据权利要求1或2所述的方法,其特征在于,若所述类型标识用于标识网络设备的公共告警模板;所述根据所述请求信息获取属性参数以及获取待配置的预警参数,包括:The method according to claim 1 or 2, wherein if the type identifier is used to identify a public alarm template of a network device, the acquiring the attribute parameter and the alarm parameter to be configured according to the request information includes:
    根据所述请求信息中的所述类型标识获取生产厂商信息和设备型号,以及获取所述待配置的预警参数。Acquiring manufacturer information and equipment model according to the type identifier in the request information, and acquiring the alert parameter to be configured.
  5. 根据权利要求1或2所述的方法,其特征在于,若所述类型标识用于标识应用的公共告警模板;所述根据所述请求信息获取属性参数以及获取待配置的预警参数,包括:The method according to claim 1 or 2, wherein if the type identifier is used to identify a public alarm template of an application; and acquiring the attribute parameters and the alarm parameters to be configured according to the request information includes:
    根据所述请求信息中的所述类型标识获取已安装的应用所属的分类标识和所述应用包含的实例的标识,以及获取所述待配置的预警参数;其中,所述实例用于提供服务。Obtaining the classification identifier to which the installed application belongs and the identifier of the instance included in the application according to the type identifier in the request information, and acquiring the alert parameter to be configured; wherein the instance is used to provide a service.
  6. 一种设备,其特征在于,包括:A device, comprising:
    请求信息获取单元,用于获取用于请求配置告警模板的请求信息;其中,所述请求信息包括待调取的公共告警模板的类型标识;A request information obtaining unit, configured to obtain request information for requesting configuration of an alarm template; wherein the request information includes a type identifier of a public alarm template to be called;
    参数获取单元,用于根据所述请求信息获取属性参数以及获取待配置的预警参数;其中,所述属性参数用于创建所述公共告警模板的子模板;A parameter obtaining unit, configured to obtain attribute parameters and obtain early-warning parameters to be configured according to the request information; wherein the attribute parameters are used to create a sub-template of the public alarm template;
    配置单元,用于根据所述属性参数创建所述公共告警模板的子模板,基于所述预警参数对所述公共告警模板的子模板进行配置。A configuration unit is configured to create a sub-template of the public alarm template according to the attribute parameters, and configure a sub-template of the public alarm template based on the early-warning parameters.
  7. 如权利要求6所述的设备,其特征在于,所述配置单元具体包括:The device according to claim 6, wherein the configuration unit specifically comprises:
    创建单元,用于根据所述属性参数调取所述公共告警模板并创建所述公共告警模板的子模板;A creating unit, configured to retrieve the public alert template and create a sub-template of the public alert template according to the attribute parameter;
    确定单元,用于确定每个所述预警参数各自对应的告警指标;A determining unit, configured to determine an alarm indicator corresponding to each of the early-warning parameters;
    子模板配置单元,用于根据所述预警参数以及每个所述预警参数各自对应的所述告警指标,配置所述公共告警模板的子模板。A sub-template configuration unit is configured to configure a sub-template of the public alarm template according to the early-warning parameter and the alarm indicator corresponding to each of the early-warning parameters.
  8. 如权利要求6或7所述的设备,其特征在于,若所述类型标识用于标识主机的公共告警模板,所述参数获取单元具体用于:根据所述请求信息中的所述类型标识获取唯一标识和操作系统的类型信息,以及获取所述待配置的预警参数。The device according to claim 6 or 7, wherein if the type identifier is used to identify a public alarm template of the host, the parameter obtaining unit is specifically configured to obtain according to the type identifier in the request information Uniquely identify and type information of the operating system, and obtain the alert parameter to be configured.
  9. 如权利要求6或7所述的设备,其特征在于,若所述类型标识用于标识网络设备的公共告警模板,所述参数获取单元具体用于:根据所述请求信息中的所述类型标识获取所述生产厂商信息和设备型号,以及获取所述待配置的预警参数。The device according to claim 6 or 7, wherein if the type identifier is used to identify a public alarm template of a network device, the parameter obtaining unit is specifically configured to: according to the type identifier in the request information Acquiring the manufacturer information and equipment model, and acquiring the warning parameters to be configured.
  10. 如权利要求6或7所述的设备,其特征在于,若所述类型标识用于标识应用的公共告警模板,所述参数获取单元具体用于:根据所述请求信息中的所述类型标识获取已安装的应用所属的分类标识和所述应用包含的实例的标识,以及获取所述待配置的预警参数;其中,所述实例用于提供服务。The device according to claim 6 or 7, wherein if the type identifier is used to identify a public alarm template of an application, the parameter obtaining unit is specifically configured to obtain according to the type identifier in the request information The classification identifier to which the installed application belongs, the identifier of the instance included in the application, and the early-warning parameter to be configured are obtained; wherein the instance is used to provide a service.
  11. 一种设备,其特征在于,所述设备包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,所述处理器执行所述计算机可读指令时实现如下步骤:A device, wherein the device includes a memory, a processor, and computer-readable instructions stored in the memory and executable on the processor, and when the processor executes the computer-readable instructions, To achieve the following steps:
    获取用于请求配置告警模板的请求信息;其中,所述请求信息包括待调取的公共告警模板的类型标识;Obtaining request information for requesting configuration of an alarm template; wherein the request information includes a type identifier of a public alarm template to be called;
    根据所述请求信息获取属性参数以及获取待配置的预警参数;其中,所述属性参数用于创建所述公共告警模板的子模板;Obtaining attribute parameters and early warning parameters to be configured according to the request information; wherein the attribute parameters are used to create a sub-template of the public alert template;
    根据所述属性参数创建所述公共告警模板的子模板,基于所述预警参数对所述公共告警模板的子模板进行配置。A sub-template of the public alert template is created according to the attribute parameters, and a sub-template of the public alert template is configured based on the alert parameters.
  12. 根据权利要求11所述的设备,其特征在于,所述根据所述属性参数创建所述公共告警模板的子模板,基于所述预警参数对所述公共告警模板的子模板进行配置,包括:The device according to claim 11, wherein the creating a sub-template of the public alarm template based on the attribute parameters, and configuring the sub-template of the public alarm template based on the alert parameters comprises:
    根据所述属性参数调取所述公共告警模板并创建所述公共告警模板的子模板;Calling the public alarm template and creating a sub-template of the public alarm template according to the attribute parameter;
    确定每个所述预警参数各自对应的告警指标;Determining an alarm indicator corresponding to each of the early-warning parameters;
    根据所述预警参数以及每个所述预警参数各自对应的所述告警指标,配置所述公共告警模板的子模板。And configuring a sub-template of the public alarm template according to the alarm parameter and the alarm indicator corresponding to each of the alarm parameters.
  13. 根据权利要求11或12所述的设备,其特征在于,若所述类型标识用于标识主机的公共告警模板;所述根据所述请求信息获取属性参数以及获取待配置的预警参数,包括:The device according to claim 11 or 12, wherein if the type identifier is used to identify a public alarm template of the host; the acquiring attribute parameters and acquiring the alert parameters to be configured according to the request information comprises:
    根据所述请求信息中的所述类型标识获取唯一标识和操作系统的类型信息,以及获取所述待配置的预警参数。Acquiring the unique identifier and the type information of the operating system according to the type identifier in the request information, and acquiring the alert parameter to be configured.
  14. 根据权利要求11或12所述的设备,其特征在于,若所述类型标识用于标识网络设备的公共告警模板;所述根据所述请求信息获取属性参数以及获取待配置的预警参数,包括:The device according to claim 11 or 12, wherein if the type identifier is used to identify a public alarm template of a network device; the acquiring attribute parameters and acquiring the warning parameters to be configured according to the request information includes:
    根据所述请求信息中的所述类型标识获取生产厂商信息和设备型号,以及获取所述待配置的预警参数。Acquiring manufacturer information and equipment model according to the type identifier in the request information, and acquiring the alert parameter to be configured.
  15. 根据权利要求11或12所述的设备,其特征在于,若所述类型标识用于标识应用的公共告警模板;所述根据所述请求信息获取属性参数以及获取待配置的预警参数,包括:The device according to claim 11 or 12, wherein if the type identifier is used to identify a public alarm template of an application; and acquiring the attribute parameters and the alarm parameters to be configured according to the request information includes:
    根据所述请求信息中的所述类型标识获取已安装的应用所属的分类标识和所述应用包含的实例的标识,以及获取所述待配置的预警参数;其中,所述实例用于提供服务。Obtaining the classification identifier to which the installed application belongs and the identifier of the instance included in the application according to the type identifier in the request information, and acquiring the alert parameter to be configured; wherein the instance is used to provide a service.
  16. 一种计算机可读存储介质,所述计算机可读存储介质存储有计算机可读指令,其特征在于,所述计算机可读指令被至少一个处理器执行时实现如下步骤:A computer-readable storage medium storing computer-readable instructions, wherein the computer-readable instructions implement the following steps when executed by at least one processor:
    获取用于请求配置告警模板的请求信息;其中,所述请求信息包括待调取的公共告警模板的类型标识;Obtaining request information for requesting configuration of an alarm template; wherein the request information includes a type identifier of a public alarm template to be called;
    根据所述请求信息获取属性参数以及获取待配置的预警参数;其中,所述属性参数用于创建所述公共告警模板的子模板;Obtaining attribute parameters and early warning parameters to be configured according to the request information; wherein the attribute parameters are used to create a sub-template of the public alert template;
    根据所述属性参数创建所述公共告警模板的子模板,基于所述预警参数对所述公共告警模板的子模板进行配置。A sub-template of the public alert template is created according to the attribute parameters, and a sub-template of the public alert template is configured based on the alert parameters.
  17. 根据权利要求15所述的计算机可读存储介质,其特征在于,所述根据所述属性参数创建所述公共告警模板的子模板,基于所述预警参数对所述公共告警模板的子模板进行配置,包括:The computer-readable storage medium of claim 15, wherein the sub-template of the public alert template is created according to the attribute parameters, and the sub-template of the public alert template is configured based on the alert parameters. ,include:
    根据所述属性参数调取所述公共告警模板并创建所述公共告警模板的子模板;Calling the public alarm template and creating a sub-template of the public alarm template according to the attribute parameter;
    确定每个所述预警参数各自对应的告警指标;Determining an alarm indicator corresponding to each of the early-warning parameters;
    根据所述预警参数以及每个所述预警参数各自对应的所述告警指标,配置所述公共告警模板的子模板。And configuring a sub-template of the public alarm template according to the alarm parameter and the alarm indicator corresponding to each of the alarm parameters.
  18. 根据权利要求16或17所述的计算机可读存储介质,其特征在于,若所述类型标识用于标识主机的公共告警模板;所述根据所述请求信息获取属性参数以及获取待配置的预警参数,包括:The computer-readable storage medium according to claim 16 or 17, wherein if the type identifier is used to identify a public alarm template of the host; the acquiring attribute parameters and the alert parameters to be configured according to the request information ,include:
    根据所述请求信息中的所述类型标识获取唯一标识和操作系统的类型信息,以及获取所述待配置的预警参数。Acquiring the unique identifier and the type information of the operating system according to the type identifier in the request information, and acquiring the alert parameter to be configured.
  19. 根据权利要求16或17所述的计算机可读存储介质,其特征在于,若所述类型标识用于标识网络设备的公共告警模板;所述根据所述请求信息获取属性参数以及获取待配置的预警参数,包括:The computer-readable storage medium according to claim 16 or 17, wherein if the type identifier is used to identify a public alert template of a network device; the attribute parameter is acquired according to the request information and an alert to be configured is acquired Parameters, including:
    根据所述请求信息中的所述类型标识获取生产厂商信息和设备型号,以及获取所述待配置的预警参数。Acquiring manufacturer information and equipment model according to the type identifier in the request information, and acquiring the alert parameter to be configured.
  20. 根据权利要求16至17所述的计算机可读存储介质,其特征在于,若所述类型标识用于标识应用的公共告警模板;所述根据所述请求信息获取属性参数以及获取待配置的预警参数,包括:The computer-readable storage medium according to claim 16 to 17, wherein if the type identifier is used to identify a public alarm template of an application; the acquiring attribute parameters and the alert parameters to be configured according to the request information ,include:
    根据所述请求信息中的所述类型标识获取已安装的应用所属的分类标识和所述应用包含的实例的标识,以及获取所述待配置的预警参数;其中,所述实例用于提供服务。Obtaining the classification identifier to which the installed application belongs and the identifier of the instance included in the application according to the type identifier in the request information, and acquiring the alert parameter to be configured; wherein the instance is used to provide a service.
PCT/CN2018/104975 2018-08-01 2018-09-11 Method and device for configuring operation and maintenance alarm template based on private cloud WO2020024369A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810863343.9 2018-08-01
CN201810863343.9A CN108847995A (en) 2018-08-01 2018-08-01 A kind of method and apparatus of the configuration O&M alarm template based on private clound

Publications (1)

Publication Number Publication Date
WO2020024369A1 true WO2020024369A1 (en) 2020-02-06

Family

ID=64192520

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/104975 WO2020024369A1 (en) 2018-08-01 2018-09-11 Method and device for configuring operation and maintenance alarm template based on private cloud

Country Status (2)

Country Link
CN (1) CN108847995A (en)
WO (1) WO2020024369A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109495347A (en) * 2018-12-10 2019-03-19 北京北信源信息安全技术有限公司 A kind of collecting method and system

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109412871B (en) * 2018-12-20 2022-03-11 高新兴科技集团股份有限公司 Internet of things equipment access management system
CN110535701B (en) * 2019-08-30 2022-07-01 新华三大数据技术有限公司 Problem positioning method and device
CN111401840B (en) * 2020-03-13 2023-10-27 中国建设银行股份有限公司 Method, apparatus, device and computer readable medium for generating guarantor information
CN113065139A (en) * 2021-05-06 2021-07-02 携程旅游网络技术(上海)有限公司 Alarm access method and system, electronic device and medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102710467A (en) * 2012-06-29 2012-10-03 北京天地云箱科技有限公司 Monitoring method and monitoring device
CN105427545A (en) * 2015-12-30 2016-03-23 山东中创软件商用中间件股份有限公司 Drools-based equipment warning management method and device
CN106301919A (en) * 2016-08-17 2017-01-04 浪潮电子信息产业股份有限公司 The warning system of a kind of privatization cloud platform and its implementation
CN106685839A (en) * 2016-11-17 2017-05-17 上海斐讯数据通信技术有限公司 Method and system for monitoring router long connection service

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102486643B (en) * 2010-12-03 2015-12-16 上海宝信软件股份有限公司 A kind of equipment monitoring system based on object model
CN102324968B (en) * 2011-06-30 2016-09-07 中兴通讯股份有限公司 A kind of method and apparatus of passive optical network terminal alarm management
CN102983999B (en) * 2012-11-22 2016-05-25 安科智慧城市技术(中国)有限公司 Method for parameter configuration, the system of a kind of monitor supervision platform system and device cluster
CN104657814B (en) * 2014-12-17 2019-03-26 国电南瑞科技股份有限公司 Protective relaying device signal templates based on EMS system extract definition method
CN111865653A (en) * 2016-08-24 2020-10-30 华为技术有限公司 Service arranging method and device and service distributing method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102710467A (en) * 2012-06-29 2012-10-03 北京天地云箱科技有限公司 Monitoring method and monitoring device
CN105427545A (en) * 2015-12-30 2016-03-23 山东中创软件商用中间件股份有限公司 Drools-based equipment warning management method and device
CN106301919A (en) * 2016-08-17 2017-01-04 浪潮电子信息产业股份有限公司 The warning system of a kind of privatization cloud platform and its implementation
CN106685839A (en) * 2016-11-17 2017-05-17 上海斐讯数据通信技术有限公司 Method and system for monitoring router long connection service

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109495347A (en) * 2018-12-10 2019-03-19 北京北信源信息安全技术有限公司 A kind of collecting method and system

Also Published As

Publication number Publication date
CN108847995A (en) 2018-11-20

Similar Documents

Publication Publication Date Title
WO2020024369A1 (en) Method and device for configuring operation and maintenance alarm template based on private cloud
US10680896B2 (en) Virtualized network function monitoring
WO2020029407A1 (en) Alarm data management method and apparatus, and computer device and storage medium
US11296960B2 (en) Monitoring distributed applications
WO2020024376A1 (en) Method and device for processing operation and maintenance monitoring alarm
WO2021129367A1 (en) Method and apparatus for monitoring distributed storage system
CN110164101B (en) Alarm information processing method and equipment
WO2021051878A1 (en) Cloud resource acquisition method and apparatus based on user permission, and computer device
WO2021068814A1 (en) Method, apparatus, server, and computer-readable storage medium for monitoring for exception of hardware device
US10181988B1 (en) Systems and methods for monitoring a network device
WO2021184587A1 (en) Prometheus-based private cloud monitoring method and apparatus, and computer device and storage medium
US11706080B2 (en) Providing dynamic serviceability for software-defined data centers
US9143412B1 (en) Proxy reporting for central management systems
WO2017107656A1 (en) Virtualized network element failure self-healing method and device
WO2020015092A1 (en) Instance monitoring method and apparatus, terminal device and medium
WO2021072847A1 (en) Method and apparatus for monitoring condition of computer network, computer device, and storage medium
US20140189103A1 (en) System for monitoring servers and method thereof
WO2016197737A1 (en) Self-check processing method, apparatus and system
WO2020000760A1 (en) Server management method and device, computer apparatus, and storage medium
WO2019232931A1 (en) Node exception processing method and system, device and computer-readable storage medium
CN111625386A (en) Monitoring method and device for power-on overtime of system equipment
WO2021213171A1 (en) Server switching method and apparatus, management node and storage medium
US10659289B2 (en) System and method for event processing order guarantee
WO2016095716A1 (en) Fault information processing method and related device
US9430481B2 (en) Storage disk file subsystem and defect management systems and methods

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18928193

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18928193

Country of ref document: EP

Kind code of ref document: A1