CN109240126B - A distributed application service monitoring system and method with simulation operation function - Google Patents
A distributed application service monitoring system and method with simulation operation function Download PDFInfo
- Publication number
- CN109240126B CN109240126B CN201811388246.5A CN201811388246A CN109240126B CN 109240126 B CN109240126 B CN 109240126B CN 201811388246 A CN201811388246 A CN 201811388246A CN 109240126 B CN109240126 B CN 109240126B
- Authority
- CN
- China
- Prior art keywords
- data
- monitoring
- host
- module
- alarm
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000012544 monitoring process Methods 0.000 title claims abstract description 232
- 238000004088 simulation Methods 0.000 title claims abstract description 40
- 238000000034 method Methods 0.000 title claims abstract description 16
- 230000002159 abnormal effect Effects 0.000 claims abstract description 48
- 238000004458 analytical method Methods 0.000 claims abstract description 23
- 230000005856 abnormality Effects 0.000 claims abstract description 17
- 238000012545 processing Methods 0.000 claims abstract description 11
- 238000007405 data analysis Methods 0.000 claims description 42
- 238000013500 data storage Methods 0.000 claims description 13
- 238000006243 chemical reaction Methods 0.000 claims description 6
- 238000005516 engineering process Methods 0.000 claims description 6
- 238000012795 verification Methods 0.000 claims description 6
- 230000005540 biological transmission Effects 0.000 claims description 5
- 238000004891 communication Methods 0.000 claims description 5
- 238000007619 statistical method Methods 0.000 claims description 3
- 238000010586 diagram Methods 0.000 description 5
- 238000013024 troubleshooting Methods 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000013480 data collection Methods 0.000 description 2
- 230000009977 dual effect Effects 0.000 description 2
- 238000007689 inspection Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000001360 synchronised effect Effects 0.000 description 2
- 230000005611 electricity Effects 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
Landscapes
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Automation & Control Theory (AREA)
- Computer And Data Communications (AREA)
- Debugging And Monitoring (AREA)
Abstract
Description
技术领域Technical field
本发明涉及应用服务功能监测技术领域,具体为一种具有模拟操作功能的分布式应用服务监测系统及方法。The present invention relates to the technical field of application service function monitoring, specifically a distributed application service monitoring system and method with a simulated operation function.
背景技术Background technique
在大型城域网或Internet上部署的一级应用服务系统越来越普遍,例如电网企业的95598客户服务系统,在接到停电、故障报修、服务申请等紧急任务后,95598客户服务专员会在应用服务系统中制定任务工单,并将任务工单在系统中派发给故障属地95598客户服务工作站,由属地95598客户服务工作站客户服务专员接单并派发给任务处理人员。这种任务处理机制对应服务系统的可靠性和依赖性要求非常高,为了有效的监控系统的应用情况,在故障发生后的第一时间通知机房管理人员或信息系统运维人员开展故障排查,保障系统安全稳定运行,需要应用具备模拟操作的监测系统开展系统模拟访问,在客户服务专员应用系统前发现系统应用服务故障并解决问题,避免出现因工单无法及时派发而影响客户紧急用电业务办理的情况发生。First-level application service systems deployed on large metropolitan area networks or the Internet are becoming more and more common. For example, the 95598 customer service system of power grid enterprises. After receiving emergency tasks such as power outages, fault repairs, and service applications, the 95598 customer service specialists will Create a task work order in the application service system and dispatch the task work order to the faulty local 95598 customer service workstation in the system. The customer service specialist of the local 95598 customer service workstation will receive the order and dispatch it to the task processing personnel. This task processing mechanism corresponds to very high requirements for the reliability and dependence of the service system. In order to effectively monitor the application of the system, the computer room managers or information system operation and maintenance personnel are notified as soon as possible after the failure occurs to carry out troubleshooting and ensure For the system to operate safely and stably, it is necessary to apply a monitoring system with simulated operations to carry out simulated access to the system, discover system application service failures and solve problems before the customer service specialist uses the system, and avoid affecting the customer's emergency electricity business due to the inability to dispatch work orders in time. situation occurs.
目前常用的机房及应用监控系统存在一定的局限性,主要包括在以下两个方面,一是常见的监控系统仅能对机房的动力环境进行监控,缺少对应用系统的模式登陆监控,既不能及时发现应用系统无法访问的问题,也不能快速定位故障原因,给故障排查带了很大的困难;二是缺少集成主机和分布式终端于一体的监控平台,常见的监控系统仅能实现主机监控,不能实现服务端和远程工作站的同步监控,导致在故障排查时,无法结合多种监测数据快速定位故障地点和内容,在使用上存在一定的局限性。Currently, commonly used computer room and application monitoring systems have certain limitations, mainly including the following two aspects. First, common monitoring systems can only monitor the power environment of the computer room. They lack mode login monitoring of the application system and cannot monitor the application system in a timely manner. It is found that the application system cannot be accessed, and the cause of the fault cannot be quickly located, which brings great difficulties to troubleshooting. Second, there is a lack of monitoring platform that integrates the host and distributed terminals. Common monitoring systems can only realize host monitoring. Synchronous monitoring of the server and remote workstations cannot be realized, resulting in the inability to quickly locate the fault location and content by combining multiple monitoring data during troubleshooting, which has certain limitations in use.
发明内容Contents of the invention
针对现有技术的不足,本发明的目的在于提供一种具有模拟人工操作功能的分布式应用服务监测系统及其方法,主要采用主机和分布式客户端同步监控的模式,监测的内容包括动环数据、模拟登陆访问是否通过、模拟登陆访问成功的时间、模拟查询、网络性能等,很好的解决了背景技术中提出的困难问题。In view of the shortcomings of the existing technology, the purpose of the present invention is to provide a distributed application service monitoring system and method with the function of simulating manual operation, which mainly adopts the mode of synchronous monitoring of the host and distributed clients. The monitoring content includes dynamic environment Data, whether the simulated login access is passed, the time when the simulated login access is successful, simulated query, network performance, etc., well solves the difficult problems raised in the background technology.
本发明的技术方案:Technical solution of the present invention:
一种具有模拟操作功能的分布式应用服务监测系统,包括监控主机、模拟操作主机、监控终端、数据分析主机、报警系统以及数据备份云端。A distributed application service monitoring system with simulated operation function, including a monitoring host, a simulated operation host, a monitoring terminal, a data analysis host, an alarm system and a data backup cloud.
所述模拟操作主机与监控主机相连接,用以向监控主机设定需要模拟监控的数据指标;The simulation operation host is connected to the monitoring host to set data indicators that need to be simulated and monitored to the monitoring host;
所述监控终端采用分布式设置,所有分布式设置的监控终端均与监控主机相连接并将采集的数据传输到监控主机,监控终端与监控主机采用有线网络和无线专网双通道访问应用服务系统。The monitoring terminals adopt distributed settings. All distributed monitoring terminals are connected to the monitoring host and transmit the collected data to the monitoring host. The monitoring terminals and the monitoring host use dual channels of wired network and wireless private network to access the application service system. .
所述数据分析主机连接到监控主机,监控主机将接收到的监控终端采集的数据发送给数据分析主机进行分析;The data analysis host is connected to the monitoring host, and the monitoring host sends the received data collected by the monitoring terminal to the data analysis host for analysis;
所述报警系统与数据分析主机相连接,报警系统用于在数据分析主机在数据异常时进行报警;数据备份云端与数据分析主机连接,用以将监控终端采集的数据以及数据分析主机分析的异常数据进行云存储。监控终端与数据分析主机之间通过监控主机完成数据通讯。The alarm system is connected to the data analysis host. The alarm system is used to alarm the data analysis host when the data is abnormal. The data backup cloud is connected to the data analysis host to store the data collected by the monitoring terminal and the abnormalities analyzed by the data analysis host. Data is stored in the cloud. Data communication between the monitoring terminal and the data analysis host is completed through the monitoring host.
所述模拟操作主机包括有模拟功能设定模块,所述监控主机包括监控对象搜索模块和监控部署单元,所述监控终端包括数据采集器和数据转换器,所述数据分析主机包括数据异常判断模块、数据整理分析图形报表模块和异常数据存储模块。所述模拟功能设定模块用于对监控对象搜索模块中监控数据指标进行设定;监控对象搜索模块根据模拟功能设定模块中设定的监控标准值进行搜索监控;所述监控对象搜索模块将搜索后的数据值输入到监控部署单元中进行监控终端的部署;所述数据采集器用于对监控终端所监控的数据进行采集,采集后的数据输入到数据转换器中进行转换,转换的数据输入到数据处理分析主机中进行数据分析,分析后的数据值输入到数据异常判断模块中进行异常判断;所述数据异常判断模块结合监控数据判断发生异常后,一路输入到数据整理分析图形报表模块中进行图形、报表处理。一路数据通过网路传输到异常数据存储模块进行存储,一路数据通过无线传输模块输入到报警系统中,异常数据存储模块接收到异常数据后通过无线网路在数据备份云端完成数据备份。The simulation operation host includes a simulation function setting module, the monitoring host includes a monitoring object search module and a monitoring deployment unit, the monitoring terminal includes a data collector and a data converter, and the data analysis host includes a data anomaly judgment module. , data sorting and analysis graphical report module and abnormal data storage module. The simulation function setting module is used to set the monitoring data indicators in the monitoring object search module; the monitoring object search module performs search and monitoring according to the monitoring standard value set in the simulation function setting module; the monitoring object search module will The searched data values are input into the monitoring deployment unit for deployment of the monitoring terminal; the data collector is used to collect the data monitored by the monitoring terminal, the collected data is input into the data converter for conversion, and the converted data is input The data is analyzed in the data processing and analysis host, and the analyzed data values are input into the data anomaly judgment module for abnormality judgment; after the data anomaly judgment module combines the monitoring data to judge an abnormality, it is input all the way to the data sorting and analysis graphical report module. Process graphics and reports. One channel of data is transmitted to the abnormal data storage module through the network for storage, and the other channel of data is input into the alarm system through the wireless transmission module. After receiving the abnormal data, the abnormal data storage module completes the data backup in the data backup cloud through the wireless network.
所述报警系统包括数据接收模块、单片机和警报模块;数据接收模块用于对动环数据、模拟登陆访问是否通过、模拟登陆访问成功的时间、模拟查询、网络性能等数据进行接收,接收后的数据值会输送到单片机中,单片机控制警报模块运行。The alarm system includes a data receiving module, a microcontroller and an alarm module; the data receiving module is used to receive data such as dynamic ring data, whether the simulated login access is passed, the time when the simulated login access is successful, simulated query, network performance, etc., after receiving The data values are transferred to the microcontroller, which controls the operation of the alarm module.
所述警报模块包括蜂鸣器、警报灯、电话预警器和短信预警器,且蜂鸣器、警报灯、电话警报器和短信警报器均与单片机连接。The alarm module includes a buzzer, an alarm light, a telephone alarm and an SMS alarm, and the buzzer, alarm light, telephone alarm and SMS alarm are all connected to the microcontroller.
一种具有模拟操作功能的分布式应用服务监测方法,包括以下具体步骤:A distributed application service monitoring method with simulated operation function, including the following specific steps:
S1.部署设备,并将监控主机、模拟操作主机、监控终端、数据分析主机、报警系统以及数据备份云端进行通信连接;S1. Deploy equipment and communicate with the monitoring host, simulation operation host, monitoring terminal, data analysis host, alarm system and data backup cloud;
S2.通过模拟操作主机的模拟功能设定模块对监控指标进行自定义设定,其中监控指标包括动环数据、模拟登陆、模拟访问、网络性能等;S2. Customize the monitoring indicators through the simulation function setting module of the simulation operation host. The monitoring indicators include dynamic environment data, simulated login, simulated access, network performance, etc.;
S3.通过监控终端获取监控数据,通过数据采集器完成数据采集,通过数据转换器完成数据转换;S3. Obtain monitoring data through the monitoring terminal, complete data collection through the data collector, and complete data conversion through the data converter;
S4.将转换后的监控数据传送到数据分析主机,由数据分析主机对获取的监控数据进行分析;S4. Transmit the converted monitoring data to the data analysis host, and the data analysis host analyzes the acquired monitoring data;
S5.数据分析主机分析监控终端及监控主机数据进行综合分析,若出现动环异常、网路不通、网路超时、应用服务器性能异常等情况时,则向报警系统发出报警指令。S5. The data analysis host analyzes the monitoring terminal and monitoring host data for comprehensive analysis. If there are dynamic loop abnormalities, network failure, network timeout, abnormal application server performance, etc., an alarm command will be sent to the alarm system.
S6.报警系统接收到报警指令后,通过多种方式(蜂鸣器、警报灯、电话、短信)等多种方式向管理员发出报警信息;S6. After receiving the alarm command, the alarm system sends alarm information to the administrator through various methods (buzzer, alarm light, phone call, SMS), etc.;
S7.异常信息产生后,数据同时在数据传输到数据整理分析图形报表模块进行统计分析,完成图表展示;S7. After the abnormal information is generated, the data is simultaneously transferred to the data sorting and analysis graphic report module for statistical analysis and chart display is completed;
S8.异常信息产生后,数据同时在异常数据存储模块进行存储,在数据备份云端完成数据备份。S8. After the abnormal information is generated, the data is stored in the abnormal data storage module at the same time, and the data backup is completed in the data backup cloud.
所述步骤S2中,模拟功能设定模块中模拟登录和访问的具体步骤为:In the step S2, the specific steps for simulating login and access in the simulation function setting module are:
a.通过Libcurl技术发送页面登录/访问请求;a. Send page login/access request through Libcurl technology;
b.通过Fiddler工具获取登录/访问请求FromData参数;b. Obtain the login/access request FromData parameters through the Fiddler tool;
c.分析加密规则,对系统登陆和访问的关键点进行分析;c. Analyze encryption rules and analyze the key points of system login and access;
d.获取登陆/访问成功的验证信息,保证系统正常登陆及访问;d. Obtain verification information for successful login/access to ensure normal login and access to the system;
e.进行编码处理,将验证信息固化到模拟操作主机。e. Carry out encoding processing and solidify the verification information to the simulation operating host.
所述步骤S3中,模拟登录访问操作具体步骤如下:In step S3, the specific steps of the simulated login access operation are as follows:
1).监控终端定时发送登录/访问请求,模拟人工访问系统操作;1). The monitoring terminal sends login/access requests regularly to simulate manual access to the system operation;
2).监控终端接收登录/访问请求返回的数据;2). The monitoring terminal receives the data returned by the login/access request;
3).解析数据判断是否有请求成功标志;3). Parse the data to determine whether there is a request success flag;
4).若步骤3)中监控终端成功登陆/访问业务应用系统,则返回登录/访问耗时数据;4). If the monitoring terminal successfully logs in/accesses the business application system in step 3), the login/access time-consuming data will be returned;
5).若步骤3)中监控终端未能成功登陆/访问业务应用系统,则向系统返回登录/访问失败信号。5). If the monitoring terminal fails to successfully log in/access the business application system in step 3), a login/access failure signal will be returned to the system.
所述步骤S4中,数据异常判断模块的判断逻辑如下:In step S4, the judgment logic of the data abnormality judgment module is as follows:
(1).获取监控终端N监控数据,包括多个监控终端的数据;(1). Obtain monitoring data from monitoring terminal N, including data from multiple monitoring terminals;
(2).首先,判断动环参数是否超出阀值,如果是则为动环异常;(2). First, determine whether the dynamic loop parameters exceed the threshold. If so, the dynamic loop is abnormal;
(3).其次,判断监控终端N与被检查应用服务系统网络连接是否中断,若是则为监控终端N网络与服务器网络不通,所有监控终端不通为服务端网络问题,部分监控终端网络不通为监控端网络问题。(3). Secondly, determine whether the network connection between the monitoring terminal N and the application service system under inspection is interrupted. If so, the monitoring terminal N network and the server network are not connected. The failure of all monitoring terminals is a server network problem. The network failure of some monitoring terminals is a monitoring problem. End network issues.
(4).再次,判断监控终端N与被检查应用服务系统网络连接是否超阀值,若是则为监控终端N网络与服务器网络连接超时,所有监控终端超时为服务端网络超时,部分监控终端超时为监控端网络超时;(4). Again, determine whether the network connection between the monitoring terminal N and the checked application service system exceeds the threshold. If so, the connection between the monitoring terminal N network and the server network has timed out. The timeout of all monitoring terminals is the server network timeout, and some monitoring terminals have timed out. The network timeout for the monitoring end;
(5).最后,判断监控终端N模拟登陆、访问是否异常,若是则为应用服务器性能异常,当上述情况都未发生时,各项监控指标均正常。(5). Finally, determine whether the simulated login and access of the monitoring terminal N is abnormal. If so, the application server performance is abnormal. When none of the above situations occur, all monitoring indicators are normal.
与现有技术相比,本发明的有益效果是:本发明提供了一种具有模拟操作功能的分布式应用服务监测系统。具备以下有益效果:Compared with the existing technology, the beneficial effects of the present invention are: the present invention provides a distributed application service monitoring system with a simulated operation function. It has the following beneficial effects:
(1)自动判断分布式监控终端与服务器网络连接状态、网络连接速度和通道性能;(2)通过模拟访问程序,判断应用系统是否正常;(3)自动判断应用系统的访问速度,并实时监控;(4)判断应用系统是否可以正常访问;(5)判断监控终端及本地动力环境(供电、环境、交换机)是否正常运行。(1) Automatically determine the network connection status, network connection speed and channel performance of the distributed monitoring terminal and server; (2) Determine whether the application system is normal by simulating the access program; (3) Automatically determine the access speed of the application system and monitor it in real time ; (4) Determine whether the application system can be accessed normally; (5) Determine whether the monitoring terminal and the local power environment (power supply, environment, switch) are operating normally.
该具有模拟操作功能的分布式应用服务监测系统,在监测系统中设置有数据整理分析图形、报表模块,数据整理分析图形、报表模块能将监控系统中监测的异常数据进行分析图形、报表处理,便于工作人员的查看和管理,警报模块包括蜂鸣器、警报灯、电话预警器和短信预警器,通过多种报警装置的设定,便于工作人员能第一时间准确的了解监控异常情况;通过模拟功能设定模块的设定,方便工作人员能对监控指标进行自定义设定,方便工作人员对监控设备的管理,实用性强,易于推广使用。This distributed application service monitoring system with simulation operation function is equipped with data collation and analysis graphics and report modules in the monitoring system. The data collation and analysis graphics and report modules can analyze the abnormal data monitored in the monitoring system and process the graphics and reports. It is convenient for staff to view and manage. The alarm module includes a buzzer, alarm light, telephone warning device and SMS warning device. Through the setting of various alarm devices, it is convenient for staff to accurately understand monitoring abnormal situations at the first time; through The setting of the simulation function setting module facilitates the staff to customize the monitoring indicators and the management of the monitoring equipment. It is highly practical and easy to promote and use.
附图说明Description of drawings
图1为本发明系统框图;Figure 1 is a system block diagram of the present invention;
图2为本发明的系统工作框图;Figure 2 is a system working block diagram of the present invention;
图3为本发明的报警系统工作原理框图;Figure 3 is a block diagram of the working principle of the alarm system of the present invention;
图4为本发明的模拟登录访问工作原理图;Figure 4 is a working principle diagram of the simulated login access of the present invention;
图5为本发明的模拟登录访问操作流程图;Figure 5 is a flow chart of the simulated login access operation of the present invention;
图6为本发明的数据异常判断模块工作原理框图。Figure 6 is a block diagram of the working principle of the data anomaly judgment module of the present invention.
具体实施方式Detailed ways
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts fall within the scope of protection of the present invention.
请参阅图1,本发明提供一种技术方案:Please refer to Figure 1. The present invention provides a technical solution:
一种具有模拟操作功能的分布式应用服务监测系统,包括监控主机100、模拟操作主机200、监控终端300、数据分析主机400、数据备份云端500以及报警系统600。A distributed application service monitoring system with simulated operation function, including a monitoring host 100, a simulated operation host 200, a monitoring terminal 300, a data analysis host 400, a data backup cloud 500 and an alarm system 600.
所述模拟操作主机200与监控主机100相连接,用以向监控主机100设定需要模拟监控登陆页面及访问页面参数。;The simulation operation host 200 is connected to the monitoring host 100, and is used to set the login page and access page parameters that need to be simulated and monitored to the monitoring host 100. ;
所述监控终端300采用分布式设置,包括多个监控终端,采用有线网路和无线网路双通道进行数据通讯,所有分布式设置的监控终端300均与监控主机100相连接并将采集的数据传输到监控主机100;The monitoring terminal 300 adopts a distributed setting, including multiple monitoring terminals, and uses dual channels of wired network and wireless network for data communication. All distributed monitoring terminals 300 are connected to the monitoring host 100 and collect the collected data. Transmit to monitoring host 100;
所述数据分析主机400连接到监控主机100,监控主机100将接收到的监控终端300采集的数据先发送给监控主机,再由监控主机发送给数据分析主机400进行分析;The data analysis host 400 is connected to the monitoring host 100. The monitoring host 100 first sends the received data collected by the monitoring terminal 300 to the monitoring host, and then the monitoring host sends it to the data analysis host 400 for analysis;
所述报警系统600和数据备份云端500与数据分析主机相连接,报警系统500用于在数据分析主机400在数据异常时进行报警,;数据备份云端500还连接监控主机100与数据分析主机400连接,用以将监控终端300采集的数据以及数据分析主机400分析的异常数据进行云存储。监控终端300与数据分析主机400之间通过监控主机100通道完成数据通讯。The alarm system 600 and the data backup cloud 500 are connected to the data analysis host. The alarm system 500 is used to alarm the data analysis host 400 when the data is abnormal. The data backup cloud 500 is also connected to the monitoring host 100 and the data analysis host 400. , used to cloud-store the data collected by the monitoring terminal 300 and the abnormal data analyzed by the data analysis host 400 . Data communication is completed between the monitoring terminal 300 and the data analysis host 400 through the monitoring host 100 channel.
所述模拟操作主机200包括有模拟功能设定模块1,所述监控主机100包括监控对象搜索模块2和监控部署单元3,所述监控终端300包括数据采集器4和数据转换器5,所述数据分析主机400包括数据异常判断模块66、数据整理分析图形报表模块7和异常数据存储模块8。所述模拟功能设定模块1用于对监控对象搜索模块2中监控数据指标进行设定;监控对象搜索模块2根据模拟功能设定模块1中设定的监控标准值进行搜索监控;所述监控对象搜索模块2将搜索后的数据值输入到监控部署单元3中进行监控终端300的部署;所述数据采集器4用于对监控终端300所监控的数据进行采集,采集后的数据输入到数据转换器5中进行转换,转换的数据输入到数据处理分析主机400中进行数据分析,分析后的数据值输入到数据异常判断模块6中进行异常判断;所述数据异常判断模块6根据判断值一路通过无线传输模块6输入到报警系统600中,另一路输入到数据整理分析图形报表模块8中进行图形、报表处理。所述数据异常判断模6块结合监控数据判断发生异常后,一路输入到数据整理分析图形报表模块7中进行图形、报表处理。一路数据通过网路传输到异常数据存储模块8进行存储,一路数据通过无线传输模块输入到报警系统600中,异常数据存储模块6接收到异常数据后通过无线网路在数据备份云端500完成数据备份。The simulation operation host 200 includes a simulation function setting module 1. The monitoring host 100 includes a monitoring object search module 2 and a monitoring deployment unit 3. The monitoring terminal 300 includes a data collector 4 and a data converter 5. The data analysis host 400 includes a data anomaly judgment module 66 , a data sorting and analysis graphical report module 7 and an anomaly data storage module 8 . The simulation function setting module 1 is used to set monitoring data indicators in the monitoring object search module 2; the monitoring object search module 2 performs search and monitoring according to the monitoring standard value set in the simulation function setting module 1; the monitoring The object search module 2 inputs the searched data values into the monitoring deployment unit 3 to deploy the monitoring terminal 300; the data collector 4 is used to collect the data monitored by the monitoring terminal 300, and the collected data is input into the data Conversion is performed in the converter 5, the converted data is input to the data processing and analysis host 400 for data analysis, and the analyzed data values are input to the data anomaly judgment module 6 for abnormality judgment; the data anomaly judgment module 6 passes the data according to the judgment value. It is input to the alarm system 600 through the wireless transmission module 6, and the other input is input to the data sorting and analysis graphic report module 8 for graphic and report processing. After the data anomaly judgment module 6 determines an abnormality based on monitoring data, it is input into the data sorting and analysis graphic report module 7 for graphics and report processing. One channel of data is transmitted to the abnormal data storage module 8 through the network for storage, and one channel of data is input into the alarm system 600 through the wireless transmission module. After receiving the abnormal data, the abnormal data storage module 6 completes the data backup in the data backup cloud 500 through the wireless network. .
所述数据转换器5转后的数据以移动数据网络传输到数据备份云端500进行数据备份。The data converted by the data converter 5 is transmitted to the data backup cloud 500 via the mobile data network for data backup.
所述报警系统600包括数据接收模块601、单片机602和警报模块603;数据接收模块601用于对数据异常判断模块6传输的数据:动环数据、模拟登陆访问是否通过、模拟登陆访问成功的时间、模拟查询、网络性能等数据进行接收,接收后的数据值会输送到单片机602中,单片机控制警报模块603运行。The alarm system 600 includes a data receiving module 601, a single chip microcomputer 602 and an alarm module 603; the data receiving module 601 is used to detect the data transmitted by the data anomaly judgment module 6: dynamic ring data, whether the simulated login access is passed, and the time when the simulated login access is successful. , simulation query, network performance and other data are received. The received data values will be transmitted to the microcontroller 602, and the microcontroller controls the operation of the alarm module 603.
所述警报模块603包括蜂鸣器604、警报灯605、电话预警器606和短信预警器607,蜂鸣器604、警报灯605、电话警报器606和短信警报器607均与单片机连接。一种具有模拟操作功能的分布式应用服务监测方法,包括以下具体步骤:The alarm module 603 includes a buzzer 604, an alarm light 605, a phone warning 606 and a text message warning 607. The buzzer 604, warning light 605, phone warning 606 and text message warning 607 are all connected to the microcontroller. A distributed application service monitoring method with simulated operation function, including the following specific steps:
S1.部署设备,并将监控主机100、模拟操作主机200、监控终端300、数据分析主机400、数据备份云端500以及报警系统600进行通信连接;S1. Deploy equipment and communicate with the monitoring host 100, simulation operation host 200, monitoring terminal 300, data analysis host 400, data backup cloud 500 and alarm system 600;
S2.通过模拟操作主机200的模拟功能设定模块1对监控指标进行自定义设定,其中监控指标包括动环数据、模拟登陆、模拟访问、网络性能等;S2. Customize the monitoring indicators through the simulation function setting module 1 of the simulation operation host 200, where the monitoring indicators include dynamic environment data, simulated login, simulated access, network performance, etc.;
S3.通过监控终端300获取监控数据,通过数据采集器4完成数据采集,通过数据转换器5完成数据转换;S3. Obtain monitoring data through the monitoring terminal 300, complete data collection through the data collector 4, and complete data conversion through the data converter 5;
S4.将转换后的监控数据传送到数据分析主机400,由数据分析主机400对获取的监控数据参数进行分析,判断监控主机是否模拟登录异常;S4. Transmit the converted monitoring data to the data analysis host 400, and the data analysis host 400 analyzes the obtained monitoring data parameters to determine whether the monitoring host simulates login anomalies;
S5.数据分析主机400分析监控终端及监控主机数据进行综合分析,若出现动环异常、网路不通、网路超时、应用服务器性能异常等情况时,则向报警系统600发出指令。S5. The data analysis host 400 analyzes the monitoring terminal and monitoring host data for comprehensive analysis. If there are dynamic loop abnormalities, network failure, network timeout, abnormal application server performance, etc., it will issue instructions to the alarm system 600.
S6.报警系统600接收到报警指令后,通过多种方式(蜂鸣器、警报灯、电话、短信)等多种方式向管理员发出报警信息。S6. After receiving the alarm command, the alarm system 600 sends alarm information to the administrator through various methods (buzzer, alarm light, phone call, SMS) and other methods.
S7.异常信息产生后,数据同时在数据传输到数据整理分析图形报表模块7进行统计分析,完成图表展示。S7. After the abnormal information is generated, the data is simultaneously transmitted to the data sorting and analysis graphic report module 7 for statistical analysis and complete chart display.
S8.异常信息产生后,数据同时在异常数据存储模块8进行存储,同时在数据备份云端600完成异常数据备份;S8. After the abnormal information is generated, the data is stored in the abnormal data storage module 8 at the same time, and the abnormal data backup is completed in the data backup cloud 600;
所述步骤S2中,模拟功能设定模块中模拟登录和查询请求访问的流程图详见附图4,具体步骤如下:In step S2, the flow chart of simulated login and query request access in the simulation function setting module is detailed in Figure 4. The specific steps are as follows:
a.通过Libcurl技术发送页面登录/访问请求;a. Send page login/access request through Libcurl technology;
b.通过Fiddler工具获取登录/访问请求FromData参数;b. Obtain the login/access request FromData parameters through the Fiddler tool;
c.分析加密规则,对系统登陆和访问的关键点进行分析;c. Analyze encryption rules and analyze the key points of system login and access;
d.获取登陆/访问成功的验证信息,保证系统正常登陆及访问;d. Obtain verification information for successful login/access to ensure normal login and access to the system;
e.进行编码处理,将验证信息固化到模拟操作主机。e. Carry out encoding processing and solidify the verification information to the simulation operating host.
所述步骤S3中,模拟登录访问操作流程图详见附图5所示,具体步骤如下:In step S3, the simulated login access operation flow chart is shown in Figure 5. The specific steps are as follows:
1).监控终端定时发送登录/访问请求,模拟人工访问系统操作;1). The monitoring terminal sends login/access requests regularly to simulate manual access to the system operation;
2).监控终端接收登录/访问请求返回的数据;2). The monitoring terminal receives the data returned by the login/access request;
3).解析数据判断是否有请求成功标志;3). Parse the data to determine whether there is a request success flag;
4).若步骤c中监控终端成功登陆/访问业务应用系统,则返回登录/访问耗时数据;4). If the monitoring terminal successfully logs in/accesses the business application system in step c, the login/access time-consuming data will be returned;
5).若步骤c中监控终端未能成功登陆/访问业务应用系统,则向系统返回登录/访问失败信号。5). If the monitoring terminal fails to successfully log in/access the business application system in step c, a login/access failure signal will be returned to the system.
所述步骤S4中,数据异常判断模块的判断逻辑详见附图6所示,具体步骤如下:In step S4, the judgment logic of the data anomaly judgment module is shown in Figure 6. The specific steps are as follows:
(1).获取监控终端N监控数据,包括多个监控终端的数据;(1). Obtain monitoring data from monitoring terminal N, including data from multiple monitoring terminals;
(2).首先,判断动环参数是否超出阀值,如果是则为动环异常;(2). First, determine whether the dynamic loop parameters exceed the threshold. If so, the dynamic loop is abnormal;
(3).其次,判断监控终端N与被检查应用服务系统网络连接是否中断,若是则为监控终端N网络与服务器网络不通,所有监控终端不通为服务端网络问题,部分监控终端网络不通为监控端网络问题。(3). Secondly, determine whether the network connection between the monitoring terminal N and the application service system under inspection is interrupted. If so, the monitoring terminal N network and the server network are not connected. The failure of all monitoring terminals is a server network problem. The network failure of some monitoring terminals is a monitoring problem. End network issues.
(4).再次,判断监控终端N与被检查应用服务系统网络连接是否超阀值,若是则为监控终端N网络与服务器网络连接超时,所有监控终端超时为服务端网络超时,部分监控终端超时为监控端网络超时。(4). Again, determine whether the network connection between the monitoring terminal N and the checked application service system exceeds the threshold. If so, the connection between the monitoring terminal N network and the server network has timed out. The timeout of all monitoring terminals is the server network timeout, and some monitoring terminals have timed out. The network timeout is for the monitoring end.
(5).最后,判断监控终端N模拟登陆、访问是否异常,若是则为应用服务器性能异常,当上述情况都未发生时,各项监控指标均正常。(5). Finally, determine whether the simulated login and access of the monitoring terminal N is abnormal. If so, the application server performance is abnormal. When none of the above situations occur, all monitoring indicators are normal.
尽管已经示出和描述了本发明的实施例,对于本领域的普通技术人员而言,可以理解在不脱离本发明的原理和精神的情况下可以对这些实施例进行多种变化、修改、替换和变型,本发明的范围由所附权利要求及其等同物限定。Although the embodiments of the present invention have been shown and described, those of ordinary skill in the art will understand that various changes, modifications, and substitutions can be made to these embodiments without departing from the principles and spirit of the invention. and modifications, the scope of the invention is defined by the appended claims and their equivalents.
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811388246.5A CN109240126B (en) | 2018-11-21 | 2018-11-21 | A distributed application service monitoring system and method with simulation operation function |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811388246.5A CN109240126B (en) | 2018-11-21 | 2018-11-21 | A distributed application service monitoring system and method with simulation operation function |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109240126A CN109240126A (en) | 2019-01-18 |
CN109240126B true CN109240126B (en) | 2024-03-08 |
Family
ID=65076101
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811388246.5A Active CN109240126B (en) | 2018-11-21 | 2018-11-21 | A distributed application service monitoring system and method with simulation operation function |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109240126B (en) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110233760B (en) * | 2019-06-11 | 2022-09-02 | 北京搜房科技发展有限公司 | System fault monitoring method and device |
CN110768859A (en) * | 2019-09-18 | 2020-02-07 | 国网江苏省电力有限公司 | A method for automatic detection of application health based on time series data |
CN110736889A (en) * | 2019-10-28 | 2020-01-31 | 海风电气(江苏)有限公司 | underwater equipment electrical performance detection system and test method |
CN112181783A (en) * | 2020-10-16 | 2021-01-05 | 广东汉鼎蜂助手网络技术有限公司 | Hard disk detection method and device, storage medium and monitoring server |
CN112364078A (en) * | 2020-11-11 | 2021-02-12 | 国网山东省电力公司泰安供电公司 | Power supply service information automatic monitoring system and method |
CN113312233A (en) * | 2021-04-30 | 2021-08-27 | 上海英众信息科技有限公司 | Computer state monitoring system |
CN113344738B (en) * | 2021-06-09 | 2022-11-29 | 广西电网有限责任公司钦州供电局 | Access management method and system suitable for data intensive monitoring |
CN113553236B (en) * | 2021-07-20 | 2022-03-01 | 深圳阿帕云计算有限公司 | Centralized automatic management system and method for physical machines in data center |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101989925A (en) * | 2010-07-23 | 2011-03-23 | 德讯科技股份有限公司 | System and method for taking over out-of-band IT equipment by applying libcurl tool |
CN103326902A (en) * | 2013-06-28 | 2013-09-25 | 广东电网公司电力科学研究院 | Configurable monitoring system and monitoring method for distributed type mainframe performance testing data |
CN104699612A (en) * | 2015-03-25 | 2015-06-10 | 北京嘀嘀无限科技发展有限公司 | Processing method, equipment and system used in software testing |
CN105915405A (en) * | 2016-03-29 | 2016-08-31 | 深圳市中博科创信息技术有限公司 | Large-scale cluster node performance monitoring system |
CN106100938A (en) * | 2016-08-19 | 2016-11-09 | 浪潮(北京)电子信息产业有限公司 | The monitoring of a kind of distributed cluster system and alarm method and system |
CN107834703A (en) * | 2017-11-21 | 2018-03-23 | 武汉精伦电气有限公司 | A kind of intelligent grid power distribution room monitoring management system and method |
CN207677512U (en) * | 2018-04-20 | 2018-07-31 | 云南电网有限责任公司丽江供电局 | It is a kind of with electricity consumption monitoring and to manage system for power distribution network management |
CN108418903A (en) * | 2018-05-28 | 2018-08-17 | 苏州德姆斯信息技术有限公司 | Embedded software daily record remote access system and access method |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2015064872A (en) * | 2013-08-29 | 2015-04-09 | 株式会社リコー | Monitoring system, system, and monitoring method |
-
2018
- 2018-11-21 CN CN201811388246.5A patent/CN109240126B/en active Active
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101989925A (en) * | 2010-07-23 | 2011-03-23 | 德讯科技股份有限公司 | System and method for taking over out-of-band IT equipment by applying libcurl tool |
CN103326902A (en) * | 2013-06-28 | 2013-09-25 | 广东电网公司电力科学研究院 | Configurable monitoring system and monitoring method for distributed type mainframe performance testing data |
CN104699612A (en) * | 2015-03-25 | 2015-06-10 | 北京嘀嘀无限科技发展有限公司 | Processing method, equipment and system used in software testing |
CN105915405A (en) * | 2016-03-29 | 2016-08-31 | 深圳市中博科创信息技术有限公司 | Large-scale cluster node performance monitoring system |
CN106100938A (en) * | 2016-08-19 | 2016-11-09 | 浪潮(北京)电子信息产业有限公司 | The monitoring of a kind of distributed cluster system and alarm method and system |
CN107834703A (en) * | 2017-11-21 | 2018-03-23 | 武汉精伦电气有限公司 | A kind of intelligent grid power distribution room monitoring management system and method |
CN207677512U (en) * | 2018-04-20 | 2018-07-31 | 云南电网有限责任公司丽江供电局 | It is a kind of with electricity consumption monitoring and to manage system for power distribution network management |
CN108418903A (en) * | 2018-05-28 | 2018-08-17 | 苏州德姆斯信息技术有限公司 | Embedded software daily record remote access system and access method |
Also Published As
Publication number | Publication date |
---|---|
CN109240126A (en) | 2019-01-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109240126B (en) | A distributed application service monitoring system and method with simulation operation function | |
CN103473710B (en) | A kind of failure sorted processing method concentrating operational system | |
CN107994539A (en) | A kind of distribution line failure detecting system based on Cloud Server | |
CN105049223B (en) | A kind of power telecom network defect troubleshooting decision assistant analysis method | |
CN110768846A (en) | Intelligent substation network safety protection system | |
CN105262210A (en) | System and method for analysis and early warning of substation network security | |
CN111259073A (en) | An intelligent judgment system for business system running status based on logs, traffic and business access | |
CN105450472A (en) | Method and device for automatically acquiring states of physical components of servers | |
CN103295155A (en) | Security core service system monitoring method | |
CN112994972B (en) | Distributed probe monitoring platform | |
CN112505471A (en) | Transient disturbance-based early fault early warning and positioning system and method for looped network cable | |
CN112449019A (en) | IMS intelligent Internet of things operation and maintenance management platform | |
CN109698766A (en) | Method and system for fault analysis of communication power supply | |
CN110445694A (en) | A method of trigger notice is monitored based on Zabbix | |
CN104238509A (en) | Data acquisition remote monitoring system | |
CN107748946A (en) | Electric power optical transmission device state-detection evaluation system | |
CN110275509A (en) | An energy storage power station monitoring function testing method and system | |
CN110429977A (en) | A kind of optical cable fibre core real-time monitoring system and method based on light source photodetector array | |
CN104410376A (en) | Power amplifier system capable of monitoring fault | |
CN101414737A (en) | Control method and system for track traffic electric power data acquisition and surveillance | |
CN205983124U (en) | Comprehensive supervision system | |
CN209388135U (en) | A Distributed Application Service Monitoring System with Simulation Operation Function | |
CN115514099B (en) | Electric power safety inspection system and method | |
CN116506278A (en) | An anomaly monitoring platform based on zabbix | |
CN205427007U (en) | Steal electric report system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |