CN111290903B - Software system monitoring method and device based on user behavior and machine learning - Google Patents
Software system monitoring method and device based on user behavior and machine learning Download PDFInfo
- Publication number
- CN111290903B CN111290903B CN201811390496.2A CN201811390496A CN111290903B CN 111290903 B CN111290903 B CN 111290903B CN 201811390496 A CN201811390496 A CN 201811390496A CN 111290903 B CN111290903 B CN 111290903B
- Authority
- CN
- China
- Prior art keywords
- early warning
- current moment
- users
- current
- logged
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000012544 monitoring process Methods 0.000 title claims abstract description 50
- 238000000034 method Methods 0.000 title claims abstract description 45
- 238000010801 machine learning Methods 0.000 title claims abstract description 28
- 230000006399 behavior Effects 0.000 title abstract description 5
- 238000012806 monitoring device Methods 0.000 claims description 9
- 238000004590 computer program Methods 0.000 claims description 8
- 238000004364 calculation method Methods 0.000 claims description 6
- 230000000007 visual effect Effects 0.000 claims description 6
- 238000012360 testing method Methods 0.000 description 13
- 238000012423 maintenance Methods 0.000 description 8
- 238000010586 diagram Methods 0.000 description 7
- 230000000694 effects Effects 0.000 description 7
- 238000012545 processing Methods 0.000 description 4
- 238000005070 sampling Methods 0.000 description 4
- 230000002159 abnormal effect Effects 0.000 description 2
- 230000005856 abnormality Effects 0.000 description 2
- 238000004422 calculation algorithm Methods 0.000 description 2
- 238000011835 investigation Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000000737 periodic effect Effects 0.000 description 2
- 230000003442 weekly effect Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3003—Monitoring arrangements specially adapted to the computing system or computing system component being monitored
- G06F11/302—Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system component is a software system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3055—Monitoring arrangements for monitoring the status of the computing system or of the computing system component, e.g. monitoring if the computing system is on, off, available, not available
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/32—Monitoring with visual or acoustical indication of the functioning of the machine
- G06F11/324—Display of status information
- G06F11/327—Alarm or error message display
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computing Systems (AREA)
- Quality & Reliability (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Debugging And Monitoring (AREA)
Abstract
本发明实施例提供一种基于用户行为和机器学习的软件系统监控方法及装置,所述方法包括:获取当前时刻的登录用户数;若判断获知当前时刻的登录用户数大于当前时刻的预警阈值,则发出预警信息。本发明实施例提供的基于用户行为和机器学习的软件系统监控方法及装置,通过设置动态的预警阈值,实时地监控系统的运行状态,提高了系统监控的准确性和时效性。
Embodiments of the present invention provide a software system monitoring method and device based on user behavior and machine learning. The method includes: obtaining the number of logged-in users at the current moment; if it is determined that the number of logged-in users at the current moment is greater than the current warning threshold, A warning message will be issued. The software system monitoring method and device based on user behavior and machine learning provided by the embodiments of the present invention can monitor the operating status of the system in real time by setting a dynamic early warning threshold, thereby improving the accuracy and timeliness of system monitoring.
Description
技术领域technical field
本发明实施例涉及系统监控技术领域,尤其涉及一种基于用户行为和机器学习的软件系统监控方法及装置。The embodiments of the present invention relate to the technical field of system monitoring, in particular to a software system monitoring method and device based on user behavior and machine learning.
背景技术Background technique
软件系统的监控对于软件系统的稳定性非常重要,通过对软件系统的监控,能够及时发现软件系统的异常,并通过短信、邮件等形式通知相关运维人员。The monitoring of the software system is very important to the stability of the software system. Through the monitoring of the software system, the abnormality of the software system can be discovered in time, and the relevant operation and maintenance personnel can be notified by SMS, email, etc.
现有技术中,通过将目标软件系统链接加入拨测系统,定期拨测,来监控目标软件系统状况。拨测时,需要目标软件系统预先准备拨测链接,并加入拨测计划,设置拨测频率等。启用后,拨测系统将定期访问拨测链接,根据返回结果或是否有返回判断目标软件系统的状态。具体来说,就是目标软件系统无返回,或是从拨测系统角度认为返回异常的情况下,确定目标软件系统已经发生异常,此时,通过短信、邮件等形式通知相关运维人员。In the prior art, the status of the target software system is monitored by linking the target software system into the dial-up testing system and periodically dial-up testing. During the dial-up test, the target software system needs to prepare the dial-up test link in advance, add the dial-up test plan, set the dial-up test frequency, etc. After it is enabled, the dial test system will visit the dial test link regularly, and judge the status of the target software system according to the returned results or whether there is any return. Specifically, if the target software system does not return, or if the return is considered abnormal from the perspective of the dial test system, it is determined that the target software system has an abnormality. At this time, the relevant operation and maintenance personnel will be notified by SMS or email.
但是,拨测系统需要拨测链接,往往是利旧,如目标软件系统主页,如果覆盖面不够,还需加入其它链接,拨测频率往往不可低于用户使用频率,否则会达不到监控效果,无形中增加了软件系统的使用压力,同时机械的检查链接返回结果,只有在软件系统已经异常的情况下发出告警,针对用户使用频率高的软件系统起不到实质性作用。However, the dial-up test system needs dial-up links, which are often old ones. For example, the target software system homepage, if the coverage is not enough, other links need to be added. The dial-up test frequency should not be lower than the user frequency, otherwise the monitoring effect will not be achieved. Invisibly increases the use pressure of the software system, and at the same time mechanically checks the link to return the result, and only sends an alarm when the software system is abnormal, and does not play a substantial role in the software system that is frequently used by users.
发明内容Contents of the invention
本发明实施例的目的是提供一种克服上述问题或者至少部分地解决上述问题的基于用户行为和机器学习的软件系统监控方法及装置。The purpose of the embodiments of the present invention is to provide a software system monitoring method and device based on user behavior and machine learning that overcomes the above problems or at least partially solves the above problems.
为了解决上述技术问题,一方面,本发明实施例提供一种软件系统监控方法,包括:In order to solve the above technical problems, on the one hand, an embodiment of the present invention provides a method for monitoring a software system, including:
获取当前时刻的登录用户数;Get the number of logged-in users at the current moment;
若判断获知当前时刻的登录用户数大于当前时刻的预警阈值,则发出预警信息。If it is judged that the number of logged-in users at the current moment is greater than the early warning threshold at the current moment, an early warning message is issued.
另一方面,本发明实施例提供一种软件系统监控装置,包括:On the other hand, an embodiment of the present invention provides a software system monitoring device, including:
获取模块,用于获取当前时刻的登录用户数;The obtaining module is used to obtain the number of logged-in users at the current moment;
预警模块,用于若判断获知当前时刻的登录用户数大于当前时刻的预警阈值,则发出预警信息。The early warning module is used to issue early warning information if it is determined that the number of logged-in users at the current moment is greater than the early warning threshold at the current moment.
再一方面,本发明实施例提供一种电子设备,包括:In another aspect, an embodiment of the present invention provides an electronic device, including:
存储器和处理器,所述处理器和所述存储器通过总线完成相互间的通信;所述存储器存储有可被所述处理器执行的程序指令,所述处理器调用所述程序指令能够执行上述的方法。A memory and a processor, the processor and the memory communicate with each other through a bus; the memory stores program instructions that can be executed by the processor, and the processor invokes the program instructions to perform the above-mentioned method.
又一方面,本发明实施例提供一种非暂态计算机可读存储介质,其上存储有计算机程序,当所述计算机程序被处理器执行时,实现上述的方法。In yet another aspect, an embodiment of the present invention provides a non-transitory computer-readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, the above method is implemented.
本发明实施例提供的软件系统监控方法及装置,通过设置动态的预警阈值,实时地监控系统的运行状态,提高了系统监控的准确性和时效性。The software system monitoring method and device provided by the embodiments of the present invention monitor the operating status of the system in real time by setting a dynamic early warning threshold, thereby improving the accuracy and timeliness of system monitoring.
附图说明Description of drawings
图1为本发明实施例提供的软件系统监控方法示意图;FIG. 1 is a schematic diagram of a software system monitoring method provided by an embodiment of the present invention;
图2为本发明实施例提供的软件系统监控装置示意图;2 is a schematic diagram of a software system monitoring device provided by an embodiment of the present invention;
图3为本发明实施例提供的电子设备的结构示意图。FIG. 3 is a schematic structural diagram of an electronic device provided by an embodiment of the present invention.
具体实施方式Detailed ways
为了使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts fall within the protection scope of the present invention.
图1为本发明实施例提供的软件系统监控方法示意图,如图1所示,本发明实施例提供一种软件系统监控方法,其执行主体为软件系统监控装置,该方法包括:FIG. 1 is a schematic diagram of a software system monitoring method provided by an embodiment of the present invention. As shown in FIG. 1 , an embodiment of the present invention provides a software system monitoring method, the execution subject of which is a software system monitoring device, and the method includes:
步骤S101、获取当前时刻的登录用户数;Step S101, obtaining the number of logged-in users at the current moment;
步骤S102、若判断获知当前时刻的登录用户数大于当前时刻的预警阈值,则发出预警信息。Step S102 , if it is determined that the number of logged-in users at the current time is greater than the current warning threshold, then send a warning message.
具体来说,本发明实施例提供的软件系统监控方法,建立有效的可以持续反映系统使用情况的方法,提高了监控的实时性,配合其他监控方式使用达到监控工作的初衷目标。Specifically, the software system monitoring method provided by the embodiment of the present invention establishes an effective method that can continuously reflect the usage of the system, improves the real-time performance of monitoring, and cooperates with other monitoring methods to achieve the original goal of monitoring work.
因系统使用情况是时刻变化的,本发明实施例提供的软件系统监控方法监控预警阀值也应该是时刻变化的,因此,可保证预警的及时性和准确性。Since the usage of the system changes from time to time, the monitoring and early warning threshold of the software system monitoring method provided by the embodiment of the present invention should also change from time to time, so that the timeliness and accuracy of the early warning can be guaranteed.
首先,获取当前时刻的登录用户数。为了做到实时反映系统使用情况,抽取系统的时段登录用户数为关键数据,同时考虑算法的复杂度和可行性,需要将时刻的颗粒度设置在一个合理的范围内,颗粒度太小,则将消耗巨大的计算资源,颗粒度太大,又将起不到很好的实时监控效果。经过对实际情况的考察,以及试验结果,以分钟作为时刻的颗粒度进行统计比较合适,即,在时间间隔长短上最终确定了以分钟为统计单元时段,进行数据的获取、计算,以及系统的动态预警。当前时刻的登录用户数可以是当前一分钟内某一个采样数据,也可以是各个采样数据的平均值,具体可结合实际情况而定。First, get the number of logged-in users at the current moment. In order to reflect the system usage in real time, the number of logged-in users in the system during the time period is extracted as the key data. At the same time, considering the complexity and feasibility of the algorithm, it is necessary to set the time granularity within a reasonable range. If the granularity is too small, then It will consume huge computing resources, and the granularity is too large, which will not achieve a good real-time monitoring effect. After the investigation of the actual situation and the test results, it is more appropriate to use minutes as the granularity of time for statistics, that is, in terms of the length of the time interval, minutes are finally determined as the statistical unit period for data acquisition, calculation, and system. Dynamic warning. The number of logged-in users at the current moment can be a certain sampling data within the current minute, or the average value of each sampling data, which can be determined according to the actual situation.
获取当前时刻的登录用户数之后,如果判断获知当前时刻的登录用户数大于当前时刻的预警阈值,则发出预警信息。可以通过短信或者邮件的形式通知运维人员,以便做出及时的处理。After obtaining the number of logged-in users at the current moment, if it is judged that the number of logged-in users at the current moment is greater than the early-warning threshold at the current moment, an early-warning message is issued. The operation and maintenance personnel can be notified by SMS or email for timely processing.
如果判断获知当前时刻的登录用户数小于等于当前时刻的预警阈值,则说明系统正常,不发出预警信息。If it is judged that the number of logged-in users at the current moment is less than or equal to the warning threshold at the current moment, it means that the system is normal and no warning information is issued.
其中,当前时刻的预警阈值是通过预设的机器学习模型,根据历史数据计算得到的,不同时刻的预警阈值不同,从而起到动态的、实时的、准确的预警效果。Among them, the early warning threshold at the current moment is calculated based on the historical data through the preset machine learning model, and the early warning threshold at different moments is different, so as to achieve a dynamic, real-time and accurate early warning effect.
例如,获取到当前时刻的登录用户人数为100,如果通过计算得到的当前时刻的预警阈值为90,说明当前的登录用户人数超过了当前时刻的预警阈值,则发出预警信息,通过短信的形式通知运维人员,以便做出及时的处理。For example, the number of logged-in users obtained at the current moment is 100. If the calculated warning threshold at the current moment is 90, it means that the current number of logged-in users exceeds the warning threshold at the current moment. Operation and maintenance personnel, in order to make timely processing.
本发明实施例提供的软件系统监控方法,通过设置动态的预警阈值,实时地监控系统的运行状态,提高了系统监控的准确性和时效性。The software system monitoring method provided by the embodiment of the present invention monitors the operating state of the system in real time by setting a dynamic early warning threshold, thereby improving the accuracy and timeliness of system monitoring.
在上述实施例的基础上,进一步地,所述若判断获知当前时刻的登录用户数大于当前时刻的预警阈值,则发出预警信息之前,还包括:On the basis of the above embodiments, further, if it is judged that the number of logged-in users at the current moment is greater than the early warning threshold at the current moment, before sending the early warning information, it also includes:
获取若干个周的历史数据,所述历史数据包括每一时刻的登录用户数,每一时刻的预警状态、以及每一时刻的历史预警阈值;Obtain several weeks of historical data, the historical data includes the number of logged-in users at each moment, the warning status at each moment, and the historical warning threshold at each moment;
将所述历史数据或者所述历史数据中的一部分,输入至预设的机器学习模型,输出当前时刻的预警阈值。The historical data or a part of the historical data is input into a preset machine learning model, and an early warning threshold at the current moment is output.
具体来说,本发明实施例提供的软件系统监控方法,根据历史数据,利用预设的机器学习模型,计算得到当前时刻的预警阈值,不同时刻的预警阈值不同,从而起到动态的、实时的、准确的预警效果。Specifically, the software system monitoring method provided by the embodiment of the present invention uses the preset machine learning model to calculate the current warning threshold based on historical data. , Accurate early warning effect.
在预警之前,首先,获取若干个周的历史数据,历史数据包括每一时刻的登录用户数,每一时刻的预警状态、以及每一时刻的历史预警阈值。Before the early warning, firstly, several weeks of historical data are obtained. The historical data includes the number of logged-in users at each moment, the early warning status at each moment, and the historical early warning threshold at each moment.
然后,将历史数据或者历史数据中的一部分,输入至预设的机器学习模型,输出当前时刻的预警阈值。Then, input the historical data or a part of the historical data into the preset machine learning model, and output the early warning threshold at the current moment.
获取的历史数据的大小,可以根据实际情况而定,因为系统的使用情况以周为一个周期呈现出周期性的变化,因此,计算预警阈值时,要考虑这一实际情况,获取的历史数据以周为时间长度,例如,获取12个周的历史数据。The size of the historical data obtained can be determined according to the actual situation, because the usage of the system shows periodic changes on a weekly basis. Therefore, when calculating the early warning threshold, this actual situation should be considered. Week is the length of time, for example, to obtain 12 weeks of historical data.
机器学习模型是预先设计好的,使用机器学习模型计算预警阈值,人工干预度较低,具有客观的参考价值,使得到的当前时刻的预警阈值更加合理,更加准确。The machine learning model is pre-designed. Using the machine learning model to calculate the early warning threshold has a low degree of manual intervention and has an objective reference value, making the current early warning threshold more reasonable and accurate.
本发明实施例提供的软件系统监控方法,通过机器学习模型动态地计算当前时刻的预警阈值,并动态跟踪当前的登录用户数,实时地监控系统的运行状态,提高了系统监控的准确性和时效性。The software system monitoring method provided by the embodiment of the present invention dynamically calculates the early warning threshold at the current moment through the machine learning model, and dynamically tracks the current number of logged-in users, monitors the operating status of the system in real time, and improves the accuracy and timeliness of system monitoring sex.
在以上各实施例的基础上,进一步地,所述机器学习模型具体为:On the basis of the above embodiments, further, the machine learning model is specifically:
其中,Yt为当前时刻的预警阈值,Z为系统所能承载的最大用户数,Bt-1为当前时刻所在周之前的第1周的同一时刻的预警状态,Bt-1的值取0或1,St-1=0表示当前时刻所在周之前的第1周的同一时刻未发出预警信息,Bt-1=1表示当前时刻所在周之前的第1周的同一时刻发出了预警信息,Bt-2为当前时刻所在周之前的第2周的同一时刻的预警状态,Bt-2的值取0或1,Bt-2=0表示当前时刻所在周之前的第2周的同一时刻未发出预警信息,Bt-2=1表示当前时刻所在周之前的第2周的同一时刻发出了预警信息,Bt-m为当前时刻所在周之前的第m周的同一时刻的预警状态,Bt-m的值取0或1,Bt-m=0表示当前时刻所在周之前的第m周的同一时刻未发出预警信息,Bt-m=1表示当前时刻所在周之前的第m周的同一时刻发出了预警信息,Yt-1为当前时刻所在周之前的第1周的同一时刻的预警阈值,为所有的历史预警阈值的平均值,Xi表示当前时刻所在周之前的第i周的同一时刻的登录用户数,N为所述若干个周的数量,N>m。Among them, Y t is the early warning threshold at the current moment, Z is the maximum number of users that the system can carry, B t-1 is the early warning status at the same time in the first week before the current moment, and the value of B t-1 is taken as 0 or 1, S t-1 =0 indicates that no early warning information was issued at the same time in the first week before the current time, and B t-1 =1 indicates that an early warning was issued at the same time in the first week before the current time Information, B t-2 is the warning state at the same time of the week before the current moment, the value of B t-2 is 0 or 1, B t-2 = 0 means the second week before the week of the current moment No early warning information was issued at the same time of the week, B t-2 = 1 means that the early warning information was issued at the same time in the second week before the current time, and B tm is the warning status at the same time in the mth week before the current time. , the value of B tm is 0 or 1, B tm = 0 means that the warning information is not issued at the same time in the mth week before the current time, B tm = 1 means that the current time is issued at the same time in the mth week before the week The early warning information, Y t-1 is the early warning threshold at the same time in the first week before the current time, is the average value of all historical early warning thresholds, Xi represents the number of logged-in users at the same time in the i-th week before the current time, N is the number of weeks, and N>m.
具体来说,例如,以分钟为时刻的颗粒度,当前时刻为2018年第13周的星期三的第7分钟,获取12个周的历史数据,即N=12,另外,取m=3,即,分析当前时刻所在周之前的3个周的同一时刻的预警状态。Specifically, for example, the minute is the granularity of time, the current time is the 7th minute of Wednesday in the 13th week of 2018, and the historical data of 12 weeks is obtained, that is, N=12. In addition, m=3, that is , to analyze the warning status at the same time in the 3 weeks before the current time.
机器学习模型具体为:The machine learning model is specifically:
其中,Yt为2018年第13周的星期三的第7分钟的预警阈值,Z为系统所能承载的最大用户数,Bt-1为2018年第12周的星期三的第7分钟的预警状态,Bt-1的值取0或1,Bt-1=0表示2018年第12周的星期三的第7分钟未发出预警信息,Bt-1=1表示2018年第12周的星期三的第7分钟发出了预警信息,Bt-2为2018年第11周的星期三的第7分钟的预警状态,St-2的值取0或1,Bt-2=0表示2018年第11周的星期三的第7分钟未发出预警信息,Bt-2=1表示2018年第11周的星期三的第7分钟发出了预警信息,Bt-3为2018年第10周的星期三的第7分钟的预警状态,Bt-m的值取0或1,Bt-3=0表示2018年第10周的星期三的第7分钟未发出预警信息,Bt-3=1表示2018年第10周的星期三的第7分钟发出了预警信息,Yt-1为2018年第12周的星期三的第7分钟的预警阈值,为所有的历史预警阈值的平均值,Xi表示2018年第13周之前的第i周的星期三的第7分钟的登录用户数,例如,X2为2018年第11周的星期三的第7分钟的登录用户数。Among them, Y t is the warning threshold at the 7th minute on Wednesday of the 13th week in 2018, Z is the maximum number of users that the system can carry, and B t-1 is the warning status at the 7th minute of Wednesday in the 12th week of 2018 , the value of B t-1 is 0 or 1, B t-1 = 0 means that no warning information was issued on the 7th minute of Wednesday of the 12th week in 2018, B t-1 = 1 means that An early warning message was issued at the 7th minute, B t-2 is the warning status at the 7th minute on Wednesday of the 11th week of 2018, the value of S t-2 is 0 or 1, B t-2 = 0 means the 11th week of 2018 No early warning information was issued on the 7th minute of Wednesday of the week, B t-2 = 1 means that an early warning information was issued on the 7th minute of Wednesday of the 11th week of 2018, B t-3 is the 7th of Wednesday of the 10th week of 2018 Minute warning status, the value of B tm is 0 or 1, B t-3 = 0 means that no warning information was issued on the 7th minute of Wednesday in the 10th week of 2018, B t-3 = 1 means that the 10th week of 2018 The warning message was issued on the 7th minute of Wednesday, Y t-1 is the warning threshold of the 7th minute of Wednesday in the 12th week of 2018, is the average value of all historical warning thresholds, X i represents the number of logged-in users at the 7th minute of the Wednesday of the i-th week before the 13th week of 2018, for example, X 2 is the 7th minute of the Wednesday of the 11th week of 2018 The number of logged-in users.
本发明实施例提供的软件系统监控方法,通过机器学习模型动态地计算当前时刻的预警阈值,并动态跟踪当前的登录用户数,实时地监控系统的运行状态,提高了系统监控的准确性和时效性。The software system monitoring method provided by the embodiment of the present invention dynamically calculates the early warning threshold at the current moment through the machine learning model, and dynamically tracks the current number of logged-in users, monitors the operating status of the system in real time, and improves the accuracy and timeliness of system monitoring sex.
在以上各实施例的基础上,进一步地,所述输出所述预警阈值之后,还包括:On the basis of the above embodiments, further, after outputting the warning threshold, it further includes:
根据当前时刻的登录用户数和当前时刻的预警阈值,生成动态可视化监控图。According to the number of logged-in users at the current moment and the early warning threshold at the current moment, a dynamic visual monitoring map is generated.
具体来说,为了便于运维人员直观地监控系统的运行情况,根据当前时刻的登录用户数和当前时刻的预警阈值,生成动态可视化监控图,即,将登录用户数和预警阈值以动态曲线的形式显示出来,运维人员可根据动态可视化监控图直观地监控的运行情况。Specifically, in order to facilitate the operation and maintenance personnel to intuitively monitor the operation of the system, a dynamic visual monitoring graph is generated according to the number of logged-in users at the current moment and the warning threshold at the current moment, that is, the number of logged-in users and the warning threshold are represented by the dynamic curve The form is displayed, and the operation and maintenance personnel can intuitively monitor the operation status according to the dynamic visual monitoring diagram.
将获取到的当前时刻的登录用户数和当前时刻的预警阈值存储到Oracle数据库中,用来支持动态可视化监控图的快速数据查询。The obtained number of logged-in users at the current moment and the warning threshold at the current moment are stored in the Oracle database to support fast data query of dynamic visual monitoring graphs.
本发明实施例提供的软件系统监控方法,通过机器学习模型动态地计算当前时刻的预警阈值,并动态跟踪当前的登录用户数,实时地监控系统的运行状态,提高了系统监控的准确性和时效性。The software system monitoring method provided by the embodiment of the present invention dynamically calculates the early warning threshold at the current moment through the machine learning model, and dynamically tracks the current number of logged-in users, monitors the operating status of the system in real time, and improves the accuracy and timeliness of system monitoring sex.
在以上各实施例的基础上,进一步地,所述获取当前时刻的登录用户数,具体包括:On the basis of the above embodiments, further, the acquisition of the number of logged-in users at the current moment specifically includes:
获取当前时刻的系统日志信息;Obtain the system log information at the current moment;
从所述系统日志信息中提取当前时刻的登录用户数。The number of logged-in users at the current moment is extracted from the system log information.
具体来说,通过J2EE框架开发分析程序,从大量基础日志数据存储ES进行关键数据抓取。Specifically, the analysis program is developed through the J2EE framework, and key data is captured from a large number of basic log data storage ES.
首先,获取当前时刻的系统日志。系统日志信息包括各个系统登录日志与操作日志。First, get the syslog at the current moment. System log information includes various system login logs and operation logs.
然后,从系统日志信息中提取当前时刻的登录用户数。Then, the number of logged-in users at the current moment is extracted from the system log information.
本发明实施例提供的软件系统监控方法,通过机器学习模型动态地计算当前时刻的预警阈值,并动态跟踪当前的登录用户数,实时地监控系统的运行状态,提高了系统监控的准确性和时效性。The software system monitoring method provided by the embodiment of the present invention dynamically calculates the early warning threshold at the current moment through the machine learning model, and dynamically tracks the current number of logged-in users, monitors the operating status of the system in real time, and improves the accuracy and timeliness of system monitoring sex.
图2为本发明实施例提供的软件系统监控装置示意图,如图2所示,本发明实施例提供一种软件系统监控装置,用于执行上述任一实施例中所述的方法,具体包括获取模块201和预警模块202,其中:FIG. 2 is a schematic diagram of a software system monitoring device provided by an embodiment of the present invention. As shown in FIG. 2 , an embodiment of the present invention provides a software system monitoring device for performing the method described in any of the above embodiments, specifically including obtaining
获取模块201用于获取当前时刻的登录用户数;预警模块202用于若判断获知当前时刻的登录用户数大于当前时刻的预警阈值,则发出预警信息。The obtaining
具体来说,本发明实施例提供的软件系统监控方法,建立有效的可以持续反映系统使用情况的方法,提高了监控的实时性,配合其他监控方式使用达到监控工作的初衷目标。Specifically, the software system monitoring method provided by the embodiment of the present invention establishes an effective method that can continuously reflect the usage of the system, improves the real-time performance of monitoring, and cooperates with other monitoring methods to achieve the original goal of monitoring work.
因系统使用情况是时刻变化的,本发明实施例提供的软件系统监控方法监控预警阀值也应该是时刻变化的,因此,可保证预警的及时性和准确性。Since the usage of the system changes from time to time, the monitoring and early warning threshold of the software system monitoring method provided by the embodiment of the present invention should also change from time to time, so that the timeliness and accuracy of the early warning can be guaranteed.
首先,通过获取模块201获取当前时刻的登录用户数。为了做到实时反映系统使用情况,抽取系统的时段登录用户数为关键数据,同时考虑算法的复杂度和可行性,需要将时刻的颗粒度设置在一个合理的范围内,颗粒度太小,则将消耗巨大的计算资源,颗粒度太大,又将起不到很好的实时监控效果。经过对实际情况的考察,以及试验结果,以分钟作为时刻的颗粒度进行统计比较合适,即,在时间间隔长短上最终确定了以分钟为统计单元时段,进行数据的获取、计算,以及系统的动态预警。当前时刻的登录用户数可以是当前一分钟内某一个采样数据,也可以是各个采样数据的平均值,具体可结合实际情况而定。First, the number of logged-in users at the current moment is acquired through the
获取当前时刻的登录用户数之后,通过预警模块202如果判断获知当前时刻的登录用户数大于当前时刻的预警阈值,则发出预警信息。可以通过短信或者邮件的形式通知运维人员,以便做出及时的处理。After obtaining the number of logged-in users at the current moment, if the early-warning
如果判断获知当前时刻的登录用户数小于等于当前时刻的预警阈值,则说明系统正常,不发出预警信息。If it is judged that the number of logged-in users at the current moment is less than or equal to the warning threshold at the current moment, it means that the system is normal and no warning information is issued.
其中,当前时刻的预警阈值是通过预设的机器学习模型,根据历史数据计算得到的,不同时刻的预警阈值不同,从而起到动态的、实时的、准确的预警效果。Among them, the early warning threshold at the current moment is calculated based on the historical data through the preset machine learning model, and the early warning threshold at different moments is different, so as to achieve a dynamic, real-time and accurate early warning effect.
例如,获取到当前时刻的登录用户人数为100,如果通过计算得到的当前时刻的预警阈值为90,说明当前的登录用户人数超过了当前时刻的预警阈值,则发出预警信息,通过短信的形式通知运维人员,以便做出及时的处理。For example, the number of logged-in users obtained at the current moment is 100. If the calculated warning threshold at the current moment is 90, it means that the current number of logged-in users exceeds the warning threshold at the current moment. Operation and maintenance personnel, in order to make timely processing.
本发明实施例提供的软件系统监控装置,通过设置动态的预警阈值,实时地监控系统的运行状态,提高了系统监控的准确性和时效性。The software system monitoring device provided by the embodiment of the present invention monitors the operating state of the system in real time by setting a dynamic early warning threshold, thereby improving the accuracy and timeliness of system monitoring.
在上述实施例的基础上,进一步地,还包括计算模块,On the basis of the above embodiments, it further includes a calculation module,
所述计算模块用于获取若干个周的历史数据,所述历史数据包括每一时刻的登录用户数,每一时刻的预警状态、以及每一时刻的历史预警阈值;The calculation module is used to obtain several weeks of historical data, the historical data including the number of logged-in users at each moment, the warning status at each moment, and the historical warning threshold at each moment;
将所述历史数据或者所述历史数据中的一部分,输入至预设的机器学习模型,输出当前时刻的预警阈值。The historical data or a part of the historical data is input into a preset machine learning model, and an early warning threshold at the current moment is output.
具体来说,本发明实施例提供的软件系统监控装置,根据历史数据,利用预设的机器学习模型,计算得到当前时刻的预警阈值,不同时刻的预警阈值不同,从而起到动态的、实时的、准确的预警效果。Specifically, the software system monitoring device provided by the embodiment of the present invention uses the preset machine learning model to calculate the current warning threshold according to the historical data. , Accurate early warning effect.
在预警之前,首先,通过计算模块获取若干个周的历史数据,历史数据包括每一时刻的登录用户数,每一时刻的预警状态、以及每一时刻的历史预警阈值。Before the early warning, firstly, several weeks of historical data are obtained through the calculation module. The historical data includes the number of logged-in users at each moment, the early warning status at each moment, and the historical early warning threshold at each moment.
然后,将历史数据或者历史数据中的一部分,输入至预设的机器学习模型,输出当前时刻的预警阈值。Then, input the historical data or a part of the historical data into the preset machine learning model, and output the early warning threshold at the current moment.
获取的历史数据的大小,可以根据实际情况而定,因为系统的使用情况以周为一个周期呈现出周期性的变化,因此,计算预警阈值时,要考虑这一实际情况,获取的历史数据以周为时间长度,例如,获取12个周的历史数据。The size of the historical data obtained can be determined according to the actual situation, because the usage of the system shows periodic changes on a weekly basis. Therefore, when calculating the early warning threshold, this actual situation should be considered. Week is the length of time, for example, to obtain 12 weeks of historical data.
机器学习模型是预先设计好的,使用机器学习模型计算预警阈值,人工干预度较低,具有客观的参考价值,使得到的当前时刻的预警阈值更加合理,更加准确。The machine learning model is pre-designed. Using the machine learning model to calculate the early warning threshold has a low degree of manual intervention and has an objective reference value, making the current early warning threshold more reasonable and accurate.
本发明实施例提供的软件系统监控装置,通过机器学习模型动态地计算当前时刻的预警阈值,并动态跟踪当前的登录用户数,实时地监控系统的运行状态,提高了系统监控的准确性和时效性。The software system monitoring device provided by the embodiment of the present invention dynamically calculates the warning threshold at the current moment through the machine learning model, and dynamically tracks the current number of logged-in users, monitors the operating status of the system in real time, and improves the accuracy and timeliness of system monitoring sex.
图3为本发明实施例提供的电子设备的结构示意图,如图3所示,所述设备包括:处理器301、存储器302和总线303;FIG. 3 is a schematic structural diagram of an electronic device provided by an embodiment of the present invention. As shown in FIG. 3 , the device includes: a
其中,处理器301和存储器302通过所述总线303完成相互间的通信;Wherein, the
处理器301用于调用存储器302中的程序指令,以执行上述各方法实施例所提供的方法,例如包括:The
获取当前时刻的登录用户数;Get the number of logged-in users at the current moment;
若判断获知当前时刻的登录用户数大于当前时刻的预警阈值,则发出预警信息。If it is judged that the number of logged-in users at the current moment is greater than the early warning threshold at the current moment, an early warning message is issued.
本发明实施例提供一种计算机程序产品,所述计算机程序产品包括存储在非暂态计算机可读存储介质上的计算机程序,所述计算机程序包括程序指令,当所述程序指令被计算机执行时,计算机能够执行上述各方法实施例所提供的方法,例如包括:An embodiment of the present invention provides a computer program product, the computer program product includes a computer program stored on a non-transitory computer-readable storage medium, the computer program includes program instructions, and when the program instructions are executed by a computer, The computer can execute the methods provided by the above method embodiments, including, for example:
获取当前时刻的登录用户数;Get the number of logged-in users at the current moment;
若判断获知当前时刻的登录用户数大于当前时刻的预警阈值,则发出预警信息。If it is judged that the number of logged-in users at the current moment is greater than the early warning threshold at the current moment, an early warning message is issued.
本发明实施例提供一种非暂态计算机可读存储介质,所述非暂态计算机可读存储介质存储计算机指令,所述计算机指令使所述计算机执行上述各方法实施例所提供的方法,例如包括:An embodiment of the present invention provides a non-transitory computer-readable storage medium, the non-transitory computer-readable storage medium stores computer instructions, and the computer instructions cause the computer to execute the methods provided in the above method embodiments, for example include:
获取当前时刻的登录用户数;Get the number of logged-in users at the current moment;
若判断获知当前时刻的登录用户数大于当前时刻的预警阈值,则发出预警信息。If it is judged that the number of logged-in users at the current moment is greater than the early warning threshold at the current moment, an early warning message is issued.
本领域普通技术人员可以理解:实现上述方法实施例的全部或部分步骤可以通过程序指令相关的硬件来完成,前述的程序可以存储于一计算机可读取存储介质中,该程序在执行时,执行包括上述方法实施例的步骤;而前述的存储介质包括:ROM、RAM、磁碟或者光盘等各种可以存储程序代码的介质。Those of ordinary skill in the art can understand that all or part of the steps for realizing the above-mentioned method embodiments can be completed by hardware related to program instructions, and the aforementioned program can be stored in a computer-readable storage medium. When the program is executed, the It includes the steps of the above method embodiments; and the aforementioned storage medium includes: ROM, RAM, magnetic disk or optical disk and other various media that can store program codes.
以上所描述的装置及设备等实施例仅仅是示意性的,其中所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。本领域普通技术人员在不付出创造性的劳动的情况下,即可以理解并实施。The above-described embodiments of devices and equipment are merely illustrative, and the units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, That is, it can be located in one place, or it can also be distributed to multiple network elements. Part or all of the modules can be selected according to actual needs to achieve the purpose of the solution of this embodiment. It can be understood and implemented by those skilled in the art without any creative efforts.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到各实施方式可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件。基于这样的理解,上述技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品可以存储在计算机可读存储介质中,如ROM/RAM、磁碟、光盘等,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行各个实施例或者实施例的某些部分所述的方法。Through the above description of the implementations, those skilled in the art can clearly understand that each implementation can be implemented by means of software plus a necessary general hardware platform, and of course also by hardware. Based on this understanding, the essence of the above technical solution or the part that contributes to the prior art can be embodied in the form of software products, and the computer software products can be stored in computer-readable storage media, such as ROM/RAM, magnetic discs, optical discs, etc., including several instructions to make a computer device (which may be a personal computer, server, or network device, etc.) execute the methods described in various embodiments or some parts of the embodiments.
最后应说明的是:以上实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的精神和范围。Finally, it should be noted that: the above embodiments are only used to illustrate the technical solutions of the present invention, rather than to limit them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: it can still be Modifications are made to the technical solutions described in the foregoing embodiments, or equivalent replacements are made to some of the technical features; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the spirit and scope of the technical solutions of the various embodiments of the present invention.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811390496.2A CN111290903B (en) | 2018-11-21 | 2018-11-21 | Software system monitoring method and device based on user behavior and machine learning |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811390496.2A CN111290903B (en) | 2018-11-21 | 2018-11-21 | Software system monitoring method and device based on user behavior and machine learning |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111290903A CN111290903A (en) | 2020-06-16 |
CN111290903B true CN111290903B (en) | 2023-04-25 |
Family
ID=71026557
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811390496.2A Active CN111290903B (en) | 2018-11-21 | 2018-11-21 | Software system monitoring method and device based on user behavior and machine learning |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111290903B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114997497A (en) * | 2022-06-06 | 2022-09-02 | 北京月新时代科技股份有限公司 | Method and system for predicting software use duration |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102325062A (en) * | 2011-09-20 | 2012-01-18 | 北京神州绿盟信息安全科技股份有限公司 | Abnormal login detecting method and device |
CN102752142A (en) * | 2012-07-05 | 2012-10-24 | 深圳市易聆科信息技术有限公司 | Monitoring method and system based on multidimensional modeled information system |
CN103905229A (en) * | 2012-12-27 | 2014-07-02 | 中国移动通信集团四川有限公司 | Early-warning method and device for terminal-user loss |
CN105743706A (en) * | 2016-03-31 | 2016-07-06 | 武汉虹信技术服务有限责任公司 | Disk array performance monitoring and early-warning method and system based on HTTP |
CN106230849A (en) * | 2016-08-22 | 2016-12-14 | 中国科学院信息工程研究所 | A kind of smart machine machine learning safety monitoring system based on user behavior |
CN107276982A (en) * | 2017-05-08 | 2017-10-20 | 微梦创科网络科技(中国)有限公司 | A kind of abnormal login detecting method and device |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10776715B2 (en) * | 2017-04-28 | 2020-09-15 | Microsoft Technology Licensing, Llc | Artificial intelligent cognition threshold |
-
2018
- 2018-11-21 CN CN201811390496.2A patent/CN111290903B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102325062A (en) * | 2011-09-20 | 2012-01-18 | 北京神州绿盟信息安全科技股份有限公司 | Abnormal login detecting method and device |
CN102752142A (en) * | 2012-07-05 | 2012-10-24 | 深圳市易聆科信息技术有限公司 | Monitoring method and system based on multidimensional modeled information system |
CN103905229A (en) * | 2012-12-27 | 2014-07-02 | 中国移动通信集团四川有限公司 | Early-warning method and device for terminal-user loss |
CN105743706A (en) * | 2016-03-31 | 2016-07-06 | 武汉虹信技术服务有限责任公司 | Disk array performance monitoring and early-warning method and system based on HTTP |
CN106230849A (en) * | 2016-08-22 | 2016-12-14 | 中国科学院信息工程研究所 | A kind of smart machine machine learning safety monitoring system based on user behavior |
CN107276982A (en) * | 2017-05-08 | 2017-10-20 | 微梦创科网络科技(中国)有限公司 | A kind of abnormal login detecting method and device |
Non-Patent Citations (2)
Title |
---|
朱黎等.基于自适应LBP人脸识别的身份验证 .《 计算机工程与科学 》.2014,第36卷(第11期),2217-2222. * |
陈俊 ; .基于大数据机器学习技术的IT运营分析系统建设.计算机时代.2018,(第3期),66-69. * |
Also Published As
Publication number | Publication date |
---|---|
CN111290903A (en) | 2020-06-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9323599B1 (en) | Time series metric data modeling and prediction | |
JP7007243B2 (en) | Anomaly detection system | |
US7437281B1 (en) | System and method for monitoring and modeling system performance | |
CN112162878A (en) | Database fault discovery method and device, electronic equipment and storage medium | |
US20190041839A1 (en) | Apparatus for troubleshooting fault component in equipment and method thereof | |
US10860405B1 (en) | System operational analytics | |
CN111314173A (en) | Monitoring information abnormity positioning method and device, computer equipment and storage medium | |
US12039045B2 (en) | Event analysis in an electric power system | |
US7369967B1 (en) | System and method for monitoring and modeling system performance | |
CN112380089A (en) | Data center monitoring and early warning method and system | |
CN115220940A (en) | Time series anomaly prediction and alerts | |
CN110739637A (en) | A transmission line monitoring method, system, terminal and storage medium | |
CN111290903B (en) | Software system monitoring method and device based on user behavior and machine learning | |
US10733514B1 (en) | Methods and apparatus for multi-site time series data analysis | |
CN106294364B (en) | Method and device for realizing web crawler to capture webpage | |
CN108248641A (en) | A kind of urban track traffic data processing method and device | |
CN118761005A (en) | Fault diagnosis method, device and computer equipment based on digital twin | |
CN117851195A (en) | Computer host operation risk monitoring management and control system based on data analysis | |
CN116506278A (en) | An anomaly monitoring platform based on zabbix | |
CN117236555A (en) | User energy consumption prediction method, device and storage medium | |
CN108632082A (en) | A kind of prediction technique and device of the load information of server | |
CN115658451A (en) | GPU fault alarm method, device, equipment and medium | |
CN111815442B (en) | Link prediction method and device and electronic equipment | |
CN111934895B (en) | Intelligent early warning method and device for network management system and computing equipment | |
CN111416744A (en) | Method and device for monitoring and alarming on Internet line |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |