CN113971187A - A service monitoring method and device - Google Patents

A service monitoring method and device Download PDF

Info

Publication number
CN113971187A
CN113971187A CN202010721171.9A CN202010721171A CN113971187A CN 113971187 A CN113971187 A CN 113971187A CN 202010721171 A CN202010721171 A CN 202010721171A CN 113971187 A CN113971187 A CN 113971187A
Authority
CN
China
Prior art keywords
user
service
server
behavior
abnormal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010721171.9A
Other languages
Chinese (zh)
Inventor
徐海平
雷希
马晓骥
王小均
瞿航
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Mobile Communications Group Co Ltd
China Mobile IoT Co Ltd
Original Assignee
China Mobile Communications Group Co Ltd
China Mobile IoT Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Mobile Communications Group Co Ltd, China Mobile IoT Co Ltd filed Critical China Mobile Communications Group Co Ltd
Priority to CN202010721171.9A priority Critical patent/CN113971187A/en
Publication of CN113971187A publication Critical patent/CN113971187A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3058Monitoring arrangements for monitoring environmental properties or parameters of the computing system or of the computing system component, e.g. monitoring of power, currents, temperature, humidity, position, vibrations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3065Monitoring arrangements determined by the means or processing involved in reporting the monitored data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/1805Append-only file systems, e.g. using logs or journals to store data
    • G06F16/1815Journaling file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Quality & Reliability (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computing Systems (AREA)
  • Fuzzy Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Debugging And Monitoring (AREA)

Abstract

本发明的实施例提供一种服务监控方法及装置。该方法包括:获取用户的操作日志;根据用户的操作日志,统计用户的业务操作行为;赋予用户的各业务操作行为不同的权重,得到用户加权业务操作;根据用户加权业务操作,确定用户的业务操作行为是否存在导致服务器异常的行为。本发明通过获取用户的操作日志,最终能精准、快速的识别出服务异常导致的原因,对由于用户的异常操作可能导致服务不稳定的行为及时识别,并提前预警和处理,具有操作简单、成本较低的优点。

Figure 202010721171

Embodiments of the present invention provide a service monitoring method and apparatus. The method includes: acquiring a user's operation log; counting the user's business operation behavior according to the user's operation log; assigning different weights to each business operation behavior of the user to obtain the user's weighted business operation; and determining the user's business operation according to the user's weighted business operation Whether there is any behavior that causes the server to be abnormal. By obtaining the user's operation log, the present invention can finally accurately and quickly identify the cause of service abnormality, identify in time the behavior that may lead to service instability due to the user's abnormal operation, and give early warning and processing, which has the advantages of simple operation and low cost. lower advantage.

Figure 202010721171

Description

Service monitoring method and device
Technical Field
The present invention relates to the technical field of service monitoring, and in particular, to a service monitoring method and apparatus.
Background
At present, existing service monitoring schemes can be roughly divided into two categories, one is to add a monitoring interface to a service or periodically adjust an existing interface to verify the availability of the service, and when the monitoring result shows that the interface is unavailable or monitoring information is abnormal, early warning is performed. The other type is that traditional monitoring is carried out based on CPU utilization rate, memory utilization rate, inflow and outflow of flow and the like of the server, when CPU, memory utilization rate and flow increase are monitored, service abnormity is identified, relevant operation and maintenance personnel are informed, abnormity is checked by the relevant personnel, and corresponding processing measures are taken according to response checking results.
The method that a monitoring interface is added to a service or an existing interface is periodically adjusted to verify the availability of the service needs to be modified in an intrusive mode on the original service in some existing schemes, and the service modification cost is high.
The monitoring of the operation condition of the server is the whole monitoring of the service platform, and when an exception occurs, the problem of the service or the error use of the user cannot be identified. It needs to be checked and analyzed by corresponding personnel to determine.
The prior art can give an early warning notification only after the service is abnormal, and can not avoid some service abnormalities in advance.
Disclosure of Invention
The invention aims to provide a service monitoring method and a service monitoring device, and aims to solve the problems of high cost and labor consumption of the conventional service monitoring scheme.
In order to solve the technical problems, the technical scheme of the invention is as follows:
in one aspect of the present invention, a service monitoring method is provided, including:
acquiring an operation log of a user;
counting the service operation behaviors of the user according to the operation log of the user;
giving different weights to each business operation behavior of the user to obtain the weighted business operation of the user;
and determining whether the service operation behavior of the user has a behavior causing server abnormity according to the user weighted service operation.
Further, giving different weights to each business operation behavior of the user to obtain the user weighted business operation, comprising:
and according to the preset proportion of occupying server resources, giving different weights to each business operation behavior of the user to obtain the user weighted business operation.
Further, the operation log of the user comprises at least one of the following items:
the information type comprises user service behavior information or service state information;
operating time;
user service type behavior operation information in a preset period;
the address and port of the server to which it belongs.
Further, determining whether the service operation behavior of the user has a behavior causing server abnormality according to the user weighted service operation, including:
adopting a polymerization algorithm for the user weighted service operation to obtain an abnormal value;
if the abnormal value is larger than a preset threshold value, determining that the current abnormal operation of the user is performed;
calling historical operation behaviors and historical abnormal operations of the user;
and judging whether the service operation behaviors of the user have behaviors causing server abnormity or not according to the historical operation behaviors, the historical abnormal operations and the current abnormal operations of the user, and if so, outputting a judgment result.
Further, the method further comprises:
acquiring health state information reported by a server;
and judging whether the service reliability is low or not according to the health state information, and if so, outputting a judgment result.
Further, the server health status information includes:
the information type comprises user behavior information or service state information;
the number of online devices of the current server;
current server CPU utilization;
the current server memory usage rate;
the current service condition of the server disk space;
the uplink data flow of the server in a preset period;
the server downlink data flow in a preset period;
the address and port of the server.
Further, if the health status information includes at least one of the following conditions, it is determined that the service reliability is low:
the health state information reported by the server is not obtained in N continuous preset periods;
the uplink/downlink data traffic is suddenly increased or decreased;
abnormity occurs in the availability/CPU utilization rate/memory utilization rate/disk space utilization condition of the server.
Further, the method further comprises:
and early warning is carried out according to the judgment result.
In another aspect of the present invention, there is provided a service monitoring apparatus, including:
the acquisition module is used for acquiring an operation log of a user;
the statistical module is used for counting the business operation behaviors of the user according to the operation log of the user;
the weighting module is used for giving different weights to each business operation behavior of the user to obtain the weighted business operation of the user;
and the judging module is used for determining whether the service operation behavior of the user has a behavior causing the server abnormity according to the user weighted service operation.
Further, the weighting module is specifically configured to:
and according to the preset proportion of occupying server resources, giving different weights to each business operation behavior of the user to obtain the user weighted business operation.
The scheme of the invention at least comprises the following beneficial effects:
according to the scheme, the reason caused by the abnormal service can be accurately and quickly identified finally by acquiring the operation log of the user, the behavior possibly causing the unstable service due to the abnormal operation (such as pressure measurement) of the user can be identified in time, early warning and processing are carried out in advance, and the method and the device have the advantages of being simple in operation and low in cost.
Drawings
FIG. 1 is a step diagram of a service monitoring method of the present invention;
fig. 2 is a flowchart of step S4;
FIG. 3 is a device connection diagram of a service monitoring apparatus of the present invention;
fig. 4 is a flow chart of the operation of a service monitoring method of the present invention.
Detailed Description
Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.
As shown in fig. 1, an embodiment of the present invention provides a service monitoring method, including:
s1, acquiring an operation log of the user;
s2, counting the service operation behavior of the user according to the operation log of the user;
s3, giving different weights to each business operation behavior of the user to obtain the weighted business operation of the user;
and S4, determining whether the business operation behaviors of the user have behaviors causing server abnormity according to the user weighted business operation.
According to the scheme, the reason caused by the abnormal service can be accurately and quickly identified finally by acquiring the operation log of the user, the behavior possibly causing the unstable service due to the abnormal operation (such as pressure measurement) of the user can be identified in time, early warning and processing are carried out in advance, and the method and the device have the advantages of being simple in operation and low in cost.
In an optional embodiment of the present invention, step S3 assigns different weights to each business operation behavior of the user to obtain a user weighted business operation, including:
and according to the preset proportion of occupying server resources, giving different weights to each business operation behavior of the user to obtain the user weighted business operation.
The rule to assign weights may be: the operation occupying more server resources for one service has higher weight; the weight occupied by infrequently used resources in normal services is higher. The method can be preset according to actual conditions and user requirements so as to improve adaptability and accuracy of final results.
In an optional embodiment of the present invention, the operation log of the user includes, but is not limited to, at least one of the following:
the information type comprises user service behavior information or service state information;
operating time;
behavior operation information of user service types (the user service types can be marked as 1, 2 and … … N) in a preset period;
ip address and port of the server to which it belongs.
By utilizing the information, whether the service operation behavior of the user has the behavior causing the server abnormity can be accurately identified.
Referring to fig. 2, in an alternative embodiment of the present invention, the step S4 determines whether there is a behavior causing server exception in the business operation behavior of the user according to the user-weighted business operation, including:
adopting a polymerization algorithm for the user weighted service operation to obtain an abnormal value;
if the abnormal value is larger than a preset threshold value, determining that the current abnormal operation of the user is performed;
calling historical operation behaviors and historical abnormal operations of the user;
and judging whether the service operation behaviors of the user have behaviors causing server abnormity or not according to the historical operation behaviors, the historical abnormal operations and the current abnormal operations of the user, and if so, outputting a judgment result.
If the abnormal value is larger than the preset threshold value, the possibility of abnormal operation of the user is indicated, then the weight distribution of each service is further analyzed for the information with the possibility of abnormal operation, and whether the abnormal behavior of the server caused by pressure measurement, attack, error use and the like exists in the operation of the user is judged by combining the previous operation behavior condition of the user and a determined case (such as consistent weight distribution for a plurality of times continuously and large weight ratio of the operation of the abnormal service), so that the accuracy of the judgment result is improved.
In an optional embodiment of the invention, the method further comprises:
acquiring health state information reported by a server;
and judging whether the service reliability is low or not according to the health state information, and if so, outputting a judgment result.
Not only does the possibility of whether the user has abnormal operation be considered, but also the server is judged, and the monitoring accuracy is improved.
In an optional embodiment of the present invention, the server health status information includes:
the information type comprises user behavior information or service state information;
the number of online devices of the current server;
current server CPU utilization;
the current server memory usage rate;
the current service condition of the server disk space;
the uplink data flow of the server in a preset period;
the server downlink data flow in a preset period;
the ip address and port of the server itself.
The situation basically covers basic data of the server which can cause problems, and the reliability of the server is judged by using the information, so that the method is comprehensive and is beneficial to improving the accuracy of the judgment result.
In an optional embodiment of the present invention, if the health status information includes at least one of the following conditions, it is determined that the service reliability is low:
the health state information reported by the server is not obtained in N continuous preset periods;
the uplink/downlink data traffic is suddenly increased or decreased;
abnormity occurs in the availability/CPU utilization rate/memory utilization rate/disk space utilization condition of the server.
In an optional embodiment of the invention, the method further comprises:
and early warning is carried out according to the judgment result.
The mode that can send the judged result to monitor terminal carries out the early warning to the control personnel, and is more convenient, swift, helps the control personnel in time to discover and handle the problem.
In an optional embodiment of the invention, the method further comprises:
and processing according to the judgment result.
The processing mode can be as follows: the temporary blocking or black-drawing processing is carried out on the user account, or the business data processing of the user with abnormal operation is transferred to a temporary area, and the temporary area only provides basic limited services or restarts corresponding services.
As shown in fig. 3, an embodiment of the present invention further provides a service monitoring apparatus, including:
the acquisition module is used for acquiring an operation log of a user;
the statistical module is used for counting the business operation behaviors of the user according to the operation log of the user;
the weighting module is used for giving different weights to each business operation behavior of the user to obtain the weighted business operation of the user;
and the judging module is used for determining whether the service operation behavior of the user has a behavior causing the server abnormity according to the user weighted service operation.
According to the scheme, the reason caused by the abnormal service can be accurately and quickly identified finally by acquiring the operation log of the user, the behavior possibly causing the unstable service due to the abnormal operation (such as pressure measurement) of the user can be identified in time, early warning and processing are carried out in advance, and the method and the device have the advantages of being simple in operation and low in cost.
In an optional embodiment of the present invention, the weighting module is specifically configured to:
and according to the preset proportion of occupying server resources, giving different weights to each business operation behavior of the user to obtain the user weighted business operation.
The rule to assign weights may be: the operation occupying more server resources for one service has higher weight; the weight occupied by infrequently used resources in normal services is higher. The method can be preset according to actual conditions and user requirements so as to improve adaptability and accuracy of final results.
In an optional embodiment of the present invention, the operation log of the user includes, but is not limited to, at least one of the following:
the information type comprises user service behavior information or service state information;
operating time;
behavior operation information of user service types (the user service types can be marked as 1, 2 and … … N) in a preset period;
ip address and port of the server to which it belongs.
By utilizing the information, whether the service operation behavior of the user has the behavior causing the server abnormity can be accurately identified.
In an optional embodiment of the present invention, the determining module is specifically configured to:
adopting a polymerization algorithm for the user weighted service operation to obtain an abnormal value;
if the abnormal value is larger than a preset threshold value, determining that the current abnormal operation of the user is performed;
calling historical operation behaviors and historical abnormal operations of the user;
and judging whether the service operation behaviors of the user have behaviors causing server abnormity or not according to the historical operation behaviors, the historical abnormal operations and the current abnormal operations of the user, and if so, outputting a judgment result.
If the abnormal value is larger than the preset threshold value, the possibility of abnormal operation of the user is indicated, then the weight distribution of each service is further analyzed for the information with the possibility of abnormal operation, and whether the abnormal behavior of the server caused by pressure measurement, attack, error use and the like exists in the operation of the user is judged by combining the previous operation behavior condition of the user and a determined case (such as consistent weight distribution for a plurality of times continuously and large weight ratio of the operation of the abnormal service), so that the accuracy of the judgment result is improved.
In an optional embodiment of the present invention, the obtaining module is further configured to:
acquiring health state information reported by a server;
and judging whether the service reliability is low or not according to the health state information, and if so, outputting a judgment result.
Not only does the possibility of whether the user has abnormal operation be considered, but also the server is judged, and the monitoring accuracy is improved.
In an optional embodiment of the present invention, the server health status information includes:
the information type comprises user behavior information or service state information;
the number of online devices of the current server;
current server CPU utilization;
the current server memory usage rate;
the current service condition of the server disk space;
the uplink data flow of the server in a preset period;
the server downlink data flow in a preset period;
the address and port of the server.
The situation basically covers basic data of the server which can cause problems, and the reliability of the server is judged by using the information, so that the method is comprehensive and is beneficial to improving the accuracy of the judgment result.
In an optional embodiment of the present invention, if the health status information includes at least one of the following conditions, it is determined that the service reliability is low:
the health state information reported by the server is not obtained in N continuous preset periods;
the uplink/downlink data traffic is suddenly increased or decreased;
abnormity occurs in the availability/CPU utilization rate/memory utilization rate/disk space utilization condition of the server.
In an optional embodiment of the invention, the apparatus further comprises:
and the early warning module is used for carrying out early warning according to the judgment result.
The mode that can send the judged result to monitor terminal carries out the early warning to the control personnel, and is more convenient, swift, helps the control personnel in time to discover and handle the problem.
In an optional embodiment of the invention, the apparatus further comprises:
and the processing module is used for processing according to the judgment result.
The processing mode can be as follows: the temporary blocking or black-drawing processing is carried out on the user account, or the business data processing of the user with abnormal operation is transferred to a temporary area, and the temporary area only provides basic limited services or restarts corresponding services.
It should be noted that the apparatus is an apparatus corresponding to the method described in fig. 1, and all the implementations of the illustrated method are applicable to the embodiment of the apparatus, and the same technical effects can be achieved.
Referring to fig. 4, a workflow of a service monitoring method according to an embodiment of the present invention is:
fi l ebeat (which is open-source third-party software, is applied in a lightweight way, filters the operation log of a user from a server with little overhead, and has little influence on the service) pulls the operation log of the user from the server to a message queue middleware Kafka (the message queue middleware Kafka is high-concurrency and low-delay distributed middleware and is used for caching the operation log information of the user, the operation behavior data of the user and the health state data of a server). The user business behavior statistical module (including the functions of the acquisition module and the statistical module) pulls the operation log of the user from the kafka, acquires the health state data of the server, periodically (for example, 1 minute) counts the business operation behavior information of the user and the health state information of the server, and sends the business operation behavior information and the health state information of the server to the message queue middleware kakfa. And the user behavior analysis and service health analysis module (comprising the function of the judgment module) pulls the service operation behavior information and the server health state information of the user from the kafka, and analyzes and judges whether the behavior of the user, such as pressure measurement, attack and the like, causes abnormal behaviors of the server. And analyzing the current service reliability level according to the health status. The abnormity early warning and informing module (comprising the functions of the early warning module) analyzes the user behavior and judges and early warns the service health analysis module, and can inform operation and maintenance personnel and users by sending early warning information to the terminal. The early warning information can be stored, classified according to the type and the requirement of the event, and informed to relevant operation and maintenance personnel and users in an email or short message mode, so that subsequent statistical analysis is facilitated. To prevent too many emails or short messages, the following notification strategy may be adopted: and for the short message or the mail for informing the operation and maintenance personnel, checking whether an unprocessed alarm exists every 1 hour, and if so, sending the mail or the short message to inform the operation and maintenance personnel to process as soon as possible. And when a new alarm is generated, checking whether unprocessed information exists, if so, notifying within one hour before, not repeating the notification, and if not, sending an email or short message notification. The short message or the mail for notifying the user is notified only once, and the short message or the mail notification can not be continued until the user work order is processed. The abnormal processing module (namely, the processing module) performs abnormal intelligent processing according to the judgment of the user behavior analysis and service health analysis module, and the processing mode can be temporary blocking or black-drawing processing on a user account, or processing and transferring the business data of the user with abnormal operation to a temporary area, wherein the temporary area only provides basic limited service, or restarts corresponding service.
In the embodiment of the invention, each behavior of the user is counted, and the weight of different services is given according to the principle that the operation which occupies more server resources in one service occupies higher weight and the operation which occupies less frequently used resources in normal services occupies higher weight. And judging whether the user has behavior (such as pressure measurement) causing unstable service according to the weight aggregation calculation of each service of the user and the weight distribution of the service in front of the user. After the user is judged to have the action of causing the service instability, a certain processing scheme is adopted to ensure the stability of the service, and the intervention is not carried out until the visible fault of the service occurs. And judging whether the current server normally operates according to the number of online users reported by the server, the service data flow and the like so as to evaluate the health degree and reliability of the service.
According to the service monitoring method and device provided by the embodiment of the invention, based on the monitoring scheme of user operation, the reasons of abnormal pressure measurement users and abnormal service can be accurately identified, the behavior possibly causing unstable service can be identified and processed in advance, and the stability of the service is ensured; case scenes can be accumulated, and operation and maintenance and users are actively informed when abnormality occurs; scenes can be accumulated according to the operation condition of the user, and processing measures such as black drawing, temporary area switching and the like can be actively taken.
While the foregoing is directed to the preferred embodiment of the present invention, it will be understood by those skilled in the art that various changes and modifications may be made without departing from the spirit and scope of the invention as defined in the appended claims.

Claims (10)

1.一种服务监控方法,其特征在于,包括:1. a service monitoring method, is characterized in that, comprises: 获取用户的操作日志;Get the user's operation log; 根据用户的操作日志,统计用户的业务操作行为;According to the user's operation log, the user's business operation behavior is counted; 赋予用户的各业务操作行为不同的权重,得到用户加权业务操作;Different weights are given to each business operation behavior of the user, and the user's weighted business operation is obtained; 根据用户加权业务操作,确定用户的业务操作行为是否存在导致服务器异常的行为。According to the user's weighted service operations, it is determined whether the user's service operation behavior has any behavior that causes the server to be abnormal. 2.根据权利要求1所述的服务监控方法,其特征在于,赋予用户的各业务操作行为不同的权重,得到用户加权业务操作,包括:2. service monitoring method according to claim 1, is characterized in that, giving each business operation behavior of user different weight, obtains user's weighted business operation, comprising: 根据预设的占用服务器资源的比重,赋予用户的各业务操作行为不同的权重,得到用户加权业务操作。According to the preset proportion of occupied server resources, different weights are given to each service operation behavior of the user, and the user weighted service operation is obtained. 3.根据权利要求2所述的服务监控方法,其特征在于,用户的操作日志,包括以下至少一项:3. The service monitoring method according to claim 2, wherein the operation log of the user includes at least one of the following: 信息类型,包括用户业务行为信息或是服务状态信息;Information type, including user business behavior information or service status information; 操作时间;operating time; 一个预设周期内用户业务类型行为操作信息;User service type behavior operation information within a preset period; 所属服务器的地址和端口。The address and port of the owning server. 4.根据权利要求1所述的服务监控方法,其特征在于,根据用户加权业务操作,确定用户的业务操作行为是否存在导致服务器异常的行为,包括:4. The service monitoring method according to claim 1, wherein, according to the user's weighted business operation, it is determined whether the user's business operation behavior has the behavior that causes the server to be abnormal, comprising: 对用户加权业务操作采用聚合算法,得到一个异常值;Aggregation algorithm is used for user-weighted business operations to obtain an outlier; 若该异常值大于预设阈值,则确定为该用户当前异常操作;If the abnormal value is greater than the preset threshold, it is determined that the user is currently operating abnormally; 调取该用户的历史操作行为和历史异常操作;Retrieve the user's historical operation behavior and historical abnormal operations; 根据用户的历史操作行为、历史异常操作和当前异常操作,判断用户的业务操作行为是否存在导致服务器异常的行为,若存在,则输出判断结果。According to the user's historical operation behavior, historical abnormal operation, and current abnormal operation, determine whether the user's business operation behavior has any behavior that causes the server to be abnormal, and if so, output the judgment result. 5.根据权利要求1或4所述的服务监控方法,其特征在于,还包括:5. The service monitoring method according to claim 1 or 4, characterized in that, further comprising: 获取服务器上报的健康状态信息;Obtain the health status information reported by the server; 根据该健康状态信息,判断服务可靠性是否为低,若是,则输出判断结果。According to the health state information, it is judged whether the service reliability is low, and if so, the judgment result is output. 6.根据权利要求5所述的服务监控方法,其特征在于,所述服务器健康状态信息,包括:6. The service monitoring method according to claim 5, wherein the server health status information comprises: 信息类型,包括用户行为信息或是服务状态信息;Type of information, including user behavior information or service status information; 当前服务器在线设备数量;The current number of online devices on the server; 当前服务器CPU使用率;Current server CPU usage; 当前服务器内存使用率;Current server memory usage; 当前服务器磁盘空间使用情况;Current server disk space usage; 一个预设周期内服务器上行数据流量;The upstream data traffic of the server within a preset period; 一个预设周期内服务器下行数据流量;Server downlink data traffic within a preset period; 服务器的地址和端口。The address and port of the server. 7.根据权利要求6所述的服务监控方法,其特征在于,若该健康状态信息中包括以下情况的至少一个,则判断服务可靠性为低:7. The service monitoring method according to claim 6, wherein, if the health status information includes at least one of the following conditions, it is determined that the service reliability is low: 连续N个预设周期未获取到服务器上报的健康状态信息;The health status information reported by the server is not obtained for N consecutive preset cycles; 上/下行数据流量突增或突减;Sudden increase or decrease of uplink/downlink data traffic; 服务器的可用性/CPU使用率/内存使用率/磁盘空间使用情况出现异常。The server's availability/CPU usage/memory usage/disk space usage is abnormal. 8.根据权利要求4所述的服务监控方法,其特征在于,还包括:8. The service monitoring method according to claim 4, further comprising: 根据判断结果,进行预警。According to the judgment result, an early warning is given. 9.一种服务监控装置,其特征在于,包括:9. A service monitoring device, comprising: 获取模块,用于获取用户的操作日志;The acquisition module is used to acquire the user's operation log; 统计模块,用于根据用户的操作日志,统计用户的业务操作行为;The statistics module is used to count the user's business operation behavior according to the user's operation log; 加权模块,用于赋予用户的各业务操作行为不同的权重,得到用户加权业务操作;The weighting module is used to assign different weights to the user's business operation behaviors to obtain the user's weighted business operations; 判断模块,用于根据用户加权业务操作,确定用户的业务操作行为是否存在导致服务器异常的行为。The judgment module is used to determine whether the user's business operation behavior has any behavior that causes the server to be abnormal according to the user's weighted business operation. 10.根据权利要求9所述的服务监控装置,其特征在于,加权模块,具体用于:10. The service monitoring device according to claim 9, wherein the weighting module is specifically used for: 根据预设的占用服务器资源的比重,赋予用户的各业务操作行为不同的权重,得到用户加权业务操作。According to the preset proportion of occupied server resources, different weights are given to each service operation behavior of the user, and the user weighted service operation is obtained.
CN202010721171.9A 2020-07-24 2020-07-24 A service monitoring method and device Pending CN113971187A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010721171.9A CN113971187A (en) 2020-07-24 2020-07-24 A service monitoring method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010721171.9A CN113971187A (en) 2020-07-24 2020-07-24 A service monitoring method and device

Publications (1)

Publication Number Publication Date
CN113971187A true CN113971187A (en) 2022-01-25

Family

ID=79585800

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010721171.9A Pending CN113971187A (en) 2020-07-24 2020-07-24 A service monitoring method and device

Country Status (1)

Country Link
CN (1) CN113971187A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108268354A (en) * 2016-12-30 2018-07-10 腾讯科技(深圳)有限公司 Data safety monitoring method, background server, terminal and system
CN108512806A (en) * 2017-02-24 2018-09-07 中国移动通信集团公司 A kind of operation behavior analysis method and server based on virtual environment
CN109525611A (en) * 2019-01-11 2019-03-26 新华三信息安全技术有限公司 A kind of abnormal outgoing behavioral value method and device of Intranet user
CN110502495A (en) * 2019-09-02 2019-11-26 中国工商银行股份有限公司 A kind of log collecting method and device of application server

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108268354A (en) * 2016-12-30 2018-07-10 腾讯科技(深圳)有限公司 Data safety monitoring method, background server, terminal and system
CN108512806A (en) * 2017-02-24 2018-09-07 中国移动通信集团公司 A kind of operation behavior analysis method and server based on virtual environment
CN109525611A (en) * 2019-01-11 2019-03-26 新华三信息安全技术有限公司 A kind of abnormal outgoing behavioral value method and device of Intranet user
CN110502495A (en) * 2019-09-02 2019-11-26 中国工商银行股份有限公司 A kind of log collecting method and device of application server

Similar Documents

Publication Publication Date Title
US6993681B2 (en) Remote administration in a distributed system
CN100490397C (en) Alarm processing method and alarming system
CN106961352B (en) Monitoring system and monitoring method
US9070121B2 (en) Approach for prioritizing network alerts
CN109660380A (en) Monitoring method, platform, system and the readable storage medium storing program for executing of operation condition of server
WO2019090998A1 (en) Data monitoring method and apparatus, computer device and storage medium
CN108989135B (en) Network equipment fault detection method and device
CN103392176B (en) For predicting the apparatus and method that network event spreads unchecked
US8862119B2 (en) Method and apparatus for telecommunications network performance anomaly events detection and notification
CN113824768B (en) Health check method and device in load balancing system and flow forwarding method
US10896073B1 (en) Actionability metric generation for events
JP2011154483A (en) Failure detection device, program, and failure detection method
WO2023138058A1 (en) Alarm event processing method and apparatus, and computer-readable storage medium
EP3996348A1 (en) Predicting performance of a network order fulfillment system
CN111130912A (en) Abnormal location method, server and storage medium for content distribution network
CN113971187A (en) A service monitoring method and device
CN111949421B (en) SDK calling method, device, electronic equipment and computer readable storage medium
CN112100029A (en) Client state monitoring method and related components
CN113391611A (en) Early warning method, device and system for dynamic environment monitoring system
CN116185781A (en) Monitoring alarm method and device
CN116974869A (en) Index data monitoring method and device, electronic equipment and storage medium
CN113836043B (en) Test case based self-maintenance method and device for middlebox and storage medium
CN112969151B (en) Short message monitoring method, device and equipment
KR100697079B1 (en) Central control system of multi-air conditioner and its error notification method
CN113835961B (en) Alarm information monitoring method, device, server and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20220125