CN112817815A - Network server fault warning system based on business layer monitoring big data - Google Patents
Network server fault warning system based on business layer monitoring big data Download PDFInfo
- Publication number
- CN112817815A CN112817815A CN202110085356.XA CN202110085356A CN112817815A CN 112817815 A CN112817815 A CN 112817815A CN 202110085356 A CN202110085356 A CN 202110085356A CN 112817815 A CN112817815 A CN 112817815A
- Authority
- CN
- China
- Prior art keywords
- module
- server
- data
- cloud computing
- alarm
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000012544 monitoring process Methods 0.000 title claims abstract description 29
- 238000013500 data storage Methods 0.000 claims abstract description 17
- 238000004891 communication Methods 0.000 claims description 20
- 230000002159 abnormal effect Effects 0.000 claims description 8
- 238000004364 calculation method Methods 0.000 claims description 5
- 238000001514 detection method Methods 0.000 claims description 4
- 230000003862 health status Effects 0.000 claims description 3
- 230000036541 health Effects 0.000 description 5
- 238000000034 method Methods 0.000 description 5
- 230000008569 process Effects 0.000 description 4
- 230000000694 effects Effects 0.000 description 3
- 238000012423 maintenance Methods 0.000 description 2
- 230000004075 alteration Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000013024 troubleshooting Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3003—Monitoring arrangements specially adapted to the computing system or computing system component being monitored
- G06F11/3006—Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system is distributed, e.g. networked systems, clusters, multiprocessor systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/3051—Monitoring arrangements for monitoring the configuration of the computing system or of the computing system component, e.g. monitoring the presence of processing resources, peripherals, I/O links, software programs
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
- G06F11/3409—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L63/00—Network architectures or network communication protocols for network security
- H04L63/08—Network architectures or network communication protocols for network security for authentication of entities
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/02—Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
- H04L67/025—Protocols based on web technology, e.g. hypertext transfer protocol [HTTP] for remote control or remote monitoring of applications
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computing Systems (AREA)
- General Physics & Mathematics (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Computer Networks & Wireless Communication (AREA)
- Computer Hardware Design (AREA)
- Computer Security & Cryptography (AREA)
- Mathematical Physics (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
The invention relates to the technical field of network server fault alarm, and discloses a network server fault alarm system based on service layer monitoring big data, which comprises: the cloud computing system comprises a cloud computing server CCSnsfa, a computer terminal PCTnsfa and a cloud computing server CCSnsfa, wherein the cloud computing server CCSnsfa is operated with server fault warning system server side software and is deployed at a remote cloud side; the server fault warning system comprises a data acquisition module, a data storage module, a standard parameter setting module, a fault warning module and a central computing module, wherein the central computing module is respectively communicated with the data acquisition module, the data storage module, the standard parameter setting module and the fault warning module. The invention solves the problems of how to quickly and timely find fault points in monitoring logs reported by a plurality of services and intelligently analyze alarms.
Description
Technical Field
The invention relates to the technical field of network server fault alarm, in particular to a network server fault alarm system based on service layer monitoring big data.
Background
With the increasing depth of the internet, the number of servers and the service scale of the internet company are rapidly increasing, and meanwhile, the monitoring service also brings challenges to the high availability of the service and the multidimensional, intelligent and real-time performance of the monitoring service. Most of the traditional server monitoring systems are designed for hardware performance indexes of a server, such as CPU utilization rate, network I/O flow, residual amount of disk space, memory utilization rate, JVM stack memory and the like, and most of the traditional server monitoring systems are designed according to a mode of manually setting a threshold value and an alarm rule. After the alarm is sent out, the operation and maintenance personnel carry out manual fault location and troubleshooting. In the process, most of the operation is performed by depending on the experience of operation and maintenance personnel, which causes the problem of low efficiency.
Whereas, from a business level, a single product may have hundreds of millions of service monitoring logs per day. Therefore, how to quickly and timely find fault points in monitoring logs reported by a plurality of services and intelligently analyze alarms is a problem to be solved.
Disclosure of Invention
Technical problem to be solved
Aiming at the defects of the prior art, the invention provides a network server fault alarm system for monitoring big data based on a service layer, which aims to solve the technical problems of how to quickly and timely find fault points in monitoring logs reported by a plurality of services and intelligently analyze alarms.
(II) technical scheme
In order to achieve the purpose, the invention provides the following technical scheme:
a network server fault warning system based on service layer monitoring big data comprises: the cloud computing system comprises a cloud computing server CCSnsfa, a computer terminal PCTnsfa and a cloud computing server CCSnsfa, wherein the cloud computing server CCSnsfa is operated with server fault warning system server side software and is deployed at a remote cloud side;
the server fault warning system comprises a data acquisition module, a data storage module, a standard parameter setting module, a fault warning module and a central computing module, wherein the central computing module is in communication connection with the data acquisition module, the data storage module, the standard parameter setting module and the fault warning module respectively.
Further, the central computing module comprises a real-time computing submodule and an off-line computing submodule.
Furthermore, the real-time computation submodule calls data to the data storage module and the standard parameter setting module, performs alarm detection according to rules preset by the standard parameter setting module, and outputs a service health state report of a service layer to the fault alarm module; and the offline calculation submodule calls data from the data storage module, analyzes the data as historical data, and predicts a monitoring index and an alarm rule of the next period so as to provide a standard parameter setting module for use.
Further, the fault alarm module analyzes the monitoring index according to the service health status report of the service layer, judges whether the index value is abnormal or not, needs to send an alarm or not, and sends alarm information at the first time when the network service is abnormal.
(III) advantageous technical effects
Compared with the prior art, the invention has the following beneficial technical effects:
the data are called to the data storage module through the off-line calculation submodule and are used as historical data to be analyzed, and the monitoring index and the alarm rule of the next period are automatically predicted so as to be provided for the standard parameter setting module to use; alarm detection is carried out through a real-time calculation submodule according to rules preset by a standard parameter setting module, and a service layer service health state report is output to a fault alarm module; the fault alarm module analyzes the monitoring index according to the service health state report of the service layer, judges whether the index value is abnormal or not, and sends out an alarm or not, and sends out alarm information at the first time when the network service is abnormal, thereby realizing the technical effects of quickly and timely finding out a fault point and intelligently analyzing the alarm.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
A network server fault warning system based on service layer monitoring big data comprises: the cloud computing system comprises a cloud computing server CCSnsfa, a computer terminal PCTnsfa and a cloud computing server CCSnsfa, wherein the cloud computing server CCSnsfa is operated with server fault warning system server side software and is deployed at a remote cloud side;
the server fault warning system comprises a data acquisition module, a data storage module, a standard parameter setting module, a fault warning module and a central computing module, wherein the central computing module is in communication connection with the data acquisition module, the data storage module, the standard parameter setting module and the fault warning module respectively;
the system comprises a data acquisition module, a data storage module and a standard parameter setting module, wherein the data acquisition module is used for acquiring monitoring log data of a network service business layer, the data storage module is used for storing the monitoring log data of the network service business layer, and the standard parameter setting module is used for defining monitoring indexes and configuring alarm rules;
the central computing module comprises a real-time computing submodule and an off-line computing submodule, the real-time computing submodule calls data to the data storage module and the standard parameter setting module, performs alarm detection according to rules preset by the standard parameter setting module, and outputs a service layer service health state report to the fault alarm module; the off-line calculation submodule calls data from the data storage module, analyzes the data as historical data, and predicts a monitoring index and an alarm rule of the next period so as to provide a standard parameter setting module for use;
the fault alarm module analyzes the monitoring index according to the service health state report of the service layer, judges whether the index value is abnormal or not, needs to send an alarm or not, and sends alarm information at the first time when the network service is abnormal;
further, installing and operating the server software with the communication authority authentication system on the operating system of the computer terminal PCTnsfa;
in order to prevent an illegal network node impersonating a cloud computing server CCSnsfa from sending false network service fault alarm information to a computer terminal PCTnsfa through a server fault alarm system, before the computer terminal PCTnsfa reads the alarm information sent by the cloud computing server CCSnsfa, a communication authority authentication system authenticates the identity of the cloud computing server CCSnsfa, and the authentication method specifically comprises the following steps:
step one, a cloud computing server CCSnsfa registers communication authority on a communication authority authentication system, which specifically comprises the following steps:
the communication authority authentication system firstly selects two large prime numbers alpha and beta, then calculates lambda as alpha beta, then secretly stores the values of the prime numbers alpha and beta, and discloses the lambda to a cloud computing server CCSnsfa;
the cloud computing server CCSnsfa selects a communication private key k (k is more than or equal to 1 and less than or equal to lambda-1), calculates l as kmod lambda, takes l as a communication public key, and then discloses the communication public key l to a communication authority authentication system;
step two, when the cloud computing server CCSnsfa sends network service fault warning information to the computer terminal PCTnsfa, the communication authority authentication system authenticates the identity of the cloud computing server CCSnsfa, which specifically comprises the following steps:
the cloud computing server CCSnsfa randomly selects a numerical value m (m is more than or equal to 1 and less than or equal to lambda-1), and p is calculated as m2mod lambda and send p to the communication authority authentication system;
the communication authority authentication system randomly selects a value e belonging to {0,1}, and sends the value e to the cloud computing server CCSnsfa;
cloud computing server CCSnsfa calculates q ═ mkeAnd sending q to a communication authority authentication system;
verification equation q of communication authority authentication system2modλ=pleWhether the result is true or not;
if the equation is established, the cloud computing server CCSnsfa is proved to know the communication private key k and has legal communication authority, and the computer terminal PCTnsfa receives alarm information sent by the cloud computing server CCSnsfa;
in the authentication process, the key k only participates in operation in the authentication process and is not transmitted in communication, so that an illegal tracker cannot intercept the key in a circuit, and the identity authentication process of the cloud computing server CCSnsfa is zero-knowledge.
Although embodiments of the present invention have been shown and described, it will be appreciated by those skilled in the art that changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.
Claims (4)
1. A network server fault alarm system based on service layer monitoring big data is characterized by comprising: the cloud computing system comprises a cloud computing server CCSnsfa, a computer terminal PCTnsfa and a cloud computing server CCSnsfa, wherein the cloud computing server CCSnsfa is operated with server fault warning system server side software and is deployed at a remote cloud side;
the server fault warning system comprises a data acquisition module, a data storage module, a standard parameter setting module, a fault warning module and a central computing module, wherein the central computing module is in communication connection with the data acquisition module, the data storage module, the standard parameter setting module and the fault warning module respectively.
2. The business layer monitoring big data-based network server fault alarm system of claim 1, wherein the central computation module comprises a real-time computation submodule and an off-line computation submodule.
3. The network server fault alarm system based on business layer big data monitoring of claim 2, wherein the real-time computation submodule calls data to the data storage module and the standard parameter setting module, performs alarm detection according to rules preset by the standard parameter setting module, and outputs a business layer service health status report to the fault alarm module; and the offline calculation submodule calls data from the data storage module, analyzes the data as historical data, and predicts a monitoring index and an alarm rule of the next period so as to provide a standard parameter setting module for use.
4. The system of claim 3, wherein the fault alarm module analyzes the monitoring index according to the service health status report of the service layer, determines whether the index value is abnormal, needs to send an alarm, and sends an alarm message at the first time when the network service is abnormal.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110085356.XA CN112817815A (en) | 2021-01-22 | 2021-01-22 | Network server fault warning system based on business layer monitoring big data |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110085356.XA CN112817815A (en) | 2021-01-22 | 2021-01-22 | Network server fault warning system based on business layer monitoring big data |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112817815A true CN112817815A (en) | 2021-05-18 |
Family
ID=75858790
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110085356.XA Pending CN112817815A (en) | 2021-01-22 | 2021-01-22 | Network server fault warning system based on business layer monitoring big data |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112817815A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113448763A (en) * | 2021-07-16 | 2021-09-28 | 广东电网有限责任公司 | Dynamic expansion grouping alarm service method for full life cycle management |
CN115277358A (en) * | 2022-02-10 | 2022-11-01 | 上海贝加信息技术有限公司 | Network server monitoring method and storage medium |
CN117221151A (en) * | 2023-09-12 | 2023-12-12 | 北京城建智控科技股份有限公司 | Visual management device and method for cloud computing storage |
-
2021
- 2021-01-22 CN CN202110085356.XA patent/CN112817815A/en active Pending
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113448763A (en) * | 2021-07-16 | 2021-09-28 | 广东电网有限责任公司 | Dynamic expansion grouping alarm service method for full life cycle management |
CN115277358A (en) * | 2022-02-10 | 2022-11-01 | 上海贝加信息技术有限公司 | Network server monitoring method and storage medium |
CN117221151A (en) * | 2023-09-12 | 2023-12-12 | 北京城建智控科技股份有限公司 | Visual management device and method for cloud computing storage |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112817815A (en) | Network server fault warning system based on business layer monitoring big data | |
US11968077B2 (en) | Link fault monitoring method and apparatus | |
US10148540B2 (en) | System and method for anomaly detection in information technology operations | |
Valdes et al. | Communication pattern anomaly detection in process control systems | |
US9967169B2 (en) | Detecting network conditions based on correlation between trend lines | |
CN112866185B (en) | Network traffic monitoring device and abnormal traffic detection method | |
WO2020228276A1 (en) | Network alert method and device | |
CN107888452B (en) | 24-hour distributed website performance monitoring and real-time alarming method | |
WO2017080161A1 (en) | Alarm information processing method and device in cloud computing | |
CN104052634A (en) | Information security monitoring system and method | |
CN108092847A (en) | A kind of electric power LTE wireless terminal remote on-line monitoring methods | |
CN112468592A (en) | Terminal online state detection method and system based on electric power information acquisition | |
CN112583643A (en) | Cross-device alarm correlation method | |
CN112769622A (en) | Cluster service fault early warning system based on RPC service monitoring | |
CN116418653A (en) | Fault positioning method and device based on multi-index root cause positioning algorithm | |
CN117520096B (en) | Intelligent server safety monitoring system | |
CN109510730B (en) | Distributed system, monitoring method and device thereof, electronic equipment and storage medium | |
US10110440B2 (en) | Detecting network conditions based on derivatives of event trending | |
CN110730087A (en) | Method and device for processing alarm storm | |
CN117118662A (en) | Zero trust mechanism-oriented power system running state safety protection system | |
CN112688970B (en) | Large-traffic DDoS attack detection method and system based on programmable chip | |
CN110737565A (en) | data monitoring method, device, electronic equipment and storage medium | |
CN115225534A (en) | Method for monitoring running state of monitoring server | |
CN114296979A (en) | Method and device for detecting abnormal state of Internet of things equipment | |
CN115834330B (en) | Group obstacle detection method, device, equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20210518 |
|
WD01 | Invention patent application deemed withdrawn after publication |