CN112817815A - Network server fault warning system based on business layer monitoring big data - Google Patents

Network server fault warning system based on business layer monitoring big data Download PDF

Info

Publication number
CN112817815A
CN112817815A CN202110085356.XA CN202110085356A CN112817815A CN 112817815 A CN112817815 A CN 112817815A CN 202110085356 A CN202110085356 A CN 202110085356A CN 112817815 A CN112817815 A CN 112817815A
Authority
CN
China
Prior art keywords
module
server
data
cloud computing
alarm
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110085356.XA
Other languages
Chinese (zh)
Inventor
王希亮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN202110085356.XA priority Critical patent/CN112817815A/en
Publication of CN112817815A publication Critical patent/CN112817815A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3006Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system is distributed, e.g. networked systems, clusters, multiprocessor systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3051Monitoring arrangements for monitoring the configuration of the computing system or of the computing system component, e.g. monitoring the presence of processing resources, peripherals, I/O links, software programs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3409Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L63/00Network architectures or network communication protocols for network security
    • H04L63/08Network architectures or network communication protocols for network security for authentication of entities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/02Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
    • H04L67/025Protocols based on web technology, e.g. hypertext transfer protocol [HTTP] for remote control or remote monitoring of applications

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Computer Hardware Design (AREA)
  • Computer Security & Cryptography (AREA)
  • Mathematical Physics (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention relates to the technical field of network server fault alarm, and discloses a network server fault alarm system based on service layer monitoring big data, which comprises: the cloud computing system comprises a cloud computing server CCSnsfa, a computer terminal PCTnsfa and a cloud computing server CCSnsfa, wherein the cloud computing server CCSnsfa is operated with server fault warning system server side software and is deployed at a remote cloud side; the server fault warning system comprises a data acquisition module, a data storage module, a standard parameter setting module, a fault warning module and a central computing module, wherein the central computing module is respectively communicated with the data acquisition module, the data storage module, the standard parameter setting module and the fault warning module. The invention solves the problems of how to quickly and timely find fault points in monitoring logs reported by a plurality of services and intelligently analyze alarms.

Description

Network server fault warning system based on business layer monitoring big data
Technical Field
The invention relates to the technical field of network server fault alarm, in particular to a network server fault alarm system based on service layer monitoring big data.
Background
With the increasing depth of the internet, the number of servers and the service scale of the internet company are rapidly increasing, and meanwhile, the monitoring service also brings challenges to the high availability of the service and the multidimensional, intelligent and real-time performance of the monitoring service. Most of the traditional server monitoring systems are designed for hardware performance indexes of a server, such as CPU utilization rate, network I/O flow, residual amount of disk space, memory utilization rate, JVM stack memory and the like, and most of the traditional server monitoring systems are designed according to a mode of manually setting a threshold value and an alarm rule. After the alarm is sent out, the operation and maintenance personnel carry out manual fault location and troubleshooting. In the process, most of the operation is performed by depending on the experience of operation and maintenance personnel, which causes the problem of low efficiency.
Whereas, from a business level, a single product may have hundreds of millions of service monitoring logs per day. Therefore, how to quickly and timely find fault points in monitoring logs reported by a plurality of services and intelligently analyze alarms is a problem to be solved.
Disclosure of Invention
Technical problem to be solved
Aiming at the defects of the prior art, the invention provides a network server fault alarm system for monitoring big data based on a service layer, which aims to solve the technical problems of how to quickly and timely find fault points in monitoring logs reported by a plurality of services and intelligently analyze alarms.
(II) technical scheme
In order to achieve the purpose, the invention provides the following technical scheme:
a network server fault warning system based on service layer monitoring big data comprises: the cloud computing system comprises a cloud computing server CCSnsfa, a computer terminal PCTnsfa and a cloud computing server CCSnsfa, wherein the cloud computing server CCSnsfa is operated with server fault warning system server side software and is deployed at a remote cloud side;
the server fault warning system comprises a data acquisition module, a data storage module, a standard parameter setting module, a fault warning module and a central computing module, wherein the central computing module is in communication connection with the data acquisition module, the data storage module, the standard parameter setting module and the fault warning module respectively.
Further, the central computing module comprises a real-time computing submodule and an off-line computing submodule.
Furthermore, the real-time computation submodule calls data to the data storage module and the standard parameter setting module, performs alarm detection according to rules preset by the standard parameter setting module, and outputs a service health state report of a service layer to the fault alarm module; and the offline calculation submodule calls data from the data storage module, analyzes the data as historical data, and predicts a monitoring index and an alarm rule of the next period so as to provide a standard parameter setting module for use.
Further, the fault alarm module analyzes the monitoring index according to the service health status report of the service layer, judges whether the index value is abnormal or not, needs to send an alarm or not, and sends alarm information at the first time when the network service is abnormal.
(III) advantageous technical effects
Compared with the prior art, the invention has the following beneficial technical effects:
the data are called to the data storage module through the off-line calculation submodule and are used as historical data to be analyzed, and the monitoring index and the alarm rule of the next period are automatically predicted so as to be provided for the standard parameter setting module to use; alarm detection is carried out through a real-time calculation submodule according to rules preset by a standard parameter setting module, and a service layer service health state report is output to a fault alarm module; the fault alarm module analyzes the monitoring index according to the service health state report of the service layer, judges whether the index value is abnormal or not, and sends out an alarm or not, and sends out alarm information at the first time when the network service is abnormal, thereby realizing the technical effects of quickly and timely finding out a fault point and intelligently analyzing the alarm.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
A network server fault warning system based on service layer monitoring big data comprises: the cloud computing system comprises a cloud computing server CCSnsfa, a computer terminal PCTnsfa and a cloud computing server CCSnsfa, wherein the cloud computing server CCSnsfa is operated with server fault warning system server side software and is deployed at a remote cloud side;
the server fault warning system comprises a data acquisition module, a data storage module, a standard parameter setting module, a fault warning module and a central computing module, wherein the central computing module is in communication connection with the data acquisition module, the data storage module, the standard parameter setting module and the fault warning module respectively;
the system comprises a data acquisition module, a data storage module and a standard parameter setting module, wherein the data acquisition module is used for acquiring monitoring log data of a network service business layer, the data storage module is used for storing the monitoring log data of the network service business layer, and the standard parameter setting module is used for defining monitoring indexes and configuring alarm rules;
the central computing module comprises a real-time computing submodule and an off-line computing submodule, the real-time computing submodule calls data to the data storage module and the standard parameter setting module, performs alarm detection according to rules preset by the standard parameter setting module, and outputs a service layer service health state report to the fault alarm module; the off-line calculation submodule calls data from the data storage module, analyzes the data as historical data, and predicts a monitoring index and an alarm rule of the next period so as to provide a standard parameter setting module for use;
the fault alarm module analyzes the monitoring index according to the service health state report of the service layer, judges whether the index value is abnormal or not, needs to send an alarm or not, and sends alarm information at the first time when the network service is abnormal;
further, installing and operating the server software with the communication authority authentication system on the operating system of the computer terminal PCTnsfa;
in order to prevent an illegal network node impersonating a cloud computing server CCSnsfa from sending false network service fault alarm information to a computer terminal PCTnsfa through a server fault alarm system, before the computer terminal PCTnsfa reads the alarm information sent by the cloud computing server CCSnsfa, a communication authority authentication system authenticates the identity of the cloud computing server CCSnsfa, and the authentication method specifically comprises the following steps:
step one, a cloud computing server CCSnsfa registers communication authority on a communication authority authentication system, which specifically comprises the following steps:
the communication authority authentication system firstly selects two large prime numbers alpha and beta, then calculates lambda as alpha beta, then secretly stores the values of the prime numbers alpha and beta, and discloses the lambda to a cloud computing server CCSnsfa;
the cloud computing server CCSnsfa selects a communication private key k (k is more than or equal to 1 and less than or equal to lambda-1), calculates l as kmod lambda, takes l as a communication public key, and then discloses the communication public key l to a communication authority authentication system;
step two, when the cloud computing server CCSnsfa sends network service fault warning information to the computer terminal PCTnsfa, the communication authority authentication system authenticates the identity of the cloud computing server CCSnsfa, which specifically comprises the following steps:
the cloud computing server CCSnsfa randomly selects a numerical value m (m is more than or equal to 1 and less than or equal to lambda-1), and p is calculated as m2mod lambda and send p to the communication authority authentication system;
the communication authority authentication system randomly selects a value e belonging to {0,1}, and sends the value e to the cloud computing server CCSnsfa;
cloud computing server CCSnsfa calculates q ═ mkeAnd sending q to a communication authority authentication system;
verification equation q of communication authority authentication system2modλ=pleWhether the result is true or not;
if the equation is established, the cloud computing server CCSnsfa is proved to know the communication private key k and has legal communication authority, and the computer terminal PCTnsfa receives alarm information sent by the cloud computing server CCSnsfa;
in the authentication process, the key k only participates in operation in the authentication process and is not transmitted in communication, so that an illegal tracker cannot intercept the key in a circuit, and the identity authentication process of the cloud computing server CCSnsfa is zero-knowledge.
Although embodiments of the present invention have been shown and described, it will be appreciated by those skilled in the art that changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims (4)

1. A network server fault alarm system based on service layer monitoring big data is characterized by comprising: the cloud computing system comprises a cloud computing server CCSnsfa, a computer terminal PCTnsfa and a cloud computing server CCSnsfa, wherein the cloud computing server CCSnsfa is operated with server fault warning system server side software and is deployed at a remote cloud side;
the server fault warning system comprises a data acquisition module, a data storage module, a standard parameter setting module, a fault warning module and a central computing module, wherein the central computing module is in communication connection with the data acquisition module, the data storage module, the standard parameter setting module and the fault warning module respectively.
2. The business layer monitoring big data-based network server fault alarm system of claim 1, wherein the central computation module comprises a real-time computation submodule and an off-line computation submodule.
3. The network server fault alarm system based on business layer big data monitoring of claim 2, wherein the real-time computation submodule calls data to the data storage module and the standard parameter setting module, performs alarm detection according to rules preset by the standard parameter setting module, and outputs a business layer service health status report to the fault alarm module; and the offline calculation submodule calls data from the data storage module, analyzes the data as historical data, and predicts a monitoring index and an alarm rule of the next period so as to provide a standard parameter setting module for use.
4. The system of claim 3, wherein the fault alarm module analyzes the monitoring index according to the service health status report of the service layer, determines whether the index value is abnormal, needs to send an alarm, and sends an alarm message at the first time when the network service is abnormal.
CN202110085356.XA 2021-01-22 2021-01-22 Network server fault warning system based on business layer monitoring big data Pending CN112817815A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110085356.XA CN112817815A (en) 2021-01-22 2021-01-22 Network server fault warning system based on business layer monitoring big data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110085356.XA CN112817815A (en) 2021-01-22 2021-01-22 Network server fault warning system based on business layer monitoring big data

Publications (1)

Publication Number Publication Date
CN112817815A true CN112817815A (en) 2021-05-18

Family

ID=75858790

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110085356.XA Pending CN112817815A (en) 2021-01-22 2021-01-22 Network server fault warning system based on business layer monitoring big data

Country Status (1)

Country Link
CN (1) CN112817815A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113448763A (en) * 2021-07-16 2021-09-28 广东电网有限责任公司 Dynamic expansion grouping alarm service method for full life cycle management
CN115277358A (en) * 2022-02-10 2022-11-01 上海贝加信息技术有限公司 Network server monitoring method and storage medium
CN117221151A (en) * 2023-09-12 2023-12-12 北京城建智控科技股份有限公司 Visual management device and method for cloud computing storage

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113448763A (en) * 2021-07-16 2021-09-28 广东电网有限责任公司 Dynamic expansion grouping alarm service method for full life cycle management
CN115277358A (en) * 2022-02-10 2022-11-01 上海贝加信息技术有限公司 Network server monitoring method and storage medium
CN117221151A (en) * 2023-09-12 2023-12-12 北京城建智控科技股份有限公司 Visual management device and method for cloud computing storage

Similar Documents

Publication Publication Date Title
CN112817815A (en) Network server fault warning system based on business layer monitoring big data
US11968077B2 (en) Link fault monitoring method and apparatus
US10148540B2 (en) System and method for anomaly detection in information technology operations
Valdes et al. Communication pattern anomaly detection in process control systems
US9967169B2 (en) Detecting network conditions based on correlation between trend lines
CN112866185B (en) Network traffic monitoring device and abnormal traffic detection method
WO2020228276A1 (en) Network alert method and device
CN107888452B (en) 24-hour distributed website performance monitoring and real-time alarming method
WO2017080161A1 (en) Alarm information processing method and device in cloud computing
CN104052634A (en) Information security monitoring system and method
CN108092847A (en) A kind of electric power LTE wireless terminal remote on-line monitoring methods
CN112468592A (en) Terminal online state detection method and system based on electric power information acquisition
CN112583643A (en) Cross-device alarm correlation method
CN112769622A (en) Cluster service fault early warning system based on RPC service monitoring
CN116418653A (en) Fault positioning method and device based on multi-index root cause positioning algorithm
CN117520096B (en) Intelligent server safety monitoring system
CN109510730B (en) Distributed system, monitoring method and device thereof, electronic equipment and storage medium
US10110440B2 (en) Detecting network conditions based on derivatives of event trending
CN110730087A (en) Method and device for processing alarm storm
CN117118662A (en) Zero trust mechanism-oriented power system running state safety protection system
CN112688970B (en) Large-traffic DDoS attack detection method and system based on programmable chip
CN110737565A (en) data monitoring method, device, electronic equipment and storage medium
CN115225534A (en) Method for monitoring running state of monitoring server
CN114296979A (en) Method and device for detecting abnormal state of Internet of things equipment
CN115834330B (en) Group obstacle detection method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20210518

WD01 Invention patent application deemed withdrawn after publication