CN106815115B - Server running state monitoring system - Google Patents

Server running state monitoring system Download PDF

Info

Publication number
CN106815115B
CN106815115B CN201710025251.9A CN201710025251A CN106815115B CN 106815115 B CN106815115 B CN 106815115B CN 201710025251 A CN201710025251 A CN 201710025251A CN 106815115 B CN106815115 B CN 106815115B
Authority
CN
China
Prior art keywords
data
module
information
log
alarm
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710025251.9A
Other languages
Chinese (zh)
Other versions
CN106815115A (en
Inventor
张文昌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Wave Intelligent Technology Co Ltd
Original Assignee
Suzhou Wave Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Wave Intelligent Technology Co Ltd filed Critical Suzhou Wave Intelligent Technology Co Ltd
Priority to CN201710025251.9A priority Critical patent/CN106815115B/en
Publication of CN106815115A publication Critical patent/CN106815115A/en
Application granted granted Critical
Publication of CN106815115B publication Critical patent/CN106815115B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3024Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system component is a central processing unit [CPU]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/302Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system component is a software system
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3037Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system component is a memory, e.g. virtual memory, cache
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/32Monitoring with visual or acoustical indication of the functioning of the machine
    • G06F11/324Display of status information
    • G06F11/327Alarm or error message display
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3452Performance evaluation by statistical analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/1805Append-only file systems, e.g. using logs or journals to store data
    • G06F16/1815Journaling file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases

Abstract

The invention provides a server running state monitoring system.A data acquisition module acquires data information of the working condition of a server and sends the acquired data information to a data processing module; the data processing module receives data information sent by the data acquisition module, sets the operation time of the data acquisition module in a broadcasting mode, and uniformly sets the data acquisition and sending processing time of the data acquisition module; the data processing module is in communication connection with the data acquisition module in the form of an optical cable or WIFI or CAN; the data processing module compares the acquired data information with a preset threshold and the current data, and carries out alarm prompt when the current data exceeds the preset threshold or produces bad effects by comparing the current data with the current data; forming alarm statistics on the alarm information and storing the alarm statistics; and the data processing module receives a request from the client and returns the processed data to the user through an HTTP (hyper text transport protocol).

Description

Server running state monitoring system
Technical Field
The invention relates to the field of servers, in particular to a server running state monitoring system.
Background
With the progress and development of network technology and data processing, the processing and computing capacity and the storage capacity of the server are related to the daily operation of enterprises. And the data of the enterprise are all stored in the server, so the server plays an important role in the data security of the enterprise. Most network companies' servers operate all weather, and how to ensure stable operation of the servers and make server monitoring in operation is a technical problem to be solved at present.
Disclosure of Invention
In order to overcome the defects in the prior art, the invention provides a server operation state monitoring system, which comprises: the system comprises a data acquisition module, a data processing module, a log recording module, a database and a data packaging module;
the data acquisition module is used for acquiring the states of each computing node, each storage node, each fan, each exchanger, the power module, the network card and the operating system of the server, and sending the acquired information to the data processing module;
the data processing module is used for receiving the data information sent by the data acquisition module, storing the received data information data by adopting a MySQL storage platform, generating a trend analysis report of the data acquired by the data acquisition module and storing the trend analysis report; the data processing module sets the running time of the data acquisition module in a broadcasting mode and uniformly sets the data acquisition and transmission processing time of the data acquisition module; the data processing module is in communication connection with the data acquisition module in the form of an optical cable or WIFI or CAN; the data processing module compares the acquired data information with a preset threshold and the current data, and carries out alarm prompt when the current data exceeds the preset threshold or produces bad effects by comparing the current data with the current data; forming alarm statistics on the alarm information and storing the alarm statistics; the data processing module receives a request from a client and returns processed data to a user through an HTTP (hyper text transport protocol);
the log recording module is connected with the data processing module;
the log recording module is used for storing the processed data information and the alarm information into a database in a log mode so as to look up corresponding history records when needed;
the data encapsulation module is used for receiving the user control instruction from the data processing module and encapsulating the user control instruction into control instruction request information to realize the control of the data acquisition module.
Preferably, the method further comprises the following steps: the system comprises a user interaction module and a code monitoring module;
the database is used for storing running state information of various servers, controlling operation logs and alarming information;
the user interaction module is used for displaying running state information of various servers, control operation logs and alarm information to a user and providing an operation interface for controlling starting, stopping and resetting of each element of the server;
the code monitoring module is used for acquiring and monitoring the running script and the running state of the application program in the server; acquiring running script codes, and comparing and judging the running script codes with a preset code script; the script code comprises a log analysis script, a server process monitoring script, a server memory occupation condition, a server CPU utilization rate, a database utilization rate and a server memory utilization rate script.
Preferably, the method further comprises the following steps: a WEB module and a mobile terminal;
the WEB module is used for setting a WEB request interface, receiving a monitoring interface request sent by the mobile terminal and returning a monitoring interface to the mobile terminal, and the monitoring interface of the mobile terminal displays the collected data information and the alarm information;
the mobile terminal establishes a monitoring interface according to data sent by the WEB module, receives monitoring information sent by the WEB module in real time, and is provided with a script for establishing html connection;
the mobile terminal comprises a parameter collection module;
the parameter collection module is used for integrating the data information acquired by the data acquisition module, organically integrating the data information through the display forms of lines, tables and graphs, and displaying and matching all the data information on a display screen of the mobile terminal through analyzing and editing the data to help a user to know the running state of the server and track and manage the running process of the server.
Preferably, the method further comprises the following steps: a log management unit;
the log management unit includes: the system comprises a preprocessing module, a log editing module, a log setting time module, a log detection module, an abnormality judgment module and a log feedback module;
the preprocessing module is used for caching the current log and transferring the last running legacy log file to a database;
the log editing module is used for respectively installing a GNU compiler suite, a g + + compiler and a gfetran compiler, constructing a log program compiling running environment and compiling a source code;
the log setting time module is used for determining the last log obtaining time point and the current log obtaining time point;
the log detection module is used for running a monitor program, detecting whether corresponding key matching information, the states of each computing node, the states of each storage node, the fan information of each fan, the state information of each switch, the state information of the power module, the working condition of the network card and the working condition of the operating system in the current data processing module log exceed a set value or not, and screening the over-threshold information in the data processing module log;
the abnormality judgment module is used for judging whether the log is abnormal or not, and acquiring the next log when the current log is not abnormal;
and the LOG feedback module is used for generating a LOG LOG file taking current LOG information and LOG time as a file name in a specified folder when the abnormality is found, calling a web-side interface, uploading the LOG and prompting an administrator to process the LOG in time.
Preferably, the log recording module is further configured to make a long-term log record of information recorded by the sensor, alarm information, shutdown information, and restart event information occurring in the system, so that a system maintenance worker diagnoses a fault according to log analysis; the log is stored in a nonvolatile memory of the centralized monitoring system, and a path for analyzing faults can be still provided for system maintenance personnel under the condition that the data processing module is down or the power supply is cut off;
the data classification alarm module is also used for setting an upper alarm limit, an upper safety limit, a lower safety limit and a lower alarm limit; when the received temperature data of a certain component or a plurality of components of the system exceeds a threshold value or the cooling fan stops working, reporting the data to a system manager in an audible and visual alarm mode;
further comprising: an alarm information notification module;
the alarm information notification module is used for sending out a notification in a short message notification mode, a client browser interface mode and an e-mail mode, and the client browser interface notification mode; searching an event list by the client browser platform, if alarm event information exists, displaying alarm information on a status bar of a page of the client browser platform, and prompting a user that the alarm information is generated; in the short message notification mode, when alarm information is generated, the alarm information notification module automatically sends the alarm information in a form of sending a short message through hardware equipment; the E-mail notification mode is that when alarm information is generated, the alarm information notification module automatically sends the alarm information to an E-mail box of a specified user.
Preferably, the WEB module is further configured to buffer status data of each computing node, status data of each storage node, status data of each fan, status data of each switch, status data of the power module, status data of the network card, and status data of the operating system to be sent, and spread spectrum and perform two-phase shift modulation on status data of each computing node, status data of each storage node, status data of each fan, status data of each switch, status data of the power module, status data of the network card, and status data of the operating system to be sent in the buffer;
the modulated state data of each computing node, the state data of each storage node, the state data of each fan, the state data of each exchanger, the state data of a power module, the working state data of a network card and the working state data of an operating system are subjected to frequency conversion of an analog signal to a high-frequency analog signal through a transmitting frequency conversion module, then the signal is subjected to conversion of an analog signal and a digital signal and is filtered, and a data information encoder encodes the filtered data information to form encoded serial data stream data information for communication transmission; dividing the coded data information into a plurality of sections, wherein each section is respectively transmitted in each subchannel, the rate of each subchannel is one N times of the total channel communication rate, namely, each channel code element period is elongated to N times of the data transmission without segmentation, each channel carrier wave is orthogonal, the frequency spectrum zero point of each subcarrier wave is superposed with the adjacent subcarrier zero point, and the interference between channels is avoided;
the mobile terminal is also used for receiving the data sent by the WEB module and combining the coded data information split into multiple sections to form a complete information code; and decoding the information codes to restore the information codes to the state data of each computing node, the state data of each storage node, the state data of each fan, the state data of each exchanger, the state data of a power module, the working state data of a network card and the working state data of an operating system.
Preferably, the data acquisition module further comprises: a data transmission processing module;
the data transmission processing module is used for packaging data information sent to the data processing module, and the data information comprises: MAC address layer, data processing terminal IP layer, data frame section;
the address code of the component of the request/reply response is represented when the lowest bit of the frame segment is 0, and the software ID of the request/reply response is represented when the lowest bit of the frame segment is 1 in the data frame segment; the upper 7 bits of the byte represent the specific address code and software ID;
parity information encoding is set in the data frame section, when the parity information encoding is even, the message is a request, and when the parity information encoding is odd, the message is a response; when the message is a basic control request or status response, the bytes are 00h and 01 h;
a serial number which is generated by a requester is also arranged in the data frame section and is used for identifying different requests when a data processing terminal needs to send a plurality of requests; the sequence number is the sub-component number or sub-address number of the component that responds to/receives the message.
According to the technical scheme, the invention has the following advantages:
receiving data information sent by a data acquisition module, storing the received data information data by adopting a MySQL storage platform, generating a trend analysis report of the data acquired by the data acquisition module, and storing the trend analysis report; the data processing module sets the running time of the data acquisition module in a broadcasting mode and uniformly sets the data acquisition and transmission processing time of the data acquisition module; the data processing module is in communication connection with the data acquisition module in the form of an optical cable or WIFI or CAN; the data processing module compares the acquired data information with a preset threshold and the current data, and carries out alarm prompt when the current data exceeds the preset threshold or produces bad effects by comparing the current data with the current data; forming alarm statistics on the alarm information and storing the alarm statistics; and the data processing module receives a request from the client and returns the processed data to the user through an HTTP (hyper text transport protocol).
The log recording module is connected with the data processing module; the log recording module is used for storing the data information and the alarm information into a database in a log mode so as to look up corresponding history records when needed; the data encapsulation module is used for receiving the user control instruction from the data processing module and encapsulating the user control instruction into control instruction request information to realize the control of the data acquisition module.
Drawings
In order to more clearly illustrate the technical solution of the present invention, the drawings used in the description will be briefly introduced, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained based on these drawings without creative efforts.
FIG. 1 is an overall schematic diagram of a server operation state monitoring system;
fig. 2 is a schematic diagram of a log management unit.
Detailed Description
In order to make the objects, features and advantages of the present invention more obvious and understandable, the technical solutions of the present invention will be clearly and completely described below with reference to specific embodiments and drawings. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the scope of protection of this patent.
The present embodiment provides a server operation state monitoring system, as shown in fig. 1, including: the system comprises a data acquisition module 1, a data processing module 2, a log recording module 4, a database 5 and a data packaging module 6;
the data acquisition module 1 is used for acquiring the states of each computing node, each storage node, each fan, each exchanger, a power module, a network card and an operating system of the server, and sending the acquired information to the data processing module;
the data processing module 2 is used for receiving the data information sent by the data acquisition module, storing the received data information data by adopting a MySQL storage platform, generating a trend analysis report of the data acquired by the data acquisition module and storing the trend analysis report; the data processing module sets the running time of the data acquisition module in a broadcasting mode and uniformly sets the data acquisition and transmission processing time of the data acquisition module; the data processing module is in communication connection with the data acquisition module in the form of an optical cable or WIFI or CAN; the data processing module compares the acquired data information with a preset threshold and the current data, and carries out alarm prompt when the current data exceeds the preset threshold or produces bad effects by comparing the current data with the current data; forming alarm statistics on the alarm information and storing the alarm statistics; the data processing module receives a request from a client and returns processed data to a user through an HTTP (hyper text transport protocol);
the log recording module 4 is connected with the data processing module 2; the log recording module 4 is used for storing the data information and the alarm information into a database in a log mode so as to look up corresponding history records when needed; the data encapsulation module is used for receiving the user control instruction from the data processing module and encapsulating the user control instruction into control instruction request information to realize the control of the data acquisition module.
In this embodiment, the method further includes: the system comprises a user interaction module and a code monitoring module;
the database is used for storing running state information of various servers, controlling operation logs and alarming information; the user interaction module is used for displaying running state information of various servers, control operation logs and alarm information to a user and providing an operation interface for controlling starting, stopping and resetting of each element of the server;
the code monitoring module is used for acquiring and monitoring the running script and the running state of the application program in the server; acquiring running script codes, and comparing and judging the running script codes with a preset code script; the script code comprises a log analysis script, a server process monitoring script, a server memory occupation condition, a server CPU utilization rate, a database utilization rate and a server memory utilization rate script.
In this embodiment, the system further includes: a WEB module and a mobile terminal;
the WEB module is used for setting a WEB request interface, receiving a monitoring interface request sent by the mobile terminal and returning a monitoring interface to the mobile terminal, and the monitoring interface of the mobile terminal displays the collected data information and the alarm information;
the mobile terminal establishes a monitoring interface according to data sent by the WEB module, receives monitoring information sent by the WEB module in real time, and is provided with a script for establishing html connection;
the mobile terminal comprises a parameter collection module; the parameter collection module is used for integrating the data information acquired by the data acquisition module, organically integrating the data information through the display forms of lines, tables and graphs, and displaying and matching all the data information on a display screen of the mobile terminal through analyzing and editing the data to help a user to know the running state of the server and track and manage the running process of the server.
In this embodiment, as shown in fig. 2, the system further includes: a log management unit; the log management unit includes: the system comprises a preprocessing module 11, a log editing module 12, a log setting time module 13, a log detection module 14, an abnormality judgment module 15 and a log feedback module 16;
the preprocessing module 11 is used for caching the current log and transferring the last running legacy log file to a database; the log editing module 12 is used for respectively installing a GNU compiler suite, a g + + compiler and a gfetran compiler, constructing a log program compiling and running environment and compiling a source code; the log setting time module 13 is used for determining the last log obtaining time point and the current log obtaining time point; the log detection module 14 is configured to run a monitor program, detect whether corresponding key matching information, states of each computing node, states of each storage node, fan information of each fan, state information of each switch, state information of a power module, network card operating conditions, and operating system operating conditions in the current data processing module log exceed a set value, and screen out super-threshold information in the data processing module log; the abnormality judgment module 15 is used for judging whether the log is abnormal or not, and when the current log is not abnormal, acquiring the next log; the LOG feedback module 16 is used for generating a LOG file of LOG in a specified folder by taking current LOG information and LOG time as a file name when an abnormality is found, and calling a web-side interface to upload a LOG and prompt an administrator to process the LOG in time. In the monitoring process, error information in the log of the data processing module can be screened from the current time point to the next time point according to the test time specified by the user, and the log setting time module 13, the log detection module 14, the abnormality judgment module 15 and the log feedback module 16 can work repeatedly.
The method comprises the steps of detecting whether a set value is exceeded in a log file or not by obtaining the log file generated by a data processing module, giving an alarm in time when an abnormality is found, circularly checking effective information in a current log of the data processing module for many times within a specified time and giving an alarm for the abnormal information by obtaining the log file generated by the data processing module, so that an administrator can timely process corresponding problems, the purpose of monitoring the running state of a server is achieved, and the server can be normally monitored even if software on the server cannot be started.
In this embodiment, the log recording module is further configured to make a long-term log record of information recorded by a sensor, alarm information, shutdown event information, and restart event information occurring in the system, so that a system maintenance worker diagnoses a fault according to log analysis; the log is stored in a nonvolatile memory of the centralized monitoring system, and a path for analyzing faults can be still provided for system maintenance personnel under the condition that the data processing module is down or the power supply is cut off;
the data classification alarm module is also used for setting an upper alarm limit, an upper safety limit, a lower safety limit and a lower alarm limit; when the received temperature data of a certain component or a plurality of components of the system exceeds a threshold value or the cooling fan stops working, reporting the data to a system manager in an audible and visual alarm mode;
the system further comprises: an alarm information notification module;
the alarm information notification module is used for sending out a notification in a short message notification mode, a client browser interface mode and an e-mail mode, and the client browser interface notification mode; searching an event list by the client browser platform, if alarm event information exists, displaying alarm information on a status bar of a page of the client browser platform, and prompting a user that the alarm information is generated; in the short message notification mode, when alarm information is generated, the alarm information notification module automatically sends the alarm information in a form of sending a short message through hardware equipment; the E-mail notification mode is that when alarm information is generated, the alarm information notification module automatically sends the alarm information to an E-mail box of a specified user.
In this embodiment, the WEB module is further configured to buffer status data of each computing node, status data of each storage node, status data of each fan, status data of each switch, status data of the power module, status data of the network card, and status data of the operating system to be sent, and spread spectrum and perform two-phase shift modulation on status data of each computing node, status data of each storage node, status data of each fan, status data of each switch, status data of the power module, status data of the network card, and status data of the operating system to be sent in the buffer;
the modulated state data of each computing node, the state data of each storage node, the state data of each fan, the state data of each exchanger, the state data of a power module, the working state data of a network card and the working state data of an operating system are subjected to frequency conversion of an analog signal to a high-frequency analog signal through a transmitting frequency conversion module, then the signal is subjected to conversion of an analog signal and a digital signal and is filtered, and a data information encoder encodes the filtered data information to form encoded serial data stream data information for communication transmission; dividing the coded data information into a plurality of sections, wherein each section is respectively transmitted in each subchannel, the rate of each subchannel is one N times of the total channel communication rate, namely, each channel code element period is elongated to N times of the data transmission without segmentation, each channel carrier wave is orthogonal, the frequency spectrum zero point of each subcarrier wave is superposed with the adjacent subcarrier zero point, and the interference between channels is avoided;
the mobile terminal is also used for receiving the data sent by the WEB module and combining the coded data information split into multiple sections to form a complete information code; and decoding the information codes to restore the information codes to the state data of each computing node, the state data of each storage node, the state data of each fan, the state data of each exchanger, the state data of a power module, the working state data of a network card and the working state data of an operating system.
In this embodiment, the data acquisition module further includes: a data transmission processing module;
the data transmission processing module is used for packaging data information sent to the data processing module, and the data information comprises: MAC address layer, data processing terminal IP layer, data frame section;
the address code of the component of the request/reply response is represented when the lowest bit of the frame segment is 0, and the software ID of the request/reply response is represented when the lowest bit of the frame segment is 1 in the data frame segment; the upper 7 bits of the byte represent the specific address code and software ID;
parity information encoding is set in the data frame section, when the parity information encoding is even, the message is a request, and when the parity information encoding is odd, the message is a response; when the message is a basic control request or status response, the bytes are 00h and 01 h;
a serial number which is generated by a requester is also arranged in the data frame section and is used for identifying different requests when a data processing terminal needs to send a plurality of requests; the sequence number is the sub-component number or sub-address number of the component that responds to/receives the message.
The embodiments in the present description are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other.
The previous description of the disclosed embodiments is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the embodiments shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims (6)

1. A server operation state monitoring system, comprising: the system comprises a data acquisition module, a data processing module, a log recording module, a database, a data packaging module, a user interaction module and a code monitoring module; the data acquisition module is used for acquiring the states of each computing node, each storage node, each fan, each exchanger, the power module, the network card and the operating system of the server, and sending the acquired information to the data processing module;
the data processing module is used for receiving the data information sent by the data acquisition module, storing the received data information data by adopting a MySQL storage platform, generating a trend analysis report of the data acquired by the data acquisition module and storing the trend analysis report; the data processing module sets the running time of the data acquisition module in a broadcasting mode and uniformly sets the data acquisition and transmission processing time of the data acquisition module; the data processing module is in communication connection with the data acquisition module in the form of an optical cable or WIFI or CAN; the data processing module compares the acquired data information with a preset threshold and the current data, and carries out alarm prompt when the current data exceeds the preset threshold or produces bad effects by comparing the current data with the current data; forming alarm statistics on the alarm information and storing the alarm statistics; the data processing module receives a request from a client and returns processed data to a user through an HTTP (hyper text transport protocol);
the log recording module is connected with the data processing module;
the log recording module is used for storing the processed data information and the alarm information into a database in a log mode so as to look up corresponding history records when needed;
the data encapsulation module is used for receiving the user control instruction from the data processing module and encapsulating the user control instruction into control instruction request information to realize the control of the data acquisition module;
the database is used for storing running state information of various servers, controlling operation logs and alarming information;
the user interaction module is used for displaying running state information of various servers, control operation logs and alarm information to a user and providing an operation interface for controlling starting, stopping and resetting of each element of the server;
the code monitoring module is used for acquiring and monitoring the running script and the running state of the application program in the server; acquiring running script codes, and comparing and judging the running script codes with a preset code script; the script code comprises a log analysis script, a server process monitoring script, a server memory occupation condition, a server CPU utilization rate, a database utilization rate and a server memory utilization rate script.
2. The server operation state monitoring system according to claim 1,
further comprising: a WEB module and a mobile terminal;
the WEB module is used for setting a WEB request interface, receiving a monitoring interface request sent by the mobile terminal and returning a monitoring interface to the mobile terminal, and the monitoring interface of the mobile terminal displays the collected data information and the alarm information;
the mobile terminal establishes a monitoring interface according to data sent by the WEB module, receives monitoring information sent by the WEB module in real time, and is provided with a script for establishing html connection;
the mobile terminal comprises a parameter collection module;
the parameter collection module is used for integrating the data information acquired by the data acquisition module, organically integrating the data information through the display forms of lines, tables and graphs, and displaying and matching all the data information on a display screen of the mobile terminal through analyzing and editing the data to help a user to know the running state of the server and track and manage the running process of the server.
3. The server operation state monitoring system according to claim 1,
further comprising: a log management unit;
the log management unit includes: the system comprises a preprocessing module, a log editing module, a log setting time module, a log detection module, an abnormality judgment module and a log feedback module;
the preprocessing module is used for caching the current log and transferring the last running legacy log file to a database; the log editing module is used for respectively installing a GNU compiler suite, a g + + compiler and a gfetran compiler, constructing a log program compiling running environment and compiling a source code;
the log setting time module is used for determining the last log obtaining time point and the current log obtaining time point;
the log detection module is used for running a monitor program, detecting whether corresponding key matching information, the states of each computing node, the states of each storage node, the fan information of each fan, the state information of each switch, the state information of the power module, the working condition of the network card and the working condition of the operating system in the current data processing module log exceed a set value or not, and screening the over-threshold information in the data processing module log;
the abnormality judgment module is used for judging whether the log is abnormal or not, and acquiring the next log when the current log is not abnormal; and the LOG feedback module is used for generating a LOG LOG file taking current LOG information and LOG time as a file name in a specified folder when the abnormality is found, calling a web-side interface, uploading the LOG and prompting an administrator to process the LOG in time.
4. The server operation state monitoring system according to claim 1,
the log recording module is also used for making long-term log recording of information recorded by the sensor, alarm information, shutdown and restart event information in the system, so that system maintenance personnel can diagnose faults according to log analysis; the log is stored in a nonvolatile memory of the centralized monitoring system, and a path for analyzing faults can be still provided for system maintenance personnel under the condition that the data processing module is down or the power supply is cut off;
the data classification alarm module is also used for setting an upper alarm limit, an upper safety limit, a lower safety limit and a lower alarm limit; when the received temperature data of a certain component or a plurality of components of the system exceeds a threshold value or the cooling fan stops working, reporting the data to a system manager in an audible and visual alarm mode;
further comprising: an alarm information notification module;
the alarm information notification module is used for sending out a notification in a short message notification mode, a client browser interface mode and an e-mail mode, and the client browser interface notification mode; searching an event list by the client browser platform, if alarm event information exists, displaying alarm information on a status bar of a page of the client browser platform, and prompting a user that the alarm information is generated; in the short message notification mode, when alarm information is generated, the alarm information notification module automatically sends the alarm information in a form of sending a short message through hardware equipment; the E-mail notification mode is that when alarm information is generated, the alarm information notification module automatically sends the alarm information to an E-mail box of a specified user.
5. The server operation state monitoring system according to claim 2,
the WEB module is also used for caching the sent state data of each computing node, the state data of each storage node, the state data of each fan, the state data of each switch, the state data of the power module, the working state data of the network card and the working state data of the operating system, and performing frequency spreading and two-phase shift modulation on the state data of each computing node, the state data of each storage node, the state data of each fan, the state data of each switch, the state data of the power module, the working state data of the network card and the working state data of the operating system to be sent in the cache;
the modulated state data of each computing node, the state data of each storage node, the state data of each fan, the state data of each exchanger, the state data of a power module, the working state data of a network card and the working state data of an operating system are subjected to frequency conversion of an analog signal to a high-frequency analog signal through a transmitting frequency conversion module, then the signal is subjected to conversion of an analog signal and a digital signal and is filtered, and a data information encoder encodes the filtered data information to form encoded serial data stream data information for communication transmission; dividing the coded data information into a plurality of sections, wherein each section is respectively transmitted in each subchannel, the rate of each subchannel is one N times of the total channel communication rate, namely, each channel code element period is elongated to N times of the data transmission without segmentation, each channel carrier wave is orthogonal, the frequency spectrum zero point of each subcarrier wave is superposed with the adjacent subcarrier zero point, and the interference between channels is avoided;
the mobile terminal is also used for receiving the data sent by the WEB module and combining the coded data information split into multiple sections to form a complete information code; and decoding the information codes to restore the information codes to the state data of each computing node, the state data of each storage node, the state data of each fan, the state data of each exchanger, the state data of a power module, the working state data of a network card and the working state data of an operating system.
6. The server operation state monitoring system according to claim 1,
the data acquisition module further comprises: a data transmission processing module;
the data transmission processing module is used for packaging data information sent to the data processing module, and the data information comprises: MAC address layer, data processing terminal IP layer, data frame section;
the address code of the component of the request/reply response is represented when the lowest bit of the frame segment is 0, and the software ID of the request/reply response is represented when the lowest bit of the frame segment is 1 in the data frame segment; the upper 7 bits of the byte represent the specific address code and software ID;
parity information encoding is set in the data frame section, when the parity information encoding is even, the message is a request, and when the parity information encoding is odd, the message is a response; bytes 00h and 01h when the message is a basic control request or status response;
a serial number which is generated by a requester is also arranged in the data frame section and is used for identifying different requests when a data processing terminal needs to send a plurality of requests; the sequence number is a sub-component number or sub-address number of the component responding/receiving the message.
CN201710025251.9A 2017-01-13 2017-01-13 Server running state monitoring system Active CN106815115B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710025251.9A CN106815115B (en) 2017-01-13 2017-01-13 Server running state monitoring system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710025251.9A CN106815115B (en) 2017-01-13 2017-01-13 Server running state monitoring system

Publications (2)

Publication Number Publication Date
CN106815115A CN106815115A (en) 2017-06-09
CN106815115B true CN106815115B (en) 2020-02-21

Family

ID=59110992

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710025251.9A Active CN106815115B (en) 2017-01-13 2017-01-13 Server running state monitoring system

Country Status (1)

Country Link
CN (1) CN106815115B (en)

Families Citing this family (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107256261B (en) * 2017-06-13 2021-03-19 中原工学院 Electronic information transmission system and method thereof
KR102422032B1 (en) * 2017-08-16 2022-07-19 에스케이하이닉스 주식회사 Memory system and operating method of memory system
US10747210B2 (en) * 2017-09-11 2020-08-18 Lam Research Corporation System and method for automating user interaction for semiconductor manufacturing equipment
CN108337193B (en) * 2018-01-26 2021-01-01 山东超越数控电子股份有限公司 Unified installation and rescue system and monitoring system of switch operating system
CN108459944A (en) * 2018-03-29 2018-08-28 中科创能实业有限公司 System operation monitoring method, device and server
CN108521457B (en) * 2018-03-30 2019-08-13 三盟科技股份有限公司 A kind of tracking and system of equipment control command
CN108449235B (en) * 2018-05-07 2023-12-12 苏州德姆斯信息技术有限公司 Equipment alarm calculation processing system and processing method
CN108829568A (en) * 2018-05-31 2018-11-16 达飞云贷科技(北京)有限公司 A kind of data monitoring method and device
CN108628231A (en) * 2018-07-05 2018-10-09 郑州云海信息技术有限公司 Apparatus monitoring method and device in cloud data center
CN108933698B (en) * 2018-07-05 2021-08-20 郑州云海信息技术有限公司 Short message alarm system and method for abnormity
CN109032904A (en) * 2018-07-19 2018-12-18 郑州云海信息技术有限公司 Monitored, management server and data acquisition, analysis method and management system
CN109254892A (en) * 2018-08-14 2019-01-22 北京信安世纪科技股份有限公司 A kind of method and device generating report
CN109684159A (en) * 2018-09-07 2019-04-26 平安普惠企业管理有限公司 Method for monitoring state, device, equipment and the storage medium of distributed information system
CN109240889A (en) * 2018-09-20 2019-01-18 武汉指针科技有限公司 A kind of server visualization control device and method
CN110927475A (en) * 2018-09-20 2020-03-27 湖南省计量检测研究院 Large-scale industrial equipment electromagnetic wave interference field test system
CN109408312B (en) * 2018-11-01 2021-10-29 郑州云海信息技术有限公司 Server operating temperature test system and equipment
CN111176956A (en) * 2018-11-09 2020-05-19 广州贵冠科技有限公司 Monitoring device
CN109286532B (en) * 2018-11-28 2021-07-27 郑州云海信息技术有限公司 Management method and device for alarm information in cloud computing system
CN109634982A (en) * 2018-12-14 2019-04-16 万翼科技有限公司 Method for generating alarm, device and computer readable storage medium
CN109842538B (en) * 2019-01-03 2021-03-30 苏州佳世达光电有限公司 Information prompting system
CN109871301A (en) * 2019-02-12 2019-06-11 中电科仪器仪表有限公司 A kind of remote test device, system and method
CN109918274A (en) * 2019-02-26 2019-06-21 武汉钢铁工程技术集团有限责任公司 Computer server operation monitoring system and method
CN110097113B (en) * 2019-04-26 2021-03-26 北京奇艺世纪科技有限公司 Method, device and system for monitoring working state of display information delivery system
CN110427294B (en) * 2019-06-17 2022-07-29 平安普惠企业管理有限公司 System integration environment monitoring method, apparatus, readable storage medium and program product
CN110888780A (en) * 2019-11-19 2020-03-17 泰康保险集团股份有限公司 Application monitoring method, device, equipment and storage medium
CN110932918B (en) * 2019-12-26 2023-01-10 远景智能国际私人投资有限公司 Log data acquisition method and device and storage medium
CN111800425B (en) * 2020-07-07 2023-03-24 中国人民解放军陆军防化学院 Embedded data acquisition and processing system
CN111817893A (en) * 2020-07-10 2020-10-23 深圳供电局有限公司 Fortune transferring system based on IT map visualization
CN112084090B (en) * 2020-09-03 2024-02-23 深信服科技股份有限公司 Server management method, server, management terminal and storage medium
CN112398872A (en) * 2020-12-04 2021-02-23 王志东 Service system operation abnormity identification system and identification method based on state log
CN112802276A (en) * 2020-12-17 2021-05-14 航天信息股份有限公司 Tax self-service terminal monitoring system
CN112839413A (en) * 2021-02-01 2021-05-25 山东泰和能源股份有限公司 Illumination monitoring system
CN113411398B (en) * 2021-06-18 2022-02-18 全方位智能科技(南京)有限公司 Big data-based file cleaning writing and cleaning management system and method
CN113554191A (en) * 2021-07-27 2021-10-26 桂林海威科技股份有限公司 Remote monitoring method and system for equipment with light source
CN116401128B (en) * 2023-06-06 2023-08-08 四川观想科技股份有限公司 Big data-based information operation and maintenance management system
CN117579574B (en) * 2024-01-15 2024-03-29 南京荧火泰讯信息科技有限公司 Multichannel baseband signal processing method based on broadband

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102075384A (en) * 2010-12-20 2011-05-25 创新科存储技术有限公司 Performance test system and method
CN102609789A (en) * 2012-02-21 2012-07-25 复旦大学 Information monitoring and abnormality predicting system for library
CN103036713A (en) * 2012-12-07 2013-04-10 福建亿榕信息技术有限公司 System health degree inspection method based on Domino platform

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102075384A (en) * 2010-12-20 2011-05-25 创新科存储技术有限公司 Performance test system and method
CN102609789A (en) * 2012-02-21 2012-07-25 复旦大学 Information monitoring and abnormality predicting system for library
CN103036713A (en) * 2012-12-07 2013-04-10 福建亿榕信息技术有限公司 System health degree inspection method based on Domino platform

Also Published As

Publication number Publication date
CN106815115A (en) 2017-06-09

Similar Documents

Publication Publication Date Title
CN106815115B (en) Server running state monitoring system
CA3118098C (en) Alerting, diagnosing, and transmitting computer issues to a technical resource in response to a dedicated physical button or trigger
CN105159964A (en) Log monitoring method and system
CN105335271A (en) State monitoring apparatus and comprehensive monitoring system and method
CN100549975C (en) Computer maintenance support system and analysis server
CN110808856A (en) Big data operation and maintenance method and system based on data center
CN106506248B (en) intelligent monitoring system for server
US8601318B2 (en) Method, apparatus and computer program product for rule-based directed problem resolution for servers with scalable proactive monitoring
CN105825641A (en) Service alarm method and apparatus
JP4842738B2 (en) Fault management support system and information management method thereof
CN111258857A (en) Server state monitoring method and system
CN108288997A (en) A kind of transmission network luminous power automated collection systems
CN103684812B (en) Method and apparatus for managing remote equipment
CN103166779A (en) Alarm confirming and processing method and device based on mobile terminal
US20130290224A1 (en) System or Solution Index Fault - Assessment, Identification, Baseline, and Alarm Feature
CN116594840A (en) Log fault acquisition and analysis method, system, equipment and medium based on ELK
JP2009187230A (en) Monitoring device for server
CN109450103A (en) Condition detection method, device and the intelligent terminal of pressing plate
CN105827447A (en) Service alarm method and apparatus
CN116340690B (en) Page abnormality detection method and device and computer equipment
CN109144765B (en) Report generation method, report generation device, computer equipment and storage medium
CN112463526A (en) Method and related device for acquiring server state
KR101064434B1 (en) Remote Configuration Management System for IP Network Element
CN117271265A (en) IT monitoring device and monitoring method
CN114327967A (en) Equipment repairing method and device, storage medium and electronic device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20200114

Address after: 215100 No. 1 Guanpu Road, Guoxiang Street, Wuzhong Economic Development Zone, Suzhou City, Jiangsu Province

Applicant after: Suzhou Wave Intelligent Technology Co., Ltd.

Address before: 450000 Henan province Zheng Dong New District of Zhengzhou City Xinyi Road No. 278 16 floor room 1601

Applicant before: Zhengzhou Yunhai Information Technology Co. Ltd.

GR01 Patent grant
GR01 Patent grant