Server remote monitoring and emergency disposal system and method
Technical field
The present invention relates to a kind of communication O&M system, specifically a kind of server long distance control system.
Background technology
Along with the enforcement of the information-based SG186 engineering of State Grid Corporation of China, informatization is advanced by leaps and bounds, and information system has been dissolved into each department of power grid enterprises' production and operation, becomes the indispensable part of production and operation link.Number of servers constantly increases, and the workload of equipment operation maintenance is increasing, and O&M personnel's workload is large leap ahead also, and conventional O&M mode can not ensure the safe operation of this hundreds of station server.While running into the uncertain emergencies such as server exception, air-condition faults, water inlet when information machine room, operating personnel need quick closedown just at runtime server, avoid causing the major events such as loss of data, device damage.Because server equipment amount in machine room is large, and operating system disunity, long by conventional method closing server complicated operation, time, human factor is many, the duration that causes shutting down is uncertain, to bring larger potential safety hazard, cause economic loss and the social influence that can not estimate.
It to the monitoring of server apparatus, is an important face of putting that ensures operation system safe and stable operation, at present, existing operational system has realized the monitoring to the running status of server and data, but also there is certain problem, as: system cannot realize the real-time monitoring to each server service data of machine room; Cannot realize the active reporting of alarm signal; When there is not expected emergency in machine room, can not shut down in time, cannot guarantee the integrality of data in server.
Summary of the invention
The technical issues that need to address of the present invention are to provide a kind of server remote monitoring and emergency disposal system and method; can not only realize the real-time monitoring of server service data in communication machine room and the active reporting of alarm signal; can also be when there is not expected emergency in machine room; realize one-touch long-range shutdown in batches, to protect the integrality of data in server.
For solving the problems of the technologies described above, the technical solution adopted in the present invention is:
Server remote monitoring and emergency disposal system, comprise monitored some clients and for the master station of monitor client, between client and master station, by control/IP(Internet Protocol) of TCP/IP, carry out transfer of data.Described client is provided with data link module, data monitoring module, process supervision module, reporting to the police monitors module and emergency disposal module, and data monitoring module, process monitor module, the supervision module of reporting to the police interconnects with data link module respectively; Described master station is provided with communication link module, data reception module, data analysis module, data outputting module and emergent control module, the data link module interconnection of communication link module and client, the output of communication link module is connected with data analysis module through data reception module, the output of data analysis module is connected with data outputting module and emergent control module respectively, and the output of emergent control module connects the emergency disposal module of client.
Server remote monitoring and emergence treating method, described server remote monitoring and emergence treating method are realized based on server remote monitoring and emergency disposal system, wherein server remote monitoring and emergency disposal system comprise monitored some clients and for the master station of monitor client, described client is provided with data link module, data monitoring module, process monitors module, report to the police and monitor module and emergency disposal module, described master station is provided with communication link module, data reception module, data analysis module, data outputting module and emergent control module, described server remote monitoring and emergence treating method specifically comprise the following steps:
The first step, linking request
When client terminal start-up, the data link module of client is initiatively initiated data link request to master station, this request of the communication link module responds of master station, and initiate heartbeat link signal to all clients;
Second step, data acquisition
The data link module of client receives after link signal, and the data monitoring module of client, process monitor module and report to the police and monitor that module starts synchronous operation and real-time data collection is sent to master station through data link module;
The 3rd step, data processing
The data reception module of master station respectively monitors that by the client receiving in real time the data message of module collection is sent to data analysis module and carries out data analysis; The analysis result that data analysis module obtains is intuitively shown in modes such as curve, chart or forms by the data outputting module of master station; Data analysis module is also transferred to analysis result emergent control module simultaneously;
The 4th step, controls client-server operating state
Emergent control module is the emergency disposal module sending controling instruction to client according to the signal receiving; The emergency disposal module of client is controlled the operating state of client-server in real time according to instruction.
Improvement of the present invention is: server remote monitoring and emergence treating method, described in the described first step, the concrete steps of linking request are: after client terminal start-up, whether recognition network environment is available, as available access server, set up communication link signal, send data link request; Whether master station recognition network environment is available simultaneously, as available, start and intercepts, and whether identify customer end accesses, and after finding that client successfully accesses, master station is set up communication link signal to client, and client and master station keep data link.
Owing to having adopted technique scheme, the technological progress that the present invention obtains is:
The present invention can not only realize the real-time monitoring of server service data in communication machine room and the active reporting of alarm signal; can also be when there is not expected emergency in machine room; realize one-touch long-range shutdown in batches, to protect the integrality of data in server.Heartbeat and handshake mechanism that the data link module of client adopts, in master station or client, there is either party data link to be detected when abnormal, will be by force break link site clearing, releasing resource, both guaranteed the stability of client and master station data link, can detect in time data link state again, make system there is self-healing ability.Master station data analysis module, the continuity data analysis to server, arranging result can intuitively show, is convenient to the service data of server, process and state to analyze.The emergent control module of master station, when being in an emergency, is sent instruction to client emergency disposal module in time, and client-server or operating software are carried out to protectiveness processing, comprises closing server and restarts server.
Accompanying drawing explanation
Fig. 1 is the structured flowchart of server remote monitoring of the present invention and emergency disposal system;
Fig. 2 is the communication logic figure of linking request of the present invention.
Embodiment
Below in conjunction with accompanying drawing, the present invention is described in further details:
Remote monitoring and an emergency disposal system, as shown in Figure 1, comprise monitored some clients and for the master station of monitor client.Client is the server apparatus being configured in machine room, is embedded with client-side program Client on client-server; Master station is the server apparatus being configured in Control Room, and master station server is embedded with monitoring program Server; Between client and master station, by control/IP(Internet Protocol) of TCP/IP, carry out transfer of data, form remote monitoring and emergent treatment system based on CS pattern.
Client comprises five functional modules, respectively: data link module, data monitoring module, process monitor module, reporting to the police monitors module and emergency disposal module.Master station comprises five functional modules, respectively: communication link module, data reception module, data analysis module, data outputting module and emergent control module.
The data monitoring module of client, process monitor module, reporting to the police monitors that module interconnects with the data link module of client respectively; The communication link module of master station and the interconnection of the data link module of client, the output of communication link module links with data analysis module through data reception module, the output of data analysis module is connected with data outputting module and emergent control module respectively, and the output of emergent control module connects the emergency disposal module of client.
Server remote monitoring and emergence treating method are realized based on above-mentioned server remote monitoring and emergency disposal system, specifically comprise the following steps:
The first step, linking request
When client terminal start-up, the data link module in client starts, and this module is responsible for carrying out communication link with master station.After each startup of server operation of machine room, the data link module of client is initiatively initiated data link request to master station server, this request of communication link module responds of master station, in order to guarantee the reliability of data communication, the communication link module of master station is initiated heartbeat link signal to all clients, and detects in real time the availability of all communication links.When master station or client have a side, data link detected when abnormal, break link site clearing, releasing resource, avoid causing due to the problem of a certain communication link the paralysis of whole communication system by force, makes system have very strong self-healing ability.
As shown in Figure 2, whether client recognition network environment is available for the data link request flow process of client and master station, as available access server, sets up communication link signal, sends data link request.Whether master station recognition network environment is available, as available, start and intercepts, and whether identify customer end accesses, and after finding that client successfully accesses, master station is set up communication link to client, and client and master station keep data link.Each functional module of master station and client keeps normal operation; as there is communication abnormality; client and master station enter respectively protectiveness abnormality processing; processing finishes rear client and continues to initiate linking request to master station; set up new round communication link; Communication Control logic continues operation, keeps the data link of master station and client modules.
Second step, data acquisition
Client keeps after stable communication link by data link module and master station, and the data monitoring module of client, process monitor module and report to the police and monitor that module starts synchronous operation.Data monitoring module is responsible for the master data of place server to gather, and comprises that basic configuration information, network configuration information, the hardware configuration information to system gathers; Process monitors that module is responsible for realizing the operation conditions of a certain process of server and operational factor monitored, as the taking of CPU, memory requirements, user's link information etc.; Report to the police and monitor the responsible forwarding to server hardware alarm signal and software alarm signal of module.Data monitoring module, process monitors module and reports to the police and monitors that the data that module collects are all sent to master station by data link module.
The 3rd step, data processing
After master station and client are kept in communication and are linked, the data reception BOB(beginning of block) of master station receives on client-server and respectively monitors that module is sent to the data message of master station, the essential information that comprises server room all devices, client is upgraded and the operation information of each process of server of uploading automatically, the interface information on each server of client.
These data that receive are analyzed by the data analysis module of master station, and form policy-making data result, as analyze the CPU usage of institute's monitoring server, according to the data analysis receiving, go out in one day which is to use peak period time period, CPU usage and process and serve between relation etc.The analysis result that data analysis module obtains is intuitively shown in modes such as curve, chart or forms by the data outputting module of master station; Data analysis module is also transferred to analysis result emergent control module simultaneously.
The 4th step, controls client-server operating state
The emergent control module of master station is the emergency disposal module sending controling instruction to client according to the signal receiving, and machine room server is carried out to protectiveness emergency processing; The emergency processing module of client is controlled the operating state of client-server in real time according to instruction.For example: current while having emergency to occur; be included in when the data that obtain in master station data analysis module occur extremely or client machine room occurs unexpected incidents; client emergency disposal module is carried out protectiveness processing to client-server or operating software under the control of master station, comprises closing server and restarts server.