CN104601673B - Extensible high-availability server layered monitoring system - Google Patents

Extensible high-availability server layered monitoring system Download PDF

Info

Publication number
CN104601673B
CN104601673B CN201410835821.7A CN201410835821A CN104601673B CN 104601673 B CN104601673 B CN 104601673B CN 201410835821 A CN201410835821 A CN 201410835821A CN 104601673 B CN104601673 B CN 104601673B
Authority
CN
China
Prior art keywords
module
monitoring
deployment
management
management module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410835821.7A
Other languages
Chinese (zh)
Other versions
CN104601673A (en
Inventor
徐鑫朋
袁铭
王欢
曾志强
梁鲁娟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
No32 Research Institute Of China Electronics Technology Group Corp
Original Assignee
No32 Research Institute Of China Electronics Technology Group Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by No32 Research Institute Of China Electronics Technology Group Corp filed Critical No32 Research Institute Of China Electronics Technology Group Corp
Priority to CN201410835821.7A priority Critical patent/CN104601673B/en
Publication of CN104601673A publication Critical patent/CN104601673A/en
Application granted granted Critical
Publication of CN104601673B publication Critical patent/CN104601673B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Debugging And Monitoring (AREA)

Abstract

The invention provides an extensible high-availability server layered monitoring system which comprises a system monitoring module, a fault warning module, a data statistics and analysis module, an extension module, a mirror image file deployment module, a system deployment state display module, a user management module, a shutdown management module, a process management module, a file management module and a multi-node console, wherein the system monitoring module, the fault warning module, the data statistics and analysis module, the extension module, the mirror image file deployment module, the system deployment state display module, the user management module, the shutdown management module, the process management module, the file management module and the multi-node console are sequentially connected. The invention adopts a layered agent architecture to realize remote monitoring and management functions on the domestic cluster server and reduce the influence of environmental factors on management personnel.

Description

Expansible High Availabitity server hierarchies monitoring system
Technical field
The present invention relates to a kind of monitoring system, in particular it relates to a kind of expansible High Availabitity server hierarchies monitoring system System.
Background technology
The development trend of domestic cluster server and data center is developed into high in the clouds, cluster blade server or In the environment such as large-scale data center, due to factors such as noise, radiation, management is monitored to cluster blade server into scene Become inadequate rationality, and remote management technologies before only focus on some aspects of server admin, and function is not comprehensive enough.
The content of the invention
For in the prior art the defects of, it is an object of the invention to provide a kind of expansible High Availabitity server hierarchies prison Control system, it uses the framework of Hierarchical Agent to realize the Distant supervision and control function to domestic cluster server, reduces environment Factor is on influence caused by administrative staff.
According to an aspect of the present invention, there is provided a kind of expansible High Availabitity server hierarchies monitoring system, its feature It is, including system-monitoring module, fault warning module, data statistic analysis module, expansion module, image file deployment mould Block, system deployment state display module, user management module, shutdown management module, process manager module, document management module, Multinode console, system-monitoring module, fault warning module, data statistic analysis module, expansion module, image file deployment Module, system deployment state display module, user management module, shutdown management module, process manager module, file management mould Block, multinode console are sequentially connected.
Preferably, the system-monitoring module, fault warning module, data statistic analysis module, expansion module form one Individual Monitor And Control Subsystem.
Preferably, the system-monitoring module is realized in main frame and monitored, and system monitoring will be alerted by fault warning module Mail is sent to user, and data statistic analysis module provides analysis and the Graphical output of monitoring data.
Preferably, the expansion module allows user oneself to define required plug-in unit.
Preferably, the image file deployment module, system deployment state display module form a deployment subsystem.
Preferably, the image file deployment module is realized that the batch image file of cluster server issues and grasped with deleting Make.
Preferably, the user management module, shutdown management module, process manager module, document management module, multinode Console forms a management subsystem.
Preferably, the user management module is according to different user demands, there is provided checks, increases and delete associated user Function.
Compared with prior art, the present invention has following beneficial effect:The present invention improves domestic cluster server blade Monitoring function, using layering monitoring technology, there is provided to the overall monitor pipe being served by from blade bottom hardware to operating system Reason, while server for remote management ability is provided, administrative staff is carried out remote operation, including operating system peace with overall process Dress, server blade operation monitoring, fault alarm, telefile browse, remote console, remote process management etc..The present invention Using B/S (Browser/Server, Browser/Server Mode) framework, anywhere B/S biggest advantages are exactly can be Carry out being operable without any special software of installation, as long as there is a computer that can be surfed the Net just to use, client zero installation, Zero dimension is protected, and the extension of system is very easy to.Present invention accomplishes adverse circumstances to carry out remote monitoring, remote deployment, remote control Demand, while lifting management employee makees efficiency, improve domestic cluster server overall operation efficiency.
Brief description of the drawings
The detailed description made by reading with reference to the following drawings to non-limiting example, further feature of the invention, Objects and advantages will become more apparent upon:
Fig. 1 is the theory diagram of the expansible High Availabitity server hierarchies monitoring system of the present invention.
Embodiment
With reference to specific embodiment, the present invention is described in detail.Following examples will be helpful to the technology of this area Personnel further understand the present invention, but the invention is not limited in any way.It should be pointed out that the ordinary skill to this area For personnel, without departing from the inventive concept of the premise, various modifications and improvements can be made.These belong to the present invention Protection domain.
As shown in figure 1, expansible High Availabitity server hierarchies monitoring system of the invention includes system-monitoring module, failure Alarm module, data statistic analysis module, expansion module, image file deployment module, system deployment state display module, user Management module, shutdown management module, process manager module, document management module, multinode console, system-monitoring module, event Hinder alarm module, data statistic analysis module, expansion module, image file deployment module, system deployment state display module, use Family management module, shutdown management module, process manager module, document management module, multinode console are sequentially connected.
System-monitoring module, fault warning module, data statistic analysis module, expansion module form a monitoring subsystem System.Whether the emphasis of system monitoring is normal in " now " service, is an instantaneous state.By the monitoring to this state and Alarm, keeper can be processed with the very first time to main frame or the failure of service.But often also it is concerned about very much the property of main frame Situations such as response time that can and service, these situations are a lasting change curves, and not one is worth in real time, if It is not only cumbersome but also abstract if being analyzed by checking daily record data, so we use graphic software platform interface, make display effect It is specific directly perceived.System monitoring realizes that the monitoring of server and service, detection function are all completed by plug-in unit, activation system After monitoring service, it can periodically call plug-in unit to go detection service device state automatically, and simultaneity factor monitoring can maintain a team Row, all plug-in units return to the status information come and all enter enqueue, and system monitoring reads information since head of the queue every time, and carries out After processing, state outcome is shown by web.System monitoring according to plug-in unit return come value, to judge monitored object State, on the one hand shown by web, so that keeper has found failure in time, on the other hand can pass through predefined event Hinder warning strategies, system manager is notified in a manner of mail etc..After system monitoring obtains the status information of server end, on the one hand Data storage is completed, and the output of performance map is completed by data output instrument, on the other hand directly deposits in system monitoring clothes Business end is used for the request of terminal in response keeper.
System-monitoring module realizes monitoring in main frame, such as:The process of monitoring server, disk is wanted the function such as to use. Monitored end finger daemon is opened in monitored main frame to monitor, and when hearing the order that is sent on monitoring server, allows it to check During hard disk use information on the server, it is carried out, and information is passed back monitoring server, less appropriate with one Metaphor, it is exactly the mode of operation of wooden horse.Finger daemon is the outer of plug-in unit that can be performed on long-range Linux/Unix main frames Portion's component bag, it is possible to achieve the local resource or attribute on long-range main frame are monitored, such as disk utilization, cpu load, internal memory profit With rate etc..
System monitoring can use mail to user's alert always in monitoring host computer and bundle of services.Supervised according to system monitoring The state of the server of survey sets the strategy for being currently needed for alarm, and when monitored equipment finds failure, system monitoring passes through Alarm email is sent to user by fault warning module.Fault warning module can include mail alarm submodule, mail alarm Submodule uses the SMTP email clients of the order line of a lightweight, and a mail service is only needed in whole network Device, you can realize mail push function.
The main service implementation of system monitoring and the condition monitoring and fault warning function of resource, and the performance to server Data are monitored, and monitoring data deposits in system monitoring staqtistical data base, and data statistic analysis module provides monitoring data Analysis and Graphical output, management module end by web server provide externally based on web modes PHP access support, its count According to from system monitoring statistics and system monitoring analyze data, the concentration that system monitoring data are realized with upper type is deposited Storage and the unified display at interface., can be by PHP programming realizations for server admin and form export function, and it is integrated to Web Server, support is externally provided by related pages.
The function of system monitoring is monitoring service and main frame, and system monitoring is all had using all monitoring, detection function Plug-in unit is completed.Besides warning function, if monitoring system pinpoint the problems can not alarm it is nonsensical if that, so alarm And one of critically important function of system monitoring.The warning function of system monitoring is also to be realized using plug-in type mode.For Extensive autgmentability is provided the user with, i.e., provided with expansion module, the plug-in unit that user can be needed for oneself definition is allowed, without going to limit System, so as to add the flexibility of system monitoring, ease for use and autgmentability.
Image file deployment module, system deployment state display module form a deployment subsystem.Cluster server System deployment function is an indispensable part in cluster management, and system deployment function mainly includes:Distributed mirror image disposes work( Energy, operating system mirror image management function and deployment monitoring function.System deployment function is accomplished that a teledata synchronization Instrument, the file between LAN/WAN Fast synchronization multiple hosts can be passed through.System deployment is come using so-called " system deployment algorithm " The file between local and remote two main frames is set to reach synchronous, this algorithm only transmits the different piece of two files, without It is every time whole part transmission, therefore speed is quite fast.
Image file deployment module realize the batch image file of cluster server issue with delete etc. operation.Dispose subsystem System can realize following functions:Whole directory tree and file system can be disposed, can batch deployment operation system image and Strategy file;The authority of original file, time, soft or hard link etc. can be accomplished to keep;It can be installed without special access right;Optimization Flow, file transmission efficiency is high;Support anonymous transmission.Image file deployment function operation pattern can be made by rsh or ssh With also going to run with daemon patterns;When being run in a manner of daemon, system deployment function services can open one 873 Port, waiting system deploying client go to connect.During connection, deployment function services can check whether password is consistent, if passing through mouth Order is checked, then can proceed by image file transmission.When connection is completed for the first time, understand whole part file transmission primaries, after A certain catalogue is then monitored using inotify principles, when the catalogue that system deployment function services are monitored changes, utilized The file of change is deployed to system deployment client by daemon modes.The course of work of image file deployment module:Using Inotify principles monitor a certain catalogue, if the file change such as occurring to increase, delete, changing in catalogue, activation system dispose guard into Journey;System deployment service end constructs FileList, and FileList, which is contained, needs all texts for being deployed to system deployment client The id of part information, (id is used for unique expression file such as MD5);FileList is sent to system deployment by system deployment service end Client;The background program processing FileList run in system deployment client, builds NewFileList, wherein according to MD5 Comparison delete on machine B the information of existing file, only retain what is be not present or change in system deployment client File;System deployment service end obtains NewFileList, and system deployment visitor is re-transmitted to the file in NewFileList Family end.
In system deployment process, the option that system deployment provides "-process " is supported to show transmitting procedure, is passed obtaining The process of mirror image deployment is monitored by way of php is combined with ssh2 during defeated.System deployment state display module The state of display system deployment.
User management module, shutdown management module, process manager module, document management module, multinode console are formed One management subsystem.
User management module has following functions:User management module is according to different user demands, there is provided can check, increase It is subject to and deletes the function of associated user.Under the framework based on LAMP (linux, apache, mysql, php), by php with The mode that ssh2 is combined realizes that user's the function such as checks, increases.User is by submitting user management to ask list to web services Request data is placed in database by device, server, and web server manages each child node in cluster server by ssh2 User Status and return to current user state and be shown to user to web server, Front End.
Shutdown management module has following functions:Cluster server is sometimes that specific clothes are restarted or closed to needs Business node, so server complete machine or the remote reset of specified node and switching on and shutting down operation, switching on and shutting down behaviour are supported in server admin Work is realized by shutdown management module.Under the framework based on LAMP (linux, apache, mysql, php), pass through php and ssh2 With reference to mode realize remote reset and power on/off function.User is by submitting the management request list that shuts down to web server, clothes Request data is placed in database by business device, and web server manages the user of each child node in cluster server by ssh2 State and return to current user state and be shown to user to web server, Front End.Process manager module is realized remotely to enter Thread management.
Process manager module has following functions:Under different application scenarios, server admin, which provides, to be checked, Delete the function of interdependent node process.Under the framework based on LAMP (linux, apache, mysql, php), by php with The mode that ssh2 is combined realizes checking, deleting function for specified node processes.User by submit management of process ask list to Request data is placed in database by web server, server, and web server is managed each in cluster server by ssh2 The progress information of child node and return to current progress information and be shown to user to web server, Front End.
Document management module has following functions:Under the framework based on LAMP (linux, apache, mysql, php), Realized by way of ftp telnet server file system file system browse and management function.User passes through ftp's Mode logs in the server specified, and each node of server opens ftp services, by the file of server node after the completion of login System information is shown to user in Front End.
Multinode console has following functions:The cluster server most of the time needs batch remote to some node processes Process control, so server admin provides the function that can perform console instructions in batches.Based on LAMP (linux, apache, Mysql, php) framework under, the function of multinode console is realized by way of php is combined with ssh2.User passes through interface The server node to be connected is selected, control command is placed on by Submission control platform command request list to web server, server In database, web server manages each child node in cluster server by ssh2 and returns to execution state to web services Device, Front End will perform status display to user.
The present invention uses management node of the web server as blade server, can complete to blade server system System deployment and server management function, the blade that online blade bracket groove place value is minimum in blade server is selected to be saved as management Point, as the host node of system monitoring, mirror image deployment and server admin, other nodes are to be managed node.Wherein management section The service end of the monitoring client of deployment monitoring alarm module, the service end of mirror image deployment module and server management module is soft in point Part, be managed the monitored end of node deployment monitoring alarm module, the client and server of mirror image deployment module manages mould The client software of block, realize and energy is analyzed to the system monitoring, monitoring alarm and data storage of blade server, while utilize pipe Manage the web server of node.
The specific embodiment of the present invention is described above.It is to be appreciated that the invention is not limited in above-mentioned Particular implementation, those skilled in the art can make various deformations or amendments within the scope of the claims, this not shadow Ring the substantive content of the present invention.

Claims (1)

1. a kind of expansible High Availabitity server hierarchies monitoring system, it is characterised in that accused including system-monitoring module, failure Alert module, data statistic analysis module, expansion module, image file deployment module, system deployment state display module, Yong Huguan Manage module, shutdown management module, process manager module, document management module, multinode console, system-monitoring module, failure Alarm module, data statistic analysis module, expansion module, image file deployment module, system deployment state display module, user Management module, shutdown management module, process manager module, document management module, multinode console are sequentially connected;The system Monitoring module, fault warning module, data statistic analysis module, expansion module form a Monitor And Control Subsystem;The system prison Control module, which is realized in main frame, to be monitored, and alarm email is sent to user, data statistics point by system monitoring by fault warning module Analyse analysis and Graphical output that module provides monitoring data;The expansion module allows user oneself to define required plug-in unit;Institute State image file deployment module, system deployment state display module forms a deployment subsystem;The image file disposes mould Block realizes that the batch image file of cluster server issues and deletion action;The user management module, shutdown management module, enter Thread management module, document management module, multinode console form a management subsystem;The user management module is not according to Same user's request, there is provided check, increase and delete the function of associated user.
CN201410835821.7A 2014-12-23 2014-12-23 Extensible high-availability server layered monitoring system Active CN104601673B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410835821.7A CN104601673B (en) 2014-12-23 2014-12-23 Extensible high-availability server layered monitoring system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410835821.7A CN104601673B (en) 2014-12-23 2014-12-23 Extensible high-availability server layered monitoring system

Publications (2)

Publication Number Publication Date
CN104601673A CN104601673A (en) 2015-05-06
CN104601673B true CN104601673B (en) 2018-01-30

Family

ID=53127167

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410835821.7A Active CN104601673B (en) 2014-12-23 2014-12-23 Extensible high-availability server layered monitoring system

Country Status (1)

Country Link
CN (1) CN104601673B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107277094A (en) * 2016-04-08 2017-10-20 北京黎阳之光科技有限公司 A kind of graph image compressibility
CN106250764A (en) * 2016-08-04 2016-12-21 四川网格新通科技有限公司 A kind of terminal control system
CN108363653B (en) * 2018-02-07 2020-08-07 平安科技(深圳)有限公司 Deployment method and device of monitoring system, computer equipment and storage medium
CN110908741A (en) * 2018-09-14 2020-03-24 阿里巴巴集团控股有限公司 Application performance management display method and device
CN112468212B (en) * 2020-11-04 2022-10-04 北京遥测技术研究所 High-availability servo system of all-weather unattended measurement and control station
CN114415575B (en) * 2022-01-27 2023-05-05 电子科技大学 Real-time data-driven welding workshop three-dimensional virtual monitoring and intelligent early warning system
CN117319196B (en) * 2023-10-26 2024-05-14 无锡芯光互连技术研究院有限公司 User server cluster environment deployment management system and method

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101072129A (en) * 2007-06-25 2007-11-14 北京邮电大学 JMX based network service management method and its application system

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101072129A (en) * 2007-06-25 2007-11-14 北京邮电大学 JMX based network service management method and its application system

Also Published As

Publication number Publication date
CN104601673A (en) 2015-05-06

Similar Documents

Publication Publication Date Title
CN104601673B (en) Extensible high-availability server layered monitoring system
CN112102111B (en) Intelligent processing system for power plant data
WO2021017301A1 (en) Management method and apparatus based on kubernetes cluster, and computer-readable storage medium
CN105119750B (en) A kind of safe operation management platform system of distributed information based on big data
Coutinho et al. Elasticity in cloud computing: a survey
WO2023142054A1 (en) Container microservice-oriented performance monitoring and alarm method and alarm system
CN110175451A (en) A kind of method for safety monitoring and system based on electric power cloud
US10129373B2 (en) Recovery of a network infrastructure to facilitate business continuity
JP2008519327A (en) Network management appliance
Pan et al. Research on dependability of cloud computing systems
CN103593804A (en) Electric power information communication scheduling and monitoring platform
WO2019164812A1 (en) Distributed integrated fabric
CN102916839A (en) Automatic monitoring system for agricultural work in sugarhouse
CN105474225A (en) Automating monitoring of computing resource in cloud-based data center
US10587655B1 (en) Compliance management system and method for an integrated computing system
WO2015192664A1 (en) Device monitoring method and apparatus
CN109254922A (en) A kind of automated testing method and device of server B MC Redfish function
CN104852814A (en) Intelligent integrated emergency system and emergency method thereof
CN109063473A (en) A kind of convenient household safety monitoring device and method based on computer network
CN110532312A (en) A kind of industry interconnection cloud platform system based on big data
CN111817865A (en) Method for monitoring network management equipment and monitoring system
KR20160087280A (en) Method and system for providing integrated managing service based smart water grid
CN109271289A (en) A kind of application interface monitoring method, device, equipment and computer-readable medium
CN110209903A (en) A kind of industry interconnection cloud platform system based on big data
CN114363079A (en) Distributed intelligent data supervision system of cloud platform

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant