CN105227354A - Log-based method for monitoring and managing distributed system - Google Patents

Log-based method for monitoring and managing distributed system Download PDF

Info

Publication number
CN105227354A
CN105227354A CN201510561858.XA CN201510561858A CN105227354A CN 105227354 A CN105227354 A CN 105227354A CN 201510561858 A CN201510561858 A CN 201510561858A CN 105227354 A CN105227354 A CN 105227354A
Authority
CN
China
Prior art keywords
node
daily record
distributed system
monitoring
log
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510561858.XA
Other languages
Chinese (zh)
Inventor
张裕超
孙海峰
王传超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Software Group Co Ltd
Original Assignee
Inspur Software Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Software Group Co Ltd filed Critical Inspur Software Group Co Ltd
Priority to CN201510561858.XA priority Critical patent/CN105227354A/en
Publication of CN105227354A publication Critical patent/CN105227354A/en
Pending legal-status Critical Current

Links

Landscapes

  • Debugging And Monitoring (AREA)

Abstract

The invention provides a log-based method for monitoring and managing a distributed system, which is characterized by comprising the following steps: monitoring and managing each node of the distributed system through a unified monitoring system; the log of the original program is used for monitoring, and the existing program is not required to be modified; and performing log monitoring by deploying a Probe open source application. The invention enables the manager to monitor the operation state of each node in the unified management system and check the log of each node at any time under the condition of not modifying the existing system. And directly maintaining each node through a management system. The configuration maintenance work which is repeated in a large amount in a plurality of servers is avoided, and the working efficiency is greatly improved.

Description

A kind of method to distributed system monitoring management based on daily record
Technical field
The present invention relates to the management maintenance field of distributed system, be specifically related to a kind of method to distributed system monitoring management based on daily record.
Background technology
With the develop rapidly of person the Internet, single node disposes the needs that the system run cannot meet enterprise, and distributed system is arisen at the historic moment.
Initial is repeat to log in each node of distributed system to the management method of distributed system, and check the operation conditions of each node, each node manages separately.When distributed system is very huge time, this way to manage will not catch up with the needs of O&M, takes time and effort, and easily misses certain node because of error yet.Cost and the difficulty of O&M are multiplied.
Summary of the invention
Technical assignment of the present invention is for the deficiencies in the prior art, provides a kind of method to distributed system monitoring management based on daily record.The present invention serves distributed system, when not revising existing system, allowing administrative staff in system for unified management, monitor the operation conditions of each node, checking each node log at any time.Directly each node is safeguarded by management system.
The technical solution adopted for the present invention to solve the technical problems is:
Based on the method to distributed system monitoring management of daily record, comprising:
Monitor by unified supervision system each node to distributed system, manage;
Monitored by the daily record of original program, do not need to revise existing program;
Daily record monitoring is carried out by disposing Probe application of increasing income.
Safeguard the daily record configuration of each node, conveniently monitored by daily record;
In the Tomcat container of each node, dispose Probe to increase income program.
Management system, by the remote method of integrated Probe, obtains the daily record of each node, and integrated remote deployment, the function such as to restart.Administrative staff can pass through each node ruuning situation of management system real time inspection, and can long-rangely manage.The system journal contrasting each node is checked in the timing of simultaneity factor backstage, if node log does not upgrade for a long time or occurs error message in daily record, just sends mail reminder operation maintenance personnel.
After having disposed management system, in the supervisory control desk page of system, the ruuning situation of each server can be checked flexibly, Timeliness coverage problem, and the bookkeepings such as the restarting of remote service, time-out can be realized by management platform.
A kind of method to distributed system monitoring management based on daily record of the present invention compared with prior art, the beneficial effect produced is, distributed system of the present invention all can unified monitoring, maintenance, Timeliness coverage problem, does not need each node of Telnet to check operation conditions.Directly service can be restarted by management system.Avoid and carry out the configuring maintenance work repeated in a large number in multiple server, substantially increase the efficiency of work.
Accompanying drawing explanation
The overall status figure that accompanying drawing 1 is applied for remote node;
Accompanying drawing 2 is system architecture diagram.
Embodiment
Below in conjunction with accompanying drawing to a kind of being described in detail below the method for distributed system monitoring management based on daily record of the present invention.
Based on the method to distributed system monitoring management of daily record, comprising: monitor by unified supervision system each node to distributed system, manage; Monitored by the daily record of original program, do not need to revise existing program; Daily record monitoring is carried out by disposing Probe application of increasing income.
Safeguard the daily record configuration of each node, conveniently monitored by daily record; In the Tomcat container of each node, dispose Probe to increase income program.Management system, by the remote method of integrated Probe, obtains the daily record of each node, and integrated remote deployment, the function such as to restart.Administrative staff can pass through each node ruuning situation of management system real time inspection, and can long-rangely manage.The system journal contrasting each node is checked in the timing of simultaneity factor backstage, if node log does not upgrade for a long time or occurs error message in daily record, just sends mail reminder operation maintenance personnel.After having disposed management system, in the supervisory control desk page of system, the ruuning situation of each server can be checked flexibly, Timeliness coverage problem, and the bookkeepings such as the restarting of remote service, time-out can be realized by management platform.
Concrete implementation step is as follows:
Step one, the daily record safeguarding distributed system exports, and ensures the reliable, readable of daily record;
Step 2, applies at each server deploy Probe;
Step 3, the probe administrator role of configuration server container;
Step 4, in a management system, configures each nodal information;
Step 5, management system backstage logs in Probe by http mode and reads the content of Probe, and daily record, server node information, server admin function is shown on foreground;
Step 6, the timing of management system backstage is read each node log by Probe and contrasts, if find error message or the long-time not more news of daily record, just automatically sends out mail reminder administrative staff and notes server health;
Step 7, so far, can see the ruuning situation of each server node at administration page and manage each node.

Claims (3)

1., based on the method to distributed system monitoring management of daily record, it is characterized in that, comprising:
Monitor by unified supervision system each node to distributed system, manage;
Monitored by the daily record of original program, do not need to revise existing program;
Daily record monitoring is carried out by disposing Probe application of increasing income.
2. a kind of method to distributed system monitoring management based on daily record according to claim 1, is characterized in that, described supervisory systems, by the remote method of integrated Probe, obtains the daily record of each node, and integrated remote deployment, the function such as to restart.
3. a kind of method to distributed system monitoring management based on daily record according to claim 1, is characterized in that, disposes Probe and increase income program in the Tomcat container of each node.
CN201510561858.XA 2015-09-07 2015-09-07 Log-based method for monitoring and managing distributed system Pending CN105227354A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510561858.XA CN105227354A (en) 2015-09-07 2015-09-07 Log-based method for monitoring and managing distributed system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510561858.XA CN105227354A (en) 2015-09-07 2015-09-07 Log-based method for monitoring and managing distributed system

Publications (1)

Publication Number Publication Date
CN105227354A true CN105227354A (en) 2016-01-06

Family

ID=54996065

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510561858.XA Pending CN105227354A (en) 2015-09-07 2015-09-07 Log-based method for monitoring and managing distributed system

Country Status (1)

Country Link
CN (1) CN105227354A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111274091A (en) * 2020-01-17 2020-06-12 北京达佳互联信息技术有限公司 Log processing method and device, computer equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030005102A1 (en) * 2001-06-28 2003-01-02 Russell Lance W. Migrating recovery modules in a distributed computing environment
CN101067798A (en) * 2007-06-14 2007-11-07 华南理工大学 Dynamic probe method and application in embedded system thereof
CN101997925A (en) * 2010-11-22 2011-03-30 北京亮点时间科技有限公司 Server monitoring method with early warning function and system thereof
CN102480489A (en) * 2010-11-30 2012-05-30 北京千橡网景科技发展有限公司 Logging method and device used in distributed environment
CN104270268A (en) * 2014-09-28 2015-01-07 曙光信息产业股份有限公司 Network performance analysis and fault diagnosis method of distributed system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030005102A1 (en) * 2001-06-28 2003-01-02 Russell Lance W. Migrating recovery modules in a distributed computing environment
CN101067798A (en) * 2007-06-14 2007-11-07 华南理工大学 Dynamic probe method and application in embedded system thereof
CN101997925A (en) * 2010-11-22 2011-03-30 北京亮点时间科技有限公司 Server monitoring method with early warning function and system thereof
CN102480489A (en) * 2010-11-30 2012-05-30 北京千橡网景科技发展有限公司 Logging method and device used in distributed environment
CN104270268A (en) * 2014-09-28 2015-01-07 曙光信息产业股份有限公司 Network performance analysis and fault diagnosis method of distributed system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
IT: "AppDynamics的Apache Tomcat监控应用", 《HTTP://LINUX.IT.NET.CN/M/VIEW.PHP?AID=16560》 *
刘晓宇: "拓扑信息采集的监测系统设计与实现", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111274091A (en) * 2020-01-17 2020-06-12 北京达佳互联信息技术有限公司 Log processing method and device, computer equipment and storage medium
CN111274091B (en) * 2020-01-17 2024-01-09 北京达佳互联信息技术有限公司 Log processing method, device, computer equipment and storage medium

Similar Documents

Publication Publication Date Title
CN107423198B (en) EAM platform monitoring management method and system
CN104731580A (en) Automation operation and maintenance system based on Karaf and ActiveMQ and implement method thereof
CN104022903A (en) One-stop automatic operation and maintaining system
CN105653425B (en) Monitoring system based on complex event processing engine
CN102480749B (en) Method, device and system for remotely collecting host process information
CN102937930A (en) Application program monitoring system and method
US20060004830A1 (en) Agent-less systems, methods and computer program products for managing a plurality of remotely located data storage systems
CN101753357A (en) Network server centralized monitoring system and method
CN106202444A (en) A kind of implementation method of data base's O&M monitoring
CN102916839A (en) Automatic monitoring system for agricultural work in sugarhouse
US8495426B2 (en) Meta-directory control and evaluation of events
CN104486445A (en) Distributed extendable resource monitoring system and method based on cloud platform
CN105491143A (en) Software running state monitoring system and realization method thereof
CN103188088A (en) Equipment information acquisition system and equipment information acquisition method
CN105490870A (en) Method for monitoring operation state of Linux server in batch
US20160142262A1 (en) Monitoring a computing network
CN110134518A (en) A kind of method and system improving big data cluster multinode high application availability
CN103595572B (en) A kind of method of cloud computing cluster interior joint selfreparing
CN110851320A (en) Server downtime supervision method, system, terminal and storage medium
CN103856354A (en) Method for achieving unified management of logs of cluster storage system
KR20040091392A (en) Method and system for backup management of remote using the web
CN108288997A (en) A kind of transmission network luminous power automated collection systems
CN108199901A (en) Hardware reports method, system, equipment, hardware management server and storage medium for repairment
CN107943665A (en) A kind of system host monitoring method and device
CN105227354A (en) Log-based method for monitoring and managing distributed system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20160106