CN102546283A - Automatic monitoring method and device for unmanned system - Google Patents

Automatic monitoring method and device for unmanned system Download PDF

Info

Publication number
CN102546283A
CN102546283A CN2010106164740A CN201010616474A CN102546283A CN 102546283 A CN102546283 A CN 102546283A CN 2010106164740 A CN2010106164740 A CN 2010106164740A CN 201010616474 A CN201010616474 A CN 201010616474A CN 102546283 A CN102546283 A CN 102546283A
Authority
CN
China
Prior art keywords
virtual machine
monitoring
automatic
fault
automatically
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2010106164740A
Other languages
Chinese (zh)
Inventor
田斌
李栋
徐毅
Original Assignee
BEIJING SAFE-CODE TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIJING SAFE-CODE TECHNOLOGY Co Ltd filed Critical BEIJING SAFE-CODE TECHNOLOGY Co Ltd
Priority to CN2010106164740A priority Critical patent/CN102546283A/en
Publication of CN102546283A publication Critical patent/CN102546283A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention discloses an automatic monitoring method for an unmanned system. The method comprises the following steps: using a virtual machine to build the unmanned system, mutually monitoring by nodes of the virtual machine, and finding and correcting faults in time; processing the faults in real time, and maintaining normal operation of the system by redundant nodes; automatically correcting fault points, and carrying out system restore aiming at software faults; automatically processing the fault points, and isolating and automatically alarming aiming at hardware faults; and automatic regression after fault points correction (including software restore and hardware replacement) and additionally arrangement of newly added nodes are carried out through automatic monitoring and automatic processing of the system. The automatic monitoring method solves the problems that: in the prior art, automatic emergency maintenance, automatic processing and truly automatic restore cannot be realized. The invention simultaneously discloses the automatic monitoring method and an automatic monitoring device for the unmanned system.

Description

The method of the automatic monitoring of system's unattended duty and device thereof
Technical field
The present invention relates to remote monitoring, cluster, system reducing technology, the method for the automatic monitoring of particularly a kind of system unattended duty.
Background technology
The automatic monitoring system is of many uses, also is the focus of studying at present.Particularly running up, needing the server end of process mass data simultaneously, the automatic monitoring technology of system seems particularly important.Existing remote control technology can be realized unattended operation, but system's handling failure automatically.The normal operation of maintenance system when a Clustering resolution system breaks down, but handling failure automatically.How realizing the automatic monitoring of unattended operation system, with the fastest speed and the highest timely treatment system fault of efficient, realize real unattended operation, reduce the loss that the system failure causes, is the problem that unattended operation system automatic monitoring Technology Need solves.
Common system's automatic monitoring technology comprises at present: 1, remote control technology; 2, Clustering; 3, manual system reduction technique.Brief account is following:
Prior art one: remote control technology
Principle: remote control technology mainly is made up of on-site supervision module, communication system and Surveillance center; The on-site supervision module is responsible for the collection of the information of accomplishing and the control command that send at the response monitoring center; Communication system is responsible for transmission of monitoring data and order; Surveillance center is responsible for collecting the monitor message that each monitoring module is uploaded, and sends various operational orders to monitoring module.
Shortcoming: only solved remote system unattended operation problem, can not repair automatically when breaking down, needed artificial repairing.
Prior art two: Clustering
Principle: cluster is a kind of parallel processing system (PPS), is made up of the independently computer that much links together, as the computational resource collaborative work of an integral body; Group system is meant that generally the two or more computer nodes that physically disperse link together through LAN, single system of picture for user and application program.
Shortcoming: only solved the problem of keeping system's operation when breaking down, and fault restoration still needs artificial treatment.
Prior art three: system reducing technology
Principle: the system reducing technology is exactly the original configuration of a system of backup in hard disk, when system breaks down, and the original configuration of recovery system.
Shortcoming: when system breaks down, be under people's operation, to reduce, rather than real automatic reduction.
In sum, prior art can not solve unattended system failure reparation, can not in time handle when system breaks down, and the cost of troubleshooting is high.
Summary of the invention
Instance of the present invention provides the method and apparatus of the automatic monitoring of a kind of system unattended duty, can not automatic monitoring in order to solve prior art, the defective of repairing automatically, and the high problem of system failure rehabilitation cost.
The method of the automatic monitoring of a kind of system unattended duty comprises:
Set up cluster virtual machine, through the mutual monitoring between the node, the fault point that discovery in real time can not normal access;
Trial is carried out the software fault reparation with the backup of virtual machine, the mode of system reducing;
Through the service of other virtual machine of timer access, confirm whether this virtual machine moves normally, in case can't visit, just make corresponding processing automatically.
The device of a kind of unattended operation system automatic monitoring comprises:
System has been divided into two-layer, is respectively the operation layer of cluster virtual machine composition and the supporting layer that virtual machine carrier (real equipment) cluster is formed;
The business one-tenth that virtual machine is trooped and formed is used for monitoring mutually between the node, and when certain node occurred repairing fault, this monitoring relation needed reorganization automatically;
The supporting layer that virtual machine carrier (real equipment) cluster is formed, the reduction request of sink virtual machine system.
Instance of the present invention is divided into the operation layer of cluster virtual machine composition and the supporting layer that virtual machine carrier (real equipment) cluster is formed to system; Monitor each other through certain logical relation between the virtual machine operation layer node, when certain node appearance can not be repaired fault, the monitoring relation reorganized automatically, and maintenance system normally moves, and real-time handling failure point; Except that hardware fault, software fault all can have been realized real unattended operation by system's automatic monitoring, reparation reduction automatically, has improved troubleshooting efficient, has reduced the troubleshooting cost.
Description of drawings
The unattended operation system braking monitoring that Fig. 1 provides for the embodiment of the invention, the method sketch map of handling automatically;
The apparatus structure sketch map of the unattended operation system automatic monitoring that Fig. 2 provides for the embodiment of the invention;
Fig. 3 is the environment sketch map of embodiment of the invention unattended operation system automatic monitoring;
Fig. 4 is the method flow diagram that embodiment of the invention unattended operation system virtual machine is handled automatically.

Claims (7)

1. the method for the automatic monitoring of system's unattended duty is characterized in that, this method comprises:
Being divided into system two-layer, is respectively the operation layer formed of cluster virtual machine and the supporting layer of virtual machine carrier (real equipment) cluster composition;
Supporting layer is responsible for moving the virtual machine of operation layer, by operation layer service is provided externally;
Set up cluster virtual machine, through the mutual monitoring between the node, the fault point that discovery in real time can not normal access.
2. the method for claim 1 is characterized in that, the method for monitoring each other between the said node specifically comprises:
Through the service of other virtual machine of timer access, confirm whether this virtual machine moves normally, in case can't visit, just make corresponding processing automatically;
To the virtual machine carrier transmitting system of supporting layer reduction request, let this virtual machine recover and restart; If go back primary failure, transmission can not be repaired signalling trouble, waits equipment to be replaced;
Automatic monitoring, processing automatically need not manual interventions.
3. according to claim 1 or claim 2 method is characterized in that, between the said node each other the method for monitoring specifically comprise:
Confirm a logical relation for monitoring person and custodial person, can logically be organized into a round-robin queue to the virtual machine node, a back node is guarded previous node;
When certain node occurred repairing fault, this monitoring relation can reorganize automatically.
4. the method for the automatic monitoring of system's unattended duty is characterized in that, automatically repair system fault and reduction automatically:
Monitoring person regularly makes the regular traffic visit to the custodial person, is not having to access the access result of expection under the situation of fault;
In case find and to visit that monitoring person sends request to supporting layer immediately, requires supporting layer that the custodial person is carried out system reducing;
Fault belongs to software fault, and the virtual machine carrier can reduce dummy machine system fully.
5. method as claimed in claim 4 is characterized in that, the automatic monitoring stage:
Find that visit is overtime or the result is undesired, need retry 3 times, all fail the processing stage of just getting into automatic fault for 3 times, avoid misoperation character system reducing, restart.
6. method as claimed in claim 4 is characterized in that, the processing stage of automatically:
Monitoring person sends after the system reducing request, and the virtual machine carrier needs the regular hour to remove to recover dummy machine system, restarts virtual machine;
At virtual machine carrier recovery dummy machine system, restart virtual machine during, can not redispatch to same custodial person's system reducing request, constantly restart avoiding;
Whether monitoring person sends after the request, just get into a detection custodial person and recover to repair until detecting the custodial person in the middle of the normal circulation, just turns back to the automatic monitoring stage.
7. the device of unattended operation system automatic monitoring is characterized in that, this device comprises:
Virtual machine is trooped, through the mutual monitoring between the node, and the fault point that discovery in real time can not normal access;
Trial is carried out the software fault reparation with the backup of virtual machine, the mode of system reducing, and is unsuccessful then for can not repair fault, normally hardware fault; And send alarm automatically; Wait for more exchange device of manual work, in the whole process, the unattended operation system keeps normal operation always.
CN2010106164740A 2010-12-31 2010-12-31 Automatic monitoring method and device for unmanned system Pending CN102546283A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2010106164740A CN102546283A (en) 2010-12-31 2010-12-31 Automatic monitoring method and device for unmanned system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2010106164740A CN102546283A (en) 2010-12-31 2010-12-31 Automatic monitoring method and device for unmanned system

Publications (1)

Publication Number Publication Date
CN102546283A true CN102546283A (en) 2012-07-04

Family

ID=46352287

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2010106164740A Pending CN102546283A (en) 2010-12-31 2010-12-31 Automatic monitoring method and device for unmanned system

Country Status (1)

Country Link
CN (1) CN102546283A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104168126A (en) * 2013-05-20 2014-11-26 江苏华视文化传媒有限公司 Self-maintenance and management system and method for unattended intelligent device

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6718513B1 (en) * 2000-01-31 2004-04-06 Fujitsu Limited Fault monitoring system and fault reporting method
CN101267334A (en) * 2007-03-13 2008-09-17 联想(北京)有限公司 A method and device for dynamic device allocation
CN102111303A (en) * 2009-12-28 2011-06-29 北京安码科技有限公司 Automatic monitoring method and device for unattended system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6718513B1 (en) * 2000-01-31 2004-04-06 Fujitsu Limited Fault monitoring system and fault reporting method
CN101267334A (en) * 2007-03-13 2008-09-17 联想(北京)有限公司 A method and device for dynamic device allocation
CN102111303A (en) * 2009-12-28 2011-06-29 北京安码科技有限公司 Automatic monitoring method and device for unattended system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104168126A (en) * 2013-05-20 2014-11-26 江苏华视文化传媒有限公司 Self-maintenance and management system and method for unattended intelligent device
CN104168126B (en) * 2013-05-20 2017-11-21 江苏华视文化传媒有限公司 A kind of unattended intelligent device self-maintenance management system and method

Similar Documents

Publication Publication Date Title
JP5102901B2 (en) Method and system for maintaining data integrity between multiple data servers across a data center
CN102111303A (en) Automatic monitoring method and device for unattended system
CN104717077B (en) A kind of method, apparatus and system for managing data center
CN104038373B (en) information early warning and self-repairing system and method
CN103019889A (en) Distributed file system and failure processing method thereof
CN202854567U (en) Monitoring system
CN105589756B (en) Batch processing group system and method
CN105430327A (en) NVR cluster backup method and device
CN109274531A (en) Data acquisition equipment method for restarting, system and computer readable storage medium
CN105574590A (en) Adaptive general control disaster recovery switching device and system, and signal generation method
CN111176783A (en) High-availability method and device for container treatment platform and electronic equipment
CN105790825A (en) Method and apparatus for carrying out hot backup on controllers in distributed protection
CN107291821A (en) A kind of method that same city dual-active framework is switched fast
CN106294795A (en) A kind of data base's changing method and system
CN106911522A (en) A kind of fault handling method and system based on cloud environment
CN102487332B (en) Fault processing method, apparatus thereof and system thereof
CN105743695A (en) Monitoring method and system based on IEC 104 protocol
CN102546283A (en) Automatic monitoring method and device for unmanned system
CN111614702B (en) Edge calculation method and edge calculation system
CN103731292A (en) Fault-tolerant method enabling server to realize fault seamless recovery
CN113946474A (en) Efficient disaster tolerance protection method and disaster tolerance processing system for storage system
CN107276828A (en) A kind of dispatching method and device of cluster interior joint
KR20120111143A (en) Method for managing wind power control system for high availability in wind farm
CN201491023U (en) Redundancy server structure based on RAC
CN101686251A (en) Centralized standby system of remote isomorphism real-time system and standby method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
ASS Succession or assignment of patent right

Owner name: XU QIN

Free format text: FORMER OWNER: BEIJING SAFE-CODE TECHNOLOGY CO., LTD.

Effective date: 20130529

C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20130529

Address after: 100082, building 1, building 32, 612 North Main Street, Haidian District, Beijing, Xizhimen

Applicant after: Xu Qin

Address before: 100082, building 1, building 32, 612 North Main Street, Haidian District, Beijing, Xizhimen

Applicant before: Beijing Safe-Code Technology Co., Ltd.

C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20120704