CN102736970A - Method for monitoring activity state of operating system - Google Patents
Method for monitoring activity state of operating system Download PDFInfo
- Publication number
- CN102736970A CN102736970A CN201210220128XA CN201210220128A CN102736970A CN 102736970 A CN102736970 A CN 102736970A CN 201210220128X A CN201210220128X A CN 201210220128XA CN 201210220128 A CN201210220128 A CN 201210220128A CN 102736970 A CN102736970 A CN 102736970A
- Authority
- CN
- China
- Prior art keywords
- network
- record
- script
- ping
- time
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Debugging And Monitoring (AREA)
- Data Exchanges In Wide-Area Networks (AREA)
- Computer And Data Communications (AREA)
Abstract
The invention provides a method for monitoring an activity state of an operating system. After the operating system runs, monitoring scrip is automatically executed under the operating system, so that a system network and a self-activity state are detected in real time, and detection information is regularly recorded in a monitoring text log; and a time point when a server fails and a probable failure expression can be accurately judged. The method has the advantages that 1) the running state of the server is accurately recorded, so that the influence of a subjective feedback mistake on failure judgment is avoided; and 2) a user can judge whether the failure expression is caused by halt because of system halt, blue screen, power failure and the like or caused by network interruption because of abnormal internal and external factors according to a record, so that corresponding solutions can be taken.
Description
Technical field
The present invention relates to specifically a kind of method for supervising of operating system active state.
Background technology
At the IDC machine room, directly through telecommunications or be communicated with core network and need professional is provided and serves, there are windows system and linux system two macrotaxonomies in the employed system of server at present a lot of users with own trust server; Possibly carry different application; When mechanical disorder took place, the information that user oneself obtains was that machine can not be communicated with, but machine still is that the suspension fault can't be known at the definite machine of delaying that when taken place; Can only notify the computer room administrator directly machine to be forced to restart; Back logon server system, confirm and judge that there is following problem in this judgment mode through system log message:
1) daily record of linux system can not be accomplished the real-time activity condition monitoring; The time of breaking down according to probably judgement system of log record generally speaking; When business is stablized even all do not have new log record in several days; Only when system restart, produce new " syslogd 1.4.1:restart. " record and the time point of corresponding system start-up; Information before this RP then is the daily record that random device ruuning situation generates, so under the situation that the keeper can not monitor server ruuning situation in real time, the concrete time that clearly judgement system breaks down; The system journal of windows system can be write down true deadlock time point;
2) for the machine of long-range trustship; No matter the windows system still be the linux system server when crash, when accident power-off or network failure cause network to interrupt; The getable failure message of long-distance user is exactly that ping is obstructed; Can't judge that this moment, fault was network failure or mechanical disorder; Since different phenomena of the failure possibly imply different faults maybe, if can't confirm phenomenon just can't be clear and definite next step solution, so often the localization of fault of server and quick-recovery are soon brought very big difficulty.
Summary of the invention
The method for supervising that the purpose of this invention is to provide a kind of operating system active state.
The objective of the invention is to realize by following mode; System operation back automatically performs monitoring script under operating system; Real-time detecting system network and own activity state; And will detect the information time recording in the daily record of monitoring text, and accurately judge time point and the general fault performance that takes place when server breaks down, concrete steps are following:
1) will automatically perform script according to the operating system classification and insert start operation automatically, windows pulls script jiankong.cmd into startup-operation place, and the linux system will carry out script jiankong.sh and write among the rc.local, guarantees machine start operation automatically;
2) Windows script: start back system script automatically performs; Elder generation register system start-up time, and generation system log-on message record " system is on ", real time record time afterwards; And order held stationary machine ip in the direct ping network through ping; Record ping command execution results, circulation is carried out, the active situation of supervisory system and network UNICOM situation;
3) Linux script: automatically perform after the start, first register system log-on message " system is on ", then writing time point; Through fixing mode of ip machine in the ping network; Judge whether network state is unobstructed, as can leading to by ping record network is up information; Otherwise record network is down information, circulation is carried out;
4) determination methods: mechanical disorder the back occurs and carries out the failure condition judgement according to the information of collecting, and each machine startup all writes down " system is on " record, checks that the time that mechanical disorder takes place can be confirmed in the final time point record before it; If ping command execution before is normal, explain that failure condition is deadlock or accident power-off, restarts, if ping command execution aborted, but the date also have record, explain that machine does not crash, just network has interrupted.
The invention has the beneficial effects as follows:
1) accurate recording operation condition of server is avoided the influence of subjective feedback mistake to fault judgement;
2) can judge the fault performance according to record is because the machine of the delaying fault of generations such as system in case of system halt, blue screen, outage or the network that unusual inside and outside factor causes interrupt, and is convenient to take corresponding solution;
3) implementation is simple, and monitoring script directly is set to automated system operation, advantages such as easy operating realization.
Description of drawings
Fig. 1 is the monitoring step synoptic diagram.
Embodiment
Explanation at length below with reference to Figure of description method of the present invention being done.
Concrete implementation procedure is following:
1) will automatically perform script according to the operating system classification and insert start operation automatically, windows pulls script jiankong.cmd into startup-operation place, and the linux system will carry out script jiankong.sh and write among the rc.local, guarantees machine start operation automatically;
2) Windows script: start back system script automatically performs; Elder generation register system start-up time, and generation system log-on message record " system is on ", real time record time afterwards; And order held stationary machine ip in the direct ping network through ping; Record ping command execution results, circulation is carried out, the active situation of supervisory system and network UNICOM situation;
3) Linux script: automatically perform after the start, first register system log-on message " system is on ", then writing time point; Through fixing mode of ip machine in the ping network; Judge whether network state is unobstructed, as can leading to by ping record network is up information; Otherwise record network is down information, circulation is carried out;
4) determination methods: mechanical disorder the back occurs and carries out the failure condition judgement according to the information of collecting, and each machine startup all writes down " system is on " record, checks that the time that mechanical disorder takes place can be confirmed in the final time point record before it; If ping command execution before is normal, explain that failure condition is deadlock or accident power-off, restarts etc., if ping command execution aborted, but the date also have record, explain that machine does not crash, but network has interrupted;
5) realize script: (annotate: xxx.xxx.xxx.xxx is stable ip address in the network, is used for doing the network monitoring RP) windows: ping.cmd echo on
echo?system?is?on?>>c:\test.txt
:ag
echo?%date%?>>c:\test.txt
echo?%time%?>>c:\test.txt
c:\windows\system32\ping?xxx.xxx.xxx.xxx?-n?4?>>c:\test.txt
goto?ag
Linux script: jiankong.sh (annotate: xxx.xxx.xxx.xxx is stable ip address in the network, is used for doing the network monitoring RP)
#!/bin/bash
echo?"system?is?on"?>>?/test.txt
while?:;do
echo?`date`?>>/test.txt
ping?-c?2?xxx.xxx.xxx.xxx?&>/dev/null
if?[?$ =?0?];then
echo?"network?is?up"?>>/test.txt
else
echo?"network?is?down"?>>/test.txt
fi
sleep?2
done
6) instance sample execution result
The windows system test.txt displaying contents after the script executing):
system?is?on
2011/09/14 Wednesday
17:51:02.69?
The data that just have 32 bytes at Ping 10.7.255.254:
Answer from 10.7.255.254: byte=32 times=3ms TTL=255
Answer from 10.7.255.254: byte=32 times < 1ms TTL=255
Answer from 10.7.255.254: byte=32 times < 1ms TTL=255
Answer from 10.7.255.254: byte=32 times < 1ms TTL=255
10.7.255.254 the Ping statistical information:
Packet: send=4, receive=4, lose=0 (0% loses),
The estimated time of round trip (is unit with the millisecond):
Weak point=0ms, length=3ms is average=0ms
2011/09/14 Wednesday
17:51:05.93
The data that just have 32 bytes at Ping 10.7.255.254:
Answer from 10.7.255.254: byte=32 times < 1ms TTL=255
Answer from 10.7.255.254: byte=32 times < 1ms TTL=255
Answer from 10.7.255.254: byte=32 times < 1ms TTL=255
Answer from 10.7.255.254: byte=32 times < 1ms TTL=255
10.7.255.254 the Ping statistical information:
Packet: send=4, receive=4, lose=0 (0% loses),
The estimated time of round trip (is unit with the millisecond):
Weak point=0ms, length=0ms is average=0ms
……
Linux script executing result:
[rootlocalhost?~]#?cat?/test.txt
system?is?on
Wed?Sep?7?12:31:54?CST?2011
network?is?up
Wed?Sep?7?12:31:57?CST?2011
network?is?up
Wed?Sep?7?12:32:00?CST?2011
network?is?up
Wed?Sep?7?12:32:03?CST?2011
network?is?up
Wed?Sep?7?12:32:06?CST?2011
network?is?up
Wed?Sep?7?12:32:09?CST?2011
network?is?down
6) the information log file is judged flow process.
Except that the described technical characterictic of instructions, be the known technology of those skilled in the art.
Claims (1)
1. the method for supervising of an operating system active state; It is characterized in that system operation back automatically performs monitoring script under operating system; Real-time detecting system network and own activity state; And will detect the information time recording in the daily record of monitoring text, and accurately judge time point and the general fault performance that takes place when server breaks down, concrete steps are following:
1) will automatically perform script according to the operating system classification and insert start operation automatically, windows pulls script jiankong.cmd into startup-operation place, and the linux system will carry out script jiankong.sh and write among the rc.local, guarantees machine start operation automatically;
2) Windows script: start back system script automatically performs; Elder generation register system start-up time, and generation system log-on message record " system is on ", real time record time afterwards; And order held stationary machine ip in the direct ping network through ping; Record ping command execution results, circulation is carried out, the active situation of supervisory system and network UNICOM situation;
3) Linux script: automatically perform after the start, first register system log-on message " system is on ", then writing time point; Through fixing mode of ip machine in the ping network; Judge whether network state is unobstructed, as can leading to by ping record network is up information; Otherwise record network is down information, circulation is carried out;
4) determination methods: mechanical disorder the back occurs and carries out the failure condition judgement according to the information of collecting, and each machine startup all writes down " system is on " record, checks that the time that mechanical disorder takes place can be confirmed in the final time point record before it; If ping command execution before is normal, explain that failure condition is deadlock or accident power-off, restarts, if ping command execution aborted, but the date also have record, explain that machine does not crash, just network has interrupted.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210220128XA CN102736970A (en) | 2012-06-29 | 2012-06-29 | Method for monitoring activity state of operating system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210220128XA CN102736970A (en) | 2012-06-29 | 2012-06-29 | Method for monitoring activity state of operating system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN102736970A true CN102736970A (en) | 2012-10-17 |
Family
ID=46992513
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210220128XA Pending CN102736970A (en) | 2012-06-29 | 2012-06-29 | Method for monitoring activity state of operating system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102736970A (en) |
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104268068A (en) * | 2014-09-24 | 2015-01-07 | 浪潮电子信息产业股份有限公司 | Server log information acquisition method |
CN104407930A (en) * | 2014-10-22 | 2015-03-11 | 陆俊 | Method and system for assisting in remote mobile terminal |
CN105227347A (en) * | 2015-08-25 | 2016-01-06 | 中国联合网络通信集团有限公司 | A kind of general O&M method for supervising and O&M supervisory control system |
CN106095642A (en) * | 2016-06-16 | 2016-11-09 | 浪潮电子信息产业股份有限公司 | A kind of fan failure solution based on RMC management |
CN106133699A (en) * | 2014-06-20 | 2016-11-16 | Nec 飞鼎克株式会社 | Malfunction informing device, failure notification method and program |
CN107396183A (en) * | 2017-07-20 | 2017-11-24 | 烽火通信科技股份有限公司 | The start network detecting method and system of convergent terminal equipment |
CN107948989A (en) * | 2016-10-13 | 2018-04-20 | 北京国双科技有限公司 | A kind of computational methods and device of mobile terminal networking duration |
CN108647129A (en) * | 2018-05-22 | 2018-10-12 | 山东中创软件商用中间件股份有限公司 | A kind of monitoring method of server, device, equipment and medium |
CN110311807A (en) * | 2019-06-06 | 2019-10-08 | 东南大学 | A kind of driven hanging basket data collection system Network status adaptive process monitoring method |
CN110399145A (en) * | 2018-04-24 | 2019-11-01 | 宏碁股份有限公司 | Computer system, its update method and computer program product |
CN112860338A (en) * | 2021-01-29 | 2021-05-28 | 中国农业银行股份有限公司 | Software starting method and device |
CN112860338B (en) * | 2021-01-29 | 2024-06-11 | 中国农业银行股份有限公司 | Software starting method and device |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1581094A (en) * | 2003-08-05 | 2005-02-16 | 联想(北京)有限公司 | Method for long-range determining system state |
CN1584850A (en) * | 2003-08-20 | 2005-02-23 | 联想(北京)有限公司 | Method for passive long-range deterministic system state |
CN101662387A (en) * | 2009-10-14 | 2010-03-03 | 中国电信股份有限公司 | System for detecting computer accessing situation in network and method thereof |
CN101799751A (en) * | 2009-12-02 | 2010-08-11 | 山东浪潮齐鲁软件产业股份有限公司 | Method for building monitoring agent software of host machine |
CN102148880A (en) * | 2011-03-18 | 2011-08-10 | 浪潮电子信息产业股份有限公司 | Dual-network interface card (NIC) bonding method in LINUX system of blade server |
CN102307119A (en) * | 2011-08-18 | 2012-01-04 | 工业和信息化部电信传输研究所 | Method for discovering probe failure in Internet performance measurement system |
JP2012038257A (en) * | 2010-08-11 | 2012-02-23 | Nec Corp | Os operating state confirmation system, confirmation object device, os operating state confirmation device, and os operating state confirmation method and program |
CN102663298A (en) * | 2012-04-06 | 2012-09-12 | 北京空间飞行器总体设计部 | Safety online detecting system facing to terminal computers |
-
2012
- 2012-06-29 CN CN201210220128XA patent/CN102736970A/en active Pending
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1581094A (en) * | 2003-08-05 | 2005-02-16 | 联想(北京)有限公司 | Method for long-range determining system state |
CN1584850A (en) * | 2003-08-20 | 2005-02-23 | 联想(北京)有限公司 | Method for passive long-range deterministic system state |
CN101662387A (en) * | 2009-10-14 | 2010-03-03 | 中国电信股份有限公司 | System for detecting computer accessing situation in network and method thereof |
CN101799751A (en) * | 2009-12-02 | 2010-08-11 | 山东浪潮齐鲁软件产业股份有限公司 | Method for building monitoring agent software of host machine |
JP2012038257A (en) * | 2010-08-11 | 2012-02-23 | Nec Corp | Os operating state confirmation system, confirmation object device, os operating state confirmation device, and os operating state confirmation method and program |
CN102148880A (en) * | 2011-03-18 | 2011-08-10 | 浪潮电子信息产业股份有限公司 | Dual-network interface card (NIC) bonding method in LINUX system of blade server |
CN102307119A (en) * | 2011-08-18 | 2012-01-04 | 工业和信息化部电信传输研究所 | Method for discovering probe failure in Internet performance measurement system |
CN102663298A (en) * | 2012-04-06 | 2012-09-12 | 北京空间飞行器总体设计部 | Safety online detecting system facing to terminal computers |
Cited By (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106133699A (en) * | 2014-06-20 | 2016-11-16 | Nec 飞鼎克株式会社 | Malfunction informing device, failure notification method and program |
CN104268068A (en) * | 2014-09-24 | 2015-01-07 | 浪潮电子信息产业股份有限公司 | Server log information acquisition method |
CN104407930A (en) * | 2014-10-22 | 2015-03-11 | 陆俊 | Method and system for assisting in remote mobile terminal |
CN105227347A (en) * | 2015-08-25 | 2016-01-06 | 中国联合网络通信集团有限公司 | A kind of general O&M method for supervising and O&M supervisory control system |
CN106095642A (en) * | 2016-06-16 | 2016-11-09 | 浪潮电子信息产业股份有限公司 | A kind of fan failure solution based on RMC management |
CN107948989A (en) * | 2016-10-13 | 2018-04-20 | 北京国双科技有限公司 | A kind of computational methods and device of mobile terminal networking duration |
CN107396183A (en) * | 2017-07-20 | 2017-11-24 | 烽火通信科技股份有限公司 | The start network detecting method and system of convergent terminal equipment |
CN107396183B (en) * | 2017-07-20 | 2020-04-28 | 烽火通信科技股份有限公司 | Starting-up network detection method and system for converged terminal equipment |
CN110399145A (en) * | 2018-04-24 | 2019-11-01 | 宏碁股份有限公司 | Computer system, its update method and computer program product |
CN108647129A (en) * | 2018-05-22 | 2018-10-12 | 山东中创软件商用中间件股份有限公司 | A kind of monitoring method of server, device, equipment and medium |
CN110311807A (en) * | 2019-06-06 | 2019-10-08 | 东南大学 | A kind of driven hanging basket data collection system Network status adaptive process monitoring method |
CN110311807B (en) * | 2019-06-06 | 2022-04-08 | 东南大学 | Network condition self-adaptive monitoring method for electric hanging basket data acquisition system |
CN112860338A (en) * | 2021-01-29 | 2021-05-28 | 中国农业银行股份有限公司 | Software starting method and device |
CN112860338B (en) * | 2021-01-29 | 2024-06-11 | 中国农业银行股份有限公司 | Software starting method and device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102736970A (en) | Method for monitoring activity state of operating system | |
CN105306272B (en) | Information system fault scenes formation gathering method and system | |
CN102325036B (en) | The method for diagnosing faults of a kind of network system, system and device | |
CA2931150C (en) | Automatic inspection and monitoring method based on time domain slotting control | |
CN107547273B (en) | Method and system for guaranteeing high availability of virtual instance of power system | |
CN105490870A (en) | Method for monitoring operation state of Linux server in batch | |
CN103378982A (en) | Internet business operation monitoring method and Internet business operation monitoring system | |
CN101997925A (en) | Server monitoring method with early warning function and system thereof | |
CN103778049A (en) | Embedded system development log system | |
CN112019370B (en) | Equipment fault processing method and system | |
CN106789306A (en) | Restoration methods and system are collected in communication equipment software fault detect | |
CN109462490B (en) | Video monitoring system and fault analysis method | |
CN109714202A (en) | A kind of client off-line reason method of discrimination and concentrating type safety management system | |
CN103198007A (en) | Multi-process log output method and system | |
CN104734904B (en) | The automatic test approach and system of bypass equipment | |
CN109586989B (en) | State checking method, device and cluster system | |
CN108762886A (en) | The fault detect restoration methods and system of virtual machine | |
JP5425720B2 (en) | Virtualization environment monitoring apparatus and monitoring method and program thereof | |
CN112994972A (en) | Distributed probe monitoring platform | |
CN114356499A (en) | Kubernetes cluster alarm root cause analysis method and device | |
CN105630647A (en) | Equipment detection method and detection equipment | |
CN109840178A (en) | A kind of method, mobile terminal and the device of monitoring journal information | |
CN101645983A (en) | Network management system and method using same for testing network equipment | |
CN110445694A (en) | A method of trigger notice is monitored based on Zabbix | |
CN108011783A (en) | The backup method and standby system of a kind of diagnosis server |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20121017 |