CN102736970A - Method for monitoring activity state of operating system - Google Patents

Method for monitoring activity state of operating system Download PDF

Info

Publication number
CN102736970A
CN102736970A CN201210220128XA CN201210220128A CN102736970A CN 102736970 A CN102736970 A CN 102736970A CN 201210220128X A CN201210220128X A CN 201210220128XA CN 201210220128 A CN201210220128 A CN 201210220128A CN 102736970 A CN102736970 A CN 102736970A
Authority
CN
China
Prior art keywords
network
record
script
ping
time
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201210220128XA
Other languages
Chinese (zh)
Inventor
任华进
顾春波
刘海滨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Electronic Information Industry Co Ltd
Original Assignee
Inspur Electronic Information Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Electronic Information Industry Co Ltd filed Critical Inspur Electronic Information Industry Co Ltd
Priority to CN201210220128XA priority Critical patent/CN102736970A/en
Publication of CN102736970A publication Critical patent/CN102736970A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Debugging And Monitoring (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
  • Computer And Data Communications (AREA)

Abstract

The invention provides a method for monitoring an activity state of an operating system. After the operating system runs, monitoring scrip is automatically executed under the operating system, so that a system network and a self-activity state are detected in real time, and detection information is regularly recorded in a monitoring text log; and a time point when a server fails and a probable failure expression can be accurately judged. The method has the advantages that 1) the running state of the server is accurately recorded, so that the influence of a subjective feedback mistake on failure judgment is avoided; and 2) a user can judge whether the failure expression is caused by halt because of system halt, blue screen, power failure and the like or caused by network interruption because of abnormal internal and external factors according to a record, so that corresponding solutions can be taken.

Description

A kind of method for supervising of operating system active state
Technical field
The present invention relates to specifically a kind of method for supervising of operating system active state.
Background technology
At the IDC machine room, directly through telecommunications or be communicated with core network and need professional is provided and serves, there are windows system and linux system two macrotaxonomies in the employed system of server at present a lot of users with own trust server; Possibly carry different application; When mechanical disorder took place, the information that user oneself obtains was that machine can not be communicated with, but machine still is that the suspension fault can't be known at the definite machine of delaying that when taken place; Can only notify the computer room administrator directly machine to be forced to restart; Back logon server system, confirm and judge that there is following problem in this judgment mode through system log message:
1) daily record of linux system can not be accomplished the real-time activity condition monitoring; The time of breaking down according to probably judgement system of log record generally speaking; When business is stablized even all do not have new log record in several days; Only when system restart, produce new " syslogd 1.4.1:restart. " record and the time point of corresponding system start-up; Information before this RP then is the daily record that random device ruuning situation generates, so under the situation that the keeper can not monitor server ruuning situation in real time, the concrete time that clearly judgement system breaks down; The system journal of windows system can be write down true deadlock time point;
2) for the machine of long-range trustship; No matter the windows system still be the linux system server when crash, when accident power-off or network failure cause network to interrupt; The getable failure message of long-distance user is exactly that ping is obstructed; Can't judge that this moment, fault was network failure or mechanical disorder; Since different phenomena of the failure possibly imply different faults maybe, if can't confirm phenomenon just can't be clear and definite next step solution, so often the localization of fault of server and quick-recovery are soon brought very big difficulty.
Summary of the invention
The method for supervising that the purpose of this invention is to provide a kind of operating system active state.
The objective of the invention is to realize by following mode; System operation back automatically performs monitoring script under operating system; Real-time detecting system network and own activity state; And will detect the information time recording in the daily record of monitoring text, and accurately judge time point and the general fault performance that takes place when server breaks down, concrete steps are following:
1) will automatically perform script according to the operating system classification and insert start operation automatically, windows pulls script jiankong.cmd into startup-operation place, and the linux system will carry out script jiankong.sh and write among the rc.local, guarantees machine start operation automatically;
2) Windows script: start back system script automatically performs; Elder generation register system start-up time, and generation system log-on message record " system is on ", real time record time afterwards; And order held stationary machine ip in the direct ping network through ping; Record ping command execution results, circulation is carried out, the active situation of supervisory system and network UNICOM situation;
3) Linux script: automatically perform after the start, first register system log-on message " system is on ", then writing time point; Through fixing mode of ip machine in the ping network; Judge whether network state is unobstructed, as can leading to by ping record network is up information; Otherwise record network is down information, circulation is carried out;
4) determination methods: mechanical disorder the back occurs and carries out the failure condition judgement according to the information of collecting, and each machine startup all writes down " system is on " record, checks that the time that mechanical disorder takes place can be confirmed in the final time point record before it; If ping command execution before is normal, explain that failure condition is deadlock or accident power-off, restarts, if ping command execution aborted, but the date also have record, explain that machine does not crash, just network has interrupted.
The invention has the beneficial effects as follows:
1) accurate recording operation condition of server is avoided the influence of subjective feedback mistake to fault judgement;
2) can judge the fault performance according to record is because the machine of the delaying fault of generations such as system in case of system halt, blue screen, outage or the network that unusual inside and outside factor causes interrupt, and is convenient to take corresponding solution;
3) implementation is simple, and monitoring script directly is set to automated system operation, advantages such as easy operating realization.
Description of drawings
Fig. 1 is the monitoring step synoptic diagram.
Embodiment
Explanation at length below with reference to Figure of description method of the present invention being done.
Concrete implementation procedure is following:
1) will automatically perform script according to the operating system classification and insert start operation automatically, windows pulls script jiankong.cmd into startup-operation place, and the linux system will carry out script jiankong.sh and write among the rc.local, guarantees machine start operation automatically;
2) Windows script: start back system script automatically performs; Elder generation register system start-up time, and generation system log-on message record " system is on ", real time record time afterwards; And order held stationary machine ip in the direct ping network through ping; Record ping command execution results, circulation is carried out, the active situation of supervisory system and network UNICOM situation;
3) Linux script: automatically perform after the start, first register system log-on message " system is on ", then writing time point; Through fixing mode of ip machine in the ping network; Judge whether network state is unobstructed, as can leading to by ping record network is up information; Otherwise record network is down information, circulation is carried out;
4) determination methods: mechanical disorder the back occurs and carries out the failure condition judgement according to the information of collecting, and each machine startup all writes down " system is on " record, checks that the time that mechanical disorder takes place can be confirmed in the final time point record before it; If ping command execution before is normal, explain that failure condition is deadlock or accident power-off, restarts etc., if ping command execution aborted, but the date also have record, explain that machine does not crash, but network has interrupted;
5) realize script: (annotate: xxx.xxx.xxx.xxx is stable ip address in the network, is used for doing the network monitoring RP) windows: ping.cmd echo on
echo?system?is?on?>>c:\test.txt
:ag
echo?%date%?>>c:\test.txt
echo?%time%?>>c:\test.txt
c:\windows\system32\ping?xxx.xxx.xxx.xxx?-n?4?>>c:\test.txt
goto?ag
Linux script: jiankong.sh (annotate: xxx.xxx.xxx.xxx is stable ip address in the network, is used for doing the network monitoring RP)
#!/bin/bash
echo?"system?is?on"?>>?/test.txt
while?:;do
echo?`date`?>>/test.txt
ping?-c?2?xxx.xxx.xxx.xxx?&>/dev/null
if?[?$ =?0?];then
echo?"network?is?up"?>>/test.txt
else
echo?"network?is?down"?>>/test.txt
fi
sleep?2
done
6) instance sample execution result
The windows system test.txt displaying contents after the script executing):
system?is?on
2011/09/14 Wednesday
17:51:02.69?
The data that just have 32 bytes at Ping 10.7.255.254:
Answer from 10.7.255.254: byte=32 times=3ms TTL=255
Answer from 10.7.255.254: byte=32 times < 1ms TTL=255
Answer from 10.7.255.254: byte=32 times < 1ms TTL=255
Answer from 10.7.255.254: byte=32 times < 1ms TTL=255
10.7.255.254 the Ping statistical information:
Packet: send=4, receive=4, lose=0 (0% loses),
The estimated time of round trip (is unit with the millisecond):
Weak point=0ms, length=3ms is average=0ms
2011/09/14 Wednesday
17:51:05.93
The data that just have 32 bytes at Ping 10.7.255.254:
Answer from 10.7.255.254: byte=32 times < 1ms TTL=255
Answer from 10.7.255.254: byte=32 times < 1ms TTL=255
Answer from 10.7.255.254: byte=32 times < 1ms TTL=255
Answer from 10.7.255.254: byte=32 times < 1ms TTL=255
10.7.255.254 the Ping statistical information:
Packet: send=4, receive=4, lose=0 (0% loses),
The estimated time of round trip (is unit with the millisecond):
Weak point=0ms, length=0ms is average=0ms
……
Linux script executing result:
[rootlocalhost?~]#?cat?/test.txt
system?is?on
Wed?Sep?7?12:31:54?CST?2011
network?is?up
Wed?Sep?7?12:31:57?CST?2011
network?is?up
Wed?Sep?7?12:32:00?CST?2011
network?is?up
Wed?Sep?7?12:32:03?CST?2011
network?is?up
Wed?Sep?7?12:32:06?CST?2011
network?is?up
Wed?Sep?7?12:32:09?CST?2011
network?is?down
6) the information log file is judged flow process.
Except that the described technical characterictic of instructions, be the known technology of those skilled in the art.

Claims (1)

1. the method for supervising of an operating system active state; It is characterized in that system operation back automatically performs monitoring script under operating system; Real-time detecting system network and own activity state; And will detect the information time recording in the daily record of monitoring text, and accurately judge time point and the general fault performance that takes place when server breaks down, concrete steps are following:
1) will automatically perform script according to the operating system classification and insert start operation automatically, windows pulls script jiankong.cmd into startup-operation place, and the linux system will carry out script jiankong.sh and write among the rc.local, guarantees machine start operation automatically;
2) Windows script: start back system script automatically performs; Elder generation register system start-up time, and generation system log-on message record " system is on ", real time record time afterwards; And order held stationary machine ip in the direct ping network through ping; Record ping command execution results, circulation is carried out, the active situation of supervisory system and network UNICOM situation;
3) Linux script: automatically perform after the start, first register system log-on message " system is on ", then writing time point; Through fixing mode of ip machine in the ping network; Judge whether network state is unobstructed, as can leading to by ping record network is up information; Otherwise record network is down information, circulation is carried out;
4) determination methods: mechanical disorder the back occurs and carries out the failure condition judgement according to the information of collecting, and each machine startup all writes down " system is on " record, checks that the time that mechanical disorder takes place can be confirmed in the final time point record before it; If ping command execution before is normal, explain that failure condition is deadlock or accident power-off, restarts, if ping command execution aborted, but the date also have record, explain that machine does not crash, just network has interrupted.
CN201210220128XA 2012-06-29 2012-06-29 Method for monitoring activity state of operating system Pending CN102736970A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210220128XA CN102736970A (en) 2012-06-29 2012-06-29 Method for monitoring activity state of operating system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210220128XA CN102736970A (en) 2012-06-29 2012-06-29 Method for monitoring activity state of operating system

Publications (1)

Publication Number Publication Date
CN102736970A true CN102736970A (en) 2012-10-17

Family

ID=46992513

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210220128XA Pending CN102736970A (en) 2012-06-29 2012-06-29 Method for monitoring activity state of operating system

Country Status (1)

Country Link
CN (1) CN102736970A (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104268068A (en) * 2014-09-24 2015-01-07 浪潮电子信息产业股份有限公司 Server log information acquisition method
CN104407930A (en) * 2014-10-22 2015-03-11 陆俊 Method and system for assisting in remote mobile terminal
CN105227347A (en) * 2015-08-25 2016-01-06 中国联合网络通信集团有限公司 A kind of general O&M method for supervising and O&M supervisory control system
CN106095642A (en) * 2016-06-16 2016-11-09 浪潮电子信息产业股份有限公司 A kind of fan failure solution based on RMC management
CN106133699A (en) * 2014-06-20 2016-11-16 Nec 飞鼎克株式会社 Malfunction informing device, failure notification method and program
CN107396183A (en) * 2017-07-20 2017-11-24 烽火通信科技股份有限公司 The start network detecting method and system of convergent terminal equipment
CN107948989A (en) * 2016-10-13 2018-04-20 北京国双科技有限公司 A kind of computational methods and device of mobile terminal networking duration
CN108647129A (en) * 2018-05-22 2018-10-12 山东中创软件商用中间件股份有限公司 A kind of monitoring method of server, device, equipment and medium
CN110311807A (en) * 2019-06-06 2019-10-08 东南大学 A kind of driven hanging basket data collection system Network status adaptive process monitoring method
CN110399145A (en) * 2018-04-24 2019-11-01 宏碁股份有限公司 Computer system, its update method and computer program product
CN112860338A (en) * 2021-01-29 2021-05-28 中国农业银行股份有限公司 Software starting method and device
CN112860338B (en) * 2021-01-29 2024-06-11 中国农业银行股份有限公司 Software starting method and device

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1581094A (en) * 2003-08-05 2005-02-16 联想(北京)有限公司 Method for long-range determining system state
CN1584850A (en) * 2003-08-20 2005-02-23 联想(北京)有限公司 Method for passive long-range deterministic system state
CN101662387A (en) * 2009-10-14 2010-03-03 中国电信股份有限公司 System for detecting computer accessing situation in network and method thereof
CN101799751A (en) * 2009-12-02 2010-08-11 山东浪潮齐鲁软件产业股份有限公司 Method for building monitoring agent software of host machine
CN102148880A (en) * 2011-03-18 2011-08-10 浪潮电子信息产业股份有限公司 Dual-network interface card (NIC) bonding method in LINUX system of blade server
CN102307119A (en) * 2011-08-18 2012-01-04 工业和信息化部电信传输研究所 Method for discovering probe failure in Internet performance measurement system
JP2012038257A (en) * 2010-08-11 2012-02-23 Nec Corp Os operating state confirmation system, confirmation object device, os operating state confirmation device, and os operating state confirmation method and program
CN102663298A (en) * 2012-04-06 2012-09-12 北京空间飞行器总体设计部 Safety online detecting system facing to terminal computers

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1581094A (en) * 2003-08-05 2005-02-16 联想(北京)有限公司 Method for long-range determining system state
CN1584850A (en) * 2003-08-20 2005-02-23 联想(北京)有限公司 Method for passive long-range deterministic system state
CN101662387A (en) * 2009-10-14 2010-03-03 中国电信股份有限公司 System for detecting computer accessing situation in network and method thereof
CN101799751A (en) * 2009-12-02 2010-08-11 山东浪潮齐鲁软件产业股份有限公司 Method for building monitoring agent software of host machine
JP2012038257A (en) * 2010-08-11 2012-02-23 Nec Corp Os operating state confirmation system, confirmation object device, os operating state confirmation device, and os operating state confirmation method and program
CN102148880A (en) * 2011-03-18 2011-08-10 浪潮电子信息产业股份有限公司 Dual-network interface card (NIC) bonding method in LINUX system of blade server
CN102307119A (en) * 2011-08-18 2012-01-04 工业和信息化部电信传输研究所 Method for discovering probe failure in Internet performance measurement system
CN102663298A (en) * 2012-04-06 2012-09-12 北京空间飞行器总体设计部 Safety online detecting system facing to terminal computers

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106133699A (en) * 2014-06-20 2016-11-16 Nec 飞鼎克株式会社 Malfunction informing device, failure notification method and program
CN104268068A (en) * 2014-09-24 2015-01-07 浪潮电子信息产业股份有限公司 Server log information acquisition method
CN104407930A (en) * 2014-10-22 2015-03-11 陆俊 Method and system for assisting in remote mobile terminal
CN105227347A (en) * 2015-08-25 2016-01-06 中国联合网络通信集团有限公司 A kind of general O&M method for supervising and O&M supervisory control system
CN106095642A (en) * 2016-06-16 2016-11-09 浪潮电子信息产业股份有限公司 A kind of fan failure solution based on RMC management
CN107948989A (en) * 2016-10-13 2018-04-20 北京国双科技有限公司 A kind of computational methods and device of mobile terminal networking duration
CN107396183A (en) * 2017-07-20 2017-11-24 烽火通信科技股份有限公司 The start network detecting method and system of convergent terminal equipment
CN107396183B (en) * 2017-07-20 2020-04-28 烽火通信科技股份有限公司 Starting-up network detection method and system for converged terminal equipment
CN110399145A (en) * 2018-04-24 2019-11-01 宏碁股份有限公司 Computer system, its update method and computer program product
CN108647129A (en) * 2018-05-22 2018-10-12 山东中创软件商用中间件股份有限公司 A kind of monitoring method of server, device, equipment and medium
CN110311807A (en) * 2019-06-06 2019-10-08 东南大学 A kind of driven hanging basket data collection system Network status adaptive process monitoring method
CN110311807B (en) * 2019-06-06 2022-04-08 东南大学 Network condition self-adaptive monitoring method for electric hanging basket data acquisition system
CN112860338A (en) * 2021-01-29 2021-05-28 中国农业银行股份有限公司 Software starting method and device
CN112860338B (en) * 2021-01-29 2024-06-11 中国农业银行股份有限公司 Software starting method and device

Similar Documents

Publication Publication Date Title
CN102736970A (en) Method for monitoring activity state of operating system
CN105306272B (en) Information system fault scenes formation gathering method and system
CN102325036B (en) The method for diagnosing faults of a kind of network system, system and device
CA2931150C (en) Automatic inspection and monitoring method based on time domain slotting control
CN107547273B (en) Method and system for guaranteeing high availability of virtual instance of power system
CN105490870A (en) Method for monitoring operation state of Linux server in batch
CN103378982A (en) Internet business operation monitoring method and Internet business operation monitoring system
CN101997925A (en) Server monitoring method with early warning function and system thereof
CN103778049A (en) Embedded system development log system
CN112019370B (en) Equipment fault processing method and system
CN106789306A (en) Restoration methods and system are collected in communication equipment software fault detect
CN109462490B (en) Video monitoring system and fault analysis method
CN109714202A (en) A kind of client off-line reason method of discrimination and concentrating type safety management system
CN103198007A (en) Multi-process log output method and system
CN104734904B (en) The automatic test approach and system of bypass equipment
CN109586989B (en) State checking method, device and cluster system
CN108762886A (en) The fault detect restoration methods and system of virtual machine
JP5425720B2 (en) Virtualization environment monitoring apparatus and monitoring method and program thereof
CN112994972A (en) Distributed probe monitoring platform
CN114356499A (en) Kubernetes cluster alarm root cause analysis method and device
CN105630647A (en) Equipment detection method and detection equipment
CN109840178A (en) A kind of method, mobile terminal and the device of monitoring journal information
CN101645983A (en) Network management system and method using same for testing network equipment
CN110445694A (en) A method of trigger notice is monitored based on Zabbix
CN108011783A (en) The backup method and standby system of a kind of diagnosis server

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20121017