CN104714863A - Method for completely storing Raid card logs on basis of Linux operation system after system crashes - Google Patents

Method for completely storing Raid card logs on basis of Linux operation system after system crashes Download PDF

Info

Publication number
CN104714863A
CN104714863A CN201510063331.4A CN201510063331A CN104714863A CN 104714863 A CN104714863 A CN 104714863A CN 201510063331 A CN201510063331 A CN 201510063331A CN 104714863 A CN104714863 A CN 104714863A
Authority
CN
China
Prior art keywords
raid card
linux
suse
server
machine
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510063331.4A
Other languages
Chinese (zh)
Inventor
刘兢
任华进
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Electronic Information Industry Co Ltd
Original Assignee
Inspur Electronic Information Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Electronic Information Industry Co Ltd filed Critical Inspur Electronic Information Industry Co Ltd
Priority to CN201510063331.4A priority Critical patent/CN104714863A/en
Publication of CN104714863A publication Critical patent/CN104714863A/en
Pending legal-status Critical Current

Links

Abstract

The invention discloses a method for completely storing Raid card logs on the basis of a Linux operation system after the system crashes, belongs to the technical field of computer storage, and particularly relates to the achievement of an automatic information collection function required for analyzing the machine failure abnormity problem. According to the method, the server restart function in IPMI functions is achieved through an ipmitool tool in the Linux operation system, a crash server restart operation is automatically triggered, and the whole procedure of collecting Raid card information is automatically started. Compared with the prior art, the method has the advantages that the crash problem can be automatically solved, the logs are automatically collected without attended operation, and quite good application and popularization value is achieved.

Description

A kind of system based on (SuSE) Linux OS is delayed the method for the complete preservation of Raid card daily record after machine
Technical field
The present invention relates to computer memory technical field, specifically a kind of system based on (SuSE) Linux OS is delayed the method for the complete preservation of Raid card daily record after machine.
Background technology
Along with the development of the new techniques such as cloud computing, large data, people to server stable, reliably work to need sum-average arithmetic non-failure operation time (MTBF) to require more and more higher, how effectively shorten server failure repair time, the efficiency and the accuracy that improve fault analysis are the difficult problem of pendulum in face of each maintainer.
But, after the server in using surprisingly delays machine, allow after server autoboot, labor is just needed to cause equipment to delay the reason of machine, if but be now forced by server power-off shutdown just there will be the disappearance of Raid card recorded information or lose completely before, and then allow the process of whole analyzing failure cause lack necessary information, more likely cannot confirm the part at problem place.
Summary of the invention
Technical assignment of the present invention is for above-mentioned the deficiencies in the prior art, a kind of method that system based on (SuSE) Linux OS delays the complete preservation of Raid card daily record after machine is provided, have and automatically process machine problem of delaying, automatic collector journal, the feature that unmanned completes automatically.
Technical assignment of the present invention realizes in the following manner: a kind of system based on (SuSE) Linux OS is delayed the method for the complete preservation of Raid card daily record after machine, be characterized in realizing restarting server capability in IPMI function by the ipmitool instrument under (SuSE) Linux OS, machine server operation of delaying is restarted in automatic triggering, and starts the whole process of collecting Raid card information automatically.
As preferably, confirm whether this server is in normal state by the logical station server of continuous print ping always, if server is occurring that ping order performs failure sometime, then under operating system, ipmitool instrument is performing the operation of autoboot immediately to this server.
The operation of restarting far-end server is performed by the shell script under (SuSE) Linux OS.
Called the implementing procedure collecting the daily record of Raid card by the shell script under (SuSE) Linux OS, start the process of whole collection information.
Under the log information of Raid card is retained in the catalogue of specifying, and in script, be accompanied with the method for automatically preserving above-mentioned log information.Concrete grammar is preferably: under being kept at the system directory of specifying after using the method for compression the packing of all log information files to be compressed.
Described Raid card information comprises current Raid card-like state, disk state information, Raid array status, and the recorded information of Raid card working condition of Raid controller feedback within a period of time.
The method and apparatus related in said method comprises: (1) Raid card; (2) Raid card log information collection method; (3) shell script of robotization; (4) Raid card log information method is collected in start; (5) IPMI; (6) ipmitool instrument; (7) ping order.Wherein:
(1), Raid card: Raid card is used to the board realizing Raid function, and Raid is the abbreviation of English Redundant Array of Independent Disks, translates into Chinese and is Redundant Array of Independent Disks (RAID), or is called for short disk array.Raid be a kind of polylith independently hard disk (physical hard disk) to combine differently formation hard disk groups (logic hard disk), thus the memory property higher than single hard disk is provided and the technology of data redundancy is provided.
(2), Raid card log information collection method: under using the linux system of specifying, program completes the collection of Raid card log information.
(3), the shell script of robotization: the automated execution write collects the calling program of information, and the program can calling the collection of Raid card information completes the process of collection, realizes the collection of robotization.
(4) Raid card log information method, is collected: use the wscript.exe under operating system to complete, automatic collector journal function has started according to machine operation system and performed specific program, just can complete the flow process of whole collection information after executive routine.
(5) IPMI: wisdom platform management interface (Intelligent Platform Management Interface), IPMI can across different operating system, firmware and hardware platform, supervision that can be intelligent, control and automatically return the functioning condition of a large amount of server, IPMI is independent of the outer self-contained operation of operating system, even and if allow that supvr is lacking operating system or the system management software, or monitored system closedown but still can remote management system when connecing power supply, IPMI also can be movable after os starting, can also provide when using in the lump with system management function and add powerful, start shooting, restart, the operations such as shutdown.
(6) ipmitool instrument: ipmitool is a kind of ipmi platform management instrument of the command line mode that can be used under Linux system, its supports IPMI 2.0 specification, by it can realize obtaining sensor information, display system log content, network remote switching on and shutting down, the function such as to restart.
(7) ping order: ping is that Linux operation is issued orders, and utilizes " ping " order can check whether network is communicated with, can analyze and judge current network fault whether and equipment whether normal.
Compared with prior art, the system based on the (SuSE) Linux OS of the present invention method of the complete preservation of Raid card daily record after machine of delaying has following characteristics:
(1) can automatically process machine problem of delaying, realize automatic collector journal, unmanned completes automatically;
(2) utilize the execution of simple shell script to restart the operation of far-end server and the process of the information of collection, there is good versatility;
(3) as long as under copying script file to corresponding operating system, the order of execution is set, the whether normal instruction of automatically detecting operating system will run down always, when there is the machine of delaying of server, send the order of restarting server at once, server OS completes restarts rear log information and has also collected, and just no longer needs other intervention operation, have good ease for use once these after being provided with.
Accompanying drawing explanation
Fig. 1 is that embodiment inediting rc.local text arranges operation collector journal script example;
Fig. 2 is the script file and collection kit schematic diagram that need under system in embodiment.
Embodiment
With specific embodiment, the system based on the (SuSE) Linux OS of the present invention method of the complete preservation of Raid card daily record after machine of delaying is described in detail below.
Embodiment:
The inventive method is delayed for server the situation of machine, realize restarting server capability in IPMI function by the ipmitool instrument under (SuSE) Linux OS, machine server operation of delaying is restarted in automatic triggering, and start automatically to collect Raid card log information, preserve the relevant valuable information of up-to-date Raid card, Raid card log information comprises current Raid card-like state, disk state information, Raid array status, the recorded information of the Raid card working condition of Raid controller feedback within a period of time.Concrete grammar is as follows:
By the characteristic of ping order under linux system, can confirm whether this station server is in normal state by the logical station server of continuous print ping always, if server is occurring that ping order performs failure sometime, then under operating system, ipmitool instrument uses, under order (restarting server command) performs the operation that this server performs autoboot immediately:
Due to server OS /etc/rc.d/rc.local in the addition of a fill order, when after system autoboot, automatically will start the process of collecting the daily record of Raid card, editor's rc.local text arranges and runs collector journal script as shown in Figure 1.
The preliminary work that collection process starts, need the implementing procedure and execution script program of collecting the daily record of Raid card to be placed into (as shown in Figure 2) under some catalogues of operating system, this catalogue is exactly the Raid card information file place catalogue that later collection arrives, the information obtaining needing just directly can arrive this catalogue and check, the ease for use of the inventive method is embodied.
The shell script write in said method, the implementing procedure needed in collection process can be called automatically, simultaneously when performing this shell script, the version of program meeting automatic decision (SuSE) Linux OS itself is 32 or 64, for different versions, corresponding implementing procedure is used to carry out the work of collection information.After completing the step of collection, under the log information of Raid card can be retained in specific catalogue, in order to save the time of searching this information, also the method for automatically preserving these important informations has been attached in script, the very little file of a volume is compressed into after using the method for compression to be packed by all message files, under being finally kept at the system directory of specifying, facilitate follow-up extraction and transmission work, concrete script file is as follows:
The message sample finally collected is as follows:
The compressed package obtained after executing automatic collection process.

Claims (7)

1. the system based on (SuSE) Linux OS is delayed the method for the complete preservation of Raid card daily record after machine, it is characterized in that: realize restarting server capability in IPMI function by the ipmitool instrument under (SuSE) Linux OS, machine server operation of delaying is restarted in automatic triggering, and starts the whole process of collecting Raid card information automatically.
2. the system based on (SuSE) Linux OS according to claim 1 is delayed the method for the complete preservation of Raid card daily record after machine, it is characterized in that, confirm whether this server is in normal state by the logical station server of continuous print ping always, if server is occurring that ping order performs failure sometime, then under operating system, ipmitool instrument is performing the operation of autoboot immediately to this server.
3. the system based on (SuSE) Linux OS according to claim 1 is delayed the method for the complete preservation of Raid card daily record after machine, it is characterized in that, is performed the operation of restarting far-end server by the shell script under (SuSE) Linux OS.
4. the system based on (SuSE) Linux OS according to claim 1 is delayed the method for the complete preservation of Raid card daily record after machine, it is characterized in that, called the implementing procedure collecting the daily record of Raid card by the shell script under (SuSE) Linux OS, start the process of whole collection information.
5. the system based on (SuSE) Linux OS according to claim 4 is delayed the method for the complete preservation of Raid card daily record after machine, it is characterized in that, under the log information of Raid card is retained in the catalogue of specifying, and in script, be accompanied with the method for automatically preserving above-mentioned log information.
6. the system based on (SuSE) Linux OS according to claim 5 is delayed the method for the complete preservation of Raid card daily record after machine, it is characterized in that, uses the method for compression by under being kept at the system directory of specifying after all log information files packing compression.
7. the system based on (SuSE) Linux OS according to claim 1 is delayed the method for the complete preservation of Raid card daily record after machine, it is characterized in that, described Raid card information comprises current Raid card-like state, disk state information, Raid array status, and the recorded information of Raid card working condition of Raid controller feedback within a period of time.
CN201510063331.4A 2015-02-06 2015-02-06 Method for completely storing Raid card logs on basis of Linux operation system after system crashes Pending CN104714863A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510063331.4A CN104714863A (en) 2015-02-06 2015-02-06 Method for completely storing Raid card logs on basis of Linux operation system after system crashes

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510063331.4A CN104714863A (en) 2015-02-06 2015-02-06 Method for completely storing Raid card logs on basis of Linux operation system after system crashes

Publications (1)

Publication Number Publication Date
CN104714863A true CN104714863A (en) 2015-06-17

Family

ID=53414225

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510063331.4A Pending CN104714863A (en) 2015-02-06 2015-02-06 Method for completely storing Raid card logs on basis of Linux operation system after system crashes

Country Status (1)

Country Link
CN (1) CN104714863A (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105955875A (en) * 2016-05-04 2016-09-21 浪潮电子信息产业股份有限公司 Device and method for monitoring logs of RAID cards
CN106201799A (en) * 2016-07-14 2016-12-07 浪潮电子信息产业股份有限公司 A kind of service based on ipmi carries out, to server, the method for testing that DC is restarted
CN106776090A (en) * 2016-11-29 2017-05-31 郑州云海信息技术有限公司 A kind of method for collecting information when RHEL operating systems are without response
WO2017148271A1 (en) * 2016-03-04 2017-09-08 中兴通讯股份有限公司 Linux system reset processing method and device, and computer storage medium
CN107665260A (en) * 2017-10-24 2018-02-06 郑州云海信息技术有限公司 A kind of log collection instrument based on Linux system
CN108459932A (en) * 2018-03-02 2018-08-28 郑州云海信息技术有限公司 A kind of method, apparatus and equipment of management RAID card
CN109189601A (en) * 2018-09-06 2019-01-11 郑州云海信息技术有限公司 The grasping means of RAID card log information under a kind of linux system
CN109324834A (en) * 2018-09-19 2019-02-12 郑州云海信息技术有限公司 A kind of system and method that distributed storage server is restarted automatically
CN110297745A (en) * 2019-07-04 2019-10-01 中山大学 A kind of Fault Locating Method and system storing monitoring system
CN111506441A (en) * 2020-04-14 2020-08-07 浪潮商用机器有限公司 Method, device, equipment and storage medium for monitoring Raid card information

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102394791A (en) * 2011-10-26 2012-03-28 浪潮(北京)电子信息产业有限公司 Downtime recovery method and system
CN103593269A (en) * 2013-11-01 2014-02-19 浪潮电子信息产业股份有限公司 Automatic cyclic test method of restart pressure of multiple PCIe devices
CN103970661A (en) * 2014-05-19 2014-08-06 浪潮电子信息产业股份有限公司 Method for batched server memory fault detection through IPMI tool
CN103995772A (en) * 2014-06-10 2014-08-20 浪潮电子信息产业股份有限公司 RAID card log completely-storing method based on LINUX operation system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102394791A (en) * 2011-10-26 2012-03-28 浪潮(北京)电子信息产业有限公司 Downtime recovery method and system
CN103593269A (en) * 2013-11-01 2014-02-19 浪潮电子信息产业股份有限公司 Automatic cyclic test method of restart pressure of multiple PCIe devices
CN103970661A (en) * 2014-05-19 2014-08-06 浪潮电子信息产业股份有限公司 Method for batched server memory fault detection through IPMI tool
CN103995772A (en) * 2014-06-10 2014-08-20 浪潮电子信息产业股份有限公司 RAID card log completely-storing method based on LINUX operation system

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017148271A1 (en) * 2016-03-04 2017-09-08 中兴通讯股份有限公司 Linux system reset processing method and device, and computer storage medium
CN107153453A (en) * 2016-03-04 2017-09-12 中兴通讯股份有限公司 A kind of linux system reset processing method and device
CN105955875A (en) * 2016-05-04 2016-09-21 浪潮电子信息产业股份有限公司 Device and method for monitoring logs of RAID cards
CN106201799A (en) * 2016-07-14 2016-12-07 浪潮电子信息产业股份有限公司 A kind of service based on ipmi carries out, to server, the method for testing that DC is restarted
CN106776090A (en) * 2016-11-29 2017-05-31 郑州云海信息技术有限公司 A kind of method for collecting information when RHEL operating systems are without response
CN107665260A (en) * 2017-10-24 2018-02-06 郑州云海信息技术有限公司 A kind of log collection instrument based on Linux system
CN108459932A (en) * 2018-03-02 2018-08-28 郑州云海信息技术有限公司 A kind of method, apparatus and equipment of management RAID card
CN109189601A (en) * 2018-09-06 2019-01-11 郑州云海信息技术有限公司 The grasping means of RAID card log information under a kind of linux system
CN109324834A (en) * 2018-09-19 2019-02-12 郑州云海信息技术有限公司 A kind of system and method that distributed storage server is restarted automatically
CN110297745A (en) * 2019-07-04 2019-10-01 中山大学 A kind of Fault Locating Method and system storing monitoring system
CN111506441A (en) * 2020-04-14 2020-08-07 浪潮商用机器有限公司 Method, device, equipment and storage medium for monitoring Raid card information
CN111506441B (en) * 2020-04-14 2023-06-16 浪潮商用机器有限公司 Method, device, equipment and storage medium for monitoring Raid card information

Similar Documents

Publication Publication Date Title
CN104714863A (en) Method for completely storing Raid card logs on basis of Linux operation system after system crashes
US9377964B2 (en) Systems and methods for improving snapshot performance
CN101770410B (en) System reducing method based on client operating system, virtual machine manager and system
CN103744764A (en) Crontab based whole computer memory stability test method
CN101093462B (en) Automatization method for testing schooling pressure on database application
CN105204979A (en) Recording method of Android logs and mobile terminal
CN110750396B (en) Server operating system compatibility testing method and device and storage medium
CN106598796A (en) Method for testing hardware information stability in reboot
CN104636242A (en) Method for automatically deleting repeated content in system logs on basis of Linux operating system
CN104572422A (en) Memory monitoring achievement method based on startup and shutdown of Linux system
CN104317709A (en) Method and system for testing performance of software
CN112068852B (en) Method, system, equipment and medium for installing open-source software based on domestic server
CN110704287B (en) RAID card abnormal log collection method and system under Linux system and storage medium
CN103593269A (en) Automatic cyclic test method of restart pressure of multiple PCIe devices
CN105718330A (en) Linux system backup data recovery method and device
CN108762886B (en) Fault detection recovery method and system for virtual machine
CN103995772A (en) RAID card log completely-storing method based on LINUX operation system
CN104021058A (en) Method for quickly starting test board card
CN106557395B (en) Application performance monitoring management method, system and application method of system
CN102063365B (en) Method and device for recording operation information of single plate
CN105786679A (en) Automatic test monitoring system and method and mobile terminal
CN104572350B (en) A kind of metadata processing method and device
CN102929746A (en) Quick backup and recovery method for lottery sale system
CN107133084A (en) A kind of method of testing that XenServer virtualization system certifications are carried out to storage product
CN106775451A (en) A kind of method and device for processing logical volume

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20150617

WD01 Invention patent application deemed withdrawn after publication