CN102662787A - Method for protecting system disk RAID (redundant array of independent disks) - Google Patents

Method for protecting system disk RAID (redundant array of independent disks) Download PDF

Info

Publication number
CN102662787A
CN102662787A CN201210116418XA CN201210116418A CN102662787A CN 102662787 A CN102662787 A CN 102662787A CN 201210116418X A CN201210116418X A CN 201210116418XA CN 201210116418 A CN201210116418 A CN 201210116418A CN 102662787 A CN102662787 A CN 102662787A
Authority
CN
China
Prior art keywords
raid
user
detection module
module
allowing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201210116418XA
Other languages
Chinese (zh)
Inventor
孙磊
李瑞东
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Electronic Information Industry Co Ltd
Original Assignee
Inspur Electronic Information Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Electronic Information Industry Co Ltd filed Critical Inspur Electronic Information Industry Co Ltd
Priority to CN201210116418XA priority Critical patent/CN102662787A/en
Publication of CN102662787A publication Critical patent/CN102662787A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention provides a method for protecting system disk RAID (redundant array of independent disks). A system comprises a detection module and an operation module. The detection module is used for detecting status of system disk RAID. The operation module is used for shutting down or resuming system operation according to the status fed back by the detection module. The detection module issues a mandatory resuming command to the operation module to force a user to operate resuming according to the RAID information acquired by the detection module, and otherwise the system is kept shut down. The method includes: using the system to acquire disk-unrecognized RAIDdegrade information through interface information provided by an API (application program interface) of a RAID card, namely receiving alarm logs sent from the RAID card to determine the status or allowing the detection module to detect the status of RAID through the provided API; allowing the detection module to automatically trigger next operation when acquiring disk recognition failure of RAID, and issuing alert to a user using the system; and allowing the operation module to force-suspend the system used by the user and shut down if the user does not handle in time, by the time the user forces to boot up, allowing the operation module to refuse booting, and allowing the user to normally boot up only when the user really replaces the failing disk and the RAID status is recovered.

Description

The method of a kind of protection system dish raid
Technical field
The present invention relates to the Computer Storage field tests, be specifically related to the method for a kind of protection system dish raid.
Background technology
There is a fatal problem in present many system disk raid, if be exactly the untimely reparation of disk failures, cause entirely collapsing of system possibly.And when raid broke down, general storage all can have the alarm mode, had red alarm, also can follow day to reporting to the police such as controller buzzer warning, dish cabinet; But machine is if at remote equipment room, and the user does not go to check daily record again timely, does not go to repair fault timely; Cause the further degradation of raid so possibly; Causing the damage of irrecoverable property. the present invention addresses this is that exactly, through the identification to the raid state, comes force users to carry out raid and repairs.
Summary of the invention
The method that the purpose of this invention is to provide a kind of protection system dish raid.
The objective of the invention is to realize that by following mode system comprises detection module and operational module, the state of detection module detection system dish raid; Operational module is closed or the operation of recovery system according to the state of detection module feedback, and detection module is according to collecting raid information; Send compulsory restore instruction and give operational module; Thereby force users is carried out recovery operation, otherwise system will keep shut, and concrete steps are following:
The interface message that system utilizes the API of raid card to provide is earlier obtained the raid degrade information of dish, comprises through receiving the alarm log that the raid card sends coming the judgement state or removing to detect the raid state through the api interface that provides by detection module oneself, and detection module is getting access to after raid falls dish information; Automatically trigger next operation; The user who just gives at using system gives a warning, if the untimely processing of user, operational module will be to the custom system mandatory pause so; Carry out power-off operation; If at this time the user goes to start shooting by force, the start of operational module refusal has only after the hard disk that the real handle of user breaks down is changed; After recovering the raid state, the user just can normal boot-strap.
The invention has the beneficial effects as follows: solved because user's carelessness or network manager's quality are not high, since negligence of management, the data degradation that can't retrieve that is caused.Because being force users, way of the present invention carries out fault handling and recovery; These reliability and stability of having strengthened system have greatly been simplified raid false alarm mechanism; When the raid mistake takes place, always there was the daily record of a group to wait deciphering in the past, judging how to do then.And the present invention simplifies these processes, only needs the user to carry out fault recovery just by prompting.
Description of drawings
Fig. 1 is a system flowchart;
Fig. 2 is the system architecture synoptic diagram.
Embodiment
Explanation at length below with reference to Figure of description method of the present invention being done.
The method of a kind of protection system dish raid of the present invention; Be the raid degrade information that the interface message of at first utilizing the API of raid card to provide is obtained dish, this can be in several ways: can come the judgement state through receiving the alarm log that the raid card sends; Can certainly remove to detect the raid state by detection module oneself through the api interface that provides. detection module is getting access to after raid falls dish information, and unlike the operation that kind of the routine fault of dishing out, but trigger next operation automatically; The user who just just gives at using system gives a warning, if the untimely processing of user, operational module will be to the custom system mandatory pause so; Carry out power-off operation; If at this time the user goes to start shooting by force, operational module does not allow to start shooting, and has only after the hard disk that the real handle of user breaks down is worse; After recovering the raid state, the user just can normal boot-strap.
Embodiment
As shown in the figure, through the api interface information of raid card, can obtain the status information of present raid, if the state of degrade, detection module can send the instruction of hard closing system and give operational module after receiving information.
Operational module forces the user to carry out system recovery to the operation that system carries out hard closing; Can't not get into system if do not recover the user, wait the user to recover after, through the new raid status information of RAID card API feedback, detection module starts the instruction of open system, at this time the user just can get into system.
Can realize that detection module is responsible for sticking into row with raid and is connected alternately, detects and exchange raid information constantly through embedded detection module under OS and operational module software.Operational module and OS bind, and carry out corresponding operation according to the input of detection module.
Except that the described technical characterictic of instructions, be the known technology of those skilled in the art.

Claims (1)

1. the method for a protection system dish raid is characterized in that, system comprises detection module and operational module; The state of detection module detection system dish raid, operational module is closed or the operation of recovery system according to the state of detection module feedback; Detection module sends compulsory restore instruction and give operational module, thereby force users is carried out recovery operation according to collecting raid information; Otherwise system will keep shut, and concrete steps are following:
The interface message that system utilizes the API of raid card to provide is earlier obtained the raid degrade information of dish, comprises through receiving the alarm log that the raid card sends coming the judgement state or removing to detect the raid state through the api interface that provides by detection module oneself, and detection module is getting access to after raid falls dish information; Automatically trigger next operation; The user who just gives at using system gives a warning, if the untimely processing of user, operational module will be to the custom system mandatory pause so; Carry out power-off operation; If at this time the user goes to start shooting by force, the start of operational module refusal has only after the hard disk that the real handle of user breaks down is changed; After recovering the raid state, the user just can normal boot-strap.
CN201210116418XA 2012-04-20 2012-04-20 Method for protecting system disk RAID (redundant array of independent disks) Pending CN102662787A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210116418XA CN102662787A (en) 2012-04-20 2012-04-20 Method for protecting system disk RAID (redundant array of independent disks)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210116418XA CN102662787A (en) 2012-04-20 2012-04-20 Method for protecting system disk RAID (redundant array of independent disks)

Publications (1)

Publication Number Publication Date
CN102662787A true CN102662787A (en) 2012-09-12

Family

ID=46772286

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210116418XA Pending CN102662787A (en) 2012-04-20 2012-04-20 Method for protecting system disk RAID (redundant array of independent disks)

Country Status (1)

Country Link
CN (1) CN102662787A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103049345A (en) * 2012-12-10 2013-04-17 北京百度网讯科技有限公司 Magnetic disk state transition detection method and device based on asynchronous communication mechanism
CN103207820A (en) * 2013-02-05 2013-07-17 北京百度网讯科技有限公司 Method and device for fault positioning of hard disk on basis of raid card log
CN103995772A (en) * 2014-06-10 2014-08-20 浪潮电子信息产业股份有限公司 RAID card log completely-storing method based on LINUX operation system
CN104679623A (en) * 2013-11-29 2015-06-03 中国移动通信集团公司 Server hard disk maintaining method, system and server monitoring equipment
CN105045689A (en) * 2015-06-25 2015-11-11 浪潮电子信息产业股份有限公司 Method for using RAID card to perform hard disk batch detection, monitoring and alerting
CN106021065A (en) * 2016-05-19 2016-10-12 浪潮电子信息产业股份有限公司 Method for automatically detecting bad track information of raid disk under linux

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030204788A1 (en) * 2002-04-29 2003-10-30 International Business Machines Corporation Predictive failure analysis for storage networks
CN1808365A (en) * 2005-01-17 2006-07-26 英业达股份有限公司 Automatic reconstruction method for disk redundancy array device
US20080040540A1 (en) * 2006-08-11 2008-02-14 Intel Corporation On-disk caching for raid systems
CN201546683U (en) * 2009-09-28 2010-08-11 高建军 Radio-alarming self-stopping device of pumping unit when in breaking or disconnection of polish rod, sand plugging or wax plugging

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030204788A1 (en) * 2002-04-29 2003-10-30 International Business Machines Corporation Predictive failure analysis for storage networks
CN1808365A (en) * 2005-01-17 2006-07-26 英业达股份有限公司 Automatic reconstruction method for disk redundancy array device
US20080040540A1 (en) * 2006-08-11 2008-02-14 Intel Corporation On-disk caching for raid systems
CN201546683U (en) * 2009-09-28 2010-08-11 高建军 Radio-alarming self-stopping device of pumping unit when in breaking or disconnection of polish rod, sand plugging or wax plugging

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103049345A (en) * 2012-12-10 2013-04-17 北京百度网讯科技有限公司 Magnetic disk state transition detection method and device based on asynchronous communication mechanism
CN103049345B (en) * 2012-12-10 2015-11-25 北京百度网讯科技有限公司 Based on Disk State transition detection method and the device of asynchronous mechanism
CN103207820A (en) * 2013-02-05 2013-07-17 北京百度网讯科技有限公司 Method and device for fault positioning of hard disk on basis of raid card log
CN103207820B (en) * 2013-02-05 2016-06-29 北京百度网讯科技有限公司 The Fault Locating Method of hard disk and device based on raid card log
CN104679623A (en) * 2013-11-29 2015-06-03 中国移动通信集团公司 Server hard disk maintaining method, system and server monitoring equipment
CN103995772A (en) * 2014-06-10 2014-08-20 浪潮电子信息产业股份有限公司 RAID card log completely-storing method based on LINUX operation system
CN105045689A (en) * 2015-06-25 2015-11-11 浪潮电子信息产业股份有限公司 Method for using RAID card to perform hard disk batch detection, monitoring and alerting
CN106021065A (en) * 2016-05-19 2016-10-12 浪潮电子信息产业股份有限公司 Method for automatically detecting bad track information of raid disk under linux

Similar Documents

Publication Publication Date Title
CN107179957B (en) Physical machine fault classification processing method and device and virtual machine recovery method and system
CN102279775B (en) Method for processing failure of hard disk under Linux system
CN102591591B (en) Disk detection system, disk detection method and network store system
CN102662787A (en) Method for protecting system disk RAID (redundant array of independent disks)
CN104536855B (en) Fault detection method and device
CN102662821B (en) Method, device and system for auxiliary diagnosis of virtual machine failure
CN106789306B (en) Method and system for detecting, collecting and recovering software fault of communication equipment
CN102880522B (en) Hardware fault-oriented method and device for correcting faults in key files of system
CN101097531A (en) Computer RAID array early-warning system and method
CN101866271A (en) Security early warning system and method based on RAID
CN101582046B (en) High-available system state monitoring, forcasting and intelligent management method
CN102404141B (en) Method and device of alarm inhibition
CN101373450B (en) Method and system for processing CPU abnormity
CN109284207A (en) Hard disc failure processing method, device, server and computer-readable medium
CN103955417A (en) Computer hard disc data detecting equipment and method
CN101556679A (en) Method for processing failures in integrated front-end system and computer equipment
CN105740110A (en) Detection method for smart information of hard disk in linux system
CN105607973B (en) Method, device and system for processing equipment fault in virtual machine system
CN101145983B (en) A self-diagnosis and self-discovery subsystem and method of network management system
CN108958965A (en) A kind of BMC monitoring can restore the method, device and equipment of ECC error
CN108762886A (en) The fault detect restoration methods and system of virtual machine
CN102841589A (en) Remote intelligent maintenance system for online environment monitoring instrument
JP2009276929A (en) Automatic fault handling system
CN103995759A (en) High-availability computer system failure handling method and device based on core internal-external synergy
JP6216621B2 (en) Plant monitoring and control system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20120912