CN102662787A

CN102662787A - Method for protecting system disk RAID (redundant array of independent disks)

Info

Publication number: CN102662787A
Application number: CN201210116418XA
Authority: CN
Inventors: 孙磊; 李瑞东
Original assignee: Inspur Electronic Information Industry Co Ltd
Current assignee: Inspur Electronic Information Industry Co Ltd
Priority date: 2012-04-20
Filing date: 2012-04-20
Publication date: 2012-09-12

Abstract

The invention provides a method for protecting system disk RAID (redundant array of independent disks). A system comprises a detection module and an operation module. The detection module is used for detecting status of system disk RAID. The operation module is used for shutting down or resuming system operation according to the status fed back by the detection module. The detection module issues a mandatory resuming command to the operation module to force a user to operate resuming according to the RAID information acquired by the detection module, and otherwise the system is kept shut down. The method includes: using the system to acquire disk-unrecognized RAIDdegrade information through interface information provided by an API (application program interface) of a RAID card, namely receiving alarm logs sent from the RAID card to determine the status or allowing the detection module to detect the status of RAID through the provided API; allowing the detection module to automatically trigger next operation when acquiring disk recognition failure of RAID, and issuing alert to a user using the system; and allowing the operation module to force-suspend the system used by the user and shut down if the user does not handle in time, by the time the user forces to boot up, allowing the operation module to refuse booting, and allowing the user to normally boot up only when the user really replaces the failing disk and the RAID status is recovered.

Description

The method of a kind of protection system dish raid

Technical field

The present invention relates to the Computer Storage field tests, be specifically related to the method for a kind of protection system dish raid.

Background technology

There is a fatal problem in present many system disk raid, if be exactly the untimely reparation of disk failures, cause entirely collapsing of system possibly.And when raid broke down, general storage all can have the alarm mode, had red alarm, also can follow day to reporting to the police such as controller buzzer warning, dish cabinet; But machine is if at remote equipment room, and the user does not go to check daily record again timely, does not go to repair fault timely; Cause the further degradation of raid so possibly; Causing the damage of irrecoverable property. the present invention addresses this is that exactly, through the identification to the raid state, comes force users to carry out raid and repairs.

Summary of the invention

The method that the purpose of this invention is to provide a kind of protection system dish raid.

The objective of the invention is to realize that by following mode system comprises detection module and operational module, the state of detection module detection system dish raid; Operational module is closed or the operation of recovery system according to the state of detection module feedback, and detection module is according to collecting raid information; Send compulsory restore instruction and give operational module; Thereby force users is carried out recovery operation, otherwise system will keep shut, and concrete steps are following:

The interface message that system utilizes the API of raid card to provide is earlier obtained the raid degrade information of dish, comprises through receiving the alarm log that the raid card sends coming the judgement state or removing to detect the raid state through the api interface that provides by detection module oneself, and detection module is getting access to after raid falls dish information; Automatically trigger next operation; The user who just gives at using system gives a warning, if the untimely processing of user, operational module will be to the custom system mandatory pause so; Carry out power-off operation; If at this time the user goes to start shooting by force, the start of operational module refusal has only after the hard disk that the real handle of user breaks down is changed; After recovering the raid state, the user just can normal boot-strap.

The invention has the beneficial effects as follows: solved because user's carelessness or network manager's quality are not high, since negligence of management, the data degradation that can't retrieve that is caused.Because being force users, way of the present invention carries out fault handling and recovery; These reliability and stability of having strengthened system have greatly been simplified raid false alarm mechanism; When the raid mistake takes place, always there was the daily record of a group to wait deciphering in the past, judging how to do then.And the present invention simplifies these processes, only needs the user to carry out fault recovery just by prompting.

Description of drawings

Fig. 1 is a system flowchart;

Fig. 2 is the system architecture synoptic diagram.

Embodiment

Explanation at length below with reference to Figure of description method of the present invention being done.

The method of a kind of protection system dish raid of the present invention; Be the raid degrade information that the interface message of at first utilizing the API of raid card to provide is obtained dish, this can be in several ways: can come the judgement state through receiving the alarm log that the raid card sends; Can certainly remove to detect the raid state by detection module oneself through the api interface that provides. detection module is getting access to after raid falls dish information, and unlike the operation that kind of the routine fault of dishing out, but trigger next operation automatically; The user who just just gives at using system gives a warning, if the untimely processing of user, operational module will be to the custom system mandatory pause so; Carry out power-off operation; If at this time the user goes to start shooting by force, operational module does not allow to start shooting, and has only after the hard disk that the real handle of user breaks down is worse; After recovering the raid state, the user just can normal boot-strap.

Embodiment

As shown in the figure, through the api interface information of raid card, can obtain the status information of present raid, if the state of degrade, detection module can send the instruction of hard closing system and give operational module after receiving information.

Operational module forces the user to carry out system recovery to the operation that system carries out hard closing; Can't not get into system if do not recover the user, wait the user to recover after, through the new raid status information of RAID card API feedback, detection module starts the instruction of open system, at this time the user just can get into system.

Can realize that detection module is responsible for sticking into row with raid and is connected alternately, detects and exchange raid information constantly through embedded detection module under OS and operational module software.Operational module and OS bind, and carry out corresponding operation according to the input of detection module.

Except that the described technical characterictic of instructions, be the known technology of those skilled in the art.

Claims

1. the method for a protection system dish raid is characterized in that, system comprises detection module and operational module; The state of detection module detection system dish raid, operational module is closed or the operation of recovery system according to the state of detection module feedback; Detection module sends compulsory restore instruction and give operational module, thereby force users is carried out recovery operation according to collecting raid information; Otherwise system will keep shut, and concrete steps are following: