CN106708646A - Hard disk abnormal condition automatic resetting method and device thereof - Google Patents

Hard disk abnormal condition automatic resetting method and device thereof Download PDF

Info

Publication number
CN106708646A
CN106708646A CN201611200471.2A CN201611200471A CN106708646A CN 106708646 A CN106708646 A CN 106708646A CN 201611200471 A CN201611200471 A CN 201611200471A CN 106708646 A CN106708646 A CN 106708646A
Authority
CN
China
Prior art keywords
hard disk
abnormal
reset
module
disk
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201611200471.2A
Other languages
Chinese (zh)
Inventor
孙磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhengzhou Yunhai Information Technology Co Ltd
Original Assignee
Zhengzhou Yunhai Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhengzhou Yunhai Information Technology Co Ltd filed Critical Zhengzhou Yunhai Information Technology Co Ltd
Priority to CN201611200471.2A priority Critical patent/CN106708646A/en
Publication of CN106708646A publication Critical patent/CN106708646A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0793Remedial or corrective actions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/2205Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing using arrangements specific to the hardware being tested

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Hardware Design (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention belongs to the field of hard disk checking, and discloses a hard disk abnormal condition automatic resetting method. The method comprises the steps that a disk array system detected an abnormal hard disk; a case management system of a disk array cabinet is used for positioning the abnormal hard disk, the abnormal hard disks are reset, and the reset hard disks are verified. The invention further discloses a hard disk abnormal condition automatic resetting device. The device comprises a detecting module for detecting the abnormal hard disk, a trigger module for positioning the abnormal hard disk, a resetting module for resetting the abnormal hard disk, a verification module for verifying the reset hard disk. The hard disk abnormal condition automatic resetting method and device thereof can achieve the automatic resetting of the hard disk, make hung hard disks be uploaded in milliseconds, and reinstate the hard disk application in the system in time, and therefore the situation that the hard disk is mistakenly removed is ruled out, the system work stability is effectively guaranteed, and the hard disk application risks are lowered.

Description

Hard disk exception auto-reset method and its device
Technical field
The invention belongs to hard disk performance detection field, more particularly to a kind of hard disk exception auto-reset method and its device.
Background technology
Hard disk is used as one of main storage core component of computer, and its reliability is most important;But current system sets In meter, the attention rate on hard disk is concentrated mainly on software aspects, such as hard disk modification, raid, data clone etc., and for Hard disk integrity problem attention rate in itself is not high.
Numerical monitor according to statistics, in the hard disk changed by storage producer, the hard disk for having 60% belongs to without exception, can make Hard disk, only in system because certain reason hard disk once is rammed without response, makes hard disk be in " seemingly-dead " state, leads Cause system takes for hard disk corruptions and rejects hard disk, make hard disk be forced it is offline be retracted into storage producer, but at this time hard disk is in fact Do not damage, can be used, as long as once being plugged operation to hard disk again, hard disk can reach the standard grade again again;Hard disk is " false Extremely " failure belongs to not timing phenomenon, does not have systematicness, occurs once within sometimes one day, sometimes occurs without for a long time once, shadow The normal operation of acoustic system, increases maintenance cost, therefore, for the art member, how in system operation The middle seemingly-dead failure for solving hard disk is the technical problem of urgent need to resolve.
The content of the invention
The present invention provides a kind of hard disk exception auto-reset method and its device, when hard disk is without response, can be automatically right Hard disk is resetted so that hard disk is reached the standard grade again, recovers the use of hard disk in systems in time, so as to avoid hard disk from being picked by mistake Remove.
To achieve these goals, the present invention uses following technical scheme:
A kind of hard disk exception auto-reset method, comprises the following steps:
Disc array system detects abnormal hard disk;
Abnormal hard disk is oriented by the shelf management system of magnetic disk array cabinet;
Abnormal hard disk is resetted;
Hard disk after checking reset.
Preferably, disc array system detects abnormal hard disk, including:Enter when disc array system is to disk read-write data Row verification, when verification generation is without response or mistake, then judges that the hard disk is abnormal hard disk.
Preferably, abnormal hard disk is resetted, including:The shelf management system of magnetic disk array cabinet sends a signal to control Device, controller control logic circuit power-off after the Preset Time of interval, restores electricity, and return to operation signal by bus again To computer management system.
Preferably, it is described to be verified as being written and read test to the hard disk after reset.
Preferably, after the hard disk after checking resets, also include:Hard disk after reset is then reached the standard grade by validation test;It is no Then, it is offline.
A kind of hard disk exception automatic reset device, including:
Detection module, for detecting abnormal hard disk;
Trigger module, for being positioned to abnormal hard disk;
Reseting module, for being resetted to abnormal hard disk;
Authentication module, for verifying the hard disk after resetting.
Preferably, also include:Processing module, hard disk reaches the standard grade or offline after being verified for treatment.
Beneficial effects of the present invention:
Whether verification of the present invention when reading and writing judges hard disk in abnormal, if hard disk exception, then by triggering, reset behaviour Make, the hard disk after reset is verified again after reset, if being proved to be successful, judge that hard disk is torpor, in systems Hard disk is reached the standard grade again, realization automatically resets to hard disk, hard disk is reached the standard grade again with the Millisecond time, in time in system It is middle recover hard disk use, and hard disk need not be plugged can recover hard disk, it is to avoid hard disk is rejected by mistake, effectively guarantor The stability of card system work, reduces the risk of hard disk applications, improves the service life of hard disk.
Existing technology is that hard disk is rammed the short time without response because there are abnormal conditions, causes system to take for hard disk Damage and reject hard disk, the hard disk being removed in fact is probably, for " seemingly-dead " state, to cause hard disk misjudged, is then carried out down Line treatment, increased the maintenance cost of hard disk, and the present invention is by triggering, reset and verifies the asking without response for the hard disk short time Topic is screened to determine whether hard disk can be continuing with, and prevents hard disk by system erroneous judgement, can effectively save 60% HD management expense.
Brief description of the drawings
Fig. 1 is one of schematic flow sheet of hard disk exception auto-reset method of the present invention;
Fig. 2 is the two of the schematic flow sheet of hard disk exception auto-reset method of the present invention;
Fig. 3 is the structural representation of hard disk exception automatic reset device of the present invention.
Specific embodiment
In order to make it easy to understand, the part noun to occurring in the present invention makees explanation explained below:
Disk array:English full name is Redundant Arrays of Independent Disks, hereinafter referred to as RAID, is By multiple disk combinations into a disk group, the whole disk system of addition effect promoting produced by data is provided using indivedual disks Efficiency.
SES management systems:SES is the abbreviation of SCSI Enclosure Service, is the use that T10 technical committees formulate In the standard of shelf management, mainly technology, the exploitation and formulation of standard such as responsible SSA/SCSI/SAS, hard disk array cabinet is all designed Bus loop come the order in be allowed various status datas and transmission SES specifications, in SES specifications when sending Scsi command parcel arrives I2Transmitted in C buses, be transferred to afterwards in the controller of hard disk array cabinet.
CPLD:English full name is Complex Programmable Logic Device, below Abbreviation CPLD, is the device that logic behavior is constituted with product term frame mode, by FPGA macroelement(Macro Cell) Programmable interconnection matrix unit around center is constituted.
I2C buses:It is a two-way continuous bus of two lines, there is provided integrated circuit(integrated circuit)Between Communication line.
Operating system:English full name is Operating System, hereinafter referred to as OS, is that management and control computer are hard The computer program of part and software resource.
Script:English full name is script, is the extension of autoexec, is the program that a kind of plain text is preserved;Script Can temporarily be called and be performed by application program.
With reference to the accompanying drawings and examples, specific embodiment of the invention is described in further detail:
Embodiment one
As shown in figure 1, a kind of hard disk exception auto-reset method, comprises the following steps:
Step S101:Using RAID as inventive disk array system, verified during system RAID read-write data, worked as verification Occur without response or during mistake, then to judge that the hard disk occurs abnormal.
Step S102:Using SES management systems as the shelf management system of magnetic disk array cabinet of the invention, hard disk is found After exception, abnormal hard disk is oriented by SES management systems.
Step S103:Using CPLD as controller of the invention, SES management systems are sent signal by managing bus To CPLD, the power-off of CPLD control logics circuit after the Preset Time of interval, restores electricity again, and returns to behaviour by managing bus Make signal and give SES management systems.
Above-mentioned management bus is I2C buses.
Used as a kind of embodiment, interval Preset Time is the Millisecond time, and the Millisecond time could be arranged to 10ms.
Step S104:Shell script under SES management systems triggering OS, completes readwrite tests.
Step S105:By the hard disk of readwrite tests, then reach the standard grade, go to step S101.
Step S106:It is not by the hard disk of readwrite tests, then offline.
Embodiment two
As shown in Fig. 2 a kind of hard disk exception auto-reset method, comprises the following steps:
Step S201:Disc array system detects abnormal hard disk.
Step S202:Abnormal hard disk is oriented by the shelf management system of magnetic disk array cabinet.
Step S203:Abnormal hard disk is resetted.
Step S204:Hard disk after checking reset.
Embodiment three
As shown in figure 3, a kind of hard disk exception automatic reset device, including detection module 301, trigger module 302, reseting module 303rd, authentication module 304 and processing module 305, the detection module 301 are linked in sequence trigger module 302, reseting module successively 303rd, authentication module 304 and processing module 305.
Detection module 301, for detecting abnormal hard disk;Trigger module 302, for being positioned to abnormal hard disk;Reset Module 303, for being resetted to abnormal hard disk;Authentication module 304, for verifying the hard disk after resetting;Processing module 305, Hard disk reaches the standard grade or offline after being verified for treatment.
Illustrated above is only the preferred embodiment of the present invention, it is noted that for the ordinary skill people of the art For member, under the premise without departing from the principles of the invention, some improvements and modifications can also be made, these improvements and modifications also should It is considered as protection scope of the present invention.

Claims (7)

1. a kind of hard disk exception auto-reset method, it is characterised in that comprise the following steps:
Disc array system detects abnormal hard disk;
Abnormal hard disk is oriented by the shelf management system of magnetic disk array cabinet;
Abnormal hard disk is resetted;
Hard disk after checking reset.
2. hard disk according to claim 1 exception auto-reset method, it is characterised in that disc array system detects different Normal hard disk, including:Verified when disc array system is to disk read-write data, when verification generation is without response or mistake, then Judge that the hard disk is abnormal hard disk.
3. hard disk according to claim 1 exception auto-reset method, it is characterised in that resetted to abnormal hard disk, Including:The shelf management system of magnetic disk array cabinet sends a signal to controller, and controller control logic circuit power-off, interval is default After time, restore electricity again, and operation signal is returned to computer management system by bus.
4. hard disk according to claim 1 exception auto-reset method, it is characterised in that described to be verified as to after reset Hard disk is written and read test.
5. the abnormal auto-reset method of hard disk according to claim 1 or 4, it is characterised in that the hard disk after checking reset Afterwards, also include:Hard disk after reset is then reached the standard grade by validation test;Otherwise, it is offline.
6. a kind of hard disk exception automatic reset device, it is characterised in that including:
Detection module, for detecting abnormal hard disk;
Trigger module, for being positioned to abnormal hard disk;
Reseting module, for being resetted to abnormal hard disk;
Authentication module, for verifying the hard disk after resetting.
7. hard disk according to claim 6 exception automatic reset device, it is characterised in that also include:Processing module, is used for Hard disk reaches the standard grade or offline after treatment checking.
CN201611200471.2A 2016-12-22 2016-12-22 Hard disk abnormal condition automatic resetting method and device thereof Pending CN106708646A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611200471.2A CN106708646A (en) 2016-12-22 2016-12-22 Hard disk abnormal condition automatic resetting method and device thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611200471.2A CN106708646A (en) 2016-12-22 2016-12-22 Hard disk abnormal condition automatic resetting method and device thereof

Publications (1)

Publication Number Publication Date
CN106708646A true CN106708646A (en) 2017-05-24

Family

ID=58903019

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611200471.2A Pending CN106708646A (en) 2016-12-22 2016-12-22 Hard disk abnormal condition automatic resetting method and device thereof

Country Status (1)

Country Link
CN (1) CN106708646A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108287770A (en) * 2018-03-01 2018-07-17 联想(北京)有限公司 Electronic equipment, information processing method and readable storage medium storing program for executing
CN109284207A (en) * 2018-08-30 2019-01-29 紫光华山信息技术有限公司 Hard disc failure processing method, device, server and computer-readable medium
CN109376029A (en) * 2018-09-27 2019-02-22 郑州云海信息技术有限公司 A kind of processing method and processing system that SCSI hard disk is extremely overtime
CN109710323A (en) * 2018-12-28 2019-05-03 联想(北京)有限公司 A kind of control method and electronic equipment
CN110457278A (en) * 2018-05-07 2019-11-15 百度在线网络技术(北京)有限公司 A kind of document copying method, device, equipment and storage medium
CN113110958A (en) * 2021-03-30 2021-07-13 宁波三星医疗电气股份有限公司 Intelligent power terminal based system file verification method
CN113868009A (en) * 2021-10-20 2021-12-31 南昌逸勤科技有限公司 Automatic repairing method, equipment and storage medium of SAS expander

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101276302A (en) * 2007-03-29 2008-10-01 中国科学院计算技术研究所 Magnetic disc fault processing and data restructuring method in magnetic disc array system
CN102819480A (en) * 2011-06-08 2012-12-12 联想(北京)有限公司 Computer and method for monitoring memory thereof
CN105119767A (en) * 2015-06-29 2015-12-02 北京宇航时代科技发展有限公司 Data self-check and self-cleaning software operation state monitoring method and system
CN105808161A (en) * 2016-02-26 2016-07-27 四川效率源信息安全技术股份有限公司 Reading method of bad sector data of hard disk

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101276302A (en) * 2007-03-29 2008-10-01 中国科学院计算技术研究所 Magnetic disc fault processing and data restructuring method in magnetic disc array system
CN102819480A (en) * 2011-06-08 2012-12-12 联想(北京)有限公司 Computer and method for monitoring memory thereof
CN105119767A (en) * 2015-06-29 2015-12-02 北京宇航时代科技发展有限公司 Data self-check and self-cleaning software operation state monitoring method and system
CN105808161A (en) * 2016-02-26 2016-07-27 四川效率源信息安全技术股份有限公司 Reading method of bad sector data of hard disk

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108287770A (en) * 2018-03-01 2018-07-17 联想(北京)有限公司 Electronic equipment, information processing method and readable storage medium storing program for executing
CN108287770B (en) * 2018-03-01 2020-12-18 联想(北京)有限公司 Electronic device, information processing method, and readable storage medium
CN110457278A (en) * 2018-05-07 2019-11-15 百度在线网络技术(北京)有限公司 A kind of document copying method, device, equipment and storage medium
CN109284207A (en) * 2018-08-30 2019-01-29 紫光华山信息技术有限公司 Hard disc failure processing method, device, server and computer-readable medium
CN109376029A (en) * 2018-09-27 2019-02-22 郑州云海信息技术有限公司 A kind of processing method and processing system that SCSI hard disk is extremely overtime
CN109376029B (en) * 2018-09-27 2021-11-19 郑州云海信息技术有限公司 Processing method and processing system for SCSI hard disk abnormal overtime
CN109710323A (en) * 2018-12-28 2019-05-03 联想(北京)有限公司 A kind of control method and electronic equipment
CN113110958A (en) * 2021-03-30 2021-07-13 宁波三星医疗电气股份有限公司 Intelligent power terminal based system file verification method
CN113868009A (en) * 2021-10-20 2021-12-31 南昌逸勤科技有限公司 Automatic repairing method, equipment and storage medium of SAS expander

Similar Documents

Publication Publication Date Title
CN106708646A (en) Hard disk abnormal condition automatic resetting method and device thereof
CN100504795C (en) Computer RAID array early-warning system and method
US6715101B2 (en) Redundant controller data storage system having an on-line controller removal system and method
CN100388217C (en) Dynamic threshold scaling method and system in communication system
US20020133740A1 (en) Redundant controller data storage system having system and method for handling controller resets
CN102279775B (en) Method for processing failure of hard disk under Linux system
EP2366148B1 (en) Apparatus and method for controlling a solid state disk ( ssd ) device
US7774646B2 (en) Surviving storage system takeover by replaying operations in an operations log mirror
US20200233823A1 (en) Connector, NVMe Storage Device, and Computer Device
CN101477480B (en) Memory control method, apparatus and memory read-write system
JP2004038290A (en) Information processing system and disk control method for use in same system
CN102135925B (en) Method and device for detecting error check and correcting memory
CN105259863B (en) A kind of PLC warm spares redundancy approach and system
CN112732477B (en) Method for fault isolation by out-of-band self-checking
CN1841547B (en) Method and apparatus for identifying failure module
CN109684141A (en) A kind of disk failure diagnostic method, device, terminal and readable storage medium storing program for executing
JP2011043957A (en) Fault monitoring circuit, semiconductor integrated circuit, and faulty part locating method
CN105760247A (en) System and method for processing hard disk faults
CN102915260B (en) The method that solid state hard disc is fault-tolerant and solid state hard disc thereof
CN103049345B (en) Based on Disk State transition detection method and the device of asynchronous mechanism
CN102819480A (en) Computer and method for monitoring memory thereof
CN108170375B (en) Overrun protection method and device in distributed storage system
CN102662787A (en) Method for protecting system disk RAID (redundant array of independent disks)
CN102520223B (en) Software anti-interference method used for electric energy meter
CN104636082A (en) Disk array RAID control method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20170524

RJ01 Rejection of invention patent application after publication