CN110018939A - A kind of hard disk on-line fault diagnosis method based on SCSI protocol - Google Patents

A kind of hard disk on-line fault diagnosis method based on SCSI protocol Download PDF

Info

Publication number
CN110018939A
CN110018939A CN201910243012.XA CN201910243012A CN110018939A CN 110018939 A CN110018939 A CN 110018939A CN 201910243012 A CN201910243012 A CN 201910243012A CN 110018939 A CN110018939 A CN 110018939A
Authority
CN
China
Prior art keywords
lun
test
hard disk
subregion
command word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910243012.XA
Other languages
Chinese (zh)
Inventor
张武
吴登勇
李德国
李亚杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shandong Chaoyue CNC Electronics Co Ltd
Original Assignee
Shandong Chaoyue CNC Electronics Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong Chaoyue CNC Electronics Co Ltd filed Critical Shandong Chaoyue CNC Electronics Co Ltd
Priority to CN201910243012.XA priority Critical patent/CN110018939A/en
Publication of CN110018939A publication Critical patent/CN110018939A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/22Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
    • G06F11/26Functional testing
    • G06F11/263Generation of test inputs, e.g. test vectors, patterns or sequences ; with adaptation of the tested hardware for testability with external testers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3409Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Hardware Design (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Debugging And Monitoring (AREA)

Abstract

A kind of hard disk on-line fault diagnosis method based on SCSI protocol, when creating Test LUN, client Initiator is sent by SCSI protocol to server end Target Server, the creation control command word of Test LUN, generate two LUN subregions of Data LUN and Test LUN, and Test LUN is hidden by hiding command word after the completion of creation, hide after usually normal use when Test LUN subregion it is invisible to user, wherein, the Test LUN for on-line checking subregion storage performance when something goes wrong, the subregion is carried out visible, and test its readwrite performance.The present invention realizes hard disk on-line checking using SCSI protocol, can be realized under hard disk in place not offline situation, guarantees in storage server continuous service, while also can quickly position the reason of investigation causes storage server performance to decline.

Description

A kind of hard disk on-line fault diagnosis method based on SCSI protocol
Technical field
The hard disk on-line fault diagnosis method based on SCSI protocol that the present invention relates to a kind of, belongs to the skill of on-line fault diagnosis Art field.
Background technique
With the arriving in the epoch of modern information technologies and big data, user is quick-fried to memory capacity and storage service demand Fried property increases.Major businesses and institutions establish the data center of oneself one after another, and along with the exponential increase of data volume, in data The expansion of heart scale, the hard disk quantity stored in equipment is also increasing, and hard disk and server be in lasting operation, data Readwrite performance necessarily will appear some problems.
And the readwrite performance of storage server when something goes wrong, can not quickly check is asking due to server itself The performance decline of storage server caused by topic or the failure of hard disk.Since the hard disk quantity in storage server is much general At more than ten to tens.And traditional diagnostic method needs to shut down to storage server, more renews hard disk confirmation first It whether is performance decline caused by the decline of readwrite performance caused by hard disk reason or server itself.
Therefore a kind of efficient hard disk inline diagnosis mode is constructed, it can be quickly fixed in the case where system not continuous service The reason of position is server itself or some hard disk failure cause readwrite performance to decline, and are to be badly in need of solution in large-scale data center Certainly the problem of.
Summary of the invention
In view of the problems of the existing technology, the present invention provides a kind of hard disk on-line fault diagnosis side based on SCSI protocol Method.The present invention be directed to the command control words of SCSI protocol and hard disk controller to be designed, and mainly be ordered using on-line testing It enables control word be written and read test to the hidden partition on single hard disk, realizes in place and do not influence legacy data in hard disk In the case of carry out function and performance detection to carrying out hard disk.
The frame of detection method of the present invention is realized, as shown in Figure 1, client is known as Initiator (usually as OS One subsystem), it initiates to request to server end Target Server, server is known as TargetServer and (sets comprising storage Scsi controller in standby, such as hard disk drive, disk array), the request of Initiator initiation is received and processed, and will place Reason result returns to Initiator.
Technical scheme is as follows:
A kind of hard disk on-line fault diagnosis method based on SCSI protocol, which is characterized in that including Test LUN subregion and Execute hard disk on-line checking:
When creating Test LUN, client Initiator is sent out by SCSI protocol to server end TargetServer It send, the creation control command word of Test LUN, generates two LUN subregions of Data LUN and Test LUN, and after the completion of creation Test LUN hidden by hiding command word, after hiding usually normal use when Test LUN subregion it is invisible to user, In,
The Data LUN subregion stores LUN for data, is supplied to user and carrys out storing data;
The Test LUN for on-line checking subregion storage performance when something goes wrong, to the subregion carry out as it can be seen that simultaneously Test its readwrite performance.
It is preferred according to the present invention, the step are as follows:
It is sent by SCSI protocol, sends LUN and create command word, two LUN of Data LUN and Test LUN are respectively created Subregion;
Hiding command word is sent to hide Test LUN;
When client uses storage server equipment, discovery readwrite performance declines, client, which sends the visible command word of LUN, to be made Test LUN is visible;
LUN command word is sent, Test LUN is tested, is hard disk failure or clothes according to test result judgement Business device failure itself.Take countermeasure.
The technical advantages of the present invention are that:
The present invention realizes hard disk on-line checking using SCSI protocol, can be realized under hard disk in place not offline situation, protects It demonstrate,proves in storage server continuous service, while also can quickly position the reason of investigation causes storage server performance to decline, In the case where not influencing normal use, the efficiency of positioning problems is improved, there is very high actual use value.
Detailed description of the invention
Fig. 1 is the overall framework figure of Initiator client and the communication of Target Server server end;
Fig. 2 is hard disk on-line fault diagnosis flow chart of the present invention.
Specific embodiment
It is described in detail below with reference to embodiment and Figure of description, but not limited to this.
Embodiment,
A kind of hard disk on-line fault diagnosis method based on SCSI protocol, including Test LUN subregion and execute hard disk it is online Detection:
When creating Test LUN, client Initiator is sent out by SCSI protocol to server end TargetServer It send, the creation control command word of Test LUN, generates two LUN subregions of Data LUN and Test LUN, and after the completion of creation Test LUN hidden by hiding command word, after hiding usually normal use when Test LUN subregion it is invisible to user, In,
The Data LUN subregion stores LUN for data, is supplied to user and carrys out storing data;
The Test LUN for on-line checking subregion storage performance when something goes wrong, to the subregion carry out as it can be seen that simultaneously Test its readwrite performance.
Specific steps are as follows:
Sent by SCSI protocol, client Initiator send LUN create command word, be respectively created DataLUN and Two LUN subregions of Test LUN;
Client Initiator send hide command word Test LUN is hidden, lose client, prevent client to its into Row maloperation;
When client uses storage server equipment, discovery readwrite performance declines, client, which sends the visible command word of LUN, to be made Test LUN as it can be seen that i.e. send visible Test LUN command word, the Test LUN subregion on hard disk is visible;
LUN command word is sent, Test LUN is tested, is hard disk failure or clothes according to test result judgement Business device failure itself:
If Test LUN performance equally declines, which goes wrong, and replacement hard disk is completed Data Migration and rebuild, if Test LUN performance does not decline, then checks the decline of performance caused by storage server failure itself, and connection professional carries out Maintenance.

Claims (2)

1. a kind of hard disk on-line fault diagnosis method based on SCSI protocol, which is characterized in that including Test LUN subregion and hold Row hard disk on-line checking:
When creating Test LUN, client Initiator is sent by SCSI protocol to server end Target Server, The creation control command word of Test LUN generates two LUN subregions of Data LUN and Test LUN, and passes through after the completion of creation Hide command word Test LUN is hidden, hide after usually normal use when Test LUN subregion it is invisible to user, wherein
The Data LUN subregion stores LUN for data, is supplied to user and carrys out storing data;
The Test LUN for on-line checking subregion storage performance when something goes wrong, to the subregion carry out as it can be seen that and testing Its readwrite performance.
2. a kind of hard disk on-line fault diagnosis method based on SCSI protocol according to claim 1, which is characterized in that institute State diagnostic method step are as follows:
It is sent by SCSI protocol, sends LUN and create command word, two LUN subregions of Data LUN and Test LUN are respectively created;
Hiding command word is sent to hide Test LUN;
When client uses storage server equipment, discovery readwrite performance declines, client, which sends the visible command word of LUN, makes Test LUN is visible;
LUN command word is sent, Test LUN is tested, is hard disk failure or server according to test result judgement Failure itself, takes countermeasure.
CN201910243012.XA 2019-03-28 2019-03-28 A kind of hard disk on-line fault diagnosis method based on SCSI protocol Pending CN110018939A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910243012.XA CN110018939A (en) 2019-03-28 2019-03-28 A kind of hard disk on-line fault diagnosis method based on SCSI protocol

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910243012.XA CN110018939A (en) 2019-03-28 2019-03-28 A kind of hard disk on-line fault diagnosis method based on SCSI protocol

Publications (1)

Publication Number Publication Date
CN110018939A true CN110018939A (en) 2019-07-16

Family

ID=67190115

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910243012.XA Pending CN110018939A (en) 2019-03-28 2019-03-28 A kind of hard disk on-line fault diagnosis method based on SCSI protocol

Country Status (1)

Country Link
CN (1) CN110018939A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111048138A (en) * 2019-12-22 2020-04-21 北京浪潮数据技术有限公司 Hard disk fault detection method and related device

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103561064A (en) * 2013-10-22 2014-02-05 华为技术有限公司 Method and device for LUN switching
CN104317726A (en) * 2014-11-19 2015-01-28 浪潮电子信息产业股份有限公司 Testing method for storage IO performance
US9740566B2 (en) * 2015-07-31 2017-08-22 Netapp, Inc. Snapshot creation workflow
CN108282347A (en) * 2016-12-30 2018-07-13 航天信息股份有限公司 A kind of server data online management method and system
US10089037B1 (en) * 2013-10-29 2018-10-02 EMC IP Holding Company LLC Block active/active access to data storage systems at different locations
CN108664363A (en) * 2018-05-17 2018-10-16 北京鲸鲨软件科技有限公司 A kind of NAS LUN access control methods and device based on dual control

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103561064A (en) * 2013-10-22 2014-02-05 华为技术有限公司 Method and device for LUN switching
US10089037B1 (en) * 2013-10-29 2018-10-02 EMC IP Holding Company LLC Block active/active access to data storage systems at different locations
CN104317726A (en) * 2014-11-19 2015-01-28 浪潮电子信息产业股份有限公司 Testing method for storage IO performance
US9740566B2 (en) * 2015-07-31 2017-08-22 Netapp, Inc. Snapshot creation workflow
CN108282347A (en) * 2016-12-30 2018-07-13 航天信息股份有限公司 A kind of server data online management method and system
CN108664363A (en) * 2018-05-17 2018-10-16 北京鲸鲨软件科技有限公司 A kind of NAS LUN access control methods and device based on dual control

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
网友: ""ISCSI Target&Lun 的访问控制调查"", 《HTTPS://BLOG.CSDN.NET/CSND_PAN/ARTICLE/DETAILS/79016430》 *
网友: ""target and iSCSI Interfaces Guide"", 《HTTPS://WEB.ARCHIVE.ORG/WEB/20181219133029/HTTPS://WWW.KERNEL.ORG/DOC/HTML/LATEST/DRIVER-API/TARGET.HTML》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111048138A (en) * 2019-12-22 2020-04-21 北京浪潮数据技术有限公司 Hard disk fault detection method and related device

Similar Documents

Publication Publication Date Title
US10936447B2 (en) Resynchronizing to a first storage system after a failover to a second storage system mirroring the first storage system
JP4606455B2 (en) Storage management device, storage management program, and storage system
US7805566B2 (en) Replication in storage systems using a target port mimicking a host initiator port
US7870093B2 (en) Storage subsystem
TWI414992B (en) Method for remote asynchronous replication of volumes and apparatus therefor
JP4646574B2 (en) Data processing system
JP5523468B2 (en) Active-active failover for direct attached storage systems
CN107346210B (en) Hard disk data erasing method, server and system
US8689044B2 (en) SAS host controller cache tracking
US8707076B2 (en) System and method for power management of storage resources
US20090265510A1 (en) Systems and Methods for Distributing Hot Spare Disks In Storage Arrays
US7530000B2 (en) Early detection of storage device degradation
JP4783076B2 (en) Disk array device and control method thereof
CN110807064A (en) Data recovery device in RAC distributed database cluster system
CN109783280A (en) Shared memory systems and shared storage method
US9063854B1 (en) Systems and methods for cluster raid data consistency
US10572188B2 (en) Server-embedded distributed storage system
CN110018939A (en) A kind of hard disk on-line fault diagnosis method based on SCSI protocol
US8996805B2 (en) Shared cache module and method thereof
CN107273251A (en) A kind of method of testing of the racks of Rack in a production environment JBOD storages
US20130227341A1 (en) Sas host cache control
US6389559B1 (en) Controller fail-over without device bring-up
CN103176745A (en) Hard disc array takeover method of storage system with double controllers
US10203890B1 (en) Multi-tier mechanism to achieve high availability in a multi-controller system
JP2007257667A (en) Data processing system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190716

RJ01 Rejection of invention patent application after publication