CN110018939A - A kind of hard disk on-line fault diagnosis method based on SCSI protocol - Google Patents
A kind of hard disk on-line fault diagnosis method based on SCSI protocol Download PDFInfo
- Publication number
- CN110018939A CN110018939A CN201910243012.XA CN201910243012A CN110018939A CN 110018939 A CN110018939 A CN 110018939A CN 201910243012 A CN201910243012 A CN 201910243012A CN 110018939 A CN110018939 A CN 110018939A
- Authority
- CN
- China
- Prior art keywords
- lun
- test
- hard disk
- subregion
- command word
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/22—Detection or location of defective computer hardware by testing during standby operation or during idle time, e.g. start-up testing
- G06F11/26—Functional testing
- G06F11/263—Generation of test inputs, e.g. test vectors, patterns or sequences ; with adaptation of the tested hardware for testability with external testers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
- G06F11/3409—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
Landscapes
- Engineering & Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Computer Hardware Design (AREA)
- Quality & Reliability (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Debugging And Monitoring (AREA)
Abstract
A kind of hard disk on-line fault diagnosis method based on SCSI protocol, when creating Test LUN, client Initiator is sent by SCSI protocol to server end Target Server, the creation control command word of Test LUN, generate two LUN subregions of Data LUN and Test LUN, and Test LUN is hidden by hiding command word after the completion of creation, hide after usually normal use when Test LUN subregion it is invisible to user, wherein, the Test LUN for on-line checking subregion storage performance when something goes wrong, the subregion is carried out visible, and test its readwrite performance.The present invention realizes hard disk on-line checking using SCSI protocol, can be realized under hard disk in place not offline situation, guarantees in storage server continuous service, while also can quickly position the reason of investigation causes storage server performance to decline.
Description
Technical field
The hard disk on-line fault diagnosis method based on SCSI protocol that the present invention relates to a kind of, belongs to the skill of on-line fault diagnosis
Art field.
Background technique
With the arriving in the epoch of modern information technologies and big data, user is quick-fried to memory capacity and storage service demand
Fried property increases.Major businesses and institutions establish the data center of oneself one after another, and along with the exponential increase of data volume, in data
The expansion of heart scale, the hard disk quantity stored in equipment is also increasing, and hard disk and server be in lasting operation, data
Readwrite performance necessarily will appear some problems.
And the readwrite performance of storage server when something goes wrong, can not quickly check is asking due to server itself
The performance decline of storage server caused by topic or the failure of hard disk.Since the hard disk quantity in storage server is much general
At more than ten to tens.And traditional diagnostic method needs to shut down to storage server, more renews hard disk confirmation first
It whether is performance decline caused by the decline of readwrite performance caused by hard disk reason or server itself.
Therefore a kind of efficient hard disk inline diagnosis mode is constructed, it can be quickly fixed in the case where system not continuous service
The reason of position is server itself or some hard disk failure cause readwrite performance to decline, and are to be badly in need of solution in large-scale data center
Certainly the problem of.
Summary of the invention
In view of the problems of the existing technology, the present invention provides a kind of hard disk on-line fault diagnosis side based on SCSI protocol
Method.The present invention be directed to the command control words of SCSI protocol and hard disk controller to be designed, and mainly be ordered using on-line testing
It enables control word be written and read test to the hidden partition on single hard disk, realizes in place and do not influence legacy data in hard disk
In the case of carry out function and performance detection to carrying out hard disk.
The frame of detection method of the present invention is realized, as shown in Figure 1, client is known as Initiator (usually as OS
One subsystem), it initiates to request to server end Target Server, server is known as TargetServer and (sets comprising storage
Scsi controller in standby, such as hard disk drive, disk array), the request of Initiator initiation is received and processed, and will place
Reason result returns to Initiator.
Technical scheme is as follows:
A kind of hard disk on-line fault diagnosis method based on SCSI protocol, which is characterized in that including Test LUN subregion and
Execute hard disk on-line checking:
When creating Test LUN, client Initiator is sent out by SCSI protocol to server end TargetServer
It send, the creation control command word of Test LUN, generates two LUN subregions of Data LUN and Test LUN, and after the completion of creation
Test LUN hidden by hiding command word, after hiding usually normal use when Test LUN subregion it is invisible to user,
In,
The Data LUN subregion stores LUN for data, is supplied to user and carrys out storing data;
The Test LUN for on-line checking subregion storage performance when something goes wrong, to the subregion carry out as it can be seen that simultaneously
Test its readwrite performance.
It is preferred according to the present invention, the step are as follows:
It is sent by SCSI protocol, sends LUN and create command word, two LUN of Data LUN and Test LUN are respectively created
Subregion;
Hiding command word is sent to hide Test LUN;
When client uses storage server equipment, discovery readwrite performance declines, client, which sends the visible command word of LUN, to be made
Test LUN is visible;
LUN command word is sent, Test LUN is tested, is hard disk failure or clothes according to test result judgement
Business device failure itself.Take countermeasure.
The technical advantages of the present invention are that:
The present invention realizes hard disk on-line checking using SCSI protocol, can be realized under hard disk in place not offline situation, protects
It demonstrate,proves in storage server continuous service, while also can quickly position the reason of investigation causes storage server performance to decline,
In the case where not influencing normal use, the efficiency of positioning problems is improved, there is very high actual use value.
Detailed description of the invention
Fig. 1 is the overall framework figure of Initiator client and the communication of Target Server server end;
Fig. 2 is hard disk on-line fault diagnosis flow chart of the present invention.
Specific embodiment
It is described in detail below with reference to embodiment and Figure of description, but not limited to this.
Embodiment,
A kind of hard disk on-line fault diagnosis method based on SCSI protocol, including Test LUN subregion and execute hard disk it is online
Detection:
When creating Test LUN, client Initiator is sent out by SCSI protocol to server end TargetServer
It send, the creation control command word of Test LUN, generates two LUN subregions of Data LUN and Test LUN, and after the completion of creation
Test LUN hidden by hiding command word, after hiding usually normal use when Test LUN subregion it is invisible to user,
In,
The Data LUN subregion stores LUN for data, is supplied to user and carrys out storing data;
The Test LUN for on-line checking subregion storage performance when something goes wrong, to the subregion carry out as it can be seen that simultaneously
Test its readwrite performance.
Specific steps are as follows:
Sent by SCSI protocol, client Initiator send LUN create command word, be respectively created DataLUN and
Two LUN subregions of Test LUN;
Client Initiator send hide command word Test LUN is hidden, lose client, prevent client to its into
Row maloperation;
When client uses storage server equipment, discovery readwrite performance declines, client, which sends the visible command word of LUN, to be made
Test LUN as it can be seen that i.e. send visible Test LUN command word, the Test LUN subregion on hard disk is visible;
LUN command word is sent, Test LUN is tested, is hard disk failure or clothes according to test result judgement
Business device failure itself:
If Test LUN performance equally declines, which goes wrong, and replacement hard disk is completed Data Migration and rebuild, if
Test LUN performance does not decline, then checks the decline of performance caused by storage server failure itself, and connection professional carries out
Maintenance.
Claims (2)
1. a kind of hard disk on-line fault diagnosis method based on SCSI protocol, which is characterized in that including Test LUN subregion and hold
Row hard disk on-line checking:
When creating Test LUN, client Initiator is sent by SCSI protocol to server end Target Server,
The creation control command word of Test LUN generates two LUN subregions of Data LUN and Test LUN, and passes through after the completion of creation
Hide command word Test LUN is hidden, hide after usually normal use when Test LUN subregion it is invisible to user, wherein
The Data LUN subregion stores LUN for data, is supplied to user and carrys out storing data;
The Test LUN for on-line checking subregion storage performance when something goes wrong, to the subregion carry out as it can be seen that and testing
Its readwrite performance.
2. a kind of hard disk on-line fault diagnosis method based on SCSI protocol according to claim 1, which is characterized in that institute
State diagnostic method step are as follows:
It is sent by SCSI protocol, sends LUN and create command word, two LUN subregions of Data LUN and Test LUN are respectively created;
Hiding command word is sent to hide Test LUN;
When client uses storage server equipment, discovery readwrite performance declines, client, which sends the visible command word of LUN, makes Test
LUN is visible;
LUN command word is sent, Test LUN is tested, is hard disk failure or server according to test result judgement
Failure itself, takes countermeasure.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910243012.XA CN110018939A (en) | 2019-03-28 | 2019-03-28 | A kind of hard disk on-line fault diagnosis method based on SCSI protocol |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910243012.XA CN110018939A (en) | 2019-03-28 | 2019-03-28 | A kind of hard disk on-line fault diagnosis method based on SCSI protocol |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110018939A true CN110018939A (en) | 2019-07-16 |
Family
ID=67190115
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910243012.XA Pending CN110018939A (en) | 2019-03-28 | 2019-03-28 | A kind of hard disk on-line fault diagnosis method based on SCSI protocol |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110018939A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111048138A (en) * | 2019-12-22 | 2020-04-21 | 北京浪潮数据技术有限公司 | Hard disk fault detection method and related device |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103561064A (en) * | 2013-10-22 | 2014-02-05 | 华为技术有限公司 | Method and device for LUN switching |
CN104317726A (en) * | 2014-11-19 | 2015-01-28 | 浪潮电子信息产业股份有限公司 | Testing method for storage IO performance |
US9740566B2 (en) * | 2015-07-31 | 2017-08-22 | Netapp, Inc. | Snapshot creation workflow |
CN108282347A (en) * | 2016-12-30 | 2018-07-13 | 航天信息股份有限公司 | A kind of server data online management method and system |
US10089037B1 (en) * | 2013-10-29 | 2018-10-02 | EMC IP Holding Company LLC | Block active/active access to data storage systems at different locations |
CN108664363A (en) * | 2018-05-17 | 2018-10-16 | 北京鲸鲨软件科技有限公司 | A kind of NAS LUN access control methods and device based on dual control |
-
2019
- 2019-03-28 CN CN201910243012.XA patent/CN110018939A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103561064A (en) * | 2013-10-22 | 2014-02-05 | 华为技术有限公司 | Method and device for LUN switching |
US10089037B1 (en) * | 2013-10-29 | 2018-10-02 | EMC IP Holding Company LLC | Block active/active access to data storage systems at different locations |
CN104317726A (en) * | 2014-11-19 | 2015-01-28 | 浪潮电子信息产业股份有限公司 | Testing method for storage IO performance |
US9740566B2 (en) * | 2015-07-31 | 2017-08-22 | Netapp, Inc. | Snapshot creation workflow |
CN108282347A (en) * | 2016-12-30 | 2018-07-13 | 航天信息股份有限公司 | A kind of server data online management method and system |
CN108664363A (en) * | 2018-05-17 | 2018-10-16 | 北京鲸鲨软件科技有限公司 | A kind of NAS LUN access control methods and device based on dual control |
Non-Patent Citations (2)
Title |
---|
网友: ""ISCSI Target&Lun 的访问控制调查"", 《HTTPS://BLOG.CSDN.NET/CSND_PAN/ARTICLE/DETAILS/79016430》 * |
网友: ""target and iSCSI Interfaces Guide"", 《HTTPS://WEB.ARCHIVE.ORG/WEB/20181219133029/HTTPS://WWW.KERNEL.ORG/DOC/HTML/LATEST/DRIVER-API/TARGET.HTML》 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111048138A (en) * | 2019-12-22 | 2020-04-21 | 北京浪潮数据技术有限公司 | Hard disk fault detection method and related device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10936447B2 (en) | Resynchronizing to a first storage system after a failover to a second storage system mirroring the first storage system | |
JP4606455B2 (en) | Storage management device, storage management program, and storage system | |
US7805566B2 (en) | Replication in storage systems using a target port mimicking a host initiator port | |
US7870093B2 (en) | Storage subsystem | |
TWI414992B (en) | Method for remote asynchronous replication of volumes and apparatus therefor | |
JP4646574B2 (en) | Data processing system | |
JP5523468B2 (en) | Active-active failover for direct attached storage systems | |
CN107346210B (en) | Hard disk data erasing method, server and system | |
US8689044B2 (en) | SAS host controller cache tracking | |
US8707076B2 (en) | System and method for power management of storage resources | |
US20090265510A1 (en) | Systems and Methods for Distributing Hot Spare Disks In Storage Arrays | |
US7530000B2 (en) | Early detection of storage device degradation | |
JP4783076B2 (en) | Disk array device and control method thereof | |
CN110807064A (en) | Data recovery device in RAC distributed database cluster system | |
CN109783280A (en) | Shared memory systems and shared storage method | |
US9063854B1 (en) | Systems and methods for cluster raid data consistency | |
US10572188B2 (en) | Server-embedded distributed storage system | |
CN110018939A (en) | A kind of hard disk on-line fault diagnosis method based on SCSI protocol | |
US8996805B2 (en) | Shared cache module and method thereof | |
CN107273251A (en) | A kind of method of testing of the racks of Rack in a production environment JBOD storages | |
US20130227341A1 (en) | Sas host cache control | |
US6389559B1 (en) | Controller fail-over without device bring-up | |
CN103176745A (en) | Hard disc array takeover method of storage system with double controllers | |
US10203890B1 (en) | Multi-tier mechanism to achieve high availability in a multi-controller system | |
JP2007257667A (en) | Data processing system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190716 |
|
RJ01 | Rejection of invention patent application after publication |