WO2018040115A1 - Determination of faulty state of storage device - Google Patents

Determination of faulty state of storage device Download PDF

Info

Publication number
WO2018040115A1
WO2018040115A1 PCT/CN2016/098142 CN2016098142W WO2018040115A1 WO 2018040115 A1 WO2018040115 A1 WO 2018040115A1 CN 2016098142 W CN2016098142 W CN 2016098142W WO 2018040115 A1 WO2018040115 A1 WO 2018040115A1
Authority
WO
WIPO (PCT)
Prior art keywords
time length
access
blocks
determining
electronic device
Prior art date
Application number
PCT/CN2016/098142
Other languages
French (fr)
Inventor
Yantao WANG
Jing Liu
Lijun Shen
Original Assignee
Telefonaktiebolaget Lm Ericsson (Publ)
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telefonaktiebolaget Lm Ericsson (Publ) filed Critical Telefonaktiebolaget Lm Ericsson (Publ)
Priority to EP16914680.0A priority Critical patent/EP3507695A4/en
Priority to US16/330,301 priority patent/US11977434B2/en
Priority to PCT/CN2016/098142 priority patent/WO2018040115A1/en
Publication of WO2018040115A1 publication Critical patent/WO2018040115A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0751Error or fault detection not based on redundancy
    • G06F11/0754Error or fault detection not based on redundancy by exceeding limits
    • G06F11/0757Error or fault detection not based on redundancy by exceeding limits by exceeding a time limit, i.e. time-out, e.g. watchdogs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0706Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
    • G06F11/0727Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment in a storage system, e.g. in a DASD or network based storage system
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0793Remedial or corrective actions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3034Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system component is a storage system, e.g. DASD based or network based
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3409Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
    • G06F11/3419Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment by assessing time
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3452Performance evaluation by statistical analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2201/00Indexing scheme relating to error detection, to error correction, and to monitoring
    • G06F2201/81Threshold
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11CSTATIC STORES
    • G11C29/00Checking stores for correct operation ; Subsequent repair; Testing stores during standby or offline operation
    • G11C29/04Detection or location of defective memory elements, e.g. cell constructio details, timing of test signals
    • G11C2029/0409Online test
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11CSTATIC STORES
    • G11C29/00Checking stores for correct operation ; Subsequent repair; Testing stores during standby or offline operation
    • G11C29/52Protection of memory contents; Detection of errors in memory contents

Definitions

  • Embodiments of the present disclosure generally relate to the field of storage management, and in particular, to a method, an electronic device, and a computer program for determining a faulty state of a storage device.
  • S.M.A.R.T. self-monitoring, analysis and reporting technology
  • storage devices such as Advanced Technology Attachment (ATA) /Serial ATA (SATA) and Small Computer System Interface (SCSI) /Serial Attached SCSI (SAS) disks.
  • ATA Advanced Technology Attachment
  • SATA Serial ATA
  • SAS Small Computer System Interface
  • Such tools may provide early warning of an impending failure of a disk, for example.
  • Such tools are conventionally specific to individual vendors. For example, attributes of the tools are stored in a vendor-specific storage area. Furthermore, representatives of the attributes for one vendor are different from those for another vendor. Accordingly, the tools lack universality and openness and further lack reliable validity.
  • HDD Hard Disk Drive
  • SSD Solid State Disk
  • example embodiments of the present disclosure provide a method and electronic device for determining a faulty state of a storage device.
  • a method implemented in an electronic device is provided. According to the method, a first time length of a first access to a set of blocks of a storage device is determined. Then, the first time length of the access and a threshold time length are compared. If the first time length exceeds the threshold time length, it is determined that the blocks are in a potential faulty state.
  • a statistical time length of accesses to the blocks may be determined. Then, the threshold time length may be determined at least in part based on the statistical time length.
  • a second time length of a second access to the blocks may be determined.
  • the second access being prior to the first access.
  • the statistical time length may be determined based on the first and second time lengths.
  • the threshold time length may be adjusted based on a user input.
  • the access is of an access type. Further, the threshold time length for the access type is determined.
  • the first time length may be determined by monitoring start time and end time of the first access. In some embodiments, if it is determined that the blocks are in the potential faulty state, the blocks are isolated from the storage device.
  • an electronic device comprising: a first determining unit configured to determine a first time length of a first access to a set of blocks of a storage device; a comparing unit configured to compare the first time length of the access and a threshold time length; and a second determining unit configured to determine, in response to the first time length exceeding the threshold time length, determine that the blocks are in a potential faulty state.
  • an electronic device comprising at least one processor and a memory.
  • the memory contains instructions executable by the at least one processor, whereby the apparatus is operative to perform the method according to the first aspect.
  • a computer program and a computer program product that is tangibly stored on a computer readable storage medium.
  • Each of the computer program and the computer program product includes instructions which, when executed on at least one processor, cause the at least one processor to carry out the method according to the first aspect.
  • a set of blocks of a storage device is determined to be in a potential faulty state by comparing a time length of an access to the sets of the storage device and a threshold time length. In this way, the determination of the potential faulty state of the storage device may be more effective and efficient.
  • Fig. 1 is a block diagram of an example environment in which embodiments of the present disclosure can be implemented
  • Fig. 2 is a diagram illustrating time lengths of accesses to storage devices in good and faulty states in accordance with some embodiments of the present disclosure
  • Fig. 3 is a flowchart of a method in accordance with some embodiments of the present disclosure.
  • Fig. 4 is a flowchart of illustrates an example process of monitoring start time end time of an access according to some embodiments of the present disclosure.
  • Fig. 5 is a block diagram of an electronic device in accordance with some embodiments of the present disclosure.
  • the term “storage device” refers to a device for storing information.
  • the storage device include, but are not limited to, a hard disk (HDD/SSD) , a portable computer diskette, a random access memory (RAM) , a read-only memory (ROM) , an erasable programmable read-only memory (EPROM or Flash memory) , a portable compact disc read-only memory (CD-ROM) , an optical storage device, a magnetic storage device, and the like.
  • time length of an access or “access time length” refers to a time period during which data is input to or output from a storage device.
  • the access type may include a write access or a read access.
  • Fig. 1 shows an electronic device 110 in which embodiments of the present disclosure can be implemented.
  • the electronic device 110 include, but are not limited to, computing devices such as a desktop personal computer (PC) , tablet computer, laptop computer and a computer server, and communication devices capable of communications, such as a base station and a terminal device, and the like.
  • Other devices may also easily employ embodiments of the present invention, such as a personal digital assistant (PDA) , a pager, a mobile computer, a mobile TV, a gaming device, a laptop computer, a camera, a video camera, a GPS device, and the like.
  • PDA personal digital assistant
  • the electronic device 110 comprises a storage device 120 which is divided into a set of blocks (not shown) .
  • the storage device 120 can be accessed by applications executed on or remotely to the electronic device 110.
  • the application may send an access request, such as a write request and/or a read request, for the storage device 120.
  • an access request such as a write request and/or a read request
  • data can be written into or read from the storage device 120.
  • Fig. 1 the number of storage devices as shown in Fig. 1 are only for the purpose of illustration without suggesting any limitation of the present disclosure. It is also to be understood that although the storage device 120 is shown as being located within the electronic device 110, this is merely for the purpose of illustration. The storage device 120 can be located remotely and communicatively coupled to the electronic device 110.
  • the electronic device 110 also includes a processing unit 130 and a memory 140 coupled to the processing unit 130.
  • the processing unit 130 may be of any type suitable to the local technical environment, and may include one or more of general purpose computers, special purpose computers, microprocessors, digital signal processors (DSPs) and processors based on multicore processor architecture, as non-limiting examples.
  • the electronic device 110 may have multiple processors, such as an application specific integrated circuit chip that is slaved in time to a clock which synchronizes the main processor.
  • the memory 140 may be of any type suitable to the local technical environment and may be implemented using any suitable data storage technology, such as semiconductor based memory devices, magnetic memory devices and systems, optical memory devices and systems, fixed memory and rernovable memory, as non-limiting examples. While only one memory 140 is shown in the electronic device 110, there may be several physically distinct memory modules in the electronic device 110.
  • the storage device 120 As described above, if the conventional monitoring tools report the faulty state of the storage device 120, the storage device 120 generally has been disabled for a while. Such a delay report significantly degrades user experiences especially for the applications that require high read/write performances. Therefore, there is a need for real-time detection of the faulty state of the storage device 120.
  • the electronic device 110 includes a monitoring application 150 which, when executed by the processing unit 130, is at least partially loaded into the memory 140 and adapted to implement embodiments of the present disclosure.
  • the operations of the monitoring application 150 will be detailed in the following paragraphs.
  • Fig. 2 shows time lengths of accesses to storage devices in good and faulty states. These time lengths are detected for a set of HDD/SSD blocks in a disk. As shown, ifblocks are faulty, the time lengths of the accesses to the blocks may over 100 times longer than the time lengths of the accesses to the good blocks.
  • embodiments of the present disclosure provide determining a set of blocks of the storage device 120 to be in a potential faulty state by comparing a time length of an access to the blocks of the storage device 120 and a threshold time length.
  • a potential faulty state of a storage device refers to a state where the performance of the storage device is significantly degrading or deteriorating, especially, before the conventional tools report the faulty state of the storage device.
  • the determination of the potential faulty state of the storage device 120 in accordance with embodiments of the present disclosure is more effective and efficient.
  • Fig. 3 shows a flowchart of an example method 300 in accordance with some embodiments of the present disclosure.
  • the method 300 can be implemented, for example, by the monitoring application 150 as shown in Fig. 1.
  • the method 300 will be described with reference to Fig. 1.
  • a time length of an access to a set of blocks of the storage device 120 is determined.
  • the access is referred to as a “first access, ” and the time length is referred to as a “first time length. ”
  • the access may be of any suitable type. As described above, the access type may include the write access and/or the read access, for example.
  • the first time length of the first access may be determined in any suitable method.
  • the first time length may be determined by monitoring start time and end time of the first access. For example, during the procedure of the access, the start time and the end time of the access may be monitored. Then, a time difference between the start time and the end time may be calculated as the time length of the access. An example process of monitoring start time and end time of the access will be described below with reference to Fig. 4.
  • an application sends (405) a disk access request for a hardware storage device, which can be a SCSI device, for example.
  • a kernel of the operating system in the electronic device 110 starts (410) the access to the hardware storage device.
  • the kernel or core refers to a control center of the operating system.
  • the starting (410) of the access may be implemented via a system function in a disk drive within the kernel.
  • the start time is monitored (415) , for example, by the monitoring application 150.
  • the kernel In response to returning of the access results by the hardware storage device, the kernel performs (415) hard interrupt and further performs (420) soft interrupt to obtain the access results. Then, the kernel ends (430) the access and returns the access results to the application. Likewise, the hard interrupt (420) , the soft interrupt (425) , and the ending (430) of the access may also be implemented via system functions in the disk drive. In response to the ending (430) of the access, the end time is monitored (435) as shown. After the access is ended (430) , the time difference between the start time and the end time may be calculated as the time length of the access.
  • the access procedure as described above with reference to Fig. 4 is only for the purpose of illustration.
  • the access procedure may involve one or more other actions implemented by the kernel, which are shown by dashed boxes, and/or other actions implemented by other elements (not shown) .
  • the access procedure is known in the art, and more details thereof will be omitted.
  • the access to the hardware storage device by the user space application as shown in Fig. 4 is for the purpose of illustration without suggesting any limitation.
  • the start and end time of the accesses to the hardware storage device by other applications, such as system space applications, may also be monitored to determine the time lengths of these accesses.
  • the first time length of the access and a threshold time length are compared.
  • the threshold time length it is determined that the blocks of the storage device 120 are in a potential faulty state.
  • the time length of the access to the storage device becomes extremely long.
  • the potential fault of the storage device 120 can be determined.
  • the threshold time length may be determined in any suitable way. In some embodiments, the threshold time length may be determined based on statistics of a plurality of accesses. For example, as shown in Fig. 3, at block 320, a statistical time length of the accesses to the blocks of the storage device 120 may be detenained. At block 325, the threshold time length may be determined based on the statistical time length.
  • the accesses may contain an access (referred to as a “second access” ) prior to the first access.
  • a time length (referred to as a “second time length” ) of the second access to the blocks may be determined. Similar to the determination of the first time length of the first access at block 305, any suitable method may be used to implement the determination of the second time length.
  • the statistical time length may be determined based on the statistics of the first and second time lengths. These statistics may be performed according to any statistics methods either currently known or to be developed in the future.
  • the threshold time length that is determined based on the statistics may be further adjusted based on a user input. For example, users may adjust the determined threshold time length by taking a specific factor into account. By way of example, for the application requiring the higher read/write performance, the threshold time length may be further adjusted to be shorter. The adjustment may further improve the universality and openness of the detection the potential faulty state of the storage device.
  • the threshold time length may be determined for a specific access type. For example, two threshold time lengths may be determined respectively for the write access and the read access.
  • the potential faulty blocks are isolated from the storage device 120 at block 335.
  • the accessed blocks may also be located. Then, if it is determined that the time difference between the start and end time of the first access exceeds the threshold time length at block 310, the accessed blocks is determined to be in the potential faulty state at block 315, and then the accessed blocks are isolated from the storage device 120 at block 320. Any suitable isolation method that is either currently known or to be developed in the future will be used. In this way, life time of the storage device 120 may be extended.
  • Fig. 5 shows a block diagram of an electronic device 500 in accordance with some other embodiments of the present disclosure.
  • the electronic device 500 can be considered an example implementation of the electronic device 110 as shown in Fig. 1.
  • the electronic device 500 comprises a first determining unit 505, a comparing unit 510, and a second determining unit 515.
  • the first determining unit 505 is configured to determine a first time length of a first access to a set of blocks of a storage device.
  • the comparing unit 510 is configured to compare the first time length of the access and a threshold time length.
  • the second determining unit 515 is configured to determine, in response to the first time length exceeding the threshold time length, determine that the blocks are in a potential faulty state.
  • the electronic device 500 may further comprise: a third determining unit configured to determine a statistical time length of accesses to the blocks; and a fourth determining unit configured to determine the threshold time length at least in part based on the statistical time length.
  • the third determining unit may comprise: a fifth determining unit configured to determine a second time length of a second access to the blocks, the second access being prior to the first access; and a sixth determining unit configured to determine the statistical time length based on the first and second time lengths.
  • the fourth determining unit may comprise: an adjusting unit configured to adjust the threshold time length based on a user input.
  • the access may be of an access type.
  • the electronic device 500 may further comprise: a seventh determining unit configured to determine the threshold time length for the access type.
  • the first determining unit 505 may comprise an eighth determining unit.
  • the eighth determining unit is configured to determine the first time length by monitoring start time and end time of the first access.
  • the electronic device 500 may further comprise an isolating unit.
  • the isolating unit is configured to, in response to determining that the blocks are in the potential faulty state, isolating the blocks from the storage device.
  • units included in the electronic device 500 correspond to the blocks of the method 300. Therefore, all operations and features described above with reference to Figs. 1 to 3 are likewise applicable to the units included in the electronic device 500 and have similar effects. For the purpose of simplification, the details will be omitted.
  • the units included in the electronic device 500 may be implemented in various manners, including software, hardware, firmware, or any combination thereof.
  • one or more units may be implemented using software and/or firmware, for example, machine-executable instructions stored on the storage medium,
  • parts or all of the units in the electronic device 500 may be implemented, at least in part, by one or more hardware logic components.
  • FPGAs Field-programmable Gate Arrays
  • ASICs Application-specific Integrated Circuits
  • ASSPs Application-specific Standard Products
  • SOCs System-on-a-chip systems
  • CPLDs Complex Programmable Logic Devices
  • various embodiments of the present disclosure may be implemented in hardware or special purpose circuits, software, logic or any combination thereof. Some aspects may be implemented in hardware, while other aspects may be implemented in firmware or software which may be executed by a controller, microprocessor or other computing device. While various aspects of embodiments of the present disclosure are illustrated and described as block diagrams, flowcharts, or using some other pictorial representation, it will be appreciated that the blocks, apparatus, systems, techniques or methods described herein may be implemented in, as non-limiting examples, hardware, software, firmware, special purpose circuits or logic, general purpose hardware or controller or other computing devices, or some combination thereof.
  • embodiments of the present disclosure can be described in the general context of machine-executable instructions, such as those included in program modules, being executed in a device on a target real or virtual processor.
  • program modules include routines, programs, libraries, objects, classes, components, data structures, or the like that perform particular tasks or implement particular abstract data types.
  • the functionality of the program modules may be combined or split between program modules as desired in various embodiments.
  • Machine-executable instructions for program modules may be executed within a local or distributed device. In a distributed device, program modules may be located in both local and remote storage media.
  • Program code for carrying out methods of the present disclosure may be written in any combination of one or more programming languages. These program codes may be provided to a processor or controller of a general purpose computer, special purpose computer, or other programmable data processing apparatus, such that the program codes, when executed by the processor or controller, cause the functions/operations specified in the flowcharts and/or block diagrams to be implemented.
  • the program code may execute entirely on a machine, partly on the machine, as a stand-alone software package, partly on the machine and partly on a remote machine or entirely on the remote machine or server.
  • a machine readable medium may be any tangible medium that may contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.
  • the machine readable medium may be a machine readable signal medium or a machine readable storage medium.
  • a machine readable medium may include but not limited to an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing.
  • machine readable storage medium More specific examples of the machine readable storage medium would include an electrical connection having one or more wires, a portable computer diskette, a hard disk, a random access memory (RAM) , a read-only memory (ROM) , an erasable programmable read-only memory (EPROM or Flash memory) , an optical fiber, a portable compact disc read-only memory (CD-ROM) , an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
  • RAM random access memory
  • ROM read-only memory
  • EPROM or Flash memory erasable programmable read-only memory
  • CD-ROM portable compact disc read-only memory
  • magnetic storage device or any suitable combination of the foregoing.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • General Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Computer Hardware Design (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • Probability & Statistics with Applications (AREA)
  • Debugging And Monitoring (AREA)

Abstract

Disclosed are a method and electronic device for determining a faulty state of a storage device. A first time length of a first access to a set of blocks of a storage device is determined. Then, the first time length of the access and a threshold time length are compared. If the first time length exceeds the threshold time length, it is determined that the blocks are in a potential faulty state.

Description

DETERMINATION OF FAULTY STATE OF STORAGE DEVICE TECHNICAL FIELD
Embodiments of the present disclosure generally relate to the field of storage management, and in particular, to a method, an electronic device, and a computer program for determining a faulty state of a storage device.
BACKGROUND
It is desired to monitor failures of computer storage systems. To this end, some tools or solutions have been proposed. For example, self-monitoring, analysis and reporting technology (S.M.A.R.T. ) can be built into storage devices, such as Advanced Technology Attachment (ATA) /Serial ATA (SATA) and Small Computer System Interface (SCSI) /Serial Attached SCSI (SAS) disks. These tools may provide early warning of an impending failure of a disk, for example. Such tools are conventionally specific to individual vendors. For example, attributes of the tools are stored in a vendor-specific storage area. Furthermore, representatives of the attributes for one vendor are different from those for another vendor. Accordingly, the tools lack universality and openness and further lack reliable validity.
In addition, when a bad health state of a storage device, such as Hard Disk Drive (HDD) /Solid State Disk (SSD) , is reported by these tools, the storage device generally has already been in a faulty state for a period of time. This results in degraded user experiences, in particular, for some applications that require high read/write performances. Furthermore, these tools have to be persistently running in computer systems to monitor the state of the storage device. However, the persistent running of these tools may significantly increase loads of the systems.
SUMMARY
In general, example embodiments of the present disclosure provide a method and electronic device for determining a faulty state of a storage device.
In a first aspect, a method implemented in an electronic device is provided. According to the method, a first time length of a first access to a set of blocks of a storage device is determined. Then, the first time length of the access and a threshold time length  are compared. If the first time length exceeds the threshold time length, it is determined that the blocks are in a potential faulty state.
In some embodiments, a statistical time length of accesses to the blocks may be determined. Then, the threshold time length may be determined at least in part based on the statistical time length.
In some embodiments, a second time length of a second access to the blocks may be determined. The second access being prior to the first access. The statistical time length may be determined based on the first and second time lengths. In some embodiments, the threshold time length may be adjusted based on a user input.
In some embodiments, the access is of an access type. Further, the threshold time length for the access type is determined.
In some embodiments, the first time length may be determined by monitoring start time and end time of the first access. In some embodiments, if it is determined that the blocks are in the potential faulty state, the blocks are isolated from the storage device.
In a second aspect, an electronic device is provided. The electronic device comprises: a first determining unit configured to determine a first time length of a first access to a set of blocks of a storage device; a comparing unit configured to compare the first time length of the access and a threshold time length; and a second determining unit configured to determine, in response to the first time length exceeding the threshold time length, determine that the blocks are in a potential faulty state.
In a third aspect, there is provided an electronic device. The electronic device comprises at least one processor and a memory. The memory contains instructions executable by the at least one processor, whereby the apparatus is operative to perform the method according to the first aspect.
In a fourth aspect, there is provided a computer program and a computer program product that is tangibly stored on a computer readable storage medium. Each of the computer program and the computer program product includes instructions which, when executed on at least one processor, cause the at least one processor to carry out the method according to the first aspect.
Through the following description, it will be appreciated that according to embodiments of the present disclosure, a set of blocks of a storage device is determined to  be in a potential faulty state by comparing a time length of an access to the sets of the storage device and a threshold time length. In this way, the determination of the potential faulty state of the storage device may be more effective and efficient.
It is to be understood that the Summary section is not intended to identify key or essential features of embodiments of the present disclosure, nor is it intended to be used to limit the scope of the present disclosure. Other features of the present disclosure will become easily comprehensible through the following description.
BRIEF DESCRIPTION OF THE DRAWINGS
Through the more detailed description of some embodiments of the present disclosure with reference to the accompanying drawings, the above and other objects, features and advantages of the present disclosure will become more apparent, wherein:
Fig. 1 is a block diagram of an example environment in which embodiments of the present disclosure can be implemented;
Fig. 2 is a diagram illustrating time lengths of accesses to storage devices in good and faulty states in accordance with some embodiments of the present disclosure;
Fig. 3 is a flowchart of a method in accordance with some embodiments of the present disclosure;
Fig. 4 is a flowchart of illustrates an example process of monitoring start time end time of an access according to some embodiments of the present disclosure; and
Fig. 5 is a block diagram of an electronic device in accordance with some embodiments of the present disclosure.
Throughout the drawings, the same or similar reference numerals represent the same or similar element.
DETAILED DESCRIPTION
Principle of the present disclosure will now be described with reference to some example embodiments. It is to be understood that these embodiments are described only for the purpose of illustration and help those skilled in the art to understand and implement the present disclosure, without suggesting any limitations as to the scope of the disclosure. The disclosure described herein can be implemented in various manners other than the ones  described below.
In the following description and claims, unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs.
As used herein, the term “storage device” refers to a device for storing information. Examples of the storage device include, but are not limited to, a hard disk (HDD/SSD) , a portable computer diskette, a random access memory (RAM) , a read-only memory (ROM) , an erasable programmable read-only memory (EPROM or Flash memory) , a portable compact disc read-only memory (CD-ROM) , an optical storage device, a magnetic storage device, and the like.
As used herein, the term “time length of an access” or “access time length” refers to a time period during which data is input to or output from a storage device. The access type may include a write access or a read access.
As used herein, the singular forms “a” , “an” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise. The term “includes” and its variants are to be read as open terms that mean “includes, but is not limited to. ” The term “based on” is to be read as “based at least in part on. ” The term “one embodiment” and “an embodiment” are to be read as “at least one embodiment. ” The term “another embodiment” is to be read as “at least one other embodiment. ” Other definitions, explicit and implicit, may be included below.
Fig. 1 shows an electronic device 110 in which embodiments of the present disclosure can be implemented. Examples of the electronic device 110 include, but are not limited to, computing devices such as a desktop personal computer (PC) , tablet computer, laptop computer and a computer server, and communication devices capable of communications, such as a base station and a terminal device, and the like. Other devices may also easily employ embodiments of the present invention, such as a personal digital assistant (PDA) , a pager, a mobile computer, a mobile TV, a gaming device, a laptop computer, a camera, a video camera, a GPS device, and the like.
The electronic device 110 comprises a storage device 120 which is divided into a set of blocks (not shown) . The storage device 120 can be accessed by applications executed on or remotely to the electronic device 110. For example, the application may send an access request, such as a write request and/or a read request, for the storage device  120. In response to the access request, data can be written into or read from the storage device 120.
It is to be understood that the number of storage devices as shown in Fig. 1 are only for the purpose of illustration without suggesting any limitation of the present disclosure. It is also to be understood that although the storage device 120 is shown as being located within the electronic device 110, this is merely for the purpose of illustration. The storage device 120 can be located remotely and communicatively coupled to the electronic device 110.
The electronic device 110 also includes a processing unit 130 and a memory 140 coupled to the processing unit 130. The processing unit 130 may be of any type suitable to the local technical environment, and may include one or more of general purpose computers, special purpose computers, microprocessors, digital signal processors (DSPs) and processors based on multicore processor architecture, as non-limiting examples. The electronic device 110 may have multiple processors, such as an application specific integrated circuit chip that is slaved in time to a clock which synchronizes the main processor.
The memory 140 may be of any type suitable to the local technical environment and may be implemented using any suitable data storage technology, such as semiconductor based memory devices, magnetic memory devices and systems, optical memory devices and systems, fixed memory and rernovable memory, as non-limiting examples. While only one memory 140 is shown in the electronic device 110, there may be several physically distinct memory modules in the electronic device 110.
As described above, if the conventional monitoring tools report the faulty state of the storage device 120, the storage device 120 generally has been disabled for a while. Such a delay report significantly degrades user experiences especially for the applications that require high read/write performances. Therefore, there is a need for real-time detection of the faulty state of the storage device 120.
To this end, embodiments of the present disclosure propose a new solution for fault detection. For example, in some embodiments, According to embodiments of the present disclosure, the electronic device 110 includes a monitoring application 150 which, when executed by the processing unit 130, is at least partially loaded into the memory 140 and adapted to implement embodiments of the present disclosure. The operations of the  monitoring application 150 will be detailed in the following paragraphs.
The inventors found that when the performance of a storage device is significantly deteriorating, a time length of an access to the storage device becomes very long. Fig. 2 shows time lengths of accesses to storage devices in good and faulty states. These time lengths are detected for a set of HDD/SSD blocks in a disk. As shown, ifblocks are faulty, the time lengths of the accesses to the blocks may over 100 times longer than the time lengths of the accesses to the good blocks.
In this situation, embodiments of the present disclosure provide determining a set of blocks of the storage device 120 to be in a potential faulty state by comparing a time length of an access to the blocks of the storage device 120 and a threshold time length. In the context of the present disclosure, a potential faulty state of a storage device refers to a state where the performance of the storage device is significantly degrading or deteriorating, especially, before the conventional tools report the faulty state of the storage device. Compared with the conventional monitoring methods, the determination of the potential faulty state of the storage device 120 in accordance with embodiments of the present disclosure is more effective and efficient.
Principles and implementations of the present disclosure will be described in detail below with reference to Fig. 3 which shows a flowchart of an example method 300 in accordance with some embodiments of the present disclosure. The method 300 can be implemented, for example, by the monitoring application 150 as shown in Fig. 1. For the purpose of discussion, the method 300 will be described with reference to Fig. 1.
At block 305, a time length of an access to a set of blocks of the storage device 120 is determined. For the sake of discussion, the access is referred to as a “first access, ” and the time length is referred to as a “first time length. ” The access may be of any suitable type. As described above, the access type may include the write access and/or the read access, for example.
According to embodiments of the present disclosure, the first time length of the first access may be determined in any suitable method. In some embodiments, the first time length may be determined by monitoring start time and end time of the first access. For example, during the procedure of the access, the start time and the end time of the access may be monitored. Then, a time difference between the start time and the end time may be calculated as the time length of the access. An example process of monitoring  start time and end time of the access will be described below with reference to Fig. 4.
As shown in Fig. 4, an application sends (405) a disk access request for a hardware storage device, which can be a SCSI device, for example. In response to the disk access request, a kernel of the operating system in the electronic device 110 starts (410) the access to the hardware storage device. In the context of the present disclosure, the kernel or core refers to a control center of the operating system. The starting (410) of the access may be implemented via a system function in a disk drive within the kernel. In this example, as shown, in response to the starting (410) of the access, the start time is monitored (415) , for example, by the monitoring application 150.
In response to returning of the access results by the hardware storage device, the kernel performs (415) hard interrupt and further performs (420) soft interrupt to obtain the access results. Then, the kernel ends (430) the access and returns the access results to the application. Likewise, the hard interrupt (420) , the soft interrupt (425) , and the ending (430) of the access may also be implemented via system functions in the disk drive. In response to the ending (430) of the access, the end time is monitored (435) as shown. After the access is ended (430) , the time difference between the start time and the end time may be calculated as the time length of the access.
It is to be understood that the access procedure as described above with reference to Fig. 4 is only for the purpose of illustration. The access procedure may involve one or more other actions implemented by the kernel, which are shown by dashed boxes, and/or other actions implemented by other elements (not shown) . The access procedure is known in the art, and more details thereof will be omitted.
It is also to be understood that the access to the hardware storage device by the user space application as shown in Fig. 4 is for the purpose of illustration without suggesting any limitation. The start and end time of the accesses to the hardware storage device by other applications, such as system space applications, may also be monitored to determine the time lengths of these accesses.
Still in reference with Fig. 3, at block 310, the first time length of the access and a threshold time length are compared. At block 315, if the first time length exceeds the threshold time length, it is determined that the blocks of the storage device 120 are in a potential faulty state.
As described above, it is found that if the performance of the storage device is  significant degrading, the time length of the access to the storage device becomes extremely long. As a result, by comparing the first time length of the access and the threshold time length, the potential fault of the storage device 120 can be determined.
The threshold time length may be determined in any suitable way. In some embodiments, the threshold time length may be determined based on statistics of a plurality of accesses. For example, as shown in Fig. 3, at block 320, a statistical time length of the accesses to the blocks of the storage device 120 may be detenained. At block 325, the threshold time length may be determined based on the statistical time length.
In some embodiments, the accesses may contain an access (referred to as a “second access” ) prior to the first access. For example, a time length (referred to as a “second time length” ) of the second access to the blocks may be determined. Similar to the determination of the first time length of the first access at block 305, any suitable method may be used to implement the determination of the second time length. After the second time length is determined, the statistical time length may be determined based on the statistics of the first and second time lengths. These statistics may be performed according to any statistics methods either currently known or to be developed in the future.
In addition, in some embodiments, the threshold time length that is determined based on the statistics may be further adjusted based on a user input. For example, users may adjust the determined threshold time length by taking a specific factor into account. By way of example, for the application requiring the higher read/write performance, the threshold time length may be further adjusted to be shorter. The adjustment may further improve the universality and openness of the detection the potential faulty state of the storage device.
The inventors also found that the time lengths associated with different access types are different. For example, for HDD/SSD blocks, when the blocks are deteriorating, the time length of the write access to the blocks is longer than the time length of the read access to the blocks. Accordingly, in some embodiments, at block 330, the threshold time length may be determined for a specific access type. For example, two threshold time lengths may be determined respectively for the write access and the read access.
As shown in Fig. 3, in some embodiments, if it is determined at block 315 that the blocks of the storage device 120 are in the potential faulty state, the potential faulty blocks are isolated from the storage device 120 at block 335. For example, during the  determination of the first time length of the first access at block 305, in addition to the start and end time of the first access, the accessed blocks may also be located. Then, if it is determined that the time difference between the start and end time of the first access exceeds the threshold time length at block 310, the accessed blocks is determined to be in the potential faulty state at block 315, and then the accessed blocks are isolated from the storage device 120 at block 320. Any suitable isolation method that is either currently known or to be developed in the future will be used. In this way, life time of the storage device 120 may be extended.
Fig. 5 shows a block diagram of an electronic device 500 in accordance with some other embodiments of the present disclosure. The electronic device 500 can be considered an example implementation of the electronic device 110 as shown in Fig. 1.
As shown, the electronic device 500 comprises a first determining unit 505, a comparing unit 510, and a second determining unit 515. The first determining unit 505 is configured to determine a first time length of a first access to a set of blocks of a storage device. The comparing unit 510 is configured to compare the first time length of the access and a threshold time length. The second determining unit 515 is configured to determine, in response to the first time length exceeding the threshold time length, determine that the blocks are in a potential faulty state.
In some embodiments, the electronic device 500 may further comprise: a third determining unit configured to determine a statistical time length of accesses to the blocks; and a fourth determining unit configured to determine the threshold time length at least in part based on the statistical time length. In some embodiments, the third determining unit may comprise: a fifth determining unit configured to determine a second time length of a second access to the blocks, the second access being prior to the first access; and a sixth determining unit configured to determine the statistical time length based on the first and second time lengths. In some embodiments, the fourth determining unit may comprise: an adjusting unit configured to adjust the threshold time length based on a user input.
In some embodiments, the access may be of an access type. In these embodiments, the electronic device 500 may further comprise: a seventh determining unit configured to determine the threshold time length for the access type.
In some embodiments, the first determining unit 505 may comprise an eighth determining unit. The eighth determining unit is configured to determine the first time  length by monitoring start time and end time of the first access.
In some embodiments, the electronic device 500 may further comprise an isolating unit. The isolating unit is configured to, in response to determining that the blocks are in the potential faulty state, isolating the blocks from the storage device.
It should be appreciated that units included in the electronic device 500 correspond to the blocks of the method 300. Therefore, all operations and features described above with reference to Figs. 1 to 3 are likewise applicable to the units included in the electronic device 500 and have similar effects. For the purpose of simplification, the details will be omitted.
The units included in the electronic device 500 may be implemented in various manners, including software, hardware, firmware, or any combination thereof. In one embodiment, one or more units may be implemented using software and/or firmware, for example, machine-executable instructions stored on the storage medium, In addition to or instead of machine-executable instructions, parts or all of the units in the electronic device 500 may be implemented, at least in part, by one or more hardware logic components. For example, and without limitation, illustrative types of hardware logic components that can be used include Field-programmable Gate Arrays (FPGAs) , Application-specific Integrated Circuits (ASICs) , Application-specific Standard Products (ASSPs) , System-on-a-chip systems (SOCs) , Complex Programmable Logic Devices (CPLDs) , and the like.
Generally, various embodiments of the present disclosure may be implemented in hardware or special purpose circuits, software, logic or any combination thereof. Some aspects may be implemented in hardware, while other aspects may be implemented in firmware or software which may be executed by a controller, microprocessor or other computing device. While various aspects of embodiments of the present disclosure are illustrated and described as block diagrams, flowcharts, or using some other pictorial representation, it will be appreciated that the blocks, apparatus, systems, techniques or methods described herein may be implemented in, as non-limiting examples, hardware, software, firmware, special purpose circuits or logic, general purpose hardware or controller or other computing devices, or some combination thereof.
By way of example, embodiments of the present disclosure can be described in the general context of machine-executable instructions, such as those included in program modules, being executed in a device on a target real or virtual processor. Generally,  program modules include routines, programs, libraries, objects, classes, components, data structures, or the like that perform particular tasks or implement particular abstract data types. The functionality of the program modules may be combined or split between program modules as desired in various embodiments. Machine-executable instructions for program modules may be executed within a local or distributed device. In a distributed device, program modules may be located in both local and remote storage media.
Program code for carrying out methods of the present disclosure may be written in any combination of one or more programming languages. These program codes may be provided to a processor or controller of a general purpose computer, special purpose computer, or other programmable data processing apparatus, such that the program codes, when executed by the processor or controller, cause the functions/operations specified in the flowcharts and/or block diagrams to be implemented. The program code may execute entirely on a machine, partly on the machine, as a stand-alone software package, partly on the machine and partly on a remote machine or entirely on the remote machine or server.
In the context of this disclosure, a machine readable medium may be any tangible medium that may contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. The machine readable medium may be a machine readable signal medium or a machine readable storage medium. A machine readable medium may include but not limited to an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples of the machine readable storage medium would include an electrical connection having one or more wires, a portable computer diskette, a hard disk, a random access memory (RAM) , a read-only memory (ROM) , an erasable programmable read-only memory (EPROM or Flash memory) , an optical fiber, a portable compact disc read-only memory (CD-ROM) , an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
Further, while operations are depicted in a particular order, this should not be understood as requiring that such operations be performed in the particular order shown or in sequential order, or that all illustrated operations be performed, to achieve desirable results. In certain circumstances, multitasking and parallel processing may be advantageous. Likewise, while several specific implementation details are contained in the above discussions, these should not be construed as limitations on the scope of the present disclosure, but rather as descriptions of features that may be specific to particular  embodiments. Certain features that are described in the context of separate embodiments may also be implemented in combination in a single embodiment. Conversely, various features that are described in the context of a single embodiment may also be implemented in multiple embodiments separately or in any suitable sub-combination.
Although the present disclosure has been described in language specific to structural features and/or functional actions, it is to be understood that the present disclosure defined in the appended claims is not necessarily limited to the specific features or actions described above. Rather, the specific features and acts described above are disclosed as example forms of implementing the claims.

Claims (16)

  1. A method (300) implemented in an electronic device (110, 500) , comprising:
    determining (305) a first time length of a first access to a set of blocks of a storage device (120) ;
    comparing (310) the first time length of the access and a threshold time length; and
    in response to the first time length exceeding the threshold time length, determining (315) that the blocks are in a potential faulty state.
  2. The method (300) of claim 1, further comprising:
    determining (320) a statistical time length of accesses to the blocks; and
    determining (325) the threshold time length at least in part based on the statistical time length.
  3. The method (300) of claim 2, wherein determining (320) the statistical time length comprising:
    determining a second time length of a second access to the blocks, the second access being prior to the first access; and
    determining the statistical time length based on the first and second time lengths.
  4. The method (300) of claim 2, wherein determining (325) the threshold time length further comprises:
    adjusting the threshold time length based on a user input.
  5. The method (300) of claim 1, wherein the access is of an access type, and the method (300) further comprises:
    determining (330) the threshold time length for the access type.
  6. The method (300) of any of claims 1-5, wherein determining (305) the first time length comprises:
    determining the first time length by monitoring start time and end time of the first access.
  7. The method (300) of any of claims 1-5, further comprising:
    in response to determining (315) that the blocks are in the potential faulty state, isolating (335) the blocks from the storage device.
  8. An electronic device (110, 500) , comprising:
    a first determining unit (505) configured to determine a first time length of a first access to a set of blocks of a storage device;
    a comparing unit (510) configured to compare the first time length of the access and a threshold time length; and
    a second determining unit (515) configured to determine, in response to the first time length exceeding the threshold time length, determine that the blocks are in a potential faulty state.
  9. The electronic device (110, 500) of claim 8, further comprising:
    a third determining unit configured to determine a statistical time length of accesses to the blocks; and
    a fourth determining unit configured to determine the threshold time length at least in part based on the statistical time length.
  10. The electronic device (110, 500) of claim 9, wherein the third determining unit comprises:
    a fifth determining unit configured to determine a second time length of a second access to the blocks, the second access being prior to the first access; and
    a sixth determining unit configured to determine the statistical time length based on the first and second time lengths.
  11. The electronic device (110, 500) of claim 9, wherein the fourth determining unit comprises:
    an adjusting unit configured to adjust the threshold time length based on a user input.
  12. The electronic device (110, 500) of claim 8, wherein the access is of an access type, and the electronic device (110, 500) further comprises:
    a seventh determining unit configured to determine the threshold time length for the access type.
  13. The electronic device (110, 500) of any of claims 8-12, wherein the first determining unit (505) comprises:
    an eighth determining unit configured to determine the first time length by monitoring start time and end time of the first access.
  14. The electronic device (110, 500) of any of claims 8-12, further comprising:
    an isolating unit configured to, in response to determining that the blocks are in the potential faulty state, isolating the blocks from the storage device.
  15. An electronic device (110, 500) , comprising:
    a processing unit (130) and a memory (140) , the memory (140) containing instructions executable by the processing unit (130) whereby the electronic device (110, 500) is operative to perform the method (300) according to any of claims 1-7.
  16. A computer program product being tangibly stored on a computer readable storage medium and including instructions which, when executed on at least one processor, cause the at least one processor to carry out the method (300) according to any of claims 1-7.
PCT/CN2016/098142 2016-09-05 2016-09-05 Determination of faulty state of storage device WO2018040115A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
EP16914680.0A EP3507695A4 (en) 2016-09-05 2016-09-05 Determination of faulty state of storage device
US16/330,301 US11977434B2 (en) 2016-09-05 2016-09-05 Determination of faulty state of storage device
PCT/CN2016/098142 WO2018040115A1 (en) 2016-09-05 2016-09-05 Determination of faulty state of storage device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2016/098142 WO2018040115A1 (en) 2016-09-05 2016-09-05 Determination of faulty state of storage device

Publications (1)

Publication Number Publication Date
WO2018040115A1 true WO2018040115A1 (en) 2018-03-08

Family

ID=61299797

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/098142 WO2018040115A1 (en) 2016-09-05 2016-09-05 Determination of faulty state of storage device

Country Status (3)

Country Link
US (1) US11977434B2 (en)
EP (1) EP3507695A4 (en)
WO (1) WO2018040115A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112506420A (en) * 2019-09-16 2021-03-16 伊姆西Ip控股有限责任公司 Method, apparatus and article of manufacture for managing scrubbing operations in a storage system
US11182237B1 (en) * 2020-08-21 2021-11-23 Micron Technology, Inc. Selective sampling of a data unit based on program/erase execution time

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6122574A (en) 1997-05-20 2000-09-19 Unisia Jecs Corporation Method for detecting abnormality of control unit and control unit employing same
US20060112232A1 (en) * 2004-08-09 2006-05-25 Xiv Ltd. Method and apparatus for managing a cache memory in a mass-storage system
CN101127233A (en) 2007-09-25 2008-02-20 Ut斯达康通讯有限公司 Hard disc error detection and fault-tolerant method in stream media uses
CN101894083A (en) * 2009-05-22 2010-11-24 瑞萨电子株式会社 Interrupt processing apparatus and method
US8467281B1 (en) 2010-09-17 2013-06-18 Emc Corporation Techniques for identifying devices having slow response times
CN104516786A (en) * 2013-09-27 2015-04-15 日本电气株式会社 Information processing device, fault avoidance method, and program storage medium
US20150131379A1 (en) * 2012-07-30 2015-05-14 Empire Technology Development Llc Bad block compensation for solid state storage devices
CN105589785A (en) * 2015-12-08 2016-05-18 中国银联股份有限公司 Device and method for monitoring IO (Input/Output) performance of storage equipment

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20020046627A (en) * 2000-12-15 2002-06-21 박종섭 Device and method for controling load mobile bts/bsc system
CN106909318B (en) * 2013-12-23 2020-05-08 华为技术有限公司 Solid state disk using method and device
US9595352B2 (en) * 2014-03-17 2017-03-14 Seagate Technology Llc Manufacturer self-test for solid-state drives
US11960348B2 (en) * 2016-09-07 2024-04-16 Pure Storage, Inc. Cloud-based monitoring of hardware components in a fleet of storage systems

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6122574A (en) 1997-05-20 2000-09-19 Unisia Jecs Corporation Method for detecting abnormality of control unit and control unit employing same
US20060112232A1 (en) * 2004-08-09 2006-05-25 Xiv Ltd. Method and apparatus for managing a cache memory in a mass-storage system
CN101127233A (en) 2007-09-25 2008-02-20 Ut斯达康通讯有限公司 Hard disc error detection and fault-tolerant method in stream media uses
CN101894083A (en) * 2009-05-22 2010-11-24 瑞萨电子株式会社 Interrupt processing apparatus and method
US8467281B1 (en) 2010-09-17 2013-06-18 Emc Corporation Techniques for identifying devices having slow response times
US20150131379A1 (en) * 2012-07-30 2015-05-14 Empire Technology Development Llc Bad block compensation for solid state storage devices
CN104516786A (en) * 2013-09-27 2015-04-15 日本电气株式会社 Information processing device, fault avoidance method, and program storage medium
CN105589785A (en) * 2015-12-08 2016-05-18 中国银联股份有限公司 Device and method for monitoring IO (Input/Output) performance of storage equipment

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP3507695A4

Also Published As

Publication number Publication date
US20190205198A1 (en) 2019-07-04
EP3507695A1 (en) 2019-07-10
US11977434B2 (en) 2024-05-07
EP3507695A4 (en) 2020-05-06

Similar Documents

Publication Publication Date Title
US9898341B2 (en) Adjustable priority ratios for multiple task queues
US10223224B1 (en) Method and system for automatic disk failure isolation, diagnosis, and remediation
US12008406B1 (en) Predictive workload placement amongst storage systems
US9983812B1 (en) Automated node failure detection in an active/hot-standby storage cluster
US9619478B1 (en) Method and system for compressing logs
US20100023740A1 (en) Diagnostic utility and method for a data storage device
US20140201442A1 (en) Cache based storage controller
US20110153928A1 (en) Memory utilization tracking
KR20180092435A (en) Data storage device and operating method thereof
US20180137059A1 (en) Migrating buffer for direct memory access in a computer system
US20150006998A1 (en) Memory scrub management
US20180218766A1 (en) Method for low power operation and test using DRAM device
KR20190030428A (en) Method of estimating device life-time, method of designing device, and computer readable storage medium
US10339053B2 (en) Variable cache flushing
US20160357623A1 (en) Abnormality detection method and information processing apparatus
US10191668B1 (en) Method for dynamically modeling medium error evolution to predict disk failure
US11977434B2 (en) Determination of faulty state of storage device
US9891846B2 (en) System and method for preventing solid state drive corruption after dirty shutdown power loss
US9411695B2 (en) Provisioning memory in a memory system for mirroring
US20150286548A1 (en) Information processing device and method
US20160196064A1 (en) Storage control device, control method and storage system
US10474371B1 (en) Method and apparatus for SSD/flash device replacement policy
US10534683B2 (en) Communicating outstanding maintenance tasks to improve disk data integrity
US9390751B1 (en) Reducing overcounting of track-level damage caused by adjacent-track and far-track interference
US20130262727A1 (en) Race free interrupt

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16914680

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2016914680

Country of ref document: EP