US20170212814A1 - Accelerated data recovery in a storage system - Google Patents
Accelerated data recovery in a storage system Download PDFInfo
- Publication number
- US20170212814A1 US20170212814A1 US15/398,651 US201715398651A US2017212814A1 US 20170212814 A1 US20170212814 A1 US 20170212814A1 US 201715398651 A US201715398651 A US 201715398651A US 2017212814 A1 US2017212814 A1 US 2017212814A1
- Authority
- US
- United States
- Prior art keywords
- storage device
- lba
- lbas
- mapped
- control logic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/16—Error detection or correction of the data by redundancy in hardware
- G06F11/1658—Data re-synchronization of a redundant component, or initial sync of replacement, additional or spare unit
- G06F11/1662—Data re-synchronization of a redundant component, or initial sync of replacement, additional or spare unit the resynchronized component or unit being a persistent storage device
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/16—Error detection or correction of the data by redundancy in hardware
- G06F11/20—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
- G06F11/2053—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant
- G06F11/2056—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant by mirroring
- G06F11/2082—Data synchronisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F13/00—Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
- G06F13/38—Information transfer, e.g. on bus
- G06F13/42—Bus transfer protocol, e.g. handshake; Synchronisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/16—Error detection or correction of the data by redundancy in hardware
- G06F11/20—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
- G06F11/2053—Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant
- G06F11/2094—Redundant storage or storage space
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2201/00—Indexing scheme relating to error detection, to error correction, and to monitoring
- G06F2201/805—Real-time
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2201/00—Indexing scheme relating to error detection, to error correction, and to monitoring
- G06F2201/82—Solving problems relating to consistency
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2213/00—Indexing scheme relating to interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
- G06F2213/0036—Small computer system interface [SCSI]
Definitions
- the present disclosure relates to accelerated data recovery, in particular to, accelerated data recovery in a storage system.
- RAID (redundant array of independent (inexpensive) disks) systems include a plurality of storage devices configured to provide improved performance and/or fault tolerance relative to a single storage device.
- redundant storage systems e.g., RAID systems above level 0
- one or more copies of data and/or parity information may be stored on a plurality of storage devices.
- Data stored on a failed storage device may be recovered from the cop(ies) and/or related parity information and stored on a replacement storage device.
- Such recovery and storage operations are known as “rebuilding”.
- rebuilding a failed storage device includes writing an entire address space of the failed storage device from the cop(ies) to the replacement storage device.
- the time duration of rebuild operations for a failed storage device also increases.
- FIG. 1 illustrates a functional block diagram of a system consistent with various embodiments of the present disclosure
- FIG. 2 is a flowchart of accelerated data recovery operations according to various embodiments of the present disclosure
- FIG. 3 is a flowchart of storage device operations according to various embodiments of the present disclosure.
- FIGS. 4A and 4B illustrate one example of accelerated data recovery in a RAID (redundant array of independent (inexpensive) disks) level 1 system.
- the storage system may include a plurality of storage devices configured to provide fault tolerance.
- the methods (and systems) are configured query one or more other storage device(s) to identify mapped logical block addresses (LBAs) related to the failed storage device.
- LBAs mapped logical block addresses
- the methods and systems are further configured to provide an accelerated rebuild of a replacement storage device by reading and writing (i.e., copying) data associated with mapped LBA(s).
- a time duration associated with rebuilding the failed storage device may be reduced relative to copying an entire span (i.e., range of LBAs available for user data) of a storage device.
- Such rebuilds may be performed independent of existence and/or type of file system that may be resident on a host computing device coupled to the storage system. Such rebuilds may also be performed independent of partitioning of the storage system by, e.g., an operating system (OS).
- OS operating system
- an unmapped device LBA is a storage device LBA that has not been written to or has been subject to a trim command (TRIM) and has not been written to since the TRIM, as described herein.
- a mapped device LBA is a storage device LBA that has been written to and has not been subject to a TRIM since being written to. Mapped and unmapped, as applied to a device LBA, correspond to respective states of the device LBA. Mapping within a storage device is configured to relate a device LBA to a physical block address (PBA) associated with a physical storage medium included in the storage device. Mapping may also be used to relate a host LBA to a device LBA.
- PBA physical block address
- FIG. 1 illustrates a functional block diagram of a system 100 consistent with various embodiments of the present disclosure.
- System 100 includes a computing device 102 and a storage system 104 .
- the storage system 104 includes storage system control logic 106 and one or more storage device(s) 110 a , 110 b , . . . , 110 m .
- System 100 may include one or more peripheral device(s) 140 coupled to computing device 102 .
- Computing device 102 may include, but is not limited to, a server, a workstation computer, a desktop computer, a laptop computer, a tablet computer (e.g., iPad®, GalaxyTab® and the like), an ultraportable computer, an ultramobile computer, a netbook computer and/or a subnotebook computer; a mobile telephone including, but not limited to a smart phone, (e.g., iPhone®, Android®-based phone, Blackberry®, Symbian®-based phone, Palm®-based phone, etc.) and/or a feature phone.
- a smart phone e.g., iPhone®, Android®-based phone, Blackberry®, Symbian®-based phone, Palm®-based phone, etc.
- a feature phone e.g., Nokia®-based phone, Samsung®-based phone, etc.
- Computing device 102 includes a processor 120 , a chipset 122 and a memory 126 .
- Processor 120 may include one or more processing unit(s) (e.g., processor(s) and/or core(s)) and is configured to perform operations associated with computing device 102 .
- Chipset 122 may include a storage device interface 124 .
- Chipset 122 and/or storage device interface 124 may be configured to couple processor 120 to storage system 104 , storage system control logic 106 and/or peripheral device(s) 140 .
- Storage device interface 124 may be configured to communicate with storage system 104 , storage system control logic 106 and/or peripheral device(s) 140 via one or more interface and/or interconnect protocols, e.g., PCIe® (Peripheral Component Interconnect Express), SAS (Serial Attached SCSI (Small Computer System Interface)), ATA (Advanced Technology Attachment), SATA (Serial ATA), NVMe (Non-Volatile Memory Host Controller Interface Express), etc.
- PCIe® Peripheral Component Interconnect Express
- SAS Serial Attached SCSI (Small Computer System Interface)
- ATA Advanced Technology Attachment
- SATA Serial ATA
- NVMe Non-Volatile Memory Host Controller Interface Express
- Peripheral device(s) 140 may include, for example, user interface device(s) including a display, a touch-screen display, printer, keypad, keyboard, etc., communication logic, wired and/or wireless, other storage device(s) including hard disk drives, solid-state drives, removable storage media, etc.
- Memory 126 may include cache and/or random access memory. Memory 126 is configured to store an OS 130 and one or more application(s) 134 . The application(s) 134 may be configured to initiate memory access operations related to storage system 104 .
- OS 130 may include a file system 132 .
- File system 132 is configured to manage file storage and/or retrieval for computing device 102 .
- file system 132 may be included in a storage stack that includes one or more of the interface and/or interconnect protocols, storage system control logic 106 and storage system 104 (e.g., storage device 110 a and/or device control logic 150 , as described herein).
- Storage system 104 includes one or more storage device(s) 110 a , 110 b , . . . , 110 m .
- the storage devices 110 a , 110 b , . . . , 110 m may include, but are not limited to, solid state drives (SSDs), hard disk drives (HDD), etc.
- SSDs solid state drives
- HDD hard disk drives
- Each storage device, e.g., storage device 110 a includes device control logic 150 , a host interface 152 , a cache memory 154 , a non-volatile memory (NVM) 156 and a map table 158 .
- NVM non-volatile memory
- Cache memory 154 may include non-volatile memory, as described herein, or volatile memory, e.g., dynamic random access memory (DRAM), synchronous DRAM (SDRAM), etc.
- Device control logic 150 is configured to manage operations of storage device 110 a .
- Device control logic 150 is configured to write data to and read data from NVM 156 .
- NVM 156 is configured to store user data, parity information and/or NVM operational information.
- NVM operational information may include logic associated with operation of storage device 110 a and/or metadata.
- NVM 156 includes a plurality of storage elements that may be organized in pages and/or blocks. Block(s) of storage elements in NVM 156 may be identified by physical block address(es) (PBA(s)).
- PBA physical block address
- Data received from computing device 102 and/or storage system control logic 106 may be identified by device LBA(s).
- Device control logic 150 may be configured to relate each device LBA to a respective PBA and store the result in map table 158 .
- Map table 158 may further include a state indicator associated with each device LBA. Map table 158 may be resident in NVM 156 and/or cache 154 .
- the state indicator may have two values that correspond to a mapped device LBA (e.g., logic one) and an unmapped device LBA (e.g., logic zero), respectively.
- device control logic 150 may be configured to set the associated state indicator to logic one when a respective device LBA is written to (i.e., mapped) and reset the associated state indicator from logic one to logic zero when a previously mapped device LBA becomes unmapped.
- map table 158 may be utilized by device control logic 150 to identify a state of each device LBA(s), as described herein.
- Storage device(s) 110 a , 110 b , . . . , 110 m may correspond to hard disk drive(s) (HDD(s)) and/or solid state drive(s) (SSD(s)).
- HDD hard disk drive
- SSD solid state drive
- NVM 156 may include ferromagnetic material.
- NVM 156 may include, but is not limited to, magnetoresistive random access memory (MRAM), phase change memory (PCM, PRAM), three dimensional crosspoint memory, resistive memory, ferroelectric memory (F-RAM, FeRAM), spin-transfer torque memory (STT), thermal assisted switching memory (TAS), millipede memory, floating junction gate memory (FJG RAM), magnetic tunnel junction (MTJ) memory, electrochemical cells (ECM) memory, binary oxide filament cell memory, interfacial switching memory, battery-backed RAM, NAND flash memory, etc.
- MRAM magnetoresistive random access memory
- PCM phase change memory
- PRAM phase change memory
- resistive memory ferroelectric memory
- F-RAM ferroelectric memory
- FeRAM spin-transfer torque memory
- STT thermal assisted switching memory
- FJG RAM floating junction gate memory
- MTJ magnetic tunnel junction
- ECM electrochemical cells
- binary oxide filament cell memory binary oxide filament cell memory
- interfacial switching memory interfacial switching memory
- NAND flash memory etc.
- Host interface 152 is configured to receive commands and/or data from computing device 102 and/or storage system control logic 106 and to transmit responses (i.e., replies) and/or data to computing device 102 and/or storage system control logic 106 .
- Host interface 152 may be configured to couple storage device 110 a to storage system control logic 106 and/or host device 102 .
- Host interface 152 may comply and/or be compatible with one or more interface and/or interconnect protocol(s), as described herein.
- Host interface 152 may be further configured to decode received commands and/or data and to provide the decoded commands and/or data to device control logic 150 .
- Host interface 152 may be further configured to encode data and/or response(s) from device control logic 150 to a format (and/or syntax) that complies and/or is compatible with the interface and/or interconnect protocol(s).
- Cache 154 is configured to store commands and/or data. Commands and/or data may be stored in cache 154 prior to decoding and/or prior to encoding. Received data, e.g., data received from computing device 102 , may be stored in cache 154 prior to being written to NVM 156 .
- host interface 152 may provide an interface, interconnect and/or communication between storage device 110 a and computing device 102 and/or storage system control logic 106 .
- Storage system control logic 106 may be resident in storage system 104 , may be coupled between computing device 102 and storage system 104 and/or may be resident in computing device 102 .
- Storage system control logic 106 is configured to manage memory access operations (e.g., read and/or write operations) between computing device 102 , storage system 104 (and storage devices 110 a , 110 b , . . . , 110 m ).
- Storage system control logic 106 is configured to abstract storage devices 110 a , 110 b , . . . , 110 m to computing device 102 .
- storage system 104 may correspond to a RAID system and storage system control logic 106 may then correspond to a RAID controller.
- Storage system control logic 106 is configured to receive read and/or write commands and associated host LBA(s) from computing device 102 (e.g., from file system 132 ) and to determine, select and/or identify appropriate storage device(s) 110 a , 110 b , . . . , and/or 110 m based, at least in part, on the host LBA(s).
- Storage system control logic 106 may be configured to map each host LBA to one or more respective device LBA(s). In a redundant storage system, one host LBA may be mapped to device LBAs associated with more than one storage device.
- Storage system control logic 106 is further configured to receive user data associated with the host LBA(s). Storage system control logic 106 may receive read data from one or more storage device(s) 110 a , 110 b , . . . , 110 m and provide the read data to computing device 102 .
- storage system control logic 106 may be configured to determine and/or select appropriate storage device(s) 110 a , 110 b , . . . , and/or 110 m based, at least in part, on the host LBA(s). For example, storage system control logic 106 may be configured to distribute user data associated with a write command across a plurality of the storage devices 110 a and 110 b , . . . , and/or 110 m in order to provide redundancy and/or relatively better performance compared to a storage system that includes a single storage device. Thus, in this example, storage system control logic 106 may be configured to select a plurality of storage devices 110 a and 110 b , . . .
- Storage system control logic 106 may be further configured to relate each subset of host LBA(s) to a respective plurality of device LBA(s) for each storage device.
- Storage system control logic 106 may be further configured to determine parity information, e.g., a checksum, based, at least in part, on data to be written, and to store the parity information to device LBA(s) of one or more of the storage device(s) 110 a , 110 b , . . . , and/or 110 m .
- storage system control logic 106 may be configured to select one storage device, e.g., storage device 110 a , to associate with the host LBAs and to receive the user data.
- Storage system control logic 106 may be further configured to determine parity information based, at least in part, on data to be written, and to store the parity information to another of the storage devices 110 b , . . . , or 110 m .
- storage system control logic 106 may be configured to provide write command(s), data (e.g., user data and/or parity information) and associated device LBA(s) to selected storage device(s) 110 a , 110 b , . . . , and/or 110 m.
- storage system control logic 106 is configured to identify appropriate storage device(s) 110 a , 110 b , . . . , and/or 110 m and to provide read command(s) and associated device LBA(s) to identified storage device(s) 110 a , 110 b , . . . , and/or 110 m based, at least in part, on host LBA(s).
- Storage system control logic 106 may be further configured to receive the read user data from the identified storage device(s) 110 a , 110 b , . . . , and/or 110 m and to provide the read data to computing device 102 .
- Storage system control logic 106 may be configured to verify that read data is not corrupted based, at least in part, on parity and/or redundant information associated with the read data. Storage system control logic 106 may be further configured to correct corrupted data, if possible, based, at least in part, on the parity and/or redundant information.
- Storage system control logic 106 is configured to detect failure of one or more of storage devices 110 a , 110 b , . . . , and/or 110 m . Storage system control logic 106 may be further configured to initiate rebuild operations in response to a detected failure. Storage system control logic 106 may be configured to communicate the failure to computing device 102 . Computing device 102 and/or OS 130 may be configured to notify, e.g., a user via peripheral device(s) 140 , that a storage device has failed. The notice is configured to prompt the user to replace the failed storage device with a replacement storage device.
- storage system control logic 106 may be configured to receive an indication from a user via computing device 102 and/or peripheral device(s) 140 that a replacement storage device has been coupled to storage system control logic 106 .
- storage system control logic 106 may be configured to detect that a replacement storage device has been coupled to storage system control logic 106 in place of the failed storage device.
- storage system 104 may include a replacement storage device configured to be utilized if a storage device fails. The replacement storage device may exist in storage system 104 prior to a failure of a storage device and/or may be added by the user after the failure is detected.
- the failed storage device may be storing less than a maximum storage capacity of the failed storage device.
- fewer device LBA(s) than the span (i.e., the range of device LBAs that may store user data) of the failed storage device may be mapped.
- Rebuild operations may be accelerated by building mapped portions of the failed storage device rather than the entire span of the failed storage device.
- storage system control logic 106 may be configured to request respective state(s) of target device LBA(s) from each storage device 110 a , 110 b , . . . , and/or 110 m (“target storage device(s)”) that may have mapped device LBAs related to the failed storage device.
- Storage system control logic 106 may be configured to identify the target storage device(s) based, at least in part, on a configuration of storage system 104 .
- Storage system control logic 106 may be further configured to identify target device LBA(s) based, at least in part, on the configuration of storage system 104 . For example, in a mirrored configuration, one target storage device may include mapped LBA(s) related to the failed storage device.
- a plurality of storage devices may include mapped device LBAs related to the failed storage device.
- the plurality of storage devices may be included in a storage pool.
- Storage system control logic 106 may be configured to identify the plurality of target storage devices based, at least in part, on the storage pool.
- Storage system control logic 106 may be configured to identify the target storage device(s) and request the state(s) of target device LBA(s) in response to an indication that a storage device has failed.
- storage system 104 may be configured to provide redundancy by mirroring stored data on a first storage device and a second storage device (e.g., RAID 1).
- Storage system control logic 106 may then be configured to request respective state(s) of target device LBA(s) from the second storage device if the first storage device fails.
- rebuild operations may then include copying data associated with the mapped device LBA(s) from the target storage device (i.e., the second storage device) to the replacement storage device.
- storage system 104 may be configured to provide fault tolerance by “striping” blocks of data across a plurality of storage devices (i.e., storage pool) and distributing parity information across the plurality of storage devices (e.g., RAID 5).
- a stripe of data includes segments of logically sequential data that are distributed across a plurality of storage devices, i.e., a storage pool.
- Storage system control logic 106 may be configured to identify the plurality of storage devices in the storage pool. Storage system control logic 106 may then be configured to request respective state(s) of target device LBA(s) from each of the plurality of storage devices (except the failed storage device) in the storage pool. The state(s) of target device LBA(s) may be requested for a stripe of data.
- Whether to read and write the target device LBAs in the stripe may then be determined for the plurality of target storage devices in the storage pool.
- rebuild operations may include determining and/or verifying values of stored data based, at least in part, on the parity information. Stripes of unmapped target device LBAs may not be read nor written.
- file system 132 may be configured to provide a trim (TRIM) command and associated host LBA(s) 162 to storage system control logic 106 .
- TRIM is configured to communicate host LBAs to storage system 104 that have been unmapped by the file system 132 .
- host LBAs may be unmapped in response to a permanent deletion of a file, emptying a “recycle bin” and/or performing a “quick format”, e.g., in a Microsoft® Windows® OS.
- TRIM is configured to optimize garbage collection and erase operations in SSDs that erase blocks of storage elements prior to writing new data to the storage elements.
- storage system control logic 106 may be configured to send a TRIM request 160 to computing device 102 in response to a storage device failure.
- the TRIM request 160 is configured to trigger transmission of a TRIM command and associated host LBA(s) 162 from file system 132 to storage system 104 .
- the TRIM command may comply and/or be compatible with one or more interface and/or interconnect protocols, as described herein.
- the TRIM command may correspond to an ATA DATA SET MANAGEMENT command, a SCSI (and/or SAS) UNMAP command and/or an NVMe DEALLOCATE command, as described herein.
- Storage system control logic 106 may receive the TRIM command and associated host LBAs 162 and may provide a corresponding TRIM command and associated device LBAs 164 to the target storage device(s).
- a host interface of each target storage device e.g., host interface 152 of storage device 110 a
- Device control logic of each target storage device e.g., device control logic 150 , may then be configured to unmap the device LBA(s) that correspond to received TRIM command.
- the target storage device(s) may have a most up-to-date mapping (and unmapping) of device LBA(s).
- Storage system control logic 106 is configured to send a request for state of target device LBA(s) command (LBA state request command) 166 to each target storage device, e.g., storage device 110 a , associated with the failed storage device.
- the LBA state request command may include one or more target device LBA(s).
- the LBA state request command may request a listing of mapped device LBA(s) and/or a listing of unmapped device LBA(s).
- Storage system control logic 106 may send the LBA state request command 166 in response to a failure of a storage device, e.g., storage device 110 b .
- the LBA state request command 166 may be sent without first sending the TRIM request command 160 .
- the LBA state request command 166 may be sent after sending the TRIM request command 160 and/or after receiving the TRIM command and associated host LBA(s) 162 .
- the LBA state request command 166 may correspond to and/or be compatible with one or more of the interface and/or interconnect protocols, as described herein.
- the LBA state request command 166 may have an associated syntax that is specific to a selected storage device, e.g., storage device 110 a .
- a specific syntax may be configured to provide a level of security by limiting use of the command to selected storage devices (e.g., vendor-specific).
- the specific syntax may be further configured to provide a vendor-specific command.
- a vendor-specific command may simplify a command structure and/or facilitate device testing.
- Each target storage device e.g., storage device 110 a
- the repl(ies) 168 may comply and/or be compatible with one or more interface and/or interconnect protocols.
- host interface 152 may be configured to receive the LBA state request command 166 , to decode the command 166 and to provide the decoded request to device control logic 150 .
- Device control logic 150 may be configured to identify the state(s) of target device LBA(s) based, at least in part, on map table 158 .
- map table 158 may relate device LBA(s) to NVM 156 PBA(s) and may further include a state indicator, e.g., a flag, that corresponds to a state of an associated device LBA.
- the flag may be a bit and may have a value of one (e.g., TRUE) for a target device LBA that is mapped and a value of zero (e.g., FALSE) for a target device LBA that is unmapped.
- Device control logic 150 may be configured to read map table 158 and the associated indicators to identify the state(s) of target device LBA(s). Device control logic 150 may then be configured to provide the state(s) to host interface 152 .
- Host interface 152 may then generate the reply 168 that includes the state(s) of target device LBA(s) and to transmit the response to storage system control logic 106 .
- Responses that include the states of a plurality of target device LBA(s) may be configured to relate each state indicator to its respective target device LBA.
- the reply 168 may comply and/or be compatible with one or more interface and/or interconnect protocols, as described herein.
- storage system control logic 106 may be configured to include a target device LBA with the request for mapped and/or unmapped device LBA(s) command 166 .
- each target storage device is configured to determine a state of the target device LBA and to provide the resulting state indicator to storage system control logic 106 in reply 168 .
- the LBA state request command 166 that includes the target device LBA may be sent during rebuild, e.g., prior to building each target device LBA.
- storage system control logic 106 may be configured to include a range of target device LBAs with the LBA state request command 166 .
- each target storage device may be configured to determine a respective state of each device LBA included in the range of target device LBA(s) and to provide the resulting state indicator(s) to storage system control logic 106 in reply 168 .
- the LBA state request command 166 that includes the range of target device LBAs may be sent prior to and/or during rebuild, e.g., prior to building the target device LBAs.
- storage system control logic 106 may be configured to request a listing of mapped device LBA(s) and/or a listing of unmapped device LBA(s).
- each target storage device may be configured to provide device LBA(s) and associated state indicators from map table 158 in their respective replies 168 .
- the LBA state request command 166 that includes the request for the listing(s) may be sent prior to rebuild.
- the storage system control logic 106 may request state(s) of target device LBA(s) from target storage device(s) in response to detection of a failed storage device.
- the target storage device(s) may then provide the state indicator(s) of the target device LBA(s) to the storage system control logic 106 in response to the request.
- the storage system control logic 106 may then be configured to identify a respective state of each target device LBA.
- storage system control logic 106 may be configured to perform a logical OR operation (e.g., exclusive OR (XOR)) of the state indicator(s) and a test value to identify the respective state(s).
- XOR exclusive OR
- Storage system control logic 106 may then be configured to build a replacement storage device, e.g., storage device 110 a , configured to replace the failed storage device, e.g., storage device 110 b .
- storage system control logic 106 may be configured to send a read command that includes mapped device LBA(s) to the target storage device(s) and to write the read data to corresponding device LBA(s) of the replacement storage device 110 a .
- Unmapped device LBA(s) may not be read and may not be written.
- the data may include user data and/or parity information, as described herein.
- Storage system control logic 106 may be further configured to update parity information associated with the replacement storage device 110 a .
- a rebuild of the failed storage device 110 b may be accelerated as described herein by reading and writing mapped device LBA(s) and not reading and not writing contents of unmapped device LBA(s).
- the rebuild may be performed independent of a file system, e.g., file system 130 , file structure and/or partitioning related to computing device 102 .
- FIG. 2 is a flowchart 200 of accelerated data recovery operations according to various embodiments of the present disclosure.
- flowchart 200 illustrates acquiring state(s) of target device LBA(s) associated with a failed storage device and building a replacement storage device by writing the contents of the mapped device LBA(s) to the replacement storage device.
- the operations may be performed, for example, by storage system control logic 106 of FIG. 1 .
- a TRIM command may be requested at operation 204 .
- the TRIM command may be requested from a file system included in a computing device, e.g., file system 132 of computing device 102 .
- Target storage device(s) may be identified at operation 206 .
- Target storage device(s) may be storing data related to the failed storage device.
- one failed storage device may correspond to one target storage device.
- one failed storage device may be related to a plurality of target storage devices.
- Target device LBA(s) may be identified at operation 208 .
- a respective state of each identified target device LBA may be requested at operation 210 .
- the respective state(s) may be requested from target storage device(s) that include data related to data stored on the failed storage device.
- Requested state indicator(s) may be received at operation 212 .
- Data associated with mapped device LBA(s) may be read from the target storage device(s) and written to at least one replacement storage device at operation 214 .
- data associated with the mapped device LBA(s) may include user data and/or parity information.
- Program flow may then continue at operation 216 .
- a rebuild of a failed storage device may be accelerated by copying data associated with mapped device LBA(s) from target storage device(s) (i.e., reading) to the replacement storage device (i.e., writing). Whether a device LBA is mapped or unmapped may be determined based, at least in part, on LBA state information received from the target storage device(s).
- FIG. 3 is a flowchart 300 of storage device operations according to various embodiments of the present disclosure.
- flowchart 300 illustrates unmapping device LBA(s) in response to a TRIM command and providing respective state(s) of target device LBA(s) in response to a request for them.
- the operations may be performed by a storage device, for example, one or more of storage device(s) 110 a , 110 b , . . . , and/or 110 m of FIG. 1 .
- a TRIM command may be received at operation 304 .
- the TRIM command is configured to identify device LBA(s) that correspond to host LBA(s) that have been unmapped by, e.g., a file system.
- Device LBA(s) associated with the TRIM command may be unmapped at operation 306 . Whether operations 304 and 306 are performed may depend on whether an OS, e.g., OS 130 , includes a TRIM command.
- Operations of this embodiment include receiving a request for state(s) of target device LBA(s) at operation 308 .
- One or more state(s) of target device LBA(s) may be determined in response to the request at operation 310 .
- state(s) of target device LBA(s) may be determined based, at least in part, on a map table, e.g., map table 158 of FIG. 1 .
- Operation 312 includes providing a reply that includes a respective state indicator related to the state of each target device LBA.
- the response may be provided to storage system control logic.
- Program flow may end at operation 314 .
- a storage device may provide respective state indicator(s) of one or more target device LBA(s) in response to a request from, e.g., storage system control logic.
- FIGS. 2 and 3 illustrate operations according various embodiments, it is to be understood that not all of the operations depicted in FIGS. 2 and/or 3 are necessary for other embodiments.
- the operations depicted in FIGS. 2 and/or 3 , and/or other operations described herein may be combined in a manner not specifically shown in any of the drawings, and such embodiments may include less or more operations than are illustrated in FIGS. 2 and/or 3 .
- claims directed to features and/or operations that are not exactly shown in one drawing are deemed within the scope and content of the present disclosure.
- FIGS. 4A and 4B illustrate one example of accelerated data recovery in a RAID level 1 (i.e., mirrored) system.
- FIG. 4A illustrates an example 400 of NVM 402 , 404 of a RAID 1 that includes two storage devices (Device 0 and Device 1 ) configured to mirror stored data.
- the NVM 402 , 404 of each device is configured to store the same data as the NVM 404 , 402 of the other device.
- device LBA(s) A 1 , A 2 , . . . , AN are mapped and LBA(s) AN+1, . . . , AY are unmapped.
- Device 1 has failed.
- Example 400 further includes a replacement device (Device R) that is configured to replace failed Device 1 .
- Device R includes NVM 406 that includes device LBA(s) A 1 , A 2 , . . . , AY that are unmapped. In other words the span of NVM 406 is unmapped. A user may remove failed Device 1 and replace Device 1 with Device R.
- FIG. 4B illustrates the example 400 ′ of FIG. 4A after the failed Device 1 has been replaced with Device R.
- Storage system control logic e.g., storage system control logic 106 of FIG. 1 , is configured to request state(s) of target device LBA(s) from Device 0 , for example, by sending a LBA state request command to Device 0 , as described herein.
- Device 0 is configured to receive the request and to transmit respective state indicator(s) of device LBA(s), A 1 , A 2 , . . . , AN (i.e., mapped), to the storage system control logic in response to receiving the LBA state request command, as described herein.
- the storage system control logic may then be configured to copy the data associated with mapped device LBA(s) A 1 , A 2 , . . . , AN from NVM 402 to NVM 406 .
- Device LBA(s) AN+1, . . . , AY of NVM 406 may remain unmapped. Thus, rebuilding a failed storage device may be accelerated, as described herein, compared to copying the full span.
- OS 130 may be configured to manage system 100 resources and control tasks that are run on each respective device and/or system, e.g., computing device 102 and/or storage system 104 .
- the OS may be implemented using Microsoft® Windows®, HP-UX®, Linux®, or UNIX®, although other operating systems may be used.
- the OS 130 may be replaced by a virtual machine monitor (or hypervisor) which may provide a layer of abstraction for underlying hardware to various operating systems (virtual machines) running on one or more processing units.
- Memory 126 may include one or more of the following types of memory: semiconductor firmware memory, programmable memory, non-volatile memory, read only memory, electrically programmable memory, random access memory, flash memory, magnetic disk memory, and/or optical disk memory. Either additionally or alternatively device memory may include other and/or later-developed types of computer-readable memory.
- Embodiments of the operations described herein may be implemented in a computer-readable storage device having stored thereon instructions that when executed by one or more processors perform the methods.
- the processor may include, for example, a processing unit and/or programmable circuitry.
- the storage device may include a machine readable storage device including any type of tangible, non-transitory storage device, for example, any type of disk including floppy disks, optical disks, compact disk read-only memories (CD-ROMs), compact disk rewritables (CD-RWs), and magneto-optical disks, semiconductor devices such as read-only memories (ROMs), random access memories (RAMs) such as dynamic and static RAMs, erasable programmable read-only memories (EPROMs), electrically erasable programmable read-only memories (EEPROMs), flash memories, magnetic or optical cards, or any type of storage devices suitable for storing electronic instructions.
- ROMs read-only memories
- RAMs random access memories
- EPROMs erasable programm
- logic may refer to an app, software, firmware and/or circuitry configured to perform any of the aforementioned operations.
- Software may be embodied as a software package, code, instructions, instruction sets and/or data recorded on non-transitory computer readable storage medium.
- Firmware may be embodied as code, instructions or instruction sets and/or data that are hard-coded (e.g., nonvolatile) in memory devices.
- Circuitry may comprise, for example, singly or in any combination, hardwired circuitry, programmable circuitry such as computer processors comprising one or more individual instruction processing cores, state machine circuitry, and/or firmware that stores instructions executed by programmable circuitry.
- the logic may, collectively or individually, be embodied as circuitry that forms part of a larger system, for example, an integrated circuit (IC), an application-specific integrated circuit (ASIC), a system on-chip (SoC), desktop computers, laptop computers, tablet computers, servers, smart phones, etc.
- IC integrated circuit
- ASIC application-specific integrated circuit
- SoC system on-chip
- Computing device 102 and storage system 104 may be configured to communicate with each other via one or more interface and/or interconnect protocols.
- Storage system control logic 106 and storage device(s) 110 a , 110 b , . . . , and/or 110 m may be configured to communicate with each other via one or more interface and/or interconnect protocols.
- the interface and/or interconnect protocol may comply and/or be compatible with PCIe® (Peripheral Component Interconnect Express) specification, titled: PCI Express ® Base Specification, Revision 3.0, released by PCI-SIG® (PCI Special Interest Group), Beaverton, Oreg., November 2010, and/or later versions of this specification, for example, version 3.1.
- PCIe® Peripheral Component Interconnect Express
- PCI-SIG® PCI Special Interest Group
- Beaverton Oreg., November 2010, and/or later versions of this specification, for example, version 3.1.
- the interface and/or interconnect protocol may comply and/or be compatible with SAS (Serial Attached SCSI (Small Computer System Interface)) standard SAS-3, titled: Serial Attached SCSI-3 (SAS-3) 2212-D ANSI-approved, published by American National Standards Institute (ANSI) InterNational Committee for Information Technology Standards (INCITS), November, 2013 , and/or later versions of this standard.
- SAS Serial Attached SCSI
- SAS-3 Serial Attached SCSI-3 2212-D ANSI-approved, published by American National Standards Institute (ANSI) InterNational Committee for Information Technology Standards (INCITS), November, 2013 , and/or later versions of this standard.
- the interface and/or interconnect protocol may comply and/or be compatible with ATA (Advanced Technology Attachment) standard, document number: INCITS 452-2009/AM 1 -2010, titled: Information Technology-AT Attachment 8-ATA/ATAPI Command Set (ATA8-ACS) Amendment 1, published in 2010 by ANSI INCITS, and/or related and/or later versions of this standard, for example, document number: INCITS 482-2012, titled: Information Technology-ATA/ATAPI Command Set-2 (ACS-2), published in 2012, and/or later and/or related versions of this standard.
- ATA Advanced Technology Attachment
- the interface and/or interconnect protocol may comply and/or be compatible with SATA (Serial ATA (Advanced Technology Attachment)) specification, titled: Serial ATA Specification, Revision 3.2, released August 2013, by SATA International Organization (SATA-IO), Beaverton, Oreg., and/or earlier and/or later versions of this specification.
- SATA Serial ATA (Advanced Technology Attachment)
- SATA-IO Serial ATA International Organization
- Beaverton Oreg., and/or earlier and/or later versions of this specification.
- the interface and/or interconnect protocol may comply and/or be compatible with NVMe (Non-Volatile Memory Host Controller Interface Express) specification titled: NVM ExpressTM, Revision 1.2, released Nov. 3, 2014 by NVM ExpressTM Work Group, and/or earlier and/or later versions of this specification.
- NVMe Non-Volatile Memory Host Controller Interface Express
- a hardware description language may be used to specify circuit and/or logic implementation(s) for the various logic and/or circuitry described herein.
- the hardware description language may comply or be compatible with a very high speed integrated circuits (VHSIC) hardware description language (VHDL) that may enable semiconductor fabrication of one or more circuits and/or logic described herein.
- VHSIC very high speed integrated circuits
- VHDL may comply or be compatible with IEEE Standard 1076-1987, IEEE Standard 1076.2, IEEE1076.1, IEEE Draft 3.0 of VHDL-2006, IEEE Draft 4.0 of VHDL-2008 and/or other versions of the IEEE VHDL standards and/or other hardware description standards.
- a system and method are configured to provide an accelerated rebuild of a replacement storage device by reading and writing (e.g., copying) data associated with mapped device LBA(s).
- the data associated with device LBA(s) determined to be mapped may be copied.
- a time duration associated with rebuilding the failed storage device may be reduced relative to copying an entire span of a storage device.
- Examples of the present disclosure include subject material such as a method, means for performing acts of the method, a device, or of an apparatus or computer-readable storage device related to accelerated data recovery in a storage system, as discussed below.
- the storage system includes storage system control logic.
- the storage system control logic is to identify at least one target storage device in response to detection of a failed storage device, request a state of a target device logical block address (LBA) from each of the at least one target storage device, and read data associated with a mapped device LBA from each target storage device and write the data to at least one replacement storage device.
- LBA target device logical block address
- This example includes the elements of example 1, wherein the storage system control logic is further to request a trim command (TRIM) from a computing device to be provided to each target storage device.
- TIM trim command
- This example includes the elements of example 1, wherein the storage system control logic is further to identify each target device LBA.
- This example includes the elements of example 1, wherein the storage system control logic is further to request from each target storage device a respective state of each target device LBA of a range of target device LBAs.
- This example includes the elements of example 1, wherein the storage system control logic is further to request a listing of mapped device LBAs from each target storage device.
- This example includes the elements of example 1, wherein the state corresponds to mapped or unmapped.
- This example includes the elements according to any one of examples 1 through 6, wherein the storage system control logic is to request the state of each target device LBA at least one of prior to and/or during a rebuild of the failed storage device.
- This example includes the elements according to any one of examples 1 through 6, wherein the request includes a respective target device LBA and the storage system control logic is to receive an indicator related to the state of the respective target device LBA.
- This example includes the elements according to any one of examples 1 through 6, wherein the request includes a range of target device LBAs and the storage system control logic is to receive indicators related to the respective state of each target device LBA in the range of target device LBAs.
- This example includes the elements according to any one of examples 1 through 6, wherein the storage system control logic is further to determine parity information based, at least in part, on the data associated with the mapped device LBA.
- the storage device includes device control logic to determine a state of a target device logical block address (LBA) in response to a request; a host interface to provide a reply to the request, the reply including a state indicator related to the state of the target device LBA; a map table including a plurality of device LBAs and respective state indicators; and non-volatile memory (NVM) including data related to at least one mapped LBA.
- LBA target device logical block address
- NVM non-volatile memory
- This example includes the elements of example 11, wherein the device control logic is to unmap a device LBA in response to a receiving trim command (TRIM).
- TIM trim command
- This example includes the elements of example 11, wherein the device control logic is further to determine a respective state of each target device LBA of a range of target device LBAs in response to the request.
- This example includes the elements of example 11, wherein the device control logic is further to determine a respective state of each device LBA of the plurality of device LBAs in response to the request.
- This example includes the elements of example 14, wherein the reply includes at least a portion of the plurality of device LBAs and a respective state of each device LBA in the portion is mapped.
- This example includes the elements of example 11, wherein the state corresponds to mapped or unmapped.
- This example includes the elements according to any one of examples 11 through 16, wherein the host interface is compatible with at least one of PCIe® (Peripheral Component Interconnect Express), SAS (Serial Attached SCSI (Small Computer System Interface)), ATA (Advanced Technology Attachment), SATA (Serial ATA) and/or NVMe (Non-Volatile Memory
- PCIe® Peripheral Component Interconnect Express
- SAS Serial Attached SCSI (Small Computer System Interface)
- ATA Advanced Technology Attachment
- SATA Serial ATA
- NVMe Non-Volatile Memory
- NVM includes one or more of magnetoresistive random access memory (MRAM), phase change memory (PCM, PRAM), three dimensional crosspoint memory, resistive memory, ferroelectric memory (F-RAM, FeRAM), spin-transfer torque memory (STT), thermal assisted switching memory (TAS), millipede memory, floating junction gate memory (FJG RAM), magnetic tunnel junction (MTJ) memory, electrochemical cells (ECM) memory, binary oxide filament cell memory, interfacial switching memory, battery-backed RAM and/or NAND flash memory.
- MRAM magnetoresistive random access memory
- PCM phase change memory
- PRAM phase change memory
- resistive memory ferroelectric memory
- F-RAM ferroelectric memory
- STT spin-transfer torque memory
- TAS thermal assisted switching memory
- FJG RAM floating junction gate memory
- MTJ magnetic tunnel junction
- ECM electrochemical cells
- This example includes the elements according to any one of examples 11 through 16, wherein the host interface is to couple the storage device to at least one of a storage system controller logic and/or a computing device.
- This example includes the elements according to any one of examples 11 through 16, wherein the request is received from and the reply is provided to storage system control logic.
- the method includes identifying, by storage system control logic, at least one target storage device in response to detection of a failed storage device; requesting, by storage system control logic, a state of a target device logical block address (LBA) from each of the at least one target storage device; reading, by storage system control logic, data associated with a mapped device LBA from each target storage device; and writing, by storage system control logic, the data to at least one replacement storage device.
- LBA target device logical block address
- This example includes the elements of example 21 and further includes requesting, by storage system control logic, a trim command (TRIM) from a computing device to be provided to each target storage device.
- TIM trim command
- This example includes the elements of example 21 and further includes identifying, by storage system control logic, each target device LBA.
- This example includes the elements of example 21 and further includes requesting, by storage system control logic, from each target storage device a respective state of each target device LBA of a range of target device LBAs.
- This example includes the elements of example 21 and further includes requesting, by storage system control logic, a listing of mapped device LBAs from each target storage device.
- This example includes the elements of example 21, wherein the state corresponds to mapped or unmapped.
- This example includes the elements of example 21 and further includes requesting, by storage system control logic, the state of each target device LBA at least one of prior to and/or during a rebuild of the failed storage device.
- This example includes the elements of example 21, wherein the request includes a respective target device LBA and further including receiving, by storage system control logic, an indicator related to the state of the respective target device LBA.
- This example includes the elements of example 21, wherein the request includes a range of target device LBAs and further including receiving, by storage system control logic, indicators related to the respective state of each target device LBA in the range of target device LBAs.
- This example includes the elements of example 21 and further includes determining, by storage system control logic, parity information based, at least in part, on the data associated with the mapped device LBA.
- the method includes determining, by device control logic, a state of a target device logical block address (LBA) in response to a request based, at least in part, on a map table including a plurality of device LBAs and respective state indicators, the map table related to a non-volatile memory (NVM) that includes data related to at least one mapped LBA; and providing, by a host interface, a reply to the request, the reply including a state indicator related to the state of the target device LBA.
- LBA target device logical block address
- NVM non-volatile memory
- This example includes the elements of example 31 and further includes unmapping, by the device control logic, a device LBA in response to a receiving trim command (TRIM).
- TIM trim command
- This example includes the elements of example 31 and further includes determining, by the device control logic, a respective state of each target device LBA of a range of target device LBAs in response to the request.
- This example includes the elements of example 31 and further includes determining, by the device control logic, a respective state of each device LBA of the plurality of device LBAs in response to the request.
- This example includes the elements of example 34, wherein the reply includes at least a portion of the plurality of device LBAs and a respective state of each device LBA in the portion is mapped.
- This example includes the elements of example 31, wherein the state corresponds to mapped or unmapped.
- This example includes the elements of example 3 1 , wherein the reply is compatible with at least one of PCIe® (Peripheral Component Interconnect Express), SAS (Serial Attached SCSI (Small Computer System Interface)), ATA (Advanced Technology Attachment), SATA (Serial ATA) and/or NVMe (Non-Volatile Memory Host Controller Interface Express).
- PCIe® Peripheral Component Interconnect Express
- SAS Serial Attached SCSI (Small Computer System Interface)
- ATA Advanced Technology Attachment
- SATA Serial ATA
- NVMe Non-Volatile Memory Host Controller Interface Express
- NVM includes one or more of magnetoresistive random access memory (MRAM), phase change memory (PCM, PRAM), three dimensional crosspoint memory, resistive memory, ferroelectric memory (F-RAM, FeRAM), spin-transfer torque memory (STT), thermal assisted switching memory (TAS), millipede memory, floating junction gate memory (FJG RAM), magnetic tunnel junction (MTJ) memory, electrochemical cells (ECM) memory, binary oxide filament cell memory, interfacial switching memory, battery-backed RAM and/or NAND flash memory.
- MRAM magnetoresistive random access memory
- PCM phase change memory
- PRAM phase change memory
- resistive memory ferroelectric memory
- F-RAM ferroelectric memory
- STT spin-transfer torque memory
- TAS thermal assisted switching memory
- FJG RAM floating junction gate memory
- MTJ magnetic tunnel junction
- ECM electrochemical cells
- This example includes the elements of example 31 and further includes coupling, by the host interface, a storage device to at least one of a storage system controller logic and/or a computing device.
- This example includes the elements of example 31, wherein the request is received from and the reply is provided to storage system control logic.
- a computer-readable storage device having stored thereon instructions that when executed by one or more processors result in the following operations including identifying at least one target storage device in response to detection of a failed storage device; requesting a state of a target device logical block address (LBA) from each of the at least one target storage device; reading data associated with a mapped device LBA from each target storage device; and writing the data to at least one replacement storage device.
- LBA target device logical block address
- This example includes the elements of example 41, wherein the instructions that when executed by one or more processors results in the following additional operations including requesting a trim command (TRIM) from a computing device to be provided to each target storage device.
- TIM trim command
- This example includes the elements of example 41, wherein the instructions that when executed by one or more processors results in the following additional operations including identifying each target device LBA.
- This example includes the elements of example 41, wherein the instructions that when executed by one or more processors results in the following additional operations including requesting from each target storage device a respective state of each target device LBA of a range of target device LBAs.
- This example includes the elements of example 41, wherein the instructions that when executed by one or more processors results in the following additional operations including requesting a listing of mapped device LBAs from each target storage device.
- This example includes the elements of example 41, wherein the state corresponds to mapped or unmapped.
- This example includes the elements according to any one of examples 41 through 46, wherein the instructions that when executed by one or more processors results in the following additional operations including requesting the state of each target device LBA at least one of prior to and/or during a rebuild of the failed storage device.
- This example includes the elements according to any one of examples 41 through 46, wherein the request includes a respective target device LBA and the instructions that when executed by one or more processors results in the following additional operations including receiving an indicator related to the state of the respective target device LBA.
- This example includes the elements according to any one of examples 41 through 46, wherein the request includes a range of target device LBAs and the instructions that when executed by one or more processors results in the following additional operations including receiving indicators related to the respective state of each target device LBA in the range of target device LBAs.
- This example includes the elements according to any one of examples 41 through 46, wherein the instructions that when executed by one or more processors results in the following additional operations including determining parity information based, at least in part, on the data associated with the mapped device LBA.
- a computer-readable storage device having stored thereon instructions that when executed by one or more processors result in the following operations including determining a state of a target device logical block address (LBA) in response to a request, based, at least in part, on a map table including a plurality of device LBAs and respective state indicators, the map table related to a non-volatile memory (NVM) that includes data related to at least one mapped LBA; and providing a reply to the request, the reply including a state indicator related to the state of the target device LBA.
- LBA target device logical block address
- NVM non-volatile memory
- This example includes the elements of example 51, wherein the instructions that when executed by one or more processors results in the following additional operations including unmapping a device LBA in response to a receiving trim command (TRIM).
- TIM trim command
- This example includes the elements of example 51, wherein the instructions that when executed by one or more processors results in the following additional operations including determining a respective state of each target device LBA of a range of target device LBAs in response to the request.
- This example includes the elements of example 51, wherein the instructions that when executed by one or more processors results in the following additional operations including determining a respective state of each device LBA of the plurality of device LBAs in response to the request.
- This example includes the elements of example 54, wherein the reply includes at least a portion of the plurality of device LBAs and a respective state of each device LBA in the portion is mapped.
- This example includes the elements of example 51, wherein the state corresponds to mapped or unmapped.
- This example includes the elements according to any one of examples 51 through 56, wherein the reply is compatible with at least one of PCIe® (Peripheral Component Interconnect Express), SAS (Serial Attached SCSI (Small Computer System Interface)), ATA (Advanced Technology Attachment), SATA (Serial ATA) and/or NVMe (Non-Volatile Memory Host Controller Interface Express).
- PCIe® Peripheral Component Interconnect Express
- SAS Serial Attached SCSI (Small Computer System Interface)
- ATA Advanced Technology Attachment
- SATA Serial ATA
- NVMe Non-Volatile Memory Host Controller Interface Express
- NVM includes one or more of magnetoresistive random access memory (MRAM), phase change memory (PCM, PRAM), three dimensional crosspoint memory, resistive memory, ferroelectric memory (F-RAM, FeRAM), spin-transfer torque memory (STT), thermal assisted switching memory (TAS), millipede memory, floating junction gate memory (FJG RAM), magnetic tunnel junction (MTJ) memory, electrochemical cells (ECM) memory, binary oxide filament cell memory, interfacial switching memory, battery-backed RAM and/or NAND flash memory.
- MRAM magnetoresistive random access memory
- PCM phase change memory
- PRAM phase change memory
- resistive memory ferroelectric memory
- F-RAM ferroelectric memory
- STT spin-transfer torque memory
- TAS thermal assisted switching memory
- FJG RAM floating junction gate memory
- MTJ magnetic tunnel junction
- ECM electrochemical cells
- binary oxide filament cell memory interfacial switching memory
- battery-backed RAM battery-backed RAM and/or NAND flash memory
- This example includes the elements according to any one of examples 51 through 56, wherein the instructions that when executed by one or more processors results in the following additional operations including coupling a storage device to at least one of a storage system controller logic and/or a computing device.
- This example includes the elements according to any one of examples 51 through 56, wherein the request is received from and the reply is provided to storage system control logic.
- Another example of the present disclosure is a system including at least one device arranged to perform the method of any one of examples 21 to 30.
- Another example of the present disclosure is a system including at least one device arranged to perform the method of any one of examples 31 to 40.
- Another example of the present disclosure is a device including means to perform the method of any one of examples 21 to 30.
- Another example of the present disclosure is a device including means to perform the method of any one of examples 31 to 40.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Quality & Reliability (AREA)
- Debugging And Monitoring (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
One embodiment provides a storage system. The storage system includes storage system control logic to identify at least one target storage device in response to detection of a failed storage device, request a state of a target device logical block address (LBA) from each of the at least one target storage device, and read data associated with a mapped device LBA from each target storage device and write the data to at least one replacement storage device. Another embodiment provides a storage device. The storage device includes device control logic to determine a state of a target device logical block address (LBA) in response to a request; a host interface to provide a reply to the request, the reply including a state indicator related to the state of the target device LBA; a map table including a plurality of device LBAs and respective state indicators; and non-volatile memory (NVM) including data related to at least one mapped LBA.
Description
- The present application is a continuation of U.S. patent application Ser. No. 14/568,417 filed Dec. 12, 2014, the entire disclosure of which is incorporated herein by reference.
- The present disclosure relates to accelerated data recovery, in particular to, accelerated data recovery in a storage system.
- RAID (redundant array of independent (inexpensive) disks) systems include a plurality of storage devices configured to provide improved performance and/or fault tolerance relative to a single storage device. In redundant storage systems (e.g., RAID systems above level 0), one or more copies of data and/or parity information may be stored on a plurality of storage devices. Data stored on a failed storage device may be recovered from the cop(ies) and/or related parity information and stored on a replacement storage device. Such recovery and storage operations are known as “rebuilding”. Generally, rebuilding a failed storage device includes writing an entire address space of the failed storage device from the cop(ies) to the replacement storage device. As storage capacities of storage devices increase, the time duration of rebuild operations for a failed storage device also increases.
- Features and advantages of the claimed subject matter will be apparent from the following detailed description of embodiments consistent therewith, which description should be considered with reference to the accompanying drawings, wherein:
-
FIG. 1 illustrates a functional block diagram of a system consistent with various embodiments of the present disclosure; -
FIG. 2 is a flowchart of accelerated data recovery operations according to various embodiments of the present disclosure; -
FIG. 3 is a flowchart of storage device operations according to various embodiments of the present disclosure; and -
FIGS. 4A and 4B illustrate one example of accelerated data recovery in a RAID (redundant array of independent (inexpensive) disks)level 1 system. - Although the following Detailed Description will proceed with reference being made to illustrative embodiments, many alternatives, modifications, and variations thereof will be apparent to those skilled in the art.
- Generally, this disclosure relates to accelerated recovery methods (and systems) in a storage system. The storage system may include a plurality of storage devices configured to provide fault tolerance. In response to a failure of a storage device, the methods (and systems) are configured query one or more other storage device(s) to identify mapped logical block addresses (LBAs) related to the failed storage device.
- The methods and systems are further configured to provide an accelerated rebuild of a replacement storage device by reading and writing (i.e., copying) data associated with mapped LBA(s). In other words, rather than copying contents of all storage elements associated with the failed storage device to the replacement storage device, the contents of storage elements associated with mapped LBA(s) may be copied. Thus, a time duration associated with rebuilding the failed storage device may be reduced relative to copying an entire span (i.e., range of LBAs available for user data) of a storage device. By querying the storage device(s), i.e., by requesting state(s) of device LBA(s) from the storage device(s), such rebuilds may be performed independent of existence and/or type of file system that may be resident on a host computing device coupled to the storage system. Such rebuilds may also be performed independent of partitioning of the storage system by, e.g., an operating system (OS).
- As used herein, an unmapped device LBA is a storage device LBA that has not been written to or has been subject to a trim command (TRIM) and has not been written to since the TRIM, as described herein. A mapped device LBA is a storage device LBA that has been written to and has not been subject to a TRIM since being written to. Mapped and unmapped, as applied to a device LBA, correspond to respective states of the device LBA. Mapping within a storage device is configured to relate a device LBA to a physical block address (PBA) associated with a physical storage medium included in the storage device. Mapping may also be used to relate a host LBA to a device LBA.
-
FIG. 1 illustrates a functional block diagram of asystem 100 consistent with various embodiments of the present disclosure.System 100 includes acomputing device 102 and astorage system 104. Thestorage system 104 includes storagesystem control logic 106 and one or more storage device(s) 110 a, 110 b, . . . , 110 m.System 100 may include one or more peripheral device(s) 140 coupled tocomputing device 102.Computing device 102 may include, but is not limited to, a server, a workstation computer, a desktop computer, a laptop computer, a tablet computer (e.g., iPad®, GalaxyTab® and the like), an ultraportable computer, an ultramobile computer, a netbook computer and/or a subnotebook computer; a mobile telephone including, but not limited to a smart phone, (e.g., iPhone®, Android®-based phone, Blackberry®, Symbian®-based phone, Palm®-based phone, etc.) and/or a feature phone. -
Computing device 102 includes aprocessor 120, achipset 122 and amemory 126.Processor 120 may include one or more processing unit(s) (e.g., processor(s) and/or core(s)) and is configured to perform operations associated withcomputing device 102.Chipset 122 may include astorage device interface 124.Chipset 122 and/orstorage device interface 124 may be configured to coupleprocessor 120 tostorage system 104, storagesystem control logic 106 and/or peripheral device(s) 140.Storage device interface 124 may be configured to communicate withstorage system 104, storagesystem control logic 106 and/or peripheral device(s) 140 via one or more interface and/or interconnect protocols, e.g., PCIe® (Peripheral Component Interconnect Express), SAS (Serial Attached SCSI (Small Computer System Interface)), ATA (Advanced Technology Attachment), SATA (Serial ATA), NVMe (Non-Volatile Memory Host Controller Interface Express), etc. Peripheral device(s) 140 may include, for example, user interface device(s) including a display, a touch-screen display, printer, keypad, keyboard, etc., communication logic, wired and/or wireless, other storage device(s) including hard disk drives, solid-state drives, removable storage media, etc. -
Memory 126 may include cache and/or random access memory.Memory 126 is configured to store an OS 130 and one or more application(s) 134. The application(s) 134 may be configured to initiate memory access operations related tostorage system 104. OS 130 may include afile system 132.File system 132 is configured to manage file storage and/or retrieval forcomputing device 102. For example,file system 132 may be included in a storage stack that includes one or more of the interface and/or interconnect protocols, storagesystem control logic 106 and storage system 104 (e.g.,storage device 110 a and/ordevice control logic 150, as described herein). -
Storage system 104 includes one or more storage device(s) 110 a, 110 b, . . . , 110 m. For example, thestorage devices storage device 110 a, includesdevice control logic 150, ahost interface 152, acache memory 154, a non-volatile memory (NVM) 156 and a map table 158.Cache memory 154 may include non-volatile memory, as described herein, or volatile memory, e.g., dynamic random access memory (DRAM), synchronous DRAM (SDRAM), etc.Device control logic 150 is configured to manage operations ofstorage device 110 a.Device control logic 150 is configured to write data to and read data from NVM 156. NVM 156 is configured to store user data, parity information and/or NVM operational information. NVM operational information may include logic associated with operation ofstorage device 110 a and/or metadata. NVM 156 includes a plurality of storage elements that may be organized in pages and/or blocks. Block(s) of storage elements inNVM 156 may be identified by physical block address(es) (PBA(s)). Data received fromcomputing device 102 and/or storagesystem control logic 106 may be identified by device LBA(s).Device control logic 150 may be configured to relate each device LBA to a respective PBA and store the result in map table 158. Map table 158 may further include a state indicator associated with each device LBA. Map table 158 may be resident in NVM 156 and/orcache 154. The state indicator may have two values that correspond to a mapped device LBA (e.g., logic one) and an unmapped device LBA (e.g., logic zero), respectively. For example,device control logic 150 may be configured to set the associated state indicator to logic one when a respective device LBA is written to (i.e., mapped) and reset the associated state indicator from logic one to logic zero when a previously mapped device LBA becomes unmapped. Thus, map table 158 may be utilized bydevice control logic 150 to identify a state of each device LBA(s), as described herein. - Storage device(s)110 a, 110 b, . . . , 110 m may correspond to hard disk drive(s) (HDD(s)) and/or solid state drive(s) (SSD(s)). For storage devices that are HDDs,
NVM 156 may include ferromagnetic material. For storage devices that are SSDs,NVM 156 may include, but is not limited to, magnetoresistive random access memory (MRAM), phase change memory (PCM, PRAM), three dimensional crosspoint memory, resistive memory, ferroelectric memory (F-RAM, FeRAM), spin-transfer torque memory (STT), thermal assisted switching memory (TAS), millipede memory, floating junction gate memory (FJG RAM), magnetic tunnel junction (MTJ) memory, electrochemical cells (ECM) memory, binary oxide filament cell memory, interfacial switching memory, battery-backed RAM, NAND flash memory, etc. -
Host interface 152 is configured to receive commands and/or data fromcomputing device 102 and/or storagesystem control logic 106 and to transmit responses (i.e., replies) and/or data tocomputing device 102 and/or storagesystem control logic 106.Host interface 152 may be configured to couplestorage device 110 a to storagesystem control logic 106 and/orhost device 102.Host interface 152 may comply and/or be compatible with one or more interface and/or interconnect protocol(s), as described herein.Host interface 152 may be further configured to decode received commands and/or data and to provide the decoded commands and/or data todevice control logic 150.Host interface 152 may be further configured to encode data and/or response(s) fromdevice control logic 150 to a format (and/or syntax) that complies and/or is compatible with the interface and/or interconnect protocol(s).Cache 154 is configured to store commands and/or data. Commands and/or data may be stored incache 154 prior to decoding and/or prior to encoding. Received data, e.g., data received fromcomputing device 102, may be stored incache 154 prior to being written toNVM 156. Thus,host interface 152 may provide an interface, interconnect and/or communication betweenstorage device 110 a andcomputing device 102 and/or storagesystem control logic 106. - Storage
system control logic 106 may be resident instorage system 104, may be coupled betweencomputing device 102 andstorage system 104 and/or may be resident incomputing device 102. Storagesystem control logic 106 is configured to manage memory access operations (e.g., read and/or write operations) betweencomputing device 102, storage system 104 (andstorage devices system control logic 106 is configured toabstract storage devices computing device 102. For example,storage system 104 may correspond to a RAID system and storagesystem control logic 106 may then correspond to a RAID controller. - Storage
system control logic 106 is configured to receive read and/or write commands and associated host LBA(s) from computing device 102 (e.g., from file system 132) and to determine, select and/or identify appropriate storage device(s) 110 a, 110 b, . . . , and/or 110 m based, at least in part, on the host LBA(s). Storagesystem control logic 106 may be configured to map each host LBA to one or more respective device LBA(s). In a redundant storage system, one host LBA may be mapped to device LBAs associated with more than one storage device. Storagesystem control logic 106 is further configured to receive user data associated with the host LBA(s). Storagesystem control logic 106 may receive read data from one or more storage device(s) 110 a, 110 b, . . . , 110 m and provide the read data tocomputing device 102. - In write operations, storage
system control logic 106 may be configured to determine and/or select appropriate storage device(s) 110 a, 110 b, . . . , and/or 110 m based, at least in part, on the host LBA(s). For example, storagesystem control logic 106 may be configured to distribute user data associated with a write command across a plurality of thestorage devices system control logic 106 may be configured to select a plurality ofstorage devices system control logic 106 may be further configured to relate each subset of host LBA(s) to a respective plurality of device LBA(s) for each storage device. Storagesystem control logic 106 may be further configured to determine parity information, e.g., a checksum, based, at least in part, on data to be written, and to store the parity information to device LBA(s) of one or more of the storage device(s) 110 a, 110 b, . . . , and/or 110 m. In another example, storagesystem control logic 106 may be configured to select one storage device, e.g.,storage device 110 a, to associate with the host LBAs and to receive the user data. Storagesystem control logic 106 may be further configured to determine parity information based, at least in part, on data to be written, and to store the parity information to another of thestorage devices 110 b, . . . , or 110 m. In both examples, storagesystem control logic 106 may be configured to provide write command(s), data (e.g., user data and/or parity information) and associated device LBA(s) to selected storage device(s) 110 a, 110 b, . . . , and/or 110 m. - In read operations, storage
system control logic 106 is configured to identify appropriate storage device(s) 110 a, 110 b , . . . , and/or 110 m and to provide read command(s) and associated device LBA(s) to identified storage device(s) 110 a, 110 b , . . . , and/or 110 m based, at least in part, on host LBA(s). Storagesystem control logic 106 may be further configured to receive the read user data from the identified storage device(s) 110 a, 110 b, . . . , and/or 110 m and to provide the read data tocomputing device 102. Storagesystem control logic 106 may be configured to verify that read data is not corrupted based, at least in part, on parity and/or redundant information associated with the read data. Storagesystem control logic 106 may be further configured to correct corrupted data, if possible, based, at least in part, on the parity and/or redundant information. - Storage
system control logic 106 is configured to detect failure of one or more ofstorage devices system control logic 106 may be further configured to initiate rebuild operations in response to a detected failure. Storagesystem control logic 106 may be configured to communicate the failure tocomputing device 102.Computing device 102 and/orOS 130 may be configured to notify, e.g., a user via peripheral device(s) 140, that a storage device has failed. The notice is configured to prompt the user to replace the failed storage device with a replacement storage device. For example, storagesystem control logic 106 may be configured to receive an indication from a user viacomputing device 102 and/or peripheral device(s) 140 that a replacement storage device has been coupled to storagesystem control logic 106. In another example, storagesystem control logic 106 may be configured to detect that a replacement storage device has been coupled to storagesystem control logic 106 in place of the failed storage device. In another example,storage system 104 may include a replacement storage device configured to be utilized if a storage device fails. The replacement storage device may exist instorage system 104 prior to a failure of a storage device and/or may be added by the user after the failure is detected. - Generally, when rebuild operations are initiated, the failed storage device may be storing less than a maximum storage capacity of the failed storage device. In other words, fewer device LBA(s) than the span (i.e., the range of device LBAs that may store user data) of the failed storage device may be mapped. Rebuild operations may be accelerated by building mapped portions of the failed storage device rather than the entire span of the failed storage device.
- In an embodiment, storage
system control logic 106 may be configured to request respective state(s) of target device LBA(s) from eachstorage device system control logic 106 may be configured to identify the target storage device(s) based, at least in part, on a configuration ofstorage system 104. Storagesystem control logic 106 may be further configured to identify target device LBA(s) based, at least in part, on the configuration ofstorage system 104. For example, in a mirrored configuration, one target storage device may include mapped LBA(s) related to the failed storage device. In another example, in a configuration that implements striping (e.g., RAID), a plurality of storage devices may include mapped device LBAs related to the failed storage device. The plurality of storage devices may be included in a storage pool. Storagesystem control logic 106 may be configured to identify the plurality of target storage devices based, at least in part, on the storage pool. - Storage
system control logic 106 may be configured to identify the target storage device(s) and request the state(s) of target device LBA(s) in response to an indication that a storage device has failed. For example,storage system 104 may be configured to provide redundancy by mirroring stored data on a first storage device and a second storage device (e.g., RAID 1). Storagesystem control logic 106 may then be configured to request respective state(s) of target device LBA(s) from the second storage device if the first storage device fails. In this example, rebuild operations may then include copying data associated with the mapped device LBA(s) from the target storage device (i.e., the second storage device) to the replacement storage device. - In another example,
storage system 104 may be configured to provide fault tolerance by “striping” blocks of data across a plurality of storage devices (i.e., storage pool) and distributing parity information across the plurality of storage devices (e.g., RAID 5). A stripe of data includes segments of logically sequential data that are distributed across a plurality of storage devices, i.e., a storage pool. Storagesystem control logic 106 may be configured to identify the plurality of storage devices in the storage pool. Storagesystem control logic 106 may then be configured to request respective state(s) of target device LBA(s) from each of the plurality of storage devices (except the failed storage device) in the storage pool. The state(s) of target device LBA(s) may be requested for a stripe of data. Whether to read and write the target device LBAs in the stripe may then be determined for the plurality of target storage devices in the storage pool. In this example, rebuild operations may include determining and/or verifying values of stored data based, at least in part, on the parity information. Stripes of unmapped target device LBAs may not be read nor written. - In some embodiments,
file system 132 may be configured to provide a trim (TRIM) command and associated host LBA(s) 162 to storagesystem control logic 106. TRIM is configured to communicate host LBAs tostorage system 104 that have been unmapped by thefile system 132. For example, host LBAs may be unmapped in response to a permanent deletion of a file, emptying a “recycle bin” and/or performing a “quick format”, e.g., in a Microsoft® Windows® OS. TRIM is configured to optimize garbage collection and erase operations in SSDs that erase blocks of storage elements prior to writing new data to the storage elements. - In these embodiments, storage
system control logic 106 may be configured to send aTRIM request 160 tocomputing device 102 in response to a storage device failure. TheTRIM request 160 is configured to trigger transmission of a TRIM command and associated host LBA(s) 162 fromfile system 132 tostorage system 104. The TRIM command may comply and/or be compatible with one or more interface and/or interconnect protocols, as described herein. For example, the TRIM command may correspond to an ATA DATA SET MANAGEMENT command, a SCSI (and/or SAS) UNMAP command and/or an NVMe DEALLOCATE command, as described herein. Storagesystem control logic 106 may receive the TRIM command and associated host LBAs 162 and may provide a corresponding TRIM command and associated device LBAs 164 to the target storage device(s). For example, a host interface of each target storage device, e.g.,host interface 152 ofstorage device 110 a, may be configured to receive command(s), data and/or device LBA(s) formatted according to a respective interface and/or interconnect protocol and to provide the command(s), data and/or device LBA(s) to, e.g.,device control logic 150. Device control logic of each target storage device, e.g.,device control logic 150, may then be configured to unmap the device LBA(s) that correspond to received TRIM command. Thus, in response to the TRIM command the target storage device(s) may have a most up-to-date mapping (and unmapping) of device LBA(s). - Storage
system control logic 106 is configured to send a request for state of target device LBA(s) command (LBA state request command) 166 to each target storage device, e.g.,storage device 110 a, associated with the failed storage device. In an embodiment, the LBA state request command may include one or more target device LBA(s). In another embodiment, the LBA state request command may request a listing of mapped device LBA(s) and/or a listing of unmapped device LBA(s). Storagesystem control logic 106 may send the LBAstate request command 166 in response to a failure of a storage device, e.g.,storage device 110 b. For example, the LBAstate request command 166 may be sent without first sending theTRIM request command 160. In another example, the LBAstate request command 166 may be sent after sending theTRIM request command 160 and/or after receiving the TRIM command and associated host LBA(s) 162. - The LBA
state request command 166 may correspond to and/or be compatible with one or more of the interface and/or interconnect protocols, as described herein. In some embodiments, the LBAstate request command 166 may have an associated syntax that is specific to a selected storage device, e.g.,storage device 110 a. A specific syntax may be configured to provide a level of security by limiting use of the command to selected storage devices (e.g., vendor-specific). The specific syntax may be further configured to provide a vendor-specific command. For example, a vendor-specific command may simplify a command structure and/or facilitate device testing. - Each target storage device, e.g.,
storage device 110 a, may be configured to provide a response 168 (i.e., reply 168) to storagesystem control logic 106 that includes state indicator(s) corresponding to the requested state(s) of the target device LBA(s). The repl(ies) 168 may comply and/or be compatible with one or more interface and/or interconnect protocols. For example,host interface 152 may be configured to receive the LBAstate request command 166, to decode thecommand 166 and to provide the decoded request todevice control logic 150.Device control logic 150 may be configured to identify the state(s) of target device LBA(s) based, at least in part, on map table 158. For example, map table 158 may relate device LBA(s) toNVM 156 PBA(s) and may further include a state indicator, e.g., a flag, that corresponds to a state of an associated device LBA. For example, the flag may be a bit and may have a value of one (e.g., TRUE) for a target device LBA that is mapped and a value of zero (e.g., FALSE) for a target device LBA that is unmapped.Device control logic 150 may be configured to read map table 158 and the associated indicators to identify the state(s) of target device LBA(s).Device control logic 150 may then be configured to provide the state(s) tohost interface 152.Host interface 152 may then generate thereply 168 that includes the state(s) of target device LBA(s) and to transmit the response to storagesystem control logic 106. Responses that include the states of a plurality of target device LBA(s) may be configured to relate each state indicator to its respective target device LBA. Thereply 168 may comply and/or be compatible with one or more interface and/or interconnect protocols, as described herein. - In an embodiment, storage
system control logic 106 may be configured to include a target device LBA with the request for mapped and/or unmapped device LBA(s)command 166. In this embodiment, each target storage device is configured to determine a state of the target device LBA and to provide the resulting state indicator to storagesystem control logic 106 inreply 168. For example, the LBAstate request command 166 that includes the target device LBA may be sent during rebuild, e.g., prior to building each target device LBA. - In another embodiment, storage
system control logic 106 may be configured to include a range of target device LBAs with the LBAstate request command 166. In this embodiment, each target storage device may be configured to determine a respective state of each device LBA included in the range of target device LBA(s) and to provide the resulting state indicator(s) to storagesystem control logic 106 inreply 168. For example, the LBAstate request command 166 that includes the range of target device LBAs may be sent prior to and/or during rebuild, e.g., prior to building the target device LBAs. - In another embodiment, storage
system control logic 106 may be configured to request a listing of mapped device LBA(s) and/or a listing of unmapped device LBA(s). In this embodiment, each target storage device may be configured to provide device LBA(s) and associated state indicators from map table 158 in theirrespective replies 168. For example, the LBAstate request command 166 that includes the request for the listing(s) may be sent prior to rebuild. - Thus, the storage
system control logic 106 may request state(s) of target device LBA(s) from target storage device(s) in response to detection of a failed storage device. The target storage device(s) may then provide the state indicator(s) of the target device LBA(s) to the storagesystem control logic 106 in response to the request. The storagesystem control logic 106 may then be configured to identify a respective state of each target device LBA. For example, storagesystem control logic 106 may be configured to perform a logical OR operation (e.g., exclusive OR (XOR)) of the state indicator(s) and a test value to identify the respective state(s). - Storage
system control logic 106 may then be configured to build a replacement storage device, e.g.,storage device 110 a, configured to replace the failed storage device, e.g.,storage device 110 b. For example, storagesystem control logic 106 may be configured to send a read command that includes mapped device LBA(s) to the target storage device(s) and to write the read data to corresponding device LBA(s) of thereplacement storage device 110 a. Unmapped device LBA(s) may not be read and may not be written. The data may include user data and/or parity information, as described herein. Storagesystem control logic 106 may be further configured to update parity information associated with thereplacement storage device 110 a. Thus, a rebuild of the failedstorage device 110 b may be accelerated as described herein by reading and writing mapped device LBA(s) and not reading and not writing contents of unmapped device LBA(s). The rebuild may be performed independent of a file system, e.g.,file system 130, file structure and/or partitioning related tocomputing device 102. -
FIG. 2 is aflowchart 200 of accelerated data recovery operations according to various embodiments of the present disclosure. In particular,flowchart 200 illustrates acquiring state(s) of target device LBA(s) associated with a failed storage device and building a replacement storage device by writing the contents of the mapped device LBA(s) to the replacement storage device. The operations may be performed, for example, by storagesystem control logic 106 ofFIG. 1 . - Operations of this embodiment may begin with a
storage device failure 202. In some embodiments, a TRIM command may be requested atoperation 204. For example, the TRIM command may be requested from a file system included in a computing device, e.g.,file system 132 ofcomputing device 102. Target storage device(s) may be identified atoperation 206. Target storage device(s) may be storing data related to the failed storage device. For example, in a mirrored system, one failed storage device may correspond to one target storage device. In another example, in a striped system, one failed storage device may be related to a plurality of target storage devices. Target device LBA(s) may be identified atoperation 208. A respective state of each identified target device LBA may be requested atoperation 210. For example, the respective state(s) may be requested from target storage device(s) that include data related to data stored on the failed storage device. Requested state indicator(s) may be received atoperation 212. Data associated with mapped device LBA(s) may be read from the target storage device(s) and written to at least one replacement storage device atoperation 214. For example, data associated with the mapped device LBA(s) may include user data and/or parity information. Program flow may then continue atoperation 216. - Thus, a rebuild of a failed storage device may be accelerated by copying data associated with mapped device LBA(s) from target storage device(s) (i.e., reading) to the replacement storage device (i.e., writing). Whether a device LBA is mapped or unmapped may be determined based, at least in part, on LBA state information received from the target storage device(s).
-
FIG. 3 is aflowchart 300 of storage device operations according to various embodiments of the present disclosure. In particular,flowchart 300 illustrates unmapping device LBA(s) in response to a TRIM command and providing respective state(s) of target device LBA(s) in response to a request for them. The operations may be performed by a storage device, for example, one or more of storage device(s) 110 a, 110 b, . . . , and/or 110 m ofFIG. 1 . - Operations of this embodiment may begin with
start 302. In some embodiments, a TRIM command may be received atoperation 304. The TRIM command is configured to identify device LBA(s) that correspond to host LBA(s) that have been unmapped by, e.g., a file system. Device LBA(s) associated with the TRIM command may be unmapped at operation 306. Whetheroperations 304 and 306 are performed may depend on whether an OS, e.g.,OS 130, includes a TRIM command. - Operations of this embodiment include receiving a request for state(s) of target device LBA(s) at
operation 308. One or more state(s) of target device LBA(s) may be determined in response to the request atoperation 310. For example, state(s) of target device LBA(s) may be determined based, at least in part, on a map table, e.g., map table 158 ofFIG. 1 .Operation 312 includes providing a reply that includes a respective state indicator related to the state of each target device LBA. For example, the response may be provided to storage system control logic. Program flow may end atoperation 314. Thus, a storage device may provide respective state indicator(s) of one or more target device LBA(s) in response to a request from, e.g., storage system control logic. - While the flowcharts of
FIGS. 2 and 3 illustrate operations according various embodiments, it is to be understood that not all of the operations depicted inFIGS. 2 and/or 3 are necessary for other embodiments. In addition, it is fully contemplated herein that in other embodiments of the present disclosure, the operations depicted inFIGS. 2 and/or 3 , and/or other operations described herein may be combined in a manner not specifically shown in any of the drawings, and such embodiments may include less or more operations than are illustrated inFIGS. 2 and/or 3 . Thus, claims directed to features and/or operations that are not exactly shown in one drawing are deemed within the scope and content of the present disclosure. -
FIGS. 4A and 4B illustrate one example of accelerated data recovery in a RAID level 1 (i.e., mirrored) system.FIG. 4A illustrates an example 400 ofNVM RAID 1 that includes two storage devices (Device 0 and Device 1) configured to mirror stored data. In other words, theNVM NVM Device 1 has failed. Example 400 further includes a replacement device (Device R) that is configured to replace failedDevice 1. Device R includesNVM 406 that includes device LBA(s) A1, A2, . . . , AY that are unmapped. In other words the span ofNVM 406 is unmapped. A user may remove failedDevice 1 and replaceDevice 1 with Device R. -
FIG. 4B illustrates the example 400′ ofFIG. 4A after the failedDevice 1 has been replaced with Device R. Storage system control logic, e.g., storagesystem control logic 106 ofFIG. 1 , is configured to request state(s) of target device LBA(s) fromDevice 0, for example, by sending a LBA state request command toDevice 0, as described herein.Device 0 is configured to receive the request and to transmit respective state indicator(s) of device LBA(s), A1, A2, . . . , AN (i.e., mapped), to the storage system control logic in response to receiving the LBA state request command, as described herein. The storage system control logic may then be configured to copy the data associated with mapped device LBA(s) A1, A2, . . . , AN fromNVM 402 toNVM 406. Device LBA(s) AN+1, . . . , AY ofNVM 406 may remain unmapped. Thus, rebuilding a failed storage device may be accelerated, as described herein, compared to copying the full span. - Turning again to
FIG. 1 ,OS 130 may be configured to managesystem 100 resources and control tasks that are run on each respective device and/or system, e.g.,computing device 102 and/orstorage system 104. For example, the OS may be implemented using Microsoft® Windows®, HP-UX®, Linux®, or UNIX®, although other operating systems may be used. In some embodiments, theOS 130 may be replaced by a virtual machine monitor (or hypervisor) which may provide a layer of abstraction for underlying hardware to various operating systems (virtual machines) running on one or more processing units. -
Memory 126 may include one or more of the following types of memory: semiconductor firmware memory, programmable memory, non-volatile memory, read only memory, electrically programmable memory, random access memory, flash memory, magnetic disk memory, and/or optical disk memory. Either additionally or alternatively device memory may include other and/or later-developed types of computer-readable memory. - Embodiments of the operations described herein may be implemented in a computer-readable storage device having stored thereon instructions that when executed by one or more processors perform the methods. The processor may include, for example, a processing unit and/or programmable circuitry. The storage device may include a machine readable storage device including any type of tangible, non-transitory storage device, for example, any type of disk including floppy disks, optical disks, compact disk read-only memories (CD-ROMs), compact disk rewritables (CD-RWs), and magneto-optical disks, semiconductor devices such as read-only memories (ROMs), random access memories (RAMs) such as dynamic and static RAMs, erasable programmable read-only memories (EPROMs), electrically erasable programmable read-only memories (EEPROMs), flash memories, magnetic or optical cards, or any type of storage devices suitable for storing electronic instructions.
- As used in any embodiment herein, the term “logic” may refer to an app, software, firmware and/or circuitry configured to perform any of the aforementioned operations. Software may be embodied as a software package, code, instructions, instruction sets and/or data recorded on non-transitory computer readable storage medium. Firmware may be embodied as code, instructions or instruction sets and/or data that are hard-coded (e.g., nonvolatile) in memory devices.
- “Circuitry”, as used in any embodiment herein, may comprise, for example, singly or in any combination, hardwired circuitry, programmable circuitry such as computer processors comprising one or more individual instruction processing cores, state machine circuitry, and/or firmware that stores instructions executed by programmable circuitry. The logic may, collectively or individually, be embodied as circuitry that forms part of a larger system, for example, an integrated circuit (IC), an application-specific integrated circuit (ASIC), a system on-chip (SoC), desktop computers, laptop computers, tablet computers, servers, smart phones, etc.
-
Computing device 102 andstorage system 104 may be configured to communicate with each other via one or more interface and/or interconnect protocols. Storagesystem control logic 106 and storage device(s) 110 a, 110 b, . . . , and/or 110 m may be configured to communicate with each other via one or more interface and/or interconnect protocols. - In one example, the interface and/or interconnect protocol may comply and/or be compatible with PCIe® (Peripheral Component Interconnect Express) specification, titled: PCI Express ® Base Specification, Revision 3.0, released by PCI-SIG® (PCI Special Interest Group), Beaverton, Oreg., November 2010, and/or later versions of this specification, for example, version 3.1.
- In another example, the interface and/or interconnect protocol may comply and/or be compatible with SAS (Serial Attached SCSI (Small Computer System Interface)) standard SAS-3, titled: Serial Attached SCSI-3 (SAS-3) 2212-D ANSI-approved, published by American National Standards Institute (ANSI) InterNational Committee for Information Technology Standards (INCITS), November, 2013, and/or later versions of this standard.
- In another example, the interface and/or interconnect protocol may comply and/or be compatible with ATA (Advanced Technology Attachment) standard, document number: INCITS 452-2009/AM1-2010, titled: Information Technology-AT Attachment 8-ATA/ATAPI Command Set (ATA8-ACS)
Amendment 1, published in 2010 by ANSI INCITS, and/or related and/or later versions of this standard, for example, document number: INCITS 482-2012, titled: Information Technology-ATA/ATAPI Command Set-2 (ACS-2), published in 2012, and/or later and/or related versions of this standard. - In another example, the interface and/or interconnect protocol may comply and/or be compatible with SATA (Serial ATA (Advanced Technology Attachment)) specification, titled: Serial ATA Specification, Revision 3.2, released August 2013, by SATA International Organization (SATA-IO), Beaverton, Oreg., and/or earlier and/or later versions of this specification.
- In another example, the interface and/or interconnect protocol may comply and/or be compatible with NVMe (Non-Volatile Memory Host Controller Interface Express) specification titled: NVM Express™, Revision 1.2, released Nov. 3, 2014 by NVM Express™ Work Group, and/or earlier and/or later versions of this specification.
- In some embodiments, a hardware description language (HDL) may be used to specify circuit and/or logic implementation(s) for the various logic and/or circuitry described herein. For example, in one embodiment the hardware description language may comply or be compatible with a very high speed integrated circuits (VHSIC) hardware description language (VHDL) that may enable semiconductor fabrication of one or more circuits and/or logic described herein. The VHDL may comply or be compatible with IEEE Standard 1076-1987, IEEE Standard 1076.2, IEEE1076.1, IEEE Draft 3.0 of VHDL-2006, IEEE Draft 4.0 of VHDL-2008 and/or other versions of the IEEE VHDL standards and/or other hardware description standards.
- Thus, consistent with the teachings of the present disclosure, a system and method are configured to provide an accelerated rebuild of a replacement storage device by reading and writing (e.g., copying) data associated with mapped device LBA(s). In other words, rather than copying all device LBAs associated with the failed storage device to the replacement storage device, the data associated with device LBA(s) determined to be mapped may be copied. Thus, a time duration associated with rebuilding the failed storage device may be reduced relative to copying an entire span of a storage device. By requesting state(s) of target device LBA(s) from the storage device(s), such rebuilds may be performed independent of existence and/or type of file system that may be resident on a host computing device coupled to the storage system.
- Examples of the present disclosure include subject material such as a method, means for performing acts of the method, a device, or of an apparatus or computer-readable storage device related to accelerated data recovery in a storage system, as discussed below.
- According to this example there is provided a storage system. The storage system includes storage system control logic. The storage system control logic is to identify at least one target storage device in response to detection of a failed storage device, request a state of a target device logical block address (LBA) from each of the at least one target storage device, and read data associated with a mapped device LBA from each target storage device and write the data to at least one replacement storage device.
- This example includes the elements of example 1, wherein the storage system control logic is further to request a trim command (TRIM) from a computing device to be provided to each target storage device.
- This example includes the elements of example 1, wherein the storage system control logic is further to identify each target device LBA.
- This example includes the elements of example 1, wherein the storage system control logic is further to request from each target storage device a respective state of each target device LBA of a range of target device LBAs.
- This example includes the elements of example 1, wherein the storage system control logic is further to request a listing of mapped device LBAs from each target storage device.
- This example includes the elements of example 1, wherein the state corresponds to mapped or unmapped.
- This example includes the elements according to any one of examples 1 through 6, wherein the storage system control logic is to request the state of each target device LBA at least one of prior to and/or during a rebuild of the failed storage device.
- This example includes the elements according to any one of examples 1 through 6, wherein the request includes a respective target device LBA and the storage system control logic is to receive an indicator related to the state of the respective target device LBA.
- This example includes the elements according to any one of examples 1 through 6, wherein the request includes a range of target device LBAs and the storage system control logic is to receive indicators related to the respective state of each target device LBA in the range of target device LBAs.
- This example includes the elements according to any one of examples 1 through 6, wherein the storage system control logic is further to determine parity information based, at least in part, on the data associated with the mapped device LBA.
- According to this example there is provided a storage device. The storage device includes device control logic to determine a state of a target device logical block address (LBA) in response to a request; a host interface to provide a reply to the request, the reply including a state indicator related to the state of the target device LBA; a map table including a plurality of device LBAs and respective state indicators; and non-volatile memory (NVM) including data related to at least one mapped LBA.
- This example includes the elements of example 11, wherein the device control logic is to unmap a device LBA in response to a receiving trim command (TRIM).
- This example includes the elements of example 11, wherein the device control logic is further to determine a respective state of each target device LBA of a range of target device LBAs in response to the request.
- This example includes the elements of example 11, wherein the device control logic is further to determine a respective state of each device LBA of the plurality of device LBAs in response to the request.
- This example includes the elements of example 14, wherein the reply includes at least a portion of the plurality of device LBAs and a respective state of each device LBA in the portion is mapped.
- This example includes the elements of example 11, wherein the state corresponds to mapped or unmapped.
- This example includes the elements according to any one of examples 11 through 16, wherein the host interface is compatible with at least one of PCIe® (Peripheral Component Interconnect Express), SAS (Serial Attached SCSI (Small Computer System Interface)), ATA (Advanced Technology Attachment), SATA (Serial ATA) and/or NVMe (Non-Volatile Memory
- Host Controller Interface Express).
- This example includes the elements according to any one of examples 11 through 16, wherein the NVM includes one or more of magnetoresistive random access memory (MRAM), phase change memory (PCM, PRAM), three dimensional crosspoint memory, resistive memory, ferroelectric memory (F-RAM, FeRAM), spin-transfer torque memory (STT), thermal assisted switching memory (TAS), millipede memory, floating junction gate memory (FJG RAM), magnetic tunnel junction (MTJ) memory, electrochemical cells (ECM) memory, binary oxide filament cell memory, interfacial switching memory, battery-backed RAM and/or NAND flash memory.
- This example includes the elements according to any one of examples 11 through 16, wherein the host interface is to couple the storage device to at least one of a storage system controller logic and/or a computing device.
- This example includes the elements according to any one of examples 11 through 16, wherein the request is received from and the reply is provided to storage system control logic.
- According to this example there is provided a method. The method includes identifying, by storage system control logic, at least one target storage device in response to detection of a failed storage device; requesting, by storage system control logic, a state of a target device logical block address (LBA) from each of the at least one target storage device; reading, by storage system control logic, data associated with a mapped device LBA from each target storage device; and writing, by storage system control logic, the data to at least one replacement storage device.
- This example includes the elements of example 21 and further includes requesting, by storage system control logic, a trim command (TRIM) from a computing device to be provided to each target storage device.
- This example includes the elements of example 21 and further includes identifying, by storage system control logic, each target device LBA.
- This example includes the elements of example 21 and further includes requesting, by storage system control logic, from each target storage device a respective state of each target device LBA of a range of target device LBAs.
- This example includes the elements of example 21 and further includes requesting, by storage system control logic, a listing of mapped device LBAs from each target storage device.
- This example includes the elements of example 21, wherein the state corresponds to mapped or unmapped.
- This example includes the elements of example 21 and further includes requesting, by storage system control logic, the state of each target device LBA at least one of prior to and/or during a rebuild of the failed storage device.
- This example includes the elements of example 21, wherein the request includes a respective target device LBA and further including receiving, by storage system control logic, an indicator related to the state of the respective target device LBA.
- This example includes the elements of example 21, wherein the request includes a range of target device LBAs and further including receiving, by storage system control logic, indicators related to the respective state of each target device LBA in the range of target device LBAs.
- This example includes the elements of example 21 and further includes determining, by storage system control logic, parity information based, at least in part, on the data associated with the mapped device LBA.
- According to this example there is provided a method. The method includes determining, by device control logic, a state of a target device logical block address (LBA) in response to a request based, at least in part, on a map table including a plurality of device LBAs and respective state indicators, the map table related to a non-volatile memory (NVM) that includes data related to at least one mapped LBA; and providing, by a host interface, a reply to the request, the reply including a state indicator related to the state of the target device LBA.
- This example includes the elements of example 31 and further includes unmapping, by the device control logic, a device LBA in response to a receiving trim command (TRIM).
- This example includes the elements of example 31 and further includes determining, by the device control logic, a respective state of each target device LBA of a range of target device LBAs in response to the request.
- This example includes the elements of example 31 and further includes determining, by the device control logic, a respective state of each device LBA of the plurality of device LBAs in response to the request.
- This example includes the elements of example 34, wherein the reply includes at least a portion of the plurality of device LBAs and a respective state of each device LBA in the portion is mapped.
- This example includes the elements of example 31, wherein the state corresponds to mapped or unmapped.
- This example includes the elements of example 31, wherein the reply is compatible with at least one of PCIe® (Peripheral Component Interconnect Express), SAS (Serial Attached SCSI (Small Computer System Interface)), ATA (Advanced Technology Attachment), SATA (Serial ATA) and/or NVMe (Non-Volatile Memory Host Controller Interface Express).
- This example includes the elements of example 31, wherein the NVM includes one or more of magnetoresistive random access memory (MRAM), phase change memory (PCM, PRAM), three dimensional crosspoint memory, resistive memory, ferroelectric memory (F-RAM, FeRAM), spin-transfer torque memory (STT), thermal assisted switching memory (TAS), millipede memory, floating junction gate memory (FJG RAM), magnetic tunnel junction (MTJ) memory, electrochemical cells (ECM) memory, binary oxide filament cell memory, interfacial switching memory, battery-backed RAM and/or NAND flash memory.
- This example includes the elements of example 31 and further includes coupling, by the host interface, a storage device to at least one of a storage system controller logic and/or a computing device.
- This example includes the elements of example 31, wherein the request is received from and the reply is provided to storage system control logic.
- According to this example there is provided a computer-readable storage device having stored thereon instructions that when executed by one or more processors result in the following operations including identifying at least one target storage device in response to detection of a failed storage device; requesting a state of a target device logical block address (LBA) from each of the at least one target storage device; reading data associated with a mapped device LBA from each target storage device; and writing the data to at least one replacement storage device.
- This example includes the elements of example 41, wherein the instructions that when executed by one or more processors results in the following additional operations including requesting a trim command (TRIM) from a computing device to be provided to each target storage device.
- This example includes the elements of example 41, wherein the instructions that when executed by one or more processors results in the following additional operations including identifying each target device LBA.
- This example includes the elements of example 41, wherein the instructions that when executed by one or more processors results in the following additional operations including requesting from each target storage device a respective state of each target device LBA of a range of target device LBAs.
- This example includes the elements of example 41, wherein the instructions that when executed by one or more processors results in the following additional operations including requesting a listing of mapped device LBAs from each target storage device.
- This example includes the elements of example 41, wherein the state corresponds to mapped or unmapped.
- This example includes the elements according to any one of examples 41 through 46, wherein the instructions that when executed by one or more processors results in the following additional operations including requesting the state of each target device LBA at least one of prior to and/or during a rebuild of the failed storage device.
- This example includes the elements according to any one of examples 41 through 46, wherein the request includes a respective target device LBA and the instructions that when executed by one or more processors results in the following additional operations including receiving an indicator related to the state of the respective target device LBA.
- This example includes the elements according to any one of examples 41 through 46, wherein the request includes a range of target device LBAs and the instructions that when executed by one or more processors results in the following additional operations including receiving indicators related to the respective state of each target device LBA in the range of target device LBAs.
- This example includes the elements according to any one of examples 41 through 46, wherein the instructions that when executed by one or more processors results in the following additional operations including determining parity information based, at least in part, on the data associated with the mapped device LBA.
- According to this example there is provided a computer-readable storage device having stored thereon instructions that when executed by one or more processors result in the following operations including determining a state of a target device logical block address (LBA) in response to a request, based, at least in part, on a map table including a plurality of device LBAs and respective state indicators, the map table related to a non-volatile memory (NVM) that includes data related to at least one mapped LBA; and providing a reply to the request, the reply including a state indicator related to the state of the target device LBA.
- This example includes the elements of example 51, wherein the instructions that when executed by one or more processors results in the following additional operations including unmapping a device LBA in response to a receiving trim command (TRIM).
- This example includes the elements of example 51, wherein the instructions that when executed by one or more processors results in the following additional operations including determining a respective state of each target device LBA of a range of target device LBAs in response to the request.
- This example includes the elements of example 51, wherein the instructions that when executed by one or more processors results in the following additional operations including determining a respective state of each device LBA of the plurality of device LBAs in response to the request.
- This example includes the elements of example 54, wherein the reply includes at least a portion of the plurality of device LBAs and a respective state of each device LBA in the portion is mapped.
- This example includes the elements of example 51, wherein the state corresponds to mapped or unmapped.
- This example includes the elements according to any one of examples 51 through 56, wherein the reply is compatible with at least one of PCIe® (Peripheral Component Interconnect Express), SAS (Serial Attached SCSI (Small Computer System Interface)), ATA (Advanced Technology Attachment), SATA (Serial ATA) and/or NVMe (Non-Volatile Memory Host Controller Interface Express).
- This example includes the elements according to any one of examples 51 through 56, wherein the NVM includes one or more of magnetoresistive random access memory (MRAM), phase change memory (PCM, PRAM), three dimensional crosspoint memory, resistive memory, ferroelectric memory (F-RAM, FeRAM), spin-transfer torque memory (STT), thermal assisted switching memory (TAS), millipede memory, floating junction gate memory (FJG RAM), magnetic tunnel junction (MTJ) memory, electrochemical cells (ECM) memory, binary oxide filament cell memory, interfacial switching memory, battery-backed RAM and/or NAND flash memory.
- This example includes the elements according to any one of examples 51 through 56, wherein the instructions that when executed by one or more processors results in the following additional operations including coupling a storage device to at least one of a storage system controller logic and/or a computing device.
- This example includes the elements according to any one of examples 51 through 56, wherein the request is received from and the reply is provided to storage system control logic.
- According to this example there is computer-readable storage device having stored thereon instructions that when executed by one or more processors result in the following operations including the method according to any one of examples 21 to 30.
- According to this example there is computer-readable storage device having stored thereon instructions that when executed by one or more processors result in the following operations including the method according to any one of examples 31 to 40.
- Another example of the present disclosure is a system including at least one device arranged to perform the method of any one of examples 21 to 30.
- Another example of the present disclosure is a system including at least one device arranged to perform the method of any one of examples 31 to 40.
- Another example of the present disclosure is a device including means to perform the method of any one of examples 21 to 30.
- Another example of the present disclosure is a device including means to perform the method of any one of examples 31 to 40.
- The terms and expressions which have been employed herein are used as terms of description and not of limitation, and there is no intention, in the use of such terms and expressions, of excluding any equivalents of the features shown and described (or portions thereof), and it is recognized that various modifications are possible within the scope of the claims. Accordingly, the claims are intended to cover all such equivalents.
- Various features, aspects, and embodiments have been described herein. The features, aspects, and embodiments are susceptible to combination with one another as well as to variation and modification, as will be understood by those having skill in the art. The present disclosure should, therefore, be considered to encompass such combinations, variations, and modifications.
Claims (26)
1. -24. (canceled)
25. One or more non-transitory computer-readable storage media having instructions stored thereon that when executed by at least one processor to cause the at least one processor of a first computing device to perform operations, comprising:
detect a failed storage device in a plurality of storage devices;
identify at least one target storage device of the plurality of storage devices;
identify mapped logical block addresses (LBAs) of the at least one target storage device, the at least one target storage device to have associated mapped and unmapped LBAs;
read data associated with mapped LBAs, and not read data associated with unmapped LBAs, in response to the detection of the failed storage device, from the at least one target storage device;
write the data associated with mapped LBAs, and not write the data associated with unmapped LBAs, in response to the detection of the failed storage device, to at least one replacement storage device;
wherein at least one of the unmapped LBAs is an LBA that has not been written to or has been subject to an UNMAP command and has not been written to since the UNMAP command.
26. The one or more non-transitory computer-readable storage media of claim 25 , wherein at least one of the mapped LBAs is an LBA that has been subject to a write operation and has not been subject to an UNMAP command since the write operation.
27. The one or more non-transitory computer-readable storage media of claim 25 , wherein the UNMAP command is a TRIM command.
28. The one or more non-transitory computer-readable storage media of claim 25 , wherein the UNMAP command is a Small Computer System Interface (SCSI) command.
29. The one or more non-transitory computer-readable storage media of claim 25 , wherein the at least one target storage device is at least one solid state drive (SSD).
30. The one or more non-transitory computer-readable storage media of claim 25 , wherein the instructions, when executed by the at least one processor of the first computing device, result in additional operations comprising:
request an UNMAP command from a computing device to be provided to at least one target storage device.
31. The one or more non-transitory computer-readable storage media of claim 25 , wherein the instructions, when executed by the at least one processor of the first computing device, result in additional operations comprising:
identify a state of at least one LBA in the at least one target storage device, said state being mapped or unmapped.
32. The one or more non-transitory computer-readable storage media of claim 31 , wherein the instructions, when executed by the at least one processor of the first computing device, result in additional operations comprising:
identify a state of at least one target storage device LBA in a range of target storage device LBAs, said state being mapped or unmapped.
33. The one or more non-transitory computer-readable storage media of claim 32 , wherein the instructions, when executed by the at least one processor of the first computing device, result in additional operations comprising:
cause device control logic of the at least one storage device to provide a listing of mapped LBAs in at least one target storage device.
34. The one or more non-transitory computer-readable storage media of claim 33 , wherein the device control logic is further to unmap a device LBA in response to an UNMAP command.
35. A device, comprising:
storage device interface circuitry to communicate with a plurality of storage devices; and
storage system control logic to:
detect a failed storage device in the plurality of storage devices;
identify at least one target storage device of the plurality of storage devices;
identify mapped logical block addresses (LBAs) of the at least one target storage device, the at least one target storage device to have associated mapped and unmapped LBAs;
read data associated with mapped LBAs, and not read data associated with unmapped LBAs, in response to the detection of the failed storage device, from the at least one target storage device;
write the data associated with mapped LBAs, and not write the data associated with unmapped LBAs, in response to the detection of the failed storage device, to at least one replacement storage device; and
wherein at least one of the unmapped LBAs is an LBA that has not been written to or has been subject to an UNMAP command and has not been written to since the UNMAP command.
36. The device of claim 35 , wherein at least one of the mapped LBAs is an LBA that has been subject to a write operation and has not been subject to an UNMAP command since the write operation.
37. The device of claim 35 , wherein the UNMAP command is a TRIM command.
38. The device of claim 35 , wherein the UNMAP command is a Small Computer System Interface (SCSI) command.
39. The device of claim 35 , wherein the at least one target storage device is at least one solid state drive (SSD).
40. The device of claim 35 , wherein the storage system control logic is further to request an UNMAP command from a computing device to be provided to at least one target storage device.
41. The device of claim 35 , wherein the storage system control logic is further to identify a state of at least one LBA in the at least one target storage device, said state being mapped or unmapped.
42. The device of claim 41 , wherein the storage system control logic is further to identify a state of at least one target storage device LBA in a range of target storage device LBAs, said state being mapped or unmapped.
43. The device of claim 42 , wherein the storage system control logic is further to cause device control logic of at least one storage device to provide a listing of mapped LBAs in at least one target storage device.
44. The device of claim 43 , wherein the storage system control logic is further to unmap a device LBA in response to an UNMAP command.
45. A system, comprising:
a storage system, including:
a plurality of storage devices; and
storage system control logic to:
detect a failed storage device in the plurality of storage devices;
identify at least one target storage device of the plurality of storage devices;
identify mapped logical block addresses (LBAs) of the at least one target storage device, the at least one target storage device to have associated mapped and unmapped LBAs;
read data associated with mapped LBAs, and not read data associated with unmapped LBAs, in response to the detection of the failed storage device, from the at least one target storage device;
write the data associated with mapped LBAs, and not write the data associated with unmapped LBAs, in response to the detection of the failed storage device, to at least one replacement storage device; and
wherein at least one of the unmapped LBAs is an LBA that has not been written to or has been subject to an UNMAP command and has not been written to since the UNMAP command; and
a computing device, including:
a processor; and
chipset circuitry to couple the processor to the storage system.
46. The system of claim 45 , further comprising a display, wherein the chipset circuitry is further to couple the processor to the display.
47. The system of claim 45 , wherein the storage system control logic is further to identify a state of at least one LBA in the at least one target storage device, said state being mapped or unmapped.
48. The system of claim 47 , wherein the storage system control logic is further to identify a state of at least one target storage device LBA in a range of target storage device LBAs, said state being mapped or unmapped.
49. The system of claim 48 , wherein the storage system control logic is further to cause device control logic of at least one storage device to provide a listing of mapped LBAs in at least one target storage device.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US15/398,651 US10289500B2 (en) | 2014-12-12 | 2017-01-04 | Accelerated data recovery in a storage system |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14/568,417 US9575853B2 (en) | 2014-12-12 | 2014-12-12 | Accelerated data recovery in a storage system |
US15/398,651 US10289500B2 (en) | 2014-12-12 | 2017-01-04 | Accelerated data recovery in a storage system |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/568,417 Continuation US9575853B2 (en) | 2014-12-12 | 2014-12-12 | Accelerated data recovery in a storage system |
Publications (2)
Publication Number | Publication Date |
---|---|
US20170212814A1 true US20170212814A1 (en) | 2017-07-27 |
US10289500B2 US10289500B2 (en) | 2019-05-14 |
Family
ID=56107932
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/568,417 Active 2035-04-02 US9575853B2 (en) | 2014-12-12 | 2014-12-12 | Accelerated data recovery in a storage system |
US15/398,651 Active 2035-03-31 US10289500B2 (en) | 2014-12-12 | 2017-01-04 | Accelerated data recovery in a storage system |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US14/568,417 Active 2035-04-02 US9575853B2 (en) | 2014-12-12 | 2014-12-12 | Accelerated data recovery in a storage system |
Country Status (4)
Country | Link |
---|---|
US (2) | US9575853B2 (en) |
KR (2) | KR102502352B1 (en) |
CN (2) | CN107111535B (en) |
WO (1) | WO2016094032A1 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2020041444A1 (en) | 2018-08-23 | 2020-02-27 | Micron Technology, Inc. | Data recovery within a memory sub-system |
US11625193B2 (en) * | 2020-07-10 | 2023-04-11 | Samsung Electronics Co., Ltd. | RAID storage device, host, and RAID system |
Families Citing this family (37)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10528272B2 (en) * | 2015-02-20 | 2020-01-07 | International Business Machines Corporation | RAID array systems and operations using mapping information |
US10157112B2 (en) * | 2015-09-03 | 2018-12-18 | Toshiba Memory Corporation | Storage device |
US9766965B2 (en) * | 2015-11-25 | 2017-09-19 | Salesforce.Com, Inc. | System and method for monitoring and detecting faulty storage devices |
WO2018041258A1 (en) * | 2016-09-05 | 2018-03-08 | 北京忆恒创源科技有限公司 | Method for processing de-allocation command, and storage device |
US10642720B2 (en) * | 2016-09-15 | 2020-05-05 | Talend, Inc. | Test case generator built into data-integration workflow editor |
US11461273B1 (en) * | 2016-12-20 | 2022-10-04 | Pure Storage, Inc. | Modifying storage distribution in a storage system that includes one or more storage devices |
KR102395538B1 (en) * | 2017-04-28 | 2022-05-10 | 에스케이하이닉스 주식회사 | Data storage device and operating method thereof |
CN107765999A (en) * | 2017-09-30 | 2018-03-06 | 深圳市华德安科技有限公司 | RAID management methods, device and computer-readable recording medium |
CN107728946A (en) * | 2017-09-30 | 2018-02-23 | 深圳市华德安科技有限公司 | Hanging method, Android device and the storage medium of disk array |
CN107728945A (en) * | 2017-09-30 | 2018-02-23 | 深圳市华德安科技有限公司 | Hanging method, Android device and the storage medium of disk array |
CN107728944A (en) * | 2017-09-30 | 2018-02-23 | 深圳市华德安科技有限公司 | RAID management methods, device and computer-readable recording medium |
CN107608637A (en) * | 2017-09-30 | 2018-01-19 | 深圳市华德安科技有限公司 | RAID management methods, device and computer-readable recording medium |
CN107765998A (en) * | 2017-09-30 | 2018-03-06 | 深圳市华德安科技有限公司 | RAID management methods, device and computer-readable recording medium |
CN107608638A (en) * | 2017-09-30 | 2018-01-19 | 深圳市华德安科技有限公司 | RAID management methods, device and computer-readable recording medium |
CN107817949A (en) * | 2017-09-30 | 2018-03-20 | 深圳市华德安科技有限公司 | Hanging method, Android device and the storage medium of disk array |
CN107704205A (en) * | 2017-09-30 | 2018-02-16 | 深圳市华德安科技有限公司 | RAID management methods, device and computer-readable recording medium |
CN107748651A (en) * | 2017-09-30 | 2018-03-02 | 深圳市华德安科技有限公司 | Hanging method, Android device and the storage medium of disk array |
CN107741830A (en) * | 2017-09-30 | 2018-02-27 | 深圳市华德安科技有限公司 | Hanging method, Android device and the storage medium of disk array |
CN107678693A (en) * | 2017-09-30 | 2018-02-09 | 深圳市华德安科技有限公司 | Hanging method, Android device and the storage medium of disk array |
CN107728942A (en) * | 2017-09-30 | 2018-02-23 | 深圳市华德安科技有限公司 | Hanging method, Android device and the storage medium of disk array |
CN107741829A (en) * | 2017-09-30 | 2018-02-27 | 深圳市华德安科技有限公司 | RAID management methods, device and computer-readable recording medium |
KR102423278B1 (en) | 2017-11-28 | 2022-07-21 | 에스케이하이닉스 주식회사 | Memory system and operating method thereof |
KR102586741B1 (en) * | 2018-04-23 | 2023-10-11 | 에스케이하이닉스 주식회사 | Memory system and operating method thereof |
CN110413202B (en) * | 2018-04-28 | 2024-03-08 | 伊姆西Ip控股有限责任公司 | Data replication method, apparatus and computer program product |
CN109032859A (en) * | 2018-06-26 | 2018-12-18 | 郑州云海信息技术有限公司 | A kind of NVME SSD logical block detection method, system and relevant apparatus |
CN111542803B (en) * | 2018-06-30 | 2021-10-01 | 华为技术有限公司 | NVMe-based data writing method, device and system |
US10409511B1 (en) * | 2018-06-30 | 2019-09-10 | Western Digital Technologies, Inc. | Multi-device storage system with distributed read/write processing |
CN111095231B (en) | 2018-06-30 | 2021-08-03 | 华为技术有限公司 | NVMe-based data reading method, device and system |
CN112631965A (en) * | 2018-10-19 | 2021-04-09 | 华为技术有限公司 | Method and device for controlling hard disk access |
CN111552436B (en) * | 2018-10-25 | 2022-02-25 | 华为技术有限公司 | Data recovery method, system and device in storage system |
US11531590B2 (en) * | 2019-09-17 | 2022-12-20 | Western Digital Technologies, Inc. | Method and system for host-assisted data recovery assurance for data center storage device architectures |
CN112764667B (en) * | 2019-10-21 | 2024-09-06 | 伊姆西Ip控股有限责任公司 | Method, apparatus, storage system and computer program product for storage management |
JP7167078B2 (en) * | 2020-03-27 | 2022-11-08 | 株式会社日立製作所 | Distributed storage system and storage control method |
CN112214356B (en) * | 2020-10-21 | 2024-06-18 | 深圳忆联信息系统有限公司 | File data recovery method, device, computer equipment and storage medium |
US11429543B2 (en) * | 2020-10-22 | 2022-08-30 | Micron Technology, Inc. | Managed NAND flash memory region control against endurance hacking |
CN113485872B (en) * | 2021-07-14 | 2024-10-01 | 北京金山云网络技术有限公司 | Fault processing method and device and distributed storage system |
CN113641437B (en) * | 2021-08-16 | 2023-06-16 | 深圳技德智能科技研究院有限公司 | Android-compatible application interface rotation method and device for Linux |
Family Cites Families (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7139931B2 (en) | 2002-08-19 | 2006-11-21 | Aristos Logic Corporation | Method of controlling the system performance and reliability impact of hard disk drive rebuild |
US7188212B2 (en) * | 2004-05-06 | 2007-03-06 | International Business Machines Corporation | Method and system for storing data in an array of storage devices with additional and autonomic protection |
US9396103B2 (en) * | 2007-06-08 | 2016-07-19 | Sandisk Technologies Llc | Method and system for storage address re-mapping for a memory device |
US20090271659A1 (en) | 2008-04-24 | 2009-10-29 | Ulf Troppens | Raid rebuild using file system and block list |
US8041991B2 (en) * | 2008-11-18 | 2011-10-18 | Lsi Corporation | System and method for recovering solid state drive data |
US20110029716A1 (en) * | 2009-07-29 | 2011-02-03 | Stec, Inc. | System and method of recovering data in a flash storage system |
US8667248B1 (en) | 2010-08-31 | 2014-03-04 | Western Digital Technologies, Inc. | Data storage device using metadata and mapping table to identify valid user data on non-volatile media |
EP2666091A2 (en) * | 2011-01-18 | 2013-11-27 | LSI Corporation | Higher-level redundancy information computation |
US9058243B2 (en) * | 2011-06-10 | 2015-06-16 | International Business Machines Corporation | Releasing blocks of storage class memory |
US8589724B2 (en) | 2011-06-30 | 2013-11-19 | Seagate Technology Llc | Rapid rebuild of a data set |
JP5807458B2 (en) * | 2011-08-31 | 2015-11-10 | 富士通株式会社 | Storage system, storage control device, and storage control method |
US9069657B2 (en) * | 2011-12-12 | 2015-06-30 | Apple Inc. | LBA bitmap usage |
TWI563383B (en) * | 2012-04-25 | 2016-12-21 | Phison Electronics Corp | Memory formatting method, memory controller and memory storage apparatus |
CN102708019B (en) * | 2012-04-28 | 2014-12-03 | 华为技术有限公司 | Method, device and system for hard disk data recovery |
US8966160B2 (en) * | 2012-09-28 | 2015-02-24 | Intel Corporation | Storage device trimming |
US8943359B2 (en) * | 2012-10-05 | 2015-01-27 | Lsi Corporation | Common hot spare for multiple RAID groups |
JP6136460B2 (en) * | 2013-03-28 | 2017-05-31 | 富士通株式会社 | Information processing apparatus, information processing apparatus control program, and information processing apparatus control method |
CN103534688B (en) * | 2013-05-29 | 2016-03-09 | 华为技术有限公司 | Data reconstruction method, memory device and storage system |
CN103699457A (en) * | 2013-09-26 | 2014-04-02 | 深圳市泽云科技有限公司 | Method and device for restoring disk arrays based on stripping |
US9658923B2 (en) * | 2014-09-30 | 2017-05-23 | International Business Machines Corporation | Optimization of rebuilding in solid state drives |
US10528272B2 (en) * | 2015-02-20 | 2020-01-07 | International Business Machines Corporation | RAID array systems and operations using mapping information |
-
2014
- 2014-12-12 US US14/568,417 patent/US9575853B2/en active Active
-
2015
- 2015-11-13 KR KR1020187002334A patent/KR102502352B1/en active IP Right Grant
- 2015-11-13 CN CN201580061632.1A patent/CN107111535B/en active Active
- 2015-11-13 KR KR1020177012863A patent/KR102487790B1/en active IP Right Grant
- 2015-11-13 WO PCT/US2015/060632 patent/WO2016094032A1/en active Application Filing
- 2015-11-13 CN CN201810052068.2A patent/CN108089951B/en active Active
-
2017
- 2017-01-04 US US15/398,651 patent/US10289500B2/en active Active
Non-Patent Citations (2)
Title |
---|
McKean US Patent app Pub 2010/0125751 * |
Werner et al US Patent app pub 2013/0290618 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2020041444A1 (en) | 2018-08-23 | 2020-02-27 | Micron Technology, Inc. | Data recovery within a memory sub-system |
US11068365B2 (en) | 2018-08-23 | 2021-07-20 | Micron Technology, Inc. | Data recovery within a memory sub-system without moving or processing the data through a host |
EP3841474A4 (en) * | 2018-08-23 | 2022-05-04 | Micron Technology, Inc. | Data recovery within a memory sub-system |
US12124345B2 (en) | 2018-08-23 | 2024-10-22 | Micron Technology, Inc. | Data recovery within a memory sub-system without moving or processing the data through a host |
US11625193B2 (en) * | 2020-07-10 | 2023-04-11 | Samsung Electronics Co., Ltd. | RAID storage device, host, and RAID system |
Also Published As
Publication number | Publication date |
---|---|
KR20180011365A (en) | 2018-01-31 |
KR20170093798A (en) | 2017-08-16 |
US20160170846A1 (en) | 2016-06-16 |
US9575853B2 (en) | 2017-02-21 |
CN107111535A (en) | 2017-08-29 |
KR102502352B1 (en) | 2023-02-23 |
KR102487790B1 (en) | 2023-01-13 |
US10289500B2 (en) | 2019-05-14 |
CN108089951B (en) | 2022-04-08 |
CN107111535B (en) | 2021-05-11 |
CN108089951A (en) | 2018-05-29 |
WO2016094032A1 (en) | 2016-06-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10289500B2 (en) | Accelerated data recovery in a storage system | |
CN111433732B (en) | Storage device and computer-implemented method performed by the storage device | |
US8788876B2 (en) | Stripe-based memory operation | |
US11474899B2 (en) | Operation method of open-channel storage device | |
US9519577B2 (en) | Method and system for migrating data between flash memory devices | |
US9304685B2 (en) | Storage array system and non-transitory recording medium storing control program | |
US10120769B2 (en) | Raid rebuild algorithm with low I/O impact | |
US9026845B2 (en) | System and method for failure protection in a storage array | |
US20130173954A1 (en) | Method of managing bad storage region of memory device and storage device using the method | |
US20140281216A1 (en) | Vertically integrated storage | |
US20160011786A1 (en) | Storage system and data write method | |
CN112558860A (en) | Memory system for distributing and reading data and operating method thereof | |
US10031689B2 (en) | Stream management for storage devices | |
US10642531B2 (en) | Atomic write method for multi-transaction | |
US11106372B2 (en) | Asynchronous power loss handling approach for a memory sub-system | |
CN112585586A (en) | Data recovery within a memory subsystem | |
CN111752475A (en) | Method and device for data access management in storage server |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STPP | Information on status: patent application and granting procedure in general |
Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 4 |
|
AS | Assignment |
Owner name: SK HYNIX NAND PRODUCT SOLUTIONS CORP., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:INTEL CORPORATION;REEL/FRAME:062702/0001 Effective date: 20211229 |