CN104813290B - Raid调查器 - Google Patents

Raid调查器 Download PDF

Info

Publication number
CN104813290B
CN104813290B CN201380059018.2A CN201380059018A CN104813290B CN 104813290 B CN104813290 B CN 104813290B CN 201380059018 A CN201380059018 A CN 201380059018A CN 104813290 B CN104813290 B CN 104813290B
Authority
CN
China
Prior art keywords
disk
disks
data
failing
data storage
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201380059018.2A
Other languages
English (en)
Chinese (zh)
Other versions
CN104813290A (zh
Inventor
A·J·弗勒德
D·J·安德森
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Compellent Technologies Inc
Original Assignee
Compellent Technologies Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Compellent Technologies Inc filed Critical Compellent Technologies Inc
Publication of CN104813290A publication Critical patent/CN104813290A/zh
Application granted granted Critical
Publication of CN104813290B publication Critical patent/CN104813290B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/08Error detection or correction by redundancy in data representation, e.g. by using checking codes
    • G06F11/10Adding special bits or symbols to the coded information, e.g. parity check, casting out 9's or 11's
    • G06F11/1076Parity data used in redundant arrays of independent storages, e.g. in RAID systems
    • G06F11/1088Reconstruction on already foreseen single or plurality of spare disks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/004Error avoidance
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/008Reliability or availability analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0706Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
    • G06F11/0727Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment in a storage system, e.g. in a DASD or network based storage system
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0751Error or fault detection not based on redundancy
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0751Error or fault detection not based on redundancy
    • G06F11/0754Error or fault detection not based on redundancy by exceeding limits
    • G06F11/076Error or fault detection not based on redundancy by exceeding limits by exceeding a count or rate limit, e.g. word- or bit count limit
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/08Error detection or correction by redundancy in data representation, e.g. by using checking codes
    • G06F11/10Adding special bits or symbols to the coded information, e.g. parity check, casting out 9's or 11's
    • G06F11/1076Parity data used in redundant arrays of independent storages, e.g. in RAID systems
    • G06F11/1092Rebuilding, e.g. when physically replacing a failing disk
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2053Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant
    • G06F11/2056Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant by mirroring
    • G06F11/2069Management of state, configuration or failover
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operations
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1448Management of the data involved in backup or backup restore
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2211/00Indexing scheme relating to details of data-processing equipment not covered by groups G06F3/00 - G06F13/00
    • G06F2211/10Indexing scheme relating to G06F11/10
    • G06F2211/1002Indexing scheme relating to G06F11/1076
    • G06F2211/1088Scrubbing in RAID systems with parity

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Debugging And Monitoring (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)
CN201380059018.2A 2012-12-06 2013-12-05 Raid调查器 Active CN104813290B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US13/706,553 US9135096B2 (en) 2012-12-06 2012-12-06 RAID surveyor
US13/706,553 2012-12-06
PCT/US2013/073347 WO2014089311A2 (en) 2012-12-06 2013-12-05 Raid surveyor

Publications (2)

Publication Number Publication Date
CN104813290A CN104813290A (zh) 2015-07-29
CN104813290B true CN104813290B (zh) 2018-09-21

Family

ID=50882388

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201380059018.2A Active CN104813290B (zh) 2012-12-06 2013-12-05 Raid调查器

Country Status (5)

Country Link
US (2) US9135096B2 (https=)
EP (1) EP2929435B1 (https=)
CN (1) CN104813290B (https=)
IN (1) IN2015DN02706A (https=)
WO (1) WO2014089311A2 (https=)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9104604B2 (en) * 2013-02-26 2015-08-11 International Business Machines Corporation Preventing unrecoverable errors during a disk regeneration in a disk array
CN106557389B (zh) * 2015-09-29 2019-03-08 成都华为技术有限公司 一种慢盘检测方法和装置
CN106250278A (zh) * 2016-08-04 2016-12-21 深圳市泽云科技有限公司 一种一键执行的磁盘阵列数据恢复方法
US10678643B1 (en) * 2017-04-26 2020-06-09 EMC IP Holding Company LLC Splitting a group of physical data storage drives into partnership groups to limit the risk of data loss during drive rebuilds in a mapped RAID (redundant array of independent disks) data storage system
US10346247B1 (en) * 2017-04-27 2019-07-09 EMC IP Holding Company LLC Adjustable error sensitivity for taking disks offline in a mapped RAID storage array
US10210045B1 (en) * 2017-04-27 2019-02-19 EMC IP Holding Company LLC Reducing concurrency bottlenecks while rebuilding a failed drive in a data storage system
CN109725838B (zh) * 2017-10-27 2022-02-25 伊姆西Ip控股有限责任公司 用于管理多个盘的方法、装置以及计算机可读介质
US10691543B2 (en) 2017-11-14 2020-06-23 International Business Machines Corporation Machine learning to enhance redundant array of independent disks rebuilds
US10740181B2 (en) * 2018-03-06 2020-08-11 Western Digital Technologies, Inc. Failed storage device rebuild method
CN110413454B (zh) * 2018-04-28 2022-04-05 华为技术有限公司 基于存储阵列的数据重建方法、装置及存储介质
CN111124264B (zh) * 2018-10-31 2023-10-27 伊姆西Ip控股有限责任公司 用于重建数据的方法、设备和计算机程序产品
CN109873985A (zh) * 2019-03-01 2019-06-11 苏州星奥达科技有限公司 一种对视频平台集群的智能备份恢复方法
US11182358B2 (en) 2019-07-18 2021-11-23 International Business Machines Corporation Performance enhanced data scrubbing
CN115206406B (zh) 2021-04-12 2026-03-31 伊姆西Ip控股有限责任公司 管理独立磁盘冗余阵列的方法和装置
US12271756B2 (en) 2021-09-22 2025-04-08 International Business Machines Corporation Identifying slow nodes in a computing environment

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6934904B2 (en) * 2001-04-30 2005-08-23 Sun Microsystems, Inc. Data integrity error handling in a redundant storage array
US7017107B2 (en) * 2001-04-30 2006-03-21 Sun Microsystems, Inc. Storage array employing scrubbing operations at the disk-controller level
US6871263B2 (en) * 2001-08-28 2005-03-22 Sedna Patent Services, Llc Method and apparatus for striping data onto a plurality of disk drives
US7363546B2 (en) * 2002-07-31 2008-04-22 Sun Microsystems, Inc. Latent fault detector
US7613945B2 (en) 2003-08-14 2009-11-03 Compellent Technologies Virtual disk drive system and method
US7590801B1 (en) * 2004-02-12 2009-09-15 Netapp, Inc. Identifying suspect disks
US7313721B2 (en) * 2004-06-21 2007-12-25 Dot Hill Systems Corporation Apparatus and method for performing a preemptive reconstruct of a fault-tolerant RAID array
US7574623B1 (en) 2005-04-29 2009-08-11 Network Appliance, Inc. Method and system for rapidly recovering data from a “sick” disk in a RAID disk group
US8145941B2 (en) 2006-10-31 2012-03-27 Hewlett-Packard Development Company, L.P. Detection and correction of block-level data corruption in fault-tolerant data-storage systems
US20090055682A1 (en) * 2007-07-18 2009-02-26 Panasas Inc. Data storage systems and methods having block group error correction for repairing unrecoverable read errors
US7971093B1 (en) * 2008-01-16 2011-06-28 Network Appliance, Inc. Apparatus and method to proactively address hard disk drive inefficiency and failure
US8463991B2 (en) 2010-09-28 2013-06-11 Pure Storage Inc. Intra-device data protection in a raid array
US8689040B2 (en) 2010-10-01 2014-04-01 Lsi Corporation Method and system for data reconstruction after drive failures

Also Published As

Publication number Publication date
WO2014089311A3 (en) 2014-07-31
EP2929435A4 (en) 2016-11-02
US20150347232A1 (en) 2015-12-03
CN104813290A (zh) 2015-07-29
EP2929435A2 (en) 2015-10-14
EP2929435B1 (en) 2020-03-25
IN2015DN02706A (https=) 2015-09-04
US10025666B2 (en) 2018-07-17
US20140164849A1 (en) 2014-06-12
WO2014089311A2 (en) 2014-06-12
US9135096B2 (en) 2015-09-15

Similar Documents

Publication Publication Date Title
CN104813290B (zh) Raid调查器
Bairavasundaram et al. An analysis of data corruption in the storage stack
US10901848B2 (en) Storage systems with peer data recovery
US8839028B1 (en) Managing data availability in storage systems
US9286002B1 (en) Dynamic restriping in nonvolatile memory systems
CN107870730B (zh) 用于管理存储系统的方法和系统
CN104484251B (zh) 一种硬盘故障的处理方法及装置
US9104604B2 (en) Preventing unrecoverable errors during a disk regeneration in a disk array
US9558068B1 (en) Recovering from metadata inconsistencies in storage systems
CN104094236B (zh) 防止数据丢失的系统和方法
Thomasian et al. Higher reliability redundant disk arrays: Organization, operation, and coding
CN107515726B (zh) 用于管理存储设备的方法和系统
US8484506B2 (en) Redundant array of independent disks level 5 (RAID 5) with a mirroring functionality
US10275312B1 (en) Systems and methods for selecting a set of storage nodes for use in reconstructing data on a faulted node in an erasure-coded system
US8650435B2 (en) Enhanced storage device replacement system and method
CN106959912A (zh) 磁盘检测方法及装置
CN103593260B (zh) 一种元数据的保护方法和装置
US20190354452A1 (en) Parity log with delta bitmap
EP3794451B1 (en) Parity log with by-pass
US11416357B2 (en) Method and system for managing a spare fault domain in a multi-fault domain data cluster
US20060215456A1 (en) Disk array data protective system and method
CN114610235B (zh) 分布式存储集群、存储引擎、两副本存储方法及设备
US11592994B2 (en) Providing preferential treatment to metadata over user data
US11080136B2 (en) Dropped write error detection
US11119858B1 (en) Method and system for performing a proactive copy operation for a spare persistent storage

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant