WO2005104731A3 - Reactive deadlock management in storage area networks - Google Patents

Reactive deadlock management in storage area networks Download PDF

Info

Publication number
WO2005104731A3
WO2005104731A3 PCT/US2005/014335 US2005014335W WO2005104731A3 WO 2005104731 A3 WO2005104731 A3 WO 2005104731A3 US 2005014335 W US2005014335 W US 2005014335W WO 2005104731 A3 WO2005104731 A3 WO 2005104731A3
Authority
WO
WIPO (PCT)
Prior art keywords
command
target
storage area
transfer ready
physical
Prior art date
Application number
PCT/US2005/014335
Other languages
French (fr)
Other versions
WO2005104731A2 (en
Inventor
Robert Tower Frey
Chao Zhang
Original Assignee
Emc Corp
Robert Tower Frey
Chao Zhang
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US10/833,457 external-priority patent/US7484058B2/en
Priority claimed from US10/833,438 external-priority patent/US20050262309A1/en
Application filed by Emc Corp, Robert Tower Frey, Chao Zhang filed Critical Emc Corp
Publication of WO2005104731A2 publication Critical patent/WO2005104731A2/en
Publication of WO2005104731A3 publication Critical patent/WO2005104731A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2053Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant
    • G06F11/2056Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant by mirroring
    • G06F11/2058Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant by mirroring using more than 2 mirrored copies
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2053Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant
    • G06F11/2056Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant by mirroring
    • G06F11/2069Management of state, configuration or failover
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2053Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant
    • G06F11/2056Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant by mirroring
    • G06F11/2071Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant by mirroring using a plurality of controllers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/0671In-line storage system
    • G06F3/0673Single storage device
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1097Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
  • Computer And Data Communications (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Systems and methods to detect and alleviate potential or actual deadlock of a storage switch or storage area network when attempting to write data to a mirrored virtual target. In accordance with one embodiment, a timer is started when a storage switch routes a write command to the physical targets corresponding to a virtual target of the write command (1404). If each physical target does not return a transfer ready signal resource within a predetermined time period, the switch determines that a potential or actual deadlock has occurred (1406), An abort command is sent to each of the physical devices (1408). The abort command can clear the command from the targets and also free any allocated transfer ready resources (1410 and 1412). In one embodiment, a queue depth for the virtual target can be lowered after failing to receive transfer ready resources form each target (1414).
PCT/US2005/014335 2004-04-28 2005-04-26 Reactive deadlock management in storage area networks WO2005104731A2 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US10/833,457 2004-04-28
US10/833,457 US7484058B2 (en) 2004-04-28 2004-04-28 Reactive deadlock management in storage area networks
US10/833,438 2004-04-28
US10/833,438 US20050262309A1 (en) 2004-04-28 2004-04-28 Proactive transfer ready resource management in storage area networks

Publications (2)

Publication Number Publication Date
WO2005104731A2 WO2005104731A2 (en) 2005-11-10
WO2005104731A3 true WO2005104731A3 (en) 2007-05-24

Family

ID=35242147

Family Applications (2)

Application Number Title Priority Date Filing Date
PCT/US2005/014335 WO2005104731A2 (en) 2004-04-28 2005-04-26 Reactive deadlock management in storage area networks
PCT/US2005/014307 WO2005104727A2 (en) 2004-04-28 2005-04-26 Proactive transfer ready resource management in storage area networks

Family Applications After (1)

Application Number Title Priority Date Filing Date
PCT/US2005/014307 WO2005104727A2 (en) 2004-04-28 2005-04-26 Proactive transfer ready resource management in storage area networks

Country Status (1)

Country Link
WO (2) WO2005104731A2 (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5666559A (en) * 1994-04-06 1997-09-09 Advanced Micro Devices Fail-safe communication abort mechanism for parallel ports with selectable NMI or parallel port interrupt
US7013336B1 (en) * 1999-03-31 2006-03-14 International Business Machines Corporation Method and structure for efficiently retrieving status for SCSI accessed fault-tolerant enclosure (SAF-TE) systems

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6341315B1 (en) * 1999-02-26 2002-01-22 Crossroads Systems, Inc. Streaming method and system for fiber channel network devices
US20030079018A1 (en) * 2001-09-28 2003-04-24 Lolayekar Santosh C. Load balancing in a storage network

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5666559A (en) * 1994-04-06 1997-09-09 Advanced Micro Devices Fail-safe communication abort mechanism for parallel ports with selectable NMI or parallel port interrupt
US7013336B1 (en) * 1999-03-31 2006-03-14 International Business Machines Corporation Method and structure for efficiently retrieving status for SCSI accessed fault-tolerant enclosure (SAF-TE) systems

Also Published As

Publication number Publication date
WO2005104727A2 (en) 2005-11-10
WO2005104727A3 (en) 2009-05-28
WO2005104731A2 (en) 2005-11-10

Similar Documents

Publication Publication Date Title
CN101425022B (en) Dynamic allocation of virtual machine devices
WO2006026655A3 (en) Systems and methods to avoid deadlock and guarantee mirror consistency during online mirror synchronization and verification
CN110175140A (en) Fusion memory part and its operating method
CN105224255B (en) A kind of storage file management method and device
CN105468291B (en) Dynamic and static wear balance control method and device
RU2285947C2 (en) Method for ensuring safety with determined execution in real time of multi-task application of control-adjustment type with localization of errors
CN109309631A (en) A kind of method and device based on universal network file system write-in data
WO2007067279A3 (en) Methods and systems for staging configuration data for aircraft computers
WO2004066093A3 (en) Distributed memory computing environment and implantation thereof
EP1845439A3 (en) Storage area dynamic assignment method
WO2004040404A3 (en) Abstracted node discovery
WO2006062328A1 (en) Retransmission and delayed ack timer management logic for tcp protocol
WO2004051471A3 (en) Cross partition sharing of state information
EP1821186A3 (en) Virtual storage system and control method thereof
WO2001001241A3 (en) Method and apparatus for identifying network devices on a storage network
WO2004081699A3 (en) Apparatus and method for controlling resource transfers in a logically partitioned computer system
CN102713854A (en) Method and apparatus for saving and restoring container state
AU2003251066A1 (en) Moving data among storage units
WO2009097005A3 (en) Fast write operations to a mirrored volume in a volume manager
CN106354431A (en) Data storage method and device
WO2001004743A3 (en) Methods and apparatus for managing an application according to an application lifecycle
CN106294019A (en) A kind of operating system mirror image preserves and restoration methods and device
WO2004061672A3 (en) Read-write switching method for a memory controller
EP2363794A3 (en) Systems and methods for simulations utilizing a virtual coupling
CN105335306A (en) Memory control method and memory control device

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

NENP Non-entry into the national phase

Ref country code: DE

WWW Wipo information: withdrawn in national office

Country of ref document: DE