IN2014DE00743A - - Google Patents

Download PDF

Info

Publication number
IN2014DE00743A
IN2014DE00743A IN743DE2014A IN2014DE00743A IN 2014DE00743 A IN2014DE00743 A IN 2014DE00743A IN 743DE2014 A IN743DE2014 A IN 743DE2014A IN 2014DE00743 A IN2014DE00743 A IN 2014DE00743A
Authority
IN
India
Prior art keywords
logged
node
operations
replay
partner
Prior art date
Application number
Inventor
Prakash Ameya Usgaonkar
Siddhartha Nandi
Original Assignee
Netapp Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Netapp Inc filed Critical Netapp Inc
Priority to IN743DE2014 priority Critical patent/IN2014DE00743A/en
Priority to US14/280,139 priority patent/US9342417B2/en
Publication of IN2014DE00743A publication Critical patent/IN2014DE00743A/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2053Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant
    • G06F11/2056Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant by mirroring
    • G06F11/2064Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant by mirroring while ensuring consistency
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2002Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where interconnections or communication control functionality are redundant
    • G06F11/2005Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where interconnections or communication control functionality are redundant using redundant communication controllers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2002Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where interconnections or communication control functionality are redundant
    • G06F11/2007Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where interconnections or communication control functionality are redundant using redundant communication media
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2097Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements maintaining the standby controller/processing unit updated
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1097Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2053Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant
    • G06F11/2056Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant by mirroring
    • G06F11/2071Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant by mirroring using a plurality of controllers

Abstract

A live non-volatile (NV) replay technique enables a partner node to efficiently takeover a failed node of a high-availability pair in a multi-node storage cluster by dynamically replaying operations synchronously logged in a non-volatile random access memory (NVRAM) of the partner node, while also providing high performance during normal operation. Dynamic live replay may be effected through interpretation of metadata describing the logged operations. The metadata may specify a location and type of each logged operation within a partner portion of the NVRAM, as well as any dependency among the logged operation and any other logged operations that would impose an ordering constraint. During normal operation, the partner node may consult the metadata to identify dependent logged operations and dynamically replay those operations to satisfy one or more requests. Upon failure of the node, the partner node may replay, in parallel, those logged operations having no imposed ordering constraint, thereby reducing time needed to complete takeover of the failed node.
IN743DE2014 2014-03-13 2014-03-13 IN2014DE00743A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
IN743DE2014 IN2014DE00743A (en) 2014-03-13 2014-03-13
US14/280,139 US9342417B2 (en) 2014-03-13 2014-05-16 Live NV replay for enabling high performance and efficient takeover in multi-node storage cluster

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
IN743DE2014 IN2014DE00743A (en) 2014-03-13 2014-03-13

Publications (1)

Publication Number Publication Date
IN2014DE00743A true IN2014DE00743A (en) 2015-09-18

Family

ID=54069018

Family Applications (1)

Application Number Title Priority Date Filing Date
IN743DE2014 IN2014DE00743A (en) 2014-03-13 2014-03-13

Country Status (2)

Country Link
US (1) US9342417B2 (en)
IN (1) IN2014DE00743A (en)

Families Citing this family (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR3030076B1 (en) * 2014-12-10 2016-12-09 Bull Sas METHOD FOR MANAGING A NETWORK OF CALCULATION NODES
US10417093B2 (en) * 2016-05-13 2019-09-17 Netapp, Inc. Methods for providing global spare data storage device management and devices thereof
US9794366B1 (en) 2016-10-19 2017-10-17 Red Hat, Inc. Persistent-memory management
US10635552B1 (en) * 2017-08-02 2020-04-28 EMC IP Holding Company LLC Method for tracking validity of journal copies to allow journal mirroring
US11301433B2 (en) * 2017-11-13 2022-04-12 Weka.IO Ltd. Metadata journal in a distributed storage system
US10795782B2 (en) * 2018-04-02 2020-10-06 Hewlett Packard Enterprise Development Lp Data processing apparatuses and methods to support transferring control between a primary data processing system and a secondary data processing system in response to an event
US11048559B2 (en) 2019-07-08 2021-06-29 Hewlett Packard Enterprise Development Lp Managing ownership transfer of file system instance in virtualized distributed storage system
US10922009B2 (en) * 2019-07-08 2021-02-16 International Business Machines Corporation Mirroring write operations across data storage devices
US11409715B2 (en) * 2019-10-04 2022-08-09 Hewlett Packard Enterprise Development Lp Maintaining high-availability of a file system instance in a cluster of computing nodes
US11231934B2 (en) * 2020-03-05 2022-01-25 Samsung Electronics Co., Ltd. System and method for controlling the order of instruction execution by a target device
US11216350B2 (en) * 2020-04-22 2022-01-04 Netapp, Inc. Network storage failover systems and associated methods
US11269744B2 (en) 2020-04-22 2022-03-08 Netapp, Inc. Network storage failover systems and associated methods
US11416356B2 (en) * 2020-04-22 2022-08-16 Netapp, Inc. Network storage failover systems and associated methods
CN113946275B (en) * 2020-07-15 2024-04-09 中移(苏州)软件技术有限公司 Cache management method and device and storage medium
US11683363B1 (en) 2021-04-12 2023-06-20 Parallels International Gmbh Re-directed folder access on remote volumes and drives
US11481326B1 (en) 2021-07-28 2022-10-25 Netapp, Inc. Networked storage system with a remote storage location cache and associated methods thereof
US11768775B2 (en) 2021-07-28 2023-09-26 Netapp, Inc. Methods and systems for managing race conditions during usage of a remote storage location cache in a networked storage system
US11544011B1 (en) 2021-07-28 2023-01-03 Netapp, Inc. Write invalidation of a remote location cache entry in a networked storage system
US11500591B1 (en) 2021-07-28 2022-11-15 Netapp, Inc. Methods and systems for enabling and disabling remote storage location cache usage in a networked storage system

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7249150B1 (en) 2001-07-03 2007-07-24 Network Appliance, Inc. System and method for parallelized replay of an NVRAM log in a storage appliance
US7305421B2 (en) * 2001-07-16 2007-12-04 Sap Ag Parallelized redo-only logging and recovery for highly available main memory database systems
US7730153B1 (en) 2001-12-04 2010-06-01 Netapp, Inc. Efficient use of NVRAM during takeover in a node cluster
GB0308262D0 (en) * 2003-04-10 2003-05-14 Ibm Recovery from failures within data processing systems
US7895286B1 (en) 2004-04-30 2011-02-22 Netapp, Inc. Network storage system with NVRAM and cluster interconnect adapter implemented in a single circuit module
US7376866B1 (en) * 2004-10-22 2008-05-20 Network Appliance, Inc. Method and an apparatus to perform fast log replay
US7844584B1 (en) 2006-06-23 2010-11-30 Netapp, Inc. System and method for persistently storing lock state information
US7613947B1 (en) 2006-11-30 2009-11-03 Netapp, Inc. System and method for storage takeover
US7620669B1 (en) 2006-12-15 2009-11-17 Netapp, Inc. System and method for enhancing log performance
US7962686B1 (en) 2009-02-02 2011-06-14 Netapp, Inc. Efficient preservation of the ordering of write data within a subsystem that does not otherwise guarantee preservation of such ordering
US8688798B1 (en) 2009-04-03 2014-04-01 Netapp, Inc. System and method for a shared write address protocol over a remote direct memory access connection

Also Published As

Publication number Publication date
US9342417B2 (en) 2016-05-17
US20150261633A1 (en) 2015-09-17

Similar Documents

Publication Publication Date Title
IN2014DE00743A (en)
SG11201907942QA (en) Blockchain cluster processing system and method, computer device and storage medium
PH12019501943A1 (en) Business verification method and apparatus
MY191655A (en) Method for controlling transmission of data
EA201990251A1 (en) SYSTEM OF DISTRIBUTED PROCESSING OF TRANSACTIONS AND AUTHENTICATION
MX2018004690A (en) Resource response expansion.
GB2538654A (en) Prioritizing data reconstruction in distributed storage systems
JP2017527911A5 (en)
SG11201807494UA (en) Optimization method, evaluation method and processing method and apparatuses for data migration
BR112016021485A2 (en) HASH-BASED ENCRYPTOR SEARCH FOR INTRA-BLOCK COPY
IN2013CH05115A (en)
GB2539605A (en) Evaluation system and method
BR112016007142A2 (en) disaster recovery using nonvolatile memory
BR112015003406A2 (en) block level access for parallel storage
EP4246295A3 (en) Composite graphical interface with shareable data-objects
WO2014020032A3 (en) High-availability computer system, working method and the use thereof
CN106687911A8 (en) The online data movement of data integrity is not damaged
SG11201811389PA (en) Data processing method and device
JP2019526106A5 (en)
BR112016015988A2 (en) NON-HEVC BASE LAYER SUPPORT IN HEVC MULTI-LAYER EXTENSIONS
GB2529117A (en) Application Backup and restore
GB2557478A (en) Manegement of virtual machine in virtualized computing environment based on fabric limit
EP3614266A3 (en) Recoverable stream processing
PH12019502832A1 (en) Communication apparatus, method and computer program
TW201612743A (en) Bit group interleave processors, methods, systems, and instructions