CN109496401A - 一种业务接管方法、存储设备和业务接管装置 - Google Patents

一种业务接管方法、存储设备和业务接管装置 Download PDF

Info

Publication number
CN109496401A
CN109496401A CN201580003092.1A CN201580003092A CN109496401A CN 109496401 A CN109496401 A CN 109496401A CN 201580003092 A CN201580003092 A CN 201580003092A CN 109496401 A CN109496401 A CN 109496401A
Authority
CN
China
Prior art keywords
storage equipment
operating status
state
storage
equipment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201580003092.1A
Other languages
English (en)
Other versions
CN109496401B (zh
Inventor
张程
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Publication of CN109496401A publication Critical patent/CN109496401A/zh
Application granted granted Critical
Publication of CN109496401B publication Critical patent/CN109496401B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1097Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • G06F11/2023Failover techniques
    • G06F11/2033Failover techniques switching over of hardware resources
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/1629Error detection by comparing the output of redundant processing systems
    • G06F11/1641Error detection by comparing the output of redundant processing systems where the comparison is not performed by the redundant processing components
    • G06F11/1645Error detection by comparing the output of redundant processing systems where the comparison is not performed by the redundant processing components and the comparison itself uses redundant hardware
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/1629Error detection by comparing the output of redundant processing systems
    • G06F11/165Error detection by comparing the output of redundant processing systems with continued operation after detection of the error
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • G06F11/2023Failover techniques
    • G06F11/2025Failover techniques using centralised failover control functionality
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • G06F11/2035Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant without idle spare hardware
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • G06F11/2048Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant where the redundant components share neither address space nor persistent storage
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2097Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements maintaining the standby controller/processing unit updated
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L1/00Arrangements for detecting or preventing errors in the information received
    • H04L1/22Arrangements for detecting or preventing errors in the information received using redundant apparatus to increase reliability
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • G06F11/2023Failover techniques
    • G06F11/2028Failover techniques eliminating a faulty processor or activating a spare
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2053Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant
    • G06F11/2056Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant by mirroring
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3024Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system component is a central processing unit [CPU]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3409Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3466Performance evaluation by tracing or monitoring
    • G06F11/3485Performance evaluation by tracing or monitoring for I/O devices
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2201/00Indexing scheme relating to error detection, to error correction, and to monitoring
    • G06F2201/805Real-time

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Hardware Redundancy (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

本实施例提供了业务接管方法、存储设备和业务接管装置。当存储系统中的两个存储设备之间发送通信故障时,两个存储设备分别获取自己的运行状态。运行状态反映的是存储设备的系统资源的当前利用情况。然后,各自根据运行状态确定延迟时长,延迟时长是存储设备向仲裁服务器发送仲裁请求之前等待的时长。两个存储设备分别在所述延迟时长之后,再向仲裁服务器发送仲裁请求以请求接管业务。有利于仲裁服务器选择出运行状态较好的存储设备来接管主机业务。

Description

PCT国内申请,说明书已公开。

Claims (15)

  1. PCT国内申请,权利要求书已公开。
CN201580003092.1A 2015-12-23 2015-12-23 一种业务接管方法、存储设备和业务接管装置 Active CN109496401B (zh)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2015/098487 WO2017107110A1 (zh) 2015-12-23 2015-12-23 一种业务接管方法、存储设备和业务接管装置

Publications (2)

Publication Number Publication Date
CN109496401A true CN109496401A (zh) 2019-03-19
CN109496401B CN109496401B (zh) 2021-01-05

Family

ID=59088782

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201580003092.1A Active CN109496401B (zh) 2015-12-23 2015-12-23 一种业务接管方法、存储设备和业务接管装置

Country Status (4)

Country Link
US (3) US10705930B2 (zh)
EP (1) EP3319258B1 (zh)
CN (1) CN109496401B (zh)
WO (1) WO2017107110A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114780272A (zh) * 2022-04-18 2022-07-22 北京亚康万玮信息技术股份有限公司 基于共享存储和虚拟化的智能故障自愈调度方法和装置

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10521344B1 (en) * 2017-03-10 2019-12-31 Pure Storage, Inc. Servicing input/output (‘I/O’) operations directed to a dataset that is synchronized across a plurality of storage systems
CN110535714B (zh) 2018-05-25 2023-04-18 华为技术有限公司 一种仲裁方法及相关装置
US11194682B2 (en) * 2019-10-15 2021-12-07 EMC IP Holding Company LLC Connectivity-aware witness for active-active storage
CN114520892B (zh) * 2020-11-18 2023-04-07 华为技术有限公司 通信控制方法、装置及光网络单元
US11487635B2 (en) * 2020-11-20 2022-11-01 Netapp Inc. Mediator assisted switchover between clusters
US11556441B2 (en) * 2021-04-16 2023-01-17 EMC IP Holding Company LLC Data storage cluster with quorum service protection
US11740803B2 (en) * 2021-10-22 2023-08-29 EMC IP Holding Company, LLC System and method for stretching storage protection configurations in a storage cluster

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101060391A (zh) * 2007-05-16 2007-10-24 华为技术有限公司 主备服务器切换方法及系统及主用服务器、备用服务器
CN101124543A (zh) * 2004-07-20 2008-02-13 Ut斯达康公司 切换辅助设备和方法
CN101835062A (zh) * 2010-04-29 2010-09-15 中兴通讯股份有限公司 业务板倒换的处理方法及机架控制装置
US20110173233A1 (en) * 2010-01-13 2011-07-14 Fujitsu Limited Database system and database control method
CN102833096A (zh) * 2012-08-06 2012-12-19 杭州华三通信技术有限公司 一种低成本的高可用系统实现方法及装置
US20150257141A1 (en) * 2014-03-04 2015-09-10 Cisco Technology, Inc. Resource allocation for control channel
CN104980693A (zh) * 2014-04-11 2015-10-14 深圳中兴力维技术有限公司 媒体服务备份方法及系统
CN105049258A (zh) * 2015-08-14 2015-11-11 深圳市傲冠软件股份有限公司 网络容灾系统的数据传输方法
CN105095125A (zh) * 2015-07-08 2015-11-25 北京飞杰信息技术有限公司 基于仲裁磁盘的高可用双控存储系统及其运行方法

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040025166A1 (en) * 2002-02-02 2004-02-05 International Business Machines Corporation Server computer and a method for accessing resources from virtual machines of a server computer via a fibre channel
JP4855355B2 (ja) * 2007-07-18 2012-01-18 株式会社日立製作所 フェールオーバにおける引き継ぎ先を自律的に変更する計算機システム及び方法
JP4609521B2 (ja) * 2008-04-21 2011-01-12 ソニー株式会社 情報処理装置、および情報処理方法、並びにコンピュータ・プログラム
JP5422147B2 (ja) * 2008-07-08 2014-02-19 株式会社日立製作所 リモートコピーシステム及びリモートコピー方法
US7882389B2 (en) * 2008-11-18 2011-02-01 International Business Machines Corporation Dynamic reassignment of devices attached to redundant controllers
US8966326B2 (en) * 2010-04-30 2015-02-24 SK Hynix Inc. Error detecting circuit and semiconductor apparatus including the same
TWI378344B (en) * 2010-06-30 2012-12-01 Ind Tech Res Inst Data backup, recovery and deletion method through a distributed network and system thereof
BR112013004072B1 (pt) * 2010-08-23 2022-03-03 Nokia Technologies Oy Aparelho, método e meio de armazenamento legível por computador
US8619555B2 (en) * 2010-11-17 2013-12-31 Netapp, Inc. Method and system for path selection in a network
US8868731B1 (en) * 2011-06-06 2014-10-21 Cisco Technology, Inc. Technique for false positives prevention in high availability network
US8682852B1 (en) * 2012-03-29 2014-03-25 Emc Corporation Asymmetric asynchronous mirroring for high availability
US9323702B2 (en) * 2012-11-27 2016-04-26 International Business Machines Corporation Increasing coverage of delays through arbitration logic
US9378145B2 (en) * 2013-03-05 2016-06-28 Dot Hill Systems Corporation Storage controller cache synchronization method and apparatus
US9104643B2 (en) * 2013-03-15 2015-08-11 International Business Machines Corporation OpenFlow controller master-slave initialization protocol
US9887924B2 (en) * 2013-08-26 2018-02-06 Vmware, Inc. Distributed policy-based provisioning and enforcement for quality of service
US9497071B2 (en) * 2014-04-01 2016-11-15 Ca, Inc. Multi-hop root cause analysis
US10027748B2 (en) * 2015-07-10 2018-07-17 Facebook, Inc. Data replication in a tree based server architecture
US10868743B2 (en) * 2016-06-01 2020-12-15 Intel Corporation System and method for providing fast platform telemetry data
US10721296B2 (en) * 2017-12-04 2020-07-21 International Business Machines Corporation Optimized rolling restart of stateful services to minimize disruption

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101124543A (zh) * 2004-07-20 2008-02-13 Ut斯达康公司 切换辅助设备和方法
CN101060391A (zh) * 2007-05-16 2007-10-24 华为技术有限公司 主备服务器切换方法及系统及主用服务器、备用服务器
US20110173233A1 (en) * 2010-01-13 2011-07-14 Fujitsu Limited Database system and database control method
CN101835062A (zh) * 2010-04-29 2010-09-15 中兴通讯股份有限公司 业务板倒换的处理方法及机架控制装置
CN102833096A (zh) * 2012-08-06 2012-12-19 杭州华三通信技术有限公司 一种低成本的高可用系统实现方法及装置
US20150257141A1 (en) * 2014-03-04 2015-09-10 Cisco Technology, Inc. Resource allocation for control channel
CN104980693A (zh) * 2014-04-11 2015-10-14 深圳中兴力维技术有限公司 媒体服务备份方法及系统
CN105095125A (zh) * 2015-07-08 2015-11-25 北京飞杰信息技术有限公司 基于仲裁磁盘的高可用双控存储系统及其运行方法
CN105049258A (zh) * 2015-08-14 2015-11-11 深圳市傲冠软件股份有限公司 网络容灾系统的数据传输方法

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114780272A (zh) * 2022-04-18 2022-07-22 北京亚康万玮信息技术股份有限公司 基于共享存储和虚拟化的智能故障自愈调度方法和装置

Also Published As

Publication number Publication date
US20200250055A1 (en) 2020-08-06
EP3319258A4 (en) 2018-11-14
EP3319258A1 (en) 2018-05-09
CN109496401B (zh) 2021-01-05
EP3319258B1 (en) 2019-11-27
WO2017107110A1 (zh) 2017-06-29
US20220283914A1 (en) 2022-09-08
US11347603B2 (en) 2022-05-31
US20180143887A1 (en) 2018-05-24
US10705930B2 (en) 2020-07-07
US11740982B2 (en) 2023-08-29

Similar Documents

Publication Publication Date Title
CN109496401A (zh) 一种业务接管方法、存储设备和业务接管装置
US9146684B2 (en) Storage architecture for server flash and storage array operation
US10366106B2 (en) Quorum-based replication of data records
US9047306B1 (en) Method of writing data
US8707085B2 (en) High availability data storage systems and methods
US20210075665A1 (en) Implementing switchover operations between computing nodes
EP2830284A1 (en) Caching method for distributed storage system, node and computer readable medium
JP2018522358A (ja) ネットワークフロー制御に基づく動的リソース割当て
US9329956B2 (en) Retrieving diagnostics information in an N-way clustered RAID subsystem
US20090228672A1 (en) Remote copy system and check method
CN101923442B (zh) iSCSI存储设备访问过程中的缓存数据同步系统及方法
US8700726B2 (en) Storage replication systems and methods
US20040139196A1 (en) System and method for releasing device reservations
KR102495724B1 (ko) 비정상 트랜잭션 요청의 발생 위치 제공 방법 및 그 장치
US20170116096A1 (en) Preserving coredump data during switchover operation
US9990148B2 (en) Storage control device and storage system for data backup
JPWO2015056301A1 (ja) ストレージシステム及びキャッシュ制御方法
JP2012168907A (ja) 相互監視システム
EP2616938B1 (en) Fault handling systems and methods
US11403001B2 (en) System and method for storage system node fencing
CN115129236A (zh) 一种双活存储系统的管理方法及装置
US9256566B1 (en) Managed reliability of data storage
CN110166558B (zh) 一种多控存储集群的通信方法、装置及设备
US7558886B2 (en) Method and apparatus for controlling data flows in distributed storage systems
CN116560905A (zh) 数据备份系统、方法和设备

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant