CN111813348A - Device, method, device and medium for node event processing in unified storage device - Google Patents

Device, method, device and medium for node event processing in unified storage device Download PDF

Info

Publication number
CN111813348A
CN111813348A CN202010934769.6A CN202010934769A CN111813348A CN 111813348 A CN111813348 A CN 111813348A CN 202010934769 A CN202010934769 A CN 202010934769A CN 111813348 A CN111813348 A CN 111813348A
Authority
CN
China
Prior art keywords
node
node event
event
module
cluster
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010934769.6A
Other languages
Chinese (zh)
Inventor
刘文志
侯红生
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Inspur Intelligent Technology Co Ltd
Original Assignee
Suzhou Inspur Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Inspur Intelligent Technology Co Ltd filed Critical Suzhou Inspur Intelligent Technology Co Ltd
Priority to CN202010934769.6A priority Critical patent/CN111813348A/en
Publication of CN111813348A publication Critical patent/CN111813348A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0602Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
    • G06F3/0614Improving the reliability of storage systems
    • G06F3/0619Improving the reliability of storage systems in relation to data integrity, e.g. data losses, bit errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0629Configuration or reconfiguration of storage systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0638Organizing or formatting or addressing of data
    • G06F3/064Management of blocks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/0671In-line storage system
    • G06F3/0673Single storage device
    • G06F3/0679Non-volatile semiconductor memory device, e.g. flash memory, one time programmable memory [OTP]

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Security & Cryptography (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The application discloses a node event processing device, a node event processing method, electronic equipment and a computer readable storage medium in unified storage equipment, wherein the device comprises a node event judgment module and a service processing submodule; the node event judging module is used for determining the contents of the node events which are sequentially generated by the cluster at present and the corresponding nodes of the generated node events according to the change records of the plurality of node event messages and the node online flag bits which are sequentially acquired, and sending corresponding operation notification messages to the corresponding service processing sub-modules; and the service processing submodule is used for executing corresponding configuration operation according to the operation notification message. According to the method and the device, by combining the analysis of the node event message and the change record of the node online zone bit, the specific content of the node event can be accurately acquired and positioned to the specific node, so that the related configuration operation of adding or quitting the cluster of the node can be efficiently processed, and the integrity and consistency of data in the cluster are effectively guaranteed.

Description

统一存储设备中的节点事件处理装置、方法、设备及介质Device, method, device and medium for node event processing in unified storage device

技术领域technical field

本申请涉及统一存储技术领域,特别涉及一种统一存储设备中的节点事件处理装置、方法、电子设备及计算机可读存储介质。The present application relates to the technical field of unified storage, and in particular, to a node event processing apparatus, method, electronic device, and computer-readable storage medium in a unified storage device.

背景技术Background technique

统一存储是一个能在单一设备上运行和管理文件和应用程序的存储系统,整合了基于文件和基于块的访问。由于其支持不同的存储协议为主机系统提供数据存储,因此也被称为多协议存储。Unified storage is a storage system that can run and manage files and applications on a single device, combining file-based and block-based access. Because it supports different storage protocols to provide data storage for the host system, it is also called multi-protocol storage.

当前统一存储设备通常都由多个控制器组成冗余结构,最常用的方式为两个控制器即两个节点组成一个IO Group,进而再由多个IO Group组成最终的多控集群。在这一架构下,由于掉电、业务故障或者其他意外导致的节点退出以及恢复加入事件就可能频繁的出现,因此如何及时、准确的处理相关场景,尤其是复杂的节点事件场景就变得非常重要。The current unified storage device usually consists of multiple controllers to form a redundant structure. The most common method is that two controllers, that is, two nodes, form an IO group, and then multiple IO groups form a final multi-controller cluster. Under this architecture, node exit and rejoin events may occur frequently due to power failure, business failure or other accidents. Therefore, how to deal with relevant scenarios in a timely and accurate manner, especially complex node event scenarios, becomes very important. important.

鉴于此,提供一种解决上述技术问题的方案,已经是本领域技术人员所亟需关注的。In view of this, providing a solution to the above-mentioned technical problems is an urgent need for those skilled in the art.

发明内容SUMMARY OF THE INVENTION

本申请的目的在于提供一种统一存储设备中的节点事件处理装置、方法、电子设备及计算机可读存储介质,以便精确地获取节点事件的具体内容,高效地处理好节点加入或退出集群的相关配置操作,保证集群中数据的完整性与一致性。The purpose of the present application is to provide a node event processing device, method, electronic device and computer-readable storage medium in a unified storage device, so as to accurately obtain the specific content of node events and efficiently handle the related nodes joining or exiting the cluster. Configure operations to ensure the integrity and consistency of data in the cluster.

为解决上述技术问题,第一方面,本申请公开了一种统一存储设备中的节点事件处理装置,包括:In order to solve the above technical problems, in a first aspect, the present application discloses a node event processing device in a unified storage device, including:

节点事件判断模块;用于根据依次获取到的多个节点事件消息以及节点在线标志位的变化记录,确定集群当前依次所发生的节点事件内容以及发生节点事件的对应节点,并向对应的业务处理子模块发送对应的操作通知消息;The node event judgment module is used to determine the content of the node events currently occurring in the cluster and the corresponding node where the node event occurs according to the successively obtained multiple node event messages and the change records of the node online flags, and send them to the corresponding business processing. The sub-module sends the corresponding operation notification message;

所述业务处理子模块;用于根据所述操作通知消息执行相应的配置操作。the service processing sub-module; configured to perform a corresponding configuration operation according to the operation notification message.

可选地,所述节点在线标志位以数据对的形式存储,用以表示同一IO Group中的两个节点的在线状态。Optionally, the node online flag is stored in the form of a data pair to indicate the online status of two nodes in the same IO Group.

可选地,所述节点事件判断模块中记录有节点状态完成标志位;所述节点事件判断模块还用于:Optionally, a node state completion flag is recorded in the node event judging module; the node event judging module is further used for:

在所述业务处理子模块根据所述操作通知消息执行相应的配置操作之后,对所述节点状态完成标志位进行对应的置位操作,以示对应节点已完成与其在线状态对应的配置操作。After the service processing submodule performs a corresponding configuration operation according to the operation notification message, a corresponding setting operation is performed on the node status completion flag to indicate that the corresponding node has completed the configuration operation corresponding to its online status.

可选地,所述节点事件消息中携带有集群当前运行状态信息。Optionally, the node event message carries information about the current running state of the cluster.

第二方面,本申请还公开了一种统一存储设备中的节点事件处理方法,包括:In a second aspect, the present application also discloses a method for processing node events in a unified storage device, including:

基于节点事件判断模块,根据依次获取到的多个节点事件消息以及节点在线标志位的变化记录,确定集群当前依次所发生的节点事件内容以及发生节点事件的对应节点;Based on the node event judging module, according to the successively acquired multiple node event messages and the change records of the node online flag bits, determine the node event content currently occurring in the cluster in sequence and the corresponding node where the node event occurs;

向对应的业务处理子模块发送对应的操作通知消息;Send the corresponding operation notification message to the corresponding service processing sub-module;

基于所述业务处理子模块根据所述操作通知消息执行相应的配置操作。A corresponding configuration operation is performed based on the service processing sub-module according to the operation notification message.

可选地,在所述基于所述业务处理子模块根据所述操作通知消息执行相应的配置操作之后,还包括:Optionally, after the sub-module based on the service processing performs a corresponding configuration operation according to the operation notification message, the method further includes:

对所述节点事件判断模块中记录的节点状态完成标志位进行对应的置位操作,以示对应节点已完成与其在线状态对应的配置操作。A corresponding setting operation is performed on the node status completion flag bit recorded in the node event judging module to indicate that the corresponding node has completed the configuration operation corresponding to its online status.

可选地,所述节点在线标志位以数据对的形式存储,用以表示同一IO Group中的两个节点的在线状态。Optionally, the node online flag is stored in the form of a data pair to indicate the online status of two nodes in the same IO Group.

可选地,所述节点事件消息中携带有集群当前运行状态信息。Optionally, the node event message carries information about the current running state of the cluster.

第三方面,本申请还公开了一种电子设备,包括:In a third aspect, the present application also discloses an electronic device, comprising:

存储器,用于存储计算机程序;memory for storing computer programs;

处理器,用于执行所述计算机程序以实现如上所述的任一种统一存储设备中的节点事件处理方法的步骤。The processor is configured to execute the computer program to implement the steps of the node event processing method in any of the above-mentioned unified storage devices.

第四方面,本申请还公开了一种计算机可读存储介质,所述计算机可读存储介质中存储有计算机程序,所述计算机程序被处理器执行时用以实现如上所述的任一种统一存储设备中的节点事件处理方法的步骤。In a fourth aspect, the present application also discloses a computer-readable storage medium, where a computer program is stored in the computer-readable storage medium, and when the computer program is executed by a processor, any one of the above-mentioned unified Steps of a node event processing method in a storage device.

本申请所提供的统一存储设备中的节点事件处理装置包括节点事件判断模块和业务处理子模块;所述节点事件判断模块用于根据依次获取到的多个节点事件消息以及节点在线标志位的变化记录,确定集群当前依次所发生的节点事件内容以及发生节点事件的对应节点,并向对应的业务处理子模块发送对应的操作通知消息;所述业务处理子模块用于根据所述操作通知消息执行相应的配置操作。The node event processing device in the unified storage device provided by the present application includes a node event judgment module and a service processing sub-module; the node event judgment module is configured to obtain a plurality of node event messages sequentially and changes in the node online flag bit. record, determine the content of the node events currently occurring in the cluster and the corresponding nodes where the node events occur, and send the corresponding operation notification message to the corresponding service processing sub-module; the service processing sub-module is used for executing according to the operation notification message The corresponding configuration action.

本申请所提供的统一存储设备中的节点事件处理装置、方法、电子设备及计算机可读存储介质所具有的有益效果是:本申请通过结合分析节点事件消息与节点在线标志位的变化记录,可精确地获取节点事件的具体内容并定位到具体节点,进而可以高效地处理节点加入或退出集群的相关配置操作,有效保障了集群中数据的完整性与一致性。The node event processing device, method, electronic device and computer-readable storage medium in the unified storage device provided by the present application have the following beneficial effects: by combining the analysis of the node event message and the change record of the node online flag bit, the present application can Accurately obtain the specific content of the node event and locate the specific node, and then can efficiently handle the related configuration operations of the node joining or exiting the cluster, effectively ensuring the integrity and consistency of the data in the cluster.

附图说明Description of drawings

为了更清楚地说明现有技术和本申请实施例中的技术方案,下面将对现有技术和本申请实施例描述中需要使用的附图作简要的介绍。当然,下面有关本申请实施例的附图描述的仅仅是本申请中的一部分实施例,对于本领域普通技术人员来说,在不付出创造性劳动的前提下,还可以根据提供的附图获得其他的附图,所获得的其他附图也属于本申请的保护范围。In order to more clearly illustrate the prior art and the technical solutions in the embodiments of the present application, the following will briefly introduce the drawings to be used in the description of the prior art and the embodiments of the present application. Of course, the following drawings related to the embodiments of the present application describe only a part of the embodiments of the present application. For those of ordinary skill in the art, without any creative effort, they can also obtain other embodiments according to the provided drawings. The accompanying drawings and other drawings obtained also belong to the protection scope of the present application.

图1为本申请实施例公开的一种统一存储设备的架构示意图;FIG. 1 is a schematic structural diagram of a unified storage device disclosed in an embodiment of the present application;

图2为本申请实施例公开的一种统一存储设备中的节点事件处理装置的结构框图;2 is a structural block diagram of a node event processing apparatus in a unified storage device disclosed in an embodiment of the present application;

图3为本申请实施例公开的一种统一存储设备中的节点事件处理方法的流程图;3 is a flowchart of a method for processing node events in a unified storage device disclosed in an embodiment of the present application;

图4为本申请实施例公开的一种电子设备的结构框图。FIG. 4 is a structural block diagram of an electronic device disclosed in an embodiment of the present application.

具体实施方式Detailed ways

本申请的核心在于提供一种统一存储设备中的节点事件处理装置、方法、电子设备及计算机可读存储介质,以便精确地获取节点事件的具体内容,高效地处理好节点加入或退出集群的相关配置操作,保证集群中数据的完整性与一致性。The core of the present application is to provide a node event processing device, method, electronic device and computer-readable storage medium in a unified storage device, so as to accurately obtain the specific content of node events and efficiently handle the related nodes joining or exiting the cluster. Configure operations to ensure the integrity and consistency of data in the cluster.

为了对本申请实施例中的技术方案进行更加清楚、完整地描述,下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行介绍。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。In order to describe the technical solutions in the embodiments of the present application more clearly and completely, the technical solutions in the embodiments of the present application will be introduced below with reference to the drawings in the embodiments of the present application. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of this application.

当前,统一存储是一个能在单一设备上运行和管理文件和应用程序的存储系统,整合了基于文件和基于块的访问。由于其支持不同的存储协议为主机系统提供数据存储,因此也被称为多协议存储。Currently, unified storage is a storage system that can run and manage files and applications on a single device, combining file-based and block-based access. Because it supports different storage protocols to provide data storage for the host system, it is also called multi-protocol storage.

参见图1,图1为本申请实施例公开的一种统一存储设备的架构示意图。当前,统一存储设备通常都由多个控制器组成冗余结构,最常用的方式为两个控制器即两个节点组成一个IO Group,进而再由多个IO Group组成最终的多控集群。Referring to FIG. 1, FIG. 1 is a schematic structural diagram of a unified storage device disclosed in an embodiment of the present application. Currently, a unified storage device usually consists of multiple controllers to form a redundant structure. The most common method is that two controllers, that is, two nodes, form an IO group, and then multiple IO groups form a final multi-controller cluster.

在这一架构下,由于掉电、业务故障或者其他意外导致的节点退出以及恢复加入事件就可能频繁的出现,因此如何及时、准确的处理相关场景,尤其是复杂的节点事件场景就变得非常重要。鉴于此,本申请提供了一种统一存储设备中的节点事件处理方案,可有效解决上述问题。Under this architecture, node exit and rejoin events may occur frequently due to power failure, business failure or other accidents. Therefore, how to deal with relevant scenarios in a timely and accurate manner, especially complex node event scenarios, becomes very important. important. In view of this, the present application provides a node event processing solution in a unified storage device, which can effectively solve the above problems.

参见图2所示,本申请实施例公开了一种统一存储设备中的节点事件处理装置,主要包括:Referring to FIG. 2 , an embodiment of the present application discloses a node event processing apparatus in a unified storage device, which mainly includes:

节点事件判断模块101;用于根据依次获取到的多个节点事件消息以及节点在线标志位的变化记录,确定集群当前依次所发生的节点事件内容以及发生节点事件的对应节点,并向对应的业务处理子模块102发送对应的操作通知消息;The node event judgment module 101 is used to determine the content of the node events currently occurring in the cluster and the corresponding nodes in which the node events occur according to the multiple node event messages and the change records of the node online flag bits obtained in sequence, and report to the corresponding service. The processing submodule 102 sends a corresponding operation notification message;

业务处理子模块102;用于根据操作通知消息执行相应的配置操作。The service processing sub-module 102 is used for executing the corresponding configuration operation according to the operation notification message.

其中,需要强调的是,本申请实施例中,节点事件判断模块101与业务处理子模块102是集群中相互独立的两个模块,节点事件判断模块101专用于判断获取节点事件具体内容,进而可通知包括如业务处理子模块102等在内的其他模块进行相应的处理。Among them, it should be emphasized that in the embodiment of the present application, the node event judgment module 101 and the service processing sub-module 102 are two independent modules in the cluster, and the node event judgment module 101 is dedicated to judging and acquiring the specific content of the node event, and then can Notify other modules including the service processing sub-module 102 to perform corresponding processing.

值得一提的是,本申请实施例中,节点事件判断模块101不仅从集群管理中心接收节点事件消息,而且还会跟踪获取节点在线标志位的变化情况。具体地,考虑到相关技术中节点事件消息一般仅仅告知发生了节点加入事件或者节点退出事件,而并不会具体通知到哪个节点在哪些具体情况下发生了加入或者退出集群的事件,因此,本申请中,则具体设置有节点在线标志位,用以表示每个节点的当前实际在线状态,节点事件判断模块101可同时从集群管理中心获取节点的节点在线标志位的变化记录,从而结合节点事件消息获取节点事件的具体内容,以及发生该节点事件的具体是哪个节点。It is worth mentioning that, in the embodiment of the present application, the node event judgment module 101 not only receives the node event message from the cluster management center, but also tracks and obtains the change of the online flag bit of the node. Specifically, considering that the node event message in the related art generally only informs the occurrence of a node join event or a node exit event, and does not specifically notify which node has joined or exited the cluster under which specific circumstances, therefore, this In the application, a node online flag bit is specifically set to indicate the current actual online status of each node, and the node event judgment module 101 can simultaneously obtain the change record of the node online flag bit of the node from the cluster management center, so as to combine the node event The message obtains the specific content of the node event, and which node the node event occurs on.

进一步地,作为一个具体实施例,节点在线标志位可以以二进制的数据对的形式存储,用以表示同一IO Group中的两个节点的在线状态。并且,不妨以0表示节点不在线,以1表示节点在线。如此,同一个IO Group中的两个节点的在线状态可具体包括00、01、10、11这四种状态。Further, as a specific embodiment, the node online flag bit may be stored in the form of a binary data pair to represent the online status of two nodes in the same IO group. Also, it is possible to use 0 to indicate that the node is not online, and 1 to indicate that the node is online. In this way, the online states of two nodes in the same IO Group may specifically include four states of 00, 01, 10, and 11.

需要说明的是,基于节点在线标志位的变化记录,本申请不仅可以确定所发生的节点事件的具体类型,而且可以确定出节点事件是在哪个节点状态下发生的。容易理解的是,从11状态变为10状态,与从01状态变为10状态所需要进行的配置操作处理过程是不同的。It should be noted that, based on the change record of the node online flag bit, the present application can determine not only the specific type of the node event that occurs, but also the node state in which the node event occurs. It is easy to understand that the configuration operation process required to change from the 11 state to the 10 state is different from that of changing from the 01 state to the 10 state.

由此,当有节点事件发生时,可以通过节点事件消息以及节点在线状态标志位,判断当前事件的具体内容,并可为每种具体的事件内容设置对应的事件枚举变量,以便基于事件枚举变量传递消息至业务处理子模块102。节点事件判断模块101以通知的形式将对应的事件枚举变量传递到对应的业务处理子模块102中,同时可以在节点事件判断模块101中根据具体业务需求控制其他各子模块的执行顺序及相互依赖关系。Therefore, when a node event occurs, the specific content of the current event can be judged through the node event message and the node online status flag bit, and the corresponding event enumeration variable can be set for each specific event content, so that based on the event enumeration The arguments are passed to the business processing sub-module 102 . The node event judging module 101 transmits the corresponding event enumeration variable to the corresponding business processing sub-module 102 in the form of notification, and at the same time, the node event judging module 101 can control the execution sequence and mutual interaction of other sub-modules according to specific business requirements. dependencies.

其他子模块在收到来自节点事件判断模块101的通知后,通过监听具体的事件类型,从而触发自身模块内的状态机的运行,完成与业务处理子模块102相适应的配置修改。After receiving the notification from the node event judging module 101 , the other sub-modules trigger the operation of the state machine in their own modules by monitoring specific event types, and complete the configuration modification suitable for the service processing sub-module 102 .

其中,常见的具体事件内容可具体包括如下几种:初始化过程中第一个节点加入(00 → 01或10)、第二个节点加入(01或10 → 11)、整机掉电状态下第一个节点的恢复(00→ 01或10),单节点故障(11 → 01或10)等等。Among them, the common specific event contents can include the following: the first node joins (00 → 01 or 10) during the initialization process, the second node joins (01 or 10 → 11), the Recovery of one node (00 → 01 or 10), single node failure (11 → 01 or 10), etc.

具体地,在实际应用过程中,由于各种不可控因素,节点事件的发生可能会导致很多更为复杂的场景。例如,一个节点加入集群,在业务处理子模块102内部进行相应配置操作尚未完全完成时(比如仅处理到第二个修改相关配置的阶段),该节点又因为掉电等因素再次退出集群,造成节点事件叠加的情况。而本申请对于此类多事件叠加的复杂节点事件场景也同样适用。Specifically, in the actual application process, due to various uncontrollable factors, the occurrence of node events may lead to many more complex scenarios. For example, when a node joins the cluster, and the corresponding configuration operation in the service processing sub-module 102 has not been completely completed (for example, only the second stage of modifying the relevant configuration is processed), the node exits the cluster again due to factors such as power failure, resulting in The case where node events are superimposed. The present application is also applicable to such complex node event scenarios where multiple events are superimposed.

例如,在一个正常的单节点恢复流程中(01 → 11),该恢复节点再次退出,则此时集群管理中心提供的节点在线标志位立刻再次变为01,节点事件判断模块101内部的处理目标就是使业务处理子模块102内部的相关配置流程恢复为该节点上线之前的状态。由此可保证业务内部能实时准确跟踪集群节点变化情况,避免影响到具体的业务流程。For example, in a normal single-node recovery process (01 → 11), the recovery node exits again, then the node online flag provided by the cluster management center immediately becomes 01 again, and the processing target inside the node event judgment module 101 It is to restore the relevant configuration process inside the service processing sub-module 102 to the state before the node goes online. This can ensure that the changes of cluster nodes can be accurately tracked in real time within the business, so as to avoid affecting specific business processes.

在面对复杂的节点事件组合情况时,基于节点事件判断模块101的实时监控与及时响应,以及按照需求及时通知到其他子模块,业务处理子模块102可以在整个流程处理过程中及时准确地完成对应的操作流程。In the face of complex node event combinations, based on the real-time monitoring and timely response of the node event judging module 101, and timely notification to other sub-modules as required, the business processing sub-module 102 can complete the process in a timely and accurate manner during the entire process. corresponding operating procedures.

本申请实施例所公开的统一存储设备中的节点事件处理装置包括节点事件判断模块101和业务处理子模块102,节点事件判断模块101用于根据获取到的节点事件消息以及节点在线标志位的变化记录,确定集群当前所发生的节点事件内容以及发生该节点事件的节点,并向对应的业务处理子模块102发送对应的操作通知消息;业务处理子模块102;业务处理子模块102用于根据操作通知消息执行相应的配置操作。The node event processing apparatus in the unified storage device disclosed in the embodiment of the present application includes a node event judgment module 101 and a service processing sub-module 102. The node event judgment module 101 is configured to obtain the node event message and the change of the node online flag according to the acquired node event message. record, determine the node event content currently occurring in the cluster and the node where the node event occurs, and send the corresponding operation notification message to the corresponding service processing sub-module 102; the service processing sub-module 102; the service processing sub-module 102 is used for operating The notification message performs the corresponding configuration action.

可见,本申请实施例所公开的统一存储设备中的节点事件处理装置,通过结合分析节点事件消息与节点在线标志位的变化记录,可精确地获取节点事件的具体内容并定位到具体节点,进而可以高效地处理节点加入或退出集群的相关配置操作,有效保障了集群中数据的完整性与一致性。It can be seen that the node event processing device in the unified storage device disclosed in the embodiment of the present application can accurately obtain the specific content of the node event and locate the specific node by combining the analysis of the node event message and the change record of the node online flag bit, and then It can efficiently handle configuration operations related to node joining or exiting the cluster, effectively ensuring the integrity and consistency of data in the cluster.

作为一种具体实施例,本申请实施例所公开的统一存储设备中的节点事件处理装置在上述内容的基础上,节点事件判断模块101中记录有节点状态完成标志位;节点事件判断模块101还用于:As a specific embodiment, the node event processing device in the unified storage device disclosed in the embodiment of the present application is based on the above content, and the node event judgment module 101 records the node state completion flag bit; the node event judgment module 101 also Used for:

在业务处理子模块102根据操作通知消息执行相应的配置操作之后,对节点状态完成标志位进行对应的置位操作,以示对应节点已完成与其在线状态对应的配置操作。After the service processing sub-module 102 performs a corresponding configuration operation according to the operation notification message, a corresponding setting operation is performed on the node status completion flag to indicate that the corresponding node has completed the configuration operation corresponding to its online status.

容易理解的是,对于业务处理子模块102,应当认为一个节点在完成了加入或者退出集群的相关配置操作后,才被视为真正已经在线或者离线的节点。因此在本实施例中,除了节点在线状态标志位以外,还在节点事件判断模块101中设置有节点状态完成标志位。It is easy to understand that, for the service processing sub-module 102, it should be considered that a node is only regarded as a node that is actually online or offline until the related configuration operations of joining or exiting the cluster are completed. Therefore, in this embodiment, in addition to the node online status flag, a node status completion flag is also set in the node event judging module 101 .

当完成了节点加入或者退出集群的相关配置流程操作后,节点事件判断模块101才进行置位操作,如此可保障在节点完成了加入集群的相关配置操作、能够正常处理业务后,才与集群其他模块进行节点间的通信,保障通信数据的有效性。The node event judging module 101 performs the setting operation only after completing the relevant configuration process operations of the node joining or exiting the cluster, so as to ensure that after the node has completed the relevant configuration operations of joining the cluster and can process services normally, it can communicate with others in the cluster. The module communicates between nodes to ensure the validity of the communication data.

如上,本实施例利用节点在线标志位表示集群提供的节点在线情况,并利用节点事件判断模块101内部的节点状态完成标志位来表示此时与业务处理子模块102的处理流程进度相匹配的节点在线状态处理情况,特别适用于复杂的叠加事件场景。当出现叠加事件时,节点事件判断模块101可以依赖两个标志位,判断需要进行何种操作以将节点事件判断模块101内部的节点状态完成标志位更新到与节点在线标志位一致。As above, in this embodiment, the node online flag is used to indicate the online status of the node provided by the cluster, and the node status completion flag in the node event judgment module 101 is used to indicate the node that matches the processing flow progress of the service processing sub-module 102 at this time. Online status processing, especially suitable for complex superimposed event scenarios. When a superimposed event occurs, the node event judging module 101 can rely on two flags to determine what operation needs to be performed to update the node state completion flag inside the node event judging module 101 to be consistent with the node online flag.

作为一种具体实施例,本申请实施例所公开的统一存储设备中的节点事件处理装置在上述内容的基础上,节点事件消息中携带有集群当前运行状态信息。As a specific embodiment, the node event processing apparatus in the unified storage device disclosed in the embodiment of the present application is based on the above content, and the node event message carries the current running state information of the cluster.

具体地,本实施例中,集群当前运行状态信息用于反馈集群状态,例如可具体包括集群初始化状态、集群掉电状态等。Specifically, in this embodiment, the current running state information of the cluster is used to feed back the cluster state, for example, it may specifically include the cluster initialization state, the cluster power-off state, and the like.

参见图3所示,本申请实施例公开了一种统一存储设备中的节点事件处理方法,主要包括:Referring to FIG. 3 , an embodiment of the present application discloses a method for processing node events in a unified storage device, which mainly includes:

S201:基于节点事件判断模块,根据依次获取到的多个节点事件消息以及节点在线标志位的变化记录,确定集群当前依次所发生的节点事件内容以及发生节点事件的对应节点。S201: Based on the node event judging module, according to the successively acquired multiple node event messages and the change records of the node online flags, determine the content of the node events currently occurring in the cluster in sequence and the corresponding nodes where the node events occur.

S202:向对应的业务处理子模块发送对应的操作通知消息。S202: Send a corresponding operation notification message to the corresponding service processing sub-module.

S203:基于业务处理子模块根据操作通知消息执行相应的配置操作。S203: Perform a corresponding configuration operation according to the operation notification message based on the service processing sub-module.

可见,本申请所提供的统一存储设备中的节点事件处理方法,通过结合分析节点事件消息与节点在线标志位的变化记录,可精确地获取节点事件的具体内容并定位到具体节点,进而可以高效地处理节点加入或退出集群的相关配置操作,有效保障了集群中数据的完整性与一致性。It can be seen that the node event processing method in the unified storage device provided by the present application can accurately obtain the specific content of the node event and locate the specific node by combining the analysis of the node event message and the change record of the node online flag bit, which can efficiently It handles the configuration operations of nodes joining or exiting the cluster, effectively ensuring the integrity and consistency of the data in the cluster.

作为一种具体实施例,本申请实施例所提供的统一存储设备中的节点事件处理方法在上述内容的基础上,在基于业务处理子模块根据操作通知消息执行相应的配置操作之后,还包括:As a specific embodiment, the method for processing node events in a unified storage device provided by the embodiment of the present application, on the basis of the above content, after performing a corresponding configuration operation based on the service processing sub-module according to the operation notification message, further includes:

对节点事件判断模块中记录的节点状态完成标志位进行对应的置位操作,以示对应节点已完成与其在线状态对应的配置操作。A corresponding setting operation is performed on the node status completion flag recorded in the node event judging module to indicate that the corresponding node has completed the configuration operation corresponding to its online status.

作为一种具体实施例,本申请实施例所提供的统一存储设备中的节点事件处理方法在上述内容的基础上,所述节点在线标志位以数据对的形式存储,用以表示同一IOGroup中的两个节点的在线状态。As a specific embodiment, the method for processing node events in a unified storage device provided by the embodiment of the present application is based on the above content, and the node online flag bit is stored in the form of a data pair, which is used to represent the data in the same IOGroup. The online status of both nodes.

作为一种具体实施例,本申请实施例所提供的统一存储设备中的节点事件处理方法在上述内容的基础上,所述节点事件消息中携带有集群当前运行状态信息。As a specific embodiment, the method for processing a node event in a unified storage device provided by an embodiment of the present application is based on the above content, and the node event message carries information about the current running state of the cluster.

关于上述统一存储设备中的节点事件处理方法的具体内容,可参考前述关于统一存储设备中的节点事件处理装置的详细介绍,这里就不再赘述。For the specific content of the node event processing method in the unified storage device, reference may be made to the foregoing detailed introduction on the node event processing device in the unified storage device, and details are not repeated here.

参见图4所示,本申请实施例公开了一种电子设备,包括:Referring to FIG. 4 , an embodiment of the present application discloses an electronic device, including:

存储器301,用于存储计算机程序;a memory 301 for storing computer programs;

处理器302,用于执行所述计算机程序以实现如上所述的任一种统一存储设备中的节点事件处理方法的步骤。The processor 302 is configured to execute the computer program to implement the steps of the node event processing method in any of the above-mentioned unified storage devices.

进一步地,本申请实施例还公开了一种计算机可读存储介质,所述计算机可读存储介质中存储有计算机程序,所述计算机程序被处理器执行时用以实现如上所述的任一种统一存储设备中的节点事件处理方法的步骤。Further, an embodiment of the present application also discloses a computer-readable storage medium, where a computer program is stored in the computer-readable storage medium, and the computer program is used to implement any of the above when executed by a processor. Steps of a method for processing node events in a unified storage device.

关于上述电子设备和计算机可读存储介质的具体内容,可参考前述关于统一存储设备中的节点事件处理方法的详细介绍,这里就不再赘述。For the specific content of the electronic device and the computer-readable storage medium, reference may be made to the foregoing detailed introduction on the node event processing method in the unified storage device, which will not be repeated here.

本申请中各个实施例采用递进的方式描述,每个实施例重点说明的都是与其他实施例的不同之处,各个实施例之间相同相似部分互相参见即可。对于实施例公开的设备而言,由于其与实施例公开的方法相对应,所以描述的比较简单,相关之处参见方法部分说明即可。The various embodiments in this application are described in a progressive manner, and each embodiment focuses on the differences from other embodiments, and the same and similar parts between the various embodiments may be referred to each other. For the device disclosed in the embodiment, since it corresponds to the method disclosed in the embodiment, the description is relatively simple, and the relevant part can be referred to the description of the method.

还需说明的是,在本申请文件中,诸如“第一”和“第二”之类的关系术语,仅仅用来将一个实体或者操作与另一个实体或者操作区分开来,而不一定要求或者暗示这些实体或者操作之间存在任何这种实际的关系或者顺序。此外,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。It should also be noted that, in this application document, relational terms such as "first" and "second" are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require Or imply that there is any such actual relationship or order between these entities or operations. Furthermore, the terms "comprising", "comprising" or any other variation thereof are intended to encompass a non-exclusive inclusion such that a process, method, article or device that includes a list of elements includes not only those elements, but also includes not explicitly listed or other elements inherent to such a process, method, article or apparatus. Without further limitation, an element qualified by the phrase "comprising a..." does not preclude the presence of additional identical elements in a process, method, article or apparatus that includes the element.

以上对本申请所提供的技术方案进行了详细介绍。本文中应用了具体个例对本申请的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本申请的方法及其核心思想。应当指出,对于本技术领域的普通技术人员来说,在不脱离本申请原理的前提下,还可以对本申请进行若干改进和修饰,这些改进和修饰也落入本申请的保护范围内。The technical solutions provided by the present application have been introduced in detail above. Specific examples are used herein to illustrate the principles and implementations of the present application, and the descriptions of the above embodiments are only used to help understand the methods and core ideas of the present application. It should be pointed out that for those skilled in the art, without departing from the principles of the present application, several improvements and modifications can also be made to the present application, and these improvements and modifications also fall within the protection scope of the present application.

Claims (10)

1.一种统一存储设备中的节点事件处理装置,其特征在于,包括:1. A node event processing device in a unified storage device, characterized in that, comprising: 节点事件判断模块;用于根据依次获取到的多个节点事件消息以及节点在线标志位的变化记录,确定集群当前依次所发生的节点事件内容以及发生节点事件的对应节点,并向对应的业务处理子模块发送对应的操作通知消息;The node event judgment module is used to determine the content of the node events currently occurring in the cluster and the corresponding node where the node event occurs according to the successively obtained multiple node event messages and the change records of the node online flags, and send them to the corresponding business processing. The sub-module sends the corresponding operation notification message; 所述业务处理子模块;用于根据所述操作通知消息执行相应的配置操作。the service processing sub-module; configured to perform a corresponding configuration operation according to the operation notification message. 2.根据权利要求1所述的节点事件处理装置,其特征在于,所述节点在线标志位以数据对的形式存储,用以表示同一IO Group中的两个节点的在线状态。2 . The node event processing device according to claim 1 , wherein the node online flag is stored in the form of a data pair to indicate the online status of two nodes in the same IO Group. 3 . 3.根据权利要求1所述的节点事件处理装置,其特征在于,所述节点事件判断模块中记录有节点状态完成标志位;所述节点事件判断模块还用于:3. The node event processing device according to claim 1, wherein a node state completion flag is recorded in the node event judging module; the node event judging module is also used for: 在所述业务处理子模块根据所述操作通知消息执行相应的配置操作之后,对所述节点状态完成标志位进行对应的置位操作,以示对应节点已完成与其在线状态对应的配置操作。After the service processing submodule performs a corresponding configuration operation according to the operation notification message, a corresponding setting operation is performed on the node status completion flag to indicate that the corresponding node has completed the configuration operation corresponding to its online status. 4.根据权利要求1至3任一项所述的节点事件处理装置,其特征在于,所述节点事件消息中携带有集群当前运行状态信息。4. The node event processing apparatus according to any one of claims 1 to 3, wherein the node event message carries information about the current running state of the cluster. 5.一种统一存储设备中的节点事件处理方法,其特征在于,包括:5. A method for processing node events in a unified storage device, comprising: 基于节点事件判断模块,根据依次获取到的多个节点事件消息以及节点在线标志位的变化记录,确定集群当前依次所发生的节点事件内容以及发生节点事件的对应节点;Based on the node event judging module, according to the successively acquired multiple node event messages and the change records of the node online flag bits, determine the node event content currently occurring in the cluster in sequence and the corresponding node where the node event occurs; 向对应的业务处理子模块发送对应的操作通知消息;Send the corresponding operation notification message to the corresponding service processing sub-module; 基于所述业务处理子模块根据所述操作通知消息执行相应的配置操作。A corresponding configuration operation is performed based on the service processing sub-module according to the operation notification message. 6.根据权利要求5所述的节点事件处理方法,其特征在于,在所述基于所述业务处理子模块根据所述操作通知消息执行相应的配置操作之后,还包括:6 . The node event processing method according to claim 5 , wherein after the service processing sub-module performs a corresponding configuration operation according to the operation notification message, the method further comprises: 6 . 对所述节点事件判断模块中记录的节点状态完成标志位进行对应的置位操作,以示对应节点已完成与其在线状态对应的配置操作。A corresponding setting operation is performed on the node status completion flag bit recorded in the node event judging module to indicate that the corresponding node has completed the configuration operation corresponding to its online status. 7.根据权利要求5所述的节点事件处理方法,其特征在于,所述节点在线标志位以数据对的形式存储,用以表示同一IO Group中的两个节点的在线状态。7 . The node event processing method according to claim 5 , wherein the node online flag is stored in the form of a data pair to indicate the online status of two nodes in the same IO Group. 8 . 8.根据权利要求5至7任一项所述的节点事件处理方法,其特征在于,所述节点事件消息中携带有集群当前运行状态信息。8. The node event processing method according to any one of claims 5 to 7, wherein the node event message carries information about the current running state of the cluster. 9.一种电子设备,其特征在于,包括:9. An electronic device, characterized in that, comprising: 存储器,用于存储计算机程序;memory for storing computer programs; 处理器,用于执行所述计算机程序以实现如权利要求5至8任一项所述的节点事件处理装置的步骤。A processor for executing the computer program to implement the steps of the node event processing apparatus according to any one of claims 5 to 8. 10.一种计算机可读存储介质,其特征在于,所述计算机可读存储介质中存储有计算机程序,所述计算机程序被处理器执行时用以实现如权利要求5至8任一项所述的节点事件处理装置的步骤。10. A computer-readable storage medium, characterized in that, a computer program is stored in the computer-readable storage medium, and when the computer program is executed by a processor, the computer program is used to realize any one of claims 5 to 8. The steps of the node event handler.
CN202010934769.6A 2020-09-08 2020-09-08 Device, method, device and medium for node event processing in unified storage device Pending CN111813348A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010934769.6A CN111813348A (en) 2020-09-08 2020-09-08 Device, method, device and medium for node event processing in unified storage device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010934769.6A CN111813348A (en) 2020-09-08 2020-09-08 Device, method, device and medium for node event processing in unified storage device

Publications (1)

Publication Number Publication Date
CN111813348A true CN111813348A (en) 2020-10-23

Family

ID=72860179

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010934769.6A Pending CN111813348A (en) 2020-09-08 2020-09-08 Device, method, device and medium for node event processing in unified storage device

Country Status (1)

Country Link
CN (1) CN111813348A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113703669A (en) * 2021-07-16 2021-11-26 苏州浪潮智能科技有限公司 Management method, system, equipment and storage medium for cache partition
CN114546686A (en) * 2022-02-28 2022-05-27 西安热工研究院有限公司 Method and system for defining event and state based on message flow real-time acquisition equipment
CN114844799A (en) * 2022-05-27 2022-08-02 深信服科技股份有限公司 Cluster management method and device, host equipment and readable storage medium
CN115150459A (en) * 2022-06-30 2022-10-04 苏州浪潮智能科技有限公司 A method and system for managing a unified storage device cluster controller node

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105335251A (en) * 2015-09-23 2016-02-17 浪潮(北京)电子信息产业有限公司 Fault recovery method and system
CN107544758A (en) * 2017-08-29 2018-01-05 新华三云计算技术有限公司 The magnetic disk heartbeat detection method and device of cluster
CN108959045A (en) * 2018-06-08 2018-12-07 郑州云海信息技术有限公司 A kind of test method and system of NAS clustering fault performance of handoffs
CN109460315A (en) * 2018-12-20 2019-03-12 华迪计算机集团有限公司 Processing method, device, the computer equipment of shared disk malfunctioning node

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105335251A (en) * 2015-09-23 2016-02-17 浪潮(北京)电子信息产业有限公司 Fault recovery method and system
CN107544758A (en) * 2017-08-29 2018-01-05 新华三云计算技术有限公司 The magnetic disk heartbeat detection method and device of cluster
CN108959045A (en) * 2018-06-08 2018-12-07 郑州云海信息技术有限公司 A kind of test method and system of NAS clustering fault performance of handoffs
CN109460315A (en) * 2018-12-20 2019-03-12 华迪计算机集团有限公司 Processing method, device, the computer equipment of shared disk malfunctioning node

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113703669A (en) * 2021-07-16 2021-11-26 苏州浪潮智能科技有限公司 Management method, system, equipment and storage medium for cache partition
CN113703669B (en) * 2021-07-16 2023-08-04 苏州浪潮智能科技有限公司 Management method, system, device and storage medium of a cache partition
CN114546686A (en) * 2022-02-28 2022-05-27 西安热工研究院有限公司 Method and system for defining event and state based on message flow real-time acquisition equipment
CN114844799A (en) * 2022-05-27 2022-08-02 深信服科技股份有限公司 Cluster management method and device, host equipment and readable storage medium
CN115150459A (en) * 2022-06-30 2022-10-04 苏州浪潮智能科技有限公司 A method and system for managing a unified storage device cluster controller node
CN115150459B (en) * 2022-06-30 2023-08-04 苏州浪潮智能科技有限公司 Method and system for managing unified storage device cluster controller nodes

Similar Documents

Publication Publication Date Title
CN111813348A (en) Device, method, device and medium for node event processing in unified storage device
TWI229796B (en) Method and system to implement a system event log for system manageability
US9240937B2 (en) Fault detection and recovery as a service
CN108600029A (en) A kind of configuration file update method, device, terminal device and storage medium
CN103069749B (en) The method and system of the isolation of the problem in virtual environment
CN112468361A (en) Network connection state monitoring method and device, electronic equipment and storage medium
WO2015062200A1 (en) Method, apparatus, and system for processing distributed transaction submission faults
CN114531373A (en) Node state detection method, node state detection device, equipment and medium
WO2021174684A1 (en) Cutover information processing method, system and apparatus
CN103701657A (en) Device and method for monitoring and processing dysfunction of continuously running data processing system
CN113010337B (en) Fault detection method, master control node, working node and distributed system
CN118779232A (en) Test case execution method, device and server
CN118646640A (en) Network card fault repair method, device, baseboard management controller, system and medium
CN118550752A (en) Cloud platform fault detection and operation and maintenance system, method, equipment and storage medium
WO2024239569A1 (en) Cluster service processing method, server, and system
CN113381887B (en) Method and device for processing faults of computing nodes
CN118535402A (en) Memory test method, device, test equipment and system
CN111984475A (en) Troubleshooting method, device, equipment and medium
CN115567372A (en) A data backup method, device, storage medium and electronic equipment
CN116069583A (en) A database cluster management method, device and network equipment
CN114564321B (en) A snapshot creation method, system and computer readable storage medium
CN114253846B (en) Automatic test abnormality positioning method, device, equipment and readable storage medium
CN112148214B (en) A node information processing method, device and medium in a dual-control environment
CN113806123B (en) Server downtime positioning system and method and server
CN116155691B (en) Data processing method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20201023

RJ01 Rejection of invention patent application after publication