CN105634813A

CN105634813A - Method for automatically switching nodes under dual-computer environment based on network

Info

Publication number: CN105634813A
Application number: CN201610000774.3A
Authority: CN
Inventors: 宋辰
Original assignee: Inspur Electronic Information Industry Co Ltd
Current assignee: Inspur Electronic Information Industry Co Ltd
Priority date: 2016-01-04
Filing date: 2016-01-04
Publication date: 2016-06-01

Abstract

The invention discloses a method for automatically switching nodes under a dual-computer environment based on a network, belongs to a method for automatically switching nodes, and solves the problem that how to avoid the unavailability of the whole Lustre file system caused by the downtime of a single-point metadata server. The technical scheme is as follows: the management node, the standby management node and the login node are all connected to the mdt node and the ost node through the Ethernet switch, and the storage server is respectively connected to the management node, the standby management node, the login node, the mdt node and the ost node through the Ethernet switch; (1) deploying heartbeat service at all mds nodes and oss nodes; (2) modifying ha.cf file codes according to the actual environment of the cluster; (3) starting heartbeat service, and checking whether all IO nodes run the service; (4) manually dropping the Ethernet port of the MDS node, and observing the switching process; (5) and confirming the residual recovery time, and confirming that the Lustre partition is still normal after the time _ remaining is timed out.

Description

A kind of method that network two-shipper environment lower node automatically switches

Technical field

The present invention relates to a kind of method that node automatically switches, a kind of method that specifically network two-shipper environment lower node automatically switches.

Background technology

Instantly HPC high-performance computing sector, the requirement of I/O bandwidth is increased by be skyrocketed through and the computational tasks of data volume day by day, and NFS file system can not meet the demand of NFS. Lustre is as a parallel file system increased income, and its powerful scalability has been widely used in HPCC environment.

But while capacity and bandwidth disclosure satisfy that calculating I/O bandwidth demand along with Lustre file system, the pressure of Lustre server is also gradually increased, especially meta data server (MDS). As the node of storage Lustre metadata, pressure is more big, and fault rate is also more high. High availability is self-evident for the importance of cluster, not only safeguards stablizing of cluster hardware structure, reduces the generation of fault, and can ensure that stablizing of file system. Once cluster file system breaks down, being catastrophic for cluster, bring the interruption even loss of data of production environment, risk is self-evident.

Summary of the invention

The technical assignment of the present invention is to provide a kind of method that network two-shipper environment lower node automatically switches, and solves the disabled problem how avoiding the single-point meta data server machine of delaying to cause whole Lustre file system.

The technical assignment of the present invention realizes in the following manner,

A kind of method that network two-shipper environment lower node automatically switches, involved hardware includes storage server, InfiniBand switch, Ethernet switch, management node, standby management node, logs in node, mds node and oss node, management node, standby management node, logging in node and connect to mdt node and ost node each through Ethernet switch, storage server via Ethernet switch is connected respectively to management node, standby management node, logs in node, mdt node and ost node; Described method comprises the steps:

(1), service at all mds nodes and oss node deployment heartbeat;

(2), ha.cf document code is revised according to cluster actual environment;

(3), open heartbeat service, check whether that all I/O node have all run this service;

(4), do not unload Lustre subregion, manually the Ethernet interface down of MDS node is fallen, observe handoff procedure;

(5), confirm to remain recovery time, after treating time_remaining timing, confirm that Lustre subregion is still normal.

Mds node includes MDS01 node and MDS02 node, and MDS01 node is mdt host node, and MDS02 node is mdt secondary node.

Oss node includes OSS01 node, OSS02 node, OSS03 node and OSS04 node; OSS01 node, OSS02 node, OSS03 node and OSS04 node are ost carry node.

OSS01 node carry ost00 and ost01; OSS02 node carry ost02 and ost03; OSS03 node carry ost04 and ost05; OSS04 node carry ost06, ost07.

The method that a kind of network two-shipper environment lower node of the present invention automatically switches has the advantage that

1, by the method monitor in real time network Heartbeat, under two-shipper environment when host node is due to malfunction and failure, host node fault-signal is informed that secondary node, secondary node take over the service of host node or the carry of memory space by heartbeat mechanism automatically. By writing script the MDS node being deployed in Lustre file system and OSS node, by the service redundant of both nodes, it is achieved the non-stop run of mdt, it is ensured that the normal operation of Lustre file system;

2, this deployment way is disposed based on script, and by installing related service under assigned catalogue, timing detects network environment, and self only takes up a small amount of system resource. And by the amendment to script, can be applicable to multiple different HA environment, colony environment;

3, after this application is disposed, not affecting storage and file system performance, take storage server resource little, after MDS active node switches, mdt recovers availability automatically, it is not necessary to manual operation; When, after OSS single point failure, another OSS being mutually redundant takes over the ost of inefficacy, automatic carry, and checks availability. To be checked complete, recover the read-write of former ost.

Accompanying drawing explanation

Below in conjunction with accompanying drawing, the present invention is further described.

Accompanying drawing 1 is the hardware block diagram of a kind of method that network two-shipper environment lower node automatically switches.

Detailed description of the invention

The method a kind of network two-shipper environment lower node of the present invention automatically switched with reference to Figure of description and specific embodiment is described in detail below.

Embodiment 1:

The method that a kind of network two-shipper environment lower node of the present invention automatically switches, involved hardware includes storage server, InfiniBand switch, Ethernet switch, management node, standby management node, logs in node, mds node and oss node, management node, standby management node, logging in node and connect to mdt node and ost node each through Ethernet switch, storage server via Ethernet switch is connected respectively to management node, standby management node, logs in node, mdt node and ost node; Described method comprises the steps:

(1), service at all mds nodes and oss node deployment heartbeat;

(2), ha.cf document code is revised according to cluster actual environment;

In step (2), ha.cf document code is:

keepalive2

deadtime30

initdead120

#definedifferentudpportfordifferentpairs

#

udpport694

bcasteth0

use_logdoff

logfile/var/log/ha-log

auto_failbackoff

#

#youmustchangehere

#

nodemds01mds02

ping11.11.11.111.11.11.2

respawnhacluster/usr/lib64/heartbeat/ipfail

#addstonith

#stonith_hostmd2external/rackpdu

#stonithexternal/rackpdu/etc/ha.d/rackpdu.conf��

In step (4), observing handoff procedure is check MDS node or OSS node:

/ proc/fs/lustre/mdt/lustre-MDT0000/recovery_status and

/proc/fs/lustre/obdfilter/lustre-OST0000/recovery_status��

By detailed description of the invention above, described those skilled in the art can be easy to realize the present invention. It is understood that the present invention is not limited to above-mentioned detailed description of the invention. On the basis of disclosed embodiment, described those skilled in the art can the different technical characteristic of combination in any, thus realizing different technical schemes.

Except the technical characteristic described in description, it is the known technology of those skilled in the art.

Claims

1. the method that a network two-shipper environment lower node automatically switches, it is characterized in that involved hardware includes storage server, InfiniBand switch, Ethernet switch, management node, standby management node, logs in node, mds node and oss node, management node, standby management node, logging in node and connect to mdt node and ost node each through Ethernet switch, storage server via Ethernet switch is connected respectively to management node, standby management node, logs in node, mdt node and ost node; Described method comprises the steps:

(1), service at all mds nodes and oss node deployment heartbeat;

(2), ha.cf document code is revised according to cluster actual environment;

(4), manually the Ethernet interface down of MDS node is fallen, observe handoff procedure;

2. the method that a kind of network two-shipper environment lower node according to claim 1 automatically switches, it is characterised in that mds node includes MDS01 node and MDS02 node, and MDS01 node is mdt host node, and MDS02 node is mdt secondary node.

3. the method that a kind of network two-shipper environment lower node according to claim 1 automatically switches, it is characterised in that oss node includes OSS01 node, OSS02 node, OSS03 node and OSS04 node; OSS01 node, OSS02 node, OSS03 node and OSS04 node are ost carry node.

4. the method that a kind of network two-shipper environment lower node according to claim 3 automatically switches, it is characterised in that OSS01 node carry ost00 and ost01; OSS02 node carry ost02 and ost03; OSS03 node carry ost04 and ost05; OSS04 node carry ost06, ost07.