CN103986789A

CN103986789A - Method for realizing dual redundant of NFS (network file system) nodes in HADOOP HA (home address) cluster based on NFS

Info

Publication number: CN103986789A
Application number: CN201410246212.8A
Authority: CN
Inventors: 张宪昭
Original assignee: Inspur Electronic Information Industry Co Ltd
Current assignee: Inspur Electronic Information Industry Co Ltd
Priority date: 2014-06-05
Filing date: 2014-06-05
Publication date: 2014-08-13

Abstract

The invention discloses a method for realizing dual redundant of NFS (network file system) nodes in an HADOOP HA (home address) cluster based on an NFS and belongs to the technical field of computer servers. The method for realizing the dual redundant of the NFS nodes in the HADOOP HA cluster based on the NFS comprises the following steps: 1) mounting a DRBD (distributed replicated block device) on two servers respectively by virtue of network interaction, so that /dev/sdb1 real-time synchronization of the two servers is realized; 2) configuring NFS service, and then configuring heartbeat; 3) mounting NFS sharing on two servers used as Namenodes, and setting automatic mounting of the NFS; 4) installing HADOOP, and configuring HA of each Namenode; 5) when an NFS server monitors that resource of a main NFS server is abnormal through heatbeat, automatically taking over IP resource and DRDB resource of the NFS from the NFS server, and starting the NFS service. The method for realizing the dual redundant of the NFS nodes in the HADOOP HA cluster based on the NFS has the advantages that the stability of the HADOOP HA cluster is improved and continuous operation of a service is guaranteed.

Description

A kind of method of NFS node dual-computer redundancy in HADOOP HA cluster of realizing based on NFS

Technical field

The present invention relates to computer server technical field, specifically a kind of method of NFS node dual-computer redundancy in HADOOP HA cluster of realizing based on NFS.

Background technology

Just as we all know, NameNode is the important component part of cloud computing technology at Hadoop(hadoop, it is the most popular and the most stable instrument in current cloud computing, large data solution, be one and can carry out to mass data the software frame of distributed treatment) there is Single Point of Faliure problem in system, for addressing this problem, Hadoop 2.0 has released namenode node HA function, one is active node, one is standby node, and Active Node externally provides service as Primary NameNode.Standby Node, in Safe mode pattern, preserves the up-to-date metadata information of Primary NameNode in internal memory.Active Node and Standby Node share storage by NFS and carry out mutual edits.DataNode sends Block location information to Active Node and Standby Node simultaneously.After keeper determines that Primary NameNode breaks down, Standby Node is switched to Primary NameNode.Owing to having preserved the up-to-date information of all metadata in Standby Node internal memory, therefore can directly externally provide service.The shortcoming of this sets of plan is, it is a single-point that NFS shares storage, when NFS(is provided, is writing a Chinese character in simplified form of Network File System, i.e. NFS) during the mechanical disorder of service, the whole Hadoop cluster machine of will delaying.

Summary of the invention

Technical assignment of the present invention is to provide the method for NFS node dual-computer redundancy in a kind of HADOOP HA cluster of realizing based on NFS.

Technical assignment of the present invention is realized in the following manner, and the method step is as follows:

1) two-server is by the network interconnection, and DRBD is installed, realize two machines /dev/sdb1 real-time synchronization;

2) configuration NFS service, configuration heartbeat after completing;

3) two server carry NFS for namenode share, and the automatic carry of NFS is set;

4) Hadoop is installed, and configures the HA of namenode;

5) when monitoring the resource exception of main nfs server from nfs server by heatbeat, from nfs server, take over NFS ip resource and drbd resource, start NFS service.

Described network is Ethernet.

Described namenode node carries out carry by NFS ip, when NFS master server fault, can confirm that NFS carry is normal by df order, thereby guaranteed the normal operation of Hadoop cluster on namenode.

The HA resource of described configuration heartbeat comprises ip, the drbd resource of NFS service.

In a kind of HADOOP HA cluster of realizing based on NFS of the present invention the method for NFS node dual-computer redundancy compared to the prior art, improved cluster stability, guaranteed the continuous operation of business.

Accompanying drawing explanation

Accompanying drawing 1 is nfs server failover flow chart.

Accompanying drawing 2 is Hadoop cluster topology graph.

Embodiment

Embodiment 1:

The method step is as follows:

1) two-server is by the network interconnection, DRBD(Distributed Replicated Block Device be installed be one with software realize, without share, the storage replication solution of mirror image block device content between server), realize two machines /dev/sdb1 real-time synchronization;

2) configuration NFS service, configuration heartbeat after completing;

4) Hadoop is installed, and configures the HA of namenode;

Embodiment 2:

The method step is as follows:

1) two-server (is referred to by Xerox company and creates and by Xerox by Ethernet, the baseband LAN standard that Intel and DEC develop jointly) interconnected, DRBD(Distributed Replicated Block Device be installed be one with software realize, without share, the storage replication solution of mirror image block device content between server), realize two machines /dev/sdb1 real-time synchronization;

2) configuration NFS service, configuration heartbeat after completing;

4) Hadoop is installed, and configures the HA of namenode;

Embodiment 3:

The method step is as follows:

2) configuration NFS service, configuration heartbeat after completing, the HA resource of configuration heartbeat comprises ip, the drbd resource of NFS service;

4) Hadoop is installed, and configures the HA of namenode;

5) when monitoring the resource exception of main nfs server from nfs server by heatbeat, from nfs server, take over NFS ip resource and drbd resource, start NFS service; Namenode node carries out carry by NFS ip, when NFS master server fault, can confirm that NFS carry is normal by df order, thereby guaranteed the normal operation of Hadoop cluster on namenode.

Claims

1. a method for NFS node dual-computer redundancy in the HADOOP HA cluster of realization based on NFS, is characterized in that, the method step is as follows:

2) configuration NFS service, configuration heartbeat after completing;

4) Hadoop is installed, and configures the HA of namenode;

2. the method for NFS node dual-computer redundancy in a kind of HADOOP HA cluster of realizing based on NFS according to claim 1, is characterized in that, described network is Ethernet.

3. the method for NFS node dual-computer redundancy in a kind of HADOOP HA cluster of realizing based on NFS according to claim 1, it is characterized in that, described namenode node carries out carry by NFS ip, when NFS master server fault, on namenode, can confirm that NFS carry is normal by df order, thereby guarantee the normal operation of Hadoop cluster.

4. the method for NFS node dual-computer redundancy in a kind of HADOOP HA cluster of realizing based on NFS according to claim 1, is characterized in that, the HA resource of described configuration heartbeat comprises ip, the drbd resource of NFS service.