WO2022116766A1 - Data storage system and construction method therefor - Google Patents

Data storage system and construction method therefor Download PDF

Info

Publication number
WO2022116766A1
WO2022116766A1 PCT/CN2021/128409 CN2021128409W WO2022116766A1 WO 2022116766 A1 WO2022116766 A1 WO 2022116766A1 CN 2021128409 W CN2021128409 W CN 2021128409W WO 2022116766 A1 WO2022116766 A1 WO 2022116766A1
Authority
WO
WIPO (PCT)
Prior art keywords
node
hdfs
nfs
server
client
Prior art date
Application number
PCT/CN2021/128409
Other languages
French (fr)
Chinese (zh)
Inventor
潘道华
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2022116766A1 publication Critical patent/WO2022116766A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/182Distributed file systems
    • G06F16/1824Distributed file systems implemented using Network-attached Storage [NAS] architecture
    • G06F16/183Provision of network file services by network file servers, e.g. by using NFS, CIFS
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/17Details of further file system functions
    • G06F16/176Support for shared access to files; File sharing support
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/182Distributed file systems

Abstract

A data storage system and a construction method therefor. The data storage system (100) comprises an HDFS server node (110), a central node (120), and an NFS client node (130), wherein the HDFS server node (110) is configured with an NFS service function; the central node (120) is configured with an HDFS client (121) and an NFS server (122), the HDFS client (121) is used for accessing shared information provided by the HDFS server node (110), and the NFS server (122) is used for providing an access entry for accessing the shared information by means of the HDFS client (121); and the NFS client node (130) is configured to access the shared information by means of the access entry.

Description

数据存储系统及其构建方法Data storage system and its construction method
相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS
本申请基于申请号为202011404986.0、申请日为2020年12月04日的中国专利申请提出,并要求该中国专利申请的优先权,该中国专利申请的全部内容在此引入本申请作为参考。This application is based on the Chinese patent application with the application number of 202011404986.0 and the filing date of December 4, 2020, and claims the priority of the Chinese patent application. The entire content of the Chinese patent application is incorporated herein by reference.
技术领域technical field
本申请实施例涉及但不限于数据处理技术领域,尤其涉及一种数据存储系统及其构建方法。The embodiments of the present application relate to, but are not limited to, the technical field of data processing, and in particular, relate to a data storage system and a construction method thereof.
背景技术Background technique
在任何系统进行商业使用时,数据都是系统的核心价值,有些数据集中存储于数据库中,有些数据则以文件的形式存放于磁盘上,而不论哪种存储形式,数据的安全、维护及操作等都是一个复杂并且非常重要的事情。在相关技术中,常用的方式是采用硬件(如磁盘阵列等)或软件(如共享存储等)的方式存储数据。采用硬件的方式往往都需要另外增加硬件配置,从而使得成本相对较高;而采用软件的方式则往往存在安全性较差的问题,并且,随着网络环境的复杂化,数据维护的工作量也随之变得很大。因此,如何解决相关技术中所存在的问题,是一个亟待解决的事情。When any system is used commercially, data is the core value of the system. Some data are centrally stored in the database, and some data are stored in the form of files on the disk. Regardless of the storage form, data security, maintenance and operation Etc. is a complex and very important thing. In the related art, a common way is to use hardware (such as a disk array, etc.) or software (such as shared storage, etc.) to store data. The way of using hardware often requires additional hardware configuration, which makes the cost relatively high; while the way of using software often has the problem of poor security, and with the complexity of the network environment, the workload of data maintenance also increases. Then it became huge. Therefore, how to solve the problems existing in the related technologies is an urgent matter to be solved.
发明内容SUMMARY OF THE INVENTION
以下是对本文详细描述的主题的概述。本概述并非是为了限制权利要求的保护范围。The following is an overview of the topics detailed in this article. This summary is not intended to limit the scope of protection of the claims.
本申请实施例提供了一种数据存储系统及其构建方法,能够在提高数据存储的安全性的情况下维持较高的性价比。The embodiments of the present application provide a data storage system and a construction method thereof, which can maintain high cost performance while improving the security of data storage.
第一方面,本申请实施例提供了一种数据存储系统,包括:分布式文件系统(Hadoop Distributed File System,HDFS)服务端节点,被配置有网络文件系统(Network File System,NFS)服务功能;中心节点,与所述HDFS服务端节点通信连接,所述中心节点被配置有HDFS客户端和NFS服务端,所述HDFS客户端用于访问由所述HDFS服务端节点提供的共享信息,所述NFS服务端用于提供通过所述HDFS客户端访问所述共享信息的访问入口;NFS客户端节点,与所述中心节点通信连接,所述NFS客户端节点被配置为通过所述访问入口访问所述共享信息。In a first aspect, an embodiment of the present application provides a data storage system, including: a distributed file system (Hadoop Distributed File System, HDFS) server node configured with a network file system (Network File System, NFS) service function; The central node is in communication connection with the HDFS server node, the central node is configured with an HDFS client and an NFS server, the HDFS client is used to access the shared information provided by the HDFS server node, the The NFS server is used to provide an access portal for accessing the shared information through the HDFS client; the NFS client node is connected to the central node in communication, and the NFS client node is configured to access the shared information through the access portal. shared information.
第二方面,本申请实施例还提供了一种数据存储系统的构建方法,包括:部署HDFS服务端节点,并在所述HDFS服务端节点配置NFS服务功能;部署中心节点,使得所述中心节点与所述HDFS服务端节点通信连接;在所述中心节点配置HDFS客户端,其中,所述HDFS客户端用于访问由所述HDFS服务端节点提供的共享信息;在所述中心节点配置NFS服务端,其中,所述NFS服务端用于提供通过所述HDFS客户端访问所述共享信息的访问入口;部署NFS客户端节点,使得所述NFS客户端节点与所述中心节点通信连接,并将所述NFS客户端节点配置为通过所述访问入口访问所述共享信息。In a second aspect, an embodiment of the present application further provides a method for constructing a data storage system, including: deploying an HDFS server node, and configuring an NFS service function on the HDFS server node; deploying a central node so that the central node communicate with the HDFS server node; configure an HDFS client on the central node, wherein the HDFS client is used to access the shared information provided by the HDFS server node; configure the NFS service on the central node The NFS server is used to provide an access portal for accessing the shared information through the HDFS client; deploy an NFS client node, so that the NFS client node is connected to the central node in communication, and The NFS client node is configured to access the shared information through the access portal.
本申请的其它特征和优点将在随后的说明书中阐述,并且,部分地从说明书中变得显而易见,或者通过实施本申请而了解。本申请的目的和其他优点可通过在说明书、权利要求书以及附图中所特别指出的结构来实现和获得。Other features and advantages of the present application will be set forth in the description which follows, and in part will be apparent from the description, or may be learned by practice of the present application. The objectives and other advantages of the application may be realized and attained by the structure particularly pointed out in the description, claims and drawings.
附图说明Description of drawings
附图用来提供对本申请技术方案的进一步理解,并且构成说明书的一部分,与本申请的实施例一起用于解释本申请的技术方案,并不构成对本申请技术方案的限制。The accompanying drawings are used to provide a further understanding of the technical solutions of the present application, and constitute a part of the specification. They are used to explain the technical solutions of the present application together with the embodiments of the present application, and do not constitute a limitation on the technical solutions of the present application.
图1是本申请一个实施例提供的数据存储系统的示意图;1 is a schematic diagram of a data storage system provided by an embodiment of the present application;
图2是本申请另一实施例提供的数据存储系统的示意图;2 is a schematic diagram of a data storage system provided by another embodiment of the present application;
图3是本申请一个实施例提供的数据存储系统的构建方法的流程图;3 is a flowchart of a method for constructing a data storage system provided by an embodiment of the present application;
图4是本申请另一实施例提供的构建方法中部署HDFS服务端节点的流程图;4 is a flowchart of deploying an HDFS server node in a construction method provided by another embodiment of the present application;
图5是本申请另一实施例提供的构建方法中在中心节点配置HDFS客户端的流程图;5 is a flowchart of configuring an HDFS client on a central node in a construction method provided by another embodiment of the present application;
图6是本申请另一实施例提供的构建方法中在中心节点配置NFS服务端的流程图;6 is a flowchart of configuring an NFS server in a central node in a construction method provided by another embodiment of the present application;
图7是本申请一个具体示例提供的数据存储系统的构建方法的流程图。FIG. 7 is a flowchart of a method for constructing a data storage system provided by a specific example of the present application.
具体实施方式Detailed ways
为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本申请进行进一步详细说明。应当理解,此处所描述的具体实施例仅用以解释本申请,并不用于限定本申请。In order to make the purpose, technical solutions and advantages of the present application more clearly understood, the present application will be described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present application, but not to limit the present application.
需要说明的是,虽然在装置示意图中进行了功能模块划分,在流程图中示出了逻辑顺序,但是在某些情况下,可以以不同于装置中的模块划分,或流程图中的顺序执行所示出或描述的步骤。It should be noted that although the functional modules are divided in the schematic diagram of the device, and the logical sequence is shown in the flowchart, in some cases, the modules may be divided differently from the device, or executed in the order in the flowchart. steps shown or described.
本申请提供了一种数据存储系统及其构建方法,通过在面向数据存储的HDFS服务端节点和面向用户的NFS客户端节点之间增设中心节点,并且在中心节点配置HDFS客户端和NFS服务端,使得中心节点既是HDFS服务端节点的客户端,又是NFS客户端节点的服务端,因此,可以将HDFS服务端节点和NFS客户端节点分设于不同的网络中,例如将HDFS服务端节点设置于安全性较高的内部网络,将NFS客户端节点设置于公共网络,从而可以提高数据存储的安全性。此外,由于HDFS服务端节点和NFS客户端节点可以分设于不同的网络中从而提高数据存储的安全性,因此不需要采用磁盘阵列等硬件方式实现数据的存储,不需要额外增加硬件配置,从而可以在提高数据存储的安全性的情况下维持较高的性价比。The present application provides a data storage system and a construction method thereof. By adding a central node between a data storage-oriented HDFS server node and a user-oriented NFS client node, and configuring the HDFS client and NFS server on the central node , so that the central node is both the client of the HDFS server node and the server of the NFS client node. Therefore, the HDFS server node and the NFS client node can be set in different networks. For example, the HDFS server node can be set For the internal network with high security, set the NFS client node on the public network, so as to improve the security of data storage. In addition, because the HDFS server node and NFS client node can be located in different networks to improve the security of data storage, there is no need to use hardware methods such as disk arrays to achieve data storage, and no additional hardware configuration is required. Maintain a high price/performance ratio while improving the security of data storage.
下面结合附图,对本申请实施例作进一步阐述。The embodiments of the present application will be further described below with reference to the accompanying drawings.
如图1所示,图1是本申请一个实施例提供的数据存储系统的示意图。在图1的示例中,该数据存储系统100包括HDFS服务端节点110、中心节点120和NFS客户端节点130,其中,HDFS服务端节点110与中心节点120通信连接,中心节点120与NFS客户端节点130通信连接。As shown in FIG. 1 , FIG. 1 is a schematic diagram of a data storage system provided by an embodiment of the present application. In the example of FIG. 1 , the data storage system 100 includes an HDFS server node 110, a central node 120 and an NFS client node 130, wherein the HDFS server node 110 is communicatively connected to the central node 120, and the central node 120 is connected to the NFS client Node 130 is communicatively connected.
其中,HDFS服务端节点110被配置有NFS服务功能;中心节点120被配置有HDFS客户端121和NFS服务端122,HDFS客户端用于访问由HDFS服务端节点提供的共享信息,NFS服务端用于提供通过该HDFS客户端访问该共享信息的访问入口;NFS客户端节点130被配置为通过该访问入口访问该共享信息。由于中心节点120配置有HDFS客户端121和NFS服务端122,因此,中心节点120既是HDFS服务端节点110的客户端,又是NFS客户端节点130的 服务端,所以,HDFS服务端节点110和NFS客户端节点130可以分设于不同的网络中,例如HDFS服务端节点110设置于安全性较高的内部网络中,NFS客户端节点130设置于安全性并不高的公共网络中,从而可以提高HDFS服务端节点110存储数据的安全性。此外,由于HDFS服务端节点110和NFS客户端节点130可以分设于不同的网络中从而提高数据存储的安全性,因此,在HDFS服务端节点110中,不需要采用磁盘阵列等硬件方式实现数据的存储,不需要增加额外的硬件配置,可以采用软件(如共享存储等)的方式存储数据,从而可以达到在提高数据存储的安全性的情况下维持较高的性价比的目的。The HDFS server node 110 is configured with the NFS service function; the central node 120 is configured with an HDFS client 121 and an NFS server 122. The HDFS client is used to access the shared information provided by the HDFS server node, and the NFS server is used for to provide an access portal for accessing the shared information through the HDFS client; the NFS client node 130 is configured to access the shared information through the access portal. Since the central node 120 is configured with the HDFS client 121 and the NFS server 122, the central node 120 is both the client of the HDFS server node 110 and the server of the NFS client node 130. Therefore, the HDFS server node 110 and The NFS client nodes 130 can be located in different networks. For example, the HDFS server node 110 is located in an internal network with high security, and the NFS client node 130 is located in a public network with low security. The HDFS server node 110 stores data security. In addition, since the HDFS server node 110 and the NFS client node 130 can be located in different networks to improve the security of data storage, in the HDFS server node 110, there is no need to use a hardware method such as a disk array to realize data storage. Storage, without adding additional hardware configuration, can use software (such as shared storage, etc.) to store data, so as to achieve the purpose of maintaining high cost performance while improving the security of data storage.
HDFS服务端节点110中配置有以Hadoop文件存储计算框架为核心的文件存储系统,当HDFS服务端节点110的数量为多个时,多个HDFS服务端节点110之间可以构建成分布式文件系统。其中,Hadoop是一种分布式系统基础架构,用户可以在不了解分布式底层细节的情况下,利用Hadoop开发分布式程序,充分利用集群的优势进行高速运算和存储。HDFS服务端节点110所配置的文件存储系统中包括有HDFS组件,HDFS组件具有高可靠性、高扩展性、高效性、高容错性和低成本等优点,可以用来部署在低廉的硬件配置上,并提供高吞吐量来访问应用程序的数据,适用于有着超大数据集的应用程序。The HDFS server node 110 is configured with a file storage system centered on the Hadoop file storage computing framework. When the number of HDFS server nodes 110 is multiple, a distributed file system can be constructed among the multiple HDFS server nodes 110 . Among them, Hadoop is a distributed system infrastructure. Users can use Hadoop to develop distributed programs without knowing the underlying details of distribution, and make full use of the advantages of clusters for high-speed computing and storage. The file storage system configured by the HDFS server node 110 includes HDFS components. The HDFS components have the advantages of high reliability, high scalability, high efficiency, high fault tolerance and low cost, and can be used to deploy on low-cost hardware configurations. , and provides high throughput to access application data, suitable for applications with very large data sets.
需要说明的是,虽然HDFS组件能够提供对所存储的数据文件进行操作的接口,但这些接口对于系统的使用并不够友好,为了在使用上能够更为简便友好,本实施例在HDFS服务端节点110内的HDFS组件中,集成并启用NFS服务功能,使得用户可以采用NFS的方式访问HDFS组件所维护的共享信息,以满足用户在使用上的简单、友好、便捷。It should be noted that although the HDFS component can provide interfaces for operating the stored data files, these interfaces are not friendly enough for the use of the system. In the HDFS component in 110, the NFS service function is integrated and enabled, so that users can use NFS to access the shared information maintained by the HDFS component, so as to meet the user's simplicity, friendliness and convenience in use.
中心节点120中配置有用于访问由HDFS服务端节点110提供的共享信息的HDFS客户端121,以及用于提供通过HDFS客户端121访问该共享信息的访问入口的NFS服务端122。其中,NFS是基于用户数据报协议(User Datagram Protocol,UDP)或者网际互连协议(Internet Protocol,IP)的应用,主要采用远程过程调用(Remote Procedure Call,RPC)机制实现对远程文件的存取操作,并且该操作与机器、操作系统以及底层传送协议无关。NFS能够支持多主机之间共享数据文件,可以将另一机器上的某个目录或分区挂载到当前机器上,此时,通过当前机器访问另一机器上的某个目录或分区中的数据文件时,就像访问本地机器上的数据文件一样。因此,NFS服务端122即是能够提供当前机器上的某个目录或分区供NFS客户端进行挂载使用的服务端程序。The central node 120 is configured with an HDFS client 121 for accessing the shared information provided by the HDFS server node 110 , and an NFS server 122 for providing an access portal for accessing the shared information through the HDFS client 121 . Among them, NFS is an application based on User Datagram Protocol (UDP) or Internet Protocol (Internet Protocol, IP), mainly using Remote Procedure Call (RPC) mechanism to access remote files operation, and the operation is independent of the machine, the operating system, and the underlying transport protocol. NFS can support sharing data files between multiple hosts, and can mount a directory or partition on another machine to the current machine. At this time, the data in a directory or partition on another machine can be accessed through the current machine. file as if accessing a data file on the local machine. Therefore, the NFS server 122 is a server program that can provide a certain directory or partition on the current machine for the NFS client to mount and use.
NFS客户端节点130能够通过由中心节点120的NFS服务端122所提供的访问入口对HDFS服务端节点110中的共享信息进行访问。NFS客户端节点130被配置为能够将由NFS服务端122提供的访问目录挂载在本地,以便于用户通过该访问目录访问该共享信息。值得注意的是,如果NFS客户端节点130当前所挂载的实例出现异常,NFS客户端节点130能够自动完成新的服务端链路的建立,并自动完成实例的挂载,使得使用上的稳定性能够得到保障,而使用上的简易性也能够方便了用户的使用。由于NFS客户端节点130所挂载的访问目录最终仍然是HDFS服务端节点110所维护的目录,因此,数据文件的安全性能够得到保障。需要说明的是,NFS客户端节点130为用户提供了采用NFS的方式访问HDFS服务端节点110所维护的共享信息,NFS方式的操作友好简便,能够为数据存储系统100的使用处理提供便捷的访问方式。The NFS client node 130 can access the shared information in the HDFS server node 110 through the access portal provided by the NFS server 122 of the central node 120 . The NFS client node 130 is configured to be able to mount the access directory provided by the NFS server 122 locally, so that the user can access the shared information through the access directory. It is worth noting that if the instance currently mounted by the NFS client node 130 is abnormal, the NFS client node 130 can automatically complete the establishment of a new server link and automatically complete the mounting of the instance, so that the use is stable. The performance can be guaranteed, and the simplicity of use can also facilitate the use of users. Since the access directory mounted by the NFS client node 130 is still the directory maintained by the HDFS server node 110 in the end, the security of the data file can be guaranteed. It should be noted that the NFS client node 130 provides users with access to the shared information maintained by the HDFS server node 110 in the NFS mode. The NFS mode is friendly and simple to operate, and can provide convenient access for the use and processing of the data storage system 100. Way.
通过采用包括有上述的HDFS服务端节点110、中心节点120和NFS客户端节点130的数据存储系统100,能够整合HDFS和NFS的各自优点,为用户提供友好简便的操作,并且,结 合HDFS服务端节点110、中心节点120和NFS客户端节点130的多层部署,能够在多网络混合的复杂环境中,提高数据存储的安全性并维持较高的性价比。By adopting the data storage system 100 including the above-mentioned HDFS server node 110 , the central node 120 and the NFS client node 130 , the respective advantages of HDFS and NFS can be integrated to provide users with friendly and simple operations, and combined with the HDFS server The multi-layer deployment of the node 110, the central node 120 and the NFS client node 130 can improve the security of data storage and maintain a relatively high cost performance in a complex environment where multiple networks are mixed.
在一实施例中,NFS服务端122能够提供通过HDFS客户端121访问HDFS服务端节点110中的共享信息的访问目录,其中,该访问目录为该共享信息在中心节点120的映射目录。In one embodiment, the NFS server 122 can provide an access directory for accessing the shared information in the HDFS server node 110 through the HDFS client 121 , wherein the access directory is a mapping directory of the shared information in the central node 120 .
需要说明的是,NFS服务端122所提供的访问目录为HDFS服务端节点110采用NFS方式对外共享的目录在中心节点120中的映射目录,即是说,中心节点120可以以独立镜像的方式供用户使用,从而能够在云化方向上实现数据存储,使得数据存储系统100的弹性部署得以实现。It should be noted that the access directory provided by the NFS server 122 is the mapping directory in the central node 120 of the directory shared by the HDFS server node 110 in the NFS manner. Users can use it, so that data storage can be realized in the direction of cloudification, so that the elastic deployment of the data storage system 100 can be realized.
在一实施例中,NFS客户端节点130还被配置为可以执行如下操作:In one embodiment, the NFS client node 130 is further configured to perform the following operations:
当获取访问共享信息的访问请求,根据该访问请求通过中心节点120向HDFS服务端节点110请求共享信息,并在通过中心节点120获取到来自HDFS服务端节点110的共享信息后,展示该共享信息。When an access request for accessing shared information is obtained, the central node 120 requests the shared information from the HDFS server node 110 according to the access request, and after obtaining the shared information from the HDFS server node 110 through the central node 120, the shared information is displayed .
基于上述实施例中的数据存储系统100,当用户在NFS客户端节点130中进行操作而访问存储于HDFS服务端节点110中的共享信息时,NFS客户端节点130会获取到对应的访问请求,此时,NFS客户端节点130会根据该访问请求通过中心节点120向HDFS服务端节点110请求对应的共享信息,当HDFS服务端节点110接收到该访问请求后,HDFS服务端节点110会通过中心节点120向NFS客户端节点130反馈对应的共享信息,当NFS客户端节点130接收到该共享信息后,即可以将该共享信息向用户展示。以一个具体示例来说,用户在NFS客户端节点130中进行操作以访问HDFS服务端节点110中的共享目录时,就像访问本地目录一样方便快捷,从而能够提高用户的使用体验。Based on the data storage system 100 in the above embodiment, when a user operates in the NFS client node 130 to access the shared information stored in the HDFS server node 110, the NFS client node 130 will obtain the corresponding access request, At this time, the NFS client node 130 will request the corresponding shared information from the HDFS server node 110 through the central node 120 according to the access request. When the HDFS server node 110 receives the access request, the HDFS server node 110 will pass the central node 110 The node 120 feeds back the corresponding shared information to the NFS client node 130, and when the NFS client node 130 receives the shared information, it can display the shared information to the user. As a specific example, when a user operates in the NFS client node 130 to access a shared directory in the HDFS server node 110, it is as convenient and fast as accessing a local directory, thereby improving the user experience.
此外,在一实施例中,如图2所示,数据存储系统100中的HDFS服务端节点110和NFS客户端节点130均可以有多个,这多个HDFS服务端节点110分别与中心节点120通信连接,中心节点120与每一个NFS客户端节点130分别连接,其中,HDFS服务端节点110可以部署于安全性较高的内部网络,而NFS客户端节点130则可以部署于外部网络供用户使用。值得注意的是,不同的NFS客户端节点130可以部署于不同的外部网络,本实施例对此并不作具体限定。这多个HDFS服务端节点110之间相互独立,并且,每个HDFS服务端节点110所存储的数据文件均不相同,此外,每一个HDFS服务端节点110所对外共享的目录,均可以在每一个NFS客户端节点130中挂载,因此,用户可以利用任何一个NFS客户端节点130通过中心节点120访问任意一个HDFS服务端节点110所存储的数据文件。In addition, in an embodiment, as shown in FIG. 2 , there may be multiple HDFS server nodes 110 and NFS client nodes 130 in the data storage system 100 , and the multiple HDFS server nodes 110 are respectively connected to the central node 120 . For communication connection, the central node 120 is connected to each NFS client node 130 respectively, wherein, the HDFS server node 110 can be deployed in the internal network with higher security, and the NFS client node 130 can be deployed in the external network for users to use . It should be noted that different NFS client nodes 130 may be deployed in different external networks, which is not specifically limited in this embodiment. The multiple HDFS server nodes 110 are independent of each other, and the data files stored by each HDFS server node 110 are different. In addition, the externally shared directory of each HDFS server node 110 can be stored in each HDFS server node 110. It is mounted on an NFS client node 130, therefore, a user can use any NFS client node 130 to access data files stored in any HDFS server node 110 through the central node 120.
需要说明的是,上述实施例中所描述的数据存储系统的架构,是为了更加清楚的说明本申请实施例的技术方案,并不构成对于本申请实施例提供的技术方案的限定,本领域技术人员可知,随着系统架构的演变和新应用场景的出现,本申请实施例提供的技术方案对于类似的技术问题,同样适用。It should be noted that the architecture of the data storage system described in the above embodiments is for the purpose of illustrating the technical solutions of the embodiments of the present application more clearly, and does not constitute a limitation on the technical solutions provided by the embodiments of the present application. Personnel know that, with the evolution of the system architecture and the emergence of new application scenarios, the technical solutions provided in the embodiments of the present application are also applicable to similar technical problems.
本领域技术人员可以理解的是,图1和图2中所示出的数据存储系统的架构并不构成对本申请实施例的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。It can be understood by those skilled in the art that the architecture of the data storage system shown in FIG. 1 and FIG. 2 does not constitute a limitation to the embodiments of the present application, and may include more or less components than those shown in the figure, or a combination of certain components may be included. some components, or a different arrangement of components.
基于上述实施例中数据存储系统的架构,提出对该数据存储系统的构建方法的各个实施例。Based on the architecture of the data storage system in the above embodiments, various embodiments of the construction method of the data storage system are proposed.
如图3所示,图3是本申请一个实施例提供的数据存储系统的构建方法的流程图,该构 建方法包括但不限于有以下步骤:As shown in Figure 3, Figure 3 is a flowchart of a construction method of a data storage system provided by an embodiment of the present application, and the construction method includes but is not limited to the following steps:
步骤S100,部署HDFS服务端节点,并在HDFS服务端节点配置NFS服务功能;Step S100, deploying an HDFS server node, and configuring the NFS service function on the HDFS server node;
步骤S200,部署中心节点,使得中心节点与HDFS服务端节点通信连接;Step S200, deploying a central node, so that the central node is communicatively connected to the HDFS server node;
步骤S300,在中心节点配置HDFS客户端,其中,HDFS客户端用于访问由HDFS服务端节点提供的共享信息;Step S300, configure an HDFS client on the central node, wherein the HDFS client is used to access the shared information provided by the HDFS server node;
步骤S400,在中心节点配置NFS服务端,其中,NFS服务端用于提供通过HDFS客户端访问共享信息的访问入口;Step S400, configure an NFS server at the central node, wherein the NFS server is used to provide an access portal for accessing shared information through an HDFS client;
步骤S500,部署NFS客户端节点,使得NFS客户端节点与中心节点通信连接,并将NFS客户端节点配置为通过访问入口访问共享信息。Step S500, deploying an NFS client node so that the NFS client node is connected to the central node in communication, and configuring the NFS client node to access shared information through an access portal.
需要说明的是,HDFS服务端节点中配置有HDFS组件,通过在该HDFS组件中集成并启用NFS服务功能,能够使得用户可以采用NFS的方式访问HDFS服务端节点所维护的共享信息,从而能够满足用户在使用上的简单、友好、便捷。值得注意的是,可以参考HDFS的官方配置方法在HDFS服务端节点中配置HDFS组件,本实施例对此并不作具体限定。It should be noted that the HDFS server node is configured with an HDFS component. By integrating and enabling the NFS service function in the HDFS component, users can use NFS to access the shared information maintained by the HDFS server node. It is simple, friendly and convenient for users to use. It is worth noting that the HDFS component can be configured in the HDFS server node by referring to the official HDFS configuration method, which is not specifically limited in this embodiment.
需要说明的是,通过在中心节点配置用于访问由HDFS服务端节点提供的共享信息的HDFS客户端,以及配置用于提供通过HDFS客户端访问该共享信息的访问入口的NFS服务端,使得中心节点既是HDFS服务端节点的客户端,又是NFS客户端节点的服务端,所以,HDFS服务端节点和NFS客户端节点能够分设于不同的网络中,例如HDFS服务端节点设置于安全性较高的内部网络中,而NFS客户端节点则设置于安全性并不高的公共网络中,从而能够提高HDFS服务端节点存储数据的安全性。It should be noted that, by configuring an HDFS client on the central node for accessing the shared information provided by the HDFS server node, and configuring an NFS server for providing an access entry for accessing the shared information through the HDFS client, the central The node is not only the client of the HDFS server node, but also the server of the NFS client node. Therefore, the HDFS server node and the NFS client node can be located in different networks. For example, the HDFS server node is set to a higher security. In the internal network of HDFS, and the NFS client node is set in the public network with low security, which can improve the security of the data stored by the HDFS server node.
需要说明的是,通过将NFS客户端节点配置为能够通过由NFS服务端提供的访问入口访问HDFS服务端节点所存储的共享信息,使得用户能够通过对NFS客户端节点进行操作而访问存储于HDFS服务端节点中的共享信息。例如,当用户在NFS客户端节点中进行操作而访问存储于HDFS服务端节点中的共享信息时,NFS客户端节点会获取到对应的访问请求,此时,NFS客户端节点会根据该访问请求通过中心节点向HDFS服务端节点请求对应的共享信息,当HDFS服务端节点接收到该访问请求后,HDFS服务端节点会通过中心节点向NFS客户端节点反馈对应的共享信息,当NFS客户端节点接收到该共享信息后,即可以将该共享信息向用户展示。因此,对于用户来说,用户在NFS客户端节点中进行操作以访问HDFS服务端节点中的共享目录时,就像访问本地目录一样方便快捷,从而能够提高用户的使用体验。It should be noted that by configuring the NFS client node to be able to access the shared information stored by the HDFS server node through the access portal provided by the NFS server, users can access the shared information stored in HDFS by operating the NFS client node. Shared information in server nodes. For example, when a user operates on the NFS client node and accesses the shared information stored in the HDFS server node, the NFS client node will obtain the corresponding access request, and at this time, the NFS client node will follow the access request. The central node requests the corresponding shared information from the HDFS server node. When the HDFS server node receives the access request, the HDFS server node will feed back the corresponding shared information to the NFS client node through the central node. After receiving the shared information, the shared information can be displayed to the user. Therefore, for the user, when the user operates on the NFS client node to access the shared directory in the HDFS server node, it is as convenient and fast as accessing the local directory, thereby improving the user experience.
通过采用包括有上述步骤S100、步骤S200、步骤S300、步骤S400和步骤S500的构建方法,部署面向数据存储的HDFS服务端节点、面向用户的NFS客户端节点和用于连接该HDFS服务端节点以及该NFS客户端节点的中心节点,并且在中心节点配置HDFS客户端和NFS服务端,使得中心节点既是HDFS服务端节点的客户端,又是NFS客户端节点的服务端,从而使得HDFS服务端节点和NFS客户端节点可以分设于不同的网络中,例如将HDFS服务端节点设置于安全性较高的内部网络,将NFS客户端节点设置于安全性较低的公共网络,因此,可以提高数据存储的安全性。此外,由于HDFS服务端节点和NFS客户端节点可以分设于不同的网络中从而提高数据存储的安全性,因此不需要采用磁盘阵列等硬件方式实现数据的存储,不需要额外增加硬件配置,从而可以在提高数据存储的安全性的情况下维持较高的性价比。By adopting the construction method including the above steps S100, S200, S300, S400 and S500, deploy the data storage-oriented HDFS server node, the user-oriented NFS client node and the HDFS server node for connecting to the HDFS server node and The central node of the NFS client node, and the HDFS client and NFS server are configured on the central node, so that the central node is both the client of the HDFS server node and the server of the NFS client node, so that the HDFS server node is The NFS client node and the NFS client node can be located in different networks. For example, the HDFS server node can be set on the internal network with high security, and the NFS client node can be set on the public network with low security. Therefore, data storage can be improved. security. In addition, since the HDFS server node and NFS client node can be located in different networks to improve the security of data storage, there is no need to use hardware methods such as disk arrays to store data, and no additional hardware configuration is required, so that the Maintain a high price/performance ratio while improving the security of data storage.
在一实施例中,访问入口为访问目录,该访问目录为共享信息在中心节点的映射目录。In one embodiment, the access entry is an access directory, and the access directory is a mapping directory of shared information in the central node.
需要说明的是,NFS服务端所提供的访问目录为HDFS服务端节点采用NFS方式对外共享 的目录在中心节点中的映射目录,即是说,中心节点可以以独立镜像的方式供用户使用,从而能够在云化方向上实现数据存储,使得数据存储系统的弹性部署得以实现。It should be noted that the access directory provided by the NFS server is the mapping directory of the directory shared by the HDFS server node in the NFS mode in the central node. Data storage can be implemented in the direction of cloudification, enabling flexible deployment of data storage systems.
另外,在一实施例中,步骤S500中的将NFS客户端节点配置为通过访问入口访问共享信息,可以包括但不限于有以下步骤:In addition, in an embodiment, in step S500, configuring the NFS client node to access the shared information through the access portal may include, but is not limited to, the following steps:
对NFS客户端节点进行配置,使得NFS客户端节点将访问目录挂载在本地,以通过访问目录访问共享信息。Configure the NFS client node so that the NFS client node mounts the access directory locally to access shared information through the access directory.
可以理解的是,由于NFS服务端所提供的访问目录为HDFS服务端节点中的共享信息在中心节点的映射目录,因此,通过对NFS客户端节点进行配置,使得NFS客户端节点将该访问目录挂载在本地,当用户需要访问HDFS服务端节点中的共享信息时,直接对挂载在NFS客户端节点的访问目录进行操作即可,就像访问本地目录一样方便快捷,因此能够提高用户的使用体验。It can be understood that since the access directory provided by the NFS server is the mapping directory of the shared information in the HDFS server node on the central node, the NFS client node is configured so that the NFS client node will access the directory. It is mounted locally. When the user needs to access the shared information in the HDFS server node, he can directly operate the access directory mounted on the NFS client node, which is as convenient and fast as accessing the local directory, so it can improve the user's experience. Use experience.
值得注意的是,如果NFS客户端节点当前所挂载的实例出现异常,NFS客户端节点能够自动完成新的服务端链路的建立,并自动完成实例的挂载,使得使用上的稳定性能够得到保障,而使用上的简易性也能够方便了用户的使用。由于NFS客户端节点所挂载的访问目录最终仍然是HDFS服务端节点所维护的目录,因此,数据文件的安全性能够得到保障。需要说明的是,NFS客户端节点为用户提供了采用NFS的方式访问HDFS服务端节点所维护的共享信息,NFS方式的操作友好简便,能够为数据存储系统的使用处理提供便捷的访问方式。It is worth noting that if the currently mounted instance of the NFS client node is abnormal, the NFS client node can automatically complete the establishment of a new server link and automatically complete the mounting of the instance, so that the stability of use can be improved. It is guaranteed, and the simplicity of use can also facilitate the use of users. Since the access directory mounted by the NFS client node is still the directory maintained by the HDFS server node, the security of data files can be guaranteed. It should be noted that the NFS client node provides users with access to the shared information maintained by the HDFS server node in the NFS mode. The NFS mode is friendly and simple to operate, and can provide a convenient access mode for the use and processing of the data storage system.
另外,在一实施例中,如图4所示,步骤S100可以包括但不限于有以下步骤:In addition, in an embodiment, as shown in FIG. 4 , step S100 may include, but is not limited to, the following steps:
步骤S110,在HDFS服务端节点中安装HDFS服务端;Step S110, install the HDFS server in the HDFS server node;
步骤S120,设置HDFS服务端的配置数据,使得HDFS服务端支持NFS服务功能;Step S120, setting the configuration data of the HDFS server, so that the HDFS server supports the NFS service function;
步骤S130,重启HDFS服务端节点,使得HDFS服务端节点启动NFS服务功能。Step S130: Restart the HDFS server node, so that the HDFS server node starts the NFS service function.
可以理解的是,HDFS服务端节点是一个应用HDFS的数据存储中心,因此,需要在HDFS服务端节点中安装HDFS服务端。虽然HDFS服务端能够提供对所存储的数据文件进行操作的接口,但这些接口对于系统的使用并不够友好,因此,为了在使用上能够更为简便友好,在HDFS服务端节点中完成HDFS服务端的安装后,可以修改HDFS服务端的配置数据,以增加NFS服务功能参数,使得HDFS服务端能够支持NFS服务功能。在完成对HDFS服务端的配置数据的设置操作后,当配置正常,即可通过重启HDFS服务端节点,使得HDFS服务端节点中的HDFS服务端启动NFS服务功能,从而为用户提供采用NFS的方式访问HDFS服务端节点所维护的共享信息,以满足用户在使用上的简单、友好、便捷。It can be understood that the HDFS server node is a data storage center that applies HDFS. Therefore, the HDFS server needs to be installed in the HDFS server node. Although the HDFS server can provide interfaces for operating the stored data files, these interfaces are not friendly enough for the system. After installation, you can modify the configuration data of the HDFS server to increase the NFS service function parameters, so that the HDFS server can support the NFS service function. After completing the setting operation of the configuration data of the HDFS server, when the configuration is normal, you can restart the HDFS server node to enable the HDFS server in the HDFS server node to start the NFS service function, thereby providing users with access using NFS. The shared information maintained by the HDFS server node is simple, friendly and convenient for users to use.
另外,在一实施例中,如图5所示,步骤S300中的在中心节点配置HDFS客户端,可以包括但不限于有以下步骤:In addition, in an embodiment, as shown in FIG. 5 , in step S300, configuring the HDFS client on the central node may include but not be limited to the following steps:
步骤S310,在中心节点安装HDFS客户端;Step S310, install the HDFS client on the central node;
步骤S320,将HDFS服务端的配置数据导入至HDFS客户端,使得HDFS客户端可通过HDFS服务端访问HDFS服务端节点的共享信息。Step S320, import the configuration data of the HDFS server to the HDFS client, so that the HDFS client can access the shared information of the HDFS server node through the HDFS server.
可以理解的是,通过在中心节点安装HDFS客户端,并且将HDFS服务端的配置数据导入至HDFS客户端,使得中心节点能够通过HDFS客户端访问HDFS服务端节点中的HDFS服务端,从而可以满足用户访问HDFS服务端节点所存储的共享信息的需求。另外,通过在中心节点配置HDFS客户端,使得中心节点能够获取HDFS服务端节点的共享信息所处的目录或分区,以便于中心节点能够在后续步骤中通过NFS服务端将该目录或分区提供给NFS客户端节点以进 行挂载,从而实现整个数据存储系统的构建。It is understandable that by installing the HDFS client on the central node and importing the configuration data of the HDFS server to the HDFS client, the central node can access the HDFS server in the HDFS server node through the HDFS client, so as to satisfy user needs. Requirements for accessing shared information stored by HDFS server nodes. In addition, by configuring the HDFS client on the central node, the central node can obtain the directory or partition where the shared information of the HDFS server node is located, so that the central node can provide the directory or partition to the NFS server in subsequent steps. NFS client node to mount, so as to realize the construction of the entire data storage system.
另外,在一实施例中,如图6所示,步骤S400中的在中心节点配置NFS服务端,可以包括但不限于有以下步骤:In addition, in an embodiment, as shown in FIG. 6 , the configuration of the NFS server at the central node in step S400 may include, but is not limited to, the following steps:
步骤S410,在中心节点安装NFS服务端;Step S410, install the NFS server on the central node;
步骤S420,设置NFS服务端的配置数据,使得NFS服务端提供通过HDFS客户端访问共享信息的访问入口;Step S420, setting the configuration data of the NFS server, so that the NFS server provides an access entry for accessing the shared information through the HDFS client;
步骤S430,重启中心节点,使得中心节点启用NFS服务端。Step S430, restart the central node, so that the central node enables the NFS server.
可以理解的是,当需要在中心节点配置NFS服务端时,可以先在中心节点安装NFS服务端,而后修改NFS服务端的配置数据(如增加NFS服务的相关配置参数),使得NFS服务端提供通过HDFS客户端访问共享信息的访问入口。在完成对NFS服务端的配置数据的设置操作后,当配置正常,即可通过重启中心节点,使得中心节点启用NFS服务端。It can be understood that when the NFS server needs to be configured on the central node, the NFS server can be installed on the central node first, and then the configuration data of the NFS server can be modified (such as adding relevant configuration parameters of the NFS service), so that the NFS server can provide The access portal for HDFS clients to access shared information. After completing the setting operation of the configuration data of the NFS server, when the configuration is normal, the central node can be restarted to enable the NFS server.
需要说明的是,当完成上述步骤S310、步骤S320,以及步骤S410、步骤S420、步骤S430之后,即完成了对中心节点的全部部署操作,此时,中心节点配置有HDFS客户端和NFS服务端,因此,中心节点既是HDFS服务端节点的客户端,又是NFS客户端节点的服务端,即,中心节点可以将HDFS服务端节点和NFS客户端节点分隔于不同的网络中,所以,HDFS服务端节点和NFS客户端节点可以分设于不同的网络中,例如HDFS服务端节点可以设置于安全性较高的内部网络中,而NFS客户端节点则可以设置于安全性并不高的公共网络中。由于HDFS服务端节点可以设置于安全性较高的内部网络中,因此可以提高数据存储的安全性。此外,由于HDFS服务端节点和NFS客户端节点可以分设于不同的网络中从而提高数据存储的安全性,所以,在HDFS服务端节点中,不需要采用磁盘阵列等硬件方式实现数据的存储,不需要增加额外的硬件配置,可以采用软件(如共享存储等)的方式存储数据,从而可以达到在提高数据存储的安全性的情况下维持较高的性价比的目的。It should be noted that after completing the above steps S310, S320, and steps S410, S420, and S430, all deployment operations on the central node are completed. At this time, the central node is configured with an HDFS client and an NFS server. , therefore, the central node is not only the client of the HDFS server node, but also the server of the NFS client node, that is, the central node can separate the HDFS server node and the NFS client node into different networks. Therefore, the HDFS service The end node and NFS client node can be located in different networks. For example, the HDFS server node can be set in the internal network with high security, while the NFS client node can be set in the public network with low security. . Since the HDFS server node can be set in the internal network with high security, the security of data storage can be improved. In addition, since the HDFS server node and NFS client node can be located in different networks to improve the security of data storage, in the HDFS server node, there is no need to use hardware methods such as disk arrays to store data. Additional hardware configuration needs to be added, and data can be stored in the form of software (such as shared storage, etc.), so as to achieve the purpose of maintaining high cost performance while improving the security of data storage.
为了更加清楚的说明上述各个实施例中数据存储系统的构建方法的具体步骤流程,下面以具体的示例进行说明。In order to more clearly describe the specific steps and processes of the construction method of the data storage system in the above-mentioned embodiments, a specific example is used for description below.
示例一:Example one:
如图7所示,图7为一个具体示例所提供的数据存储系统的构建方法的流程图。在图7中,HDFS服务端节点为单节点,中心节点的NFS服务端采用独立镜像供用户进行访问操作,HDFS服务端节点、中心节点和NFS客户端节点均部署在同一个安全性较高的内部网络。该数据存储系统的构建方法包括步骤S11、步骤S12和步骤S13。As shown in FIG. 7 , FIG. 7 is a flowchart of a method for constructing a data storage system provided by a specific example. In Figure 7, the HDFS server node is a single node, and the NFS server of the central node uses an independent image for users to access and operate. The HDFS server node, central node and NFS client node are all deployed in the same high security Internal network. The construction method of the data storage system includes step S11, step S12 and step S13.
步骤S11,在HDFS服务端节点中安装HDFS服务端,并修改HDFS服务端节点的配置数据,以使HDFS服务端节点支持NFS服务功能,当配置完成后,重启HDFS服务端节点以启动NFS服务功能。Step S11, install the HDFS server in the HDFS server node, and modify the configuration data of the HDFS server node, so that the HDFS server node supports the NFS service function, when the configuration is completed, restart the HDFS server node to start the NFS service function .
在步骤S11中,配置HDFS服务端节点的方法可以参考官方配置方法,如果配置正常,可以通过showmount命令查看到HDFS服务端节点所提供的共享目录的信息。In step S11, for the method of configuring the HDFS server node, you can refer to the official configuration method. If the configuration is normal, you can use the showmount command to view the information of the shared directory provided by the HDFS server node.
步骤S12,部署中心节点,在中心节点安装HDFS客户端,并在HDFS客户端中导入HDFS服务端的相关配置数据;安装NFS服务端,同时修改NFS服务端的相关配置数据,增加有关NFS服务功能的相关配置;当配置完成后,重启中心节点。Step S12, deploying the central node, installing the HDFS client on the central node, and importing the relevant configuration data of the HDFS server into the HDFS client; installing the NFS server, modifying the relevant configuration data of the NFS server at the same time, and adding the relevant NFS service functions. Configuration; when the configuration is complete, restart the central node.
在步骤S12中,当在HDFS客户端中导入HDFS服务端节点的相关配置数据后,可以通过HDFS的客户端命令查看HDFS服务端节点的目录信息。当启用NFS服务端后,如果对中心节 点的配置正常,则可以通过showmount命令查看到HDFS服务端节点所提供的共享目录的信息。In step S12, after importing the relevant configuration data of the HDFS server node into the HDFS client, the directory information of the HDFS server node can be viewed through the HDFS client command. After the NFS server is enabled, if the configuration of the central node is normal, you can view the shared directory information provided by the HDFS server node through the showmount command.
步骤S13,部署NFS客户端节点,在NFS客户端节点中安装NFS客户端,并将中心节点的NFS服务端所提供的共享目录挂载到NFS客户端。Step S13, deploy the NFS client node, install the NFS client in the NFS client node, and mount the shared directory provided by the NFS server of the central node to the NFS client.
在步骤S13中,当完成NFS客户端节点的部署后,用户通过NFS客户端节点的NFS客户端发起访问HDFS服务端节点所提供的共享目录的操作时,就像访问NFS客户端节点的本地目录一样简单、友好、便捷。In step S13, after the deployment of the NFS client node is completed, when the user initiates an operation to access the shared directory provided by the HDFS server node through the NFS client of the NFS client node, it is like accessing the local directory of the NFS client node Just as simple, friendly and convenient.
示例二:Example two:
参照如图2所示的数据存储系统的架构,HDFS服务端节点的数量为多个,NFS客户端节点的数量也为多个,中心节点与每一个HDFS服务端节点和每一个NFS客户端节点通信连接。其中,HDFS服务端节点部署于安全性较高的内部网络,而NFS客户端节点则部署于外部网络供用户使用。值得注意的是,不同的NFS客户端节点可以部署于不同的外部网络。该数据存储系统的构建方法包括以下步骤:Referring to the architecture of the data storage system shown in Figure 2, the number of HDFS server nodes is multiple, the number of NFS client nodes is also multiple, the central node and each HDFS server node and each NFS client node communication connection. Among them, the HDFS server node is deployed on the internal network with high security, and the NFS client node is deployed on the external network for users to use. It is worth noting that different NFS client nodes can be deployed on different external networks. The construction method of the data storage system includes the following steps:
第一步,在内部网络中,部署启动多个HDFS服务端节点,并在每一个HDFS服务端节点中配置支持NFS服务功能。The first step is to deploy and start multiple HDFS server nodes in the internal network, and configure each HDFS server node to support the NFS service function.
在该步骤中,对每一个HDFS服务端节点的配置均可以参考官方配置方法,当配置完成后,重启每一个HDFS服务端节点以启动NFS服务功能,如果配置正常,可以通过showmount命令查看到每一个HDFS服务端节点所提供的共享目录的信息。In this step, you can refer to the official configuration method for the configuration of each HDFS server node. After the configuration is completed, restart each HDFS server node to start the NFS service function. If the configuration is normal, you can use the showmount command to view each Information about shared directories provided by an HDFS server node.
第二步,部署中心节点,在中心节点安装HDFS客户端,并在HDFS客户端中导入每一个HDFS服务端节点的相关配置数据;安装NFS服务端,同时修改NFS服务端的相关配置数据,增加有关NFS服务功能的相关配置。The second step is to deploy the central node, install the HDFS client on the central node, and import the relevant configuration data of each HDFS server node into the HDFS client; install the NFS server, and at the same time modify the relevant configuration data of the NFS server, add relevant Related configuration of NFS service function.
在该步骤中,如果配置正常,则重启中心节点,使得中心节点启用HDFS客户端和NFS服务端。如果操作正常,可以通过HDFS的客户端命令查看每一个HDFS服务端节点的目录信息,也可以通过showmount命令查看到每一个HDFS服务端节点所提供的共享目录的信息。值得注意的是,中心节点的HDFS客户端和NFS服务端,功能相互独立,相关配置操作相互独立,因此可以制作成一个独立的镜像以供云化环境快速部署。In this step, if the configuration is normal, restart the central node, so that the central node enables the HDFS client and the NFS server. If the operation is normal, you can use the HDFS client command to view the directory information of each HDFS server node, and you can also use the showmount command to view the shared directory information provided by each HDFS server node. It is worth noting that the functions of the HDFS client and NFS server of the central node are independent of each other, and related configuration operations are independent of each other, so they can be made into an independent image for rapid deployment in a cloud environment.
第三步,在外部网络中,部署多个NFS客户端节点,并在每一个NFS客户端节点中安装NFS客户端,将中心节点的NFS服务端所提供的共享目录分别挂载到每一个NFS客户端。The third step is to deploy multiple NFS client nodes in the external network, install the NFS client in each NFS client node, and mount the shared directory provided by the NFS server of the central node to each NFS respectively. client.
在该步骤中,当完成全部NFS客户端节点的部署后,用户可以利用任何一个NFS客户端节点通过中心节点访问任意一个HDFS服务端节点所提供的共享目录。当用户可以通过NFS客户端节点的NFS客户端发起访问HDFS服务端节点所提供的共享目录的操作时,就像访问NFS客户端节点的本地目录一样简单、友好、便捷。In this step, after the deployment of all NFS client nodes is completed, the user can use any NFS client node to access the shared directory provided by any HDFS server node through the central node. When a user can initiate an operation to access the shared directory provided by the HDFS server node through the NFS client of the NFS client node, it is as simple, friendly and convenient as accessing the local directory of the NFS client node.
本申请实施例包括:数据存储系统包括HDFS服务端节点、中心节点和NFS客户端节点,其中,HDFS服务端节点被配置有NFS服务功能;中心节点与HDFS服务端节点通信连接,中心节点被配置有HDFS客户端和NFS服务端,HDFS客户端用于访问由HDFS服务端节点提供的共享信息,NFS服务端用于提供通过HDFS客户端访问共享信息的访问入口;NFS客户端节点与中心节点通信连接,NFS客户端节点被配置为通过访问入口访问共享信息。根据本申请实施例提供的方案,通过在面向数据存储的HDFS服务端节点和面向用户的NFS客户端节点之间增设中心节点,并且在中心节点配置HDFS客户端和NFS服务端,使得中心节点既是HDFS服务端节点的客户端,又是NFS客户端节点的服务端,因此,可以将HDFS服务端节点和NFS客 户端节点分设于不同的网络中,例如将HDFS服务端节点设置于安全性较高的内部网络,将NFS客户端节点设置于公共网络,从而可以提高数据存储的安全性。此外,由于HDFS服务端节点和NFS客户端节点可以分设于不同的网络中从而提高数据存储的安全性,因此不需要采用磁盘阵列等硬件方式实现数据的存储,不需要额外增加硬件配置,从而可以在提高数据存储的安全性的情况下维持较高的性价比。The embodiments of the present application include: a data storage system includes an HDFS server node, a central node, and an NFS client node, wherein the HDFS server node is configured with an NFS service function; the central node is communicatively connected to the HDFS server node, and the central node is configured with an NFS service function. There are HDFS client and NFS server. The HDFS client is used to access the shared information provided by the HDFS server node, and the NFS server is used to provide the access portal for accessing the shared information through the HDFS client; the NFS client node communicates with the central node To connect, NFS client nodes are configured to access shared information through an access portal. According to the solution provided by the embodiments of the present application, by adding a central node between the data storage-oriented HDFS server node and the user-oriented NFS client node, and configuring the HDFS client and NFS server on the central node, the central node is both The client of the HDFS server node is also the server of the NFS client node. Therefore, the HDFS server node and the NFS client node can be located in different networks. For example, the HDFS server node can be set to a higher security. The internal network of the NFS client node is set on the public network, which can improve the security of data storage. In addition, since the HDFS server node and NFS client node can be located in different networks to improve the security of data storage, there is no need to use hardware methods such as disk arrays to store data, and no additional hardware configuration is required, so that the Maintain a high price/performance ratio while improving the security of data storage.
本领域普通技术人员可以理解,上文中所公开方法中的全部或某些步骤、系统可以被实施为软件、固件、硬件及其适当的组合。某些物理组件或所有物理组件可以被实施为由处理器,如中央处理器、数字信号处理器或微处理器执行的软件,或者被实施为硬件,或者被实施为集成电路,如专用集成电路。这样的软件可以分布在计算机可读介质上,计算机可读介质可以包括计算机存储介质(或非暂时性介质)和通信介质(或暂时性介质)。如本领域普通技术人员公知的,术语计算机存储介质包括在用于存储信息(诸如计算机可读指令、数据结构、程序模块或其他数据)的任何方法或技术中实施的易失性和非易失性、可移除和不可移除介质。计算机存储介质包括但不限于RAM、ROM、EEPROM、闪存或其他存储器技术、CD-ROM、数字多功能盘(DVD)或其他光盘存储、磁盒、磁带、磁盘存储或其他磁存储装置、或者可以用于存储期望的信息并且可以被计算机访问的任何其他的介质。此外,本领域普通技术人员公知的是,通信介质通常包含计算机可读指令、数据结构、程序模块或者诸如载波或其他传输机制之类的调制数据信号中的其他数据,并且可包括任何信息递送介质。Those of ordinary skill in the art can understand that all or some of the steps and systems in the methods disclosed above can be implemented as software, firmware, hardware, and appropriate combinations thereof. Some or all physical components may be implemented as software executed by a processor, such as a central processing unit, digital signal processor or microprocessor, or as hardware, or as an integrated circuit, such as an application specific integrated circuit . Such software may be distributed on computer-readable media, which may include computer storage media (or non-transitory media) and communication media (or transitory media). As is known to those of ordinary skill in the art, the term computer storage media includes both volatile and nonvolatile implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data flexible, removable and non-removable media. Computer storage media include, but are not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disk (DVD) or other optical disk storage, magnetic cartridges, magnetic tape, magnetic disk storage or other magnetic storage devices, or may Any other medium used to store desired information and which can be accessed by a computer. In addition, communication media typically embodies computer readable instructions, data structures, program modules, or other data in a modulated data signal such as a carrier wave or other transport mechanism, and can include any information delivery media, as is well known to those of ordinary skill in the art .
以上是对本申请的若干实施例进行了具体说明,但本申请并不局限于上述实施方式,熟悉本领域的技术人员在不违背本申请精神的前提下还可作出种种的等同变形或替换,这些等同的变形或替换均包含在本申请权利要求所限定的范围内。The above is a specific description of several embodiments of the present application, but the present application is not limited to the above-mentioned embodiments, and those skilled in the art can also make various equivalent modifications or replacements on the premise of not violating the spirit of the present application. Equivalent modifications or substitutions are included within the scope defined by the claims of the present application.

Claims (10)

  1. 一种数据存储系统,包括:A data storage system comprising:
    分布式文件系统HDFS服务端节点,被配置有网络文件系统NFS服务功能;The distributed file system HDFS server node is configured with the network file system NFS service function;
    中心节点,与所述HDFS服务端节点通信连接,所述中心节点被配置有HDFS客户端和NFS服务端,所述HDFS客户端用于访问由所述HDFS服务端节点提供的共享信息,所述NFS服务端用于提供通过所述HDFS客户端访问所述共享信息的访问入口;The central node is in communication connection with the HDFS server node, the central node is configured with an HDFS client and an NFS server, the HDFS client is used to access the shared information provided by the HDFS server node, the The NFS server is used to provide an access entry for accessing the shared information through the HDFS client;
    NFS客户端节点,与所述中心节点通信连接,所述NFS客户端节点被配置为通过所述访问入口访问所述共享信息。An NFS client node is connected in communication with the central node, and the NFS client node is configured to access the shared information through the access portal.
  2. 根据权利要求1所述的数据存储系统,其中,所述NFS服务端,用于提供通过所述HDFS客户端访问所述共享信息的访问目录,其中,所述访问目录为所述共享信息在所述中心节点的映射目录。The data storage system according to claim 1, wherein the NFS server is configured to provide an access directory for accessing the shared information through the HDFS client, wherein the access directory is where the shared information is stored. The mapping directory of the central node described above.
  3. 根据权利要求2所述的数据存储系统,其中,所述NFS客户端节点,被配置为将所述访问目录挂载在本地,以通过所述访问目录访问所述共享信息。The data storage system according to claim 2, wherein the NFS client node is configured to mount the access directory locally, so as to access the shared information through the access directory.
  4. 根据权利要求1至3任意一项所述的数据存储系统,其中,所述NFS客户端节点还被配置为:当获取访问所述共享信息的访问请求,根据所述访问请求通过所述中心节点向所述HDFS服务端节点请求所述共享信息,并在通过所述中心节点获取到来自所述HDFS服务端节点的所述共享信息后,展示所述共享信息。The data storage system according to any one of claims 1 to 3, wherein the NFS client node is further configured to: when acquiring an access request for accessing the shared information, pass the central node through the central node according to the access request Request the shared information from the HDFS server node, and display the shared information after obtaining the shared information from the HDFS server node through the central node.
  5. 一种数据存储系统的构建方法,包括:A construction method of a data storage system, comprising:
    部署HDFS服务端节点,并在所述HDFS服务端节点配置NFS服务功能;Deploy the HDFS server node, and configure the NFS service function on the HDFS server node;
    部署中心节点,使得所述中心节点与所述HDFS服务端节点通信连接;Deploying a central node so that the central node communicates with the HDFS server node;
    在所述中心节点配置HDFS客户端,其中,所述HDFS客户端用于访问由所述HDFS服务端节点提供的共享信息;Configure an HDFS client on the central node, wherein the HDFS client is used to access the shared information provided by the HDFS server node;
    在所述中心节点配置NFS服务端,其中,所述NFS服务端用于提供通过所述HDFS客户端访问所述共享信息的访问入口;An NFS server is configured on the central node, wherein the NFS server is used to provide an access entry for accessing the shared information through the HDFS client;
    部署NFS客户端节点,使得所述NFS客户端节点与所述中心节点通信连接,并将所述NFS客户端节点配置为通过所述访问入口访问所述共享信息。An NFS client node is deployed, so that the NFS client node is communicatively connected to the central node, and the NFS client node is configured to access the shared information through the access portal.
  6. 根据权利要求5所述的构建方法,其中,所述访问入口为访问目录,所述访问目录为所述共享信息在所述中心节点的映射目录。The construction method according to claim 5, wherein the access entry is an access directory, and the access directory is a mapping directory of the shared information in the central node.
  7. 根据权利要求6所述的构建方法,其中,所述将所述NFS客户端节点配置为通过所述访问入口访问所述共享信息,包括:The construction method according to claim 6, wherein the configuring the NFS client node to access the shared information through the access portal comprises:
    对所述NFS客户端节点进行配置,使得所述NFS客户端节点将所述访问目录挂载在本地,以通过所述访问目录访问所述共享信息。The NFS client node is configured so that the NFS client node mounts the access directory locally, so as to access the shared information through the access directory.
  8. 根据权利要求5所述的构建方法,其中,所述部署HDFS服务端节点,并在所述HDFS服务端节点配置NFS服务功能,包括:The construction method according to claim 5, wherein the deploying an HDFS server node, and configuring the NFS service function on the HDFS server node, comprises:
    在HDFS服务端节点中安装HDFS服务端;Install the HDFS server in the HDFS server node;
    设置所述HDFS服务端的配置数据,使得所述HDFS服务端支持NFS服务功能;Setting the configuration data of the HDFS server so that the HDFS server supports the NFS service function;
    重启所述HDFS服务端节点,使得所述HDFS服务端节点启动所述NFS服务功能。Restart the HDFS server node, so that the HDFS server node starts the NFS service function.
  9. 根据权利要求8所述的构建方法,其中,所述在所述中心节点配置HDFS客户端,包括:The construction method according to claim 8, wherein the configuring an HDFS client on the central node comprises:
    在所述中心节点安装HDFS客户端;Install the HDFS client on the central node;
    将所述HDFS服务端的配置数据导入至所述HDFS客户端,使得所述HDFS客户端可通过所述HDFS服务端访问所述HDFS服务端节点的共享信息。Import the configuration data of the HDFS server to the HDFS client, so that the HDFS client can access the shared information of the HDFS server node through the HDFS server.
  10. 根据权利要求9所述的构建方法,其中,所述在所述中心节点配置NFS服务端,包括:The construction method according to claim 9, wherein the configuring the NFS server on the central node comprises:
    在所述中心节点安装NFS服务端;Install an NFS server on the central node;
    设置所述NFS服务端的配置数据,使得所述NFS服务端提供通过所述HDFS客户端访问所述共享信息的访问入口;Setting the configuration data of the NFS server so that the NFS server provides an access entry for accessing the shared information through the HDFS client;
    重启所述中心节点,使得所述中心节点启用所述NFS服务端。Restart the central node, so that the central node enables the NFS server.
PCT/CN2021/128409 2020-12-04 2021-11-03 Data storage system and construction method therefor WO2022116766A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011404986.0A CN114610690A (en) 2020-12-04 2020-12-04 Data storage system and construction method thereof
CN202011404986.0 2020-12-04

Publications (1)

Publication Number Publication Date
WO2022116766A1 true WO2022116766A1 (en) 2022-06-09

Family

ID=81853821

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/128409 WO2022116766A1 (en) 2020-12-04 2021-11-03 Data storage system and construction method therefor

Country Status (2)

Country Link
CN (1) CN114610690A (en)
WO (1) WO2022116766A1 (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103561101A (en) * 2013-11-06 2014-02-05 中国联合网络通信集团有限公司 Network file system
CN107066205A (en) * 2016-12-30 2017-08-18 曙光信息产业(北京)有限公司 A kind of data-storage system
US10298709B1 (en) * 2014-12-31 2019-05-21 EMC IP Holding Company LLC Performance of Hadoop distributed file system operations in a non-native operating system
US10498804B1 (en) * 2016-06-29 2019-12-03 EMC IP Holding Company LLC Load balancing Hadoop distributed file system operations in a non-native operating system
CN111400271A (en) * 2020-03-18 2020-07-10 北京东方金信科技有限公司 Method for integrating NFS in HDFS plug-in
CN111708738A (en) * 2020-05-29 2020-09-25 深圳市瑞驰信息技术有限公司 Method and system for realizing data inter-access between hdfs of hadoop file system and s3 of object storage

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103561101A (en) * 2013-11-06 2014-02-05 中国联合网络通信集团有限公司 Network file system
US10298709B1 (en) * 2014-12-31 2019-05-21 EMC IP Holding Company LLC Performance of Hadoop distributed file system operations in a non-native operating system
US10498804B1 (en) * 2016-06-29 2019-12-03 EMC IP Holding Company LLC Load balancing Hadoop distributed file system operations in a non-native operating system
CN107066205A (en) * 2016-12-30 2017-08-18 曙光信息产业(北京)有限公司 A kind of data-storage system
CN111400271A (en) * 2020-03-18 2020-07-10 北京东方金信科技有限公司 Method for integrating NFS in HDFS plug-in
CN111708738A (en) * 2020-05-29 2020-09-25 深圳市瑞驰信息技术有限公司 Method and system for realizing data inter-access between hdfs of hadoop file system and s3 of object storage

Also Published As

Publication number Publication date
CN114610690A (en) 2022-06-10

Similar Documents

Publication Publication Date Title
CN110990047B (en) Fusion method and device for multiple microservice architectures
US8970873B2 (en) System and method for managing printer resources on an internal network
CN107210924B (en) Method and apparatus for configuring a communication system
US10069941B2 (en) Scalable event-based notifications
GB2502408A (en) File synchronization using timestamp data
CN109417492B (en) Network function NF management method and NF management equipment
CN107172214B (en) Service node discovery method and device with load balancing function
CN109525590B (en) Data packet transmission method and device
US11637914B2 (en) Multiple geography service routing
WO2022007908A1 (en) Method for service collaboration between network element devices, and network element device
US20200045139A1 (en) Remote procedure call using quorum state store
CN112181049B (en) Cluster time synchronization method, device, system, equipment and readable storage medium
WO2021047227A1 (en) Cross-region service sharing method, apparatus and management device, and storage medium
WO2022116766A1 (en) Data storage system and construction method therefor
CN115604120A (en) Multi-cloud cluster resource sharing method, device, equipment and storage medium
WO2022042393A1 (en) Configuration error information transmission method and device
WO2021109784A1 (en) Application management method, system and server in hybrid cloud environment
CN112351114B (en) Information processing method and device and storage medium
US10637924B2 (en) Cloud metadata discovery API
US20070162577A1 (en) System for providing managed computing service
CN112910796A (en) Traffic management method, apparatus, device, storage medium, and program product
CN116668372B (en) Flow control method and related device
US11663058B1 (en) Preemptive filtering of events of an event bus with a deterministic filter
Kenny et al. Kubernetes for HPC Administration.
CN117435608A (en) Multi-cluster data query method, device, equipment and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21899800

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 301023)