CN105516313A

CN105516313A - Distributed storage system used for big data

Info

Publication number: CN105516313A
Application number: CN201510910421.2A
Authority: CN
Inventors: 龚艳; 母继元; 常健
Original assignee: State Grid Corp of China SGCC; State Grid Sichuan Electric Power Co Ltd
Current assignee: State Grid Corp of China SGCC; State Grid Sichuan Electric Power Co Ltd
Priority date: 2015-12-10
Filing date: 2015-12-10
Publication date: 2016-04-20

Abstract

The invention discloses a distributed storage system used for big data, which comprises an application server cluster consisting of a plurality of application servers, a storage server cluster consisting of a plurality of storage servers, and a metadata server cluster consisting of a plurality of metadata servers, wherein the application servers, the storage servers and the metadata server have data exchange through communication network interconnection; the application server cluster is used for generating and receiving production data, the storage server cluster is used for providing storage space, and the metadata server cluster is used for managing the storage space. The distributed storage system used for big data, which is provided by the invention, has the advantages of simple expansion, low cost, high safety and good compatibility.

Description

A kind of distributed memory system for large data

Technical field

The present invention relates to Computer Applied Technology field, particularly relate to a kind of distributed memory system for large data.

Background technology

Along with the development of the technology such as cloud computing, Internet of Things, mobile Internet, data volume all sharp increases that human society every field produces, these large data have become a kind of important resource, and define data-centered, to be stored as the development trend at center, therefore, how to store these large data and become extremely important.

Conventional store adopts centralized architecture to carry out data storage, and all data by the fail safe of disk array Raid technique guarantee data, and are centrally stored on special memory device, and all creation datas are all transmitted by network or specific store network.But there is dilatation difficulty in this centralized architecture, cost is high, fail safe is low, the shortcoming of poor compatibility.

Summary of the invention

For above-mentioned technical problem, the object of the present invention is to provide a kind of distributed memory system for large data, can effectively solve the problem.

For reaching this object, the present invention by the following technical solutions:

A kind of distributed memory system for large data, comprise: the metadata server cluster of the application server cluster of some application server compositions, the storage server cluster of some storage server compositions and some meta data server compositions, described application server, storage server and meta data server carry out exchanges data by communication network interconnection; Described application server cluster for generation of with reception creation data, described storage server cluster is used for providing memory space, described metadata server cluster be used for managed storage space.

Especially, described storage server, meta data server all adopt X86-based server, simultaneously supporting document and block agreement.

Especially, described in be stored in each application server, storage server and meta data server data all adopt copy mechanism to carry out data protection, and each copy data is all stored on the node different from maternal data.

Especially, described application server, storage server and meta data server are interconnected by full line speed Infiniband network.

The distributed memory system for large data that the present invention proposes adopts distributed structure/architecture; storage server is supporting document and block agreement simultaneously; all storage servers carry out data storage simultaneously; the data be stored on storage server all adopt copy mechanism to carry out data protection; and each copy data is all stored in the storage server different from maternal data, the mode of all data acquisition front end data sections, rear end distribution stores.Thus dilatation is simple, cost the, advantage that fail safe is high, compatible good to make this storage system have.

Accompanying drawing explanation

Fig. 1 is the distributed memory system structured flowchart for large data that the embodiment of the present invention provides.

Embodiment

Below in conjunction with drawings and Examples, the invention will be further described.Be understandable that, specific embodiment described herein is only for explaining the present invention, but not limitation of the invention.It also should be noted that, for convenience of description, illustrate only part related to the present invention in accompanying drawing but not full content, unless otherwise defined, all technology used herein and scientific terminology are identical with belonging to the implication that those skilled in the art of the present invention understand usually.The object of term used in the description of the invention herein just in order to describe specific embodiment, is not intended to be restriction the present invention.Term as used herein " and/or " comprise arbitrary and all combinations of one or more relevant Listed Items.

Embodiment one

Please refer to shown in Fig. 1, the distributed memory system structured flowchart for large data that Fig. 1 provides for the embodiment of the present invention.

In the present embodiment, the distributed memory system for large data comprises the metadata server cluster of the application server cluster of some application server compositions, the storage server cluster of some storage server compositions and some meta data server compositions.Described application server cluster for generation of with reception creation data, described storage server cluster is used for providing memory space, described metadata server cluster be used for managed storage space.Described storage server adopts x86 IA frame serverPC, is specially the hard disk such as SATA or SAS or SSD; Described meta data server adopts x86 IA frame serverPC, is specially SSD; Described application server adopts supports all IA frame serverPC equipment.Described application server, storage server and meta data server are interconnected by Infiniband, carry out exchanges data.

The data storage procedure of described storage system comprises writing and reading process, is specially:

Ablation process:

1, creation data is produced by application server cluster.

2, application server is to meta data server application write request, and request content comprises the size of the data file of needs write.

3, meta data server inquires about the address space of whole storage system, finds out the address block that can distribute, and the address block information preparing to distribute is turned back to application server.

4, application server is according to the address block information returned, and by file fragmentation, and is transferred to each memory node, writes successfully, and each node returns and is written to function signal to meta data server, and address block space identification is " using " by meta data server.

5, the write of data is completed.

Readout:

1, application server issues the read request to meta data server, and request comprises the file path information of reading.

2, the meta data server address block information of storage server of being correlated with according to file path information inquiry, and information is turned back to application server.

3, application server is according to the address block information of each storage server, sends data block read information to each memory node.

4, corresponding application data block is turned back to application server according to data block information by memory node.

For in the distributed memory system of large data described in the present embodiment, adopt the extensive open cluster storage architecture of band external schema, by Infiniband express network, the CPU in common store server, internal memory, the network bandwidth, disk running speed, bus performance are combined, form a high end storage, make to store the transmission speed of overall performance higher than computing ability and the network bandwidth, shield the memory property bottleneck because disk and bus performance Slow lifting cause with this.With the storage of the computing node of full line speed Infiniband network interconnection front end and rear end and metadata node, reach maximization to ensure that the performance of memory device exports.Adopt band external schema, well eliminate the mutual interference of metadata path and data path.And the clustering of meta data server and the clustering of storage server, make the performance bottleneck point do not existed in whole system as NAS or SAN system, the high concurrent demand of high-performance calculation application can be met completely.

Simultaneously, on the basis of unrestricted online dilatation and load-balancing technique fast, Highly Scalable memory technology is adopted for the distributed memory system of large data described in the present embodiment, make the scale of storage system can efficient along with the development of computing application, without the need to the online expansion of break in service, reach the linear growth of the increase performance along with capacity simultaneously.

Simultaneously, for the basis of the distributed memory system of large data multivariate data server Clustering in distributed storage described in the present embodiment, create in conjunction with high-performance calculation application file, access and bibliographic structure feature, adopt multivariate data server Clustering, dynamic-configuration can be carried out according to the scale producing quantity of documents actual in user's production system to metadata server cluster in a set of storage system, meta data servers all in whole storage system participates in retrieval and the positioning work of file simultaneously, eliminate the bottleneck of metadata channel in storage system, simultaneously can also according to the enhancing of the change of subsequent applications feature and demand, accomplish online extended metadata server, and reach the effect of plug and play.

Meanwhile, multistage intelligent Data cache technology is adopted to realize efficient buffer memory for the distributed memory system of large data described in the present embodiment.First, by independent research client kernel module on application server, the feature of analytical applications visit data, adopt specific cache algorithm on the application server valid cache application visit data; Secondly, the cache pool that formation one of the internal storage set of all storage servers being got up is large, by the data of cache algorithm buffer memory different application access uniquely on storage server; Finally, the efficient metadata cache pond of formation one of the internal storage set of all meta data servers being got up, carrys out cache metadata information by the metadata life cycle algorithm of complexity.By the caching technology of such multi-stage data, the performance of whole storage system can be made to be largely increased.

Simultaneously, enhancement mode rights management techniques is provided for the distributed memory system of large data described in the present embodiment, except the original rights management based on operating system user of compatibility, also add enhancement mode rights management techniques specific to self, by the rear end pipe platform of storage system directly for certain catalogue specifies which user or which application server to access, which catalogue forbids that user deletes, amendment etc.

Meanwhile, be that storage server or meta data server all have employed clustering technology for the distributed memory system of large data described in the present embodiment; Further, storage system software layer adopts multi-duplicate technology, file Dynamic Program Slicing technology, file-level raid technology and supports that different directories arranges different redundancy technics to ensure the High Availabitity of whole system.Finally, by these efficient total system scale data High Availabitity technology, thoroughly eliminate the Single Point of Faliure in storage system.In addition, automatic fault finding mechanism that storage system is built-in, can the different fault of dynamic instrumentation, and the recovery flow process of log-on data guarantees data integrity automatically, date restoring is held completely after storing and is carried out, front end applications is not had any impact, the operation that the application of user is continual and steady is guaranteed while guaranteeing data security, separate unit storage server or meta data server are damaged and all can not application data read-write have any impact, and along with the continuous increase of storage server, the speed of date restoring is doubled and redoubled, namely the fail safe of the larger data of service cluster scale is higher.

Simultaneously, object storage technology is adopted for the distributed memory system of large data described in the present embodiment, can accomplish to take disk as the online real time access flow control technology of granularity, when whole storage system is by when monitoring finds that some equipment or disk are in the situation of high capacity in real time, on the one hand, storage system by forbidding some equipment write data, thus can reduce the load of some equipment; On the other hand, can start up system load adjusting function, the hot spot data of system moves according to self intelligent load mechanism by storage system, makes each memory node load balancing in storage system.

Simultaneously, file microtomy is adopted for the distributed memory system of large data described in the present embodiment, file is evenly distributed in whole storage server cluster, be no matter access module to Single document or multiple file to ensure to apply, effectively can utilize the performance that whole storage server cluster provides; Application server end is directly communicated with storage server by express network on the other hand, make use of the bandwidth that express network provides efficiently, eliminates the network bottleneck in current process.After adopting such technology, when whole computing platform thousands of application nodes simultaneously visit data time, can guarantee that each equipment of whole storage system can give play to the hardware capabilities of self, for front end applications provides maximum aggregate bandwidth.

Set up described in the present embodiment and for the concrete steps of the distributed memory system of large data be:

1, meta data server is installed

(1) meta data server operating system is prepared.

(2) meta data server software is installed.

(3) Metadata Service is started.

2, storage server is installed, and associates with meta data server, thus the unified management realized each storage server node and allocation of space

(1) storage server operating system is prepared.

(2) storage server software is installed.

(3) storage data, services is started.

3, Install and configure application server client, realizes the file front end section function of client

(1) application server client is installed.

(2) configure application server client.

Technical scheme of the present invention; distributed memory system for large data adopts distributed structure/architecture; storage server is supporting document and block agreement simultaneously; all storage servers carry out data storage simultaneously; the data be stored on storage server all adopt copy mechanism to carry out data protection; and each copy data is all stored in the storage server different from maternal data, the mode of all data acquisition front end data sections, rear end distribution stores.Thus dilatation is simple, cost the, advantage that fail safe is high, compatible good to make this storage system have.

Note, above are only preferred embodiment of the present invention and institute's application technology principle.Skilled person in the art will appreciate that and the invention is not restricted to specific embodiment described here, various obvious change can be carried out for a person skilled in the art, readjust and substitute and can not protection scope of the present invention be departed from.Therefore, although be described in further detail invention has been by above embodiment, the present invention is not limited only to above embodiment, when not departing from the present invention's design, can also comprise other Equivalent embodiments more, and scope of the present invention is determined by appended right.

Claims

1. the distributed memory system for large data, it is characterized in that, comprise: the metadata server cluster of the application server cluster of some application server compositions, the storage server cluster of some storage server compositions and some meta data server compositions, described application server, storage server and meta data server carry out exchanges data by communication network interconnection; Described application server cluster for generation of with reception creation data, described storage server cluster is used for providing memory space, described metadata server cluster be used for managed storage space.

2. the distributed memory system for large data according to claim 1, is characterized in that, described storage server, meta data server all adopt X86-based server, simultaneously supporting document and block agreement.

3. the distributed memory system for large data according to claim 1; it is characterized in that; the described data being stored in each application server, storage server and meta data server all adopt copy mechanism to carry out data protection, and each copy data is all stored on the server different from maternal data.

4. the distributed memory system for large data according to claim 1, is characterized in that, described application server, storage server and meta data server are interconnected by full line speed Infiniband network.