CN103440244A

CN103440244A - Large-data storage and optimization method

Info

Publication number: CN103440244A
Application number: CN201310293482XA
Authority: CN
Inventors: 安宏伟; 季统凯
Original assignee: Institute of Computing Technology of CAS
Current assignee: Institute of Computing Technology of CAS
Priority date: 2013-07-12
Filing date: 2013-07-12
Publication date: 2013-12-11

Abstract

The invention relates to the technical field of data processing, in particular to a large-data storage and optimization method facing to sea-cloud coordination. The method comprises the following steps of data preprocessing, calculation optimization and mass data optimization, wherein the step of data preprocessing comprises data collection, multi-source data organization and gathering, data redundant processing and data compression storage; the calculation optimization comprises HDFS (hadoop distributed file system) file transmission and optimization and Map/Reduce parallel calculation and optimization; and the step of mass data optimization comprises data backup for disaster recovery, data encryption, CC index and CCT backup. The large-data storage and optimization method disclosed by the invention can be applied to large-data storage of a cloud platform.

Description

A kind of large data store optimization method

Technical field

The present invention relates to technical field of data processing, a kind of large data store optimization method of especially working in coordination with towards sea-cloud.

Background technology

Along with the fast development of infotech, traditional persistent storage scheme more and more has been difficult to adapt to the development of information service from framework; The Hadoop distributed system is passed through distributed algorithm, by the access of data and storage and distribution among a large amount of servers, on each server access can also be distributed in to cluster in many back-up storage reliably in, it is a subversive development of conventional store framework.But Opportunity coexists with challenge, the distributed structure/architecture of increasing income but seems especially heavy when solving Distributed Application, particularly to large data storage and frequent file writes, performance response deficiency during read operation.

Summary of the invention

The technical matters that the present invention solves is a kind of large data store optimization method that provides collaborative towards sea-cloud, effectively realizes the optimization of air exercise data storage.

The technical scheme that the present invention solves the problems of the technologies described above is:

Described method comprises data pre-service, calculation optimization and mass data optimization, the data pre-service comprise data acquisition, multi-source data tissue and converge, data redundancy is processed, compression storing data; Calculation optimization comprises the optimization of HDFS file transfer and Map/Reduce parallel computation optimization; With mass data optimization comprise that the data calamity is standby, data encryption, CCIndex index and CCT backup; The data that client is submitted to are after data acquisition gathers, by the multi-source data tissue with converge, data redundancy processes and to carry out standardization processing, and adopt RCFile to compress storage, data level splits, introduce piecemeal, the first piecemeal of burst mechanism burst again, adopt in piece by going storage, store by row in burst; Then, in calculation optimization, adopt CCIndex that the data random ergodic is converted into by the line index traversal, adopt CCT to carry out the recording level row and copied the data increment backup; When mass data is optimized, the parallel computation assembly completes HDFS file system and the optimization of Map/Reduce computation model configuration class, collaborative seamless integrated with G-cloud cloud platform, infrastructure and the infrastructure service of using flexibly G-cloud cloud platform to provide.

The system of storage optimization is set to:

The first step, linux system file mount parameter optimization, increase the noatime parameter;

Second step, NameNode node parameter configuration optimization, the dfs.block.size massive data files is processed and is set to 64M*N(N=1,2,3,4), the dfs.namenode.handler.count default value is arranged to 64;

The 3rd step, the DataNode node optimization, the service thread quantity that the far call of the DataNode node of dfs.datanode.handler.count is opened is set to 8;

The 4th step, job.tracker monitor node configuration optimization, the quantity that the processing task trackers opened on mapred.job.tracker.handler.count-job tracker passes the service thread of the RPC come is set to 64; The map task quantity of each job of mapred.map.tasks-, be arranged to cluster in the very approaching numerical value of the host number that exists; The reduce task quantity of each job of mapred.reduce.tasks-, be arranged to cluster in the very approaching numerical value of the host number that exists;

The 5th step, task.tracker monitor node configuration optimization,

Mapred.tasktracker.map.tasks.maximum, the maximum quantity of the map task that can simultaneously move on task tracker, be set to server CPU core number or number and subtract 1;

Can control the quantity of the task of operation simultaneously on task tracker of mapred.tasktracker.reduce.tasks.maximum simultaneously and be set to 2; TaskTracker.http.threads is the Thread Count on HTTPserver, operates in each TaskTracker upper, for the treatment of maptask output, can be set to 40～50;

The 6th step, the map configuration optimization, io.sort.mb can be set to 200M, the io.sort.factor attribute, the int type, Map end and Reduce end use this setup of attribute Map end and Reduce hold all use to file Sort the time max-flow that once merges be set to 100; The io.file.buffer.size attribute, the size of the buffer zone provided in the I/O operation of this setup of attribute MapReduce operation is provided in the iMapReduce operation, take byte as unit, be adjusted into 64KB or128KB, the tasktracker.http.threads attribute, the int type, the Map end is used each tasktracker in this setup of attribute cluster to be increased between 40-50 for the quantity of map output being passed to the worker thread of reducer;

The 7th step, the reduce configuration optimization, the also line number that mapred.reduce.parallel.copies increases the reproduction process of reduce end is adjusted into 20; The mapred.child.java.opts attribute, be adjusted into 2MB;

The mapred.job.shuffle.input.buffer.percent attribute, suitably scaling up is not overflow Map output and is write disk; The mapred.job.shuffle.merge.percent attribute, suitably increase its ratio and reduce the excessive number of times of writing of disk; The mapred.inmem.merge.threshold attribute, seldom the time, can be 0 when the memory requirements of Reduce function by this setup of attribute, controlled separately to overflow by the mapred.job.shuffle.merge.percent attribute and write process; The mapred.job.reduce.input.buffer.percent attribute, be set to 1.0.

The HDFS distributed document is stored workflow,

The first step, client, by authentication, is set up TCP/IP and is connected, and by a configurable port, is connected to NameNode and initiates the RPC remote request;

Second step, NameNode checks whether file to be created exists, and whether the founder has authority to be operated; Successful be record of document creation, otherwise to the client throw exception;

The 3rd step, the client writing in files, file is cut into to a plurality of packets, and in inside with these packets of format management of data queue " data queue ", apply for new blocks to NameNode simultaneously, obtain for storing the suitable DataNode list of replicas, the size of list is determined according to the setting to replication in NameNode;

The 4th step, form with pipeline writes packet in all replicas, packet is write to first DataNode in the mode flowed, this DataNode is after this packet storage, again it is passed to the next DataNode in this pipeline, to the last a DataNode;

The 5th step, if in transmitting procedure, there is certain DataNode fault to occur, so current pipeline can be closed, the DataNode broken down can remove from current pipeline, and remaining block can continue in remaining DataNode to continue the form transmission with pipeline, and NameNode can distribute a new DataNode simultaneously, the quantity that keeps replicas to set, write operation completes;

The 6th step, NameNode is linked to mailing address in corresponding DataNode piece according to the data block address of storage, the some or all of block list of backspace file;

The 7th step, NameNode selects nearest DataNode node, reads the block list, starts the file file reading.

Data are processed detailed process:

The first step, from a plurality of visual angles such as information source, imformosome, user's requests, analyze the availability aspect of multi-source magnanimity information;

Second step, multi-source data, at tissue with after converging, may produce a plurality of identical copies; When newly-increased file is converged storage, system monitoring, to event, calculates the digest value of the file that makes new advances, to the system request new files; Whether system contrast digest value has been present in system, and if there is no, return message allows client to converge storage data, newly-built this file; If digest value exists, the newly-built this document of system and corresponding authority, attribute information, but file data is directly quoted the data with existing content, without converging again the system of depositing in;

The 3rd step, adopt RCFile to complete compression to data, and the relation data level is split, and in burst, by the row order, stored, and will become the storage organization of the unit of classifying as in distributed data processing system with the storage organization of the unit of being recorded as;

The 4th step, the storage of unstructured document data is responsible for by data cluster, introduces deblocking and piecemeal copy mechanism and is stored, and increase data directory and tree node optimization;

The 5th step, adopt transmission channel to encrypt and data storage encryption mode, and symmetric cryptography is combined with asymmetric encryption;

The 6th step, adopt disk array to be backed up in realtime to production data; CCIndex is introduced in mass data processing optimization, and the data random ergodic is converted into efficiently and travels through by line index, and introducing CCT carries out the recording level row and copied the data increment backup;

The 7th step, synchronously access G-cloud cloud platform, uses computational resource, virtual resources, management resource etc. to carry out massive data processing, filters heavy and mining analysis, simultaneously by introducing the operations such as mass data search index and tree node optimization.

The HDFS distributed document reads detailed process:

The first step, client is connected to NameNode by a configurable port, and this connection is set up by ICP/IP protocol;

Second step, client is mutual by ClientProtocol and NameNode;

The 3rd step, DataNode use DatanodeProtocol and NameNode are mutual, and foundation is connected with NameNode;

The 4th step, DataNode keeps the communication connection with NameNode by periodically to NameNode, sending heartbeat and data block;

The 5th step, the information of data block comprises the attribute of data block, which file is data block belong to, data block address ID, modification time etc.;

The 6th step, the NameNode response is asked from the RPC of client and DataNode, and receives heartbeat signal and bulk state report from all DataNode;

The 7th step, return to bulk state and report to client, and status report has comprised the data block list that certain DataNode is all;

The 8th step, client, according to the address information of returning in the piece report, is chosen DataNode node reading out data;

The 9th step, close DataNode and connect, and once reads end.

The present invention has realized the optimization of HDFS file transfer, Map/Reduce parallel computation optimization, mass data query optimization, to reach following performance index: realized stable, efficiently large data store optimization method, the mass data query processing is optimized, be with good expansibility, can support to be no less than the memory capacity of 100PB level, support to expand to the storage of EB level; Have good reliability, security, to critical data, can realize many copies redundancy protecting mechanism, number of copies is not less than 3; Have strange land data disaster recovery and backup systems, based on the G-cloud platform, realize that resource elasticity takes, system has good response speed, supports mass data analysis and the service of excavating.

The accompanying drawing explanation

Below in conjunction with accompanying drawing, the present invention is further described;

Fig. 1 is system architecture schematic diagram of the present invention;

Fig. 2 is unstructured data storage system schematic diagram;

Fig. 3 is sea-cloud collaborative platform HDFS distributed file system schematic diagram;

Fig. 4 is network topology schematic diagram of the present invention.

Embodiment

The present invention proposes a kind of large data store optimization method based on G-cloud cloud platform, the JobClient client submits the data to data acquisition system (DAS), mass data adopts Data Preprocessing Technology to submit to data to carry out standardization processing the JobClient client, data compression technique adopts efficient storage organization RCFile, data level is split, introducing piecemeal, burst mechanism are first piecemeal burst again, adopt in piece by going and store, and in burst, by row, store; CCIndex is introduced in mass data processing optimization, and the data random ergodic is converted into efficiently and travels through by line index, and introducing CCT carries out the recording level row and copied the data increment backup; The parallel computation assembly completes HDFS file system and the optimization of Map/Reduce computation model configuration class, the mass data storage scheme of Error Tolerance and high-throughput is provided, significantly improve file processing and calculated performance, collaborative seamless integrated with G-cloud cloud platform, infrastructure and the infrastructure service of using flexibly G-cloud cloud platform to provide, support large-scale calculations resource, storage resources, Internet resources are virtual and data analysis management.

As shown in Figure 1, the detailed process of enforcement storage optimization method of the present invention is:

The first step, linux system file mount parameter optimization, increase the noatime parameter, Linux provides this parameter of noatime to forbid recording the last access time stamp, when file system mounted, can significantly improve the efficiency of disk I/O, only need again the carry file system after modification arranges, just do not need to restart and can come into force;

Second step, NameNode node parameter configuration optimization, the dfs.block.size massive data files is processed and is set to 64M*N (N=1,2,3,4), and the dfs.namenode.handler.count default value is 10, during the massive data files cluster, is arranged to 64;

The 3rd step, the DataNode node optimization, dfs.datanode.handler.count, the service thread quantity that the far call of DataNode node is opened, be defaulted as 3, and the present invention is set to 8;

The 4th step, job.tracker monitor node configuration optimization, the processing task trackers opened on mapred.job.tracker.handler.count-job tracker passes the quantity of the service thread of the RPC come, general 4% of the task tracker number of nodes that is set to, the present invention is set to 64.The map task quantity of each job of mapred.map.tasks-, often be arranged to cluster in the very approaching numerical value of the host number that exists.The reduce task quantity of each job of mapred.reduce.tasks-, often be arranged to cluster in the very approaching numerical value of the host number that exists;

The 5th step, task.tracker monitor node configuration optimization,

Mapred.tasktracker.map.tasks.maximum, the maximum quantity of the map task that can simultaneously move on task tracker, being set to server CPU core number or number, to subtract 1 o'clock operational efficiency the highest.Can control the quantity of the task of operation simultaneously on task tracker of mapred.tasktracker.reduce.tasks.maximum, the present invention is set to 2 simultaneously.TaskTracker.http.threads is the Thread Count on HTTPserver, operates in each TaskTracker upper, and for the treatment of maptask output, the large data sets group can be set to 40～50;

The 6th step, the map configuration optimization, io.sort.mb acquiescence 10 can be set to 200M for large cluster, the io.sort.factor attribute, the int type, Map end and Reduce end use this setup of attribute Map end and Reduce hold all use to file Sort the time max-flow that once merges, its default value is 10, is increased to 100.The io.file.buffer.size attribute, the size of the buffer zone provided in the I/O operation of this setup of attribute MapReduce operation is provided in the iMapReduce operation, take byte as unit, acquiescence is 4KB, be adjusted into 64KB or128KB, the tasktracker.http.threads attribute, the int type, in this setup of attribute cluster of Map end use, each tasktracker is for passing to map output the quantity of the worker thread of reducer, acquiescence is 40, it can be increased between 40-50, can increase the doubling Thread Count, improve the cluster performance;

The 7th step, the reduce configuration optimization, mapred.reduce.parallel.copies increases the also line number of reduce end reproduction process, default value 5, the present invention is adjusted into 20.The mapred.child.java.opts attribute, be adjusted into 2MB, improves the performance of MapReduce operation.The mapred.job.shuffle.input.buffer.percent attribute, acquiescence is 0.70, suitably scaling up is not overflow Map output and is write disk;

The mapred.job.shuffle.merge.percent attribute, suitably increase its ratio and can reduce the excessive number of times of writing of disk.The mapred.inmem.merge.threshold attribute, be defaulted as 1000.Seldom the time, can be 0 when the memory requirements of Reduce function by this setup of attribute, there is no threshold restriction, be controlled separately to overflow by the mapred.job.shuffle.merge.percent attribute and write process.The mapred.job.reduce.input.buffer.percent attribute, be set to 1.0;

The 8th step, CCIndex is introduced in mass data processing optimization, and the data random ergodic is converted into efficiently and travels through by line index, and introducing CCT carries out the recording level row and has copied the data increment backup;

As shown in Figure 2

Unstructured data storage detailed process is:

The first step, often contain unclean and nonstandard form in the magnanimity multi-source data, will cause potential risk to use, the statistical study of application system.Must data be converted into to the standardized data of system platform by data pre-service standard;

Second step, from a plurality of visual angles such as information source, imformosome, user's requests, analyze the availability aspect of multi-source magnanimity information, sets up and meet the availability assessment inference pattern that information develops and applies;

The 3rd step, multi-source data, at tissue with after converging, may produce a plurality of identical copies with a part file.When newly-increased file is converged storage, system monitoring, to event, calculates the digest value of the file that makes new advances, to the system request new files.Whether system contrast digest value has been present in system, and if there is no, return message allows client to converge storage data, newly-built this file.If digest value exists, the newly-built this document of system and corresponding authority, attribute information, but file data is directly quoted the data with existing content, without converging again the system of depositing in;

The 4th step, the present invention adopts a kind of efficient data store organisation---RCFile (Record Columnar File), data are completed to compression, the RCFile data store organisation is based on the Hadoop system, the RCFile storage organization combines the advantage of row storage and row storage, follow the design concept of " first horizontal division, then vertical division ".

The 5th step, the storage of unstructured document data is responsible for by data cluster, introduces deblocking and piecemeal copy mechanism and is stored, and for accelerating the retrieval rate of data, increases data directory and tree node optimization;

The 6th step, for increasing data security, adopt transmission channel to encrypt and data storage encryption mode, and symmetric encipherment algorithm is combined with rivest, shamir, adelman;

The 7th step, adopt disk array to be backed up in realtime to production data.

As shown in Figure 3

The detailed process of the collaborative distributed file storage of sea-cloud is:

Second step, client is mutual by ClientProtocol and NameNode;

The 9th step, close DataNode and connect, and reads end

As shown in Figure 4, the present invention is comprised of mass data storage management, distributed data platform and G-cloud cloud operating system three parts; Client is by authentication, setting up TCP/IP connects, be connected to NameNode and initiate the RPC request by a configurable port, carry out the data storage alternately with field of distributed file processing, bottom access cloud platform, used cloud infrastructure and infrastructure service to carry out data mining and analysis flexibly; Complete the collaborative calculation services of sea-cloud.

Claims

1. a large data store optimization method, it is characterized in that: described method comprises data pre-service, calculation optimization and mass data optimization, the data pre-service comprise data acquisition, multi-source data tissue and converge, data redundancy is processed, compression storing data; Calculation optimization comprises the optimization of HDFS file transfer and Map/Reduce parallel computation optimization; With mass data optimization comprise that the data calamity is standby, data encryption, CCIndex index and CCT backup; The data that client is submitted to are after data acquisition gathers, by the multi-source data tissue with converge, data redundancy processes and to carry out standardization processing, and adopt RCFile to compress storage, data level splits, introduce piecemeal, the first piecemeal of burst mechanism burst again, adopt in piece by going storage, store by row in burst; Then, in calculation optimization, adopt CCIndex that the data random ergodic is converted into by the line index traversal, adopt CCT to carry out the recording level row and copied the data increment backup; When mass data is optimized, the parallel computation assembly completes HDFS file system and the optimization of Map/Reduce computation model configuration class, collaborative seamless integrated with G-cloud cloud platform, infrastructure and the infrastructure service of using flexibly G-cloud cloud platform to provide.

2. large data according to claim 1 are stored and optimization method, and it is characterized in that: the system of storage optimization is set to:

The 5th step, task.tracker monitor node configuration optimization,

3. large data store optimization method according to claim 1 is characterized in that:

The HDFS distributed document is stored workflow,

4. large data store optimization method according to claim 2 is characterized in that:

The HDFS distributed document is stored workflow,

5. according to the described large data store optimization method of claim 1 to 4 any one, it is characterized in that:

Data are processed detailed process:

6. according to the described large data store optimization method of claim 1 to 4 any one, it is characterized in that:

The HDFS distributed document reads detailed process:

Second step, client is mutual by ClientProtocol and NameNode;

The 9th step, close DataNode and connect, and once reads end.

7. large data store optimization method according to claim 5 is characterized in that:

The HDFS distributed document reads detailed process:

Second step, client is mutual by ClientProtocol and NameNode;

The 9th step, close DataNode and connect, and once reads end.