CN104113597A - Multi- data-centre hadoop distributed file system (HDFS) data read-write system and method - Google Patents
Multi- data-centre hadoop distributed file system (HDFS) data read-write system and method Download PDFInfo
- Publication number
- CN104113597A CN104113597A CN201410344218.9A CN201410344218A CN104113597A CN 104113597 A CN104113597 A CN 104113597A CN 201410344218 A CN201410344218 A CN 201410344218A CN 104113597 A CN104113597 A CN 104113597A
- Authority
- CN
- China
- Prior art keywords
- data
- metadata
- data center
- client
- hdfs
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Information Transfer Between Computers (AREA)
Abstract
Description
Claims (5)
- The HDFS data read-write system of 1.Yi Zhongduo data center, is characterized in that, comprises a global metadata server, a n data center, a client, and all there are a metadata node and a plurality of back end in each data center; Between global metadata server and client He Ge data center metadata node, adopt wide area network to link, between the metadata node of each data center and back end, by local area network (LAN), linked; Global metadata server, for the metadata information of the store and management overall situation, is responsible for each data center and distributes metadata NameSpace; The metadata node of each data center all comprises a GMS plugin module, is responsible for to global metadata server registration and regularly reports data center's resource using status and metadata information; Global metadata server is responsible for receiving client HDFS reading and writing data access request, and selects according to default dispatching algorithm the data center meeting the demands; The metadata node at client-access selected data center, by this metadata node, carried out the scheduling of HDFS reading and writing data, client is after HDFS reading and writing data completes, and the metadata node of data center is synchronized to global metadata server by the change information of metadata again.
- The HDFS data read-write method of 2.Yi Zhongduo data center, is characterized in that, comprises the large step of read and write two:The first step, HDFS data are read, and comprising:(1) set up global metadata server, for the metadata information of the store and management overall situation; Global metadata server Wei Ge data center distributes NameSpace, and metadata information is reported to global metadata server by each data center;(2) global metadata server receives client read data request, selects to meet the data center of reading requirement by preset algorithm, returns to the metadata node information at selected data center;(3) metadata node of client-access data center, metadata node returns to client according to default dispatching algorithm data block and back end information;(4) client and back end carry out alternately, and reading out data has read rear notice metadata node, and read data has operated;Second step, HDFS data are write, and comprising:(1) step of reading with HDFS data (1);(2) global metadata server receives client read data request, by preset algorithm, selects to meet the data center that writes requirement, returns to the metadata node information at selected data center;(3) metadata node of the selected HDFS of client-access data center, metadata node creates metadata information, and according to preset algorithm distribute data node, and back end information is returned to client;(4) client and back end carry out alternately, carrying out data writing operation, have write rear notice metadata node; During client data writing, adopt piecemeal writing mechanism, data block copy copy is completed automatically by back end, and all data blocks all write successfully notifies metadata node to write afterwards;(5), after ablation process completes, the metadata node of data center is synchronized to global metadata server by the change information of metadata.
- 3. the HDFS data read-write method of many data centers as claimed in claim 2, is characterized in that, described client read data request comprises any feature of file path, data block index, buffer size; Described client write data requests comprises any feature of new establishment file path, data writing size, access rights.
- 4. the HDFS data read-write method of many data centers as claimed in claim 2, it is characterized in that, the default data center selection algorithm of described global metadata server is according to reading or writing data distribution, the systematic function of request of data He Ge data center, any feature of loading condition, adopts distribute preferential, performance preference strategy of data to select data center.
- 5. the HDFS data read-write method of many data centers as claimed in claim 2, it is characterized in that, the default dispatching algorithm of described metadata node comprises any feature according to the distance of size of data, piecemeal quantity, data block and client, data block distribution, by distance priority, distribution fairness policy, selects.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410344218.9A CN104113597B (en) | 2014-07-18 | 2014-07-18 | The HDFS data read-write method of a kind of many Data centres |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410344218.9A CN104113597B (en) | 2014-07-18 | 2014-07-18 | The HDFS data read-write method of a kind of many Data centres |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104113597A true CN104113597A (en) | 2014-10-22 |
CN104113597B CN104113597B (en) | 2016-06-08 |
Family
ID=51710229
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410344218.9A Active CN104113597B (en) | 2014-07-18 | 2014-07-18 | The HDFS data read-write method of a kind of many Data centres |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104113597B (en) |
Cited By (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104506527A (en) * | 2014-12-23 | 2015-04-08 | 苏州海博智能系统有限公司 | Multidimensional information pointer platform and data access method thereof |
CN105049504A (en) * | 2015-07-09 | 2015-11-11 | 国云科技股份有限公司 | Big data transit transmission synchronization and storage method |
CN105760556A (en) * | 2016-04-19 | 2016-07-13 | 江苏物联网研究发展中心 | Low-time delay high-throughput multi-copy file read-write optimization method |
CN105847392A (en) * | 2016-04-25 | 2016-08-10 | 乐视控股(北京)有限公司 | HDFS writing method and device |
CN106357723A (en) * | 2016-08-15 | 2017-01-25 | 杭州古北电子科技有限公司 | Synchronous system and method for multi-cluster information caching based on cloud host |
CN106502795A (en) * | 2016-11-03 | 2017-03-15 | 郑州云海信息技术有限公司 | The method and system of scientific algorithm application deployment are realized on distributed type assemblies |
WO2017206754A1 (en) * | 2016-05-30 | 2017-12-07 | 中兴通讯股份有限公司 | Storage method and storage device for distributed file system |
CN107483571A (en) * | 2017-08-08 | 2017-12-15 | 柏域信息科技(上海)有限公司 | A kind of dynamic cloud storage method and system |
CN107562926A (en) * | 2017-09-14 | 2018-01-09 | 丙申南京网络技术有限公司 | For more hadoop distributed file systems of big data analysis |
CN107958159A (en) * | 2017-11-15 | 2018-04-24 | 广东电网有限责任公司电力调度控制中心 | A kind of method and system of big data migration |
CN109582686A (en) * | 2018-12-13 | 2019-04-05 | 中山大学 | Distributed meta-data management consistency ensuring method, device, system and application |
CN109726250A (en) * | 2018-12-27 | 2019-05-07 | 星环信息科技(上海)有限公司 | Data-storage system, metadatabase synchronization and data cross-domain calculation method |
CN110022338A (en) * | 2018-01-09 | 2019-07-16 | 阿里巴巴集团控股有限公司 | File reading, system, meta data server and user equipment |
CN110213352A (en) * | 2019-05-17 | 2019-09-06 | 北京航空航天大学 | The unified Decentralized Autonomous storage resource polymerization of name space |
CN111030858A (en) * | 2019-12-06 | 2020-04-17 | 北京浪潮数据技术有限公司 | Data management method, system and related device for distributed multi-cluster system |
CN111124301A (en) * | 2019-12-18 | 2020-05-08 | 深圳供电局有限公司 | Data consistency storage method and system of object storage device |
CN111198849A (en) * | 2020-01-10 | 2020-05-26 | 国网福建省电力有限公司 | Power supply data read-write system based on Hadoop and working method thereof |
CN111327681A (en) * | 2020-01-21 | 2020-06-23 | 北京工业大学 | Cloud computing data platform construction method based on Kubernetes |
CN112395354A (en) * | 2020-11-05 | 2021-02-23 | 深圳市中博科创信息技术有限公司 | Distributed relational database based on HDFS metadata server and construction method |
CN113419687A (en) * | 2021-07-13 | 2021-09-21 | 广东电网有限责任公司 | Object storage method, system, equipment and storage medium |
US20220075757A1 (en) * | 2019-09-27 | 2022-03-10 | Huawei Technologies Co., Ltd. | Data read method, data write method, and server |
CN117076391A (en) * | 2023-10-12 | 2023-11-17 | 长江勘测规划设计研究有限责任公司 | Water conservancy metadata management system |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102419766A (en) * | 2011-11-01 | 2012-04-18 | 西安电子科技大学 | Data redundancy and file operation methods based on Hadoop distributed file system (HDFS) |
CN103473365A (en) * | 2013-09-25 | 2013-12-25 | 北京奇虎科技有限公司 | File storage method and device based on HDFS (Hadoop Distributed File System) and distributed file system |
US20140122429A1 (en) * | 2012-10-31 | 2014-05-01 | International Business Machines Corporation | Data processing method and apparatus for distributed systems |
-
2014
- 2014-07-18 CN CN201410344218.9A patent/CN104113597B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102419766A (en) * | 2011-11-01 | 2012-04-18 | 西安电子科技大学 | Data redundancy and file operation methods based on Hadoop distributed file system (HDFS) |
US20140122429A1 (en) * | 2012-10-31 | 2014-05-01 | International Business Machines Corporation | Data processing method and apparatus for distributed systems |
CN103473365A (en) * | 2013-09-25 | 2013-12-25 | 北京奇虎科技有限公司 | File storage method and device based on HDFS (Hadoop Distributed File System) and distributed file system |
Cited By (32)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104506527B (en) * | 2014-12-23 | 2021-12-17 | 苏州海博智能系统有限公司 | Multi-dimensional information pointer platform and data access method thereof |
CN104506527A (en) * | 2014-12-23 | 2015-04-08 | 苏州海博智能系统有限公司 | Multidimensional information pointer platform and data access method thereof |
CN105049504A (en) * | 2015-07-09 | 2015-11-11 | 国云科技股份有限公司 | Big data transit transmission synchronization and storage method |
CN105049504B (en) * | 2015-07-09 | 2019-03-05 | 国云科技股份有限公司 | A kind of big data transfer transmission is synchronous and storage method |
CN105760556A (en) * | 2016-04-19 | 2016-07-13 | 江苏物联网研究发展中心 | Low-time delay high-throughput multi-copy file read-write optimization method |
CN105760556B (en) * | 2016-04-19 | 2019-05-24 | 江苏物联网研究发展中心 | More wave files of low delay high-throughput read and write optimization method |
CN105847392A (en) * | 2016-04-25 | 2016-08-10 | 乐视控股(北京)有限公司 | HDFS writing method and device |
WO2017206754A1 (en) * | 2016-05-30 | 2017-12-07 | 中兴通讯股份有限公司 | Storage method and storage device for distributed file system |
CN107451138A (en) * | 2016-05-30 | 2017-12-08 | 中兴通讯股份有限公司 | A kind of distributed file system storage method and system |
CN106357723A (en) * | 2016-08-15 | 2017-01-25 | 杭州古北电子科技有限公司 | Synchronous system and method for multi-cluster information caching based on cloud host |
CN106502795A (en) * | 2016-11-03 | 2017-03-15 | 郑州云海信息技术有限公司 | The method and system of scientific algorithm application deployment are realized on distributed type assemblies |
CN107483571A (en) * | 2017-08-08 | 2017-12-15 | 柏域信息科技(上海)有限公司 | A kind of dynamic cloud storage method and system |
CN107562926A (en) * | 2017-09-14 | 2018-01-09 | 丙申南京网络技术有限公司 | For more hadoop distributed file systems of big data analysis |
CN107562926B (en) * | 2017-09-14 | 2023-09-26 | 丙申南京网络技术有限公司 | Multi-hadoop distributed file system for big data analysis |
CN107958159A (en) * | 2017-11-15 | 2018-04-24 | 广东电网有限责任公司电力调度控制中心 | A kind of method and system of big data migration |
CN110022338A (en) * | 2018-01-09 | 2019-07-16 | 阿里巴巴集团控股有限公司 | File reading, system, meta data server and user equipment |
CN109582686A (en) * | 2018-12-13 | 2019-04-05 | 中山大学 | Distributed meta-data management consistency ensuring method, device, system and application |
CN109582686B (en) * | 2018-12-13 | 2021-01-15 | 中山大学 | Method, device, system and application for ensuring consistency of distributed metadata management |
CN109726250A (en) * | 2018-12-27 | 2019-05-07 | 星环信息科技(上海)有限公司 | Data-storage system, metadatabase synchronization and data cross-domain calculation method |
CN109726250B (en) * | 2018-12-27 | 2020-01-17 | 星环信息科技(上海)有限公司 | Data storage system, metadata database synchronization method and data cross-domain calculation method |
CN110213352A (en) * | 2019-05-17 | 2019-09-06 | 北京航空航天大学 | The unified Decentralized Autonomous storage resource polymerization of name space |
US20220075757A1 (en) * | 2019-09-27 | 2022-03-10 | Huawei Technologies Co., Ltd. | Data read method, data write method, and server |
CN111030858A (en) * | 2019-12-06 | 2020-04-17 | 北京浪潮数据技术有限公司 | Data management method, system and related device for distributed multi-cluster system |
CN111124301A (en) * | 2019-12-18 | 2020-05-08 | 深圳供电局有限公司 | Data consistency storage method and system of object storage device |
CN111124301B (en) * | 2019-12-18 | 2024-02-23 | 深圳供电局有限公司 | Data consistency storage method and system of object storage device |
CN111198849A (en) * | 2020-01-10 | 2020-05-26 | 国网福建省电力有限公司 | Power supply data read-write system based on Hadoop and working method thereof |
CN111327681A (en) * | 2020-01-21 | 2020-06-23 | 北京工业大学 | Cloud computing data platform construction method based on Kubernetes |
CN112395354A (en) * | 2020-11-05 | 2021-02-23 | 深圳市中博科创信息技术有限公司 | Distributed relational database based on HDFS metadata server and construction method |
CN112395354B (en) * | 2020-11-05 | 2022-08-02 | 深圳市中博科创信息技术有限公司 | Distributed relational database based on HDFS metadata server and construction method |
CN113419687A (en) * | 2021-07-13 | 2021-09-21 | 广东电网有限责任公司 | Object storage method, system, equipment and storage medium |
CN117076391A (en) * | 2023-10-12 | 2023-11-17 | 长江勘测规划设计研究有限责任公司 | Water conservancy metadata management system |
CN117076391B (en) * | 2023-10-12 | 2024-03-22 | 长江勘测规划设计研究有限责任公司 | Water conservancy metadata management system |
Also Published As
Publication number | Publication date |
---|---|
CN104113597B (en) | 2016-06-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104113597B (en) | The HDFS data read-write method of a kind of many Data centres | |
US9460185B2 (en) | Storage device selection for database partition replicas | |
US10459898B2 (en) | Configurable-capacity time-series tables | |
US9489443B1 (en) | Scheduling of splits and moves of database partitions | |
US9304815B1 (en) | Dynamic replica failure detection and healing | |
CN105190533B (en) | Snapshot in situ | |
US8271455B2 (en) | Storing replication requests for objects in a distributed storage system | |
EP3040886A1 (en) | Service oriented data management and architecture | |
CN110447021A (en) | For maintaining the methods, devices and systems of the consistency of metadata and data between data center | |
CN109314721B (en) | Management of multiple clusters of a distributed file system | |
CN105190623A (en) | Log record management | |
CN103067461A (en) | Metadata management system of document and metadata management method thereof | |
CN103458044A (en) | Metadata sharing management method for multi-storage clusters under wide area network environment | |
US9983823B1 (en) | Pre-forking replicas for efficient scaling of a distribued data storage system | |
CN103067488A (en) | Implement method of unified storage | |
CN105069151A (en) | HBase secondary index construction apparatus and method | |
US20170351620A1 (en) | Caching Framework for Big-Data Engines in the Cloud | |
CN109992373B (en) | Resource scheduling method, information management method and device and task deployment system | |
CN103365740B (en) | A kind of data cold standby method and device | |
CN102693312A (en) | Flexible transaction management method in key-value store data storage | |
CN202872848U (en) | Cloud storage terminal equipment based on cloud information and cloud computing services | |
US8543700B1 (en) | Asynchronous content transfer | |
Salehian et al. | Comparison of spark resource managers and distributed file systems | |
JP2015114913A (en) | Storage device, storage system, and data management program | |
RU2721235C2 (en) | Method and system for routing and execution of transactions |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C53 | Correction of patent of invention or patent application | ||
CB03 | Change of inventor or designer information |
Inventor after: Dong Bo Inventor after: Ruan Jianfei Inventor after: Zheng Qinghua Inventor after: He Huan Inventor after: Zhang Hanning Inventor after: Zhang Weizhan Inventor before: Dong Bo Inventor before: Zhang Hanning Inventor before: Zheng Qinghua Inventor before: He Huan Inventor before: Zhang Weizhan |
|
COR | Change of bibliographic data |
Free format text: CORRECT: INVENTOR; FROM: DONG BO ZHANG HANNING ZHENG QINGHUA HE HUAN ZHANG WEIZHAN TO: DONG BO RUANJIANFEI ZHENG QINGHUA HE HUAN ZHANG HANNING ZHANG WEIZHAN |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant |