CN102932443A - HDFS (hadoop distributed file system) cluster based distributed cloud storage system - Google Patents

HDFS (hadoop distributed file system) cluster based distributed cloud storage system Download PDF

Info

Publication number
CN102932443A
CN102932443A CN2012104191579A CN201210419157A CN102932443A CN 102932443 A CN102932443 A CN 102932443A CN 2012104191579 A CN2012104191579 A CN 2012104191579A CN 201210419157 A CN201210419157 A CN 201210419157A CN 102932443 A CN102932443 A CN 102932443A
Authority
CN
China
Prior art keywords
user
file
storage
cloud storage
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2012104191579A
Other languages
Chinese (zh)
Inventor
陈国庆
郭蒙蒙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SUZHOU LIANGJIANG TECHNOLOGY Co Ltd
Original Assignee
SUZHOU LIANGJIANG TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SUZHOU LIANGJIANG TECHNOLOGY Co Ltd filed Critical SUZHOU LIANGJIANG TECHNOLOGY Co Ltd
Priority to CN2012104191579A priority Critical patent/CN102932443A/en
Publication of CN102932443A publication Critical patent/CN102932443A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses an HDFS (hadoop distributed file system) cluster based distributed cloud storage system which comprises a name node and a data node, wherein the name node is responsible for managing file namespaces and client access, and the data node is responsible for managing data storage. The system is characterized in that the name node comprises a registration/certification module and a user customization module, wherein the registration/certification module is used for providing users with a login service and carrying out certification on user login requests; and the user customization module is used for customizing the capacity of a storage space of a login user, and after the user logins successfully, the user sets a storage space allowance reminder. According to the method, the size of a storage space of a user is customized, therefore, an effect of distribution based on need is achieved, and resources are rationally used; the application of distributed storage improves the storage efficiency; and the application of multi-copy storage improves the data reliability of storage.

Description

Distributed cloud storage system based on the HDFS cluster
Technical field
The invention belongs to the cloud technical field of memory, be specifically related to a kind of distributed cloud storage system based on the HDFS cluster.
Background technology
Along with the development of Internet technology, amount of information is explosive increase, and the data storage becomes the key issue of restriction enterprise development gradually.Increasing enterprise begins the data storage separated as project independently and manages.High reliability, high universalizable, high scalability, large capacity, cloud is stored with the incomparable advantages characteristic of conventional data centers, is just becoming the important selection that enterprise realizes raising the efficiency, reducing cost.
Compare with traditional memory device, the cloud storage is not only a hardware, but the system that a plurality of parts such as the network equipment, memory device, server, application software, public visit interface, Access Network and a client-side program form.
The cloud storage does not refer to some concrete equipment, and refers to an aggregate that is made of various memory devices and server the user.The user uses the cloud storage, is not to use some memory devices, but a kind of data access service of using whole cloud storage system to bring.The core of cloud storage is that application software combines with memory device, realizes that by application software memory device is to the transformation of stores service.
At present cloud storage is apparent to user's's (no matter being individual or enterprise) meaning: be stored in data on the network and can access whenever and wherever possible and read (as long as can network), save local storage factor and acquire cost according to the additional hardware that growth brings, substantially need not consider the maintenance issues such as data backup, only need to select suitable cloud storage service provider and pay as required correlative charges to get final product.Use the cloud storage to become a kind of trend as stores service, the research that cloud is stored has broad application prospects.
What traditional cloud storage File storage was adopted is centralised storage, and file is stored in the local file system.The deficiency of this storage mode then is when the same data of needs multiaccess, to need many parts of backups.In addition, for the resource that only is kept in the file system, if there is system's machine of delaying, then can cause loss of data.For these problems, existing solution has user data is saved in server, only has and can network, and the user can access its data anywhere or anytime, need not many backups.But the problem of this existence then is the problem of a data reliability, and the machine in case server is delayed will cause user data loss.The present invention therefore.
Summary of the invention
The object of the invention is to provide a kind of distributed cloud storage system based on the HDFS cluster, has solved the storage of prior art medium cloud storage File and has adopted centralized stores, and the user can not customize, and file transfer is difficult to the problems such as control.
In order to solve these problems of the prior art, technical scheme provided by the invention is:
A kind of distributed cloud storage system based on the HDFS cluster, comprise the title node of being in charge of file name space and client-access and the back end of being responsible for data are stored into the line pipe reason, it is characterized in that described title node comprises authentication registration module and customization module, described authentication registration module is used for providing the user to register and user's logging request being authenticated; Described customization module is used for registered user's storage space volume is customized, and the user arranges the prompting of memory space surplus after the user successfully logins.
Preferably, described user directly uploads download file or uploads download file by the title node to back end to back end by client.
Preferably, when the user carried out upload operation, the user at first connected with the server end of title node, obtained user's remaining space size, if user's remaining space is not enough, then pointed out the memory space surplus, and returned the user and store main interface; Otherwise upload user's file, and prompting memory space surplus.
Preferably, when the user carried out down operation, the user at first connected with server end by client, and the sign that user's transmission is read and file identification are to server end, and server end reads sign, returns to the client file data according to file identification.
The registrable account of user in the technical solution of the present invention is applied for a certain size memory space.But the user user name that succeeds in registration and password login storage system.But the user can check its usage space size and the file of having uploaded, and the user can upload local file to server, and server deposits user data among the HDFS in; The user can download its file that has uploaded onto the server.When user's space is not enough, prompting user.
Technical solution of the present invention adopts the piecemeal storage, accelerates storage efficiency; The many copies of file of storage, the reliability of assurance user data.Compare with traditional document storage system, technical solution of the present invention has user's usability, and the user is the accessor server data whenever and wherever possible, adopts the HDFS distributed memory system to improve the storage efficiency of system.The cloud storage is applied in the storage of subscriber data system, possess the user data reliability, even certain loom in the server stores system has been delayed machine, the user still can obtain correct data.
The present invention compared with prior art has following beneficial effect:
Technical solution of the present invention has compared with prior art realized the customization of user's size, and resource is rationally used in distribution according to need; The user can be under networking situation calling party data whenever and wherever possible, need not to copy everywhere; Use distributed storage to improve storage efficiency; Use many copy storages, improved the data reliability of storage.
Description of drawings
The invention will be further described below in conjunction with drawings and Examples:
Fig. 1 is that technical solution of the present invention is based on the Organization Chart of the distributed cloud storage system of HDFS cluster;
Fig. 2 is that technical solution of the present invention is based on the overview flow chart of the distributed cloud storage system of HDFS cluster;
Fig. 3 is the workflow diagram that the technical solution of the present invention user carries out upload file;
Fig. 4 is the workflow diagram that the technical solution of the present invention user carries out download file;
Fig. 5 is the workflow diagram that the technical solution of the present invention user carries out deleted file.
Embodiment
Below in conjunction with specific embodiment such scheme is described further.Should be understood that these embodiment are not limited to limit the scope of the invention for explanation the present invention.The implementation condition that adopts among the embodiment can be done further adjustment according to the condition of concrete producer, and not marked implementation condition is generally the condition in the normal experiment.
Embodiment
The distributed cloud storage system based on the HDFS cluster that the present embodiment obtains, comprise the title node of being in charge of file name space and client-access and the back end of being responsible for data are stored into the line pipe reason, described title node comprises authentication registration module and customization module, and described authentication registration module is used for providing the user to register and user's logging request being authenticated; Described customization module is used for registered user's storage space volume is customized, and the user arranges the prompting of memory space surplus after the user successfully logins.
As shown in Figure 1, carry out the process of map-reduce data decomposition in the HDFS cluster.When specifically disposing, the implementation step is described below referring to Fig. 2:
1) build cluster, configuration HDFS arranges the Namenode/Datanode node, a host node, and two from node, and the copy number.
2) user's registration: when the user registers, judge mainly whether user name is used, what still adopt is the storage mode of structural data herein, if user name exists, then can not register; If user name does not exist, judge then whether the password that the user inputs has contained capital and small letter, digital alphabet etc., the user creates successfully if meet the requirements then, otherwise the prompting user registration failure.
3) user user name and password login system, background data base connects, and judges whether user's input is correct, if checking correctly then enter user storage space, shows that the user stores main interface, otherwise returns login interface.
4) enter after the user stores the interface, control page jump with the action class, operate execution according to the user, the user can select to check the operations such as individual storage information, upload file, then jumps to the corresponding page.
5) be illustrated in figure 3 as the workflow diagram that the user carries out upload file.Select upload file, at first subscriber's main station and server end connect, and obtain user's remaining space size, if user's remaining space is not enough, then provide corresponding prompting, and return the user and store main interface; Otherwise server reads the user and writes sign, and the spanned file path returns to client, and server end writes HDFS with user data.HDFS is then according to load balancing, choose load in the cluster lower from node, piecemeal storage user data, block size can configure according to actual conditions, usually be set to 64M, after the data of having stored a block size, back up according to number of copies, HDFS at first selects to carry out copy storage with the node of storage node on same frame, and then selects not another node on same frame to store.
6) be illustrated in figure 4 as the workflow diagram that the user carries out download file.Select viewing files, the user can download or delete the user file that it has been uploaded.During download file, client at first connects with server end, the sign that the client transmission is downloaded and file identification are to server end, server end reads sign, file identification is sent to HDFS, HDFS finds the nearest position from node of its storage in host node, file reading returns to the client file data according to file identification.Be illustrated in figure 5 as the workflow diagram that the user carries out deleted file.During deleted file, server end according to file identification, is done a mark to file after reading sign, transfer to the HDFS deleted file.HDFS can detect the heartbeat that each file sends every certain time interval, if heartbeat is designated deletion, then HDFS can delete this document.
The effect that below is the centralised storage of routine in the present embodiment and the prior art compares, and is as shown in table 1.
The effect of table 1 technical solution of the present invention and centralised storage relatively
Figure BDA00002320432900051
Above-mentioned example only is explanation technical conceive of the present invention and characteristics, and its purpose is to allow the people who is familiar with technique can understand content of the present invention and according to this enforcement, can not limit protection scope of the present invention with this.All equivalent transformations that Spirit Essence is done according to the present invention or modification all should be encompassed within protection scope of the present invention.

Claims (4)

1. distributed cloud storage system based on the HDFS cluster, comprise the title node of being in charge of file name space and client-access and the back end of being responsible for data are stored into the line pipe reason, it is characterized in that described title node comprises authentication registration module and customization module, described authentication registration module is used for providing the user to register and user's logging request being authenticated; Described customization module is used for registered user's storage space volume is customized, and the user arranges the prompting of memory space surplus after the user successfully logins.
2. the distributed cloud storage system based on the HDFS cluster according to claim 1 is characterized in that described user directly uploads download file or uploads download file by the title node to back end to back end by client.
3. the distributed cloud storage system based on the HDFS cluster according to claim 2, it is characterized in that when the user carries out upload operation, the user at first connects with the server end of title node, obtain user's remaining space size, if user's remaining space is not enough, then point out the memory space surplus, and return the user and store main interface; Otherwise upload user's file, and prompting memory space surplus.
4. the distributed cloud storage system based on the HDFS cluster according to claim 2, it is characterized in that when the user carries out down operation, the user at first connects with server end by client, the sign that user's transmission is read and file identification are to server end, server end reads sign, returns to the client file data according to file identification.
CN2012104191579A 2012-10-29 2012-10-29 HDFS (hadoop distributed file system) cluster based distributed cloud storage system Pending CN102932443A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2012104191579A CN102932443A (en) 2012-10-29 2012-10-29 HDFS (hadoop distributed file system) cluster based distributed cloud storage system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2012104191579A CN102932443A (en) 2012-10-29 2012-10-29 HDFS (hadoop distributed file system) cluster based distributed cloud storage system

Publications (1)

Publication Number Publication Date
CN102932443A true CN102932443A (en) 2013-02-13

Family

ID=47647140

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2012104191579A Pending CN102932443A (en) 2012-10-29 2012-10-29 HDFS (hadoop distributed file system) cluster based distributed cloud storage system

Country Status (1)

Country Link
CN (1) CN102932443A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103116643A (en) * 2013-02-25 2013-05-22 江苏物联网研究发展中心 Hadoop-based intelligent medical data management method
CN103442057A (en) * 2013-08-27 2013-12-11 玉林师范学院 Cloud storage system based on user collaboration cloud
CN105898601A (en) * 2015-12-11 2016-08-24 乐视网信息技术(北京)股份有限公司 File play method and device
CN107592503A (en) * 2017-09-22 2018-01-16 唐山开用网络信息服务有限公司 Audio-video collection management system of enforcing the law and management method
CN107613026A (en) * 2017-10-31 2018-01-19 四川仕虹腾飞信息技术有限公司 Distributed file management system based on cloud storage system
CN107741981A (en) * 2017-10-16 2018-02-27 桂进林 A kind of e-book management method and device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN201657029U (en) * 2010-04-15 2010-11-24 王鹏 Cloud storage system based on cloud computing framework
CN102196049A (en) * 2011-05-31 2011-09-21 北京大学 Method suitable for secure migration of data in storage cloud
CN102404399A (en) * 2011-11-18 2012-04-04 浪潮电子信息产业股份有限公司 Fuzzy dynamic allocation method for cloud storage resource
CN102594899A (en) * 2011-12-31 2012-07-18 成都市华为赛门铁克科技有限公司 Storage service method and storage server using the same
US20120182891A1 (en) * 2011-01-19 2012-07-19 Youngseok Lee Packet analysis system and method using hadoop based parallel computation

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN201657029U (en) * 2010-04-15 2010-11-24 王鹏 Cloud storage system based on cloud computing framework
US20120182891A1 (en) * 2011-01-19 2012-07-19 Youngseok Lee Packet analysis system and method using hadoop based parallel computation
CN102196049A (en) * 2011-05-31 2011-09-21 北京大学 Method suitable for secure migration of data in storage cloud
CN102404399A (en) * 2011-11-18 2012-04-04 浪潮电子信息产业股份有限公司 Fuzzy dynamic allocation method for cloud storage resource
CN102594899A (en) * 2011-12-31 2012-07-18 成都市华为赛门铁克科技有限公司 Storage service method and storage server using the same

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103116643A (en) * 2013-02-25 2013-05-22 江苏物联网研究发展中心 Hadoop-based intelligent medical data management method
CN103442057A (en) * 2013-08-27 2013-12-11 玉林师范学院 Cloud storage system based on user collaboration cloud
CN105898601A (en) * 2015-12-11 2016-08-24 乐视网信息技术(北京)股份有限公司 File play method and device
CN107592503A (en) * 2017-09-22 2018-01-16 唐山开用网络信息服务有限公司 Audio-video collection management system of enforcing the law and management method
CN107741981A (en) * 2017-10-16 2018-02-27 桂进林 A kind of e-book management method and device
CN107613026A (en) * 2017-10-31 2018-01-19 四川仕虹腾飞信息技术有限公司 Distributed file management system based on cloud storage system

Similar Documents

Publication Publication Date Title
US11750607B2 (en) Identifying accounts having shared credentials
CN104008028B (en) Intelligent mobile terminal data backup memory method and system based on many cloud storages
US9130922B2 (en) Using a session continuity token to access an online content management system
CN102904870B (en) Server unit and information processing method
CN110827097A (en) Tax management method, apparatus, medium and electronic device based on block chain system
CN102932443A (en) HDFS (hadoop distributed file system) cluster based distributed cloud storage system
CN103036956A (en) Filing system and implement method of distributed configured massive data
CN103095848B (en) The cloud folder arrangement of To enterprises client and the method for information interaction
CN103731489B (en) A kind of date storage method, system and equipment
CN103180842A (en) Cloud computing system and data synchronization method therefor
CN103605798A (en) Method for directly operating file stored at cloud end
CN110191128A (en) A kind of tax shared file system and implementation method based on HDFS
CN104462185A (en) Digital library cloud storage system based on mixed structure
CN108776758A (en) The block level data De-weight method of dynamic ownership management is supported in a kind of storage of mist
US20190354395A1 (en) Limiting folder and link sharing
CN104333553A (en) Mass data authority control strategy based on combination of blacklist and whitelist
US20240106902A1 (en) Communication protocols for an online content management system
US20120047568A1 (en) Digital Asset Management on the Internet
US9436769B2 (en) Automatic device upload configuration
CN109951567A (en) A kind of Double Data center applications dispositions method
US10839090B2 (en) Digital data processing system for efficiently storing, moving, and/or processing data across a plurality of computing clusters
CN111565144A (en) Data layered storage management method for instant communication tool
CN113840013B (en) Document system for hierarchical management
Rongqiang et al. Sceapi: A unified restful web api for high-performance computing
US10412586B2 (en) Limited-functionality accounts

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
AD01 Patent right deemed abandoned

Effective date of abandoning: 20170111

AD01 Patent right deemed abandoned