CN104902022B - Distributed file acquisition method and distributed file acquisition system - Google Patents

Distributed file acquisition method and distributed file acquisition system Download PDF

Info

Publication number
CN104902022B
CN104902022B CN201510280153.0A CN201510280153A CN104902022B CN 104902022 B CN104902022 B CN 104902022B CN 201510280153 A CN201510280153 A CN 201510280153A CN 104902022 B CN104902022 B CN 104902022B
Authority
CN
China
Prior art keywords
user
download
file
hadoop
client
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201510280153.0A
Other languages
Chinese (zh)
Other versions
CN104902022A (en
Inventor
葛祺
窦乐建
崔晶晶
林佳婕
姜兴
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Xiaoxiang Innovation Artificial Intelligence Technology Co ltd
Original Assignee
Beijing Geo Polymerization Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Geo Polymerization Technology Co ltd filed Critical Beijing Geo Polymerization Technology Co ltd
Priority to CN201510280153.0A priority Critical patent/CN104902022B/en
Publication of CN104902022A publication Critical patent/CN104902022A/en
Application granted granted Critical
Publication of CN104902022B publication Critical patent/CN104902022B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/06Protocols specially adapted for file transfer, e.g. file transfer protocol [FTP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention provides a distributed file acquisition method and a distributed file acquisition system, wherein the method comprises the following steps: the client sends a file downloading request to a Hadoop file system; mapping a file list to be downloaded by a user to the user from a Hadoop file system; and returning a file list to be downloaded by the user to the client. The step of the client sending a file downloading request to the Hadoop file system further comprises: the client sends a file downloading request to a downloading interface; the download interface sends a permission verification request to a permission verification module; the authority verification module verifies whether the user has the downloading authority; if the download authority exists, the execution is continued; otherwise, ending. The distributed file acquisition method and the distributed file acquisition system provided by the invention do not need to occupy too much network bandwidth and Hadoop resources, so that the Hadoop resources can be saved.

Description

A kind of distributed document acquisition methods and distributed document obtain system
Technical field
The invention belongs to distributed data processing field more particularly to a kind of distributed document acquisition methods and distribution text Part obtains system.
Background technique
Hadoop distributed file system (Hadoop Distributed File System, HDFS) is a kind of suitable fortune Distributed file system of the row on common hardware (commodity hardware).The data that HDFS can provide high-throughput are visited It asks, the application being very suitable on large-scale dataset.For external client, HDFS is just as a traditional hierarchial file structure system System.It can create, delete, moving or Rename file, etc..The framework of HDFS is constructed based on one group of specific node, These nodes include NameNode (only one), and Metadata Service is provided inside HDFS;DataNode provides for HDFS Memory block.Wherein, NameNode is the software run on an independent machine usually in HDFS example.It is responsible for management text The access of part system name space and control external client.NameNode decides whether will be on File Mapping to DataNode In copy block.For most common 3 copy blocks, first copy block is stored on the different nodes of same rack, last A copy block is stored on some node of different racks.NameNode stores all about file system name in one file Claim the information in space.This file and a record file comprising all affairs will be stored in the local file system of NameNode On system.
Existing file acquisition method is the file of the HDFS file system directories of timing scan Hadoop, downloads file Onto local file system, when user will download file, then file is directly obtained from local file system.Such way It will cause the waste of Hadoop resource rather.
Therefore, better file acquisition method how is provided, becomes technical staff's problem in need of consideration.
Summary of the invention
Technical problem to be solved by the invention is to provide a kind of distributed document acquisition methods and distributed document to obtain System saves Hadoop resource.
In order to solve the above-mentioned technical problems, the present invention provides a kind of distributed document acquisition methods, comprising:
Client sends file download request to Hadoop file system;
The listed files that user to be downloaded is mapped to user from Hadoop file system;
The listed files to be downloaded of user is returned to client.
As the preferred embodiment of the present invention, the client sends file download request to Hadoop file system Step further comprises:
Client sends file download request to download interface;
Download interface sending permission checking request is to Authority Verification module;
Whether Authority Verification module verification user has download permission;If there is download permission then continues to execute;Otherwise terminate.
As the preferred embodiment of the present invention, the step of the end, further comprise:
Lack of competence download information is returned to download interface;
Download interface sends unexpected message to client and disconnects the connection with client, terminates.
In order to solve the above-mentioned technical problem, the present invention also provides a kind of distributed documents to obtain system, comprising:
Client modules, for sending file download request to Hadoop file system module;
Hadoop File Mapping module is reflected for the listed files to be downloaded user from Hadoop file system module It penetrates to user;
Hadoop file system module, for returning to listed files that user to be downloaded to client modules.
As the preferred embodiment of the present invention, the system also includes:
Download interface module for receiving the file download request, sending permission checking request, and receives downloading result Return to client modules;
Authority Verification module verifies whether user has download permission for receiving the Authority Verification request;If so, Then send the message for having permission downloading;Correspondingly,
The Hadoop File Mapping module is further used for wanting user when receiving the message for having permission downloading The listed files of downloading is mapped to user from Hadoop file system.
As the preferred embodiment of the present invention,
The Authority Verification module is further used for returning lack of competence download information to download interface module;
The download interface module is further used for lower transmission unexpected message to client modules and disconnects and client mould The connection of block.
Distributed document acquisition methods and distributed document provided by the invention obtain system, do not need to occupy too many net Network bandwidth and Hadoop resource, so as to save Hadoop resource.
Detailed description of the invention
Fig. 1 is the distributed document acquisition methods flow chart of one embodiment of the invention.
Fig. 2 is the distributed document acquisition methods flow chart of another embodiment of the invention.
Fig. 3 is the distributed file system structural schematic diagram of one embodiment of the invention.
Fig. 4 is the distributed file system structural schematic diagram of another embodiment of the invention.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that the described embodiment is only a part of the embodiment of the present invention, rather than all.Based in the present invention Embodiment, every other embodiment obtained by those of ordinary skill in the art without making creative efforts, all Belong to the scope of protection of the invention.
Main idea is that when user will download file, by the HDFS file of Hadoop by way of mapping Some catalogue of system is mapped to local file system, similar shortcut.In this case there is no need to occupy more Netowrk tapes Wide and Hadoop resource.
It is the distributed document acquisition methods flow chart of one embodiment of the invention shown in referring to Fig.1.The method packet It includes:
101, client sends file download request to Hadoop file system;
102, the listed files that user to be downloaded is mapped to user from Hadoop file system;
103, the listed files to be downloaded of user is returned to client.
Referring to shown in Fig. 2, for the distributed document acquisition methods flow chart of another embodiment of the invention.The method packet It includes:
201, client sends file download request to download interface;
202, download interface sending permission checking request to Authority Verification module;
203, whether Authority Verification module verification user has download permission;If so, thening follow the steps 204, otherwise execute Step 208;
204, Authority Verification module, which is sent, has permission the message of downloading to Hadoop File Mapping module;
205, the Hadoop File Mapping module listed files to be downloaded user, are mapped to use from Hadoop file system Family;
206, the listed files that return user needs to download interface;
207, download interface transmission returns results to client, terminates;
208, lack of competence download information is returned to download interface module;
209, download interface module sends unexpected message and disconnects, and terminates.
It is the distributed file system structural schematic diagram of one embodiment of the invention referring to shown in Fig. 3.The system packet It includes:
Client modules 301, for sending file download request to Hadoop file system module;
Hadoop File Mapping module 302, for the listed files to be downloaded user, from Hadoop file system module It is mapped to user;
Hadoop file system module 303, for returning to listed files that user to be downloaded to client modules.
Referring to shown in Fig. 4, for the distributed file system structural schematic diagram of another embodiment of the invention.The system packet It includes:
Client modules 401, for sending file download request;
Download interface module 402 for receiving the file download request, sending permission checking request, and receives downloading As a result client modules 401 are returned to;
Authority Verification module 403, for verifying whether user has download permission;Downloading is had permission if so, then sending Otherwise message returns to lack of competence download information to download interface module 402;
Hadoop File Mapping module 404, for receiving the message for having permission downloading;And the text to be downloaded user Part list is mapped to user from Hadoop file system;
Hadoop file system 405, for returning to the listed files of user's needs to download interface module 402.
In other embodiment of the present invention, the Authority Verification module 403 is further used for returning to information carrying under lack of competence Cease download interface module 402;
The download interface module 402 is further used for lower transmission unexpected message to client modules 401 and disconnects and visitor The connection of family end module 401.
Above-described specific embodiment has carried out further the purpose of the present invention, technical scheme and beneficial effects It is described in detail, it should be understood that being not intended to limit the present invention the foregoing is merely a specific embodiment of the invention Protection scope, all within the spirits and principles of the present invention, any modification, equivalent substitution, improvement and etc. done should all include Within protection scope of the present invention.

Claims (2)

1. a kind of distributed document acquisition methods characterized by comprising
Client sends file download request to Hadoop file system, specific sub-step are as follows: client is sent under file It carries request and arrives download interface, download interface sending permission checking request to Authority Verification module, Authority Verification module verification user Whether download permission is had: if there is download permission then continues to execute;Otherwise lack of competence download information is returned to download interface, downloading Interface sends unexpected message to client and disconnects the connection with client, terminates;
The listed files that user to be downloaded is mapped to user from Hadoop file system;
The listed files to be downloaded of user is returned to client.
2. a kind of distributed document obtains system characterized by comprising
Client modules, for sending file download request to Hadoop file system module;
Hadoop File Mapping module is mapped to for the listed files to be downloaded user from Hadoop file system module User;
Hadoop file system module, for returning to listed files that user to be downloaded to client modules;
Download interface module for receiving the file download request, sending permission checking request, and receives downloading result and returns To client modules;
Authority Verification module verifies whether user has download permission for receiving the Authority Verification request;If so, then sending out Send the message for having permission downloading;Correspondingly, the Hadoop File Mapping module, is further used for working as to receive having permission downloading Message when, the listed files that user to be downloaded is mapped to user from Hadoop file system;
The Authority Verification module is further used for returning lack of competence download information to download interface module;
The download interface module is further used for lower transmission unexpected message to client modules and disconnects and client modules Connection.
CN201510280153.0A 2015-05-27 2015-05-27 Distributed file acquisition method and distributed file acquisition system Expired - Fee Related CN104902022B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510280153.0A CN104902022B (en) 2015-05-27 2015-05-27 Distributed file acquisition method and distributed file acquisition system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510280153.0A CN104902022B (en) 2015-05-27 2015-05-27 Distributed file acquisition method and distributed file acquisition system

Publications (2)

Publication Number Publication Date
CN104902022A CN104902022A (en) 2015-09-09
CN104902022B true CN104902022B (en) 2019-02-26

Family

ID=54034418

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510280153.0A Expired - Fee Related CN104902022B (en) 2015-05-27 2015-05-27 Distributed file acquisition method and distributed file acquisition system

Country Status (1)

Country Link
CN (1) CN104902022B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110597764B (en) * 2019-10-10 2024-05-07 深圳前海微众银行股份有限公司 File downloading and version management method and device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102750391A (en) * 2012-07-06 2012-10-24 深圳市远行科技有限公司 File previewing method and system based on Hadoop distribution type
CN103279474A (en) * 2013-04-10 2013-09-04 深圳康佳通信科技有限公司 Video file index method and system
CN103581190A (en) * 2013-11-07 2014-02-12 江南大学 Method for control over file safety access based on cloud computing technology
CN103577500A (en) * 2012-08-10 2014-02-12 腾讯科技(深圳)有限公司 Method for carrying out data processing by distributed file system and distributed file system
CN104038771A (en) * 2014-06-19 2014-09-10 常州大学 High-effect streaming media file distributed storage system and method based on Hadoop2

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102750391A (en) * 2012-07-06 2012-10-24 深圳市远行科技有限公司 File previewing method and system based on Hadoop distribution type
CN103577500A (en) * 2012-08-10 2014-02-12 腾讯科技(深圳)有限公司 Method for carrying out data processing by distributed file system and distributed file system
CN103279474A (en) * 2013-04-10 2013-09-04 深圳康佳通信科技有限公司 Video file index method and system
CN103581190A (en) * 2013-11-07 2014-02-12 江南大学 Method for control over file safety access based on cloud computing technology
CN104038771A (en) * 2014-06-19 2014-09-10 常州大学 High-effect streaming media file distributed storage system and method based on Hadoop2

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
《基于HDFS的云计算安全技术研究与实现》;郭圣昌;《中国优秀硕士学位论文全文数据库信息科技辑》;20131015;正文第25、32-33页,图3-1 *
郭圣昌.《基于HDFS的云计算安全技术研究与实现》.《中国优秀硕士学位论文全文数据库信息科技辑》.2013,正文第25-33页. *

Also Published As

Publication number Publication date
CN104902022A (en) 2015-09-09

Similar Documents

Publication Publication Date Title
AU2016346890B2 (en) Selective synchronization and distributed content item block caching for multi-premises hosting of digital content items
CN106537881B (en) Method and computing equipment for allowing synchronous access to cloud storage system based on stub tracking
CN105045802B (en) A kind of polymorphic type previewing file system of message-driven
CN103237046B (en) Support distributed file system and the implementation method of mixed cloud storage application
CN102404338B (en) File synchronization method and device
WO2017167100A1 (en) Data migration method and device
CN105740418A (en) File monitoring and message pushing based real-time synchronization system
CN109542865A (en) Distributed cluster system configuration file synchronous method, device, system and medium
CN103873290A (en) Evaluating distributed application performance in a new environment
CN105025053A (en) Distributed file upload method based on cloud storage technology and system
CN102882985A (en) File sharing method based on cloud storage
US9847903B2 (en) Method and apparatus for configuring a communication system
CN106294870B (en) Object-based distribution cloud storage method
CN109818934A (en) A kind of method, apparatus and calculating equipment of automation daily record processing
CN102880658A (en) Distributed file management system based on seismic data processing
CN111400777B (en) Network storage system, user authentication method, device and equipment
CN104348859B (en) File synchronisation method, device, server, terminal and system
EP2842034B1 (en) Providing client and service compatibility through cloud-hosted adapters
CN106953910A (en) A kind of Hadoop calculates storage separation method
CN104104582B (en) A kind of data storage path management method, client and server
CN105490843A (en) Information processing method and system
CN106250571A (en) The method and system that a kind of ETL data process
CN109525590A (en) The transmission method and device of data packet
CN114357252A (en) Storage method, system and storage medium of cross-source multi-domain distributed data
CN101483668A (en) Network storage and access method, device and system for hot spot data

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20220428

Address after: 100000 room 116, building 3, Shuangqiao (Shuangqiao dairy factory), Chaoyang District, Beijing

Patentee after: Beijing Xiaoxiang innovation Artificial Intelligence Technology Co.,Ltd.

Address before: 100085 901, 9th floor, building 5, yard 1, Shangdi East Road, Haidian District, Beijing

Patentee before: BEIJING GEO POLYMERIZATION TECHNOLOGY Co.,Ltd.

TR01 Transfer of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20190226

CF01 Termination of patent right due to non-payment of annual fee