IN2014MU00872AIN872MU2014AIN2014MU00872AIN 2014MU00872 AIN2014MU00872 AIN 2014MU00872AIN 872MU2014 AIN872MU2014 AIN 872MU2014AIN 2014MU00872 AIN2014MU00872 AIN 2014MU00872A
Authority
IN
India
Prior art keywords
data
distributed file
file system
archiving
onto
Prior art date
Application number
Inventor
Binesh KUTTAN
Vivek JACOB
Abraham Varghese
Thomas Jeby JOHN
Original Assignee
Tata Consultancy Services Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tata Consultancy Services LtdfiledCriticalTata Consultancy Services Ltd
Priority to IN872MU2014priorityCriticalpatent/IN2014MU00872A/en
Publication of IN2014MU00872ApublicationCriticalpatent/IN2014MU00872A/en
Information Retrieval, Db Structures And Fs Structures Therefor
(AREA)
Abstract
ABSTRACT ACTIVE ARCHIVING OF DATA ON A DISTRIBUTED FILE SYSTEM Systems and methods for active archiving of data onto a distributed file system are described. A data archiving system may implement archiving method, where the method includes receiving a request from an authenticated user to archive the data on to the distributed file system, wherein the data to be archived is located at a data source and transferring the data to be archived onto the distributed file system based on a distributed file transfer mechanism, wherein the data is one of a structured data and an unstructured data. The method further includes loading the data to be archived onto the distributed file system based on Hbase bulk load mechanism. The method also includes indexing the data loaded onto the distributed file system to generate at least one indices corresponding to the data, wherein the indexing comprises segmenting of data into plurality of index segments. <To be published with Figure 2>