IN2013MU03472A - - Google Patents

Info

Publication number
IN2013MU03472A
IN2013MU03472A IN3472MU2013A IN2013MU03472A IN 2013MU03472 A IN2013MU03472 A IN 2013MU03472A IN 3472MU2013 A IN3472MU2013 A IN 3472MU2013A IN 2013MU03472 A IN2013MU03472 A IN 2013MU03472A
Authority
IN
India
Prior art keywords
file
indexing
segments
index
nodes
Prior art date
Application number
Inventor
Arun Vasu
Jishnu Kurunthala
Original Assignee
Tata Consultancy Services Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tata Consultancy Services Ltd filed Critical Tata Consultancy Services Ltd
Priority to IN3472MU2013 priority Critical patent/IN2013MU03472A/en
Priority to US14/498,598 priority patent/US9846702B2/en
Publication of IN2013MU03472A publication Critical patent/IN2013MU03472A/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/182Distributed file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures

Abstract

ABSTRACT INDEXING OF FILE IN A HADOOP CLUSTER A file indexing system (102) for indexing a file to be stored onto a distributed file system (104) includes a segmentation module (122) to segment the file into a plurality of segments. The file indexing system (102) further includes an index generation module (124) to initiate indexing of the file through a plurality of nodes of a Hadoop cluster, where each of the plurality of nodes indexes one or more segments from amongst the plurality of segments to generate at least one index corresponding to the one or more segments. The file indexing system (102) further includes an index transfer module (126) to store the at least one index onto the distributed file system (104). <To be published with Figure 1>
IN3472MU2013 2013-10-31 2013-10-31 IN2013MU03472A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
IN3472MU2013 IN2013MU03472A (en) 2013-10-31 2013-10-31
US14/498,598 US9846702B2 (en) 2013-10-31 2014-09-26 Indexing of file in a hadoop cluster

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
IN3472MU2013 IN2013MU03472A (en) 2013-10-31 2013-10-31

Publications (1)

Publication Number Publication Date
IN2013MU03472A true IN2013MU03472A (en) 2015-07-24

Family

ID=52996626

Family Applications (1)

Application Number Title Priority Date Filing Date
IN3472MU2013 IN2013MU03472A (en) 2013-10-31 2013-10-31

Country Status (2)

Country Link
US (1) US9846702B2 (en)
IN (1) IN2013MU03472A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106294721A (en) * 2016-08-08 2017-01-04 无锡天脉聚源传媒科技有限公司 A kind of company-data statistics and deriving method and device

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104834730B (en) * 2015-05-15 2018-06-01 北京京东尚科信息技术有限公司 data analysis system and method
US9961068B2 (en) 2015-07-21 2018-05-01 Bank Of America Corporation Single sign-on for interconnected computer systems
CN105354251B (en) * 2015-10-19 2018-10-30 国家电网公司 Electric power cloud data management indexing means based on Hadoop in electric system
CN105868253A (en) * 2015-12-23 2016-08-17 乐视网信息技术(北京)股份有限公司 Data importing and query methods and apparatuses
CN105740727A (en) * 2016-02-02 2016-07-06 上海斐讯数据通信技术有限公司 Distributed storage method and system of private data
US20200126010A1 (en) * 2016-06-15 2020-04-23 Solix Technologies, Inc. Enterprise Business Record Management System
CN106294842A (en) * 2016-08-19 2017-01-04 浪潮(北京)电子信息产业有限公司 A kind of data interactive method, platform and distributed file system
CN106487582A (en) * 2016-09-21 2017-03-08 努比亚技术有限公司 A kind of method and apparatus of deployment search server
CN106776929A (en) * 2016-11-30 2017-05-31 北京锐安科技有限公司 A kind of method for information retrieval and device
CN106649800A (en) * 2016-12-29 2017-05-10 南威软件股份有限公司 Solr-based Chinese search method
CN106844700A (en) * 2017-02-03 2017-06-13 山东浪潮商用系统有限公司 It is a kind of to ask tax system based on Sorl
CN107066595A (en) * 2017-04-19 2017-08-18 济南浪潮高新科技投资发展有限公司 A kind of many application searches method of servicing of big data and system
CN107273515A (en) * 2017-06-21 2017-10-20 国网内蒙古东部电力有限公司信息通信分公司 The retrieval of electric network data asset source and displaying based on polymorphic data directory technology
US10936681B2 (en) * 2017-08-03 2021-03-02 International Business Machines Corporation Generalized search engine for abstract data types with skimming and approximate retrieval
WO2019113197A1 (en) 2017-12-05 2019-06-13 Walmart Apollo, Llc System and method for an index search engine
US11392544B2 (en) * 2018-02-06 2022-07-19 Samsung Electronics Co., Ltd. System and method for leveraging key-value storage to efficiently store data and metadata in a distributed file system
US11748495B2 (en) * 2018-11-28 2023-09-05 Jpmorgan Chase Bank, N.A. Systems and methods for data usage monitoring in multi-tenancy enabled HADOOP clusters
US11294938B2 (en) 2019-01-03 2022-04-05 International Business Machines Corporation Generalized distributed framework for parallel search and retrieval of unstructured and structured patient data across zones with hierarchical ranking
CN109766360A (en) * 2019-01-09 2019-05-17 北京一览群智数据科技有限责任公司 A kind of list screening method and device
CN110297971B (en) * 2019-05-30 2022-09-20 百度在线网络技术(北京)有限公司 Personalized resource retrieval method, device, equipment and computer readable storage medium
US20220277054A1 (en) * 2021-02-26 2022-09-01 State Farm Mutual Automobile Insurance Company Data migration of search indexes across search-engine deployments

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008085204A2 (en) * 2006-12-29 2008-07-17 Prodea Systems, Inc. Demarcation between application service provider and user in multi-services gateway device at user premises
US8082258B2 (en) * 2009-02-10 2011-12-20 Microsoft Corporation Updating an inverted index in a real time fashion
US20110196854A1 (en) * 2010-02-05 2011-08-11 Sarkar Zainul A Providing a www access to a web page
US20120030018A1 (en) * 2010-07-28 2012-02-02 Aol Inc. Systems And Methods For Managing Electronic Content
US8650159B1 (en) * 2010-08-26 2014-02-11 Symantec Corporation Systems and methods for managing data in cloud storage using deduplication techniques
US9092151B1 (en) * 2010-09-17 2015-07-28 Permabit Technology Corporation Managing deduplication of stored data
CN103620591A (en) * 2011-06-14 2014-03-05 惠普发展公司,有限责任合伙企业 Deduplication in distributed file systems
US20150112996A1 (en) * 2013-10-23 2015-04-23 Microsoft Corporation Pervasive search architecture

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106294721A (en) * 2016-08-08 2017-01-04 无锡天脉聚源传媒科技有限公司 A kind of company-data statistics and deriving method and device

Also Published As

Publication number Publication date
US9846702B2 (en) 2017-12-19
US20150120695A1 (en) 2015-04-30

Similar Documents

Publication Publication Date Title
IN2013MU03472A (en)
IN2015DN03160A (en)
SG10201906917QA (en) Processing data from multiple sources
IL252772A0 (en) Generating card stacks with queries on online social networks
MX356565B (en) Modifying structured search queries on online social networks.
PH12016500957A1 (en) Data management for connected devices
MX347812B (en) Using inverse operators for queries on online social networks.
WO2014179145A3 (en) Drive level encryption key management in a distributed storage system
MX353716B (en) Structured search queries based on social-graph information.
NZ754204A (en) Object tracking system optimization and tools
CL2015003348A1 (en) Hybrid power / fiber cable
JP2014096164A5 (en)
WO2014165439A3 (en) Automated storage and retrieval system and control system thereof
MX369047B (en) Systems and methods for mapping and routing based on clustering.
SA515360346B1 (en) Method for operating an arrangement for storing thermal energy
GB2525788A (en) Data synchronization
IN2012DE01073A (en)
GB2514275A (en) Identifying and ranking solutions from multiple data sources
WO2015038508A3 (en) Techniques to manage color representations for a digital map
IN2013MU03094A (en)
ES2722408T3 (en) A wind power plant, and a method to increase the reactive power capacity of a wind power plant
WO2015167427A3 (en) Data distribution based on network information
GB2530454A (en) Optimization of instruction groups across group boundaries
BR112015029297A2 (en) passive distribution system used fiber indexing
MX356937B (en) Contact aggregation in a social network.