WO2001031512A3 - Fast indexing of web objects - Google Patents

Fast indexing of web objects Download PDF

Info

Publication number
WO2001031512A3
WO2001031512A3 PCT/US2000/041334 US0041334W WO0131512A3 WO 2001031512 A3 WO2001031512 A3 WO 2001031512A3 US 0041334 W US0041334 W US 0041334W WO 0131512 A3 WO0131512 A3 WO 0131512A3
Authority
WO
WIPO (PCT)
Prior art keywords
bits
linked list
cache server
unique set
substantially unique
Prior art date
Application number
PCT/US2000/041334
Other languages
French (fr)
Other versions
WO2001031512A2 (en
WO2001031512A9 (en
Inventor
Nitin S Sonawane
William J Carpenter
David J Yates
Abdelsalam A Heddaya
Original Assignee
Infolibria Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Infolibria Inc filed Critical Infolibria Inc
Priority to AU27460/01A priority Critical patent/AU2746001A/en
Publication of WO2001031512A2 publication Critical patent/WO2001031512A2/en
Publication of WO2001031512A3 publication Critical patent/WO2001031512A3/en
Publication of WO2001031512A9 publication Critical patent/WO2001031512A9/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/901Indexing; Data structures therefor; Storage structures
    • G06F16/9014Indexing; Data structures therefor; Storage structures hash tables

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

A cache server is used to store objects such as web pages transferred over a network system. Requests for the stored objects, in the form of URLs, are processed by a computer to determine if the object resides in the cache server. In particular, a URL character string is converted to a substantially unique set of bits using a hashing function. A first part of the substantially unique set of bits is used to identify a linked list to be searched while a second part of the substantially unique set of bits is used to identify a target index pointer within the linked list. Based upon a located target index pointer in the linked list corresponding to the URL character string, a desired web page or document object is quickly retrieved from the cache server.
PCT/US2000/041334 1999-10-25 2000-10-20 Fast indexing of web objects WO2001031512A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU27460/01A AU2746001A (en) 1999-10-25 2000-10-20 Fast indexing of web objects

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US42644399A 1999-10-25 1999-10-25
US09/426,443 1999-10-25

Publications (3)

Publication Number Publication Date
WO2001031512A2 WO2001031512A2 (en) 2001-05-03
WO2001031512A3 true WO2001031512A3 (en) 2002-06-13
WO2001031512A9 WO2001031512A9 (en) 2002-11-14

Family

ID=23690827

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/041334 WO2001031512A2 (en) 1999-10-25 2000-10-20 Fast indexing of web objects

Country Status (2)

Country Link
AU (1) AU2746001A (en)
WO (1) WO2001031512A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9116812B2 (en) 2012-01-27 2015-08-25 Intelligent Intellectual Property Holdings 2 Llc Systems and methods for a de-duplication cache

Families Citing this family (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2418999A (en) 2004-09-09 2006-04-12 Surfcontrol Plc Categorizing uniform resource locators
US9654495B2 (en) 2006-12-01 2017-05-16 Websense, Llc System and method of analyzing web addresses
WO2008070803A1 (en) 2006-12-06 2008-06-12 Fusion Multisystems, Inc. (Dba Fusion-Io) Apparatus, system, and method for managing data from a requesting device with an empty data token directive
US8151082B2 (en) 2007-12-06 2012-04-03 Fusion-Io, Inc. Apparatus, system, and method for converting a storage request into an append data storage command
US9104599B2 (en) 2007-12-06 2015-08-11 Intelligent Intellectual Property Holdings 2 Llc Apparatus, system, and method for destaging cached data
US8161353B2 (en) * 2007-12-06 2012-04-17 Fusion-Io, Inc. Apparatus, system, and method for validating that a correct data segment is read from a data storage device
GB2445764A (en) * 2007-01-22 2008-07-23 Surfcontrol Plc Resource access filtering system and database structure for use therewith
US9519540B2 (en) 2007-12-06 2016-12-13 Sandisk Technologies Llc Apparatus, system, and method for destaging cached data
US7836226B2 (en) 2007-12-06 2010-11-16 Fusion-Io, Inc. Apparatus, system, and method for coordinating storage requests in a multi-processor/multi-thread environment
WO2011031796A2 (en) 2009-09-08 2011-03-17 Fusion-Io, Inc. Apparatus, system, and method for caching data on a solid-state storage device
US8429436B2 (en) 2009-09-09 2013-04-23 Fusion-Io, Inc. Apparatus, system, and method for power reduction in a storage device
US9223514B2 (en) 2009-09-09 2015-12-29 SanDisk Technologies, Inc. Erase suspend/resume for memory
US8984216B2 (en) 2010-09-09 2015-03-17 Fusion-Io, Llc Apparatus, system, and method for managing lifetime of a storage device
US10817421B2 (en) 2010-12-13 2020-10-27 Sandisk Technologies Llc Persistent data structures
US9208071B2 (en) 2010-12-13 2015-12-08 SanDisk Technologies, Inc. Apparatus, system, and method for accessing memory
US9047178B2 (en) 2010-12-13 2015-06-02 SanDisk Technologies, Inc. Auto-commit memory synchronization
EP2652623B1 (en) 2010-12-13 2018-08-01 SanDisk Technologies LLC Apparatus, system, and method for auto-commit memory
US9218278B2 (en) 2010-12-13 2015-12-22 SanDisk Technologies, Inc. Auto-commit memory
US10817502B2 (en) 2010-12-13 2020-10-27 Sandisk Technologies Llc Persistent memory management
WO2012106362A2 (en) 2011-01-31 2012-08-09 Fusion-Io, Inc. Apparatus, system, and method for managing eviction of data
US9003104B2 (en) 2011-02-15 2015-04-07 Intelligent Intellectual Property Holdings 2 Llc Systems and methods for a file-level cache
US9201677B2 (en) 2011-05-23 2015-12-01 Intelligent Intellectual Property Holdings 2 Llc Managing data input/output operations
US8874823B2 (en) 2011-02-15 2014-10-28 Intellectual Property Holdings 2 Llc Systems and methods for managing data input/output operations
WO2012116369A2 (en) 2011-02-25 2012-08-30 Fusion-Io, Inc. Apparatus, system, and method for managing contents of a cache
US9767032B2 (en) 2012-01-12 2017-09-19 Sandisk Technologies Llc Systems and methods for cache endurance
US9251052B2 (en) 2012-01-12 2016-02-02 Intelligent Intellectual Property Holdings 2 Llc Systems and methods for profiling a non-volatile cache having a logical-to-physical translation layer
US10102117B2 (en) 2012-01-12 2018-10-16 Sandisk Technologies Llc Systems and methods for cache and storage device coordination
US9251086B2 (en) 2012-01-24 2016-02-02 SanDisk Technologies, Inc. Apparatus, system, and method for managing a cache
US10359972B2 (en) 2012-08-31 2019-07-23 Sandisk Technologies Llc Systems, methods, and interfaces for adaptive persistence
US10019353B2 (en) 2012-03-02 2018-07-10 Longitude Enterprise Flash S.A.R.L. Systems and methods for referencing data on a storage medium
US10339056B2 (en) 2012-07-03 2019-07-02 Sandisk Technologies Llc Systems, methods and apparatus for cache transfers
US9612966B2 (en) 2012-07-03 2017-04-04 Sandisk Technologies Llc Systems, methods and apparatus for a virtual machine cache
US9842053B2 (en) 2013-03-15 2017-12-12 Sandisk Technologies Llc Systems and methods for persistent cache logging
CN109165096B (en) * 2018-08-20 2021-10-15 四川长虹电器股份有限公司 Cache utilization system and method for web cluster

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5822759A (en) * 1996-11-22 1998-10-13 Versant Object Technology Cache system

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5822759A (en) * 1996-11-22 1998-10-13 Versant Object Technology Cache system

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
CHANG D-W ET AL: "Adaptive-level memory caches on World Wide Web servers", COMPUTER NETWORKS, ELSEVIER SCIENCE PUBLISHERS B.V., AMSTERDAM, NL, vol. 32, no. 3, March 2000 (2000-03-01), pages 261 - 275, XP004304669, ISSN: 1389-1286 *
DANZIG P: "NetCache architecture and deployment", COMPUTER NETWORKS AND ISDN SYSTEMS, NORTH HOLLAND PUBLISHING. AMSTERDAM, NL, vol. 30, no. 22-23, 25 November 1998 (1998-11-25), pages 2081 - 2091, XP004152160, ISSN: 0169-7552 *
DATABASE INSPEC THE INSTITUTION OF ELECTRICAL ENGINEERS, STEVENAGE, GB; XP002190278 *
KUN-LUNG WU ET AL.: "Load balancing and hot spot relief for hash routing among a collection of proxy caches", PROC. 19TH. IEEE INT. CONF. ON DISTRIBUTED COMPUTING SYSTEMS, 31 May 1999 (1999-05-31) - 4 June 1999 (1999-06-04), pages 536 - 543, XP010340602, Retrieved from the Internet <URL:http://ieeexplore.ieee.org/iel5/6307/16865/00776556.pdf?isNumber=16865&prod=CNF&arnumber=776556&arSt=536&ared=543&arAuthor=Kun-Lung+Wu%3B+Yu%2C+P.S.> [retrieved on 20020214], DOI: doi:10.1109/ICDCS.1999.776556 *
ROSS K W: "HASH ROUTING FOR COLLECTIONS OF SHARED WEB CACHES", IEEE NETWORK, IEEE INC. NEW YORK, US, vol. 11, no. 6, 1 November 1997 (1997-11-01), pages 37 - 44, XP000737464, ISSN: 0890-8044 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9116812B2 (en) 2012-01-27 2015-08-25 Intelligent Intellectual Property Holdings 2 Llc Systems and methods for a de-duplication cache

Also Published As

Publication number Publication date
WO2001031512A2 (en) 2001-05-03
AU2746001A (en) 2001-05-08
WO2001031512A9 (en) 2002-11-14

Similar Documents

Publication Publication Date Title
WO2001031512A3 (en) Fast indexing of web objects
US6301614B1 (en) System and method for efficient representation of data set addresses in a web crawler
KR100313462B1 (en) A method of displaying searched information in distance order in web search engine
WO2000054182A8 (en) Systems, methods and computer program products for performing internet searches utilizing bookmarks
WO2007065105A3 (en) Method for tracking of non-resident pages
CA2409642A1 (en) Method and apparatus for identifying related searches in a database search system
EP1552425A4 (en) A link generation system
EP1241594A3 (en) System and method for locating pages on the world wide web and for locating documents from a network of computers
WO2002042863A3 (en) A system and process for network site fragmented search
EP0924628A3 (en) Methods and system for using web browser to search large collections of documents
WO2000005663A3 (en) Distributed computer database system and method for performing object search
CA2248911A1 (en) System and method for locating resources on a network using resource evaluations derived from electronic messages
AU2008327678A1 (en) Federated search implemented across multiple search engines
BR0016397A (en) Real time search engine
EP2267618A3 (en) Method and system for forming a keyword database for referencing physical locations
CN102411617B (en) Method for storing and inquiring a large quantity of URLs
US8392366B2 (en) Changing number of machines running distributed hyperlink database
CA2353533A1 (en) Search engine for video and graphics
DE60038307D1 (en) ASSOCIATIVE MEMORY FOR CACHE STORAGE
HUP0004164A2 (en) An internet caching system and a method and an arrangement in such a system
CN102541924B (en) A kind of caching method of retrieving information and search engine system
Chowdhary et al. Study of web page ranking algorithms: a review
CN103136294B (en) File operating method and device
Zhang et al. Efficient search in large textual collections with redundancy
WO2001004811A8 (en) A system for searching multiple job posting web sites through a single web site

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

AK Designated states

Kind code of ref document: C2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: C2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP