DE69330993D1 - Verfahren zur Komprimierung von Indizen für komplette Texte - Google Patents
Verfahren zur Komprimierung von Indizen für komplette TexteInfo
- Publication number
- DE69330993D1 DE69330993D1 DE69330993T DE69330993T DE69330993D1 DE 69330993 D1 DE69330993 D1 DE 69330993D1 DE 69330993 T DE69330993 T DE 69330993T DE 69330993 T DE69330993 T DE 69330993T DE 69330993 D1 DE69330993 D1 DE 69330993D1
- Authority
- DE
- Germany
- Prior art keywords
- data
- offset
- document
- data key
- identifier
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/31—Indexing; Data structures therefor; Storage structures
- G06F16/316—Indexing structures
- G06F16/328—Management therefor
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99936—Pattern matching access
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Software Systems (AREA)
- Business, Economics & Management (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- General Business, Economics & Management (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US07/986,754 US5649183A (en) | 1992-12-08 | 1992-12-08 | Method for compressing full text indexes with document identifiers and location offsets |
Publications (2)
Publication Number | Publication Date |
---|---|
DE69330993D1 true DE69330993D1 (de) | 2001-11-29 |
DE69330993T2 DE69330993T2 (de) | 2002-04-04 |
Family
ID=25532705
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE69330993T Expired - Lifetime DE69330993T2 (de) | 1992-12-08 | 1993-12-08 | Verfahren zur Komprimierung von Indizen für komplette Texte |
Country Status (6)
Country | Link |
---|---|
US (2) | US5649183A (de) |
EP (1) | EP0601569B1 (de) |
JP (1) | JP3550173B2 (de) |
AT (1) | ATE207635T1 (de) |
CA (1) | CA2110870A1 (de) |
DE (1) | DE69330993T2 (de) |
Families Citing this family (43)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5649183A (en) * | 1992-12-08 | 1997-07-15 | Microsoft Corporation | Method for compressing full text indexes with document identifiers and location offsets |
US5870739A (en) * | 1996-09-20 | 1999-02-09 | Novell, Inc. | Hybrid query apparatus and method |
US6057790A (en) * | 1997-02-28 | 2000-05-02 | Fujitsu Limited | Apparatus and method for data compression/expansion using block-based coding with top flag |
US6029167A (en) * | 1997-07-25 | 2000-02-22 | Claritech Corporation | Method and apparatus for retrieving text using document signatures |
CN100535889C (zh) * | 1997-10-21 | 2009-09-02 | 富士通株式会社 | 文件处理方法和数据处理装置 |
US6094649A (en) * | 1997-12-22 | 2000-07-25 | Partnet, Inc. | Keyword searches of structured databases |
US6055526A (en) * | 1998-04-02 | 2000-04-25 | Sun Microsystems, Inc. | Data indexing technique |
US6473774B1 (en) | 1998-09-28 | 2002-10-29 | Compaq Computer Corporation | Method and apparatus for record addressing in partitioned files |
WO2000034897A1 (en) * | 1998-12-07 | 2000-06-15 | Bloodhound Software, Inc. | System and method for finding near matches among records in databases |
US6502088B1 (en) | 1999-07-08 | 2002-12-31 | International Business Machines Corporation | Method and system for improved access to non-relational databases |
US6772141B1 (en) | 1999-12-14 | 2004-08-03 | Novell, Inc. | Method and apparatus for organizing and using indexes utilizing a search decision table |
US7233942B2 (en) * | 2000-10-10 | 2007-06-19 | Truelocal Inc. | Method and apparatus for providing geographically authenticated electronic documents |
US7685224B2 (en) * | 2001-01-11 | 2010-03-23 | Truelocal Inc. | Method for providing an attribute bounded network of computers |
US6920477B2 (en) * | 2001-04-06 | 2005-07-19 | President And Fellows Of Harvard College | Distributed, compressed Bloom filter Web cache server |
WO2003017023A2 (en) * | 2001-08-14 | 2003-02-27 | Quigo Technologies, Inc. | System and method for extracting content for submission to a search engine |
US20030157470A1 (en) * | 2002-02-11 | 2003-08-21 | Michael Altenhofen | E-learning station and interface |
US20050149507A1 (en) * | 2003-02-05 | 2005-07-07 | Nye Timothy G. | Systems and methods for identifying an internet resource address |
US7613687B2 (en) * | 2003-05-30 | 2009-11-03 | Truelocal Inc. | Systems and methods for enhancing web-based searching |
US7953720B1 (en) | 2005-03-31 | 2011-05-31 | Google Inc. | Selecting the best answer to a fact query from among a set of potential answers |
US7587387B2 (en) | 2005-03-31 | 2009-09-08 | Google Inc. | User interface for facts query engine with snippets from information sources that include query terms and answer terms |
US7386570B2 (en) * | 2005-03-31 | 2008-06-10 | International Business Machines Corporation | Method, system and program product for providing high performance data lookup |
US8239394B1 (en) | 2005-03-31 | 2012-08-07 | Google Inc. | Bloom filters for query simulation |
US8538969B2 (en) * | 2005-06-03 | 2013-09-17 | Adobe Systems Incorporated | Data format for website traffic statistics |
US8027876B2 (en) * | 2005-08-08 | 2011-09-27 | Yoogli, Inc. | Online advertising valuation apparatus and method |
US8429167B2 (en) * | 2005-08-08 | 2013-04-23 | Google Inc. | User-context-based search engine |
US20070185870A1 (en) | 2006-01-27 | 2007-08-09 | Hogue Andrew W | Data object visualization using graphs |
US8055674B2 (en) * | 2006-02-17 | 2011-11-08 | Google Inc. | Annotation framework |
US8954426B2 (en) | 2006-02-17 | 2015-02-10 | Google Inc. | Query language |
US7925676B2 (en) | 2006-01-27 | 2011-04-12 | Google Inc. | Data object visualization using maps |
US8688485B2 (en) * | 2006-07-06 | 2014-04-01 | Google Inc. | Low fare search for ticket changes using married segment indicators |
US8954412B1 (en) | 2006-09-28 | 2015-02-10 | Google Inc. | Corroborating facts in electronic documents |
US8122026B1 (en) | 2006-10-20 | 2012-02-21 | Google Inc. | Finding and disambiguating references to entities on web pages |
US8321485B2 (en) | 2006-11-08 | 2012-11-27 | Hitachi, Ltd. | Device and method for constructing inverted indexes |
US8347202B1 (en) | 2007-03-14 | 2013-01-01 | Google Inc. | Determining geographic locations for place names in a fact repository |
US20080277314A1 (en) * | 2007-05-08 | 2008-11-13 | Halsey Richard B | Olefin production utilizing whole crude oil/condensate feedstock and hydrotreating |
US8239751B1 (en) | 2007-05-16 | 2012-08-07 | Google Inc. | Data from web documents in a spreadsheet |
US8166041B2 (en) * | 2008-06-13 | 2012-04-24 | Microsoft Corporation | Search index format optimizations |
US9135277B2 (en) | 2009-08-07 | 2015-09-15 | Google Inc. | Architecture for responding to a visual query |
US9087059B2 (en) | 2009-08-07 | 2015-07-21 | Google Inc. | User interface for presenting search results for multiple regions of a visual query |
US9092507B2 (en) * | 2013-01-15 | 2015-07-28 | Marklogic Corporation | Apparatus and method for computing n-way co-occurrences of data tuples in scalar indexes |
US9087055B2 (en) | 2013-01-28 | 2015-07-21 | International Business Machines Corporation | Segmenting documents within a full text index |
US10437911B2 (en) * | 2013-06-14 | 2019-10-08 | Business Objects Software Ltd. | Fast bulk z-order for graphic elements |
US10860571B2 (en) * | 2016-11-04 | 2020-12-08 | Sap Se | Storage and pruning for faster access of a document store |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS5827240A (ja) * | 1981-08-10 | 1983-02-17 | Nippon Telegr & Teleph Corp <Ntt> | フアイル記憶方式 |
US4606002A (en) * | 1983-05-02 | 1986-08-12 | Wang Laboratories, Inc. | B-tree structured data base using sparse array bit maps to store inverted lists |
JPS607557A (ja) * | 1983-06-27 | 1985-01-16 | Fujitsu Ltd | 文字型デ−タの区分化圧縮法 |
JPS60247732A (ja) * | 1984-05-23 | 1985-12-07 | Nec Corp | デ−タ記憶装置 |
US5062074A (en) * | 1986-12-04 | 1991-10-29 | Tnet, Inc. | Information retrieval system and method |
US5201048A (en) * | 1988-12-01 | 1993-04-06 | Axxess Technologies, Inc. | High speed computer system for search and retrieval of data within text and record oriented files |
JP2969153B2 (ja) * | 1990-06-29 | 1999-11-02 | カシオ計算機株式会社 | レコード検索方法 |
US5321833A (en) * | 1990-08-29 | 1994-06-14 | Gte Laboratories Incorporated | Adaptive ranking system for information retrieval |
US5313604A (en) * | 1990-11-13 | 1994-05-17 | Hewlett-Packard Company | Method for locating compressed data in a computed memory back up device including steps of refining estimater location |
DE69231113T2 (de) * | 1991-04-08 | 2001-03-01 | Koninklijke Philips Electronics N.V., Eindhoven | Speicherverfahren für bibliographische Information über Daten aus einer endlichen Textquelle, und insbesondere Dokumentverbuchungen zur Verwendung in einem Suchsystem für Ganztextdokumente |
US5488725A (en) * | 1991-10-08 | 1996-01-30 | West Publishing Company | System of document representation retrieval by successive iterated probability sampling |
US5375235A (en) * | 1991-11-05 | 1994-12-20 | Northern Telecom Limited | Method of indexing keywords for searching in a database recorded on an information recording medium |
JPH05257774A (ja) * | 1992-03-10 | 1993-10-08 | Fujitsu Ltd | インデックス・レコード番号を圧縮・格納した情報検索装置 |
US5440481A (en) * | 1992-10-28 | 1995-08-08 | The United States Of America As Represented By The Secretary Of The Navy | System and method for database tomography |
US5649183A (en) * | 1992-12-08 | 1997-07-15 | Microsoft Corporation | Method for compressing full text indexes with document identifiers and location offsets |
-
1992
- 1992-12-08 US US07/986,754 patent/US5649183A/en not_active Expired - Lifetime
-
1993
- 1993-12-07 CA CA002110870A patent/CA2110870A1/en not_active Abandoned
- 1993-12-08 DE DE69330993T patent/DE69330993T2/de not_active Expired - Lifetime
- 1993-12-08 EP EP93119811A patent/EP0601569B1/de not_active Expired - Lifetime
- 1993-12-08 JP JP30806093A patent/JP3550173B2/ja not_active Expired - Fee Related
- 1993-12-08 AT AT93119811T patent/ATE207635T1/de not_active IP Right Cessation
-
1997
- 1997-03-28 US US08/829,461 patent/US5832479A/en not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
EP0601569A1 (de) | 1994-06-15 |
EP0601569B1 (de) | 2001-10-24 |
DE69330993T2 (de) | 2002-04-04 |
JPH06243009A (ja) | 1994-09-02 |
JP3550173B2 (ja) | 2004-08-04 |
CA2110870A1 (en) | 1994-06-09 |
US5649183A (en) | 1997-07-15 |
ATE207635T1 (de) | 2001-11-15 |
US5832479A (en) | 1998-11-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE69330993D1 (de) | Verfahren zur Komprimierung von Indizen für komplette Texte | |
ATE377220T1 (de) | Verfahren zur erzeugung von passwörtern aus biometrischen daten | |
AU2599495A (en) | Apparatus and method for event correlation and problem reporting | |
DE60130430D1 (de) | Verfahren und vorrichtung zur informationsverarbeitung | |
ATE378643T1 (de) | Indexstruktur von metadaten, verfahren zum bereitstellen von indizes von metadaten und metadatensuchverfahren und vorrichtung, die die indizes von metadaten verwenden | |
DE69621940D1 (en) | Protein-/(poly)peptidbibliotheken | |
DE3673080D1 (de) | Verfahren zur uebertragung von audio-information und zusatzinformation in digitaler form. | |
DE3884438D1 (de) | Verfahren zur herstellung von abriebfesten polykarbonatgegenstaenden. | |
DE3581501D1 (de) | Verfahren zur herstellung von diamantartigen kohlenstoffschichten. | |
ATE336119T1 (de) | Vorrichtung und verfahren für die einbettung und wiedergewinnung von informationen in analogen signalen mit verwendung der verteilten signalmerkmale | |
DE3577208D1 (de) | Verfahren zur herstellung von tetrafluorethylenpolymerisathaltigen, flammwidrigen polycarbonatformmassen. | |
DE3778292D1 (de) | Optisch lesbarer aufzeichnungstraeger zum aufzeichnen von informationen, verfahren und geraet zum herstellen eines derartigen aufzeichnungstraegers, geraet zum aufzeichnen von informationen auf einem derartigen aufzeichnungstraeger und geraet zur wiedergabe von auf einem derartigen aufzeichnungstraeger aufgezeichneten informationen. | |
ATE276607T1 (de) | Verfahren und vorrichtung zum kodieren und dekodieren von nachrichten | |
ATE56555T1 (de) | System zur identifizierung eines wertgegenstandes. | |
DE3882452D1 (de) | Verfahren zur herstellung von gegenstaenden aus kohlenstoff/kohlenstoffasern. | |
WO1995022230A3 (en) | A method and a system for identifying call records | |
ATE172040T1 (de) | Rechnersysteme mit einer prozessdatenbank und verfahren zur benutzung in diesen systemen | |
DE3774035D1 (de) | Verfahren zur herstellung von pech, verwertbar zur herstellung von kohlenstoffkoerpern. | |
DE3860713D1 (de) | Verfahren zur herstellung von polycarbonat/polyepoxid-massen. | |
ATE107822T1 (de) | Verfahren für die digitale und/oder analoge codierung von information eines, zweier oder mehrerer kanäle und/oder frequenz- oder bandbreitenreduzierung und/oder erhöhung der übertragungssicherheit. | |
DE59804319D1 (de) | Verfahren zur herstellung von kartenförmigen datenträgern | |
ATE196560T1 (de) | Verfahren zur umwandlung von sprachlich eingegebenen informationen in maschinenlesbare daten | |
DE3683622D1 (de) | Verfahren zur herstellung von hochviskosen schmieroelen. | |
DE3786846D1 (de) | Verfahren zur herstellung von polycarbonatharz. | |
DE3676908D1 (de) | Verfahren zur gewinnung von dateninformationen der magnetischen kernresonanz. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition |