CA2253744A1 - Indexing databases for efficient relational querying - Google Patents

Indexing databases for efficient relational querying Download PDF

Info

Publication number
CA2253744A1
CA2253744A1 CA002253744A CA2253744A CA2253744A1 CA 2253744 A1 CA2253744 A1 CA 2253744A1 CA 002253744 A CA002253744 A CA 002253744A CA 2253744 A CA2253744 A CA 2253744A CA 2253744 A1 CA2253744 A1 CA 2253744A1
Authority
CA
Canada
Prior art keywords
source data
data
file
index files
index
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CA002253744A
Other languages
French (fr)
Other versions
CA2253744C (en
Inventor
Mariano Paulo Consens
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Progress Software Corp
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CA002253744A priority Critical patent/CA2253744C/en
Publication of CA2253744A1 publication Critical patent/CA2253744A1/en
Application granted granted Critical
Publication of CA2253744C publication Critical patent/CA2253744C/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • G06F16/2237Vectors, bitmaps or matrices

Abstract

A computer system for indexing databases for efficient relational querying. A set of index files is created to index an entire set of structured or semi-structured source data. The source data is tokenized. During the tokenization of the source data the source data is manipulated in accordance with the relational data base model. Tokens are generated in the index files to represent the table and the column in the source data for following value tokens. Index files are created to map into the position of data in the source file, and index files which represent the data as sorted lexicographically. The index files include a sort vector file and an associated join bit file. The system provides for efficient relational queries across the entire source data by use of the sort vector file, the join bit file, and related inverted files.
CA002253744A 1998-11-10 1998-11-10 Indexing databases for efficient relational querying Expired - Lifetime CA2253744C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CA002253744A CA2253744C (en) 1998-11-10 1998-11-10 Indexing databases for efficient relational querying

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CA002253744A CA2253744C (en) 1998-11-10 1998-11-10 Indexing databases for efficient relational querying

Publications (2)

Publication Number Publication Date
CA2253744A1 true CA2253744A1 (en) 2000-05-10
CA2253744C CA2253744C (en) 2004-08-24

Family

ID=29425632

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002253744A Expired - Lifetime CA2253744C (en) 1998-11-10 1998-11-10 Indexing databases for efficient relational querying

Country Status (1)

Country Link
CA (1) CA2253744C (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2409078A (en) * 2001-02-26 2005-06-15 Ori Software Dev Ltd Encoding semi-structured data for efficient search and browsing
GB2389690B (en) * 2001-02-26 2005-09-07 Ori Software Dev Ltd Encoding semi-structured data for efficient search and browsing
WO2013148853A1 (en) * 2012-03-29 2013-10-03 The Echo Nest Corporation Real time mapping of user models to an inverted data index for retrieval, filtering and recommendation
US9158754B2 (en) 2012-03-29 2015-10-13 The Echo Nest Corporation Named entity extraction from a block of text
US9406072B2 (en) 2012-03-29 2016-08-02 Spotify Ab Demographic and media preference prediction using media content data analysis
US9547679B2 (en) 2012-03-29 2017-01-17 Spotify Ab Demographic and media preference prediction using media content data analysis
US9798823B2 (en) 2015-11-17 2017-10-24 Spotify Ab System, methods and computer products for determining affinity to a content creator
CN110383263A (en) * 2017-03-20 2019-10-25 国际商业机器公司 The creation cognition intelligence inquiry from multiple data corpus

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2409078A (en) * 2001-02-26 2005-06-15 Ori Software Dev Ltd Encoding semi-structured data for efficient search and browsing
GB2409078B (en) * 2001-02-26 2005-09-07 Ori Software Dev Ltd Encoding semi-structured data for efficient search and browsing
GB2389690B (en) * 2001-02-26 2005-09-07 Ori Software Dev Ltd Encoding semi-structured data for efficient search and browsing
US8065308B2 (en) 2001-02-26 2011-11-22 Ori Software Development Ltd. Encoding semi-structured data for efficient search and browsing
US8489597B2 (en) 2001-02-26 2013-07-16 Ori Software Development Ltd. Encoding semi-structured data for efficient search and browsing
US9158754B2 (en) 2012-03-29 2015-10-13 The Echo Nest Corporation Named entity extraction from a block of text
WO2013148853A1 (en) * 2012-03-29 2013-10-03 The Echo Nest Corporation Real time mapping of user models to an inverted data index for retrieval, filtering and recommendation
US9406072B2 (en) 2012-03-29 2016-08-02 Spotify Ab Demographic and media preference prediction using media content data analysis
US9547679B2 (en) 2012-03-29 2017-01-17 Spotify Ab Demographic and media preference prediction using media content data analysis
US9600466B2 (en) 2012-03-29 2017-03-21 Spotify Ab Named entity extraction from a block of text
US10002123B2 (en) 2012-03-29 2018-06-19 Spotify Ab Named entity extraction from a block of text
US10459904B2 (en) 2012-03-29 2019-10-29 Spotify Ab Real time mapping of user models to an inverted data index for retrieval, filtering and recommendation
US9798823B2 (en) 2015-11-17 2017-10-24 Spotify Ab System, methods and computer products for determining affinity to a content creator
US11210355B2 (en) 2015-11-17 2021-12-28 Spotify Ab System, methods and computer products for determining affinity to a content creator
CN110383263A (en) * 2017-03-20 2019-10-25 国际商业机器公司 The creation cognition intelligence inquiry from multiple data corpus

Also Published As

Publication number Publication date
CA2253744C (en) 2004-08-24

Similar Documents

Publication Publication Date Title
EP1164510A3 (en) Virtual join index for relational databases
EP1164509A3 (en) Aggregate join index for relational databases
AU3092099A (en) Perspective transformations on relational database tables
ES2132922T3 (en) COMPUTER SYSTEM FOR THE CREATION OF MODELS OF SEMANTIC OBJECTS FROM EXISTING DATABASES OF RELATIONAL DATABASES.
WO2001042881A3 (en) System and method for the storage, indexing and retrieval of xml documents using relational databases
DE60130475D1 (en) IMPLEMENTATION OF CALCULATIONS OF THE TABLE CALCULATION TYPE IN A DATABASE SYSTEM
CA2198735A1 (en) Parallel processing system for traversing a data base
MXPA05010224A (en) System and method utilizing virtual folders.
MXPA05005209A (en) Method and system for indexing and searching databases.
WO2006009822A3 (en) Integrated database indexing system
SE0004043D0 (en) Method and apparatus for document indexing and searching
WO2002027546A3 (en) Database annotation and retrieval
NZ326982A (en) Evaluation of the content of a data set using multiple and/or complex queries
JPS6448129A (en) Information memory/operator
EP0350208A3 (en) Real-time database
CA2253744A1 (en) Indexing databases for efficient relational querying
WO2001095154A8 (en) Object-relational mapping for tables without primary keys
EP0336579A3 (en) Relational databases
WO2002069202A3 (en) Method for determining synthetic term senses using reference text
Tenopir Database selection tools
Krivda Full steam ahead.
SE8700770D0 (en) METHOD FOR QUERY HANDLING IN A DATABASE SYSTEM
Geiger et al. LEVEL CROSSINGS AND THE PANUM AREA.
Shieh et al. A tree-based inverted file for fast ranked-document retrieval
Levy et al. Challenges for global information systems

Legal Events

Date Code Title Description
EEER Examination request
MKEX Expiry

Effective date: 20181113