CA2550818C - System and method of providing relational set operations for olap data sources - Google Patents

System and method of providing relational set operations for olap data sources Download PDF

Info

Publication number
CA2550818C
CA2550818C CA2550818A CA2550818A CA2550818C CA 2550818 C CA2550818 C CA 2550818C CA 2550818 A CA2550818 A CA 2550818A CA 2550818 A CA2550818 A CA 2550818A CA 2550818 C CA2550818 C CA 2550818C
Authority
CA
Canada
Prior art keywords
query
olap
data
relational
provider
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CA2550818A
Other languages
French (fr)
Other versions
CA2550818A1 (en
Inventor
Soufiane Azizi
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SAP SE
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from CA002519015A external-priority patent/CA2519015A1/en
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Priority to CA2550818A priority Critical patent/CA2550818C/en
Publication of CA2550818A1 publication Critical patent/CA2550818A1/en
Application granted granted Critical
Publication of CA2550818C publication Critical patent/CA2550818C/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2453Query optimisation
    • G06F16/24534Query rewriting; Transformation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/283Multi-dimensional databases or data warehouses, e.g. MOLAP or ROLAP

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Human Resources & Organizations (AREA)
  • Strategic Management (AREA)
  • General Engineering & Computer Science (AREA)
  • Operations Research (AREA)
  • General Business, Economics & Management (AREA)
  • Tourism & Hospitality (AREA)
  • Quality & Reliability (AREA)
  • Marketing (AREA)
  • Economics (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A tabular set provider is provided for processing a data agnostic business intelligence query with relational set operations, herein called a relational query against a multidimensional datasource. The tabular set provider comprises a query interceptor, a query decomposes, an OLAP query provider interface, and an operation module. The query interceptor is provided for detecting and intercepting a data agnostic Business intelligence query that has relational set operations that is issued against one or more multidimensional data sources. The query decomposes is provided for decomposing the operands of the set operation in the input query into one or more sub-queries. The OLAP query provider interface is provided for sending the OLAP supported sub- queries to an OLAP query provider and receiving results. The operation module is provided for using set operation functionality provided by a relational query provider that performs the set operations locally on the results of the OLAP data extraction.

Description

System and Method of Providing Relational Set Operations for OLAP Sources FIELD OF INVENTION
[0001 ) The present invention relates generally to a system and method for processing high level user requests containing one or more queries, and in particular to a system and method of providing relational set operations for online analytical processing (OLAP) sources.
BACKGROUND OF THE INVENTION
[0002) Many organizations use data stores for storing business data, such as financial data and operational data. In order to assist business users to examine their data, various data analyzing applications are proposed. Those data analyzing applications provide various views or reports of data to users. Those data analyzing applications typically have query engines that access the data stores to obtain desired data. The accessed data stores can be either relational or multidimensional stores, which are based on fundamentally different data storage technologies.
[0003] Those data analyzing applications issue requests of data from the data stores. A
request may contain data agnostic business intelligence (BI) queries that may express algebraic set operations, such us union, except, and intersect, which are relational style operations. BI users can make such computations on relational data. There is a need for the ability to do the same computation natively on OLAP or dimensional data.
Presently, users have had to extract dimensional data, transform, and load it (ETL) into relational database management systems (RDBMS) that provide such computation using the SQL
query language. This ETL route is time consuming, requires the interventions of IT
departments in large organizations, is error prone, and is inefficient compared to being able to perform such computation in the BI system directly.
[0004] Some data analyzing applications have Online Analytical Processing (OLAP) query engines to allow users to analyze multidimensional views of data. This type of OLAP is sometimes called Multidimensional OLAP (MOLAP). A MOLAP engine summarizes business data into multidimensional views in advance, and places the summarized data in a cube structure. When a user request is received, the MOLAP engine accesses the summarized data, and thus the MOLAP engine can provide a response to the query very fast. The user can rotate the cube structured data to see a desired view of the data using the MOLAP engine.
[0005] There also exist Relational OLAP (ROLAP) query engines that extract data from traditional relational databases. ROLAP engines are able to create multidimensional views on the fly. In order to extract data, those ROLAP engines typically use complex Structured Query Language (SQL) statements against relational tables in the relational databases.
ROLAP engines tend to be used on data that has a large number of attributes, where the data cannot be easily placed into a cube structure. ROLAP engines support multidimensional queries issued against relational databases. Some ROLAP
engines translate OLAP queries into SQL queries, and other ROLAP query engines implement the access to relational databases using internal communication between components responsible for OLAP and relational operations.
[0006] Both MOLAP and ROLAP approaches to the multidimensional data access, even though they use different data storage technologies, provide only the functionality of the multidimensional query language. This query language has a number of significant limitations when it comes to providing relational style set operations, such us the union, except, and intersect operations between set operands.
[0007] On the other side is the relational query language, SQL, providing a powerful set of operations that manipulate data in accordance with the relational algebra.
This set of operations includes the relational style operations that are not natively supported by the multidimensional technologies, including the union, except, and intersect operations between set operands. While SQL is ideal for processing transactional data, it has a number of significant limitations when it comes to data analysis and reporting.

SUMMARY OF THE INVENTION
[0008] The present invention intercepts data agnostic business intelligence (BI) query that has relational style set operations that is issued against one or more multidimensional data sources, and decomposes it into sub-queries to execute the query.
[0009] It is an object of the invention to provide an improved system and method for decomposition of a data agnostic business intelligence query against one or more multidimensional data sources.
[0010] According to the present invention there is provided a tabular set provider for processing a data agnostic business intelligence query that has relational style set operations that are not supported natively by the underlying multidimensional data sources.
The tabular set provider comprises a query interceptor, a query decomposer, an OLAP
query provider interface, and an operation module. The query interceptor is provided for detecting and intercepting a data agnostic Business intelligence query that has relational set operations that is issued against one or more multidimensional data sources.
The query decomposer is provided for decomposing the operands of the set operation in the input query into one or more sub-queries. The OLAP query provider interface is provided for sending the OLAP supported sub-queries to an OLAP query provider and receiving results.
The operation module is provided for using set operation functionality provided by a relational query provider that performs the set operations locally on the results of the OLAP data extraction.
[001 I ] In accordance with another aspect of the present application, there is provided a query decomposer for decomposing data agnostic business intelligence queries that contain relational set operations. The query decomposer comprises a set operation identifier, an OLAP sub-queries extractor, a relational set operation creator, and a post-processing query creator. The set operation identifier analyzes the incoming data agnostic query in order to identify sub-queries that represent the operands of a query set operation against OLAP data sources. The OLAP sub-queries extractor extracts the operands of the set operation; each operand represents a query that is sent to the OLAP query provider for preparation and execution. The relational set operation creator creates a plan to perform the set operation on results received from the OLAP query provider using services provided by the relational query provider. The post-processing query creator builds a plan that outlines all operations that need to be applied to the results of the execution of the relational query provider of the set operation. This post-processing phase of the query decomposition is important as it assures that the results set that is sent to the Data Analyzing System is cast in terms of the original query that was issued by this system and not in terms of decomposed sub-queries.
[0012] In accordance with another aspect of the present application, there is provided a method of processing a data agnostic business intelligence query against one or more data sources. The method comprises steps of detecting and intercepting a data agnostic business intelligence query that is issued against one or more multidimensional data sources; decomposing operands of a query operation source of the data agnostic business intelligence query into one or more OLAP sub-queries; and replacing the data agnostic business intelligence query with a stack of operations expressed by the decomposed OLAP
sub-queries and the relational set operation query for execution.
[0013] In accordance with another aspect of the present application, there is provided a method of decomposing a data agnostic business intelligence query applied against one or more multidimensional data sources. The method comprises the steps of identifying sub-queries that represent the operands of query set operation; extracting the identified operands into sub-queries; sending the extracted OLAP supported sub-queries to an OLAP
query provider, and preparing a plan to perform the set operations on results that will be received from an OLAP query provider using services provided by a relational query provider.
[0014] In accordance with another aspect of the present application, there is provided a memory containing computer executable instructions that can be read and executed by a computer for caring out a method of processing a data agnostic business intelligence query against one or more data source. The method comprises steps of detecting and intercepting a data agnostic business intelligence query that is issued against one or more multidimensional data sources; decomposing operands of a query operation source of the data agnostic business intelligence query into one or more OLAP sub-queries;
and replacing the data agnostic business intelligence query with a stack of operations expressed by the decomposed OLAP sub-queries and the relational set operation query for execution.
[0015] In accordance with another aspect of the present application, there is provided a carrier carrying a propagated signal containing computer executable instructions that can be read and executed by a computer, the computer executable instructions being used to execute a method of processing a data agnostic business intelligence query against one or more data source, the method comprises the steps of identifying sub-queries that represent the operands of query set operation; extracting the identified operands into sub-queries;
sending the extracted OLAP supported sub-queries to an OLAP query provider, and preparing a plan to perform the set operations on results that will be received from an OLAP query provider using services provided by a relational query provider.
[0016] This summary of the invention does not necessarily describe all features of the invention.
BRIEF DESCRIPTION OF THE DRAWINGS
[0017] These and other features of the invention will became more apparent from the following description in which reference is made to the appended drawings wherein:
Figure 1 shows in a block diagram an example of a query framework system in which an example of a tabular set provider is suitably used, in accordance with an embodiment of the present invention;
Figure 2 shows in a block diagram the query framework system in more detail;
Figure 3 shows in a block diagram an example of a tabular set provider, in accordance with an embodiment of the present invention;
Figure 4 schematically shows in a diagram the decomposition of a data agnostic business intelligence query, in accordance with an embodiment of the tabular set provider;
Figure 5 shows in a flowchart the operation of the tabular set provider;
Figure 6 shows in a flowchart a decomposition process by the tabular set provider;
-s-Figure 7 shows in a block diagram a query decomposer in accordance with an embodiment of the tabular set provider;
Figure 8 shows in a screen shot an example data analyzing system's representation of a query set operation that can be expressed as a request to the query framework;
Figure 9 shows in a screenshot of the two data sources that are the targets of "Queryl" and "Query2" respectively;
Figure 10 shows in a screenshot the report of Figure 9; and Figure 11 shows in a screenshot the results of the report for "Queryl ", "Query2"
and their union, "Query3".
DETAILED DESCRIPTION
[0018] The present invention allows for the computation of relational set operations such as union, intersect, and except for data extracted from OLAP (or multidimensional) sources that do not support these computations natively.
[0019] Figures 1 and 2 show a query framework system 10 in which a tabular set provider 100 (Figure 2) in accordance with an embodiment of the application is suitably used. The tabular set provider 100 may be a standalone pluggable component as shown in Figure 2, or a part of a component that processes high level user requests. The following descriptions describe an embodiment of the tabular set provider as a standalone component, but the invention is not limited to such an embodiment.
[0020] The query framework system 10 is used in a computer system 20 having an input unit 22 and an output unit 24. The query framework system 10 is provided to receive user requests from a data analyzing system 30 and process the received user requests to retrieve requested data from one or more data sources 32.
[0021 ] The data analyzing system 30 is an application that provides various views of data in the data sources 32 to allow users to analyze the data. When a user requests a view of data, the data analyzing system 30 generates a user request. A user request typically contains multiple queries. To generate user requests, the data analyzing system 30 may use a metadata model 34 that contains rnetadata of the data sources 32. The user request is in a query language that the data analyzing system 30 uses to issue the user request. Some data analyzing system 30 may issue a user request in a data source language, such as SQL, and some data analyzing system 30 may issue a user request in a language specific to the data analyzing system 30.
[0022] The data analyzing system 30 issues user requests in a data agnostic querying language or specification. The data agnostic querying specification allows combining query elements involving relational operations and OLAP operations.
Accordingly, a data agnostic query issued by the data analyzing system 30 may involve both relational and OLAP operations, only relational operations, or only OLAP operations.
[0023] The query framework system 10 intercepts user requests generated by the data analyzing system 30. It processes and executes the user requests to retrieve desired data from the data sources 32.
[0024] As shown in Figure 2, the query framework system 10 has multiple query processing components 12. Query processing components 12 share a common interface 14 and a common query language of the query framework system 10. Query processing components 12 are pluggable components. Query processing components 12 include a set of query operation providers 50, and a coordination planner 60. The query framework system 10 may also have an operation support table 16 that describes functionalities of the query operation providers 50.
[0025] The coordination planner 60 organizes interaction between the query processing components 12. The interaction is carried out through the common interface 14 and based on the common query language. The coordination planner 60 divides the query processing into two phases: query planning or preparation phase and a query execution phase.
During the query preparation phase, the coordination planner 60 interacts with components in order to identify and plan the operations associated with each component involved in the query preparation process, and to determine the sequence of these operations.
The coordination planner 60 may use one or more query operation providers 50 during the query preparation phase. During the query execution phase, the coordination planner 60 distributes the query operations to associated query operation providers 50.
The coordination planner 60 invokes the query operations in the sequence determined at the preparation phase.
[0026] During the preparation phase, the coordination planner 60 converts a user request received from the data analyzing system 30 into a converted query or a query framework (QF) query. A QF query plays the role of a query specification that the query operation providers 50 use to communicate to each other and to the coordination planner 60 within the query framework system 10. The QF query definition is an extension of the user request specification defined by the data analyzing system 30. The coordination planner 60 and the query framework system 10 are further described in Canadian Patent Application No. 2,518,902 entitled "System and Method for Query Planning and Execution".
[0027] Each query operation provider 50 is capable of performing a specific operation on queries. The query operation providers 50 include a relational query provider 52, OLAP
query provider 54, and the tabular set provider 100. There may be more query operation providers in the query framework system 10.
[0028] The OLAP query provider 54 is capable of processing OLAP queries, i.e., queries of the system query language involving OLAP operations only. The union, intersect, and except operations which act on rows of operand queries are relational type operations that are not supported by the OLAP query provider 54. The operation of the OLAP
query provider 54 involves translation of the system query language into the query language of underling multidimensional data sources 32, which is MDX.
[0029] The relational query provider 52 is capable of processing relational queries. It provides tabular operations to received queries. The operation of the relational query provider 52 involves translation of the system query language into the query language of underling relational data sources 32, which is SQL. The relational query provider 52is also capable of receiving the data upon which the relational queries act. The operations in the input relational queries are applied to the input data stream. The relational query provider 52 provides the services needed in order to compute the relational set operations _g_ like the union, intersect, and except operations on data streams representing the operands of these operations.
[0030] The tabular set provider 100 is a query transformation component plugged into the query framework system 10 governed by the coordination planner 60. The tabular set provider 100 is a transformation provider which is responsible for preprocessing of the received user request, and transforms the received user request into a QF
query in order, for example, to make it simpler or supported by other components in the query framework system 10.
[0031 ] The tabular set provider 100 uses a query decomposition process that bridges between functionality available in relational operations and multidimensional operations.
It allows relational set operations, such us union, intersect, and except, against multidimensional data sources that do not natively support these relational set operations.
[0032] The tabular set provider 100 plays part in data agnostic user experience, e.g., the same look and feel against relational and multidimensional data sources, including the set of functions shared across multidimensional and relational data sources. The tabular set provider 100 allows the relational and multidimensional operations combined in a single query. In this embodiment, the logic of the query decomposition is implemented as a tabular set provider 100 which is a pluggable component in the query framework system 10. Accordingly, its functionality can be reused for different variations of data sources.
Also, its behavior may be tweaked in the system 10 by pre-processing incoming queries or post-processing the results of the decomposition. In a different embodiment, the functionality of the tabular set provider 100 may be incorporated into other component.
[0033] As shown in Figure 3, the tabular set provider 100 has a query interceptor 102, query decomposes 104, an OLAP query provider interface 106, and an operation module 108. The query interceptor 102 detects and intercepts queries requiring its involvement, the main type of which is queries that involve relational set operations, such as the union, intersect, and except issued against multidimensional data sources 32. The query decomposes 104 decomposes these queries into three specifications, as further described below. The OLAP query provider interface 106 is used for sending multidimensional queries to a multidimensional query provider and for receiving results. The operation module 108 is a relational query interface that is used to compute the set operations on results coming from the OLAP query provider.
[0034] Figure 4 schematically illustrates an example of decomposition of a multidimensional query 110 intercepted by the tabular set provider 100. The data agnostic input query 110 is decomposed into three specifications 120. The three specifications 120 are a set of OLAP sub-queries 122 each corresponding to an operand of the set operation, a relational set operation query 126, and a post-processing operation query 128.
Each sub-query in the set of OLAP sub-queries 122 has a QF query specification that involves only OLAP operations and is sent to the OLAP query provider 54 for execution. The relational set operation query 126 has a QF query specification that involves only relational operations. This query 126 is sent to the relational query provider 52 for execution. The post processing operation query 128 is a query representing an operation of adjusting data set information to correspond to column information expressed in the metadata model rather than the structure of the input data streams, one per sub-query, that is extracted from the OLAP query provider 54 by the OLAP query provider interface 106 and passed to the relational query provider 52 by the operation module 108. The post processing operation is handled by the tabular set provider 100 itself.
[0035] Referring to Figure 5, the operation of the tabular set provider 100 and the decomposition rules are described. The tabular set provider 100 detects and intercepts an input data agnostic query issued against the multidimensional data source 32 (150). The tabular set provider 100 checks if the input data agnostic query has a source specification that represents a query operations between 2 or more sub-query operands (152).
To determine if the set operations are unsupported, the tabular set provider 100 may use the operation support table 16 to refer to the functionalities that are supported by the OLAP
query provider 54. If the input data agnostic query does not have a query operation source specification, the query is sent to the OLAP query provider 54 (154) by the coordination planner 60 for execution.
[0036] If the data agnostic query has a source specification that represents a set operation between two or more sub-queries, the tabular set provider 100 invokes the query decomposes 104 to decompose the input data agnostic query (156). The query decomposes 104 analyses the incoming query's source specification (158) in order to detect the query operation source type and extract the operands of this query operation (160).
[0037] The initial data agnostic query is then replaced with stack of operations (162) based on the results of the decomposition of the query at step 160. The set of OLAP
supported sub-queries are then sent to the OLAP query provider 54 by the OLAP query provider interface 106 and the resulting data streams, each corresponding to a sub-query operand, are sent to the relational query provider 52 along with the set operation query and results are received by the operation modules.
[0038] The tabular set provider 100 finally performs a data set information post processing operation (164). The tabular set provider adjusts the data set information to correspond to column information expressed in the metadata model rather than the structure of the input data streams that is extracted from the OLAP query provider and passed to the relational query provider.
[0039] In order to perform the decomposition of expressions at step I60, as shown in Figure 6, the query decomposes 104 may have to identify sub-queries that represent the operands of a query set operations as specified in the source of the initial data agnostic input query (190). The query decomposes 104 may then have to extract operands into sub-queries (192). The query decomposes may then have to create and send the OLAP
sub-queries to an OLAP query provider 54 (194), and performs the set operation on results received from the OLAP query provider (54) using services provided by the relational query provider (196).
[0040] The decomposition may be carned out as shown in Figure 7. The set operation identifier 200 of the query decomposes 104 identifies sub-queries that represent the operands of a query set operation in a source specification that is unsupported by an OLAP
query provider 54. The OLAP sub-queries extractor 202 extracts operands into sub-queries that are supported by the OLAP query provider 54. The operand sub-queries are extracted from the set operation as specified in the source of the incoming business intelligence data agnostic query. The query decomposes 104 uses the relational set -t1-operation creator 204 in order to create one query for the relational query provider 52 and to use its services to execute the relational set operation. Finally, the query decomposer 104 uses the post processing query creator 206 in order to prepare a post processing plan that will ensure that the data set information returned to the data analyzing system is consistent with the input data agnostic query.
[0041 ] An example of software implementation of the tabular set provider 100, TabularOpeartionProvider, in accordance with an embodiment of the invention is now described. In this embodiment, the query framework system 10 receives from the data analyzing system 30 a user request that has a <queryResultDefinition> section.
Each expression in the user request is analyzed in the context of the <queryResultDefinition>
section of the user request. An example of the user request that is described in Figures 8 to Figure 11 is the following:
<?xml version="1.0" encoding="UTF-8"?>
<qs:command xmlns:qs="http://developer.cognos.com/schemas/qsapi/2P'>
<connection>
<QFProviderType value="QueryFrameworkService"/>
<productLocale value="en"/>
<runLocale value="en-us"/>
<servicelD value="RSVP"/>
<sessionlD value="010:0743ad7d-e670-llda-8f45-cfBddOd60ece:3678379425"/>
<CAMPassport value="010:0743ad7d-e670-11 da-Sf45-cfBddOd60ece:3678379425"/>
<contentManagerURl value="http://127Ø0.1:9300/p2pd/servlet"/>
<lconnection>
<execute>
<querySet expressionLocale="en-us">
<modelPath>/content/package(@name=&apos;Two OLAP Data Sources&aposmodelPath>
<queries>
<query name="Queryl ">
<source>
<model/>
</source>
<selection>
<dataltem aggregate="none" name="Year">
<expression>[Great Outdoors Company].[Years].[Years].[Year]</expression>
</dataltem>
<dataltem name="Revenue">
<expression>[Great Outdoors Company].[Measures].[Revenue]</expression>
</dataltem>
<dataltem name="Quantity sold">
<expression>[Great Outdoors_Company].[Measures].[Quantity sold]</expression>
</dataltem>
</selection>
</query>
<query name--"Query2">
<source>
<model/>
</source>
<selection>
<dataltem aggregate="none" name--"Year">
<expression>[Great outdoors.[Years].[Years].[Year]<lexpression>

outdoors.[Measures].[Revenue]</expression>
Sold]</expression>
</dataltem>
<dataltem name="Revenue">
<expression>[Great </dataltem>
<dataltem name--"Quantity Sold">
<expression>[Great outdoors.[Measures].[Quantity </dataltem>
</selection>
</query>
<query name="Query3">
<source>
<queryOpera6on name="Union1" setOperation="UNION">
<queryRefs>
<queryRef refQuery="Query1"/>
<queryRef refQuery="Query2"/>
</queryRefs>
<projectionList autoGenerated="true">
<queryltem name="Year"/>
<queryltem name="Revenue"/>
<queryltem name="Quantity sold"/>
</projection List>
</queryOperation>
</source>
<selection>
<dataltem name="Year">
<expression>[Union 1 ]. [Year]</expression>
</dataltem>
<dataltem name="Revenue">
<expression>[Union 1 ]. [Revenue]</expression>
</dataltem>
<dataltem name="Quantity sold">
<expression>[Union1].[Quantity sold]</expression>
</dataltem>
</selection>
</query>
</queries>
<queryResuItDefinitions>
<queryResuItDefinition name="Query3.0" refQuery="Query3">
<edges>
<edge name="3">
<edgeGroups>
<edgeGroup>
<valueSets>
<valueSet name="7">
<groupBody name="7_groupBody'">
<dataltemRef refDataltem="Year"/>
<dataltemRef refDataltem="Revenue"/>
<dataltemRef refDataltem="Quantity sold"/>
</groupBody>
</valueSet>
</valueSets>
</edgeGroup>
</edgeGroups>
</edge>
</edges>
</queryResuItDefinition>
</queryResuItDefinitions>
</querySet>
</execute>
</qs:command>

[0042] The <queryResultDefinition> in this example reference "Query3" , which is defined as follows:
<query name="Query3">
<source>
<queryOperation name="Unionl" setOperation="UNION">
<queryRefs>
<queryRef refQuery="Query1 "/>
<queryRef refQuery="Query2"/>
</queryRefs>
<projectionList autoGenerated="true">
<queryltem name="Year"/>
<queryltem name="Revenue"/>
<queryltem name="Quantity sold"/>
</projectionList>
</queryOperation>
</source>
<selection>
<dataltem name-='Year">
<expression>[Union 1 ].[Year]</expression>
</dataltem>
<dataltem name="Revenue">
<expression>[Union 1 ].[Revenue]</expression>
</dataltem>
<dataltem name="Quantity sold">
<expression>[Union1].[Quantity sold]</expression>
</dataltem>
</selection>
</query>
[0043] The source of "Query3" in this example is a <queryOperation> element that has the attributes name "Unionl" and a setOperation "UNION". Hence "Query3" is a query that has a source specification indicating that it is a union operation. The operands of this union operation are the children element in this specification, which are specified under the element <queryRefs>. The referenced queries are "Queryl" and "Query2"
which are specified in the complete example above.
[0044] "Queryl" and "Query2" are the extracted sub-queries that are supported by the OLAP query provider 54. They are sent for execution to the OLAP query provider 54 for executions and the resulting data streams are fed to the relational query provider 52 along with the set operation specification for execution. The results from the relational query provider 52 are post processed and sent back to the data analyzing system.
[0045] Figures 8 to 11 show in screenshots examples of set operations handled by the tabular set provider component, in accordance with an embodiment of the tabular set provider 100.

Example 1 [0046] Figure 8 shows in a screen shot an example data analyzing system's representation of a query set operation that can be expressed as a request to the query framework as discussed above. In this screen shot, "Query3" is formulated through few user interface gestures as the set union of "Queryl" and "Query2".
Example 2 [0047] Figure 9 shows in a screenshot of the two data sources that are the targets of "Queryl" and "Query2" respectively. "Queryl" extract data from the first OLAP
data source "Great Outdoors Company" that contains sample data of a fictitious company. The data contains many dimensions along the business of this fictitious company, such us Time, Products, Geography, etc. The data contains many measures for analysis, such us Revenue and Quantity sold. "Query2" extracts data from the second OLAP data source "Great Outdoors" which contain the same business dimensions and analysis measures for the fictitious company. The difference between the two data sources is that the first data source contains recent data relating to the years 2004, 2005, 2006. The second data source contains historic data relating to the years 1996 and 1997. The user would like to produce a report that contains both recent and historic data for the revenues and quantity sold measures. Hence, he needs to union the data from "Queryl" and "Query2".
Example 3 [0048] Figure 10 shows in a screenshot the report described in Example 2.
Example 4 [0049] Figure 11 shows in a screenshot the results of the report for "Queryl ", "Query2"
and their union, "Query3".
[0050] The systems and methods according to the present invention may be implemented by any hardware, software or a combination of hardware and software having the functions described above. The software code, either in its entirety or a part thereof, may be stored in a computer readable memory. Further, a computer data signal representing the software -ls-code that may be embedded in a carrier wave may be transmitted via a communication network. Such a computer readable memory and a computer data signal are also within the scope of the present invention, as well as the hardware, software and the combination thereof.
[0051 ] While particular embodiments of the present invention have been shown and described, changes and modifications may be made to such embodiments without departing from the true scope of the invention.

Claims (6)

What is claimed is:
1. A method of processing a data agnostic business intelligence query comprising the steps of:
detecting and intercepting, by a computer, a data agnostic business intelligence query that is issued against at least one multidimensional data source;
decomposing a relational set operation contained in the data agnostic business intelligence query into at least two sub-queries handled by an online analytical processing (OLAP) query provider that queries the at least one multidimensional data source, wherein the relational set operation is not natively supported by the OLAP query provider, and wherein each sub-query is supported by the OLAP query provider and represents an operand of the relational set operation;
sending the at least two sub-queries to the OLAP query provider for execution;
receiving resultant OLAP data streams generated from the execution of the at least two sub-queries, each resultant OLAP data stream corresponding to a sub-query operand for the relational set operation; and sending the resultant OLAP data streams to a relational query provider for execution of the relational set operation.
2. The method of claim 1, wherein the decomposed at least two sub-queries comprise multidimensional sub-queries sent to the OLAP query provider that is capable of executing multidimensional query operations.
3. The method of claim 1, wherein the relational set operation and the resultant OLAP
data streams are sent to the relational query provider that is capable of executing relational query operations.
4. The method of claim 1, wherein the decomposing step includes identifying sub-queries that represent operands of the relational set operation that is unsupported by the OLAP query provider, and extracting relational set operands into the sub-queries that are supported by the OLAP query provider, and wherein the method further comprises:

creating one query for the relational query provider and executing the relational set operation along with sub-query operands represented by the resultant OLAP data streams;
and preparing a post processing plan to return a data set that is consistent with the data agnostic business intelligence query.
5. The method of claim 1, wherein the decomposing step includes using an operation support table describing operations supported by the OLAP query provider.
6. A memory containing computer executable instructions that can be read and executed by a computer for carrying out a method of processing a data agnostic business intelligence query, the method comprising the steps of:
detecting and intercepting, by a computer, a data agnostic business intelligence query that is issued against at least one multidimensional data source;
decomposing a relational set operation contained in the data agnostic business intelligence query into at least two sub-queries handled by an online analytical processing (OLAP) query provider that queries the at least one multidimensional data source, wherein the relational set operation is not natively supported by the OLAP query provider, and wherein each sub-query is supported by the OLAP query provider and represents an operand of the relational set operation;

sending the at least two sub-queries to the OLAP query provider for execution;
receiving resultant OLAP data streams generated from the execution of the at least two sub-queries, each resultant OLAP data stream corresponding to a sub-query operand for the relational set operation; and sending the resultant OLAP data streams to a relational query provider for execution of the relational set operation.
CA2550818A 2005-09-13 2006-06-22 System and method of providing relational set operations for olap data sources Expired - Fee Related CA2550818C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CA2550818A CA2550818C (en) 2005-09-13 2006-06-22 System and method of providing relational set operations for olap data sources

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CA2,519,015 2005-09-13
CA002519015A CA2519015A1 (en) 2005-09-13 2005-09-13 System and method of providing relational set operations for olap data sources
CA2550818A CA2550818C (en) 2005-09-13 2006-06-22 System and method of providing relational set operations for olap data sources

Publications (2)

Publication Number Publication Date
CA2550818A1 CA2550818A1 (en) 2007-03-13
CA2550818C true CA2550818C (en) 2012-06-26

Family

ID=37872092

Family Applications (1)

Application Number Title Priority Date Filing Date
CA2550818A Expired - Fee Related CA2550818C (en) 2005-09-13 2006-06-22 System and method of providing relational set operations for olap data sources

Country Status (1)

Country Link
CA (1) CA2550818C (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7853603B2 (en) * 2007-05-23 2010-12-14 Microsoft Corporation User-defined relevance ranking for search

Also Published As

Publication number Publication date
CA2550818A1 (en) 2007-03-13

Similar Documents

Publication Publication Date Title
US7647315B2 (en) System and method of providing relational set operations for multidimensional data sources
US7711704B2 (en) System and method of providing date, arithmetic and other relational functions for OLAP sources
US8935273B2 (en) Method of processing and decomposing a multidimensional query against a relational data source
JP7273045B2 (en) Dimensional Context Propagation Techniques for Optimizing SQL Query Plans
US10984042B2 (en) Publishing RDF quads as relational views
US10515094B2 (en) System and method for analyzing and reporting extensible data from multiple sources in multiple formats
US7716257B2 (en) Universal drill-down system for coordinated presentation of items in different databases
Vassiliadis et al. ARKTOS: towards the modeling, design, control and execution of ETL processes
US8943059B2 (en) Systems and methods for merging source records in accordance with survivorship rules
EP3259686B1 (en) Querying a data source on a network
US20070094236A1 (en) Combining multi-dimensional data sources using database operations
CN106503912A (en) A kind of data service system
US7640238B2 (en) Query planning for execution using conditional operators
US11960482B1 (en) Systems and methods for extracting data views from heterogeneous sources
CN104317936A (en) ROLAP (relational on-line analysis processing) analysis engine design method and device on basis of star models
Wada et al. Virtual database technology for distributed database
US8527552B2 (en) Database consistent sample data extraction
CA2550818C (en) System and method of providing relational set operations for olap data sources
CA2510508C (en) System and method for processing and decomposition of a multidimensional query against a relational data source
CA2545108A1 (en) System and method of providing date, arithmetic and other relational functions for olap sources
WO2024130741A1 (en) Data processing method, apparatus, device, storage medium, and program product
Dinsmore et al. The Hadoop Ecosystem: Disrupting from Below
Schneider et al. Reading Sample
Tocci A comparison of leading database storage engines in support of online analytical processing in an open source environment

Legal Events

Date Code Title Description
EEER Examination request
MKLA Lapsed

Effective date: 20150622