CN101008953A - Method and device for processing nonempty date in online analytical processing system - Google Patents

Method and device for processing nonempty date in online analytical processing system Download PDF

Info

Publication number
CN101008953A
CN101008953A CN 200710003016 CN200710003016A CN101008953A CN 101008953 A CN101008953 A CN 101008953A CN 200710003016 CN200710003016 CN 200710003016 CN 200710003016 A CN200710003016 A CN 200710003016A CN 101008953 A CN101008953 A CN 101008953A
Authority
CN
China
Prior art keywords
data
unit
dimension
original
row
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN 200710003016
Other languages
Chinese (zh)
Other versions
CN100495403C (en
Inventor
林志贤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Kingdee Software China Co Ltd
Original Assignee
Kingdee Software China Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Kingdee Software China Co Ltd filed Critical Kingdee Software China Co Ltd
Priority to CNB2007100030168A priority Critical patent/CN100495403C/en
Publication of CN101008953A publication Critical patent/CN101008953A/en
Application granted granted Critical
Publication of CN100495403C publication Critical patent/CN100495403C/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

This invention discloses one cross analysis process system method to process non vacuum data, which comprises the following steps: a, analyzing defined multi-dimension expression to get index axis original range; b, when only considering physical data, browsing the said original range each member combination and filtering the one dimension data to get initial two dimensional form; c, browsing the said two dimension each line or row units and filtering them to get needed two dimensional form. This invention also discloses one device to analyze process system for non-hollow data of cross machine.

Description

Handle the method and apparatus of non vacuum data in the on-line analysing processing system
Technical field
The present invention relates to field of computer technology, be particularly related to the method and apparatus of handling non vacuum data in a kind of on-line analysing processing system specifically.
Background technology
OLAP (On-Line Analytical Processing, on-line analytical processing) is a kind of software engineering, it make the analyst can be rapidly, consistent, observed information in all its bearings alternatively, to reach the deep purpose of understanding data, these information are directly changed from raw data and are come, and they reflect the truth of enterprise in the mode that the user understands easily.The most of strategy of OLAP all is relationship type or common data to be carried out multidimensional data store, so that analyze, thereby reaches the purpose of on-line analytical processing.This multidimensional data is counted as a hypercube, and along the direction stored data of each dimension, it allows the user to analyze data along the axis of affairs, has realized the express-analysis of shared multidimensional information.
MDX (Multi Dimensional Expressions, Multidimensional Expressions) is the OLAP query language of standard.The result set that obtains behind the application MDX inquiry OLAP is a two-dimentional form normally, the both sides of two dimension form are axles, the data that run into full line and permutation in this two-dimentional form probably are empty situation, generally need filter, be applied to axle by Non Empty key word in the MDX query language and go up to specify these needs to filter out the point of sky data to the empty data in the form.When filtering empty data in the prior art, at first concentrate and use the MDX inquiry as required and obtain whole two-dimentional form from multidimensional data, then the two-dimentional form that generates is traveled through, judge all unit of each row or column decision in the two-dimentional form, if the data in all unit all are empty, this row or column then being described for empty, is the two-dimentional form of ultimate demand with all data in the two-dimentional form for empty row or column filtration back generates new two-dimentional form at last.Especially, when some dimension the two-dimentional form that obtains from the concentrated inquiry of multidimensional data is defined new calculating member, all non-calculating member unit that occur a certain row or column decision probably all are empty, but the calculating member of this row or column decision is not empty situation, when the existing method of application is filtered this two dimension form so, then can only will only have calculate the member for empty row or column integral body remains, only calculate the member and be not empty row or column and can't select flexibly whether to keep according to the needs of business.For example, use a Multidimensional Expressions SELECT{[Measures] .[Sales_Dollars], [Measures] .[Sales_Units] } ON columns, Crossjoin ([Time] .[2000], [Time] .[2001] }, [Product] .[Bread], and [Prodct] .[Dairy], [Product] .[Meat]) ON rows FROM[Sales] concentrate inquiry to obtain from multidimensional data as following table 1:
Tolerance
Time Product Consumption sum Sales volume
1 2000 Bread
2 Dairy
3 Meat
4 Calendar year 2001 Bread 2180.74 1046
5 Dairy 4014.91 1884
6 Meat 1678.51 785
From last table 1 as can be known, member on the time dimension did not have data to take place in 2000, use Multidimensional Expressions SELECT{[Measures] .[Sales_Dollars], [Measures] .[Sales_Units] } ON columns, NON EMPTY Crossjoin ([Time] .[2000], [Time] .[2001] }, [Product] .[Bread], [Prodct] .[Dairy], [Product] .[Meat] }) ON rows FROM [Sales] WHERE ([State] .[Canada], [Employee] .[All Employee]) operates once more this form and filters out data and obtain following needed table 2 after for empty unit:
Tolerance
Time Product Consumption sum Sales volume
1 Calendar year 2001 Bread 2180.74 1046
2 Dairy 4014.91 1884
3 Meat 1678.51 785
Obtain as following table 3 when using Multidimensional Expressions inquiry cube, this epiphase increases a new calculating member " other " for the dimension of the tolerance in the table 1, calculates member's " other " and represents whether this row has data, and concrete table 3 is as follows:
Tolerance
Time Product Consumption sum Sales volume Other
1 2000 Bread Not
2 Dairy Not
3 Meat Not
4 Calendar year 2001 Bread 2180.74 1046 Be
5 Dairy 4014.91 1884 Be
6 Meat 1678.51 785 Be
When the existing method his-and-hers watches 3 of application filter, owing to went up all non-calculating member unit in 2000 all is empty, but calculate member's " other " is not empty, so can't will data not take place physically, be that the time dimension member that data do not take place for tolerance consumption sum in the fact table and sales volume filtered in 2000, therefore existing method can't generate as required and only keep the two-dimentional form that data physically take place.
As the above analysis, use existing method processing non vacuum data and have following limitation:
At first, existing method can only be carried out engine by MDX and concentrate inquiry or calculate two-dimentional form according to the MDX expression formula from multidimensional data, and then the two-dimentional form that generates filtered the two-dimentional form that generates actual needs, and can't be before obtaining two-dimentional form the filtration fraction data, to dwindle query context.When the two-dimentional form hollow row that runs into generation or much more very situations of empty row, in the process of traversal form, will waste a large amount of time, make that the whole operation process efficiency is low;
Secondly, it is data that only need consideration physically to take place that existing method is not distinguished the non-NULL filtration, still need consider to comprise all data of calculating the member, because the calculating member can't be distinguished in the NonEmpty key word in the MDX query language standard.Therefore, existing method can not satisfy some professional particular demands when carrying out the processing of non vacuum data, promptly need to filter the row or column that data physically do not take place, and have a large amount of such demands in the actual service application, greatly limited the practicality and the dirigibility of OLAP inquiry.
Summary of the invention
The purpose of this invention is to provide the method for handling non vacuum data in a kind of on-line analysing processing system, with overcome in the prior art can't be before obtaining two-dimentional form the filtration fraction data, when the two-dimentional form hollow row that runs into generation or much more very situations of empty row, in the process of traversal form, waste the plenty of time, reduced the problem of operating efficiency.
Another object of the present invention provides the device of handling non vacuum data in a kind of on-line analysing processing system, to solve the problem of handling the non vacuum data inefficiency in the prior art in the on-line analytical processing form.
For solving the problems of the technologies described above, the invention provides following technical scheme:
Handle the method for non vacuum data in a kind of on-line analysing processing system, comprising:
The original scope that the Multidimensional Expressions of A, parsing definition obtains inquiring about axle;
B, when only needing to consider the data that physics exists, travel through the combination of each dimension member on the point in the described original scope, and data are taken place at least one dimension member obtain the original two-dimensional form for empty point filters;
All unit of each row or column decision in C, the described original two-dimensional form of traversal when all unit all do not have the data of physics existence, obtain required two-dimentional form with this row or column filtration.
Described method also comprises:
S1, according to source data structure cube;
S2, when the dimension member who concentrates when multidimensional data has data, the data attribute that this dimension member is set is for true, otherwise data attribute be vacation.
Described step S2 comprises:
S21, during initially according to the dimension of the dimension table structure cube of source data, each dimension member's data attribute all is set to vacation;
S22, during according to the metric data of the fact table of source data structure cube, the data attribute that upgrades the described dimension member that data are arranged in this fact table is for true.
Among the described step B data being taken place at least one dimension member is specially for empty point filters:
Judge the data attribute of each dimension member on the described point, when at least one dimension member's data attribute is that fictitious time filters this point.
Described method also comprises:
When needs consider to comprise all data of calculating the member, according to described original scope from the concentrated original two-dimensional form that obtains of described multidimensional data;
Travel through the unit of each row or column decision in the described original two-dimensional form, when the data in all unit all are sky, this row or column filtration is obtained required two-dimentional form.
Handle the device of non vacuum data in a kind of on-line analysing processing system, comprising:
Resolve the Multidimensional Expressions unit, be used to resolve the original scope that the Multidimensional Expressions of definition obtains inquiring about axle;
The Rule of judgment unit is used to judge whether only need consider the data that physics exists;
Traversal dimension member unit, the Rule of judgment unit judges is used for when only need be considered the data of physics existence, travel through the combination of each dimension member on the point in the described original scope, and data are taken place at least one dimension member obtain the original two-dimensional form for empty point filters;
Handle the physics data cell, be used for traveling through all unit of each row or column decision of described original two-dimensional form, when all unit all do not have the data of physics existence, this row or column filtration is obtained required two-dimentional form.
Described device also comprises:
Structure cube unit is used for according to source data structure cube;
The unit that sets a property is used for when dimension member that multidimensional data is concentrated has data, and the data attribute that this dimension member is set is for true, otherwise data attribute be vacation.
Described device also comprises:
Obtain two-dimentional list cell, be used for when the Rule of judgment unit judges need be considered to comprise all data of calculating the member, concentrate from described multidimensional data according to described original scope to obtain the original two-dimensional form;
The ergodic data unit is used for traveling through the unit that each row or column of described original two-dimensional form determines, when the data in all unit all are sky, this row or column filtration is obtained required two-dimentional form.
By above technical scheme provided by the invention as seen, the present invention has following characteristics and advantage:
At first, the present invention is empty member by data on the pre-service filtration dimension, and then the original two-dimensional form that generates is filtered.This method is when only needing to consider the data of physics existence, by Non Fact Empty key word is set in the MDX expression formula, can filter in the original scope of inquiry axle at least one dimension member earlier data takes place is empty point, make that data decline to a great extent for empty unit in the original two-dimensional form of follow-up generation, when this form being traveled through the empty data of filtration, greatly reduce the running time, improved filtration efficiency;
Secondly, when needs were considered to comprise all data of calculating the member, the inventive method was by being provided with Non Empty key word in the MDX query statement, can select to keep all members in the two-dimentional form of generation and not be the point of sky.Therefore use the present invention and carry out non vacuum data when handling, not only can improve operating efficiency, and greatly improve the dirigibility of filtering, the user can carry out the processing of non vacuum data to form according to the needs of practical business, has very strong practicality.
Description of drawings
Fig. 1 is the inventive method process flow diagram;
Fig. 2 A is a time dimension structural drawing in the embodiment of the invention;
Fig. 2 B is product dimensional structure figure in the embodiment of the invention;
Fig. 3 is a preferred embodiment of the present invention process flow diagram;
Fig. 4 is the process flow diagram of second embodiment of the invention;
Fig. 5 is the process flow diagram of third embodiment of the invention;
Fig. 6 is the preferred embodiment block diagram of apparatus of the present invention.
Embodiment
Core of the present invention provides the method for handling non vacuum data in a kind of on-line analysing processing system, obtain inquiring about the original scope of axle by the Multidimensional Expressions of resolving definition, when only needing to consider the data of physics existence, travel through the combination of each dimension member on the point in this original scope, and data are taken place at least one dimension member obtain the original two-dimensional form for empty point filters; Travel through all unit of each row or column decision in this original two-dimensional form again, when all unit all do not have the data of physics existence, this row or column filtration is obtained required two-dimentional form.
The two-dimentional form that the present invention needs by MDX inquiry OLAP acquisition:
The principal feature of OLAP is the multi-angle thinking pattern of directly copying the user, sets up the data model of multidimensional in advance for the user, and wherein dimension refers to user's analytic angle.For example, the analysis of data is sold in multi-pin, and the time cycle is a dimension, and product category, channel of distribution, geographic distribution, client's realm also are respectively dimensions.In case the multidimensional data modelling is finished, the user can obtain data from each analytic angle apace, also can be dynamically switches between all angles or carries out the multi-angle comprehensive analysis, has great dirigibility.MDX is mainly used in definition, uses and retrieve data from multi dimensional object.By the MDX inquiry, multidimensional data can be generated two-dimentional form and show that the both sides of two-dimentional form are Axis (axle), generally these two axles are Columns (row axle) and Rows (row axle); Component units on the axle is Point (position), and Cube (cube) is a cube; Also comprising the Member (member) on Dimension (dimension), the dimension and calculate the member, wherein, calculate the member and do not exist physically, is that its value is calculated by expression formula, belongs to the member on the dimension in form by the member of expression formula definition.
In order to make those skilled in the art person understand the present invention program better, the present invention is described in further detail below in conjunction with drawings and embodiments.
The inventive method flow process is as shown in Figure 1:
Step 101: the data attribute that multidimensional data is concentrated the dimension member is set.
When concentrating the dimension member that data are arranged according to the multidimensional data of source data structure, the data attribute that this dimension member is set is for true, otherwise data attribute is false.Concrete, when initially constructing the dimension of cube according to the dimension table of source data, each dimension member's data attribute all is set to vacation, when constructing the metric data of cube according to the fact table of source data then, the data attribute that upgrades the described dimension member that data are arranged in this fact table is for true.
Step 102: the Multidimensional Expressions of resolving definition obtains inquiring about the original scope of axle.
Step 103: when only needing to consider the data of physics existence, execution in step 104; When needs are considered to comprise all data of calculating the member, execution in step 106.
Step 104: travel through the combination of dimension member on each aspect and filter at least one dimension member the point of data for sky takes place.
No matter in the Multidimensional Expressions whether the calculating member is arranged, when only needing to consider the data that physics exists, travel through the combination of each dimension member on the point in the described original scope, and data are taken place at least one dimension member obtain the original two-dimensional form for empty point filters.Concrete, judge the data attribute of each dimension member on this aspect, when at least one dimension member's data attribute is that fictitious time is with this some filtration.
Step 105: all do not have physics to exist the row or column of data to filter process ends all unit.
All unit of each row or column decision in the original two-dimensional form that traversal generates when all unit all do not have the data of physics existence, obtain required two-dimentional form with this row or column filtration.
Step 106: obtain the original two-dimensional form according to original scope.
The calculating member is arranged, and when need considering to comprise all data of calculating the member in Multidimensional Expressions, according to original scope from the concentrated original two-dimensional form that obtains of multidimensional data.
Step 107: the row or column that all unit all do not have data to exist is filtered process ends.
The unit of each row or column decision in the original two-dimensional form that traversal generates when the data in all unit all are sky, obtains required two-dimentional form with this row or column filtration.
When application MDX carries out the non-NULL inquiry to cube, can handle this cube by adding Non Fact Empty key word or Non Empty key word in MDX.Wherein, when adding Non Fact Empty key word among the MDX, no matter among the MDX whether the calculating member is arranged, only judges that multidimensional data concentrates the point that physically has data, when all unit physical datas of this some decision when be empty with this some filtration; When adding Non Empty key word among the MDX, and the calculating member is arranged among the MDX, judges that the institute on the axle have a few, when all cell datas that determine when this point are sky with this some filtration.
Before cube being carried out the non-NULL processing by MDX, at first need to construct cube and the data attribute that this multidimensional data is concentrated each dimension member is set according to source data.Suppose that source data has two dimensions, Time (time) and Product (product), and two metric Amount (amount of money) and Quantity (quantity).Wherein, Time (time) record sheet is as shown in table 4:
ID Year
1 2000
2 2001
Product (product) record sheet is as shown in table 5:
ID Year
1 Bread
2 Dairy
3 Meat
Fact table is as shown in table 6:
Time ID Product ID Amout Quantity
2 1 2180.74 1046
2 2 4014.91 1884
Travel through the record of above-mentioned three forms, the rise time structure as shown in Figure 2, wherein Fig. 2 A is the time dimension structure, Fig. 2 B is the product dimensional structure.
Traversal the above mentioned facts table table 6, by field Time_ID related and Product_ID with dimension, can obtain the associated dimension member of every line item, for example according to first line item in the table 6, obtain following corresponding informance corresponding to the Bread among 2001 among Time and the Product: (2001, Product): (2180.74,1046); The above corresponding informance that obtains is write in the concentrated storage system of multidimensional data, promptly obtained the cube of structure.
When initial, two members in Time (time) record sheet 4 and three members' in Product (product) record sheet 5 data attribute (hasData) is set to the vacation of giving tacit consent to (false) respectively; In the process of traversal fact table 6, corresponding modify member's data attribute as first line item in the table, all has data among the Bread corresponding to 2001 and the Product of Time, and therefore the data attribute with these two members is revised as very (true).
The preferred embodiment flow process of the inventive method the figure shows behind the data attribute that the concentrated dimension member of multidimensional data is set as shown in Figure 3, handles the flow process of non vacuum data when not calculating the member in the Multidimensional Expressions by Non FactEmpty key word.
Step 301: resolve the original scope that the Multidimensional Expressions that does not calculate the member obtains inquiring about axle.
Suppose that Multidimensional Expressions is: Select Measures.members on Columns, Non Fact EmptyCrossJoin (Time.members, Product.members) on Rows from cubeName.Obtain following each dimension member after resolving this Multidimensional Expressions,
Measures.members comprises two members:
{[Measures].[Amount],Measures].[Quantity]};
Product.member comprises three members:
{[Product].[Bread],[Product].[Dairy],[Product].[Meat]}
Time.members comprises two members:
{[Time].[2000],[Time].[2001]}
Therefore, each dimension member makes up as follows on the point in the original query scope:
([Time].[2000],[Product].[Bread]);
([Time].[2000],[Product].[Dairy]);
([Time].[2000],[Product].[Meat]);
([Time].[2001],[Product].[Bread]);
([Time].[2001],[Product].[Dairy]);
([Time].[2001],[Product].[Meat])。
Step 302: filter at least one dimension member data attribute and obtain the original two-dimensional form for false point.
The data attribute of each dimension member on this aspect is judged in each dimension member's combination on the point of original scope in the traversal step 201, is that false point filters with at least one dimension member data attribute.Because [Time] .[2000] data attribute be false, therefore will include [Time] .[2000] have a few all delete after, obtain following three combinations:
([Time].[2001],[Product].[Bread]);
([Time].[2001],[Product].[Dairy]);
([Time].[2001],[Product].[Meat])。
As shown in table 7 below to query structure that should the result:
Time Product Amout Quantity
2001 Bread
2001 Dairy
2001 Meat
It is as shown in table 8 below to obtain the original two-dimensional form according to query structure table 7:
Time Product Amout Quantity
2001 Bread 2180.74 1046
2001 Dairy 4014.91 1884
2001 Meat
Step 303: with all cell datas in the original two-dimensional form is that empty row or column is filtered.
Traversal original two-dimensional form table 8 judges whether Amount and Quantity all be empty, all is that to obtain the two-dimentional form of ultimate demand after last column filtration of sky as shown in table 9 below with data in this form:
Time Product Amout Quantity
2001 Bread 2180.74 1046
2001 Dairy 4014.91 1884
Owing to do not calculate the member in the Multidimensional Expressions of above embodiment, therefore also can handle non vacuum data by the NonEmpty key word, this Multidimensional Expressions is specially: Select Measures.memberson Columns, Non Empty CrossJoin (Time.members, Product.members) on Rowsfrom cubeName.The idiographic flow of handling does not repeat them here with Non Fact Empty.
The second embodiment flow process of the inventive method as shown in Figure 4, the figure shows behind the data attribute that the concentrated dimension member of multidimensional data is set, when the data of calculating the member and only need considering the physics existence are arranged in the Multidimensional Expressions, handle the flow process of non vacuum data by Non Fact Empty key word.
Step 401: resolving has the Multidimensional Expressions that calculates the member to obtain inquiring about the original scope of axle.
Suppose that this Multidimensional Expressions is: With[Measures] .[Note] as ' illustrates ' Select (Measures.members, [Measures] .[Note]) on Columns, Non Fact EmptyCrossJoin (Time.members, Product.members) on Rows from cubeName.This Multidimensional Expressions for the tolerance dimension definition one calculate the member, this member's value is that character string ' illustrates ' forever.Resolving this Multidimensional Expressions, to obtain each dimension member as follows,
Measures.members comprises two members:
{[Measures].[Amount],[Measures].[Quantity]}
(Measures.members, [Measures] .[Note]) comprises three members:
{[Measures].[Amount],[Measures].[Quantity],[Measures].[Note]}
Product.members comprises three members:
{[Product].[Bread],[Product].[Dairy],[Product].[Meat]}
Time.members comprises two members:
{[Time].[2000],[Time].[2001]}
Therefore, each dimension member makes up as follows on the point in the original query scope:
([Time].[2000],[Product].[Bread]);
([Time].[2000],[Product].[Dairy]);
([Time].[2000],[Product].[Meat]);
([Time].[2001],[Product].[Bread]);
([Time].[2001],[Product].[Dairy]);
([Time].[2001],[Product].[Meat])。
Step 402: filter at least one dimension member data attribute and obtain the original two-dimensional form for false point.
The data attribute of each dimension member on this aspect is judged in each dimension member's combination on the point in the traversal step 401 in the original scope, is that false point filters with at least one dimension member data attribute.Because [Time] .[2000] data attribute be false, therefore will include [Time] .[2000] have a few all delete after, obtain following three results:
([Time].[2001],[Product].[Bread]);
([Time].[2001],[Product].[Dairy]);
([Time].[2001],[Product].[Meat])。
As shown in table 10 below to query structure that should the result:
Time Product Amout Quantity Note
2001 Bread
2001 Dairy
2001 Meat
It is as shown in table 11 below to obtain the original two-dimensional form according to query structure table 10:
Time Product Amout Quantity Note
2001 Bread 2180.74 1046 Explanation
2001 Dairy 4014.91 1884 Explanation
2001 Meat Explanation
Step 403: only consider physically to exist the point of data, all unit physical datas are empty row or column in the filtration original two-dimensional form.
Traversal original two-dimensional form 11 judges whether Amount and Quantity all are empty, so physically do not exist data not consider owing to calculate member Note, the two-dimentional form that obtains ultimate demand after therefore last column in this form being filtered is as shown in table 12 below:
Time Product Amout Quantity Note
2001 Bread 2180.74 1046 Explanation
2001 Dairy 4014.91 1884 Explanation
The 3rd embodiment flow process of the inventive method as shown in Figure 5, the figure shows behind the data attribute that the concentrated dimension member of multidimensional data is set, when the member of calculating being arranged in the Multidimensional Expressions and need to consider all data, handle the flow process of non vacuum data by Non Empty key word.
Step 501: resolving has the Multidimensional Expressions that calculates the member to obtain inquiring about the original scope of axle.
Suppose that this Multidimensional Expressions is: With[Measures] .[Note] as ' illustrates ' Select (Measures.members, [Measures] .[Note]) on Columns, Non EmptyCrossJoin (Time.members, Product.members) on Rows from cubeName.This Multidimensional Expressions for the tolerance dimension definition one calculate the member, this member's value be that character string ' illustrates ' forever, need consider to calculate the data among the member ' Note ' when filtration.Resolving this Multidimensional Expressions, to obtain each dimension member as follows,
Measures.members comprises two members:
{[Measures].[Amount],[Measures].[Quantity]}
(Measures.members, [Measures] .[Note]) comprises three members:
{[Measures].[Amount],[Measures].[Quantity],[Measures].[Note]}
Product.members comprises three members:
{[Product].[Bread],[Product].[Dairy],[Product].[Meat]}
Time.members comprises two members:
{[Time].[2000],[Time].[2001]}
Therefore, each dimension member makes up as follows on the point in the original query scope:
([Time].[2000],[Product].[Bread]);
([Time].[2000],[Product].[Dairy]);
([Time].[2000],[Product].[Meat]);
([Time].[2001],[Product].[Bread]);
([Time].[2001],[Product].[Dairy]);
([Time].[2001],[Product].[Meat])。
Step 502: obtain the original two-dimensional form according to each dimension member query composition.
The query structure that at first obtains is as shown in table 13 below:
Time Product Amout Quantity Note
2000 Bread
2000 Dairy
2000 Meat
2001 Bread
2001 Dairy
2001 Meat
It is as shown in table 14 below to obtain the original two-dimensional form according to query structure table 13:
Time Product Amout Quantity Note
2000 Bread Explanation
2000 Dairy Explanation
2000 Meat Explanation
2001 Bread 2180.74 1046 Explanation
2001 Dairy 4014.91 1884 Explanation
2001 Meat Explanation
Step 503: consider to comprise all data of calculating the member, all cell datas are empty row or column in the filtration original two-dimensional form.
Travel through original two-dimensional form 14, judge whether Amount, Quantity and Note all are empty,, so obtain the two-dimentional form of ultimate demand with table 14 after traveling through this form because the data of calculating among the member ' Note ' are not empty.
The preferred embodiment block diagram of apparatus of the present invention is as shown in Figure 6:
This device comprises: structure cube module 610 is used for according to source data structure cube; With the module 620 that sets a property that structure cube module 610 links to each other, be used for when the concentrated dimension member of multidimensional data has data, the data attribute that this dimension member is set is for true, otherwise data attribute is false; The parsing Multidimensional Expressions module 630 that links to each other with the module 620 that sets a property, the Multidimensional Expressions that is used to resolve definition obtains inquiring about the original scope of axle; With the Rule of judgment module 640 that parsing Multidimensional Expressions module 630 links to each other, be used to only need to judge whether the data of consideration physics existence; The traversal dimension member module 650 that links to each other with Rule of judgment module 640, the Rule of judgment unit judges is used for when only need be considered the data of physics existence, travel through the combination of each dimension member on the point in the original scope, and data are taken place at least one dimension member obtain the original two-dimensional form for empty point filters; The traversal physical data module 660 that links to each other with traversal dimension member module 650, be used for traveling through all unit of each row or column decision of original two-dimensional form, when all unit all do not have the data of physics existence, this row or column filtration is obtained required two-dimentional form; And link to each other with Rule of judgment module 640 obtain original two-dimensional list cell 670, be used for when the Rule of judgment unit judges need be considered to comprise all data of calculating the member, concentrate from described multidimensional data according to original scope to obtain the original two-dimensional form; With obtain the ergodic data module 680 that original two-dimensional list cell 670 links to each other, be used for traveling through the unit of each row or column decision of original two-dimensional form, when data in all unit when all be empty, this row or column filtration is obtained required two-dimentional form.
Though described the present invention by embodiment, those of ordinary skills know, the present invention has many distortion and variation and do not break away from spirit of the present invention, wish that appended claim comprises these distortion and variation and do not break away from spirit of the present invention.

Claims (8)

1, handle the method for non vacuum data in a kind of on-line analysing processing system, it is characterized in that, comprising:
The original scope that the Multidimensional Expressions of A, parsing definition obtains inquiring about axle;
B, when only needing to consider the data that physics exists, travel through the combination of each dimension member on the point in the described original scope, and data are taken place at least one dimension member obtain the original two-dimensional form for empty point filters;
All unit of each row or column decision in C, the described original two-dimensional form of traversal when all unit all do not have the data of physics existence, obtain required two-dimentional form with this row or column filtration.
2, method according to claim 1 is characterized in that, described method also comprises:
S1, according to source data structure cube;
S2, when the dimension member who concentrates when multidimensional data has data, the data attribute that this dimension member is set is for true, otherwise data attribute be vacation.
3, method according to claim 2 is characterized in that, described step S2 comprises:
S21, during initially according to the dimension of the dimension table structure cube of source data, each dimension member's data attribute all is set to vacation;
S22, during according to the metric data of the fact table of source data structure cube, the data attribute that upgrades the described dimension member that data are arranged in this fact table is for true.
4, method according to claim 2 is characterized in that, among the described step B data is taken place at least one dimension member and is specially for empty point filters:
Judge the data attribute of each dimension member on the described point, when at least one dimension member's data attribute is that fictitious time filters this point.
5, method according to claim 1 is characterized in that, described method also comprises:
When needs consider to comprise all data of calculating the member, according to described original scope from the concentrated original two-dimensional form that obtains of described multidimensional data;
Travel through the unit of each row or column decision in the described original two-dimensional form, when the data in all unit all are sky, this row or column filtration is obtained required two-dimentional form.
6, handle the device of non vacuum data in a kind of on-line analysing processing system, it is characterized in that, comprising:
Resolve the Multidimensional Expressions unit, be used to resolve the original scope that the Multidimensional Expressions of definition obtains inquiring about axle;
The Rule of judgment unit is used to judge whether only need consider the data that physics exists;
Traversal dimension member unit, the Rule of judgment unit judges is used for when only need be considered the data of physics existence, travel through the combination of each dimension member on the point in the described original scope, and data are taken place at least one dimension member obtain the original two-dimensional form for empty point filters;
Handle the physics data cell, be used for traveling through all unit of each row or column decision of described original two-dimensional form, when all unit all do not have the data of physics existence, this row or column filtration is obtained required two-dimentional form.
7, device according to claim 6 is characterized in that, described device also comprises:
Structure cube unit is used for according to source data structure cube;
The unit that sets a property is used for when dimension member that multidimensional data is concentrated has data, and the data attribute that this dimension member is set is for true, otherwise data attribute be vacation.
8, device according to claim 6 is characterized in that, described device also comprises:
Obtain two-dimentional list cell, be used for when the Rule of judgment unit judges need be considered to comprise all data of calculating the member, concentrate from described multidimensional data according to described original scope to obtain the original two-dimensional form;
The ergodic data unit is used for traveling through the unit that each row or column of described original two-dimensional form determines, when the data in all unit all are sky, this row or column filtration is obtained required two-dimentional form.
CNB2007100030168A 2007-01-30 2007-01-30 Method and device for processing nonempty date in online analytical processing system Active CN100495403C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2007100030168A CN100495403C (en) 2007-01-30 2007-01-30 Method and device for processing nonempty date in online analytical processing system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2007100030168A CN100495403C (en) 2007-01-30 2007-01-30 Method and device for processing nonempty date in online analytical processing system

Publications (2)

Publication Number Publication Date
CN101008953A true CN101008953A (en) 2007-08-01
CN100495403C CN100495403C (en) 2009-06-03

Family

ID=38697386

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2007100030168A Active CN100495403C (en) 2007-01-30 2007-01-30 Method and device for processing nonempty date in online analytical processing system

Country Status (1)

Country Link
CN (1) CN100495403C (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102819616A (en) * 2011-12-28 2012-12-12 中华电信股份有限公司 Cloud online real-time multi-dimensional analysis system and method
CN102938097A (en) * 2012-09-28 2013-02-20 用友软件股份有限公司 Data processing device and data processing method for online analytical processing (OLAP) system
US20140074769A1 (en) * 2012-09-12 2014-03-13 International Business Machines Corporation Tuple reduction for hierarchies of a dimension
CN110442854A (en) * 2019-08-13 2019-11-12 北京源清慧虹信息科技有限公司 Generation method, device, computer equipment and the readable storage medium storing program for executing of report

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102819616A (en) * 2011-12-28 2012-12-12 中华电信股份有限公司 Cloud online real-time multi-dimensional analysis system and method
CN102819616B (en) * 2011-12-28 2015-09-16 中华电信股份有限公司 Cloud online real-time multi-dimensional analysis system and method
US20140074769A1 (en) * 2012-09-12 2014-03-13 International Business Machines Corporation Tuple reduction for hierarchies of a dimension
US9886460B2 (en) * 2012-09-12 2018-02-06 International Business Machines Corporation Tuple reduction for hierarchies of a dimension
CN102938097A (en) * 2012-09-28 2013-02-20 用友软件股份有限公司 Data processing device and data processing method for online analytical processing (OLAP) system
CN102938097B (en) * 2012-09-28 2016-09-28 用友网络科技股份有限公司 Data processing equipment and data processing method for on-line analysing processing system
CN110442854A (en) * 2019-08-13 2019-11-12 北京源清慧虹信息科技有限公司 Generation method, device, computer equipment and the readable storage medium storing program for executing of report
CN110442854B (en) * 2019-08-13 2023-04-07 北京源清慧虹信息科技有限公司 Report generation method and device, computer equipment and readable storage medium

Also Published As

Publication number Publication date
CN100495403C (en) 2009-06-03

Similar Documents

Publication Publication Date Title
CN109669934B (en) Data warehouse system suitable for electric power customer service and construction method thereof
Ghazal et al. Bigbench: Towards an industry standard benchmark for big data analytics
EP3740880A1 (en) Pick and applicator for use with a stringed instrument
Jensen et al. Converting XML DTDs to UML diagrams for conceptual data integration
Dehdouh Building OLAP cubes from columnar NoSQL data warehouses
CN102917009B (en) A kind of stock certificate data collection based on cloud computing technology and storage means and system
CN107016501A (en) A kind of efficient industrial big data multidimensional analysis method
CN105159971B (en) A kind of cloud platform data retrieval method
EP1909198A2 (en) Semantical partitioning of data
CN101008954A (en) Multidimensional expression data caching method and device in online analytical processing system
GB2522524A (en) Extract, transform and load (ETL) system and method
CN100495403C (en) Method and device for processing nonempty date in online analytical processing system
Wang et al. A review of input–output model application hot spots in the energy and environment fields based on co-words network analysis
Suriarachchi et al. Big provenance stream processing for data intensive computations
US10983997B2 (en) Path query evaluation in graph databases
CN104899652A (en) Cultural performing operation platform decision support system under integrated data interaction
MahmoudiNasab et al. AdaptRDF: adaptive storage management for RDF databases
US8250024B2 (en) Search relevance in business intelligence systems through networked ranking
WO2016119508A1 (en) Method for recognizing large-scale objects based on spark system
Hassan Storage structures in the era of big data: from data warehouse to lakehouse
Mohammed et al. Selectivity estimation of extended XML query tree patterns based on prime number labeling and synopsis modeling
Phan et al. Mining multi-relational gradual patterns
Jemal et al. MapReduce-DBMS: an integration model for big data management and optimization
Taniar et al. Performance analysis of “Groupby-After-Join” query processing in parallel database systems
CN110825786A (en) Spark-based big data association rule mining method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant