CN106933902A - Querying method and device that data multidimensional degree is freely dissected - Google Patents
Querying method and device that data multidimensional degree is freely dissected Download PDFInfo
- Publication number
- CN106933902A CN106933902A CN201511032274.XA CN201511032274A CN106933902A CN 106933902 A CN106933902 A CN 106933902A CN 201511032274 A CN201511032274 A CN 201511032274A CN 106933902 A CN106933902 A CN 106933902A
- Authority
- CN
- China
- Prior art keywords
- index
- data
- dimension
- checked
- index table
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2453—Query optimisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/283—Multi-dimensional databases or data warehouses, e.g. MOLAP or ROLAP
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
This application discloses querying method and device that a kind of data multidimensional degree is freely dissected.The method includes:The tables of data being associated with each dimension in multiple dimensions is obtained respectively, obtains associated data table set, wherein, multiple dimensions are the dimension for needing to be dissected;Determine the index table and non-index table in associated data table set, wherein, index table is the table comprising index to be checked in associated data table set, and non-index table is the table not comprising index to be checked in associated data table set;Filtration treatment is carried out to non-index table according to default filter condition;Non- index table after index table and filtering is attached, data sublist is obtained;And index to be checked is inquired about in data sublist and dimension is dissected, wherein, it refers to carry out dimension anatomy to data sublist according to multiple dimensions to dissect dimension.By the application, solve the problems, such as that the search efficiency that data multidimensional degree is freely dissected in correlation technique is relatively low.
Description
Technical field
The application is related to data processing field, in particular to the querying method that a kind of data multidimensional degree is freely dissected
And device.
Background technology
Under distributed environment, query engine conventional at present has a Hive, Impala these structurings with metadata
Database, in these databases, generally, by the data Ji Lu of all kinds of events each tables of data.For example,
In " Education Administration Information System ", teaching management database includes following tables of data:" teacher " table, " course "
Table, " achievement " table, " student " table, " class " table and " giving lessons " table etc., are used for managing by data above table
The information such as reason teaching process middle school student, teacher, course.Again for example, the session that internet is carried out is monitored, typically have
Conversational list (Session), page browsing table (PageView), search in Website table (SiteSearch), order table
(Ecommerce), customized event table (Event) etc. much represents the tables of data of miscellaneous service scene, but all can
It is associated by the session identification (SessionID) of client, forms all entities of whole session.When user needs
Will from multiple angles across multiple tables of data data correlation check achievement data and dissect dimension when, correlation technique
In, according to user's query demand, respective code is write, so as to the achievement data and phase that go inquiry related in database
Dimension is closed, due to lacking the technical scheme that user's query demand is converted into inquiry for unification in correlation technique, causes dividing
The search efficiency that data multidimensional degree in analysis field is freely dissected is low.
The relatively low problem of the search efficiency that freely dissects for data multidimensional degree in correlation technique, not yet proposes effective at present
Solution.
The content of the invention
The main purpose of the application is to provide querying method and device that a kind of data multidimensional degree is freely dissected, to solve
The relatively low problem of data multidimensional degree is freely dissected in correlation technique search efficiency.
To achieve these goals, according to the one side of the application, there is provided what a kind of data multidimensional degree was freely dissected
Querying method.The method includes:The tables of data being associated with each dimension in multiple dimensions is obtained respectively, is associated
Tables of data set, wherein, multiple dimensions are the dimension for needing to be dissected;Determine the index in associated data table set
Table and non-index table, wherein, index table is the table comprising index to be checked in associated data table set, and non-index table is
Table not comprising index to be checked in associated data table set;Non- index table is carried out at filtering according to default filter condition
Reason;Non- index table after index table and filtering is attached, data sublist is obtained;And inquired about in data sublist
Index to be checked and anatomy dimension, wherein, it refers to carry out dimension anatomy to data sublist according to multiple dimensions to dissect dimension.
Further, after filtration treatment is carried out to non-index table according to default filter condition, by index table and mistake
Non- index table after filter is attached, and before obtaining data sublist, the method also includes:It is determined that the non-index after filtering
Connection row in table, wherein, connection is classified as the data row for needing to be attached with index table, after index table and filtering
Non- index table be attached, obtaining data sublist includes:With index in connecting the non-index table after arranging filtering
Table is attached, and obtains data sublist.
Further, after filtration treatment is carried out to non-index table according to default filter condition, by index table and mistake
Non- index table after filter is attached, and before obtaining data sublist, the method also includes:According to default filter condition pair
Index table carries out filtration treatment;Determine link field in the index table after filtration treatment and related to index to be checked
Field, wherein, link field is the field for needing to be attached with the non-index table after filtering;After filtration treatment
The table of link field and the field composition related to index to be checked in index table will refer to as the index table after filtering
Non- index table after mark table and filtering is attached, and obtaining data sublist includes:Non- index table and filtering after by filtering
Index table afterwards is attached, and obtains data sublist.
Further, index to be checked is inquired about in data sublist and dimension is dissected includes:Chosen in data sublist and treated
Inquiry dimension row, dimension to be checked is classified as dimension row to be checked in data sublist;Determine in data sublist to be checked
The corresponding index row of index;And according to the corresponding index row inquiry index to be checked of index to be checked in data sublist,
Dimension anatomy is carried out to dimension to be checked row according to multiple dimensions.
Further, the tables of data being associated with each dimension in multiple dimensions is obtained respectively, obtains associated data table collection
Conjunction includes:The tables of data being associated with each dimension in multiple dimensions is obtained respectively, obtains at least one associated data table;
Obtain the associated key that is mutually related between each associated data table at least one associated data table;And by associated key
At least one associated data table is associated, associated data table set is obtained.
Further, the tables of data being associated with each dimension in multiple dimensions is obtained respectively, obtains associated data table collection
Conjunction includes:The tables of data being associated with each dimension in multiple dimensions is obtained respectively, obtains at least one associated data table;
Duplicate removal filtration treatment is carried out to identical tables of data at least one associated data table, the associated data table after being filtered;
And using the associated data table after filtering as associated data table set.
Further, it is determined that the index table and non-index table in associated data table set include:Receive looking into for outside input
Ask instruction;Index to be checked is determined according to query statement;And divided the set of associated data table according to index to be checked
It is index table and non-index table.
To achieve these goals, according to the another aspect of the application, there is provided what a kind of data multidimensional degree was freely dissected
Inquiry unit, the device includes:Acquiring unit, for obtaining the number being associated with each dimension in multiple dimensions respectively
According to table, associated data table set is obtained, wherein, multiple dimensions are the dimension for needing to be dissected;First determining unit,
For determining the index table in associated data table set and non-index table, wherein, index table is in associated data table set
Table comprising index to be checked, non-index table is the table not comprising index to be checked in associated data table set;Treatment is single
Unit, for carrying out filtration treatment to non-index table according to default filter condition;Connection unit, for by index table and mistake
Non- index table after filter is attached, and obtains data sublist;And query unit, treated for being inquired about in data sublist
Inquiry index and anatomy dimension, wherein, it refers to carry out dimension anatomy to data sublist according to multiple dimensions to dissect dimension.
Further, the device also includes:Second determining unit, for determining the connection in the non-index table after filtering
Row, wherein, connection is classified as the data row for needing to be attached with index table, and connection unit is additionally operable to will by connecting row
It is attached with index table in non-index table after filtering, obtains data sublist.
Further, query unit includes:Module is chosen, for choosing dimension row to be checked in data sublist, is treated
Inquiry dimension is classified as dimension row to be checked in data sublist;Determining module, it is to be checked for determining in data sublist
The corresponding index row of index;And enquiry module, for being arranged according to the corresponding index of index to be checked in data sublist
Index to be checked is inquired about, dimension anatomy is carried out to dimension to be checked row according to multiple dimensions.
By the application, using following steps:The tables of data being associated with each dimension in multiple dimensions is obtained respectively,
Associated data table set is obtained, wherein, multiple dimensions are the dimension for needing to be dissected;Determine associated data table set
In index table and non-index table, wherein, index table be associated data table set in comprising index to be checked table, it is non-
Index table is the table not comprising index to be checked in associated data table set;Non- index table is entered according to default filter condition
Row filtration treatment;Non- index table after index table and filtering is attached, data sublist is obtained;And in data
Index to be checked is inquired about in table and dimension is dissected, wherein, it refers to that data sublist is carried out according to multiple dimensions to dissect dimension
Dimension is dissected, and solves the problems, such as that the search efficiency that data multidimensional degree is freely dissected in correlation technique is relatively low, is realized many
Table linkage carries out multi-dimensional free analysis, reduces performance cost, and then reached the efficiency of lifting inquiry data target
Effect.
Brief description of the drawings
The accompanying drawing for constituting the part of the application is used for providing further understanding of the present application, the schematic reality of the application
Apply example and its illustrate for explaining the application, do not constitute the improper restriction to the application.In the accompanying drawings:
Fig. 1 is the flow chart of the querying method freely dissected according to the data multidimensional degree of the application first embodiment;
Fig. 2 is the flow chart of the querying method freely dissected according to the data multidimensional degree of the application second embodiment;And
Fig. 3 is the schematic diagram of the inquiry unit freely dissected according to the data multidimensional degree of the embodiment of the present application.
Specific embodiment
It should be noted that in the case where not conflicting, the feature in embodiment and embodiment in the application can phase
Mutually combination.Describe the application in detail below with reference to the accompanying drawings and in conjunction with the embodiments.
In order that those skilled in the art more fully understand application scheme, below in conjunction with the embodiment of the present application
Accompanying drawing, is clearly and completely described to the technical scheme in the embodiment of the present application, it is clear that described embodiment
The only embodiment of the application part, rather than whole embodiments.Based on the embodiment in the application, ability
The every other embodiment that domain those of ordinary skill is obtained under the premise of creative work is not made, should all belong to
The scope of the application protection.
It should be noted that term " first ", " in the description and claims of this application and above-mentioned accompanying drawing
Two " it is etc. for distinguishing similar object, without for describing specific order or precedence.It should be appreciated that this
The data that sample is used can be exchanged in the appropriate case, so as to embodiments herein described herein.Additionally, term
" comprising " and " having " and their any deformation, it is intended that covering is non-exclusive to be included, for example, comprising
The process of series of steps or unit, method, system, product or equipment are not necessarily limited to those steps clearly listed
Rapid or unit, but may include not listing clearly or intrinsic for these processes, method, product or equipment
Other steps or unit.
Term is explained:
Index:Refer to the value that can carry out aminated polyepichlorohydrin, for example, page views are an indexs, using sue for peace into
Row polymerization;Mean residence time is also an index, using being averagely polymerized, wherein, aminated polyepichlorohydrin include summation,
Averagely, count etc..
Dimension:Refer to the angle for checking index, for example, browser is a dimension, can be from this dimension of browser
Query page browses (Page View, referred to as PV), such that it is able to know user checks the page using which browser,
And the number of times of the page is checked using these browsers;Operating system be another dimension, it is also possible to from operating system this
Individual dimension inquires about PV, such that it is able to know user checks the page using which operating system, and operates system using these
System checks the number of times of the page.
Member:Refer to the specific corresponding object of dimension, for example, browser is a dimension, and IE browser, Chrome
Browser is a member of the dimension.
Various dimensions are dissected:Refer to that can dissect multiple from multiple dimensions to refer to target value, for example, being looked into from operating system dimension
After seeing session value, after selecting wherein several members, continuation checks session value from browser dimension, i.e., from two dimensions
Dissect session value.The like, can freely be dissected from multiple dimensions.
According to embodiments herein, there is provided the querying method that a kind of data multidimensional degree is freely dissected.
Fig. 1 is the flow chart of the querying method freely dissected according to the data multidimensional degree of the application first embodiment.As schemed
Shown in 1, the method is comprised the following steps:
Step S101, obtains the tables of data being associated with each dimension in multiple dimensions respectively, obtains associated data table collection
Close, wherein, multiple dimensions are the dimension for needing to be dissected.
Dimension (dimension) is the structural characteristics of cube.They are for describing data in tables of data
The hierarchical structure in a organized way (rank) of classification.These classification describe some similar member sets with rank, and user will
It is analyzed based on these member sets.Dimension table can be regarded as the window that user carrys out analyze data, be wrapped in dimension table
Characteristic containing true record in tables of data, some characteristics provide descriptive information, and some characteristics specify how combined data
Table data, to provide useful information, hierarchical structure of the dimension table comprising the characteristic for helping combined data for analyst.
For example, tables of data is sales volume table, dimension table is exactly regional table.Dissect somewhere commodity sales volume, be from area this
Angle observes Sales Volume of Commodity, dissects the sales volume of the regional commodity such as Beijing, Shanghai and Guangzhou, is to observe business from multiple angles
Product sales volume.
The querying method that multiple dimensions in step S101 are freely dissected for the data multidimensional degree of the application first embodiment
The middle dimension for needing to be dissected, in distributed data base, obtain is associated respectively with each dimension in multiple dimensions
Tables of data, obtain associated data table set.Because some tables of data may obtain comprising two or more dimension
To the tables of data being associated with each dimension in multiple dimensions in there is identical tables of data, in order to lifted obtain association
The accuracy of tables of data set is freely cutd open in the data multidimensional degree of the application first embodiment, it is necessary to carry out duplicate removal treatment
In the querying method of analysis, the tables of data being associated with each dimension in multiple dimensions is obtained respectively, obtain associated data table
Set includes:The tables of data being associated with each dimension in multiple dimensions is obtained respectively, obtains at least one associated data
Table;Duplicate removal filtration treatment is carried out to identical tables of data at least one associated data table, the incidence number after being filtered
According to table;And using the associated data table after filtering as associated data table set.
Duplicate removal filtration treatment is carried out by the associated data table for getting, the associated data table after being filtered;And
Using the associated data table after filtering as associated data table set, the accuracy for obtaining associated data table set is improved.
Alternatively, in the querying method that the data multidimensional degree of the application first embodiment is freely dissected, respectively obtain with
The associated tables of data of each dimension in multiple dimensions, obtaining associated data table set includes:Obtain respectively and tieed up with multiple
The associated tables of data of each dimension, obtains at least one associated data table in degree;Obtain at least one associated data table
In be mutually related between each associated data table associated key;And at least one associated data table is associated by associated key,
Obtain associated data table set.
For example, at least one associated data table is associated by session identification (sessionID), obtain incidence number
According to table set.Specific associated key is not limited in this application.Can only be with other number if there is the tables of data having
According to the relevant key of table, and can only by other tables of data complete with gather in other tables of data associate, in the application
In its specific associated key is also not especially limited.
By above-mentioned steps, the tables of data being associated with each dimension in multiple dimensions is got, that is, get and subsequently look into
Ask the metadata information of achievement data.It should be noted that the inquiry that the data multidimensional degree that the application is provided freely is dissected
Method is, the querying method that the data multidimensional degree in analysis field is freely dissected.
Step S102, determines the index table and non-index table in associated data table set, wherein, index table is incidence number
According to the table comprising index to be checked in table set, non-index table is not comprising index to be checked in associated data table set
Table.
Above-mentioned associated data table set is made up of index table and non-index table, and index table is bag in associated data table set
Table containing index to be checked, non-index table is the table not comprising index to be checked in associated data table set.Finger to be checked
Being designated as user needs the index of inquiry, for example, index to be checked can be visit capacity, page views are average to stop
Time, jump out rate, average page access number etc..
Alternatively, in the querying method that the data multidimensional degree of the application first embodiment is freely dissected, incidence number is determined
Can also be realized by following steps according to the index table in table set and non-index table:Receive the query statement of outside input;
Index to be checked is determined according to query statement;And associated data table set is divided into by index table according to index to be checked
With non-index table.
By above step, associated data table set is divided into index table and non-index table.
Step S103, filtration treatment is carried out according to default filter condition to non-index table.
Filtration treatment is carried out to non-index table according to default filter condition, the default filter condition in step S103 can be:
All Session dimensions filterings and the peculiar dimension filtering of the non-index table.
Wherein, the default filter condition is the metadata according to matching dimensionality and index, forms selection required for each table
Row and filtering condition.
Alternatively, in the querying method that the data multidimensional degree of the application first embodiment is freely dissected, according to default
Filter condition is carried out after filtration treatment to non-index table, and the non-index table after by index table and filtering is attached,
Before obtaining data sublist, the method also includes:It is determined that filtering after non-index table in connection row, wherein, connection
The data row for needing to be attached with index table are classified as, the non-index table after index table and filtering is attached, obtained
Data sublist includes:It is attached with index table by the non-index table after connecting row by filtering, obtains data sublist.
By alternative column, and (distinct) table linkage field is chosen, for example, SessionID, ecommerceid,
Sitesearchid, does distinct operations, it is to avoid data expansion when rear continued is connected, the effect of influence inquiry achievement data
Rate.
Alternatively, in the querying method that the data multidimensional degree of the application first embodiment is freely dissected, according to default
Filter condition is carried out after filtration treatment to non-index table, and the non-index table after by index table and filtering is attached,
Before obtaining data sublist, the method also includes:Filtration treatment is carried out to index table according to default filter condition;It is determined that
Link field and the field related to index to be checked in index table after filtration treatment, wherein, link field is to need
The field to be attached with the non-index table after filtering;By the link field in the index table after filtration treatment and with treat
The table of the related field composition of inquiry index enters the non-index table after index table and filtering as the index table after filtering
Row connection, obtaining data sublist includes:The index table after non-index table and filtering after by filtering is attached, and obtains
Data sublist.
Filtration treatment is carried out to index table according to default filter condition, default filter condition can be:All Session
Dimension is filtered and the peculiar dimension of the index table is filtered.By determining the relevant field that table linkage field and index are calculated,
Filtration treatment is carried out to index table, it is to avoid when rear continued is connected, data expansion, the efficiency of influence inquiry achievement data.
Step S104, the non-index table after index table and filtering is attached, and obtains data sublist.
Non- index table after index table and filtering various ways are attached, for example, interior connection, outer connection and friendship
Fork connection etc..Only comprising the row for meeting condition in the Connection inquiring result set of interior connection, interior connection is SQL Server
Default connected mode, it is different according to the manner of comparison that is used, interior connection be divided into again equivalent connection, Nature Link and
Do not wait three kinds of connection;Comprising the combination of all rows in two tables in the Connection inquiring result set of interconnection;Outer connection
Both meet the row of condition in Connection inquiring result set comprising those, also the whole rows comprising wherein certain table, there are 3 kinds of shapes
The outer connection of formula:Left outside connection, right outer connection, complete outer connection.Can realize that multiple tables are inquired about by concatenation operator.
Attended operation brings very big flexibility to user, can at any time increase new data type.For different entities are created
New table is built, is then inquired about by connection.
By with any one in upper type, or other connected modes enter the non-index table after index table and filtering
Row connection, obtains data sublist.
Step S105, index to be checked is inquired about in data sublist and dimension is dissected, wherein, it refers to basis to dissect dimension
Multiple dimensions carry out dimension anatomy to data sublist.
By the method including step S101 to step S105, by defining sublist, and carried out by table linkage field
Distinct choose, distributed environment be especially good at calculate Distinct set, than table connect performance faster, then will knot
Fruit set carries out table connection, obtains data sublist, so as to avoid data expansion, reaches correct inquiry achievement data
Purpose, solves the problems, such as that the search efficiency that data multidimensional degree is freely dissected in correlation technique is relatively low, realizes multilist connection
It is dynamic to carry out multi-dimensional free analysis, performance cost is reduced, and then reached the effect of the efficiency of lifting inquiry data target
Really.
In sum, the querying method that the data multidimensional degree that the application first embodiment is provided freely is dissected, by respectively
The tables of data being associated with each dimension in multiple dimensions is obtained, associated data table set is obtained, wherein, multiple dimensions
For the dimension that needs are dissected;Determine the index table and non-index table in associated data table set, wherein, index table
It is the table comprising index to be checked in associated data table set, non-index table is to be checked not include in associated data table set
Ask the table of index;Filtration treatment is carried out to non-index table according to default filter condition;By the non-finger after index table and filtering
Mark table is attached, and obtains data sublist;And index to be checked is inquired about in data sublist and dimension is dissected, wherein,
It refers to carry out dimension anatomy to data sublist according to multiple dimensions to dissect dimension, solves data multidimensional degree in correlation technique
The relatively low problem of the search efficiency that freely dissects, realizing multilist linkage carries out multi-dimensional free analysis, reduces performance
Expense, and then reached the effect of the efficiency of lifting inquiry data target.
Fig. 2 is the flow chart of the querying method freely dissected according to the data multidimensional degree of the application second embodiment.Fig. 2
Can be as a kind of preferred embodiment of embodiment illustrated in fig. 1.As shown in Fig. 2 the method is comprised the following steps:
Step S201, obtains the tables of data being associated with multiple dimensions respectively, obtains associated data table set.
Step S201 will not be repeated here with above-mentioned steps S101.
Step S202, determines the index table and non-index table in associated data table set, wherein, index table is incidence number
According to the table comprising index to be checked in table set, non-index table is not comprising index to be checked in associated data table set
Table.
Step S202 will not be repeated here with above-mentioned steps S102.
Step S203, filtration treatment is carried out according to default filter condition to non-index table.
Step S203 will not be repeated here with above-mentioned steps S103.
Step S204, the non-index table after index table and filtering is attached, and obtains data sublist.
Step S204 will not be repeated here with above-mentioned steps S104.
Step S205, chooses dimension to be checked row in data sublist, and dimension to be checked is classified as to be checked in data sublist
Dimension row.
When multidimensional is dissected, the incoming anatomy chain (default filter condition) of meeting, and the index of correlation on the index table for dissecting,
Filtered according to chain is dissected, if the dimension filtering of other tables of data comprising non-index table, first by other data
Table is filtered by relevant dimension, then by the result after filtering, is attached (according to metadata) with index table, so
Just the filter condition in other tables of data can be applied, for example, dissect chain being:Operating system version (window7)>>
Browser (chrome)>>Whether new visitor's (YES)>>The page amount of checking (1)>>Access duration.In data
A length of dimension row to be checked during dimension row, the i.e. access to be checked are chosen in table.
Step S206, determines the corresponding index row of index to be checked in data sublist.
The corresponding index row of index to be checked are determined in data sublist.For example, according to the corresponding index of index to be checked
Be classified as queried access amount, page views, mean residence time, jump out the indexs such as rate, average page access number row.
It should be noted that the unique value of alternative column is more, performance is slower;The length of alternative column is more long, and performance is slower.
Step S207, according to the corresponding index row inquiry index to be checked of index to be checked in data sublist, according to many
Individual dimension carries out dimension anatomy to dimension to be checked row.
For example, according to the corresponding index row queried access amount of index to be checked, page views, flat in data sublist
Equal residence time, the achievement data for jumping out the indexs such as rate, average page access number row.According to be checked in data sublist
Ask whether the inquiry of dimension row is that new visitor, browser dimension, the page amount of checking etc. dissect dimension.
In the querying method that the data multidimensional degree of the application second embodiment is freely dissected, can be by Group By
The row of relevant inquiring;OrderBy indexs of correlation are arranged;Pageing is operated;GroupBy dimensions row and index row are chosen,
Inquire the corresponding achievement data of index to be checked.
By the method including step S201 to step S207, dimension to be checked can be selected to arrange in data sublist,
Can be arranged with related index, multi-dimensional free analysis are carried out so as to carry out multilist linkage, reduce performance cost, and then
The effect of the efficiency of lifting inquiry data target is reached.
In sum, the querying method that the data multidimensional degree that the application second embodiment is provided freely is dissected, by respectively
The tables of data being associated with each dimension in multiple dimensions is obtained, associated data table set is obtained, wherein, multiple dimensions
For the dimension that needs are dissected;Determine the index table and non-index table in associated data table set, wherein, index table
It is the table comprising index to be checked in associated data table set, non-index table is to be checked not include in associated data table set
Ask the table of index;Filtration treatment is carried out to non-index table according to default filter condition;By the non-finger after index table and filtering
Mark table is attached, and obtains data sublist;Dimension row to be checked are chosen in data sublist, dimension to be checked is classified as number
Arranged according to dimension to be checked in sublist;The corresponding index row of index to be checked are determined in data sublist;And in data
According to the corresponding index row inquiry achievement data of index to be checked in sublist, dimension is dissected according to dimension to be checked row inquiry,
Solve the problems, such as that the search efficiency that data multidimensional degree is freely dissected in correlation technique is relatively low, realizing multilist linkage is carried out
Multi-dimensional free analysis, reduce performance cost, and then reached the effect of the efficiency of lifting inquiry data target.
It should be noted that can be in such as one group computer executable instructions the step of the flow of accompanying drawing is illustrated
Performed in computer system, and, although logical order is shown in flow charts, but in some cases, can
Shown or described step is performed with different from order herein.
The embodiment of the present application additionally provides the inquiry unit that a kind of data multidimensional degree freely dissects, it is necessary to explanation, this
Apply for that the inquiry unit that the data multidimensional degree of embodiment is freely dissected can be used for performing the use that the embodiment of the present application is provided
In the querying method that data multidimensional degree is freely dissected.The data multidimensional degree that the embodiment of the present application is provided freely is dissected below
Inquiry unit be introduced.
Fig. 3 is the schematic diagram of the inquiry unit freely dissected according to the data multidimensional degree of the embodiment of the present application.Such as Fig. 3 institutes
Show, the device includes:Acquiring unit 10, the first determining unit 20, processing unit 30, connection unit 40 and inquiry
Unit 50.
Acquiring unit 10, for obtaining the tables of data being associated with each dimension in multiple dimensions respectively, obtains incidence number
According to table set, wherein, multiple dimensions are the dimension for needing to be dissected.
First determining unit 20, for determining the index table in associated data table set and non-index table, wherein, index
Table is the table comprising index to be checked in associated data table set, and non-index table is treated not include in associated data table set
Inquire about the table of index.
Processing unit 30, for carrying out filtration treatment to non-index table according to default filter condition.
Connection unit 40, for the non-index table after index table and filtering to be attached, obtains data sublist.
Query unit 50, for inquiring about index to be checked in data sublist and dissecting dimension, wherein, dissecting dimension is
Finger carries out dimension anatomy according to multiple dimensions to data sublist.
The inquiry unit that the data multidimensional degree that the embodiment of the present application is provided freely is dissected, is obtained respectively by acquiring unit 10
The tables of data being associated with each dimension in multiple dimensions, obtains associated data table set, wherein, multiple dimensions are to need
The dimension for being dissected;First determining unit 20 determines the index table and non-index table in associated data table set, its
In, index table is the table comprising index to be checked in associated data table set, and non-index table is in associated data table set
Table not comprising index to be checked;Processing unit 30 carries out filtration treatment according to default filter condition to non-index table;Even
Be attached for non-index table after index table and filtering by order unit 40, obtains data sublist;And query unit 50
Index to be checked is inquired about in data sublist and dimension is dissected, wherein, it refers to data according to multiple dimensions to dissect dimension
Sublist carries out dimension anatomy, solves the problems, such as that the search efficiency that data multidimensional degree is freely dissected in correlation technique is relatively low,
Realizing multilist linkage carries out multi-dimensional free analysis, reduces performance cost, and then has reached lifting inquiry data and refer to
The effect of target efficiency.
Alternatively, in the inquiry unit that the data multidimensional degree that the embodiment of the present application is provided freely is dissected, the device is also wrapped
Include:Second determining unit, for determine filtering after non-index table in connection row, wherein, connection be classified as needs with
The data row that index table is attached, connection unit is additionally operable in connecting the non-index table after arranging filtering and index
Table is attached, and obtains data sublist.
Alternatively, in the inquiry unit that the data multidimensional degree that the embodiment of the present application is provided freely is dissected, query unit 50
Including:Module is chosen, for choosing dimension row to be checked in data sublist, dimension to be checked is classified as in data sublist
Dimension row to be checked;Determining module, for determining the corresponding index row of index to be checked in data sublist;And
Enquiry module, for inquiring about index to be checked according to the corresponding index row of index to be checked in data sublist, according to many
Individual dimension carries out dimension anatomy to dimension to be checked row.
It should be noted that for foregoing each method embodiment, in order to be briefly described, therefore it is all expressed as one it is
The combination of actions of row, but those skilled in the art should know, and the application is not limited by described sequence of movement
System, because according to the application, some steps can sequentially or simultaneously be carried out using other.Secondly, art technology
Personnel should also know that embodiment described in this description belongs to preferred embodiment, involved action and module
Not necessarily necessary to the application.
In the above-described embodiments, the description to each embodiment all emphasizes particularly on different fields, and does not have the portion described in detail in certain embodiment
Point, may refer to the associated description of other embodiment.
In several embodiments provided herein, it should be understood that disclosed device, can be by other sides
Formula is realized.For example, device embodiment described above is only schematical, such as the division of described unit, only
Only a kind of division of logic function, can there is other dividing mode when actually realizing, such as multiple units or component can
To combine or be desirably integrated into another system, or some features can be ignored, or not perform.
The unit that is illustrated as separating component can be or may not be it is physically separate, it is aobvious as unit
The part for showing can be or may not be physical location, you can with positioned at a place, or can also be distributed to
On multiple NEs.Some or all of unit therein can be according to the actual needs selected to realize the present embodiment
The purpose of scheme.
In addition, during each functional unit in the application each embodiment can be integrated in a processing unit, it is also possible to
It is that unit is individually physically present, it is also possible to which two or more units are integrated in a unit.It is above-mentioned integrated
Unit can both be realized in the form of hardware, it would however also be possible to employ the form of SFU software functional unit is realized.
Obviously, those skilled in the art should be understood that each module or each step of above-mentioned the application can be with general
Computing device realize that they can be concentrated on single computing device, or be distributed in multiple computing device institutes
On the network of composition, alternatively, they can be realized with the executable program code of computing device, it is thus possible to
It is stored in being performed by computing device in storage device, or they is fabricated to each integrated circuit die respectively
Block, or the multiple modules or step in them are fabricated to single integrated circuit module to realize.So, the application
Any specific hardware and software is not restricted to combine.
The preferred embodiment of the application is the foregoing is only, the application is not limited to, for those skilled in the art
For member, the application can have various modifications and variations.It is all within spirit herein and principle, made it is any
Modification, equivalent, improvement etc., should be included within the protection domain of the application.
Claims (10)
1. the querying method that a kind of data multidimensional degree is freely dissected, it is characterised in that including:
The tables of data being associated with each dimension in multiple dimensions is obtained respectively, obtains associated data table set, its
In, the multiple dimension is the dimension for needing to be dissected;
Determine the index table and non-index table in the associated data table set, wherein, the index table is described
Table comprising index to be checked in associated data table set, the non-index table is in the associated data table set
Table not comprising the index to be checked;
Filtration treatment is carried out to the non-index table according to default filter condition;
Non- index table after the index table and filtering is attached, data sublist is obtained;And
Index to be checked is inquired about in the data sublist and dimension is dissected, wherein, the anatomy dimension refers to root
Dimension anatomy is carried out to the data sublist according to the multiple dimension.
2. method according to claim 1, it is characterised in that
After filtration treatment is carried out to the non-index table according to default filter condition, by the index table and
Non- index table after filtering is attached, and before obtaining data sublist, methods described also includes:Determine the mistake
Connection row in non-index table after filter, wherein, the connection is classified as to be needed to be attached with the index table
Data are arranged,
Non- index table after the index table and filtering is attached, obtaining data sublist includes:By described
It is attached with the index table in non-index table after row are connected by filtering, obtains the data sublist.
3. method according to claim 1 and 2, it is characterised in that
After filtration treatment is carried out to the non-index table according to default filter condition, by the index table and
Non- index table after filtering is attached, and before obtaining data sublist, methods described also includes:According to described pre-
If filter condition carries out filtration treatment to the index table;Determine the link field in the index table after filtration treatment
And the field related to the index to be checked, wherein, the link field be need with the filtering after it is non-
The field that index table is attached;By the link field in the index table after the filtration treatment and to be checked with described
The table of the related field composition of index is ask as the index table after filtering,
Non- index table after the index table and filtering is attached, obtaining data sublist includes:After filtering
Non- index table and filtering after index table be attached, obtain the data sublist.
4. method according to claim 1, it is characterised in that inquired about in the data sublist index to be checked and
Dissecting dimension includes:
Dimension row to be checked are chosen in the data sublist, the dimension to be checked is classified as in the data sublist
Dimension row to be checked;
The corresponding index row of the index to be checked are determined in the data sublist;And
According to the corresponding index row inquiry index to be checked of the index to be checked in the data sublist,
Dimension anatomy is carried out to the dimension row to be checked according to the multiple dimension.
5. method according to claim 1, it is characterised in that obtain related with each dimension in multiple dimensions respectively
The tables of data of connection, obtaining associated data table set includes:
The tables of data being associated with each dimension in the multiple dimension is obtained respectively, obtains at least one incidence number
According to table;
Obtain the associated key that is mutually related between each associated data table at least one associated data table;With
And
At least one associated data table is associated by the associated key, the associated data table set is obtained.
6. method according to claim 1, it is characterised in that obtain related with each dimension in multiple dimensions respectively
The tables of data of connection, obtaining associated data table set includes:
The tables of data being associated with each dimension in multiple dimensions is obtained respectively, obtains at least one associated data table;
Duplicate removal filtration treatment is carried out to identical tables of data at least one associated data table, after being filtered
Associated data table;And
Using the associated data table after the filtering as the associated data table set.
7. method according to claim 1, it is characterised in that determine the index table in the associated data table set
Include with non-index table:
Receive the query statement of outside input;
The index to be checked is determined according to the query statement;And
The associated data table set is divided into by the index table and the non-index according to the index to be checked
Table.
8. the inquiry unit that a kind of data multidimensional degree is freely dissected, it is characterised in that including:
Acquiring unit, for obtaining the tables of data being associated with each dimension in multiple dimensions respectively, is associated
Tables of data set, wherein, the multiple dimension is the dimension for needing to be dissected;
First determining unit, for determining the index table in the associated data table set and non-index table, wherein,
The index table is the table comprising index to be checked in the associated data table set, and the non-index table is described
Table not comprising the index to be checked in associated data table set;
Processing unit, for carrying out filtration treatment to the non-index table according to default filter condition;
Connection unit, for the non-index table after the index table and filtering to be attached, obtains data sublist;
And
Query unit, for inquiring about index to be checked in the data sublist and dissecting dimension, wherein, it is described
It refers to carry out dimension anatomy to the data sublist according to the multiple dimension to dissect dimension.
9. device according to claim 8, it is characterised in that described device also includes:Second determining unit, uses
In it is determined that connection row in non-index table after the filtering, wherein, the connection is classified as needs and the index
The data row that table is attached,
The connection unit is additionally operable to enter with the index table by the non-index table after the connection row are by filtering
Row connection, obtains the data sublist.
10. device according to claim 8, it is characterised in that the query unit includes:
Module is chosen, for choosing dimension row to be checked in the data sublist, the dimension to be checked is classified as
Dimension row to be checked in the data sublist;
Determining module, for determining the corresponding index row of the index to be checked in the data sublist;And
Enquiry module, in the data sublist according to the corresponding index row inquiry institute of the index to be checked
Index to be checked is stated, dimension anatomy is carried out to the dimension row to be checked according to the multiple dimension.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201511032274.XA CN106933902B (en) | 2015-12-31 | 2015-12-31 | Data multidimensional free analysis query method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201511032274.XA CN106933902B (en) | 2015-12-31 | 2015-12-31 | Data multidimensional free analysis query method and device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106933902A true CN106933902A (en) | 2017-07-07 |
CN106933902B CN106933902B (en) | 2020-02-07 |
Family
ID=59444177
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201511032274.XA Active CN106933902B (en) | 2015-12-31 | 2015-12-31 | Data multidimensional free analysis query method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106933902B (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109635014A (en) * | 2018-12-12 | 2019-04-16 | 苏州思必驰信息科技有限公司 | The display methods and system of user's assets information |
CN110209687A (en) * | 2018-02-23 | 2019-09-06 | 北京国双科技有限公司 | The querying method and device of various dimensions attribution |
CN112948441A (en) * | 2021-03-26 | 2021-06-11 | 浪潮通用软件有限公司 | Financial data-oriented multidimensional data aggregation method and equipment |
CN114579619A (en) * | 2022-04-28 | 2022-06-03 | 北京达佳互联信息技术有限公司 | Data query method and device, electronic equipment and storage medium |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103699966A (en) * | 2013-04-12 | 2014-04-02 | 国家电网公司 | Multidimensional overall-process data control system and control method based on user requirement |
CN104391951A (en) * | 2014-11-27 | 2015-03-04 | 北京国双科技有限公司 | Web page thermodynamic diagram loading method and device |
CN104392001A (en) * | 2014-12-15 | 2015-03-04 | 北京国双科技有限公司 | Database inquiry method and device |
CN104408180A (en) * | 2014-12-15 | 2015-03-11 | 北京国双科技有限公司 | Stored data inquiring method and device |
CN104408179A (en) * | 2014-12-15 | 2015-03-11 | 北京国双科技有限公司 | Method and device for processing data from data table |
CN104424229A (en) * | 2013-08-26 | 2015-03-18 | 腾讯科技(深圳)有限公司 | Calculating method and system for multi-dimensional division |
CN104484398A (en) * | 2014-12-12 | 2015-04-01 | 北京国双科技有限公司 | Method and device for aggregation of data in datasheet |
CN106933914A (en) * | 2015-12-31 | 2017-07-07 | 北京国双科技有限公司 | The data processing method and device of many tables of data |
CN106933893A (en) * | 2015-12-31 | 2017-07-07 | 北京国双科技有限公司 | The querying method and device of multi-dimensional data |
-
2015
- 2015-12-31 CN CN201511032274.XA patent/CN106933902B/en active Active
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103699966A (en) * | 2013-04-12 | 2014-04-02 | 国家电网公司 | Multidimensional overall-process data control system and control method based on user requirement |
CN104424229A (en) * | 2013-08-26 | 2015-03-18 | 腾讯科技(深圳)有限公司 | Calculating method and system for multi-dimensional division |
CN104391951A (en) * | 2014-11-27 | 2015-03-04 | 北京国双科技有限公司 | Web page thermodynamic diagram loading method and device |
CN104484398A (en) * | 2014-12-12 | 2015-04-01 | 北京国双科技有限公司 | Method and device for aggregation of data in datasheet |
CN104392001A (en) * | 2014-12-15 | 2015-03-04 | 北京国双科技有限公司 | Database inquiry method and device |
CN104408180A (en) * | 2014-12-15 | 2015-03-11 | 北京国双科技有限公司 | Stored data inquiring method and device |
CN104408179A (en) * | 2014-12-15 | 2015-03-11 | 北京国双科技有限公司 | Method and device for processing data from data table |
CN106933914A (en) * | 2015-12-31 | 2017-07-07 | 北京国双科技有限公司 | The data processing method and device of many tables of data |
CN106933893A (en) * | 2015-12-31 | 2017-07-07 | 北京国双科技有限公司 | The querying method and device of multi-dimensional data |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110209687A (en) * | 2018-02-23 | 2019-09-06 | 北京国双科技有限公司 | The querying method and device of various dimensions attribution |
CN110209687B (en) * | 2018-02-23 | 2021-06-22 | 北京国双科技有限公司 | Multi-dimensional attribution query method and device |
CN109635014A (en) * | 2018-12-12 | 2019-04-16 | 苏州思必驰信息科技有限公司 | The display methods and system of user's assets information |
CN112948441A (en) * | 2021-03-26 | 2021-06-11 | 浪潮通用软件有限公司 | Financial data-oriented multidimensional data aggregation method and equipment |
CN112948441B (en) * | 2021-03-26 | 2023-09-29 | 浪潮通用软件有限公司 | Multi-dimensional data collection method and equipment for financial data |
CN114579619A (en) * | 2022-04-28 | 2022-06-03 | 北京达佳互联信息技术有限公司 | Data query method and device, electronic equipment and storage medium |
CN114579619B (en) * | 2022-04-28 | 2023-01-20 | 北京达佳互联信息技术有限公司 | Data query method and device, electronic equipment and storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN106933902B (en) | 2020-02-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106933906A (en) | The querying method and device of data multidimensional degree | |
US9519689B2 (en) | Disambiguation framework for information searching | |
CA2751295C (en) | Analysis of object structures such as benefits and provider contracts | |
US20140040306A1 (en) | Business intelligence performance analysis system | |
CN106933902A (en) | Querying method and device that data multidimensional degree is freely dissected | |
DE102014204842A1 (en) | Clustering of data | |
US20170357708A1 (en) | Apparatus and method for processing multi-dimensional queries in a shared nothing system through tree reduction | |
JP2004528660A (en) | Method and system for improving response time of queries on partitioned database objects | |
CN106557498A (en) | Date storage method and device and data query method and apparatus | |
CN106933893B (en) | multi-dimensional data query method and device | |
CN102663048A (en) | Method and device for providing search result | |
CN103714096A (en) | Lucene-based inverted index system construction method and device, and Lucene-based inverted index system data processing method and device | |
US9477702B1 (en) | Apparatus and method for accessing materialized and non-materialized values in a shared nothing system | |
US20110066620A1 (en) | Automated Boolean Expression Generation for Computerized Search and Indexing | |
CN103605848A (en) | Method and device for analyzing paths | |
CN104182544B (en) | The dimension method for decomposing and device of analytical database | |
CN105653661A (en) | Search result re-ranking method and device | |
CN106682206A (en) | Method and system for big data processing | |
US9177079B1 (en) | Apparatus and method for processing multi-dimensional queries in a shared nothing system through tree reduction | |
CN103605744B (en) | The analysis method and device of site search engine data on flows | |
CN104794130A (en) | Inter-table correlation query method and device | |
CN106649385B (en) | Data reordering method and device based on HBase database | |
CN106933894A (en) | The querying method and device of multi-dimensional data | |
US20120197933A1 (en) | Systems and methods for assigning hosts in response to a data query | |
CN113553341A (en) | Multidimensional data analysis method, multidimensional data analysis device, multidimensional data analysis equipment and computer readable storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information | ||
CB02 | Change of applicant information |
Address after: 100083 No. 401, 4th Floor, Haitai Building, 229 North Fourth Ring Road, Haidian District, Beijing Applicant after: Beijing Guoshuang Technology Co.,Ltd. Address before: 100086 Cuigong Hotel, 76 Zhichun Road, Shuangyushu District, Haidian District, Beijing Applicant before: Beijing Guoshuang Technology Co.,Ltd. |
|
GR01 | Patent grant | ||
GR01 | Patent grant |