CN106933902A - Querying method and device that data multidimensional degree is freely dissected - Google Patents

Querying method and device that data multidimensional degree is freely dissected Download PDF

Info

Publication number
CN106933902A
CN106933902A CN201511032274.XA CN201511032274A CN106933902A CN 106933902 A CN106933902 A CN 106933902A CN 201511032274 A CN201511032274 A CN 201511032274A CN 106933902 A CN106933902 A CN 106933902A
Authority
CN
China
Prior art keywords
index
data
dimension
checked
index table
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201511032274.XA
Other languages
Chinese (zh)
Other versions
CN106933902B (en
Inventor
洪超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Gridsum Technology Co Ltd
Original Assignee
Beijing Gridsum Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Gridsum Technology Co Ltd filed Critical Beijing Gridsum Technology Co Ltd
Priority to CN201511032274.XA priority Critical patent/CN106933902B/en
Publication of CN106933902A publication Critical patent/CN106933902A/en
Application granted granted Critical
Publication of CN106933902B publication Critical patent/CN106933902B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2453Query optimisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/283Multi-dimensional databases or data warehouses, e.g. MOLAP or ROLAP

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

This application discloses querying method and device that a kind of data multidimensional degree is freely dissected.The method includes:The tables of data being associated with each dimension in multiple dimensions is obtained respectively, obtains associated data table set, wherein, multiple dimensions are the dimension for needing to be dissected;Determine the index table and non-index table in associated data table set, wherein, index table is the table comprising index to be checked in associated data table set, and non-index table is the table not comprising index to be checked in associated data table set;Filtration treatment is carried out to non-index table according to default filter condition;Non- index table after index table and filtering is attached, data sublist is obtained;And index to be checked is inquired about in data sublist and dimension is dissected, wherein, it refers to carry out dimension anatomy to data sublist according to multiple dimensions to dissect dimension.By the application, solve the problems, such as that the search efficiency that data multidimensional degree is freely dissected in correlation technique is relatively low.

Description

Querying method and device that data multidimensional degree is freely dissected
Technical field
The application is related to data processing field, in particular to the querying method that a kind of data multidimensional degree is freely dissected And device.
Background technology
Under distributed environment, query engine conventional at present has a Hive, Impala these structurings with metadata Database, in these databases, generally, by the data Ji Lu of all kinds of events each tables of data.For example, In " Education Administration Information System ", teaching management database includes following tables of data:" teacher " table, " course " Table, " achievement " table, " student " table, " class " table and " giving lessons " table etc., are used for managing by data above table The information such as reason teaching process middle school student, teacher, course.Again for example, the session that internet is carried out is monitored, typically have Conversational list (Session), page browsing table (PageView), search in Website table (SiteSearch), order table (Ecommerce), customized event table (Event) etc. much represents the tables of data of miscellaneous service scene, but all can It is associated by the session identification (SessionID) of client, forms all entities of whole session.When user needs Will from multiple angles across multiple tables of data data correlation check achievement data and dissect dimension when, correlation technique In, according to user's query demand, respective code is write, so as to the achievement data and phase that go inquiry related in database Dimension is closed, due to lacking the technical scheme that user's query demand is converted into inquiry for unification in correlation technique, causes dividing The search efficiency that data multidimensional degree in analysis field is freely dissected is low.
The relatively low problem of the search efficiency that freely dissects for data multidimensional degree in correlation technique, not yet proposes effective at present Solution.
The content of the invention
The main purpose of the application is to provide querying method and device that a kind of data multidimensional degree is freely dissected, to solve The relatively low problem of data multidimensional degree is freely dissected in correlation technique search efficiency.
To achieve these goals, according to the one side of the application, there is provided what a kind of data multidimensional degree was freely dissected Querying method.The method includes:The tables of data being associated with each dimension in multiple dimensions is obtained respectively, is associated Tables of data set, wherein, multiple dimensions are the dimension for needing to be dissected;Determine the index in associated data table set Table and non-index table, wherein, index table is the table comprising index to be checked in associated data table set, and non-index table is Table not comprising index to be checked in associated data table set;Non- index table is carried out at filtering according to default filter condition Reason;Non- index table after index table and filtering is attached, data sublist is obtained;And inquired about in data sublist Index to be checked and anatomy dimension, wherein, it refers to carry out dimension anatomy to data sublist according to multiple dimensions to dissect dimension.
Further, after filtration treatment is carried out to non-index table according to default filter condition, by index table and mistake Non- index table after filter is attached, and before obtaining data sublist, the method also includes:It is determined that the non-index after filtering Connection row in table, wherein, connection is classified as the data row for needing to be attached with index table, after index table and filtering Non- index table be attached, obtaining data sublist includes:With index in connecting the non-index table after arranging filtering Table is attached, and obtains data sublist.
Further, after filtration treatment is carried out to non-index table according to default filter condition, by index table and mistake Non- index table after filter is attached, and before obtaining data sublist, the method also includes:According to default filter condition pair Index table carries out filtration treatment;Determine link field in the index table after filtration treatment and related to index to be checked Field, wherein, link field is the field for needing to be attached with the non-index table after filtering;After filtration treatment The table of link field and the field composition related to index to be checked in index table will refer to as the index table after filtering Non- index table after mark table and filtering is attached, and obtaining data sublist includes:Non- index table and filtering after by filtering Index table afterwards is attached, and obtains data sublist.
Further, index to be checked is inquired about in data sublist and dimension is dissected includes:Chosen in data sublist and treated Inquiry dimension row, dimension to be checked is classified as dimension row to be checked in data sublist;Determine in data sublist to be checked The corresponding index row of index;And according to the corresponding index row inquiry index to be checked of index to be checked in data sublist, Dimension anatomy is carried out to dimension to be checked row according to multiple dimensions.
Further, the tables of data being associated with each dimension in multiple dimensions is obtained respectively, obtains associated data table collection Conjunction includes:The tables of data being associated with each dimension in multiple dimensions is obtained respectively, obtains at least one associated data table; Obtain the associated key that is mutually related between each associated data table at least one associated data table;And by associated key At least one associated data table is associated, associated data table set is obtained.
Further, the tables of data being associated with each dimension in multiple dimensions is obtained respectively, obtains associated data table collection Conjunction includes:The tables of data being associated with each dimension in multiple dimensions is obtained respectively, obtains at least one associated data table; Duplicate removal filtration treatment is carried out to identical tables of data at least one associated data table, the associated data table after being filtered; And using the associated data table after filtering as associated data table set.
Further, it is determined that the index table and non-index table in associated data table set include:Receive looking into for outside input Ask instruction;Index to be checked is determined according to query statement;And divided the set of associated data table according to index to be checked It is index table and non-index table.
To achieve these goals, according to the another aspect of the application, there is provided what a kind of data multidimensional degree was freely dissected Inquiry unit, the device includes:Acquiring unit, for obtaining the number being associated with each dimension in multiple dimensions respectively According to table, associated data table set is obtained, wherein, multiple dimensions are the dimension for needing to be dissected;First determining unit, For determining the index table in associated data table set and non-index table, wherein, index table is in associated data table set Table comprising index to be checked, non-index table is the table not comprising index to be checked in associated data table set;Treatment is single Unit, for carrying out filtration treatment to non-index table according to default filter condition;Connection unit, for by index table and mistake Non- index table after filter is attached, and obtains data sublist;And query unit, treated for being inquired about in data sublist Inquiry index and anatomy dimension, wherein, it refers to carry out dimension anatomy to data sublist according to multiple dimensions to dissect dimension.
Further, the device also includes:Second determining unit, for determining the connection in the non-index table after filtering Row, wherein, connection is classified as the data row for needing to be attached with index table, and connection unit is additionally operable to will by connecting row It is attached with index table in non-index table after filtering, obtains data sublist.
Further, query unit includes:Module is chosen, for choosing dimension row to be checked in data sublist, is treated Inquiry dimension is classified as dimension row to be checked in data sublist;Determining module, it is to be checked for determining in data sublist The corresponding index row of index;And enquiry module, for being arranged according to the corresponding index of index to be checked in data sublist Index to be checked is inquired about, dimension anatomy is carried out to dimension to be checked row according to multiple dimensions.
By the application, using following steps:The tables of data being associated with each dimension in multiple dimensions is obtained respectively, Associated data table set is obtained, wherein, multiple dimensions are the dimension for needing to be dissected;Determine associated data table set In index table and non-index table, wherein, index table be associated data table set in comprising index to be checked table, it is non- Index table is the table not comprising index to be checked in associated data table set;Non- index table is entered according to default filter condition Row filtration treatment;Non- index table after index table and filtering is attached, data sublist is obtained;And in data Index to be checked is inquired about in table and dimension is dissected, wherein, it refers to that data sublist is carried out according to multiple dimensions to dissect dimension Dimension is dissected, and solves the problems, such as that the search efficiency that data multidimensional degree is freely dissected in correlation technique is relatively low, is realized many Table linkage carries out multi-dimensional free analysis, reduces performance cost, and then reached the efficiency of lifting inquiry data target Effect.
Brief description of the drawings
The accompanying drawing for constituting the part of the application is used for providing further understanding of the present application, the schematic reality of the application Apply example and its illustrate for explaining the application, do not constitute the improper restriction to the application.In the accompanying drawings:
Fig. 1 is the flow chart of the querying method freely dissected according to the data multidimensional degree of the application first embodiment;
Fig. 2 is the flow chart of the querying method freely dissected according to the data multidimensional degree of the application second embodiment;And
Fig. 3 is the schematic diagram of the inquiry unit freely dissected according to the data multidimensional degree of the embodiment of the present application.
Specific embodiment
It should be noted that in the case where not conflicting, the feature in embodiment and embodiment in the application can phase Mutually combination.Describe the application in detail below with reference to the accompanying drawings and in conjunction with the embodiments.
In order that those skilled in the art more fully understand application scheme, below in conjunction with the embodiment of the present application Accompanying drawing, is clearly and completely described to the technical scheme in the embodiment of the present application, it is clear that described embodiment The only embodiment of the application part, rather than whole embodiments.Based on the embodiment in the application, ability The every other embodiment that domain those of ordinary skill is obtained under the premise of creative work is not made, should all belong to The scope of the application protection.
It should be noted that term " first ", " in the description and claims of this application and above-mentioned accompanying drawing Two " it is etc. for distinguishing similar object, without for describing specific order or precedence.It should be appreciated that this The data that sample is used can be exchanged in the appropriate case, so as to embodiments herein described herein.Additionally, term " comprising " and " having " and their any deformation, it is intended that covering is non-exclusive to be included, for example, comprising The process of series of steps or unit, method, system, product or equipment are not necessarily limited to those steps clearly listed Rapid or unit, but may include not listing clearly or intrinsic for these processes, method, product or equipment Other steps or unit.
Term is explained:
Index:Refer to the value that can carry out aminated polyepichlorohydrin, for example, page views are an indexs, using sue for peace into Row polymerization;Mean residence time is also an index, using being averagely polymerized, wherein, aminated polyepichlorohydrin include summation, Averagely, count etc..
Dimension:Refer to the angle for checking index, for example, browser is a dimension, can be from this dimension of browser Query page browses (Page View, referred to as PV), such that it is able to know user checks the page using which browser, And the number of times of the page is checked using these browsers;Operating system be another dimension, it is also possible to from operating system this Individual dimension inquires about PV, such that it is able to know user checks the page using which operating system, and operates system using these System checks the number of times of the page.
Member:Refer to the specific corresponding object of dimension, for example, browser is a dimension, and IE browser, Chrome Browser is a member of the dimension.
Various dimensions are dissected:Refer to that can dissect multiple from multiple dimensions to refer to target value, for example, being looked into from operating system dimension After seeing session value, after selecting wherein several members, continuation checks session value from browser dimension, i.e., from two dimensions Dissect session value.The like, can freely be dissected from multiple dimensions.
According to embodiments herein, there is provided the querying method that a kind of data multidimensional degree is freely dissected.
Fig. 1 is the flow chart of the querying method freely dissected according to the data multidimensional degree of the application first embodiment.As schemed Shown in 1, the method is comprised the following steps:
Step S101, obtains the tables of data being associated with each dimension in multiple dimensions respectively, obtains associated data table collection Close, wherein, multiple dimensions are the dimension for needing to be dissected.
Dimension (dimension) is the structural characteristics of cube.They are for describing data in tables of data The hierarchical structure in a organized way (rank) of classification.These classification describe some similar member sets with rank, and user will It is analyzed based on these member sets.Dimension table can be regarded as the window that user carrys out analyze data, be wrapped in dimension table Characteristic containing true record in tables of data, some characteristics provide descriptive information, and some characteristics specify how combined data Table data, to provide useful information, hierarchical structure of the dimension table comprising the characteristic for helping combined data for analyst. For example, tables of data is sales volume table, dimension table is exactly regional table.Dissect somewhere commodity sales volume, be from area this Angle observes Sales Volume of Commodity, dissects the sales volume of the regional commodity such as Beijing, Shanghai and Guangzhou, is to observe business from multiple angles Product sales volume.
The querying method that multiple dimensions in step S101 are freely dissected for the data multidimensional degree of the application first embodiment The middle dimension for needing to be dissected, in distributed data base, obtain is associated respectively with each dimension in multiple dimensions Tables of data, obtain associated data table set.Because some tables of data may obtain comprising two or more dimension To the tables of data being associated with each dimension in multiple dimensions in there is identical tables of data, in order to lifted obtain association The accuracy of tables of data set is freely cutd open in the data multidimensional degree of the application first embodiment, it is necessary to carry out duplicate removal treatment In the querying method of analysis, the tables of data being associated with each dimension in multiple dimensions is obtained respectively, obtain associated data table Set includes:The tables of data being associated with each dimension in multiple dimensions is obtained respectively, obtains at least one associated data Table;Duplicate removal filtration treatment is carried out to identical tables of data at least one associated data table, the incidence number after being filtered According to table;And using the associated data table after filtering as associated data table set.
Duplicate removal filtration treatment is carried out by the associated data table for getting, the associated data table after being filtered;And Using the associated data table after filtering as associated data table set, the accuracy for obtaining associated data table set is improved.
Alternatively, in the querying method that the data multidimensional degree of the application first embodiment is freely dissected, respectively obtain with The associated tables of data of each dimension in multiple dimensions, obtaining associated data table set includes:Obtain respectively and tieed up with multiple The associated tables of data of each dimension, obtains at least one associated data table in degree;Obtain at least one associated data table In be mutually related between each associated data table associated key;And at least one associated data table is associated by associated key, Obtain associated data table set.
For example, at least one associated data table is associated by session identification (sessionID), obtain incidence number According to table set.Specific associated key is not limited in this application.Can only be with other number if there is the tables of data having According to the relevant key of table, and can only by other tables of data complete with gather in other tables of data associate, in the application In its specific associated key is also not especially limited.
By above-mentioned steps, the tables of data being associated with each dimension in multiple dimensions is got, that is, get and subsequently look into Ask the metadata information of achievement data.It should be noted that the inquiry that the data multidimensional degree that the application is provided freely is dissected Method is, the querying method that the data multidimensional degree in analysis field is freely dissected.
Step S102, determines the index table and non-index table in associated data table set, wherein, index table is incidence number According to the table comprising index to be checked in table set, non-index table is not comprising index to be checked in associated data table set Table.
Above-mentioned associated data table set is made up of index table and non-index table, and index table is bag in associated data table set Table containing index to be checked, non-index table is the table not comprising index to be checked in associated data table set.Finger to be checked Being designated as user needs the index of inquiry, for example, index to be checked can be visit capacity, page views are average to stop Time, jump out rate, average page access number etc..
Alternatively, in the querying method that the data multidimensional degree of the application first embodiment is freely dissected, incidence number is determined Can also be realized by following steps according to the index table in table set and non-index table:Receive the query statement of outside input; Index to be checked is determined according to query statement;And associated data table set is divided into by index table according to index to be checked With non-index table.
By above step, associated data table set is divided into index table and non-index table.
Step S103, filtration treatment is carried out according to default filter condition to non-index table.
Filtration treatment is carried out to non-index table according to default filter condition, the default filter condition in step S103 can be: All Session dimensions filterings and the peculiar dimension filtering of the non-index table.
Wherein, the default filter condition is the metadata according to matching dimensionality and index, forms selection required for each table Row and filtering condition.
Alternatively, in the querying method that the data multidimensional degree of the application first embodiment is freely dissected, according to default Filter condition is carried out after filtration treatment to non-index table, and the non-index table after by index table and filtering is attached, Before obtaining data sublist, the method also includes:It is determined that filtering after non-index table in connection row, wherein, connection The data row for needing to be attached with index table are classified as, the non-index table after index table and filtering is attached, obtained Data sublist includes:It is attached with index table by the non-index table after connecting row by filtering, obtains data sublist.
By alternative column, and (distinct) table linkage field is chosen, for example, SessionID, ecommerceid, Sitesearchid, does distinct operations, it is to avoid data expansion when rear continued is connected, the effect of influence inquiry achievement data Rate.
Alternatively, in the querying method that the data multidimensional degree of the application first embodiment is freely dissected, according to default Filter condition is carried out after filtration treatment to non-index table, and the non-index table after by index table and filtering is attached, Before obtaining data sublist, the method also includes:Filtration treatment is carried out to index table according to default filter condition;It is determined that Link field and the field related to index to be checked in index table after filtration treatment, wherein, link field is to need The field to be attached with the non-index table after filtering;By the link field in the index table after filtration treatment and with treat The table of the related field composition of inquiry index enters the non-index table after index table and filtering as the index table after filtering Row connection, obtaining data sublist includes:The index table after non-index table and filtering after by filtering is attached, and obtains Data sublist.
Filtration treatment is carried out to index table according to default filter condition, default filter condition can be:All Session Dimension is filtered and the peculiar dimension of the index table is filtered.By determining the relevant field that table linkage field and index are calculated, Filtration treatment is carried out to index table, it is to avoid when rear continued is connected, data expansion, the efficiency of influence inquiry achievement data.
Step S104, the non-index table after index table and filtering is attached, and obtains data sublist.
Non- index table after index table and filtering various ways are attached, for example, interior connection, outer connection and friendship Fork connection etc..Only comprising the row for meeting condition in the Connection inquiring result set of interior connection, interior connection is SQL Server Default connected mode, it is different according to the manner of comparison that is used, interior connection be divided into again equivalent connection, Nature Link and Do not wait three kinds of connection;Comprising the combination of all rows in two tables in the Connection inquiring result set of interconnection;Outer connection Both meet the row of condition in Connection inquiring result set comprising those, also the whole rows comprising wherein certain table, there are 3 kinds of shapes The outer connection of formula:Left outside connection, right outer connection, complete outer connection.Can realize that multiple tables are inquired about by concatenation operator. Attended operation brings very big flexibility to user, can at any time increase new data type.For different entities are created New table is built, is then inquired about by connection.
By with any one in upper type, or other connected modes enter the non-index table after index table and filtering Row connection, obtains data sublist.
Step S105, index to be checked is inquired about in data sublist and dimension is dissected, wherein, it refers to basis to dissect dimension Multiple dimensions carry out dimension anatomy to data sublist.
By the method including step S101 to step S105, by defining sublist, and carried out by table linkage field Distinct choose, distributed environment be especially good at calculate Distinct set, than table connect performance faster, then will knot Fruit set carries out table connection, obtains data sublist, so as to avoid data expansion, reaches correct inquiry achievement data Purpose, solves the problems, such as that the search efficiency that data multidimensional degree is freely dissected in correlation technique is relatively low, realizes multilist connection It is dynamic to carry out multi-dimensional free analysis, performance cost is reduced, and then reached the effect of the efficiency of lifting inquiry data target Really.
In sum, the querying method that the data multidimensional degree that the application first embodiment is provided freely is dissected, by respectively The tables of data being associated with each dimension in multiple dimensions is obtained, associated data table set is obtained, wherein, multiple dimensions For the dimension that needs are dissected;Determine the index table and non-index table in associated data table set, wherein, index table It is the table comprising index to be checked in associated data table set, non-index table is to be checked not include in associated data table set Ask the table of index;Filtration treatment is carried out to non-index table according to default filter condition;By the non-finger after index table and filtering Mark table is attached, and obtains data sublist;And index to be checked is inquired about in data sublist and dimension is dissected, wherein, It refers to carry out dimension anatomy to data sublist according to multiple dimensions to dissect dimension, solves data multidimensional degree in correlation technique The relatively low problem of the search efficiency that freely dissects, realizing multilist linkage carries out multi-dimensional free analysis, reduces performance Expense, and then reached the effect of the efficiency of lifting inquiry data target.
Fig. 2 is the flow chart of the querying method freely dissected according to the data multidimensional degree of the application second embodiment.Fig. 2 Can be as a kind of preferred embodiment of embodiment illustrated in fig. 1.As shown in Fig. 2 the method is comprised the following steps:
Step S201, obtains the tables of data being associated with multiple dimensions respectively, obtains associated data table set.
Step S201 will not be repeated here with above-mentioned steps S101.
Step S202, determines the index table and non-index table in associated data table set, wherein, index table is incidence number According to the table comprising index to be checked in table set, non-index table is not comprising index to be checked in associated data table set Table.
Step S202 will not be repeated here with above-mentioned steps S102.
Step S203, filtration treatment is carried out according to default filter condition to non-index table.
Step S203 will not be repeated here with above-mentioned steps S103.
Step S204, the non-index table after index table and filtering is attached, and obtains data sublist.
Step S204 will not be repeated here with above-mentioned steps S104.
Step S205, chooses dimension to be checked row in data sublist, and dimension to be checked is classified as to be checked in data sublist Dimension row.
When multidimensional is dissected, the incoming anatomy chain (default filter condition) of meeting, and the index of correlation on the index table for dissecting, Filtered according to chain is dissected, if the dimension filtering of other tables of data comprising non-index table, first by other data Table is filtered by relevant dimension, then by the result after filtering, is attached (according to metadata) with index table, so Just the filter condition in other tables of data can be applied, for example, dissect chain being:Operating system version (window7)>> Browser (chrome)>>Whether new visitor's (YES)>>The page amount of checking (1)>>Access duration.In data A length of dimension row to be checked during dimension row, the i.e. access to be checked are chosen in table.
Step S206, determines the corresponding index row of index to be checked in data sublist.
The corresponding index row of index to be checked are determined in data sublist.For example, according to the corresponding index of index to be checked Be classified as queried access amount, page views, mean residence time, jump out the indexs such as rate, average page access number row.
It should be noted that the unique value of alternative column is more, performance is slower;The length of alternative column is more long, and performance is slower.
Step S207, according to the corresponding index row inquiry index to be checked of index to be checked in data sublist, according to many Individual dimension carries out dimension anatomy to dimension to be checked row.
For example, according to the corresponding index row queried access amount of index to be checked, page views, flat in data sublist Equal residence time, the achievement data for jumping out the indexs such as rate, average page access number row.According to be checked in data sublist Ask whether the inquiry of dimension row is that new visitor, browser dimension, the page amount of checking etc. dissect dimension.
In the querying method that the data multidimensional degree of the application second embodiment is freely dissected, can be by Group By The row of relevant inquiring;OrderBy indexs of correlation are arranged;Pageing is operated;GroupBy dimensions row and index row are chosen, Inquire the corresponding achievement data of index to be checked.
By the method including step S201 to step S207, dimension to be checked can be selected to arrange in data sublist, Can be arranged with related index, multi-dimensional free analysis are carried out so as to carry out multilist linkage, reduce performance cost, and then The effect of the efficiency of lifting inquiry data target is reached.
In sum, the querying method that the data multidimensional degree that the application second embodiment is provided freely is dissected, by respectively The tables of data being associated with each dimension in multiple dimensions is obtained, associated data table set is obtained, wherein, multiple dimensions For the dimension that needs are dissected;Determine the index table and non-index table in associated data table set, wherein, index table It is the table comprising index to be checked in associated data table set, non-index table is to be checked not include in associated data table set Ask the table of index;Filtration treatment is carried out to non-index table according to default filter condition;By the non-finger after index table and filtering Mark table is attached, and obtains data sublist;Dimension row to be checked are chosen in data sublist, dimension to be checked is classified as number Arranged according to dimension to be checked in sublist;The corresponding index row of index to be checked are determined in data sublist;And in data According to the corresponding index row inquiry achievement data of index to be checked in sublist, dimension is dissected according to dimension to be checked row inquiry, Solve the problems, such as that the search efficiency that data multidimensional degree is freely dissected in correlation technique is relatively low, realizing multilist linkage is carried out Multi-dimensional free analysis, reduce performance cost, and then reached the effect of the efficiency of lifting inquiry data target.
It should be noted that can be in such as one group computer executable instructions the step of the flow of accompanying drawing is illustrated Performed in computer system, and, although logical order is shown in flow charts, but in some cases, can Shown or described step is performed with different from order herein.
The embodiment of the present application additionally provides the inquiry unit that a kind of data multidimensional degree freely dissects, it is necessary to explanation, this Apply for that the inquiry unit that the data multidimensional degree of embodiment is freely dissected can be used for performing the use that the embodiment of the present application is provided In the querying method that data multidimensional degree is freely dissected.The data multidimensional degree that the embodiment of the present application is provided freely is dissected below Inquiry unit be introduced.
Fig. 3 is the schematic diagram of the inquiry unit freely dissected according to the data multidimensional degree of the embodiment of the present application.Such as Fig. 3 institutes Show, the device includes:Acquiring unit 10, the first determining unit 20, processing unit 30, connection unit 40 and inquiry Unit 50.
Acquiring unit 10, for obtaining the tables of data being associated with each dimension in multiple dimensions respectively, obtains incidence number According to table set, wherein, multiple dimensions are the dimension for needing to be dissected.
First determining unit 20, for determining the index table in associated data table set and non-index table, wherein, index Table is the table comprising index to be checked in associated data table set, and non-index table is treated not include in associated data table set Inquire about the table of index.
Processing unit 30, for carrying out filtration treatment to non-index table according to default filter condition.
Connection unit 40, for the non-index table after index table and filtering to be attached, obtains data sublist.
Query unit 50, for inquiring about index to be checked in data sublist and dissecting dimension, wherein, dissecting dimension is Finger carries out dimension anatomy according to multiple dimensions to data sublist.
The inquiry unit that the data multidimensional degree that the embodiment of the present application is provided freely is dissected, is obtained respectively by acquiring unit 10 The tables of data being associated with each dimension in multiple dimensions, obtains associated data table set, wherein, multiple dimensions are to need The dimension for being dissected;First determining unit 20 determines the index table and non-index table in associated data table set, its In, index table is the table comprising index to be checked in associated data table set, and non-index table is in associated data table set Table not comprising index to be checked;Processing unit 30 carries out filtration treatment according to default filter condition to non-index table;Even Be attached for non-index table after index table and filtering by order unit 40, obtains data sublist;And query unit 50 Index to be checked is inquired about in data sublist and dimension is dissected, wherein, it refers to data according to multiple dimensions to dissect dimension Sublist carries out dimension anatomy, solves the problems, such as that the search efficiency that data multidimensional degree is freely dissected in correlation technique is relatively low, Realizing multilist linkage carries out multi-dimensional free analysis, reduces performance cost, and then has reached lifting inquiry data and refer to The effect of target efficiency.
Alternatively, in the inquiry unit that the data multidimensional degree that the embodiment of the present application is provided freely is dissected, the device is also wrapped Include:Second determining unit, for determine filtering after non-index table in connection row, wherein, connection be classified as needs with The data row that index table is attached, connection unit is additionally operable in connecting the non-index table after arranging filtering and index Table is attached, and obtains data sublist.
Alternatively, in the inquiry unit that the data multidimensional degree that the embodiment of the present application is provided freely is dissected, query unit 50 Including:Module is chosen, for choosing dimension row to be checked in data sublist, dimension to be checked is classified as in data sublist Dimension row to be checked;Determining module, for determining the corresponding index row of index to be checked in data sublist;And Enquiry module, for inquiring about index to be checked according to the corresponding index row of index to be checked in data sublist, according to many Individual dimension carries out dimension anatomy to dimension to be checked row.
It should be noted that for foregoing each method embodiment, in order to be briefly described, therefore it is all expressed as one it is The combination of actions of row, but those skilled in the art should know, and the application is not limited by described sequence of movement System, because according to the application, some steps can sequentially or simultaneously be carried out using other.Secondly, art technology Personnel should also know that embodiment described in this description belongs to preferred embodiment, involved action and module Not necessarily necessary to the application.
In the above-described embodiments, the description to each embodiment all emphasizes particularly on different fields, and does not have the portion described in detail in certain embodiment Point, may refer to the associated description of other embodiment.
In several embodiments provided herein, it should be understood that disclosed device, can be by other sides Formula is realized.For example, device embodiment described above is only schematical, such as the division of described unit, only Only a kind of division of logic function, can there is other dividing mode when actually realizing, such as multiple units or component can To combine or be desirably integrated into another system, or some features can be ignored, or not perform.
The unit that is illustrated as separating component can be or may not be it is physically separate, it is aobvious as unit The part for showing can be or may not be physical location, you can with positioned at a place, or can also be distributed to On multiple NEs.Some or all of unit therein can be according to the actual needs selected to realize the present embodiment The purpose of scheme.
In addition, during each functional unit in the application each embodiment can be integrated in a processing unit, it is also possible to It is that unit is individually physically present, it is also possible to which two or more units are integrated in a unit.It is above-mentioned integrated Unit can both be realized in the form of hardware, it would however also be possible to employ the form of SFU software functional unit is realized.
Obviously, those skilled in the art should be understood that each module or each step of above-mentioned the application can be with general Computing device realize that they can be concentrated on single computing device, or be distributed in multiple computing device institutes On the network of composition, alternatively, they can be realized with the executable program code of computing device, it is thus possible to It is stored in being performed by computing device in storage device, or they is fabricated to each integrated circuit die respectively Block, or the multiple modules or step in them are fabricated to single integrated circuit module to realize.So, the application Any specific hardware and software is not restricted to combine.
The preferred embodiment of the application is the foregoing is only, the application is not limited to, for those skilled in the art For member, the application can have various modifications and variations.It is all within spirit herein and principle, made it is any Modification, equivalent, improvement etc., should be included within the protection domain of the application.

Claims (10)

1. the querying method that a kind of data multidimensional degree is freely dissected, it is characterised in that including:
The tables of data being associated with each dimension in multiple dimensions is obtained respectively, obtains associated data table set, its In, the multiple dimension is the dimension for needing to be dissected;
Determine the index table and non-index table in the associated data table set, wherein, the index table is described Table comprising index to be checked in associated data table set, the non-index table is in the associated data table set Table not comprising the index to be checked;
Filtration treatment is carried out to the non-index table according to default filter condition;
Non- index table after the index table and filtering is attached, data sublist is obtained;And
Index to be checked is inquired about in the data sublist and dimension is dissected, wherein, the anatomy dimension refers to root Dimension anatomy is carried out to the data sublist according to the multiple dimension.
2. method according to claim 1, it is characterised in that
After filtration treatment is carried out to the non-index table according to default filter condition, by the index table and Non- index table after filtering is attached, and before obtaining data sublist, methods described also includes:Determine the mistake Connection row in non-index table after filter, wherein, the connection is classified as to be needed to be attached with the index table Data are arranged,
Non- index table after the index table and filtering is attached, obtaining data sublist includes:By described It is attached with the index table in non-index table after row are connected by filtering, obtains the data sublist.
3. method according to claim 1 and 2, it is characterised in that
After filtration treatment is carried out to the non-index table according to default filter condition, by the index table and Non- index table after filtering is attached, and before obtaining data sublist, methods described also includes:According to described pre- If filter condition carries out filtration treatment to the index table;Determine the link field in the index table after filtration treatment And the field related to the index to be checked, wherein, the link field be need with the filtering after it is non- The field that index table is attached;By the link field in the index table after the filtration treatment and to be checked with described The table of the related field composition of index is ask as the index table after filtering,
Non- index table after the index table and filtering is attached, obtaining data sublist includes:After filtering Non- index table and filtering after index table be attached, obtain the data sublist.
4. method according to claim 1, it is characterised in that inquired about in the data sublist index to be checked and Dissecting dimension includes:
Dimension row to be checked are chosen in the data sublist, the dimension to be checked is classified as in the data sublist Dimension row to be checked;
The corresponding index row of the index to be checked are determined in the data sublist;And
According to the corresponding index row inquiry index to be checked of the index to be checked in the data sublist, Dimension anatomy is carried out to the dimension row to be checked according to the multiple dimension.
5. method according to claim 1, it is characterised in that obtain related with each dimension in multiple dimensions respectively The tables of data of connection, obtaining associated data table set includes:
The tables of data being associated with each dimension in the multiple dimension is obtained respectively, obtains at least one incidence number According to table;
Obtain the associated key that is mutually related between each associated data table at least one associated data table;With And
At least one associated data table is associated by the associated key, the associated data table set is obtained.
6. method according to claim 1, it is characterised in that obtain related with each dimension in multiple dimensions respectively The tables of data of connection, obtaining associated data table set includes:
The tables of data being associated with each dimension in multiple dimensions is obtained respectively, obtains at least one associated data table;
Duplicate removal filtration treatment is carried out to identical tables of data at least one associated data table, after being filtered Associated data table;And
Using the associated data table after the filtering as the associated data table set.
7. method according to claim 1, it is characterised in that determine the index table in the associated data table set Include with non-index table:
Receive the query statement of outside input;
The index to be checked is determined according to the query statement;And
The associated data table set is divided into by the index table and the non-index according to the index to be checked Table.
8. the inquiry unit that a kind of data multidimensional degree is freely dissected, it is characterised in that including:
Acquiring unit, for obtaining the tables of data being associated with each dimension in multiple dimensions respectively, is associated Tables of data set, wherein, the multiple dimension is the dimension for needing to be dissected;
First determining unit, for determining the index table in the associated data table set and non-index table, wherein, The index table is the table comprising index to be checked in the associated data table set, and the non-index table is described Table not comprising the index to be checked in associated data table set;
Processing unit, for carrying out filtration treatment to the non-index table according to default filter condition;
Connection unit, for the non-index table after the index table and filtering to be attached, obtains data sublist; And
Query unit, for inquiring about index to be checked in the data sublist and dissecting dimension, wherein, it is described It refers to carry out dimension anatomy to the data sublist according to the multiple dimension to dissect dimension.
9. device according to claim 8, it is characterised in that described device also includes:Second determining unit, uses In it is determined that connection row in non-index table after the filtering, wherein, the connection is classified as needs and the index The data row that table is attached,
The connection unit is additionally operable to enter with the index table by the non-index table after the connection row are by filtering Row connection, obtains the data sublist.
10. device according to claim 8, it is characterised in that the query unit includes:
Module is chosen, for choosing dimension row to be checked in the data sublist, the dimension to be checked is classified as Dimension row to be checked in the data sublist;
Determining module, for determining the corresponding index row of the index to be checked in the data sublist;And
Enquiry module, in the data sublist according to the corresponding index row inquiry institute of the index to be checked Index to be checked is stated, dimension anatomy is carried out to the dimension row to be checked according to the multiple dimension.
CN201511032274.XA 2015-12-31 2015-12-31 Data multidimensional free analysis query method and device Active CN106933902B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201511032274.XA CN106933902B (en) 2015-12-31 2015-12-31 Data multidimensional free analysis query method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201511032274.XA CN106933902B (en) 2015-12-31 2015-12-31 Data multidimensional free analysis query method and device

Publications (2)

Publication Number Publication Date
CN106933902A true CN106933902A (en) 2017-07-07
CN106933902B CN106933902B (en) 2020-02-07

Family

ID=59444177

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201511032274.XA Active CN106933902B (en) 2015-12-31 2015-12-31 Data multidimensional free analysis query method and device

Country Status (1)

Country Link
CN (1) CN106933902B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109635014A (en) * 2018-12-12 2019-04-16 苏州思必驰信息科技有限公司 The display methods and system of user's assets information
CN110209687A (en) * 2018-02-23 2019-09-06 北京国双科技有限公司 The querying method and device of various dimensions attribution
CN112948441A (en) * 2021-03-26 2021-06-11 浪潮通用软件有限公司 Financial data-oriented multidimensional data aggregation method and equipment
CN114579619A (en) * 2022-04-28 2022-06-03 北京达佳互联信息技术有限公司 Data query method and device, electronic equipment and storage medium

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103699966A (en) * 2013-04-12 2014-04-02 国家电网公司 Multidimensional overall-process data control system and control method based on user requirement
CN104391951A (en) * 2014-11-27 2015-03-04 北京国双科技有限公司 Web page thermodynamic diagram loading method and device
CN104392001A (en) * 2014-12-15 2015-03-04 北京国双科技有限公司 Database inquiry method and device
CN104408180A (en) * 2014-12-15 2015-03-11 北京国双科技有限公司 Stored data inquiring method and device
CN104408179A (en) * 2014-12-15 2015-03-11 北京国双科技有限公司 Method and device for processing data from data table
CN104424229A (en) * 2013-08-26 2015-03-18 腾讯科技(深圳)有限公司 Calculating method and system for multi-dimensional division
CN104484398A (en) * 2014-12-12 2015-04-01 北京国双科技有限公司 Method and device for aggregation of data in datasheet
CN106933914A (en) * 2015-12-31 2017-07-07 北京国双科技有限公司 The data processing method and device of many tables of data
CN106933893A (en) * 2015-12-31 2017-07-07 北京国双科技有限公司 The querying method and device of multi-dimensional data

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103699966A (en) * 2013-04-12 2014-04-02 国家电网公司 Multidimensional overall-process data control system and control method based on user requirement
CN104424229A (en) * 2013-08-26 2015-03-18 腾讯科技(深圳)有限公司 Calculating method and system for multi-dimensional division
CN104391951A (en) * 2014-11-27 2015-03-04 北京国双科技有限公司 Web page thermodynamic diagram loading method and device
CN104484398A (en) * 2014-12-12 2015-04-01 北京国双科技有限公司 Method and device for aggregation of data in datasheet
CN104392001A (en) * 2014-12-15 2015-03-04 北京国双科技有限公司 Database inquiry method and device
CN104408180A (en) * 2014-12-15 2015-03-11 北京国双科技有限公司 Stored data inquiring method and device
CN104408179A (en) * 2014-12-15 2015-03-11 北京国双科技有限公司 Method and device for processing data from data table
CN106933914A (en) * 2015-12-31 2017-07-07 北京国双科技有限公司 The data processing method and device of many tables of data
CN106933893A (en) * 2015-12-31 2017-07-07 北京国双科技有限公司 The querying method and device of multi-dimensional data

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110209687A (en) * 2018-02-23 2019-09-06 北京国双科技有限公司 The querying method and device of various dimensions attribution
CN110209687B (en) * 2018-02-23 2021-06-22 北京国双科技有限公司 Multi-dimensional attribution query method and device
CN109635014A (en) * 2018-12-12 2019-04-16 苏州思必驰信息科技有限公司 The display methods and system of user's assets information
CN112948441A (en) * 2021-03-26 2021-06-11 浪潮通用软件有限公司 Financial data-oriented multidimensional data aggregation method and equipment
CN112948441B (en) * 2021-03-26 2023-09-29 浪潮通用软件有限公司 Multi-dimensional data collection method and equipment for financial data
CN114579619A (en) * 2022-04-28 2022-06-03 北京达佳互联信息技术有限公司 Data query method and device, electronic equipment and storage medium
CN114579619B (en) * 2022-04-28 2023-01-20 北京达佳互联信息技术有限公司 Data query method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN106933902B (en) 2020-02-07

Similar Documents

Publication Publication Date Title
CN106933906A (en) The querying method and device of data multidimensional degree
US9519689B2 (en) Disambiguation framework for information searching
CA2751295C (en) Analysis of object structures such as benefits and provider contracts
US20140040306A1 (en) Business intelligence performance analysis system
CN106933902A (en) Querying method and device that data multidimensional degree is freely dissected
DE102014204842A1 (en) Clustering of data
US20170357708A1 (en) Apparatus and method for processing multi-dimensional queries in a shared nothing system through tree reduction
JP2004528660A (en) Method and system for improving response time of queries on partitioned database objects
CN106557498A (en) Date storage method and device and data query method and apparatus
CN106933893B (en) multi-dimensional data query method and device
CN102663048A (en) Method and device for providing search result
CN103714096A (en) Lucene-based inverted index system construction method and device, and Lucene-based inverted index system data processing method and device
US9477702B1 (en) Apparatus and method for accessing materialized and non-materialized values in a shared nothing system
US20110066620A1 (en) Automated Boolean Expression Generation for Computerized Search and Indexing
CN103605848A (en) Method and device for analyzing paths
CN104182544B (en) The dimension method for decomposing and device of analytical database
CN105653661A (en) Search result re-ranking method and device
CN106682206A (en) Method and system for big data processing
US9177079B1 (en) Apparatus and method for processing multi-dimensional queries in a shared nothing system through tree reduction
CN103605744B (en) The analysis method and device of site search engine data on flows
CN104794130A (en) Inter-table correlation query method and device
CN106649385B (en) Data reordering method and device based on HBase database
CN106933894A (en) The querying method and device of multi-dimensional data
US20120197933A1 (en) Systems and methods for assigning hosts in response to a data query
CN113553341A (en) Multidimensional data analysis method, multidimensional data analysis device, multidimensional data analysis equipment and computer readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Address after: 100083 No. 401, 4th Floor, Haitai Building, 229 North Fourth Ring Road, Haidian District, Beijing

Applicant after: Beijing Guoshuang Technology Co.,Ltd.

Address before: 100086 Cuigong Hotel, 76 Zhichun Road, Shuangyushu District, Haidian District, Beijing

Applicant before: Beijing Guoshuang Technology Co.,Ltd.

GR01 Patent grant
GR01 Patent grant