CN110019238A - The method, apparatus and storage medium that various dimensions range partition is cut - Google Patents

The method, apparatus and storage medium that various dimensions range partition is cut Download PDF

Info

Publication number
CN110019238A
CN110019238A CN201711478050.0A CN201711478050A CN110019238A CN 110019238 A CN110019238 A CN 110019238A CN 201711478050 A CN201711478050 A CN 201711478050A CN 110019238 A CN110019238 A CN 110019238A
Authority
CN
China
Prior art keywords
subregion
cut
data
range
expression formula
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201711478050.0A
Other languages
Chinese (zh)
Other versions
CN110019238B (en
Inventor
汪龙重
徐冬
连杰红
郑君正
雷春蔚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201711478050.0A priority Critical patent/CN110019238B/en
Publication of CN110019238A publication Critical patent/CN110019238A/en
Application granted granted Critical
Publication of CN110019238B publication Critical patent/CN110019238B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • G06F16/2264Multidimensional index structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2282Tablespace storage structures; Management thereof

Abstract

The method, apparatus and storage medium cut the invention discloses a kind of various dimensions range partition.This method comprises: the field of multiple dimensional extents based on tables of data to be cut, is decomposed into multiple subexpressions for the expression formula for being used for cut data table;Obtain subexpression subrange targeted in tables of data;Subrange is extended, multiple spreading ranges are generated;Based on each spreading range, cut data table obtains each subregion;Merge each subregion, obtains combined region.Inventive embodiments can support various dimensions range partition to cut as a result, expand the application range of subregion cutting;More permutation and combination situations can also be handled, and subregion can be made to cut not only thoroughly but also accurate;Follow up scan range can also be reduced, it is time-consuming to shorten scanning, and support can be provided to obtain accurate target data.

Description

The method, apparatus and storage medium that various dimensions range partition is cut
Technical field
The present invention relates to database inquiry optimization technical fields more particularly to a kind of various dimensions range partition to cut The method, apparatus and storage medium of (Partition Pruning).
Background technique
With the fast development of network technology, the data volume in database rapidly increases therewith, and big data era has been come Face.Usual data are stored in the database by region.In logical layer, data can be divided into multiple subregions and be stored in different subregions In table;In physical layer, data can be divided into multiple subregions and be stored in different data file or distributed data base node.? Lane database including mass data, user can obtain target partition by the method for subregion cutting.
Wherein, subregion cutting is to be cut out according to predicate expressions to subregion, is not belonging to predicate table to crop those Up to subregion belonging to formula, remaining target partition.Predicate expressions are to indicate a kind of calculation expression in the database.For example, needle To proposition: Wang Liang is football player or volleyballer.If Wang Liangji done: a, Wang Liang are that football player's meter is done: f (a), Wang Liang be volleyballer meter do: g (a), then, the predicate expressions of the proposition may is that f (a) ∨ g (a).
Specifically, it is a kind of optimization method for scanning partition table in database that subregion, which is cut,.It is scanned when to partition table When, the filter condition that whether there is subregion field in the conditional expression of predicate can first be checked by cutting component.If there is this Filter condition, then access meets the subregion of filter condition.It for ineligible subregion, then does not need to scan, to reach The purpose of scan data volume is reduced, and then reduces the data scanning time.
Existing database product, such as Mysql etc. only carry out subregion sanction to the predicate for being related to a dimensional extent It cuts, subregion cutting can not be carried out if the predicate for being related to multiple dimensional extents.When can not subregion cut when, it is necessary to scan All subregions.Cause the subregion table scan of multiple dimensional extents whole time-consuming longer.
How solving the subregion table scans of multiple dimensional extents, time-consuming, is a technical problem to be solved urgently.
Summary of the invention
In order to solve the problems, such as the subregion table scan of multiple dimensional extents, time-consuming, and it is more that the embodiment of the invention provides one kind The method, apparatus and storage medium that dimensional extent subregion is cut.
In a first aspect, providing a kind of method that various dimensions range partition is cut.Method includes the following steps:
The field of multiple dimensional extents based on tables of data to be cut, the expression formula for being used for cut data table is decomposed into Multiple subexpressions;
Obtain subexpression subrange targeted in tables of data;
Subrange is extended, multiple spreading ranges are generated;
Based on each spreading range, cut data table obtains each subregion;
Merge each subregion, obtains combined region.
Second aspect provides a kind of device that various dimensions range partition is cut.The device includes:
Expression formula decomposition component, range securing component, range extension element, subregion cut component and subregion combining block,
Expression formula decomposition component and range extension element are coupled with range securing component respectively,
Range extension element and subregion combining block cut component with subregion respectively and couple,
Wherein:
Expression formula decomposition component, the field of multiple dimensional extents based on tables of data to be cut, will be used for cut data The expression formula of table is decomposed into multiple subexpressions;
Range securing component, for obtaining subexpression subrange targeted in tables of data;
Range extension element generates multiple spreading ranges for extending subrange;
Subregion cuts component, is based on each spreading range, and cut data table obtains each subregion;
Subregion combining block obtains combined region for merging each subregion.
The third aspect provides a kind of device that various dimensions range partition is cut.The device includes:
Memory, for storing program;
Processor, for executing the program of the memory storage, it is above-mentioned each that described program executes the processor Method described in aspect.
Fourth aspect provides a kind of computer readable storage medium.Finger is stored in the computer readable storage medium It enables, when run on a computer, so that computer executes method described in above-mentioned various aspects.
5th aspect, provides a kind of computer program product comprising instruction.When the product is run on computers, So that computer executes method described in above-mentioned various aspects.
6th aspect, provides a kind of computer program.When the computer program is run on computers, so that calculating Machine executes method described in above-mentioned various aspects.
On the one hand, foregoing invention embodiment can will be used to cut institute based on the field of each dimensional extent of tables of data The expression formula for stating tables of data is decomposed into multiple subexpressions, so as to support various dimensions range partition to cut, expands subregion The application range of cutting.
On the other hand, foregoing invention embodiment can extend subrange, generate multiple spreading ranges, be based on each extension model It encloses, cut data table obtains each subregion, and single subregion cutting method is decomposed into multiple child partitions so as to realize The mode of cutting not only can handle more permutation and combination situations, but also subregion can be made to cut not only thoroughly but also accurate.
Another aspect, foregoing invention embodiment can accurately cut subregion by merging, obtain accurately merging subregion, And then follow up scan range can be not only reduced, it is time-consuming to shorten scanning, and branch can be provided to obtain accurate target data It holds.
Detailed description of the invention
In order to illustrate the technical solution of the embodiments of the present invention more clearly, will make below to required in the embodiment of the present invention Attached drawing is briefly described, it should be apparent that, drawings described below is only some embodiments of the present invention, for For those of ordinary skill in the art, without creative efforts, it can also be obtained according to these attached drawings other Attached drawing.
Fig. 1 is the system architecture schematic diagram that the various dimensions range partition of one embodiment of the invention is cut;
Fig. 2 is the flow diagram for the method that the various dimensions range partition of one embodiment of the invention is cut;
Fig. 3 is the flow diagram for the method that the various dimensions range partition of another embodiment of the present invention is cut;
Fig. 4 is the structural schematic diagram for the device that the various dimensions range partition of another embodiment of the present invention is cut;
Fig. 5 is the block schematic illustration for the device that the various dimensions range partition of one embodiment of the invention is cut.
Specific embodiment
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with the embodiment of the present invention In attached drawing, technical scheme in the embodiment of the invention is clearly and completely described, it is clear that described embodiment is A part of the embodiment of the present invention, instead of all the embodiments.Based on the embodiments of the present invention, those of ordinary skill in the art Every other embodiment obtained without creative efforts, shall fall within the protection scope of the present invention.
It should be noted that in the absence of conflict, the features in the embodiments and the embodiments of the present application can phase Mutually combination.The application is described in detail below with reference to the accompanying drawings and in conjunction with the embodiments.
Fig. 1 is the system architecture schematic diagram that the various dimensions range partition of one embodiment of the invention is cut.
As shown in Figure 1, the framework may include: server 10, client 20 and network 30.Server 10 may include number According to library 11, tables of data 111 can be stored in database 11.
One application scenarios of the present embodiment may is that user 40 (such as Xiao Zhou) accesses server 10 using client 20, Target area is obtained by the method for subregion cutting in the tables of data 111 of database 11.After obtaining target area, user 40 can also be scanned in target area, obtain target data.
Wherein, server 10 can be equipped with database 11 and various telecommunication customer end applications, such as instant messaging tools, Mailbox client, social platform software, audio frequency and video software etc..In general, the hardware configuration with higher of server 10 and relatively strong Operational capability.In the lesser situation of operation pressure, server 10 can use desktop computer, work station, mobile device or above-mentioned Electronic equipment etc. substituted.
Database 11 can be SQL database, oracle database etc..
Tables of data 111 can be the customer transaction tables of data of certain website, can also be student information table of certain school etc..Number It for example can be as shown in following table (1) according to table 111:
Table (1)
Referring to table (1), target area is obtained by the method for subregion cutting in the tables of data 111 of database 11, it can To obtain: university is Peking University, and institute is School of Computer Science, and grade is 2017 target areas.The target area can be with As shown in following table (2):
Table (2)
Serial number Name Student number University Institute Grade
1 Li Xi 100010 Peking University School of Computer Science 2017
2 Zhou Yue 100011 Peking University School of Computer Science 2017
3 Zhang Guo 100100 Peking University School of Computer Science 2017
Then, user can be scanned in target area, available " Li Xi ", " all Yue " and " fruit " this 3 The data information of the name of classmate.
Client 20 includes but is not limited to PC, smart phone, tablet computer, personal digital assistant, server etc..
Network 30 can be to provide the medium of communication link between various electronic equipments.Specifically, network 30 can be with Including various connection types, such as wired, wireless communication link or fiber optic cables etc..
It should be understood that the number of the equipment in Fig. 1 is only schematical.According to needs are realized, which can be with With any number of server 10, database 11, tables of data 111, client 20 and network 30.Tables of data 111 can increase Or reduce ordered series of numbers.Flexible setting can be specifically carried out according to the actual situation.
Following embodiments can carry out subregion using the system architecture of the present embodiment and cut and the processing such as data scanning.
Fig. 2 is the flow diagram for the method that the various dimensions range partition of one embodiment of the invention is cut.
In the present embodiment, the executing subject of method and step can be processing component.It is appreciated that its with processing function Its processor can also execute this operation, and content is with no restrictions in this respect.
As shown in Fig. 2, method includes the following steps: S210, multiple dimensions of the processing component based on tables of data to be cut The field for spending range, is decomposed into multiple subexpressions for the expression formula for being used for cut data table;S220, processing component obtain sublist The subrange targeted in tables of data up to formula;S230, processing component extend subrange, generate multiple spreading ranges;S240, place It manages component and is based on each spreading range, cut data table obtains each subregion;S250, processing component merge each subregion, Obtain combined region.
In step S210, expression formula may include: operator and character, for example, expression formula can be predicate expressions. Predicate expressions are to indicate a kind of calculation expression in the database.Predicate expressions may include: operator and value.Wherein, Operator may include:=,==,>,<,>=,≤,<>,!=,!>,!< etc..Value can be character, number etc..With reference to upper It states table (1), predicate expressions for example may is that university=Peking University and institute=School of Computer Science of scanning etc..
In the present embodiment, the conditional relationship that can refer to according to expression formula is cut, ineligible relationship is filtered out Region obtains remaining target area.
The field of each dimensional extent of the processing component based on tables of data to be cut, will be used for the expression of cut data table It may include following sub-step that formula, which is decomposed into the step of multiple subexpressions (i.e. S210):
S211, processing component obtain the field of each dimensional extent of tables of data.
For example, name, student number, university, institute and grade can indicate a dimensional extent in table (1).Table (1) can To be the table of 15 dimensional extent.Name, student number, university, institute and grade can indicate the field of dimensional extent.Each word The corresponding column of section can be subregion column.
S212, processing component obtain character corresponding with field in expression formula.
For example, be directed to predicate expressions: university=Peking University and institute=School of Computer Science, the character of acquisition can be with Be: university, Peking University, and, institute ,=, School of Computer Science.
S213, processing component are based on character, expression formula are decomposed into multiple subexpressions.
For example, predicate university=Peking University and institute=School of Computer Science expression formula can be decomposed are as follows: university= Peking University and this 2 sub- predicates of institute=School of Computer Science.
The implementation that step obtains subexpression subrange (i.e. S220) targeted in tables of data can wrap Include following sub-step:
S221 obtains associate field and include the subexpression for having corresponding character with field;
For example, only obtaining university=Peking University because university and Peking University include university's field Predicate.
S222, with the dimensional extent where the associated field of subexpression in, obtain the subrange of subexpression.
For example, can be the data column range where university with the associated field of subexpression.
In some embodiments, in S230, subrange is extended, multiple spreading ranges is generated, may include: from multiple sons Part subrange is taken out in range;The subrange of taking-up is subjected to permutation and combination, generates multiple extension expression formulas.
Thus, it is possible to take out the sub- predicate of Partial key from multiple sub- predicates according to actual conditions of demand, be based on The sub- predicate of Partial key is carried out permutation and combination, obtains multiple combination predicates, it is possible to reduce operand subtracts by logical relation Few sweep time.
Effect is obvious especially in the website data statistics of mass data for the embodiment.In order to describe succinct and letter of arguing It is single, the present embodiment only less situation of schematic enumerated data amount.It is appreciated that the present embodiment can be applied to mass data Database in.
In some embodiments, in S230, subrange is extended, multiple spreading ranges is generated, may include: from multiple sons Whole subranges are taken out in range;The subrange of taking-up is subjected to permutation and combination, generates multiple extension expression formulas.
Thus, it is possible to take out whole sub- predicates from multiple sub- predicates according to actual conditions of demand, logic-based is closed The sub- predicate of whole is carried out permutation and combination, obtains multiple combination predicates, can increase the precision of cutting by system.
In step S230, in partition table, spreading range can be rectangular extent.Such as the rectangle number of above-mentioned table (2) According to range.
In step S240, it is based on each spreading range, cut data table, it includes as follows for obtaining each subregional step Sub-step:
S241 obtains each subregional common portion region;
S242 obtains each subregional characteristic region;
After cropping inactive area, the common portion region of acquisition can the gauge outfit as shown in following table (3) and table (4) Region (including serial number, name and institute's field).
Table (3)
Serial number Name
5 Zhao Xinyi
6 Li Jiatong
Table (4)
Serial number Institute
5 Medical college
6 The College of Physics
In step s 250, merging subregion can be as shown in following table (5):
Table (5)
Serial number Name Institute
5 Zhao Xinyi Medical college
6 Li Jiatong The College of Physics
Foregoing invention embodiment can introduce permutation and combination theory on the basis of original subregion is cut as a result, optimize The method that existing subregion is cut.Specifically, subregion cutting problem can be split into according to permutation and combination theory different one by one Child partition cut problem, each child partition is then cut into problem and is cut according to existing subregion method of cutting out, is pressed simultaneously Phototypesetting column combinatorial theory merges the result that all child partitions are cut, and carries out to realize to the subregion of multiple and different ranges It cuts.
Fig. 3 is the flow diagram for the method that the various dimensions range partition of another embodiment of the present invention is cut.
As shown in figure 3, method includes the following steps:
S310 decomposes predicate according to subregion field (for example, field of multiple dimensional extents), by different subregions word The different predicates association that Duan Yuqi is related to.
Assuming that all subregion fields of partition table (such as tables of data) are (C1, C2, C3 ..., Cn), while all predicates For P.Wherein, C1 can be " name " field in table (1), and C2 can be " student number " field, and C3 can be " university " field, C4 It can be " institute " field, C5 can be " grade " field etc..P may is that name=Li Xi and student number=100010and is big =Peking University and institute=School of Computer Science and grades=2017 etc..
Predicate P is decomposed according to all subregion fields, P can be decomposed into (P1, P2, P3 ..., Pn).For example, P1 is " name=Li Xi ", P2 are " student number=100010 ", and P3 is " university=Peking University ", and P4 is " institute=School of Computer Science " Deng.
Different subregions field is associated from the different predicates that it is related to, obtains (Ci- > Pi), wherein i=1, 2 ..., m.That is subregion field C1 is associated with predicate P1, and subregion field C2 is associated with predicate P2 ... subregion field Cm and is associated with predicate Pm.Example Such as, " name " is associated with " name=Li Xi ", " student number " is associated with " student number=100010 ".Specific associated mode can Then to stop searching simply by the presence of the predicate Pj that a subregion field can not be cut with subregion by searching for determination.Also It is, if there are subregion fields not to have predicate for centre, to illustrate that the corresponding range of predicate has cavity, so subsequent partitions field There is no need to search for the predicate being related to.
The associated predicate Pi of all subregion fields is converted range by S320.
Wherein, range can be predicate subexpression subrange targeted in tables of data.The range may be one Or it is multiple, Ri [1] can be used, Ri [2] ..., Ri [ni] are indicated.I.e. subregion field is associated with the range that predicate obtains according to it For (Ci- > Ri [1], Ri [2] ..., Ri [ni]), wherein i=1,2 ..., m.The range number is (n1 ..., nm) respectively.
For example, predicate P1 " university=Peking University " can shift the range R1 [1] for being converted into following tables of data:
Serial number University
1 Peking University
2 Peking University
3 Peking University
4 Peking University
5 Peking University
6 Peking University
For example, predicate P2 " institute=School of Computer Science " can shift the range R1 [2] for being converted into following tables of data:
Serial number Institute
1 School of Computer Science
2 School of Computer Science
3 School of Computer Science
For example, predicate P3 " name=Li Xi " can shift the range R1 [3] for being converted into following tables of data:
Serial number Name
1 Li Xi
Subregion range is cut range by the various dimensions subregion that permutation and combination method obtains multiple sons and (e.g., cut by S330 The resulting subregion of tables of data).
Subregion range is (Ci- > Ri [1], Ri [2] ..., Ri [ni]).
Various dimensions subregion cuts range:
Wherein, SubRi expression one sub- various dimensions subregion cutting range, and byIt is composed.Ri ∈ Ri, ri expression The range of each dimension derives from the range Ri of field Ci.The number of various dimensions subregion range can be n1*n2...*nm.
For example, R1 [1], R1 [2], R1 [3] are available after permutation and combination: R1 [1], R1 [2], R1 [3], R1 [1]+ R1 [2], R1 [1]+R1 [3], R1 [1]+R1 [2]+R1 [3] etc..
S340 cuts range progress subregion according to sub- various dimensions subregion and cuts to obtain cutting subregion (such as subregion).
The number of various dimensions subregion range can be n1*n2...*nm.
The number for cutting subregion is also possible to n1*n2...*nm.
Cutting subregion is division result, Ke YiyongResult (i) is indicated.
S350 merges and cuts subregion, obtains merging subregion.
MergeResult (i) division result obtains final subregion and cuts result result, it may be assumed that merges subregion.
In some embodiments, it is assumed that subregion field is (c1, c2, c3) respectively, and the cutting of various dimensions range partition can root It is realized according to following 4 kinds of situations of predicate.1st kind of situation, can be directed to predicate c1=1 the case where:
Serial number c1 c2 c3
1 1
2
3
Firstly, collecting subregion arranges corresponding predicate, i.e. c1- > (c1=1).
Then, it calculates predicate and corresponds to range, i.e. c1- > ([1,1]).I.e. 1 above-mentioned table be the 1st row the 1st column.
Since other subregions column (c2 and c3) do not have predicate, then predicate corresponds to range and is exactly: [(1, min, min), (1, Max, max)].That is: [(c1 minimum value, c2 minimum value, c3 minimum value), (c1 maximum value, c2 maximum value, c3 maximum value)].Due to The present embodiment only has assignment to c1, and c2 and c3 are without assignment, so the section of c2 and c3 can be from min to max.Therefore, c2 It may each be min with the minimum value of c3, their maximum value may each be max.
Then, subregion cutting is carried out according to the range.
2nd kind of situation, can be directed to predicate c1=1and c2=1 the case where:
Serial number c1 c2 c3
1 1 1
2
3
It is associated with firstly, subregion is arranged corresponding predicate, it may be assumed that c1- > (c1=1), c2- > (c2=1).
Then, it calculates predicate and corresponds to range, as c1- > [1,1], c2- > [1,1], then predicate, which corresponds to subregion range, is [(1,1, min), (1,1, max)].
Then, subregion cutting is carried out according to subregion range.
3rd kind of situation, can be directed to predicate (c1=1or c1=2) and c2=1 the case where:
It is associated with firstly, subregion is arranged corresponding predicate, i.e. c1- > (c1=1or c1=2), c2- > (c2=1).
Since there are the expression formulas of or, then permutation and combination is carried out, is equivalent to (c1=1and c2=1) or (c1=1and C2=1).
Then, subregion range is calculated with reference to the 2nd kind of situation.
Then, merge the result for cutting subregion.
4th kind of situation, can be directed to (c1=1or c1=2) and (c2=1or c2=2) the case where:
It is associated with firstly, subregion is arranged corresponding predicate, i.e. c1- > (c1=1or c1=2), c2- > (c2=1or c2= 2)。
It since there are or expression formulas, then carries out permutation and combination and is split, be equivalent to (c1=1and c2=1) or (c1= 1and c2=2) or (c1=2and c2=1) or (c1=1and c2=2).
Then, subregion range is calculated with reference to the 2nd kind of situation.
Then, merge the result for cutting subregion.
All expression formulas can be carried out permutation and combination and be split as smaller unit of account by the embodiment of the present invention, then be counted Calculation obtains son and obtains final result as a result, merging.
The sequence of above-mentioned operating procedure is adjusted flexibly, or above-mentioned steps are subjected to the behaviour such as flexible combination Make.For simplicity, repeating no more various implementations.In addition, the content of each embodiment can mutual reference.
It should be noted that the executing subject of the above method can be processing component, it is also possible to controller, can also be Processor etc..In the absence of conflict, those skilled in the art can be according to actual needs by the suitable of above-mentioned operating procedure Sequence is adjusted flexibly, or above-mentioned steps are carried out the operation such as flexible combination.For simplicity, repeating no more various realization sides Formula.In addition, the content of each embodiment can mutual reference.
Fig. 4 is the structural schematic diagram for the device that the various dimensions range partition of another embodiment of the present invention is cut.
As shown in figure 4, the device 400 that various dimensions range partition is cut may include: expression formula decomposition component 401, range Securing component 402, range extension element 403, subregion cut component 404 and subregion combining block 405.Expression formula decomposition component 401 and range extension element 403 coupled respectively with range securing component 402, range extension element 403 and subregion combining block 405, which cut component 404 with subregion respectively, couples.
Wherein: expression formula decomposition component 401 can will be used based on the field of multiple dimensional extents of tables of data to be cut Multiple subexpressions are decomposed into the expression formula of cut data table;Range securing component 402 can be used for obtaining subexpression and exist Targeted subrange in tables of data;Range extension element 403 can be used for extending subrange, generate multiple spreading ranges;Point Area, which cuts component 404, can be based on each spreading range, and cut data table obtains each subregion;Subregion combining block 405 can To obtain combined region for merging each subregion.
In some embodiments, expression formula decomposition component 401 can be also used for: obtain each dimensional extent of tables of data Field;Obtain character corresponding with field in expression formula;Based on character, expression formula is decomposed into multiple subexpressions.
In some embodiments, range securing component 402 can be also used for: associate field and including having corresponding word with field The subexpression of symbol;With the dimensional extent where the associated field of subexpression in, obtain the subrange of subexpression.
In some embodiments, range extension element 403 can be also used for: part subrange is taken out from multiple subranges Or whole subranges;The subrange of taking-up is subjected to permutation and combination, generates multiple extension expression formulas.
In some embodiments, subregion combining block 405 can be also used for: obtain each subregional common portion area Domain;Obtain each subregional characteristic region;Merge common portion region and characteristic region, so that combined region packet Include common portion region and each characteristic region.
In some embodiments, expression formula may include: operator and character.
In some embodiments, expression formula is predicate expressions.
It should be noted that the device of the various embodiments described above can be used as the method for each embodiment of the various embodiments described above In executing subject, the corresponding process in each method may be implemented, realize identical technical effect, for sake of simplicity, in this respect Content repeats no more.
In the above-described embodiments, can come wholly or partly by software, hardware, firmware or any combination thereof real It is existing.For example, 2 elements are integrated in a device, 1 device can also be divided into two individual elements etc..Work as use When software realization, can entirely or partly it realize in the form of a computer program product.The computer program product includes One or more computer instructions describe in above-mentioned each embodiment so that computer executes when run on a computer Method.When loading on computers and executing the computer program instructions, entirely or partly generate according to of the invention real Apply process described in example or function.The computer can be general purpose computer, special purpose computer, computer network or its His programmable device.The computer instruction may be stored in a computer readable storage medium, or can from a computer It reads storage medium to transmit to another computer readable storage medium, for example, the computer instruction can be from website station Point, computer, server or data center pass through wired (such as coaxial cable, optical fiber, Digital Subscriber Line (DSL)) or wireless (such as infrared, wireless, microwave etc.) mode is transmitted to another web-site, computer, server or data center.Institute Stating computer readable storage medium can be any usable medium or include that one or more can that computer can access The data storage devices such as server, the data center integrated with medium.The usable medium can be magnetic medium, (for example, soft Disk, hard disk, tape), optical medium (for example, DVD) or semiconductor medium (such as solid state hard disk Solid State Disk (SSD)) etc..
Fig. 5 is the block schematic illustration that the various dimensions range partition of one embodiment of the invention is cut.
It, can be according to being stored in read-only storage as shown in figure 5, the frame may include central processing unit (CPU) 501 Program in device (ROM) 502 is executed from the program that storage section 508 is loaded into random access storage device (RAM) 503 The various operations that Fig. 2 and Fig. 3 embodiment is done.In RAM503, be also stored with system architecture operation needed for various programs and Data.CPU501, ROM 502 and RAM 503 is connected with each other by bus 504.Input/output (I/O) interface 505 also connects To bus 504.
I/O interface 505 is connected to lower component: the importation 506 including keyboard, mouse etc.;It is penetrated including such as cathode The output par, c 507 of spool (CRT), liquid crystal display (LCD) etc. and loudspeaker etc.;Storage section 508 including hard disk etc.; And the communications portion 509 of the network interface card including LAN card, modem etc..Communications portion 509 via such as because The network of spy's net executes communication process.Driver 510 is also connected to I/O interface 505 as needed.Detachable media 511, such as Disk, CD, magneto-optic disk, semiconductor memory etc. are mounted on as needed on driver 510, in order to read from thereon Computer program be mounted into storage section 508 as needed.
Particularly, according to an embodiment of the invention, may be implemented as computer above with reference to the process of flow chart description Software program.For example, the embodiment of the present invention includes a kind of computer program product comprising be tangibly embodied in machine readable Computer program on medium, the computer program include the program code for method shown in execution flow chart.At this In the embodiment of sample, which can be downloaded and installed from network by communications portion 509, and/or from removable Medium 511 is unloaded to be mounted.
The apparatus embodiments described above are merely exemplary, wherein described, unit can as illustrated by the separation member It is physically separated with being or may not be, component shown as a unit may or may not be physics list Member, it can it is in one place, or may be distributed over multiple network units.It can be selected according to the actual needs In some or all of the modules achieve the purpose of the solution of this embodiment.Those of ordinary skill in the art are not paying creativeness Labour in the case where, it can understand and implement.
Through the above description of the embodiments, those skilled in the art can be understood that each embodiment can It realizes by means of software and necessary general hardware platform, naturally it is also possible to pass through hardware.Based on this understanding, on Stating technical solution, substantially the part that contributes to existing technology can be embodied in the form of software products in other words, should Computer software product may be stored in a computer readable storage medium, such as ROM/RAM, magnetic disk, CD, including several fingers It enables and using so that a computer equipment (can be personal computer, server or the network equipment etc.) executes each implementation Method described in certain parts of example or embodiment.
Finally, it should be noted that the above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations;Although Present invention has been described in detail with reference to the aforementioned embodiments, those skilled in the art should understand that: it still may be used To modify the technical solutions described in the foregoing embodiments or equivalent replacement of some of the technical features; And these are modified or replaceed, technical solution of various embodiments of the present invention that it does not separate the essence of the corresponding technical solution spirit and Range.

Claims (16)

1. a kind of method that various dimensions range partition is cut characterized by comprising
The field of multiple dimensional extents based on tables of data to be cut, the expression formula for being used to cut the tables of data is decomposed into Multiple subexpressions;
Obtain subexpression subrange targeted in the tables of data;
The subrange is extended, multiple spreading ranges are generated;
Based on each spreading range, the tables of data is cut, obtains each subregion;
Merge each subregion, obtains combined region.
2. the method according to claim 1, wherein each dimensional extent based on tables of data to be cut Field, the expression formula for being used to cut the tables of data is decomposed into multiple subexpressions, comprising:
Obtain the field of each dimensional extent of the tables of data;
Obtain character corresponding with the field in the expression formula;
Based on the character, the expression formula is decomposed into multiple subexpressions.
3. the method according to claim 1, wherein described obtain subexpression institute in the tables of data For subrange, comprising:
Acquisition is associated with the field and includes the subexpression for having corresponding character with the field;
With the dimensional extent where the associated field of the subexpression in, obtain the subrange of the subexpression.
4. multiple spreading ranges are generated the method according to claim 1, wherein described extend the subrange, Include:
Part subrange or whole subranges are taken out from multiple subranges;
The subrange of taking-up is subjected to permutation and combination, generates multiple extension expression formulas.
5. obtaining combined region the method according to claim 1, wherein described merge each subregion Include:
Obtain each subregional common portion region;
Obtain each subregional characteristic region;
Merge the common portion region and the characteristic region, so that the combined region includes the common portion area Domain and each characteristic region.
6. method according to any one of claims 1-5, which is characterized in that the expression formula includes: operator and word Symbol.
7. according to the method described in claim 6, it is characterized in that, the expression formula is predicate expressions.
8. the device that a kind of various dimensions range partition is cut characterized by comprising
Expression formula decomposition component, range securing component, range extension element, subregion cut component and subregion combining block,
The expression formula decomposition component and the range extension element are coupled with the range securing component respectively,
The range extension element and the subregion combining block cut component with the subregion respectively and couple, in which:
The expression formula decomposition component, the field of multiple dimensional extents based on tables of data to be cut, will be used for cut data The expression formula of table is decomposed into multiple subexpressions;
The range securing component, for obtaining subexpression subrange targeted in tables of data;
The range extension element generates multiple spreading ranges for extending subrange;
The subregion cuts component, is based on each spreading range, and cut data table obtains each subregion;
The subregion combining block obtains combined region for merging each subregion.
9. device according to claim 8, which is characterized in that the expression formula decomposition component is also used to:
Obtain the field of each dimensional extent of tables of data;
Obtain character corresponding with field in expression formula;
Based on character, expression formula is decomposed into multiple subexpressions.
10. device according to claim 8, which is characterized in that the range securing component is also used to:
Obtain associate field and include the subexpression for having corresponding character with field;
With the dimensional extent where the associated field of subexpression in, obtain the subrange of subexpression.
11. device according to claim 8, which is characterized in that the range extension element is also used to:
Part subrange or whole subranges are taken out from multiple subranges;
The subrange of taking-up is subjected to permutation and combination, generates multiple extension expression formulas.
12. device according to claim 8, which is characterized in that the subregion combining block is also used to:
Obtain each subregional common portion region;
Obtain each subregional characteristic region;
Merge common portion region and characteristic region, so that combined region includes common portion region and each characteristic Region.
13. according to the device of any one of claim 8-12, which is characterized in that the expression formula includes: operator and character.
14. device according to claim 13, which is characterized in that the expression formula is predicate expressions.
15. the device that a kind of various dimensions range partition is cut characterized by comprising
Memory, for storing program;
Processor, for executing the program of the memory storage, described program makes the processor execute such as claim Method described in any one of 1-7.
16. a kind of computer readable storage medium, comprising: instruction,
When described instruction is run on computers, so that computer executes the side as described in any one of claim 1-7 Method.
CN201711478050.0A 2017-12-29 2017-12-29 Method, device and storage medium for partition clipping in multi-dimensional range Active CN110019238B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711478050.0A CN110019238B (en) 2017-12-29 2017-12-29 Method, device and storage medium for partition clipping in multi-dimensional range

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711478050.0A CN110019238B (en) 2017-12-29 2017-12-29 Method, device and storage medium for partition clipping in multi-dimensional range

Publications (2)

Publication Number Publication Date
CN110019238A true CN110019238A (en) 2019-07-16
CN110019238B CN110019238B (en) 2023-01-06

Family

ID=67187199

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711478050.0A Active CN110019238B (en) 2017-12-29 2017-12-29 Method, device and storage medium for partition clipping in multi-dimensional range

Country Status (1)

Country Link
CN (1) CN110019238B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111695289A (en) * 2020-05-13 2020-09-22 中国东方电气集团有限公司 Fault diagnosis method and platform of full-power converter
CN116775698A (en) * 2023-08-23 2023-09-19 本原数据(北京)信息技术有限公司 Partition clipping method and device for database, computer equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020194157A1 (en) * 1999-09-27 2002-12-19 Mohamed Zait Partition pruning with composite partitioning
US20060253429A1 (en) * 2005-05-04 2006-11-09 Ananth Raghavan Techniques for partition pruning
CN103177061A (en) * 2011-12-23 2013-06-26 Sap股份公司 Unique value estimation in partitioned tables
CN104685496A (en) * 2012-09-28 2015-06-03 甲骨文国际公司 Pruning disk blocks of a clustered table in a relational database management system
US20150286681A1 (en) * 2012-09-28 2015-10-08 Oracle International Corporation Techniques for partition pruning based on aggregated zone map information

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020194157A1 (en) * 1999-09-27 2002-12-19 Mohamed Zait Partition pruning with composite partitioning
US20060253429A1 (en) * 2005-05-04 2006-11-09 Ananth Raghavan Techniques for partition pruning
CN103177061A (en) * 2011-12-23 2013-06-26 Sap股份公司 Unique value estimation in partitioned tables
CN104685496A (en) * 2012-09-28 2015-06-03 甲骨文国际公司 Pruning disk blocks of a clustered table in a relational database management system
US20150286681A1 (en) * 2012-09-28 2015-10-08 Oracle International Corporation Techniques for partition pruning based on aggregated zone map information

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111695289A (en) * 2020-05-13 2020-09-22 中国东方电气集团有限公司 Fault diagnosis method and platform of full-power converter
CN116775698A (en) * 2023-08-23 2023-09-19 本原数据(北京)信息技术有限公司 Partition clipping method and device for database, computer equipment and storage medium
CN116775698B (en) * 2023-08-23 2023-11-24 本原数据(北京)信息技术有限公司 Partition clipping method and device for database, computer equipment and storage medium

Also Published As

Publication number Publication date
CN110019238B (en) 2023-01-06

Similar Documents

Publication Publication Date Title
US9804954B2 (en) Automatic cognitive adaptation of development assets according to requirement changes
JP5990192B2 (en) Filtering query data in the data store
CN107430611B (en) Filtering data lineage graph
US10706598B2 (en) Interface for data analysis
US11546380B2 (en) System and method for creation and implementation of data processing workflows using a distributed computational graph
CN109582289B (en) Method, system, storage medium and processor for processing rule flow in rule engine
US20140101097A1 (en) Template based database analyzer
CN111125107A (en) Data processing method, device, electronic equipment and medium
US20150019284A1 (en) Dynamically modifying business processes based on real-time events
CN110019238A (en) The method, apparatus and storage medium that various dimensions range partition is cut
CN107291835B (en) Search term recommendation method and device
Durón Heatmap centrality: a new measure to identify super-spreader nodes in scale-free networks
CN106598861B (en) Test case generation method and device and electronic equipment
Thom et al. Automated methods to test connectedness and quantify indirectness of evidence in network meta‐analysis
US10353890B2 (en) Automatic enumeration of data analysis options and rapid analysis of statistical models
US11347796B2 (en) Eliminating many-to-many joins between database tables
WO2024021581A1 (en) Metric query method and system, and electronic device and storage medium
CN112925522A (en) Dependency graph generation method, dependency graph generation device, dependency graph generation apparatus, storage medium, and program product
CN109597702B (en) Root cause analysis method, device, equipment and storage medium for message bus abnormity
CN107169115A (en) Add the method and device of self-defined participle
CN109597819B (en) Method and apparatus for updating a database
CN107832347B (en) Data dimension reduction method and system and electronic equipment
CN110019531B (en) Method and device for acquiring similar object set
CN107291923B (en) Information processing method and device
US20160246783A1 (en) Systems and methods for managing data related to network elements from multiple sources

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
REG Reference to a national code

Ref country code: HK

Ref legal event code: DE

Ref document number: 40010889

Country of ref document: HK

GR01 Patent grant
GR01 Patent grant