CN103942341A - Data processing method and device used for multi-dimensional data set - Google Patents

Data processing method and device used for multi-dimensional data set Download PDF

Info

Publication number
CN103942341A
CN103942341A CN 201410196206 CN201410196206A CN103942341A CN 103942341 A CN103942341 A CN 103942341A CN 201410196206 CN201410196206 CN 201410196206 CN 201410196206 A CN201410196206 A CN 201410196206A CN 103942341 A CN103942341 A CN 103942341A
Authority
CN
Grant status
Application
Patent type
Prior art keywords
query
batch
parameters
derived
parameter
Prior art date
Application number
CN 201410196206
Other languages
Chinese (zh)
Inventor
洪超
Original Assignee
北京国双科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor ; File system structures therefor
    • G06F17/30286Information retrieval; Database structures therefor ; File system structures therefor in structured data stores
    • G06F17/30587Details of specialised database models
    • G06F17/30592Multi-dimensional databases and data warehouses, e.g. MOLAP, ROLAP

Abstract

The invention discloses a data processing method and device used for a multi-dimensional data set. The data processing method used for the multi-dimensional data set includes the steps of inquiring about parameters to be inquired about in the multi-dimensional data set in batches, wherein the parameters to be inquired about are parameters of target objects; generating derivation values of the parameters to be inquired about in batches, wherein the derivation values are used for representing whether the parameters to be inquired about meet preset conditions or not; counting the target objects corresponding to the parameters which meet the preset conditions and are about to be inquired about in batches. By means of the data processing method and device, the problem that in the related technology, a large amount of time is consumed when inquiry is conducted on the multi-dimensional data set is solved.

Description

用于多维数据集的数据处理方法和装置 Data processing method and apparatus for cube

技术领域 FIELD

[0001] 本发明涉及计算机领域,具体而言,涉及一种用于多维数据集的数据处理方法和 [0001] The present invention relates to the field of computers, specifically, the data processing method relates to a cube and for

>JU ρςα装直。 > JU ρςα loaded straight.

背景技术 Background technique

[0002]目前,在相关技术中,在查询多维数据集时,往往需要先将多维数据集按照过滤条件进行逐条过滤,再将逐条过滤得到的结果进行汇总,比如,先逐条过滤(filter)多维数据集中的数据,得到过滤结果,然后再批量统计(count)过滤结果中满足过滤条件的数据。 [0002] Currently, in the related art, when querying a cube, the cube often need to first filtered by filter criteria one by one, and then filtered by one summarize the results obtained, for example, by one first filtering (filter) Multidimensional data in the dataset, the result obtained was filtered, and then batch counter (count) filter results not filtered data. 由于这种查询方式是基于单元格匹配(cell-by-cell)的查询模式,因此需要对众多单元格进行逐个查询,整个查询过程耗时大、性能低。 Because this approach is based on the query matching cells (cell-by-cell) query patterns, so the need for numerous cell-by-query, the query's time-consuming, and low performance. [0003] 比如,对于已经产生订单的产品,如果想要查询其中订单数量大于5的产品的种类数,则采用以下查询语句查询多维数据集:Count (Filter ([Product].[Product].[Product], [Measures].[Internet Order Quantity] >5))。 Number of species [0003] For example, the product has been produced for the order, if you want to check where the number of product orders greater than 5, the following query query cube:. Count (Filter ([Product] [Product] [. Product], [Measures]. [Internet Order Quantity]> 5)). 由于count (filter...)是一种cell-by-cell的计算模式,因此耗时大、性能低。 Since the count (filter ...) computing model is one kind of cell-by-cell, and therefore time-consuming, low performance.

[0004] 针对相关技术中查询多维数据集耗时大的问题,目前尚未提出有效的解决方案。 [0004] in the related art time-consuming queries cube big problem, we have not yet come up with effective solutions. 发明内容 SUMMARY

[0005] 本发明的主要目的在于提供一种用于多维数据集的数据处理方法和装置,以解决相关技术中查询多维数据集耗时大的问题。 [0005] The main object of the present invention to provide a data processing method and apparatus for cubes to solve the related art cube query consuming large.

[0006] 为了实现上述目的,根据本发明的一个方面,提供了一种用于多维数据集的数据处理方法。 [0006] To achieve the above object, according to one aspect of the invention there is provided a data processing method for the cube. 该方法包括:批量查询多维数据集中的待查询参数,其中,待查询参数为目标对象的参数;批量生成待查询参数的派生值,其中,派生值用于表示待查询参数是否满足预设条件;以及根据派生值批量统计满足预设条件的待查询参数对应的目标对象。 The method comprising: Bulk cube query parameters to be queried, wherein the parameter Parameter to be queried for the target object; generating a batch query parameter values ​​to be derived, wherein the derived values ​​to be used to represent query parameters meets a preset condition; and the audience meets a preset condition according to statistics to be derived value batch query parameters corresponding.

[0007] 进一步地,在批量生成待查询参数的派生值之前,数据处理方法还包括:批量判断待查询参数是否满足预设条件,其中,如果批量判断出待查询参数满足预设条件,则批量生成待查询参数的派生值包括:批量生成待查询参数的第一派生值,根据派生值批量统计满足预设条件的待查询参数对应的目标对象包括:根据第一派生值批量统计满足预设条件的待查询参数对应的目标对象。 Before derived values ​​[0007] Further, in the batch production parameters to be queried, the data processing method further comprises: determining the batch to be queried parameter meets a preset condition, wherein, if the bulk is judged to be a query parameter meets a preset condition, then the bulk generating a query parameter to be derived value comprises: generating a first batch query parameter values ​​to be derived, satisfy the statistical parameters to be queried preset condition corresponding to the derived value according to a target object comprises a batch: a first preset condition is satisfied in accordance with the derived value batch counter query parameters corresponding to be the target object.

[0008] 进一步地,在批量判断待查询参数是否满足预设条件之后,如果批量判断出待查询参数不满足预设条件,则批量生成待查询参数的派生值包括:批量生成待查询参数的第二派生值,根据派生值批量统计满足预设条件的待查询参数对应的目标对象包括:根据第一派生值和第二派生值批量统计满足预设条件的待查询参数对应的目标对象。 After [0008] Further, in a batch is determined to be the query parameter meets a preset condition, if the bulk is judged to be a query parameter does not satisfy the preset condition, the batch production values ​​to be derived query parameters comprises: generating a first batch of parameters to be queried two derived values, according to the derived statistical value satisfies a preset condition quantities to be queried parameter corresponding to the target object comprising: a target object satisfy a preset condition to be queried according to the parameter corresponding to a first derivative value and the second derived values ​​batch counter.

[0009] 进一步地,在批量生成待查询参数的第一派生值之前,数据处理方法还包括:批量生成待查询参数的用于表示待查询参数满足预设条件的第一标识,其中,在批量生成待查询参数的用于表示待查询参数满足预设条件的第一标识之后,批量生成待查询参数的第一派生值包括:批量将第一标识转化为第一派生值。 [0009] Further, before the first derived value generated batch parameters to be queried, the data processing method further comprises: generating a batch query parameters to be used to indicate an identifier of a first query parameter meets a preset condition, wherein, in batch after generating the query parameters to be used to indicate to query parameter meets a first preset condition identifier, generating a first derived value to be batch query parameters comprises: identifying a first batch into a first derivative value. [0010] 进一步地,批量将第一标识转化为第一派生值包括:批量将第一标识转化为1,根据第一派生值批量统计满足预设条件的待查询参数包括:累加1,得到满足预设条件的待查询参数对应的目标对象的数量。 [0010] Further, the first batch identifier into a first derived value comprises: identifying a first batch into 1, satisfies a preset condition to be derived based on the first query parameter value batch counter comprising: accumulating 1, met the number of pre-conditions of the target object corresponding to the parameter to be queried.

[0011] 为了实现上述目的,根据本发明的另一方面,提供了一种用于多维数据集的数据处理装置。 [0011] To achieve the above object, according to another aspect of the present invention, there is provided a data processing apparatus for the cube. 该装置包括:查询单元,用于批量查询多维数据集中的待查询参数,其中,待查询参数为目标对象的参数;第一生成单元,用于批量生成待查询参数的派生值,其中,派生值用于表示待查询参数是否满足预设条件;以及统计单元,用于根据派生值批量统计满足预设条件的待查询参数对应的目标对象。 The apparatus comprising: a query unit configured to query the batch to be cube query parameters, wherein the query parameter to be a parameter of the target object; a first generating unit for generating a batch query parameter values ​​to be derived, wherein the derived values query parameters to be used to indicate whether a predetermined condition is satisfied; and a counting unit configured to count a target object meets a preset condition based on the derived values ​​of the parameter corresponding to the quantities to be queried.

[0012] 进一步地,该数据处理装置还包括:判断单元,用于在批量生成待查询参数的派生值之前,批量判断待查询参数是否满足预设条件,其中:第一生成单元还用于如果批量判断出待查询参数满足预设条件,则批量生成待查询参数的第一派生值,统计单元还用于根据第一派生值批量统计满足预设条件的待查询参数对应的目标对象。 [0012] Further, the data processing apparatus further comprising: a determining unit configured to generate prior batch query parameter values ​​to be derived, the batch is determined to be queried parameter meets a preset condition, wherein: a first generating unit is further configured to, if batch is judged to be a query parameter meets a preset condition, generating a first derivative value of the quantities to be queried parameter statistics unit is further configured audience meets a preset condition according to a first derived value batch counter corresponding to the parameters to be queried.

[0013] 进一步地,第一生成单元还用于在批量判断待查询参数是否满足预设条件之后,如果批量判断出待查询参数不满足预设条件,则批量生成待查询参数的第二派生值;以及统计单元还用于根据第一派生值和第二派生值批量统计满足预设条件的待查询参数对应的目标对象。 [0013] Further, the first generating unit is further configured to query parameters to be determined after the batch meets a preset condition, if the bulk is judged to be a query parameter does not satisfy a predetermined condition, generating a second batch query parameter values ​​to be derived ; and a target object to be a query parameter statistics unit is further configured to satisfy a preset condition according to a first derivative value and the second derivative value corresponding to the statistical quantities.

[0014] 进一步地,该数据处理装置还包括:第二生成单元,用于在批量生成待查询参数的第一派生值之前,批量生成待查询参数的用于表示待查询参数满足预设条件的第一标识,其中,第一生成单元还用于在批量生成待查询参数的用于表示待查询参数满足预设条件的第一标识之后,批量将第一标识转化为第一派生值。 [0014] Further, the data processing apparatus further comprises: second generating means for generating a first batch before the query parameter values ​​to be derived, to be generated batch query parameters to be used to represent query parameters satisfy a preset condition a first identifier, wherein the first generating unit is further configured to be generated after the bulk of the query parameters to be used to represent the first identification of the query parameter meets a preset condition, the first batch identifier into a first derivative value.

[0015] 进一步地,第一生成单元还用于批量将第一标识转化为I ;以及统计单元还用于累加1,得到满足预设条件的待查询参数对应的目标对象的数量。 [0015] Further, the first generating unit is further configured to identify the first batch into I; and a statistics unit is further configured to accumulate 1, to obtain the number of target objects to be meeting a preset condition corresponding to the query parameters.

[0016] 通过本发明,采用批量查询多维数据集中的待查询参数,其中,待查询参数为目标对象的参数;批量生成待查询参数的派生值,其中,派生值用于表示待查询参数是否满足预设条件;以及根据派生值批量统计满足预设条件的待查询参数对应的目标对象。 [0016] By the present invention, a batch query cube to query parameters, wherein the parameters to be queried for the parameters of the target object; generating a batch query parameter values ​​to be derived, wherein the derived values ​​to be used to represent query parameters satisfies preset condition; and according to statistics derived value batch meets a preset condition to be query parameters corresponding to the target object. 由于可以批量处理多维数据集中的单元格,因此解决了相关技术中查询多维数据集耗时大的问题,进而达到了快速查询查询多维数据集的效果。 Since batch processing cube cells, thus solving the problem of large time-consuming queries related art cube, thus achieving a rapid query query cube effect.

附图说明 BRIEF DESCRIPTION

[0017] 构成本申请的一部分的附图用来提供对本发明的进一步理解,本发明的示意性实施例及其说明用于解释本发明,并不构成对本发明的不当限定。 [0017] The drawings constitute a part of this application are intended to provide further understanding of the invention, exemplary embodiments of the present invention are used to explain the present invention without unduly limiting the present invention. 在附图中: In the drawings:

[0018] 图1是根据本发明实施例的用于多维数据集的数据处理装置的示意图; [0018] FIG. 1 is a diagram illustrating a data processing apparatus for the cube embodiment of the present invention;

[0019] 图2是根据本发明可选实施例的用于多维数据集的数据处理装置的示意图; [0019] FIG. 2 is a schematic diagram of a data processing apparatus of the cube embodiment of the present invention can be selected;

[0020] 图3是根据本发明实施例的用于多维数据集的数据处理方法的流程图;以及 [0020] FIG. 3 is a flowchart of a method for processing data cube embodiment of the present invention; and

[0021] 图4是根据本发明可选实施例的用于多维数据集的数据处理方法的流程图。 [0021] FIG. 4 is a flowchart of a data processing method alternative cube according to an embodiment of the present invention.

具体实施方式 detailed description

[0022] 需要说明的是,在不冲突的情况下,本申请中的实施例及实施例中的特征可以相互组合。 [0022] Incidentally, in the case of no conflict, embodiments and features of the embodiments of the present application can be combined with each other. 下面将参考附图并结合实施例来详细说明本发明。 Below with reference to accompanying drawings and embodiments of the present invention will be described in detail. [0023] 为了使本领域的技术人员更好的理解本发明方案,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分的实施例,而不是全部的实施例。 [0023] In order to enable those skilled in the art a better understanding of the invention, in conjunction with the following drawings of the present invention embodiments, the technical solutions of the embodiments of the present invention will be clearly and completely described, obviously, the described Example embodiments are merely part of embodiments of the present invention rather than all embodiments. 基于本发明中的实施例,在本领域普通技术人员没有做出创造性劳动前提下所获得的所有其他实施例,都应当属于本发明的保护范围。 Based on the embodiments of the present invention, all other embodiments of ordinary skill in the art without creative efforts shall be made available, shall fall within the protection scope of the present invention.

[0024] 需要说明的是,本发明的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。 [0024] Incidentally, the above-mentioned book and in the figures the terms "first," "second," and the like are used for distinguishing between similar objects, and not necessarily for describing a particular sequential or claimed in the specification and claims of the present invention priorities. 应该理解这样使用的数据在适当情况下可以互换,以便这里描述的本发明的实施例能够以除了在这里图示或描述的那些以外的顺序实施。 It should be understood that the data so used are interchangeable under appropriate circumstances, embodiments of the present invention described herein are capable to addition order than those in the embodiments illustrated or described herein. 此外,术语“包括”和“具有”以及他们的任何变形,意图在于覆盖不排他的包含。 Furthermore, the terms "including" and "having," as well as any of their deformation, intended to cover non-exclusive inclusion.

[0025] 根据本发明的实施例,提供了一种用于多维数据集的数据处理装置,该用于多维数据集的数据处理装置用于批量处理多维数据集中的单元格数据以提高用户查询数据的速度。 [0025] According to an embodiment of the present invention, there is provided a data processing apparatus for a cube, the data processing apparatus cube for batch processing cell data cube to increase user query data speed.

[0026] 图1是根据本发明实施例的用于多维数据集的数据处理装置的示意图。 [0026] FIG. 1 is a schematic view of an apparatus for processing data cube embodiment of the present invention according to embodiment.

[0027] 如图1所示,该装置包括:查询单元10、第一生成单元20和统计单元30。 [0027] As shown in FIG 1, the apparatus comprising: a query unit 10, a first generating unit 20 and a statistics unit 30.

[0028] 查询单元10用于批量查询多维数据集中的待查询参数。 [0028] The inquiry unit 10 for batch cube query parameters to be queried. 其中,待查询参数为目标对象的参数。 Among them, the parameters for the query parameters to be the target object.

[0029] 待查询参数可以是指客户分析目标对象所依赖的数据,即待查询参数可以是指客户分析目标对象所依赖的参数,它可以包括产品的订单数。 [0029] query parameters may be refer customers to analyze the target object is dependent upon the data that is to be query parameters can refer customers to analyze the target object is dependent parameter, which may include the number of orders the product.

[0030] 例如,在一个多维数据集中,记录了某企业的所有产品类型以及每种产品类型产生的订单数等,如果客户想要分析订单数超过预定值的产品共有几种,那么产品就可以作为目标对象,而产品的订单数就可以作为待查询参数。 [0030] For example, in one cube, a company record number of orders for all product types, and each type of product produced, etc. If customers want to analyze the number of orders exceeds a predetermined value of the product there are several, then the product can as the target object, and the number of orders for the product can be used as query parameters. 通过查询待查询参数,得到查询结果,根据该查询结果可以分析目标对象的属性信息。 Parameters to be queried by the query, the query results obtained, the attribute information of the target object may be analyzed based on the query result. 具体地,例如,多维数据集中记录了第一产品、第二产品、第三产品和第四产品,相应的第一产品、第二产品、第三产品和第四产品产生的订单数分别是30、26、15和13,如果查询条件是订单数大于20,则得到的查询结果是第一产品和第二产品,共计2种产品。 Specifically, for example, the first cube recorded product, the second product orders, the third product and fourth product, a respective first product, a second product, the third product and fourth product produced are 30 , 26,15 and 13, if the query is a number greater than line 20, the query result is obtained is a first product and a second product, a total of 2 products.

[0031] 批量查询多维数据集中的待查询参数可以是在同一时间查询多个待查询参数,这样,可以克服逐个查询待查询参数需要消耗大量的查询时间的缺点,达到快速查询的效果。 [0031] batch query cube can be queried parameter is the query at the same time to be more query parameters, so that you can overcome the shortcomings of the query-by-query parameters need to be consuming a lot of query time, achieve fast query. 基于上述示例,批量查询多维数据集中的待查询参数可以是同时查询第一产品、第二产品、第三产品和第四产品4种产品分别对应的订单数,而不是逐个查询上述四种产品分别对应的订单数。 Based on the above example, the batch query cube query parameters may be simultaneously queries a first product, the second product orders, the third product and fourth product corresponding to each of four products, instead of the above four products were individually query the corresponding number of orders.

[0032] 需要说明的是,除了产品的订单数,待查询参数还可以包括客户分析目标对象所依赖的其他参数,例如,产品的重量、体积、甚至产品的生产时间、保质期等等。 [0032] Incidentally, in addition to product orders, to be queried client parameters may also include other parameters analysis target object is dependent, e.g., product weight, volume, and even the production time, shelf life and the like.

[0033] 第一生成单元20用于批量生成待查询参数的派生值。 [0033] The first generating unit 20 for generating quantities derived values ​​of the parameters to be queried.

[0034] 其中,派生值用于表示待查询参数是否满足预设条件。 [0034] wherein the derived values ​​to be used to represent query parameters meets a preset condition. 该预设条件可以是预设值。 The preset condition may be a preset value.

[0035] 需要说明的是,通过查询待查询参数,筛选出哪些待查询参数满足预设条件,哪些待查询参数不满足预设条件,对满足预设条件和不满足预设条件的待查询参数用生成的派生值进行区分。 [0035] It should be noted that the parameters to be queried by the query parameters, query parameters to be selected which meet the preset condition, which does not satisfy the query parameters to be preset conditions, to be queried and does not meet the conditions of the pre-pre-conditions are met distinguished by the value of the generated derivatives. 仍然以在多维数据集中查询产品的订单数是否大于预设条件为例,对待查询参数满足预设条件的,可以生成一种统一的派生值,对待查询参数不满足预设条件的,可以生成另一种统一的派生值,两种派生值可以为不同的派生值。 Still if the number of orders in the cube query product is greater than the preset conditions, for example, to treat the query parameters meet the preset conditions, can generate a unified derived value, treat the query parameter does not meet the pre-conditions can be generated for a unified derived value both derived values ​​can be derived for different values. 在生成派生值之后可以将其存储在派生列中。 After generating the derived values ​​may be stored in the derived column.

[0036] 在本发明实施例中,批量生成待查询参数的派生值的方式可以包括: [0036] In an embodiment of the present invention, batch production values ​​to be derived in the query parameters may include:

[0037] 方式一,对满足预设条件的所有待查询参数生成一种相同的派生值,而对不满足预设条件的所有待查询参数生成另一种相同的派生值,即,对满足和不满足预设条件的所有待查询参数均生成派生值,但是满足预设条件的待查询参数生成的派生值与不满足预设条件的待查询参数生成的派生值不同,而所有满足预设条件的待查询参数生成的派生值相同,且所有不满足预设条件的待查询参数生成的派生值也相同。 [0037] one embodiment, the same produce a derived value for all query parameters to be meeting a preset condition, and all query parameters to be generated does not satisfy the preset condition the same to another derived value, i.e., satisfying and different query parameters to be derived value does not satisfy the preset condition is not met all the preconditions to be generated query parameters are derived values, but meeting a preset condition to be generated query parameters to generate the derived value, and all meet the preset conditions the same query parameters to be generated by the derived value, and does not meet all the preconditions to be generated query parameters derived values ​​are the same.

[0038] 方式二,只对满足预设条件的所有待查询参数生成相同的派生值,而对不满足预设条件的所有待查询参数不生成派生值。 [0038] The second approach, derived only generate the same value for all query parameters to be meeting a preset condition, and for all pending query parameter does not satisfy a predetermined condition does not generate a derived value.

[0039] 方式三,与方式二相反,只对不满足预设条件的所有待查询参数生成相同的派生值,而对满足预设条件的所有待查询参数不生成派生值。 [0039] The third approach, the second approach contrast, derived only generate the same values ​​for all pending query parameter does not satisfy a predetermined condition, and for all pending query parameters meets a preset condition does not generate a derived value.

[0040] 通过生成派生值,由于免去了每次查询都需要查询待查询参数是否满足预设条件所占用的时间,因此可以为后续查询节约查询时间、提高查询速度。 [0040] By generating derived values, since every query eliminates the need, be queried parameter meets a preset condition occupied time, thus saving time for the subsequent query query speed up the search.

[0041] 统计单元30用于根据派生值批量统计满足预设条件的待查询参数对应的目标对象。 Query parameters to be [0041] Statistics unit 30 according to the derived statistical value satisfies a preset condition quantities corresponding to the target object.

[0042] 根据派生值批量统计满足预设条件的待查询参数对应的目标对象可以包括根据全部或者部分派生值统计满足预设条件的待查询参数对应的目标对象。 [0042] The derived batch counter value satisfies a preset condition to be the target object corresponding to the query parameters may include a target object satisfy a predetermined condition to be queried according to the parameter corresponding to all or part of the statistics derived values.

[0043] 具体地,可以依据上述批量生成待查询参数的派生值的方式以及生成的派生值本身进行统计: Derived by value [0043] In particular, it can be generated based on the above batch query parameters and generate value itself derived statistics:

[0044] 对于方式一,可以同时通过满足预设条件和不满足预设条件的派生值批量统计满足预设条件的待查询参数对应的目标对象,或者可以只通过满足预设条件的派生值批量统计满足预设条件的待查询参数对应的目标对象,或者可以首先通过统计目标对象的总数以及统计不满足预设条件的派生值批量统计满足预设条件的待查询参数对应的目标对象的个数,然后将目标对象的总数和满足预设条件的待查询参数对应的目标对象的数作差,得到上述两数的差值,其中,该差值即为满足预设条件的待查询参数对应的目标对象的个数。 [0044] For a way, by simultaneously satisfying the derived value batch preset conditions and preconditions are not satisfied preconditions to meet the target object corresponding to the query parameters to be or can be derived only by the value of the batch meets a preset condition statistics statistical meet the preset conditions to be query parameters corresponding to the target object, or does not meet the first preset condition of the target object by the total number of statistics and statistical value derived statistical quantities to meet the target object to be preset conditions corresponding to the number of query parameters , then the total number and the number of target objects satisfy the query parameters to be preset condition corresponding to the target object as a difference, a difference of two numbers to obtain the above-described, wherein the difference is the predetermined condition satisfying the query parameters to be corresponding to the number of the target object.

[0045] 对于方式二,可以通过满足预设条件的派生值直接批量统计满足预设条件的待查询参数对应的目标对象。 [0045] For the second approach, the statistical quantities can be directly meet the target object preconditions to be satisfied by the query parameters corresponding derived value preset conditions.

[0046] 对于方式三,可以首先通过不满足预设条件的派生值直接批量统计不满足预设条件的待查询参数对应的目标对象的个数,再统计待分析的目标对象的总数,然后将目标对象的总数与不满足预设条件的待查询参数对应的目标对象的个数作差,得到的差值即为满足预设条件的待查询参数对应的目标对象的个数。 [0046] For the third approach, first by default does not satisfy the conditions derived value of statistical target object does not direct the bulk of the pre-conditions to be corresponding to the number of query parameters, then the target object to be analyzed statistics of the total number, then Total number of number of parameters to be queried and the preset condition does not satisfy the target object corresponding to the difference as the target object, the resulting difference is the predetermined condition to be satisfied query parameters corresponding to the target object.

[0047] 在本发明实施例中,可以通过多维表达(Mult1-Dimensional Expressions,简称MDX)的方式进行多维数据集中单元格的批量查询(Bulk)。 [0047] In an embodiment of the present invention, may be a cube cell batch query (Bulk) multidimensional expression (Mult1-Dimensional Expressions, referred to as MDX) manner. 具体地,可以通过iif和sum结合的方式来实现Bulk的查询模式。 Specifically, the query may be implemented Bulk mode and manner iif sum binding. 例如,对生成订单数量大于k的产品生成派生值m,而对生成订单数量不大于k的产品生成派生值n,可以使用以下查询语句进行查询:Sum([product], [product].[product], Iif([Measures].[Internet Order Quantity]>k,m, n)) o For example, for the generation of orders greater than k products generate derived value m, and the product generated a derived value n to generate orders not more than k, and can use the following query query: Sum ([product], [product] [product. ], Iif ([Measures]. [Internet Order Quantity]> k, m, n)) o

[0048] 通过本发明实施例,采用批量查询待查询参数并生成待查询参数的派生值的方式,可以避免逐个查询待查询参数并对查询结果进行实时计数的缺点,达到了节约查询时间、提高查询速度的效果。 [0048] The embodiment of the present invention, batch parameters and generate a query to query a manner to be derived value query parameters, query parameters to be avoided by one query and query results disadvantage of real-time count of the query as to save time, improve the effect of query speed.

[0049] 图2是根据本发明可选实施例的用于多维数据集的数据处理装置的示意图。 [0049] FIG. 2 is a diagram of a data processing apparatus optionally cube according to an embodiment of the present invention.

[0050] 如图2所示,该实施例可以作为图1所示实施例的优选实施方式,该实施例的用于多维数据集的数据处理装置除了包括第一实施例的查询单元10、第一生成单元20和统计单元30之外,还包括判断单元40。 [0050] As shown, this Example 2 can be used as the preferred embodiment shown in FIG. 1 embodiment, the means for processing the data cube according to this embodiment includes a query unit except that of the first embodiment 10, a generating unit 20 and outside the counting unit 30, further comprising a determining unit 40.

[0051] 查询单元10的作用与第一实施例中的相同,在此不再赘述。 [0051] The effect of the inquiry unit 10 in the first embodiment the same as, are not repeated here.

[0052] 判断单元40用于批量判断待查询参数是否满足预设条件。 [0052] The determination unit 40 determines to be used for batch query parameter meets a preset condition.

[0053] 具体地,可以批量将待查询参数与预设条件进行比对以判断待查询参数是否满足预设条件。 [0053] In particular, the batch can be queried with a preset condition parameters for comparison to determine the parameters to be queried meets a preset condition. 其中,如果批量判断出待查询参数满足预设条件,则批量生成待查询参数的派生值。 Wherein, if the bulk is judged to be a query parameter meets a preset condition, the batch production of the derived values ​​of the parameter to be queried. 如果批量判断出待查询参数不满足预设条件,则不生成待查询参数的派生值。 If the batch is judged to be a query parameter does not satisfy a preset condition is not generated query parameters to be derived values. 在生成派生值之后可以将其存储在派生列中。 After generating the derived values ​​may be stored in the derived column.

[0054] 这样,只在待查询参数满足预设条件时生成派生值,可以节约用于存储派生值的存储空间。 [0054] Thus, only when the derived values ​​to be generated query parameter satisfies a preset condition, can save storage space for storing the derived value.

[0055] 第一生成单元20还用于批量生成待查询参数的第一派生值。 [0055] The first generating unit 20 is further configured to derive a first value to be generated batch query parameters.

[0056] 如果批量判断出待查询参数满足预设条件,则批量生成待查询参数的第一派生值。 [0056] If the batch is judged to be a query parameter meets a preset condition, the batch production to be a first derived value query parameters. 如果批量判断出待查询参数不满足预设条件,则不作任何处理。 If the batch is judged to be a query parameter does not meet the pre-conditions, it is not any treatment.

[0057] 统计单元30还用于根据第一派生值批量统计满足预设条件的待查询参数对应的目标对象。 [0057] Statistics Statistics unit 30 is further configured to satisfy a preset condition to be the target object corresponding to the query parameter value is derived in accordance with a first batch.

[0058] 第一派生值可以是整形数据,并且由于每个满足预设条件的待查询参数具有一个第一派生值,这样,通过对多个第一派生值本身或者其变形进行累加计算,可以统计出满足预设条件的待查询参数对应的目标对象的数量。 [0058] The first integer data value may be derived, and since each query parameter to be satisfied a preset condition having a first derived value so calculated by accumulating a plurality of first derivative values ​​themselves or modifications thereof, can be the statistics of the number of target objects to be meeting a preset condition corresponding query parameters.

[0059] 通过本发明实施例,采用量判断待查询参数是否满足预设条件的方式,使得只对满足预设条件的待查询参数执行生成派生值的功能,简化了派生值的生成的过程,并且达到了节约存储空间的效果。 [0059] The embodiment of the present invention, the amount is determined to be queried using the parameter meets a preset condition manner that perform the function of generating only the derived values ​​to the query parameters to be meeting a preset condition, it simplifies the process of generating derivative values, and so as to save storage space effect.

[0060] 可选地,在本发明实施例中,在批量判断待查询参数是否满足预设条件之后,除了如果批量判断出待查询参数满足预设条件,则批量生成待查询参数的第一派生值之外,还可以包括如果批量判断出待查询参数不满足预设条件,则第一生成单元20还用于执行批量生成待查询参数的第二派生值的功能。 [0060] Alternatively, in the embodiment of the present invention, after the batch is determined to be the query parameter meets a preset condition, except if the bulk is judged to be a query parameter meets a preset condition, the batch production parameters to be queried first derivative addition values, may further include if the bulk is judged to be a query parameter does not satisfy a predetermined condition, the first generating unit 20 for performing the function of the second batch to generate the derived value of the parameter to be queried. 其中,第一派生值和第二派生值为不同的派生值。 Wherein the first and second derived values ​​different from values ​​derived value derived. 这样,统计单元30还用于根据第一派生值和第二派生值批量统计满足预设条件的待查询参数对应的目标对象。 Thus, statistics unit 30 for further audience meets a preset condition based on the first value and the second derived value derived statistical quantities corresponding to the parameters to be queried. 第一派生值和第二派生值均可以是整形数据,并且由于每个满足预设条件的待查询参数具有一个第一派生值,每个不满足预设条件的待查询参数具有一个第二派生值,这样,通过对多个第一派生值本身或者其变形以及多个第一派生值本身或者其变形进行累加计算,可以统计出满足预设条件的待查询参数对应的目标对象的数量。 The first and second derived values ​​may be derived values ​​are integer data, and since each query parameter to be satisfied a preset condition having a first derived value, does not satisfy the predetermined condition for each query parameter to be a second derivative having value, so that, by itself, a plurality of first derivative values, or modified first derivative values ​​and a plurality of deformation per se, or a cumulative calculation, the statistics may be the number of target objects to be meeting a preset condition corresponding to the query parameters.

[0061] 可选地,在本发明实施例中,该数据处理装置还可以包括:第二生成单元。 [0061] Alternatively, in the embodiments of the invention, the data processing apparatus may further comprise: a second generating unit. 第二生成单元用于在批量生成待查询参数的第一派生值之前,批量生成待查询参数的用于表示待查询参数满足预设条件的第一标识,其中,第一标识可以为一段关键字,该关键字可以是字符串。 Second generating means for generating a first batch before the query parameter values ​​to be derived, batch production to be queried first identifier representing the parameter to be queried parameter meets a preset condition, wherein the first identifier may be a period of keywords the key can be a string. 第一生成单元20还可以用于在批量生成待查询参数的用于表示待查询参数满足预设条件的第一标识之后,批量将第一标识转化为第一派生值。 A first generating unit 20 may also be used after the volume generated by the query parameters to be used to indicate an identifier of the query parameter satisfies a first predetermined condition, the first batch identifier into a first derivative value. [0062] 进一步可选地,第一生成单元20还可以用于批量将第一标识转化为1,即将每一个第一标识都转化为I。 [0062] Further alternatively, the first generating unit 20 may also be used for batch into a first identifier, each first identifier are coming into I. 这样,统计单元30还可以用于将所有第一派生值I进行累加,得到满足预设条件的待查询参数对应的目标对象的数量。 Thus, statistics unit 30 may also be used to derive all of the first accumulated value I, to obtain the number of target objects to be meeting a preset condition corresponding to the query parameters.

[0063] 同理,在批量生成待查询参数的第二派生值之前,该数据处理装置还可以包括:第三生成单元。 [0063] Similarly, prior to generating the second batch query parameter values ​​to be derived, the data processing apparatus may further comprise: a third generating unit. 该第三生成单元用于批量生成待查询参数的用于表示待查询参数满足预设条件的第二标识,其中,第二标识可以为一段关键字,该关键字可以是字符串。 The third generating means for generating a batch query parameters to be used to represent an identifier of a second query parameter meets a preset condition, wherein the second identification section may be a keyword, which may be a string. 需要说明的是,第一标识与第二标识不同。 Note that different from the first identification and the second identification. 第一生成单元20还可以用于在批量生成待查询参数的用于表示待查询参数满足预设条件的第二标识之后,批量将第二标识转化为第二派生值。 A first generating unit 20 may also be used after the volume generated by the query parameters to be used to indicate an identifier of the query parameter meets a second preset condition, the second batch into the second identification value derived. 进一步地,第一生成单元20还可以用于批量将第二标识转化为0,即将每一个第二标识都转化为O。 Further, the first generating unit 20 may also be used for batch identification into the 0 second, i.e. each second identifier is converted into O. 这样,统计单元30还用于将所有第一派生值I和所有的第二派生值O进行累加,得到满足预设条件的待查询参数对应的目标对象的数量。 Thus, statistics unit 30 further derived values ​​for all of the first I and all of the second derivative value O is accumulated to obtain the number of target objects to be meeting a preset condition corresponding to the query parameters. 例如,对生成订单数量大于k的产品生成派生值1,而对生成订单数量不大于k的产品生成派生值0,可以使用以下查询语句进行查询:Sum ([product], [product].[product], Iif ([Measures].[Internet OrderQuantity] >k,I,0))。 For example, for the generation of orders greater than k products generating derived value 1, while generating a number of order not greater than k products generating derived value 0, the following query query: Sum ([product], [product] [product. ], Iif ([Measures]. [Internet OrderQuantity]> k, I, 0)). 这样,可以不需要对各个派生值进行变形,而直接累加现有的派生值即可统计出满足预设条件的目标对象的数量,从而简化了统计过程。 Thus, the respective derived values ​​may not be required for the deformation, and direct current cumulative statistics can be derived value meets a preset condition number of the target object, thereby simplifying the statistical process.

[0064] 根据本发明的实施例,提供了一种用于多维数据集的数据处理方法,该用于多维数据集的数据处理方法用于批量处理多维数据集中的单元格数据以提高用户查询数据的速度。 [0064] According to an embodiment of the present invention, there is provided a data processing method for the cube, the cube is a data processing method for processing a batch cell data cube to increase user query data speed. 该用于多维数据集的数据处理方法可以运行在计算机处理设备上。 The method for processing data cube can be run on a computer processing device. 需要说明的是,本发明实施例所提供的用于多维数据集的数据处理方法可以通过本发明实施例的用于多维数据集的数据处理装置来执行,本发明实施例的用于多维数据集的数据处理装置也可以用于执行本发明实施例的用于多维数据集的数据处理方法。 Incidentally, a method for processing data cube according to embodiments of the present invention provides a data processing apparatus may perform for the cube embodiment of the present invention, for the cube embodiment of the present invention data processing means may also be used to perform data processing method for the cube embodiment of the present invention.

[0065] 图3是根据本发明实施例的用于多维数据集的数据处理方法的流程图。 [0065] FIG. 3 is a flowchart of a method for processing data cube embodiment of the present invention.

[0066] 如图3所示,该方法包括如下的步骤S302至步骤S306: [0066] As shown in FIG. 3, the method comprising the steps of S302 to S306:

[0067] 步骤S302,批量查询多维数据集中的待查询参数。 [0067] step S302, the batch query cube to query parameters.

[0068] 待查询参数可以是指客户分析目标对象所依赖的数据,即待查询参数可以是指客户分析目标对象所依赖的参数,它可以包括产品的订单数。 [0068] query parameters may be refer customers to analyze the target object is dependent upon the data that is to be query parameters can refer customers to analyze the target object is dependent parameter, which may include the number of orders the product.

[0069] 例如,在一个多维数据集中,记录了某企业的所有产品类型以及每种产品类型产生的订单数等,如果客户想要分析订单数超过预定值的产品共有几种,那么产品就可以作为目标对象,而产品的订单数就可以作为待查询参数。 [0069] For example, in one cube, a company record number of orders for all product types, and each type of product produced, etc. If customers want to analyze the number of orders exceeds a predetermined value of the product there are several, then the product can as the target object, and the number of orders for the product can be used as query parameters. 通过查询待查询参数,得到查询结果,根据该查询结果可以分析目标对象的属性信息。 Parameters to be queried by the query, the query results obtained, the attribute information of the target object may be analyzed based on the query result. 具体地,例如,多维数据集中记录了第一产品、第二产品、第三产品和第四产品,相应的第一产品、第二产品、第三产品和第四产品产生的订单数分别是30、26、15和13,如果查询条件是订单数大于20,则得到的查询结果是第一产品和第二产品,共计2种产品。 Specifically, for example, the first cube recorded product, the second product orders, the third product and fourth product, a respective first product, a second product, the third product and fourth product produced are 30 , 26,15 and 13, if the query is a number greater than line 20, the query result is obtained is a first product and a second product, a total of 2 products.

[0070] 批量查询多维数据集中的待查询参数可以是在同一时间查询多个待查询参数,这样,可以克服逐个查询待查询参数需要消耗大量的查询时间的缺点,达到快速查询的效果。 [0070] batch query cube can be queried parameter is the query at the same time to be more query parameters, so that you can overcome the shortcomings of the query-by-query parameters need to be consuming a lot of query time, achieve fast query. 基于上述示例,批量查询多维数据集中的待查询参数可以是同时查询第一产品、第二产品、第三产品和第四产品4种产品分别对应的订单数,而不是逐个查询上述四种产品分别对应的订单数。 Based on the above example, the batch query cube query parameters may be simultaneously queries a first product, the second product orders, the third product and fourth product corresponding to each of four products, instead of the above four products were individually query the corresponding number of orders.

[0071] 需要说明的是,除了产品的订单数,待查询参数还可以包括客户分析目标对象所依赖的其他参数,例如,产品的重量、体积、甚至产品的生产时间、保质期等等。 [0071] Incidentally, in addition to product orders, to be queried client parameters may also include other parameters analysis target object is dependent, e.g., product weight, volume, and even the production time, shelf life and the like.

[0072] 步骤S304,批量生成待查询参数的派生值。 [0072] step S304, the derived values ​​to be generated batch query parameters.

[0073] 其中,派生值用于表示待查询参数是否满足预设条件。 [0073] wherein the derived values ​​to be used to represent query parameters meets a preset condition. 该预设条件可以是预设值。 The preset condition may be a preset value.

[0074] 需要说明的是,通过查询待查询参数,筛选出哪些待查询参数满足预设条件,哪些待查询参数不满足预设条件,对满足预设条件和不满足预设条件的待查询参数用生成的派生值进行区分。 [0074] It should be noted that the parameters to be queried by the query parameters, query parameters to be selected which meet the preset condition, which does not satisfy the query parameters to be preset conditions, to be queried and does not meet the conditions of the pre-pre-conditions are met distinguished by the value of the generated derivatives. 仍然以在多维数据集中查询产品的订单数是否大于预设条件为例,对待查询参数满足预设条件的,可以生成一种统一的派生值,对待查询参数不满足预设条件的,可以生成另一种统一的派生值,两种派生值可以为不同的派生值。 Still if the number of orders in the cube query product is greater than the preset conditions, for example, to treat the query parameters meet the preset conditions, can generate a unified derived value, treat the query parameter does not meet the pre-conditions can be generated for a unified derived value both derived values ​​can be derived for different values. 在生成派生值之后可以将其存储在派生列中。 After generating the derived values ​​may be stored in the derived column.

[0075] 在本发明实施例中,批量生成待查询参数的派生值的方式可以包括: [0075] In an embodiment of the present invention, batch production values ​​to be derived in the query parameters may include:

[0076] 方式一,对满足预设条件的所有待查询参数生成一种相同的派生值,而对不满足预设条件的所有待查询参数生成另一种相同的派生值,即,对满足和不满足预设条件的所有待查询参数均生成派生值,但是满足预设条件的待查询参数生成的派生值与不满足预设条件的待查询参数生成的派生值不同,而所有满足预设条件的待查询参数生成的派生值相同,且所有不满足预设条件的待查询参数生成的派生值也相同。 [0076] one embodiment, the same produce a derived value for all query parameters to be meeting a preset condition, and all query parameters to be generated does not satisfy the preset condition the same to another derived value, i.e., satisfying and different query parameters to be derived value does not satisfy the preset condition is not met all the preconditions to be generated query parameters are derived values, but meeting a preset condition to be generated query parameters to generate the derived value, and all meet the preset conditions the same query parameters to be generated by the derived value, and does not meet all the preconditions to be generated query parameters derived values ​​are the same.

[0077] 方式二,只对满足预设条件的所有待查询参数生成相同的派生值,而对不满足预设条件的所有待查询参数不生成派生值。 [0077] The second approach, derived only generate the same value for all query parameters to be meeting a preset condition, and for all pending query parameter does not satisfy a predetermined condition does not generate a derived value.

[0078] 方式三,与方式二相反,只对不满足预设条件的所有待查询参数生成相同的派生值,而对满足预设条件的所有待查询参数不生成派生值。 [0078] The third approach, the second approach contrast, derived only generate the same values ​​for all pending query parameter does not satisfy a predetermined condition, and for all pending query parameters meets a preset condition does not generate a derived value.

[0079] 通过生成派生值,由于免去了每次查询都需要查询待查询参数是否满足预设条件所占用的时间,因此可以为后续查询节约查询时间、提高查询速度。 [0079] By generating derived values, since every query eliminates the need, be queried parameter meets a preset condition occupied time, thus saving time for the subsequent query query speed up the search.

[0080] 步骤S306,根据派生值批量统计满足预设条件的待查询参数对应的目标对象。 [0080] step S306, the derived value according to the statistical quantities meeting a preset condition to be the target object corresponding to the query parameters.

[0081] 根据派生值批量统计满足预设条件的待查询参数对应的目标对象可以包括根据全部或者部分派生值统计满足预设条件的待查询参数对应的目标对象。 [0081] The derived batch counter value satisfies a preset condition to be the target object corresponding to the query parameters may include a target object satisfy a predetermined condition to be queried according to the parameter corresponding to all or part of the statistics derived values.

[0082] 具体地,可以依据上述步骤中批量生成待查询参数的派生值的方式以及生成的派生值本身进行统计: Manner derived values ​​[0082] In particular, batches to be generated based on the query parameters of the above step and generating a statistical value itself derived:

[0083] 对于方式一,可以同时通过满足预设条件和不满足预设条件的派生值批量统计满足预设条件的待查询参数对应的目标对象,或者可以只通过满足预设条件的派生值批量统计满足预设条件的待查询参数对应的目标对象,或者可以首先通过统计目标对象的总数以及统计不满足预设条件的派生值批量统计满足预设条件的待查询参数对应的目标对象的个数,然后将目标对象的总数和满足预设条件的待查询参数对应的目标对象的数作差,得到上述两数的差值,其中,该差值即为满足预设条件的待查询参数对应的目标对象的个数。 [0083] For a way, by simultaneously satisfying the derived value batch preset conditions and preconditions are not satisfied preconditions to meet the target object corresponding to the query parameters to be or can be derived only by the value of the batch meets a preset condition statistics statistical meet the preset conditions to be query parameters corresponding to the target object, or does not meet the first preset condition of the target object by the total number of statistics and statistical value derived statistical quantities to meet the target object to be preset conditions corresponding to the number of query parameters , then the total number and the number of target objects satisfy the query parameters to be preset condition corresponding to the target object as a difference, a difference of two numbers to obtain the above-described, wherein the difference is the predetermined condition satisfying the query parameters to be corresponding to the number of the target object.

[0084] 对于方式二,可以通过满足预设条件的派生值直接批量统计满足预设条件的待查询参数对应的目标对象。 [0084] For the second approach, the statistical quantities can be directly meet the target object preconditions to be satisfied by the query parameters corresponding derived value preset conditions.

[0085] 对于方式三,可以首先通过不满足预设条件的派生值直接批量统计不满足预设条件的待查询参数对应的目标对象的个数,再统计待分析的目标对象的总数,然后将目标对象的总数与不满足预设条件的待查询参数对应的目标对象的个数作差,得到的差值即为满足预设条件的待查询参数对应的目标对象的个数。 [0085] For the third approach, first by default does not satisfy the conditions derived value of statistical target object does not direct the bulk of the pre-conditions to be corresponding to the number of query parameters, then the target object to be analyzed statistics of the total number, then Total number of number of parameters to be queried and the preset condition is not satisfied the target object as the target object corresponding to a difference obtained difference is the predetermined condition to be satisfied query parameters corresponding to the target object.

[0086] 在本发明实施例中,可以通过多维表达(Mult1-Dimensional Expressions,简称MDX)的方式进行多维数据集中单元格的批量查询(Bulk)。 [0086] In an embodiment of the present invention, may be a cube cell batch query (Bulk) multidimensional expression (Mult1-Dimensional Expressions, referred to as MDX) manner. 具体地,可以通过iif和sum结合的方式来实现Bulk的查询模式。 Specifically, the query may be implemented Bulk mode and manner iif sum binding. 例如,对生成订单数量大于k的产品生成派生值m,而对生成订单数量不大于k的产品生成派生值n,可以使用以下查询语句进行查询:Sum([product].[product].[product], Iif([Measures].[Internet Order Quantity]>k,m, n)) o For example, for the generation of orders greater than k products generate derived value m, and the product generated a derived value n to generate orders not more than k, and can use the following query query:. Sum ([product] [product] [product. ], Iif ([Measures]. [Internet Order Quantity]> k, m, n)) o

[0087] 通过本发明实施例,采用批量查询待查询参数并生成待查询参数的派生值的方式,可以避免逐个查询待查询参数并对查询结果进行实时计数的缺点,达到了节约查询时间、提高查询速度的效果。 [0087] The embodiment of the present invention, batch parameters and generate a query to query a manner to be derived value query parameters, query parameters to be avoided by one query and query results disadvantage of real-time count of the query as to save time, improve the effect of query speed.

[0088] 图4是根据本发明可选实施例的用于多维数据集的数据处理方法的流程图。 [0088] FIG. 4 is a flowchart of a data processing method alternative cube according to an embodiment of the present invention.

[0089] 如图4所示,该用于多维数据集的数据处理方法包括如下的步骤S402至步骤S408,该实施例可以作为图3所示实施例的优选实施方式。 [0089] As shown, the method for processing data cube 4 comprising the steps of S402 to step S408, this embodiment can be used as a preferred embodiment by way of example in FIG. 3. FIG.

[0090] 步骤S402,同图3所示实施例的步骤S302,在此不再赘述。 [0090] step S402, the same procedure of Example 3 shown in FIG S302 embodiments, not described herein again.

[0091] 步骤S404,批量判断待查询参数是否满足预设条件。 [0091] step S404, the batch is determined to be queried parameter meets a preset condition.

[0092] 其中,待查询参数为目标对象的参数。 [0092] where parameters to be queried for the parameters of the target object. 具体地,可以批量将待查询参数与预设条件进行比对以判断待查询参数是否满足预设条件。 Specifically, batch query parameters to be aligned with a preset condition to be queried to determine the parameter meets a preset condition. 其中,如果批量判断出待查询参数满足预设条件,则批量生成待查询参数的派生值。 Wherein, if the bulk is judged to be a query parameter meets a preset condition, the batch production of the derived values ​​of the parameter to be queried. 如果批量判断出待查询参数不满足预设条件,则不生成待查询参数的派生值。 If the batch is judged to be a query parameter does not satisfy a preset condition is not generated query parameters to be derived values. 在生成派生值之后可以将其存储在派生列中。 After generating the derived values ​​may be stored in the derived column.

[0093] 这样,只在待查询参数满足预设条件时生成派生值,可以节约用于存储派生值的存储空间。 [0093] Thus, only when the derived values ​​to be generated query parameter satisfies a preset condition, can save storage space for storing the derived value.

[0094] 步骤S406,如果批量判断出待查询参数满足预设条件,则批量生成待查询参数的 [0094] step S406, if judged to be the batch query parameter meets a preset condition, the batch production parameters to be queried

第一派生值。 The first derivative value.

[0095] 如果批量判断出待查询参数满足预设条件,则批量生成待查询参数的第一派生值。 [0095] If the batch is judged to be a query parameter meets a preset condition, the batch production to be a first derived value query parameters. 如果批量判断出待查询参数不满足预设条件,则不作任何处理。 If the batch is judged to be a query parameter does not meet the pre-conditions, it is not any treatment.

[0096] 步骤S408,根据第一派生值批量统计满足预设条件的待查询参数对应的目标对象。 [0096] step S408, the derived value according to the first batch counter satisfies a preset condition to be the target object corresponding to the query parameters.

[0097] 第一派生值可以是整形数据,并且由于每个满足预设条件的待查询参数具有一个第一派生值,这样,通过对多个第一派生值本身或者其变形进行累加计算,可以统计出满足预设条件的待查询参数对应的目标对象的数量。 [0097] The first integer data value may be derived, and since each query parameter to be satisfied a preset condition having a first derived value so calculated by accumulating a plurality of first derivative values ​​themselves or modifications thereof, can be the statistics of the number of target objects to be meeting a preset condition corresponding query parameters.

[0098] 通过本发明实施例,采用量判断待查询参数是否满足预设条件的方式,使得只对满足预设条件的待查询参数执行生成派生值的功能,简化了派生值的生成的过程,并且达到了节约存储空间的效果。 [0098] embodiment of the present invention, the amount is determined to be queried using the parameter meets a preset condition manner that perform the function of generating only the derived values ​​to the query parameters to be meeting a preset condition, it simplifies the process of generating derivative values, and so as to save storage space effect.

[0099] 可选地,在本发明实施例中,在批量判断待查询参数是否满足预设条件之后,除了如果批量判断出待查询参数满足预设条件,则批量生成待查询参数的第一派生值之外,还可以包括如果批量判断出待查询参数不满足预设条件,则批量生成待查询参数的第二派生值的步骤。 [0099] Alternatively, in the embodiment of the present invention, after the batch is determined to be the query parameter meets a preset condition, except if the bulk is judged to be a query parameter meets a preset condition, the batch production parameters to be queried first derivative addition values, may further include if the bulk is judged to be a query parameter does not satisfy a predetermined condition, a second step the derived query parameter value is to be batch production. 其中,第一派生值和第二派生值为不同的派生值。 Wherein the first and second derived values ​​different from values ​​derived value derived. 这样,根据派生值批量统计满足预设条件的待查询参数对应的目标对象可以包括:根据第一派生值和第二派生值批量统计满足预设条件的待查询参数对应的目标对象。 Thus, according to the derived statistical value satisfies a preset condition quantities to be queried parameter corresponding to the target object may include: a target object satisfy a predetermined condition to be queried according to the parameter corresponding to a first derivative value and a second value derived statistical quantities. 第一派生值和第二派生值均可以是整形数据,并且由于每个满足预设条件的待查询参数具有一个第一派生值,每个不满足预设条件的待查询参数具有一个第二派生值,这样,通过对多个第一派生值本身或者其变形以及多个第一派生值本身或者其变形进行累加计算,可以统计出满足预设条件的待查询参数对应的目标对象的数量。 The first and second derived values ​​may be derived values ​​are integer data, and since each query parameter to be satisfied a preset condition having a first derived value, does not satisfy the predetermined condition for each query parameter to be a second derivative having value, so that, by itself, a plurality of first derivative values, or modified first derivative values ​​and a plurality of deformation per se, or a cumulative calculation, the statistics may be the number of target objects to be meeting a preset condition corresponding to the query parameters.

[0100] 可选地,在本发明实施例中,在批量生成待查询参数的第一派生值之前,该数据处理方法还可以包括:批量生成待查询参数的用于表示待查询参数满足预设条件的第一标识,其中,第一标识可以为一段关键字,该关键字可以是字符串。 [0100] Alternatively, in the embodiment of the present invention, prior to generating a first batch query parameter values ​​to be derived, the data processing method may further comprise: generating a batch query parameters to be used to represent query parameters to be preset to meet the identifying a first condition, wherein the first identification section may be a keyword, which may be a string. 在批量生成待查询参数的用于表示待查询参数满足预设条件的第一标识之后,批量生成待查询参数的第一派生值可以包括:批量将第一标识转化为第一派生值。 After the batch to be generated for a query parameter indicates an identifier of a first query parameter meets a preset condition, generating a first derived value to be batch query parameters may include: identifying a first batch into a first derivative value.

[0101] 进一步可选地,批量将第一标识转化为第一派生值可以包括:批量将第一标识转化为1,即将每一个第一标识都转化为I。 [0101] Further alternatively, the first batch into a first derived value identifier may include: identifying a first batch into 1, i.e. each of the first identifier are converted to I. 这样,根据第一派生值批量统计满足预设条件的待查询参数可以包括:将所有第一派生值I进行累加,得到满足预设条件的待查询参数对应的目标对象的数量。 Thus, according to a first derived value batch counter meets a preset condition may be a query parameter comprising: a first derived value of all accumulated I, to obtain the number of target objects to be meeting a preset condition corresponding to the query parameters.

[0102] 同理,在批量生成待查询参数的第二派生值之前,还数据处理方法还可以包括:批量生成待查询参数的用于表示待查询参数满足预设条件的第二标识,其中,第二标识可以为一段关键字,该关键字可以是字符串。 Before the second derived values ​​[0102] Similarly, in batch production parameters to be queried, but also data processing method may further comprise: generating a batch query parameters to be used to represent an identifier of a second query parameter meets a preset condition, wherein, the second identifier may be for a keyword, which can be a string. 需要说明的是,第一标识与第二标识不同。 Note that different from the first identification and the second identification. 在批量生成待查询参数的用于表示待查询参数满足预设条件的第二标识之后,批量生成待查询参数的第二派生值可以包括:批量将第二标识转化为第二派生值。 After the batch to be generated for a query parameter indicates an identifier of a second query parameter meets a preset condition, and generating a second derived value to be batch query parameters may include: identifying a second batch into the second derived values. 进一步地,批量将第二标识转化为第二派生值可以包括:批量将第二标识转化为0,即将每一个第二标识都转化为O。 Further, the second batch identification into a second derived value may comprise: identifying a second batch into 0, i.e. each second identifier is converted into O. 这样,根据第一派生值和第二派生值批量统计满足预设条件的待查询参数可以包括:将所有第一派生值I和所有的第二派生值O进行累加,得到满足预设条件的待查询参数对应的目标对象的数量。 Thus, according to a first derivative and second derivative value batch counter value satisfies the preset condition may be a query parameter comprising: first of all all the derived value I and the second derived value O accumulates a preset condition to be met the number of the target object corresponding to the query parameters. 例如,对生成订单数量大于k的产品生成派生值1,而对生成订单数量不大于k的产品生成派生值O,可以使用以下查询语句进行查询:Sum ([product], [product],[product], Iif ([Measures].[Internet Order Quantity]>k, 1,0))。 For example, for the generation of orders greater than k products generating derived value 1, while generating a number of order not greater than k products generate derived value O, you can use the following query query: Sum ([product], [product], [product ], Iif ([Measures]. [Internet Order Quantity]> k, 1,0)). 这样,可以不需要对各个派生值进行变形,而直接累加现有的派生值即可统计出满足预设条件的目标对象的数量,从而简化了统计过程。 Thus, the respective derived values ​​may not be required for the deformation, and direct current cumulative statistics can be derived value meets a preset condition number of the target object, thereby simplifying the statistical process.

[0103] 从以上的描述中,可以看出,本发明可以对多维数据集中的单元格进行同时批量查询,这样,由于查询所消耗的时间等于耗时最长的单元格的查询时间,而不是所有单元格查询时所消耗时间的总和,因此,节约了查询时间、提高了查询效率。 [0103] From the above description, it can be seen that the present invention may be simultaneously batch query to the cube cells, so that, since the query is equal to the time consumed by the longest query cell time, instead of the sum of all cells of the query time consuming, thus saving time query, the query efficiency is improved.

[0104] 需要说明的是,在附图的流程图示出的步骤可以在诸如一组计算机可执行指令的计算机系统中执行,并且,虽然在流程图中示出了逻辑顺序,但是在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤。 [0104] It should be noted that the steps illustrated in the flowchart drawings can be executed on a computer system such as a set of executable instructions, and, although in the flowchart shown in a logical order, but in some case, the steps shown or may be performed in a different order than described herein.

[0105] 显然,本领域的技术人员应该明白,上述的本发明的各模块或各步骤可以用通用的计算装置来实现,它们可以集中在单个的计算装置上,或者分布在多个计算装置所组成的网络上,可选地,它们可以用计算装置可执行的程序代码来实现,从而,可以将它们存储在存储装置中由计算装置来执行,或者将它们分别制作成各个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。 [0105] Obviously, those skilled in the art should understand that the modules or steps of the present invention described above can be used general-purpose computing device, they can be integrated in a single computing device or distributed across multiple computing devices available on the Internet, optionally, they can be implemented with program codes executable by a computing device, so that, to be performed by a computing device stored in a storage device, or they are made into integrated circuit modules, or they plurality of modules or steps are manufactured into a single integrated circuit module. 这样,本发明不限制于任何特定的硬件和软件结合。 Thus, the present invention is not limited to any particular hardware and software combination.

[0106] 以上所述仅为本发明的优选实施例而已,并不用于限制本发明,对于本领域的技术人员来说,本发明可以有各种更改和变化。 [0106] The foregoing is only preferred embodiments of the present invention, it is not intended to limit the invention to those skilled in the art, the present invention may have various changes and variations. 凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。 Any modification within the spirit and principle of the present invention, made, equivalent substitutions, improvements, etc., should be included within the scope of the present invention.

Claims (10)

  1. 1.一种用于多维数据集的数据处理方法,其特征在于,包括: 批量查询多维数据集中的待查询参数,其中,所述待查询参数为目标对象的参数;批量生成所述待查询参数的派生值,其中,所述派生值用于表示所述待查询参数是否满足预设条件;以及根据所述派生值批量统计满足所述预设条件的所述待查询参数对应的所述目标对象。 A data processing method for the cube, characterized in that, comprising: a bulk cube query parameters to be queried, wherein said parameter is a parameter query to be the target object; batch production of said parameters to be queried the derived value, wherein the derived values ​​to be used to represent the query parameter meets a preset condition; and the preset condition of the target object based on the derived value meets the batch counter corresponding to the query parameters to be .
  2. 2.根据权利要求1所述的数据处理方法,其特征在于, 在批量生成所述待查询参数的派生值之前,所述数据处理方法还包括:批量判断所述待查询参数是否满足所述预设条件, 其中,如果批量判断出所述待查询参数满足所述预设条件,则批量生成所述待查询参数的派生值包括:批量生成所述待查询参数的第一派生值, 根据所述派生值批量统计满足所述预设条件的所述待查询参数对应的目标对象包括:根据所述第一派生值批量统计满足所述预设条件的所述待查询参数对应的所述目标对象。 The data processing method according to claim 1, wherein, prior to generating the derived values ​​of the batch parameters to be queried, the data processing method further comprises: determining whether the batch satisfies the query parameters to be pre set conditions, wherein, if the bulk is judged to be a query parameter meets the preset condition, then the bulk of the value to be queried generate derived parameters comprises: generating a first derivative value of the bulk query parameter to be based on the derived batch statistical value satisfies the preset condition to be the target object corresponding to the query parameters comprising: the preset condition of the parameters to be queried according to the target object corresponding to said first derivative value batch counter satisfied.
  3. 3.根据权利要求2所述的数据处理方法,其特征在于, 在批量判断所述待查询参数是否满足所述预设条件之后,如果批量判断出所述待查询参数不满足所述预设条件, 则批量生成所述待查询参数的派生值包括:批量生成所述待查询参数的第二派生值, 根据所述派生值批量统计满足所述预设条件的所述待查询参数对应的目标对象包括:根据所述第一派生值和所述第二派生值批量统计满足所述预设条件的所述待查询参数对应的所述目标对象。 The data processing method according to claim 2, characterized in that, after the batch is determined to be the predetermined condition is satisfied query parameter, if the bulk is judged to be a query parameter does not satisfy the preset condition , the bulk of the derived value to be generated query parameters comprises: generating a second derivative value of the quantities to be queried parameter, said preset condition based on the derived value meets the batch counter corresponding to the parameter to be queried audiences comprising: said predetermined condition of said parameter corresponds to query the target object based on the first derivative value and the second derivative value batch counter satisfied.
  4. 4.根据权利要求2所述的数据处理方法,其特征在于, 在批量生成所述待查询参数的第一派生值之前,所述数据处理方法还包括:批量生成所述待查询参数的用于表示所述待查询参数满足所述预设条件的第一标识, 其中,在批量生成所述待查询参数的用于表示所述待查询参数满足预设条件的第一标识之后,批量生成所述待查询参数的第一派生值包括:批量将所述第一标识转化为所述第一派生值。 A data processing method according to claim 2, wherein, prior to generating a first derivative value of the batch to be queried parameter, said data processing method further comprising: generating the bulk of the query parameters to be used after be representing a first identification of the query parameter meets the preset condition, wherein, in the batch to be generated for a query parameter representing a first identification of the query parameters to be meeting a preset condition, the batch production the first query parameter to be derived value comprises: identifying the first batch into said first derivative value.
  5. 5.根据权利要求4所述的数据处理方法,其特征在于, 批量将所述第一标识转化为所述第一派生值包括:批量将所述第一标识转化为1, 根据所述第一派生值批量统计满足所述预设条件的所述待查询参数包括:累加1,得到满足所述预设条件的所述待查询参数对应的所述目标对象的数量。 The data processing method according to claim 4, characterized in that the first batch identifier into said first derivative value comprises: identifying the first batch into 1 according to the first derived batch statistical value satisfies the preset condition to be the query parameter comprising: accumulating 1, to obtain the number of the target object satisfies the query condition to be corresponding to the preset parameters.
  6. 6.一种用于多维数据集的数据处理装置,其特征在于,包括: 查询单元,用于批量查询多维数据集中的待查询参数,其中,所述待查询参数为目标对象的参数; 第一生成单元,用于批量生成所述待查询参数的派生值,其中,所述派生值用于表示所述待查询参数是否满足预设条件;以及统计单元,用于根据所述派生值批量统计满足所述预设条件的所述待查询参数对应的目标对象。 A data processing apparatus cube, characterized in that, comprising: a query unit configured to query the batch to be cube query parameter, wherein the parameter is a parameter query to be the target object; first generating means for generating the bulk of the query parameter to be derived value, wherein the derived values ​​to be used to represent the query parameter meets a preset condition; and a counting unit, according to the derived statistical value satisfies batch the preset condition to be the target object corresponding to the query parameters.
  7. 7.根据权利要求6所述的数据处理装置,其特征在于,所述数据处理装置还包括: 判断单元,用于在批量生成所述待查询参数的派生值之前,批量判断所述待查询参数是否满足所述预设条件,其中: 所述第一生成单元还用于如果批量判断出所述待查询参数满足所述预设条件,则批量生成所述待查询参数的第一派生值, 所述统计单元还用于根据所述第一派生值批量统计满足所述预设条件的所述待查询参数对应的所述目标对象。 7. The data processing apparatus according to claim 6, wherein said data processing apparatus further comprises: determination means for generating a derived value before the batch parameters to be queried, the batch is determined to be the query parameters satisfies the preset condition, wherein: the first generating unit is further configured to, if the bulk is judged to be a query parameter meets the preset condition, generating a first derivative value of the bulk of the parameters to be queried, the It said statistics unit is further configured to the preset condition based on the first derivative value of the batch counter to be satisfied query parameters corresponding to the target object.
  8. 8.根据权利要求7所述的数据处理装置,其特征在于, 所述第一生成单元还用于在批量判断所述待查询参数是否满足所述预设条件之后,如果批量判断出所述待查询参数不满足所述预设条件,则批量生成所述待查询参数的第二派生值;以及所述统计单元还用于根据所述第一派生值和所述第二派生值批量统计满足所述预设条件的所述待查询参数对应的所述目标对象。 A data processing apparatus according to claim 7, wherein the first generating unit is further used for determining the quantities to be after the predetermined condition is satisfied query parameter, if the bulk is judged to be query parameter does not satisfy the preset condition, then the second batch to generate the derived value of the parameter to be queried; and the batch counter for counting unit further derived based on the first value and the second value satisfies the derivative the preset condition to be described later query parameters corresponding to the target object.
  9. 9.根据权利要求7所述的数据处理装置,其特征在于,所述数据处理装置还包括: 第二生成单元,用于在批量生成所述待查询参数的第一派生值之前,批量生成所述待查询参数的用于表示所述待查询参数满足所述预设条件的第一标识, 其中,所述第一生成单元还用于在批量生成所述待查询参数的用于表示所述待查询参数满足预设条件的第一标识之后,批量将所述第一标识转化为所述第一派生值。 A data processing apparatus according to claim 7, wherein said data processing means further comprises: second generating means for generating a first batch before the derived values ​​of the parameters to be queried, generate batch said query parameters to be used to indicate a first identifying the parameters to be queried satisfies the preset condition, wherein the first generating unit is further configured to generate the batch to be used to represent the query parameters to be after identifying query parameter meets a first preset condition, the first batch identifier into said first derivative value.
  10. 10.根据权利要求9所述的数据处理装置,其特征在于, 所述第一生成单元还用于批量将所述第一标识转化为I ;以及所述统计单元还用于累加1,得到满足所述预设条件的所述待查询参数对应的目标对象的数量。 10. The data processing apparatus according to claim 9, wherein the first generating unit is further configured to identify the first batch into I; and wherein further for accumulating statistics unit 1, are met the preset number of the target object to be the condition corresponding to the query parameters.
CN 201410196206 2014-05-09 2014-05-09 Data processing method and device used for multi-dimensional data set CN103942341A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 201410196206 CN103942341A (en) 2014-05-09 2014-05-09 Data processing method and device used for multi-dimensional data set

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 201410196206 CN103942341A (en) 2014-05-09 2014-05-09 Data processing method and device used for multi-dimensional data set

Publications (1)

Publication Number Publication Date
CN103942341A true true CN103942341A (en) 2014-07-23

Family

ID=51190009

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 201410196206 CN103942341A (en) 2014-05-09 2014-05-09 Data processing method and device used for multi-dimensional data set

Country Status (1)

Country Link
CN (1) CN103942341A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104408169A (en) * 2014-12-09 2015-03-11 北京国双科技有限公司 Multi-dimensional expression language based dimension query method and device

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120109878A1 (en) * 2010-10-27 2012-05-03 Crazy Development Debugging system for multidimensional database query expressions on a processing server
CN103605651A (en) * 2013-08-28 2014-02-26 杭州顺网科技股份有限公司 Data processing showing method based on on-line analytical processing (OLAP) multi-dimensional analysis

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120109878A1 (en) * 2010-10-27 2012-05-03 Crazy Development Debugging system for multidimensional database query expressions on a processing server
CN103605651A (en) * 2013-08-28 2014-02-26 杭州顺网科技股份有限公司 Data processing showing method based on on-line analytical processing (OLAP) multi-dimensional analysis

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
HONKCAL: "MDX大部分常用函数", 《HTTP://WWW.CNBLOGS.COM/HONKCAL/ARCHIVE/2011/10/30/2229595.HTML》 *
MOSHA: "Microsoft OLAP by Mosha Pasumansky:Optimizing MDX aggregation functions", 《HTTP://SQLBLOG.COM/BLOGS/MOSHA/ARCHIVE/2008/10/22/OPTIMIZING-MDX-AGGREGATION-FUNCTIONS.ASPX》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104408169A (en) * 2014-12-09 2015-03-11 北京国双科技有限公司 Multi-dimensional expression language based dimension query method and device
CN104408169B (en) * 2014-12-09 2018-02-02 北京国双科技有限公司 Dimensions Multidimensional Expressions query language based method and apparatus

Similar Documents

Publication Publication Date Title
US20090225082A1 (en) Generating distributed dataflow graphs
Zhang et al. Multiple aggregations over data streams
US20060225055A1 (en) Method, system, and device for indexing and processing of expressions
Parker et al. Comparing nosql mongodb to an sql db
Park et al. Parallel computation of skyline and reverse skyline queries using mapreduce
Papadakis et al. A blocking framework for entity resolution in highly heterogeneous information spaces
US20110218978A1 (en) Operating on time sequences of data
CN102663116A (en) Multi-dimensional OLAP (On Line Analytical Processing) inquiry processing method facing column storage data warehouse
US20140156633A1 (en) Scalable Multi-Query Optimization for SPARQL
US20080082520A1 (en) Methods and apparatuses for information analysis on shared and distributed computing systems
US20120166447A1 (en) Filtering queried data on data stores
US20130311454A1 (en) Data source analytics
US20080104070A1 (en) Pattern-based filtering of query input
US20130232133A1 (en) Systems and methods for performing a nested join operation
US20150310061A1 (en) Query relationship management
CN102289436A (en) Determining a weight value search word method and apparatus, method and apparatus for generating search results
US20080040317A1 (en) Decomposed query conditions
US20120047124A1 (en) Database query optimizations
US20150220529A1 (en) Split elimination in mapreduce systems
Chaves et al. Towards materialized view selection for distributed databases
Papadakis et al. Eliminating the redundancy in blocking-based entity resolution methods
De Raedt et al. Compressing probabilistic Prolog programs
CN103310011A (en) Analytical method for data query under cluster database system environment
Roschke et al. A flexible and efficient alert correlation platform for distributed ids
Surana et al. An efficient approach to mine periodic-frequent patterns in transactional databases

Legal Events

Date Code Title Description
C06 Publication
C10 Entry into substantive examination