CN103559413B - A kind of data processing method and device - Google Patents

A kind of data processing method and device Download PDF

Info

Publication number
CN103559413B
CN103559413B CN201310573974.4A CN201310573974A CN103559413B CN 103559413 B CN103559413 B CN 103559413B CN 201310573974 A CN201310573974 A CN 201310573974A CN 103559413 B CN103559413 B CN 103559413B
Authority
CN
China
Prior art keywords
binary number
property parameters
numerical value
binary
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201310573974.4A
Other languages
Chinese (zh)
Other versions
CN103559413A (en
Inventor
曹艳白
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BEIJING SOUFUN SCIENCE & TECHNOLOGY DEVELOPMENT Co Ltd
Original Assignee
BEIJING SOUFUN SCIENCE & TECHNOLOGY DEVELOPMENT Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIJING SOUFUN SCIENCE & TECHNOLOGY DEVELOPMENT Co Ltd filed Critical BEIJING SOUFUN SCIENCE & TECHNOLOGY DEVELOPMENT Co Ltd
Priority to CN201310573974.4A priority Critical patent/CN103559413B/en
Publication of CN103559413A publication Critical patent/CN103559413A/en
Application granted granted Critical
Publication of CN103559413B publication Critical patent/CN103559413B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The invention provides a kind of data processing method and device, the method includes: obtain the property parameters that data to be analyzed have;The property parameters having according to data to be analyzed, determine the figure place of binary number to be generated, wherein, the figure place of this binary number is identical with the number of the property parameters that data to be analyzed have, and everybody of this binary number represents a property parameters of these data to be analyzed respectively;Generate all described binary number with this figure place, and from the multiple binary numbers generated, choose and include the binary number that predetermined number position is the first appointment numerical value;For each binary number selected, everybody the represented property parameters composition property parameters in binary number being the first appointment numerical value is combined;Based on the property parameters combination obtained, treat analytical data and carry out the statistics of predetermined number dimension.The method can improve the precision to data analysis statistical.

Description

A kind of data processing method and device
Technical field
The present invention relates to technical field of data processing, a kind of data processing method and device.
Background technology
In data statistics, it is often necessary to relate to re-scheduling and calculate.It is exactly to unite from data to be counted that so-called re-scheduling calculates Count out the data record of specified type, to get rid of the data record being not belonging to this specified type.Such as, with data to be counted for certain As a example by the sales data in individual supermarket, then this sales data includes many data record, and in every data record, tool contains and sells Sell the attribute informations such as the trade name of commodity, production firm, selling time, if the commodity A selling this month carries out re-scheduling meter After calculation, the most only can count selling time is this month, and the data record of trade name commodity A, and other data records Then can be excluded.
In actual applications, data to be analyzed typically have multiple property parameters, it may be necessary to be based respectively on multiple difference The incompatible re-scheduling carrying out multiple dimension of set of properties calculate, so, then need the number of dimensions artificially according to required statistics, list Possible combinations of attributes situation, is based respectively on possible property parameters combination the most again and carries out re-scheduling calculating.
As being still introduced with above example, this sales data correspond to trade name, production firm, selling time this Individual three property parameters, these three property parameters can be combined into 8 kinds of different dimensions combinations, i.e. these 8 kinds possible dimension groups Conjunction comprises a three dimensionality combination, three two-dimensions combinations, three dimension combinations and a zero dimension degree combination.Wherein, this three The three-dimensional arrangement being combined as being combined by trade name, production firm and selling time these three property parameters of dimension;These three The combination of two dimensions is respectively as follows: the two-dimensional combination of the two-dimensional combination of trade name and production firm, trade name and selling time, Production firm and the two-dimensional combination of selling time;The combination of these three dimension is trade name, production firm and pin the most respectively Selling any one property parameters in the time is an one-dimensional combination;Zero dimension degree is exactly not consider that arbitrary property parameters combines.On The commodity A selling this month that face is mentioned carries out re-scheduling calculating and is actually based on sale title and selling time the two attribute The re-scheduling of a kind of two-dimensions of parameter combination calculates.
When the quantity of the property parameters that data have is n, the total quantity of property parameters based on different dimensions combination is then It it is the n power of 2.Along with the increase of data complexity, the quantity of the property parameters that data have increases the most accordingly.When data have Property parameters quantity bigger time, possible increases the most accordingly, so, has enumerated possible dimension and combined by the way of artificial Through becoming impossible, and artificially enumerate the combination the most often occurring omitting some property parameters so that the dimension group obtained Close not comprehensive, had influence on re-scheduling calculating, and then reduced the precision of data statistic analysis.
Summary of the invention
In view of this, the present invention provides a kind of data processing method and device, to improve the attribute utilizing data to be analyzed The accuracy of parameter determination dimension combination, and then improve the precision of data statistic analysis.
For achieving the above object, the present invention provides following technical scheme: a kind of data processing method, including:
Obtain the property parameters that data to be analyzed have;
The property parameters having according to described data to be analyzed, determines the figure place of binary number to be generated, wherein, described The figure place of binary number is identical with the number of the property parameters that described data to be analyzed have, and every point of described binary number Do not represent a property parameters of described data to be analyzed;
Generate all described binary number with described figure place, and from the plurality of binary number generated, choose Including the binary number that predetermined number position is the first appointment numerical value, wherein, described first appointment numerical value is 0 or 1;
For each described binary number selected, will described binary number be the every of described first appointment numerical value Represented property parameters composition property parameters combination;
Based on the described property parameters combination obtained, described data to be analyzed are carried out the system of described predetermined number dimension Meter.
Preferably, described generation has all described binary number of described figure place, and enters from the plurality of two generated In number processed, choose and include the binary number that predetermined number position is the first appointment numerical value, including:
A: generate and have described figure place, and every initial binary number being described second appointment numerical value, at the beginning of described Beginning binary number is as the first binary number, and wherein, the second appointment numerical value is 0 or 1;
B: according to preset rules and described first binary number, generates the second binary number, described second binary number with The absolute value of described first binary difference is 1;
C: if having the numerical value on predetermined number position in described second binary number is the first appointment numerical value, then select institute State the second binary number;
C: judge that everybody of described second binary number is whether for being the 3rd appointment numerical value, if it is, perform generation The operation of described property parameters combination;If it is not, then using current described second binary number as described first binary number, And return described step B;
Wherein, described 3rd appointment numerical value is 0 or 1, and the described 3rd specifies numerical value to be different from described second appointment numerical value.
Preferably, when described second to specify numerical value be 0, described generation has a described figure place, and every is described the The two initial binary numbers specifying numerical value, including:
Generation has a described figure place, and every be 0 initial binary number
Described according to preset rules with described first binary number, generate the second binary number, described second binary number It is 1 with the absolute value of described first binary difference, including:
The lowest order of described first binary number is added one, obtains the second binary number.
Preferably, when described second to specify numerical value be 1, described generation has a described figure place, and every is described the The two initial binary numbers specifying numerical value, including:
Generation has a described figure place, and every be 1 initial binary;
Described according to preset rules with described first binary number, generate the second binary number, described second binary number It is 1 with the absolute value of described first binary difference, including:
The lowest order of described first binary number is subtracted one, obtains the second binary number.
Preferably, described for each described binary number selected, will described binary number be described first finger Everybody represented property parameters composition property parameters of fixed number value combines, including:
For each described binary number selected, according to the rule of the true value in first appointment numerical value correspondence boolean's array Then, everybody value of described binary number is converted to the element value in boolean's array successively, so that in described boolean's array Each element value respectively corresponding described property parameters;
Extract the property parameters that in described boolean's array, true value is corresponding, the property parameters composition property parameters that will extract Combination.
On the other hand, present invention also offers a kind of data processing equipment, including:
Acquiring unit, for obtaining the property parameters that data to be analyzed have;
Relation determination unit, for the property parameters having according to described data to be analyzed, determines binary system to be generated The figure place of number, wherein, the figure place of described binary number is identical with the number of the property parameters that described data to be analyzed have, and institute State binary number everybody represent a property parameters of described data to be analyzed respectively;
Binary number processing unit, for the figure place determined according to described relation determination unit, generates and has institute's rheme The all described binary number of number, and from the plurality of binary number generated, chooses that to include predetermined number position be first Specifying the binary number of numerical value, wherein, described first appointment numerical value is 0 or 1;
Property parameters assembled unit, for each described binary system selected for described binary number processing unit Number, will be the described first every represented property parameters composition property parameters combination specifying numerical value in described binary number;
Computing unit, for the described property parameters combination obtained based on described property parameters assembled unit, treats described Analytical data carries out the statistics of described predetermined number dimension.
Preferably, described binary number processing unit, including:
Initial number signal generating unit, for the figure place determined according to described relation determination unit, generates and has described figure place, And every initial binary number being described second appointment numerical value, using described initial binary number as the first binary number, Wherein, the second appointment numerical value is 0 or 1;
Mediant signal generating unit, for according to preset rules and described first binary number, generates the second binary number, institute The absolute value stating the second binary number and described first binary difference is 1;
Binary number chooses unit, if in described second binary number that number signal generating unit processed generates in the middle of described Having the numerical value on predetermined number position is described first appointment numerical value, then select described second binary number;
Judging unit, for everybody judging described second binary number that described mediant signal generating unit generates be whether It is the 3rd appointment numerical value, performs described property parameters assembled unit if it is, trigger;If it is not, then by described in current Second binary number is as described first binary number, and returns the described intermediate binary number signal generating unit of execution;Wherein, described 3rd appointment numerical value is 0 or 1, and the described 3rd specifies numerical value to be different from described second appointment numerical value.
Preferably, when described second appointment numerical value is 0, described initial number signal generating unit, including:
First initial number signal generating unit, for the figure place determined according to described relation determination unit, generates described in having Figure place, and every be 0 initial binary number
Described mediant signal generating unit, including:
First mediant signal generating unit, for adding one by the lowest order of described first binary number, obtains the second binary system Number.
Preferably, when described second appointment numerical value is 1, described initial number signal generating unit, including:
First initial number signal generating unit, for the figure place determined according to described relation determination unit, generates described in having Figure place, and every be 1 initial binary;
Described mediant signal generating unit, including:
First mediant signal generating unit, for subtracting one by the lowest order of described first binary number, obtains the second binary system Number.
Preferably, described property parameters assembled unit, including:
Boolean's array converting unit, for for each described binary number selected, specifying numerical value pair according to first Answer the rule of true value in boolean's array, everybody value of described binary number is converted to the element in boolean's array successively Value, so that the most corresponding described property parameters of each element value in described boolean's array;
Parameter group zygote unit, for extracting the property parameters that in described boolean's array, true value is corresponding, by extract Property parameters composition property parameters combination.
Understand via above-mentioned technical scheme, the quantity of the property parameters that the present invention has according to these data to be analyzed, raw Become the binary number of figure place identical with the quantity of this property parameters, the binary number of generation everybody represent this number to be analyzed respectively According to a property parameters, owing to binary number is made up of 0 and 1, so can set 0 or 1 is the first appointment numerical value, and recognizes For binary number is the first appointment numerical value position corresponding to property parameters participate in statistical computation, so, the binary system of generation Number actually includes property parameters in these data to be analyzed and carries out each combining form of combination in any.Dimension when statistical analysis When the number of degrees are predetermined number, select, from the binary number generated, the binary number that predetermined number position is the first appointment numerical value After, for each binary number selected, will this binary number be every represented attribute of this first appointment numerical value Parameter composition property parameters combination, just can be met all of property parameters combination of this number of dimensions, it is to avoid omit full The combination of property parameters of this number of dimensions of foot, and then improve and treat, based on this number of dimensions, the precision that analytical data carries out adding up.
Accompanying drawing explanation
In order to be illustrated more clearly that the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing In having technology to describe, the required accompanying drawing used is briefly described, it should be apparent that, the accompanying drawing in describing below is only this Inventive embodiment, for those of ordinary skill in the art, on the premise of not paying creative work, it is also possible to according to The accompanying drawing provided obtains other accompanying drawing.
Fig. 1 shows the schematic flow sheet of the present invention one embodiment of a kind of data processing method;
Fig. 2 shows the schematic flow sheet of the present invention another embodiment of a kind of data processing method;
Fig. 3 shows the structural representation of the present invention one embodiment of a kind of data processing equipment;
Fig. 4 shows the binary number processing unit one composition structural representation of a kind of data processing equipment of the present invention.
Detailed description of the invention
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Describe, it is clear that described embodiment is only a part of embodiment of the present invention rather than whole embodiments wholely.Based on Embodiment in the present invention, it is every other that those of ordinary skill in the art are obtained under not making creative work premise Embodiment, broadly falls into the scope of protection of the invention.
See Fig. 1, it is shown that the schematic flow sheet of the present invention one embodiment of a kind of data processing method, the side of the present invention Method can apply to calculate in node arbitrarily, and the method for the present embodiment may include that
S101, obtains the property parameters that data to be analyzed have.
Wherein, these data to be analyzed can be to need to carry out many data of statistical analysis, such as merchandise sales record, network Bandwidth uses record etc..
The property parameters of these data to be analyzed is the ginseng describing this object, classification or feature representated by data to be analyzed Number.The property parameters of data as to be analyzed in this can be each in name of the information as indicated in these data to be analyzed, data to be analyzed The generation time etc. of data.When being communication use data such as data to be analyzed, property parameters can include in data to be analyzed Telephone number corresponding to each bar call record, operator, type of call, the parameter such as duration of call.
It is understood that in data statistics field, the property parameters of data to be analyzed is referred to as data to be analyzed Dimension, a property parameters of data to be analyzed is a dimension of these data to be analyzed.Carry out treating analytical data During statistics, the one or more dimensions that can choose these data to be analyzed as required carry out statistical analysis.For example, it is possible to based on This communication is used data to carry out statistical analysis by telephone number and two dimensions of operator.
S102, the property parameters having according to these data to be analyzed, determine the figure place of binary number to be generated.
Wherein, the figure place of this binary number is identical with the number of the property parameters that these data to be analyzed have, to be generated Everybody of binary number represents a property parameters of these data to be analyzed respectively.
The figure place needing the binary number generated is determined by the quantity of the property parameters of these data to be analyzed, and needs to generate Every of this binary number all to should a property parameters of data to be analyzed, and different property parameters is to should be to be generated The not coordination of the binary number become.It is to say, set up binary everybody and these data to be analyzed that have that these needs generate Corresponding relation between property parameters.
As, when the property parameters that these data to be analyzed have is 3, then may determine that binary number to be generated is 3 Binary number, and this binary number every represents a property parameters in these 3 property parameters, and binary number to be generated The property parameters that characterized of not coordination different.Assume the property parameters of these data to be analyzed be respectively telephone number, operator, During type of call, then can be need in the binary number that generates from high to low first to should telephone number, second Corresponding operator, the 3rd corresponding type of call.
S103, generates all binary numbers with this figure place, and from the multiple binary numbers generated, chooses and include Predetermined number position is the binary number of the first appointment numerical value.
According to step 102 is determined need generate binary number figure place after, then can generate and there is corresponding figure place All possible binary number.Figure place as determined is 3, then need to generate all of triad number, i.e. generate Binary number includes 000,001,010,011,100,101,110 and 111.
Due to everybody corresponding property parameters of the most prespecified binary number to be generated, therefore, For any one binary number generated, every of each binary number is all to should an attribute of data to be analyzed Parameter.
Wherein, this predetermined number can set as required, typically carries out required number of dimensions treating analytical data Amount determines.As, when carrying out data analysis, needing to carry out the analysis of 3 dimensions, then this predetermined number can be set as 3.Its In, dimension describes and a data object is analyzed required number of parameters.If desired for carrying out three dimensionality analysis, then need Determine all of property parameters combination being made up of three property parameters of these data to be analyzed.
Owing to binary number is made up of 0 or 1 number, in this binary number, everybody is only 0 or 1, therefore, and should First appointment numerical value is 0 or 1.This first appointment numerical value can be set as a numerical value in 0 and 1, specifically can be as required Set.
In the embodiment of the present application, this first specify numerical value represent participate in statistical computation, therefore, if in binary number certain The numerical value of position is this first appointment numerical value, then it represents that this corresponding property parameters participates in statistical computation.Accordingly, binary system In certain in numerical value be not this first specify numerical value, then this corresponding property parameters is not involved in treating analytical data In statistical computation.
If be the first appointment numerical value due to the numerical value on binary number certain, then it represents that this institute of this binary number Corresponding property parameters participates in statistical computation, accordingly, it is determined that go out, these data to be analyzed is carried out the number of dimensions needed for statistical analysis, After i.e. determining that many small number the property parameters of needs are combined, in order to which there is meet the attribute ginseng of this number of dimensions The combination of number, then can choose and include the binary system that predetermined number position is the first appointment numerical value from the binary number generated Number.
Having the numerical value on predetermined number position in the binary number selected is the first appointment numerical value, and selects for any one For the binary number taken out, this binary number is everybody corresponding property parameters of the first appointment numerical value is grouped together The property parameters combination obtained, is the combination of a kind of property parameters meeting this number of dimensions determined.
Such as, with data to be analyzed, there are 3 property parameters and be introduced, the binary number of generation includes 000,001, 010,011,100,101,110 and 111, specify numerical value for 1 with first, as a example by needing to carry out the analysis of 2 dimensions, then need choosing Taking out and have the binary number that numerical value is 1 on two, the binary number selected includes 011,101 and 110.
S104, for each binary number selected, by this binary number for this first every institute specifying numerical value The property parameters composition property parameters combination represented.
In order to according to the binary number selected, determine the property parameters that can participate in calculating that this binary number is corresponding Combination, then need to determine respectively everybody represented property parameters that this binary number is the first appointment numerical value, then will The property parameters determined is combined, and obtains property parameters combination.Owing to selecting multiple binary number, each binary system Number the most corresponding property parameters combination, then can obtain the combination of multiple property parameters.
Such as, still with in previously described binary number from high to low first to should telephone number, second pair Answering operator, the 3rd corresponding type of call, then as a example by needing to generate 3 bits, it will again be assumed that treats analytical data and carries out 2 The analysis of individual dimension, and first to specify numerical value be 1, then from generating binary number, the binary number selected is 011,101 and 110, and binary number 011 is 1 everybody be respectively second and the 3rd, property parameters corresponding to second is for running Business, the 3rd corresponding property parameters is type of call, and therefore the property parameters of this binary number 011 correspondence is combined as operator With the combination of type of call, i.e. based on the two dimension, these data to be analyzed are carried out statistical analysis.Accordingly, binary number The telephone number of 101 correspondences and the combination of type of call the two property parameters, the corresponding telephone number of binary number 110 and operation The combination of business's the two property parameters.
S105, based on the property parameters combination obtained, treats analytical data and carries out the statistics of predetermined number dimension.
After obtaining the combination of all of property parameters, the combination of each property parameters can be based respectively on and carry out the respective dimension number of degrees Statistical analysis.If predetermined number is 2, and certain property parameters is combined as comprising the combination of telephone number and operator, then may be used Carry out the statistics of two-dimensions treating analytical data based on the two property parameters.
Wherein, combination based on the property parameters obtained, treat analytical data and carry out the statistics of respective dimensions, with existing Mode is similar, does not repeats them here.
It is understood that in actual applications, can need to be based respectively on multiple different dimension and carry out data analysis, Therefore, this predetermined number can set multiple.For example, it is desired to during the analysis of 2 dimensions and 3-dimensional degree, then predetermined number can be 2 Hes 3.But for each predetermined number, choosing binary number, and determining that the anabolic process of property parameters is all identical.
In the present embodiment, according to the quantity of the property parameters that these data to be analyzed have, generate and this property parameters The binary number of the identical figure place of quantity, the binary number of generation everybody represent a property parameters of these data to be analyzed respectively, Owing to binary number is made up of 0 and 1, so can set 0 or 1 is the first appointment numerical value, and thinks in binary number to be One specifies the property parameters corresponding to position of numerical value to participate in statistical computation, and so, the binary number of generation actually includes this In data to be analyzed, property parameters carries out each combining form of combination in any.It is predetermined number when needing the number of dimensions carried out Time, after selecting, from the binary number generated, the binary number that predetermined number position is the first appointment numerical value, for select Each binary number, forms property parameters by everybody the represented property parameters in this binary number being this first appointment numerical value Combination, just can be met all of property parameters combination of this number of dimensions, it also avoid and omit the genus meeting this number of dimensions The combination of property parameter, and then improve and treat analytical data and carry out the precision of statistical computation.
Meanwhile, the method is to utilize computer to utilize binary mode to determine the property parameters combination meeting this number of dimensions Provide possibility, and then can improve and treat, based on this number of dimensions, the convenience that analytical data carries out adding up.
See Fig. 2, it is shown that the schematic flow sheet of the present invention another embodiment of a kind of data processing method, the present embodiment Method may include that
S201, obtains the property parameters that data to be analyzed have.
S202, the property parameters having according to these data to be analyzed, determine the figure place of binary number to be generated.
Wherein, the figure place of this binary number is identical with the number of the property parameters that these data to be analyzed have, to be generated It is binary that everybody represents a property parameters of these data to be analyzed respectively.
S203, generates and has figure place, and every initial binary number being the second appointment numerical value, by this initial binary Number is as the first binary number.
Wherein, the second appointment numerical value is 0 or 1, and one in only 0 and 1 determines numerical value.
When this second appointment numerical value difference, the initial binary number of generation is the most different.If this second appointment numerical value is 1 Time, everybody of this initial binary number is 1;As this second to specify numerical value be 0 time, everybody of this initial binary number is 0.
Such as, these data to be analyzed have 3 property parameters, and when this second appointment numerical value is 0, then initial two generated System number is 000.
S204, according to preset rules and the first binary number, generates the second binary number.
Wherein, the second binary number of generation is 1 with the absolute value of this first binary difference.
Wherein, being 0 or 1 according to this second appointment numerical value, the mode generating this second binary number is the most different, but is both needed to Ensure that this second binary number is different from this first binary number, and this second binary number that current time generates is before The binary number not generated.
As, when this second appointment numerical value is 0, then the mode generating this second binary number is: by this first binary system The lowest order of number adds one, obtains the second binary number.Wherein, such as, when being 000 with initial binary number, if current time 000 is the first binary number, then the second binary number is 001.
And for example, when this second appointment numerical value is 1, then the mode generating this second binary number is: enter the two or two The lowest order of number processed subtracts one, obtains the second binary number.Such as, when this first binary number is initial binary number 111, then give birth to The second binary number become is 110.
S205, if having the numerical value on predetermined number position in this second binary number is this first appointment numerical value, then chooses Go out this second binary number.
Often generate second binary number, be required to judge whether this second binary number has the number on predetermined number position Value is the first appointment numerical value, if having the numerical value on predetermined number position in this second binary number is the first appointment numerical value, then protects Deposit this second binary number;If the numerical value on the position of Non-precondition quantity is the first appointment numerical value in this second binary number, The most directly carry out step 206.
Certainly, if needing to carry out statistical analysis based on multiple dimension, then multiple predetermined number can be set, therefore, as Really this second binary number is that the position of the first numerical value quantity reaches any one predetermined number, all selects current the two or two and enters Number processed.
Wherein, this first appointment numerical value is 0 or 1.
S206, it is judged that whether everybody of this second binary number be for being the 3rd appointment numerical value, if it is, perform step 208;If it does not, perform step 207.
S207, using current described second binary number as described first binary number, and returns described step 204.
Wherein, the 3rd appointment numerical value is a numerical value in 0 and 1, and the 3rd specifies numerical value to be different from the second appointment Numerical value.
When everybody in this second binary number is the 3rd appointment numerical value, then illustrate to have generated all this figure places Binary number, if the binary number that there is also this figure place is not generated, then this second binary number is entered as the one or two Number processed, returns this step 204, continues to generate next second binary number, until having binary number all quilts of this figure place Generate.
In the present embodiment, with one as step value, progressively generate the second binary number, e.g., use the side of increasing or decreasing Formula, is increased or decreased the numerical value of binary number, such that it is able to obtain the binary number of this figure places all, it is to avoid binary number something lost Leakage or repetition.
S208, for each binary number selected, by this binary number for this first every institute specifying numerical value The property parameters composition property parameters combination represented.
S209, based on the property parameters combination obtained, treats analytical data and carries out the statistics of predetermined number dimension.
Wherein, this step 207 is similar to the related introduction of preceding embodiment with the operation of step 208, does not repeats them here.
It is understood that the operation of this step 205 and step 206 is not limited to shown in Fig. 2, this step 205 and step 206 Can also carry out simultaneously.
In the present embodiment, the numerical value that this first appointment numerical value is similarly in 0 and 1.This first appointment numeric representation When any one corresponding property parameters of binary number participates in statistical computation, the numerical value in corresponding positions.As, if first refers to When fixed number value is 1, if binary number is 000, then it represents that three corresponding property parameters of this binary number are all not involved in system Meter is analyzed, the property parameters combination being combined as 0 dimension of the property parameters that this binary number is corresponding;If binary number is 011, then it represents that rear two corresponding property parameters of this binary number can participate in statistical analysis, and rear the two of this binary number The property parameters of position correspondence is combined into the property parameters combination of 2 dimensions.
This second appointment numerical value is used for limiting initial binary number, and generates subsequent binary based on initial binary number The mode of number.Therefore, first specifies numerical value and second to specify the meaning difference of numerical value, refers to setting the first appointment numerical value and second During fixed number value, this first appointment numerical value and second can be set and specify numerical value identical, as being all 0;This first finger can also be set Fixed number value specifies numerical value different with second, and if this second appointment numerical value is 0, and this second appointment numerical value is 1.
For the ease of understanding the scheme of the present embodiment, below as a example by data to be analyzed are for merchandise sales record, it is assumed that business The property parameters that product sales record includes has trade name, manufacturer and selling time, and to need to carry out the analysis of 2 dimensions As a example by be described in detail.Assume that the first appointment numerical value is 1 to be introduced, say, that binary number numerical value on certain is 1 represents that this corresponding property parameters participates in statistical analysis.Setting second specifies numerical value as 0, and the 3rd appointment numerical value is 1.Then generating this initial binary number is 000, using this 000 as first binary number perform step 204, this by 000 minimum Position adds one, obtains the second binary number 001.This has the numerical value on 1 in 001 is 1, then preserve out this 001.Simultaneously, it is judged that This 001 everybody be not all of being 1, then this is 001 as the first binary number, returns and performs this step 204, by 001 minimum Position adds 1, obtains the second binary number 010, the like, until three of the second binary number 111,111 generated are 1, Then perform subsequent step 207.
Finally, the binary number selected has 011,101 and 110, wherein, 011 corresponding manufacturer and the two of selling time Dimension combines, 101 corresponding goods titles and two dimension combinations of selling time, 110 corresponding goods titles and the two of manufacturer The combination of individual dimension.So, combine based on these three two dimension, corresponding statistical analysis can be carried out respectively.
Being only used in embodiments of the present invention describe conveniently, the property parameters having with data to be analyzed is 3 or 4 As a example by be described, but it is understood that, the property parameters that data the most to be analyzed have can have very Many, when the property parameters that data to be analyzed have is the biggest, the method for the application present invention more embodies it and avoids omitting attribute Parameter combines, and then improves the advantage such as precision of statistical analysis.
Further, in one embodiment of any of the above, for the ease of judging which position in the binary number selected Corresponding property parameters participates in statistical analysis, after selecting binary number, for each binary number selected, according to First rule specifying the true value in numerical value correspondence boolean's array, is converted to Bolean number successively by everybody value of this binary number Element value in group, so that the most corresponding property parameters of each element value in boolean's array.
It is to say, this binary number is the value in the boolean value corresponding to position of the first appointment numerical value be true value, And for this, first to specify the value in the boolean value corresponding to position of numerical value be false in this binary number, wherein, this binary number In everybody corresponding property parameters, be corresponding in turn to each element of this boolean's array.Such as, the first appointment numerical value is 1, and two enter When number processed is 101, then first element value during the highest order of this binary number is converted to boolean's array is ture, this binary system Second element value that the second of number is converted in boolean's array is false, and the value of the lowest order of this binary number is converted to The 3rd element value in boolean value is ture, therefore, boolean's array that this binary number is changed out be ture, False, ture}, wherein, the property parameters that in this boolean's array, first element is corresponding is the highest in this binary number 101 The property parameters that position is corresponding, the second that property parameters is this binary number 101 that in this boolean's array, second element is corresponding Corresponding property parameters, in this boolean's array, the 3rd property parameters that element is corresponding is the lowest order that this binary system counts 101 Corresponding property parameters.
After obtaining boolean's array, extract the property parameters that true value ture in boolean value is corresponding, the attribute that will extract Parameter composition property parameters combination.Such as, boolean's array is { when ture, false, ture}, to extract this Bolean number the most respectively In group, extract two property parameters are combined into property parameters by first element and the 3rd property parameters that element is corresponding Combination.
The data processing method of the corresponding present invention, present invention also offers a kind of data processing equipment, sees Fig. 3, it is shown that The structural representation of the present invention one embodiment of a kind of data processing equipment, the device of the present embodiment may include that acquisition is single Unit 301, relation determination unit 302, binary number processing unit 303, property parameters assembled unit 304 and computing unit 305.
Wherein, acquiring unit 301, for obtaining the property parameters that data to be analyzed have;
Relation determination unit 302, for the property parameters having according to described data to be analyzed, determines that to be generated two enter The figure place of number processed, wherein, the figure place of described binary number is identical with the number of the property parameters that described data to be analyzed have, and Everybody of described binary number represents a property parameters of described data to be analyzed respectively;
Binary number processing unit 303, for the figure place determined according to described relation determination unit, generates described in having The all described binary number of figure place, and from the plurality of binary number generated, chooses that to include predetermined number position be the One binary number specifying numerical value, wherein, described first appointment numerical value is 0 or 1;
Property parameters assembled unit 304, each described two for selecting for described binary number processing unit enter Number processed, will be the described first every represented property parameters composition property parameters group specifying numerical value in described binary number Close;
Computing unit 305, for the described property parameters combination obtained based on described property parameters assembled unit, to described Data to be analyzed carry out the statistics of described predetermined number dimension.
In the present embodiment, the quantity of the property parameters that relation determination unit has according to data to be analyzed, determine and treat The figure place of binary number generated, and binary number to be generated everybody represent an attribute ginseng of these data to be analyzed respectively Number, owing to binary number is made up of 0 and 1, so can set 0 or 1 is the first appointment numerical value, and thinks in binary number and be First specifies the property parameters corresponding to position of numerical value to participate in statistical computation, and so, this binary number processing unit is according to this pass System determines the figure place that unit is determined, all binary numbers generating corresponding figure place actually include in these data to be analyzed Property parameters carries out each combining form of combination in any.
And this binary number processing unit is from the binary number generated, and to select predetermined number position be the first appointment numerical value Binary number after, for each binary number selected, by this binary number for this first every institute specifying numerical value The property parameters composition property parameters combination represented, just can be met and all of genus of this predetermined number identical dimensional number Property parameter combination, it is to avoid omit the combination of the property parameters meeting this number of dimensions, and then improve and treat based on this number of dimensions Analytical data carries out the precision of statistical computation.
Wherein, binary number processing unit generates according to the figure place that described relation determination unit is determined and has this figure place The mode of binary number can have multiple.See Fig. 4, it is shown that in a kind of data processing equipment of the present invention, binary number processes The structural representation of a kind of implementation of unit, in the present embodiment, this binary number processing unit 303, including:
Initial number signal generating unit 3031, for the figure place determined according to described relation determination unit, generates described in having Figure place, and every initial binary number being described second appointment numerical value, enter described initial binary number as the one or two Number processed, wherein, the second appointment numerical value is 0 or 1;
Mediant signal generating unit 3032, for according to preset rules and described first binary number, generates the second binary system Number, described second binary number is 1 with the absolute value of described first binary difference;
Binary number chooses unit 3033, if described second binary number generated for described mediant signal generating unit In have the numerical value on predetermined number position be described first specify numerical value, then select described second binary number;
Judging unit 3034, for everybody judging described second binary number that described mediant signal generating unit generates be No for being the 3rd appointment numerical value, perform described property parameters assembled unit if it is, trigger;If it is not, then by current Described second binary number is as described first binary number, and returns the described mediant signal generating unit of execution;Wherein, described Three appointment numerical value are 0 or 1, and the described 3rd specifies numerical value to be different from described second appointment numerical value.
Wherein, this first appointment numerical value can specify numerical value identical with second, it is also possible to is and the 3rd appointment numerical value phase With.
Wherein, this judging unit 3034 can be chosen unit at this binary number and determines whether to choose this second binary system After number, then perform to judge whether everybody of this this second binary number is the 3rd operation specifying numerical value.This judging unit Can also be to choose while unit carries out selection operation to this second binary number at this binary number, perform corresponding to judge Operation.
When this second appointment numerical value is 0, this initial number signal generating unit, may include that
First initial number signal generating unit, for the figure place determined according to described relation determination unit, generates described in having Figure place, and every be 0 initial binary number
Accordingly, this mediant signal generating unit, may include that
First mediant signal generating unit, for adding one by the lowest order of described first binary number, obtains the second binary system Number.
When described second appointment numerical value is 1, described initial number signal generating unit, may include that
First initial number signal generating unit, for the figure place determined according to described relation determination unit, generates described in having Figure place, and every be 1 initial binary;
Described mediant signal generating unit, including:
First mediant signal generating unit, for subtracting one by the lowest order of described first binary number, obtains the second binary system Number.
Further, in one embodiment of any of the above, this property parameters assembled unit, may include that
Boolean's array converting unit, for for each described binary number selected, specifying numerical value pair according to first Answer the rule of true value in boolean's array, everybody value of described binary number is converted to the element in boolean's array successively Value, so that the most corresponding described property parameters of each element value in described boolean's array;
Parameter group zygote unit, for extracting the property parameters that in described boolean's array, true value is corresponding, by extract Property parameters composition property parameters combination.
In this specification, each embodiment uses the mode gone forward one by one to describe, and what each embodiment stressed is and other The difference of embodiment, between each embodiment, identical similar portion sees mutually.For device disclosed in embodiment For, owing to it corresponds to the method disclosed in Example, so describe is fairly simple, relevant part sees method part and says Bright.
Described above to the disclosed embodiments, makes professional and technical personnel in the field be capable of or uses the present invention. Multiple amendment to these embodiments will be apparent from for those skilled in the art, as defined herein General Principle can realize without departing from the spirit or scope of the present invention in other embodiments.Therefore, the present invention It is not intended to be limited to the embodiments shown herein, and is to fit to and principles disclosed herein and features of novelty phase one The widest scope caused.

Claims (10)

1. a data processing method, it is characterised in that including:
Obtain the property parameters that the data to be analyzed of pending statistical analysis have;
The property parameters having according to described data to be analyzed, determines the figure place of binary number to be generated, and wherein, described two enter The figure place of number processed is identical with the number of the property parameters that described data to be analyzed have, and every table respectively of described binary number Show a property parameters of described data to be analyzed;
Generate and there is all described binary number of described figure place, and from the multiple binary numbers generated, choose include pre- If the binary number that number of bits is the first appointment numerical value, wherein, described first appointment numerical value is 0 or 1, and described predetermined number represents The dimension that described data to be analyzed are analyzed;
For each described binary number selected, by described binary number for the described first every institute table specifying numerical value The property parameters composition property parameters combination shown, wherein, in described binary number represented by the described first position specifying numerical value Property parameters be participate in statistical computation property parameters;
Based on the described property parameters combination obtained, described data to be analyzed are carried out the statistics of described predetermined number dimension.
Method the most according to claim 1, it is characterised in that described generation has all described binary system of described figure place Number, and from the plurality of binary number generated, choose and include the binary number that predetermined number position is the first appointment numerical value, Including:
A: generate and there is described figure place, and every initial binary number being the second appointment numerical value, by described initial binary Number is as the first binary number, and wherein, the second appointment numerical value is 0 or 1;
B: according to preset rules and described first binary number, generates the second binary number, and described second binary number is with described The absolute value of first binary difference is 1;
C: if having the numerical value on predetermined number position in described second binary number is the first appointment numerical value, then select described Two binary numbers;
C: judge everybody of described second binary number whether for being the 3rd appointment numerical value, if it is, perform to generate described The operation of property parameters combination;If it is not, then using current described second binary number as described first binary number, and return Return described step B;
Wherein, described 3rd appointment numerical value is 0 or 1, and the described 3rd specifies numerical value to be different from described second appointment numerical value.
Method the most according to claim 2, it is characterised in that when described second appointment numerical value is 0, described generation has Described figure place, and every initial binary number being described second appointment numerical value, including:
Generation has a described figure place, and every be 0 initial binary number
Described according to preset rules with described first binary number, generate the second binary number, described second binary number and institute The absolute value stating first binary difference is 1, including:
The lowest order of described first binary number is added one, obtains the second binary number.
Method the most according to claim 2, it is characterised in that when described second appointment numerical value is 1, described generation has Described figure place, and every initial binary number being described second appointment numerical value, including:
Generation has a described figure place, and every be 1 initial binary;
Described according to preset rules with described first binary number, generate the second binary number, described second binary number and institute The absolute value stating first binary difference is 1, including:
The lowest order of described first binary number is subtracted one, obtains the second binary number.
5. according to the method described in any one of Claims 1-4, it is characterised in that described for each select described two System number, will be the described first every represented property parameters composition property parameters group specifying numerical value in described binary number Close, including:
For each described binary number selected, according to the rule of the true value in first appointment numerical value correspondence boolean's array, Everybody value of described binary number is converted to the element value in boolean's array successively, so that every in described boolean's array The most corresponding described property parameters of individual element value;
Extract the property parameters that in described boolean's array, true value is corresponding, the property parameters composition property parameters group that will extract Close.
6. a data processing equipment, it is characterised in that including:
Acquiring unit, for obtaining the property parameters that the data to be analyzed of pending statistical analysis have;
Relation determination unit, for the property parameters having according to described data to be analyzed, determines binary number to be generated Figure place, wherein, the figure place of described binary number is identical with the number of the property parameters that described data to be analyzed have, and described two Everybody of system number represents a property parameters of described data to be analyzed respectively;
Binary number processing unit, for the figure place determined according to described relation determination unit, generates and has described figure place All described binary numbers, and from the multiple binary numbers generated, choose that to include predetermined number position be the first appointment numerical value Binary number, wherein, described first to specify numerical value be 0 or 1, and described predetermined number represents and carries out described data to be analyzed point The dimension of analysis;
Property parameters assembled unit, for each described binary number selected for described binary number processing unit, will Described binary number is the described first every represented property parameters composition property parameters combination specifying numerical value, wherein, In described binary number, the property parameters represented by the described first position specifying numerical value is the property parameters participating in statistical computation;
Computing unit, for the described property parameters combination obtained based on described property parameters assembled unit, to described to be analyzed Data carry out the statistics of described predetermined number dimension.
Device the most according to claim 6, it is characterised in that described binary number processing unit, including:
Initial number signal generating unit, for the figure place determined according to described relation determination unit, generates and has described figure place, and often Position is the initial binary number of the second appointment numerical value, using described initial binary number as the first binary number, wherein, second Specifying numerical value is 0 or 1;
Mediant signal generating unit, for according to preset rules and described first binary number, generates the second binary number, and described the Two binary numbers are 1 with the absolute value of described first binary difference;
Binary number chooses unit, if having pre-in described second binary number that number signal generating unit processed generates in the middle of described If the numerical value in number of bits is described first appointment numerical value, then select described second binary number;
Judging unit, for judging that whether everybody of described second binary number that described mediant signal generating unit generates be for being 3rd specifies numerical value, performs described property parameters assembled unit if it is, trigger;If it is not, then by current described second Binary number is as described first binary number, and returns the described intermediate binary number signal generating unit of execution;Wherein, the described 3rd Specifying numerical value is 0 or 1, and the described 3rd specifies numerical value to be different from described second appointment numerical value.
Device the most according to claim 7, it is characterised in that when described second appointment numerical value is 0, described initial number is raw Become unit, including:
First initial number signal generating unit, for the figure place determined according to described relation determination unit, generates and has described figure place, And every be 0 initial binary number
Described mediant signal generating unit, including:
First mediant signal generating unit, for adding one by the lowest order of described first binary number, obtains the second binary number.
Device the most according to claim 7, it is characterised in that when described second appointment numerical value is 1, described initial number is raw Become unit, including:
First initial number signal generating unit, for the figure place determined according to described relation determination unit, generates and has described figure place, And every be 1 initial binary;
Described mediant signal generating unit, including:
First mediant signal generating unit, for subtracting one by the lowest order of described first binary number, obtains the second binary number.
10. according to the device described in any one of claim 6 to 9, it is characterised in that described property parameters assembled unit, including:
Boolean's array converting unit, for for each described binary number selected, specifying numerical value correspondence cloth according to first The rule of the true value in your array, is converted to the element value in boolean's array successively by everybody value of described binary number, with Make the most corresponding described property parameters of each element value in described boolean's array;
Parameter group zygote unit, for extracting the property parameters that in described boolean's array, true value is corresponding, the attribute that will extract Parameter composition property parameters combination.
CN201310573974.4A 2013-11-15 2013-11-15 A kind of data processing method and device Active CN103559413B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310573974.4A CN103559413B (en) 2013-11-15 2013-11-15 A kind of data processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310573974.4A CN103559413B (en) 2013-11-15 2013-11-15 A kind of data processing method and device

Publications (2)

Publication Number Publication Date
CN103559413A CN103559413A (en) 2014-02-05
CN103559413B true CN103559413B (en) 2016-11-02

Family

ID=50013659

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310573974.4A Active CN103559413B (en) 2013-11-15 2013-11-15 A kind of data processing method and device

Country Status (1)

Country Link
CN (1) CN103559413B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107153651B (en) * 2016-03-03 2021-04-02 阿里巴巴集团控股有限公司 Multidimensional cross data processing method and apparatus
CN108461153B (en) * 2018-02-02 2022-03-15 上海市针灸经络研究所 Test data management method/system, computer readable storage medium and device
CN109840080B (en) * 2018-12-28 2022-08-26 东软集团股份有限公司 Character attribute comparison method and device, storage medium and electronic equipment
CN117829851A (en) * 2023-09-21 2024-04-05 江苏州际数码印花有限公司 Inspection system for textile shipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5301284A (en) * 1991-01-16 1994-04-05 Walker-Estes Corporation Mixed-resolution, N-dimensional object space method and apparatus
US6804664B1 (en) * 2000-10-10 2004-10-12 Netzero, Inc. Encoded-data database for fast queries
CN101730892A (en) * 2007-01-24 2010-06-09 迈可菲公司 Web reputation scoring

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000224585A (en) * 1999-02-01 2000-08-11 Ricoh Co Ltd Encoding and decoding device
US6904114B2 (en) * 2003-04-25 2005-06-07 J. Barry Shackleford Ones counter employing two dimensional cellular array

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5301284A (en) * 1991-01-16 1994-04-05 Walker-Estes Corporation Mixed-resolution, N-dimensional object space method and apparatus
US6804664B1 (en) * 2000-10-10 2004-10-12 Netzero, Inc. Encoded-data database for fast queries
CN101730892A (en) * 2007-01-24 2010-06-09 迈可菲公司 Web reputation scoring

Also Published As

Publication number Publication date
CN103559413A (en) 2014-02-05

Similar Documents

Publication Publication Date Title
Viloria et al. Improvements for determining the number of clusters in k-means for innovation databases in SMEs
Van Exel et al. The impact of crowdsourcing on spatial data quality indicators
Parmigiani et al. Complementarity, capabilities, and the boundaries of the firm: the impact of within‐firm and interfirm expertise on concurrent sourcing of complementary components
CN103559413B (en) A kind of data processing method and device
CN107609217B (en) Processing method and device for collision check data
CN105893561A (en) Ordering method and device
CN111611236A (en) Data analysis method and system
CN110009502B (en) Financial data analysis method, device, computer equipment and storage medium
CN106411587A (en) Simulation architecture suitable for performance evaluation of satellite communications network
CN104182544B (en) The dimension method for decomposing and device of analytical database
CN106651513B (en) Quotation method and device for circuit board orders
CN109933771B (en) Report automatic merging method, device, equipment and storage medium
US9251609B1 (en) Timelined spider diagrams
Li et al. A game model of supply chain management based on fractal analysis of time series
CN106022833A (en) Commodity customized method based on big data processing
CN103631832A (en) Service object ordering method, service object searching method and related device
Zhang et al. Bounded and discrete data in data envelopment analysis with assurance regions: application to design performance evaluation of gear shaping machines
CN106250565A (en) Querying method based on burst relevant database and system
Haruna et al. Effect of advanced manufacturing technology (AMT) on the product output of manufacturing small and medium scale enterprises in Nigeria
CN113077538B (en) Method and device for establishing three-dimensional temperature and humidity cloud picture of machine room and terminal equipment
CN110737704B (en) Data display method and device
KR102163595B1 (en) Positive Response Index Calculation Method
CN110110176B (en) Data display method and device
Mazhara et al. Limitations of Stock Price Valuation by Classical Methods: Critics of their Reliability and Influence of Behavioral Finance
CN113806223A (en) Software evaluation method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant