CN103559413B - A kind of data processing method and device - Google Patents
A kind of data processing method and device Download PDFInfo
- Publication number
- CN103559413B CN103559413B CN201310573974.4A CN201310573974A CN103559413B CN 103559413 B CN103559413 B CN 103559413B CN 201310573974 A CN201310573974 A CN 201310573974A CN 103559413 B CN103559413 B CN 103559413B
- Authority
- CN
- China
- Prior art keywords
- binary number
- property parameters
- numerical value
- binary
- data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Abstract
The invention provides a kind of data processing method and device, the method includes: obtain the property parameters that data to be analyzed have;The property parameters having according to data to be analyzed, determine the figure place of binary number to be generated, wherein, the figure place of this binary number is identical with the number of the property parameters that data to be analyzed have, and everybody of this binary number represents a property parameters of these data to be analyzed respectively;Generate all described binary number with this figure place, and from the multiple binary numbers generated, choose and include the binary number that predetermined number position is the first appointment numerical value;For each binary number selected, everybody the represented property parameters composition property parameters in binary number being the first appointment numerical value is combined;Based on the property parameters combination obtained, treat analytical data and carry out the statistics of predetermined number dimension.The method can improve the precision to data analysis statistical.
Description
Technical field
The present invention relates to technical field of data processing, a kind of data processing method and device.
Background technology
In data statistics, it is often necessary to relate to re-scheduling and calculate.It is exactly to unite from data to be counted that so-called re-scheduling calculates
Count out the data record of specified type, to get rid of the data record being not belonging to this specified type.Such as, with data to be counted for certain
As a example by the sales data in individual supermarket, then this sales data includes many data record, and in every data record, tool contains and sells
Sell the attribute informations such as the trade name of commodity, production firm, selling time, if the commodity A selling this month carries out re-scheduling meter
After calculation, the most only can count selling time is this month, and the data record of trade name commodity A, and other data records
Then can be excluded.
In actual applications, data to be analyzed typically have multiple property parameters, it may be necessary to be based respectively on multiple difference
The incompatible re-scheduling carrying out multiple dimension of set of properties calculate, so, then need the number of dimensions artificially according to required statistics, list
Possible combinations of attributes situation, is based respectively on possible property parameters combination the most again and carries out re-scheduling calculating.
As being still introduced with above example, this sales data correspond to trade name, production firm, selling time this
Individual three property parameters, these three property parameters can be combined into 8 kinds of different dimensions combinations, i.e. these 8 kinds possible dimension groups
Conjunction comprises a three dimensionality combination, three two-dimensions combinations, three dimension combinations and a zero dimension degree combination.Wherein, this three
The three-dimensional arrangement being combined as being combined by trade name, production firm and selling time these three property parameters of dimension;These three
The combination of two dimensions is respectively as follows: the two-dimensional combination of the two-dimensional combination of trade name and production firm, trade name and selling time,
Production firm and the two-dimensional combination of selling time;The combination of these three dimension is trade name, production firm and pin the most respectively
Selling any one property parameters in the time is an one-dimensional combination;Zero dimension degree is exactly not consider that arbitrary property parameters combines.On
The commodity A selling this month that face is mentioned carries out re-scheduling calculating and is actually based on sale title and selling time the two attribute
The re-scheduling of a kind of two-dimensions of parameter combination calculates.
When the quantity of the property parameters that data have is n, the total quantity of property parameters based on different dimensions combination is then
It it is the n power of 2.Along with the increase of data complexity, the quantity of the property parameters that data have increases the most accordingly.When data have
Property parameters quantity bigger time, possible increases the most accordingly, so, has enumerated possible dimension and combined by the way of artificial
Through becoming impossible, and artificially enumerate the combination the most often occurring omitting some property parameters so that the dimension group obtained
Close not comprehensive, had influence on re-scheduling calculating, and then reduced the precision of data statistic analysis.
Summary of the invention
In view of this, the present invention provides a kind of data processing method and device, to improve the attribute utilizing data to be analyzed
The accuracy of parameter determination dimension combination, and then improve the precision of data statistic analysis.
For achieving the above object, the present invention provides following technical scheme: a kind of data processing method, including:
Obtain the property parameters that data to be analyzed have;
The property parameters having according to described data to be analyzed, determines the figure place of binary number to be generated, wherein, described
The figure place of binary number is identical with the number of the property parameters that described data to be analyzed have, and every point of described binary number
Do not represent a property parameters of described data to be analyzed;
Generate all described binary number with described figure place, and from the plurality of binary number generated, choose
Including the binary number that predetermined number position is the first appointment numerical value, wherein, described first appointment numerical value is 0 or 1;
For each described binary number selected, will described binary number be the every of described first appointment numerical value
Represented property parameters composition property parameters combination;
Based on the described property parameters combination obtained, described data to be analyzed are carried out the system of described predetermined number dimension
Meter.
Preferably, described generation has all described binary number of described figure place, and enters from the plurality of two generated
In number processed, choose and include the binary number that predetermined number position is the first appointment numerical value, including:
A: generate and have described figure place, and every initial binary number being described second appointment numerical value, at the beginning of described
Beginning binary number is as the first binary number, and wherein, the second appointment numerical value is 0 or 1;
B: according to preset rules and described first binary number, generates the second binary number, described second binary number with
The absolute value of described first binary difference is 1;
C: if having the numerical value on predetermined number position in described second binary number is the first appointment numerical value, then select institute
State the second binary number;
C: judge that everybody of described second binary number is whether for being the 3rd appointment numerical value, if it is, perform generation
The operation of described property parameters combination;If it is not, then using current described second binary number as described first binary number,
And return described step B;
Wherein, described 3rd appointment numerical value is 0 or 1, and the described 3rd specifies numerical value to be different from described second appointment numerical value.
Preferably, when described second to specify numerical value be 0, described generation has a described figure place, and every is described the
The two initial binary numbers specifying numerical value, including:
Generation has a described figure place, and every be 0 initial binary number
Described according to preset rules with described first binary number, generate the second binary number, described second binary number
It is 1 with the absolute value of described first binary difference, including:
The lowest order of described first binary number is added one, obtains the second binary number.
Preferably, when described second to specify numerical value be 1, described generation has a described figure place, and every is described the
The two initial binary numbers specifying numerical value, including:
Generation has a described figure place, and every be 1 initial binary;
Described according to preset rules with described first binary number, generate the second binary number, described second binary number
It is 1 with the absolute value of described first binary difference, including:
The lowest order of described first binary number is subtracted one, obtains the second binary number.
Preferably, described for each described binary number selected, will described binary number be described first finger
Everybody represented property parameters composition property parameters of fixed number value combines, including:
For each described binary number selected, according to the rule of the true value in first appointment numerical value correspondence boolean's array
Then, everybody value of described binary number is converted to the element value in boolean's array successively, so that in described boolean's array
Each element value respectively corresponding described property parameters;
Extract the property parameters that in described boolean's array, true value is corresponding, the property parameters composition property parameters that will extract
Combination.
On the other hand, present invention also offers a kind of data processing equipment, including:
Acquiring unit, for obtaining the property parameters that data to be analyzed have;
Relation determination unit, for the property parameters having according to described data to be analyzed, determines binary system to be generated
The figure place of number, wherein, the figure place of described binary number is identical with the number of the property parameters that described data to be analyzed have, and institute
State binary number everybody represent a property parameters of described data to be analyzed respectively;
Binary number processing unit, for the figure place determined according to described relation determination unit, generates and has institute's rheme
The all described binary number of number, and from the plurality of binary number generated, chooses that to include predetermined number position be first
Specifying the binary number of numerical value, wherein, described first appointment numerical value is 0 or 1;
Property parameters assembled unit, for each described binary system selected for described binary number processing unit
Number, will be the described first every represented property parameters composition property parameters combination specifying numerical value in described binary number;
Computing unit, for the described property parameters combination obtained based on described property parameters assembled unit, treats described
Analytical data carries out the statistics of described predetermined number dimension.
Preferably, described binary number processing unit, including:
Initial number signal generating unit, for the figure place determined according to described relation determination unit, generates and has described figure place,
And every initial binary number being described second appointment numerical value, using described initial binary number as the first binary number,
Wherein, the second appointment numerical value is 0 or 1;
Mediant signal generating unit, for according to preset rules and described first binary number, generates the second binary number, institute
The absolute value stating the second binary number and described first binary difference is 1;
Binary number chooses unit, if in described second binary number that number signal generating unit processed generates in the middle of described
Having the numerical value on predetermined number position is described first appointment numerical value, then select described second binary number;
Judging unit, for everybody judging described second binary number that described mediant signal generating unit generates be whether
It is the 3rd appointment numerical value, performs described property parameters assembled unit if it is, trigger;If it is not, then by described in current
Second binary number is as described first binary number, and returns the described intermediate binary number signal generating unit of execution;Wherein, described
3rd appointment numerical value is 0 or 1, and the described 3rd specifies numerical value to be different from described second appointment numerical value.
Preferably, when described second appointment numerical value is 0, described initial number signal generating unit, including:
First initial number signal generating unit, for the figure place determined according to described relation determination unit, generates described in having
Figure place, and every be 0 initial binary number
Described mediant signal generating unit, including:
First mediant signal generating unit, for adding one by the lowest order of described first binary number, obtains the second binary system
Number.
Preferably, when described second appointment numerical value is 1, described initial number signal generating unit, including:
First initial number signal generating unit, for the figure place determined according to described relation determination unit, generates described in having
Figure place, and every be 1 initial binary;
Described mediant signal generating unit, including:
First mediant signal generating unit, for subtracting one by the lowest order of described first binary number, obtains the second binary system
Number.
Preferably, described property parameters assembled unit, including:
Boolean's array converting unit, for for each described binary number selected, specifying numerical value pair according to first
Answer the rule of true value in boolean's array, everybody value of described binary number is converted to the element in boolean's array successively
Value, so that the most corresponding described property parameters of each element value in described boolean's array;
Parameter group zygote unit, for extracting the property parameters that in described boolean's array, true value is corresponding, by extract
Property parameters composition property parameters combination.
Understand via above-mentioned technical scheme, the quantity of the property parameters that the present invention has according to these data to be analyzed, raw
Become the binary number of figure place identical with the quantity of this property parameters, the binary number of generation everybody represent this number to be analyzed respectively
According to a property parameters, owing to binary number is made up of 0 and 1, so can set 0 or 1 is the first appointment numerical value, and recognizes
For binary number is the first appointment numerical value position corresponding to property parameters participate in statistical computation, so, the binary system of generation
Number actually includes property parameters in these data to be analyzed and carries out each combining form of combination in any.Dimension when statistical analysis
When the number of degrees are predetermined number, select, from the binary number generated, the binary number that predetermined number position is the first appointment numerical value
After, for each binary number selected, will this binary number be every represented attribute of this first appointment numerical value
Parameter composition property parameters combination, just can be met all of property parameters combination of this number of dimensions, it is to avoid omit full
The combination of property parameters of this number of dimensions of foot, and then improve and treat, based on this number of dimensions, the precision that analytical data carries out adding up.
Accompanying drawing explanation
In order to be illustrated more clearly that the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing
In having technology to describe, the required accompanying drawing used is briefly described, it should be apparent that, the accompanying drawing in describing below is only this
Inventive embodiment, for those of ordinary skill in the art, on the premise of not paying creative work, it is also possible to according to
The accompanying drawing provided obtains other accompanying drawing.
Fig. 1 shows the schematic flow sheet of the present invention one embodiment of a kind of data processing method;
Fig. 2 shows the schematic flow sheet of the present invention another embodiment of a kind of data processing method;
Fig. 3 shows the structural representation of the present invention one embodiment of a kind of data processing equipment;
Fig. 4 shows the binary number processing unit one composition structural representation of a kind of data processing equipment of the present invention.
Detailed description of the invention
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete
Describe, it is clear that described embodiment is only a part of embodiment of the present invention rather than whole embodiments wholely.Based on
Embodiment in the present invention, it is every other that those of ordinary skill in the art are obtained under not making creative work premise
Embodiment, broadly falls into the scope of protection of the invention.
See Fig. 1, it is shown that the schematic flow sheet of the present invention one embodiment of a kind of data processing method, the side of the present invention
Method can apply to calculate in node arbitrarily, and the method for the present embodiment may include that
S101, obtains the property parameters that data to be analyzed have.
Wherein, these data to be analyzed can be to need to carry out many data of statistical analysis, such as merchandise sales record, network
Bandwidth uses record etc..
The property parameters of these data to be analyzed is the ginseng describing this object, classification or feature representated by data to be analyzed
Number.The property parameters of data as to be analyzed in this can be each in name of the information as indicated in these data to be analyzed, data to be analyzed
The generation time etc. of data.When being communication use data such as data to be analyzed, property parameters can include in data to be analyzed
Telephone number corresponding to each bar call record, operator, type of call, the parameter such as duration of call.
It is understood that in data statistics field, the property parameters of data to be analyzed is referred to as data to be analyzed
Dimension, a property parameters of data to be analyzed is a dimension of these data to be analyzed.Carry out treating analytical data
During statistics, the one or more dimensions that can choose these data to be analyzed as required carry out statistical analysis.For example, it is possible to based on
This communication is used data to carry out statistical analysis by telephone number and two dimensions of operator.
S102, the property parameters having according to these data to be analyzed, determine the figure place of binary number to be generated.
Wherein, the figure place of this binary number is identical with the number of the property parameters that these data to be analyzed have, to be generated
Everybody of binary number represents a property parameters of these data to be analyzed respectively.
The figure place needing the binary number generated is determined by the quantity of the property parameters of these data to be analyzed, and needs to generate
Every of this binary number all to should a property parameters of data to be analyzed, and different property parameters is to should be to be generated
The not coordination of the binary number become.It is to say, set up binary everybody and these data to be analyzed that have that these needs generate
Corresponding relation between property parameters.
As, when the property parameters that these data to be analyzed have is 3, then may determine that binary number to be generated is 3
Binary number, and this binary number every represents a property parameters in these 3 property parameters, and binary number to be generated
The property parameters that characterized of not coordination different.Assume the property parameters of these data to be analyzed be respectively telephone number, operator,
During type of call, then can be need in the binary number that generates from high to low first to should telephone number, second
Corresponding operator, the 3rd corresponding type of call.
S103, generates all binary numbers with this figure place, and from the multiple binary numbers generated, chooses and include
Predetermined number position is the binary number of the first appointment numerical value.
According to step 102 is determined need generate binary number figure place after, then can generate and there is corresponding figure place
All possible binary number.Figure place as determined is 3, then need to generate all of triad number, i.e. generate
Binary number includes 000,001,010,011,100,101,110 and 111.
Due to everybody corresponding property parameters of the most prespecified binary number to be generated, therefore,
For any one binary number generated, every of each binary number is all to should an attribute of data to be analyzed
Parameter.
Wherein, this predetermined number can set as required, typically carries out required number of dimensions treating analytical data
Amount determines.As, when carrying out data analysis, needing to carry out the analysis of 3 dimensions, then this predetermined number can be set as 3.Its
In, dimension describes and a data object is analyzed required number of parameters.If desired for carrying out three dimensionality analysis, then need
Determine all of property parameters combination being made up of three property parameters of these data to be analyzed.
Owing to binary number is made up of 0 or 1 number, in this binary number, everybody is only 0 or 1, therefore, and should
First appointment numerical value is 0 or 1.This first appointment numerical value can be set as a numerical value in 0 and 1, specifically can be as required
Set.
In the embodiment of the present application, this first specify numerical value represent participate in statistical computation, therefore, if in binary number certain
The numerical value of position is this first appointment numerical value, then it represents that this corresponding property parameters participates in statistical computation.Accordingly, binary system
In certain in numerical value be not this first specify numerical value, then this corresponding property parameters is not involved in treating analytical data
In statistical computation.
If be the first appointment numerical value due to the numerical value on binary number certain, then it represents that this institute of this binary number
Corresponding property parameters participates in statistical computation, accordingly, it is determined that go out, these data to be analyzed is carried out the number of dimensions needed for statistical analysis,
After i.e. determining that many small number the property parameters of needs are combined, in order to which there is meet the attribute ginseng of this number of dimensions
The combination of number, then can choose and include the binary system that predetermined number position is the first appointment numerical value from the binary number generated
Number.
Having the numerical value on predetermined number position in the binary number selected is the first appointment numerical value, and selects for any one
For the binary number taken out, this binary number is everybody corresponding property parameters of the first appointment numerical value is grouped together
The property parameters combination obtained, is the combination of a kind of property parameters meeting this number of dimensions determined.
Such as, with data to be analyzed, there are 3 property parameters and be introduced, the binary number of generation includes 000,001,
010,011,100,101,110 and 111, specify numerical value for 1 with first, as a example by needing to carry out the analysis of 2 dimensions, then need choosing
Taking out and have the binary number that numerical value is 1 on two, the binary number selected includes 011,101 and 110.
S104, for each binary number selected, by this binary number for this first every institute specifying numerical value
The property parameters composition property parameters combination represented.
In order to according to the binary number selected, determine the property parameters that can participate in calculating that this binary number is corresponding
Combination, then need to determine respectively everybody represented property parameters that this binary number is the first appointment numerical value, then will
The property parameters determined is combined, and obtains property parameters combination.Owing to selecting multiple binary number, each binary system
Number the most corresponding property parameters combination, then can obtain the combination of multiple property parameters.
Such as, still with in previously described binary number from high to low first to should telephone number, second pair
Answering operator, the 3rd corresponding type of call, then as a example by needing to generate 3 bits, it will again be assumed that treats analytical data and carries out 2
The analysis of individual dimension, and first to specify numerical value be 1, then from generating binary number, the binary number selected is 011,101 and
110, and binary number 011 is 1 everybody be respectively second and the 3rd, property parameters corresponding to second is for running
Business, the 3rd corresponding property parameters is type of call, and therefore the property parameters of this binary number 011 correspondence is combined as operator
With the combination of type of call, i.e. based on the two dimension, these data to be analyzed are carried out statistical analysis.Accordingly, binary number
The telephone number of 101 correspondences and the combination of type of call the two property parameters, the corresponding telephone number of binary number 110 and operation
The combination of business's the two property parameters.
S105, based on the property parameters combination obtained, treats analytical data and carries out the statistics of predetermined number dimension.
After obtaining the combination of all of property parameters, the combination of each property parameters can be based respectively on and carry out the respective dimension number of degrees
Statistical analysis.If predetermined number is 2, and certain property parameters is combined as comprising the combination of telephone number and operator, then may be used
Carry out the statistics of two-dimensions treating analytical data based on the two property parameters.
Wherein, combination based on the property parameters obtained, treat analytical data and carry out the statistics of respective dimensions, with existing
Mode is similar, does not repeats them here.
It is understood that in actual applications, can need to be based respectively on multiple different dimension and carry out data analysis,
Therefore, this predetermined number can set multiple.For example, it is desired to during the analysis of 2 dimensions and 3-dimensional degree, then predetermined number can be 2 Hes
3.But for each predetermined number, choosing binary number, and determining that the anabolic process of property parameters is all identical.
In the present embodiment, according to the quantity of the property parameters that these data to be analyzed have, generate and this property parameters
The binary number of the identical figure place of quantity, the binary number of generation everybody represent a property parameters of these data to be analyzed respectively,
Owing to binary number is made up of 0 and 1, so can set 0 or 1 is the first appointment numerical value, and thinks in binary number to be
One specifies the property parameters corresponding to position of numerical value to participate in statistical computation, and so, the binary number of generation actually includes this
In data to be analyzed, property parameters carries out each combining form of combination in any.It is predetermined number when needing the number of dimensions carried out
Time, after selecting, from the binary number generated, the binary number that predetermined number position is the first appointment numerical value, for select
Each binary number, forms property parameters by everybody the represented property parameters in this binary number being this first appointment numerical value
Combination, just can be met all of property parameters combination of this number of dimensions, it also avoid and omit the genus meeting this number of dimensions
The combination of property parameter, and then improve and treat analytical data and carry out the precision of statistical computation.
Meanwhile, the method is to utilize computer to utilize binary mode to determine the property parameters combination meeting this number of dimensions
Provide possibility, and then can improve and treat, based on this number of dimensions, the convenience that analytical data carries out adding up.
See Fig. 2, it is shown that the schematic flow sheet of the present invention another embodiment of a kind of data processing method, the present embodiment
Method may include that
S201, obtains the property parameters that data to be analyzed have.
S202, the property parameters having according to these data to be analyzed, determine the figure place of binary number to be generated.
Wherein, the figure place of this binary number is identical with the number of the property parameters that these data to be analyzed have, to be generated
It is binary that everybody represents a property parameters of these data to be analyzed respectively.
S203, generates and has figure place, and every initial binary number being the second appointment numerical value, by this initial binary
Number is as the first binary number.
Wherein, the second appointment numerical value is 0 or 1, and one in only 0 and 1 determines numerical value.
When this second appointment numerical value difference, the initial binary number of generation is the most different.If this second appointment numerical value is 1
Time, everybody of this initial binary number is 1;As this second to specify numerical value be 0 time, everybody of this initial binary number is 0.
Such as, these data to be analyzed have 3 property parameters, and when this second appointment numerical value is 0, then initial two generated
System number is 000.
S204, according to preset rules and the first binary number, generates the second binary number.
Wherein, the second binary number of generation is 1 with the absolute value of this first binary difference.
Wherein, being 0 or 1 according to this second appointment numerical value, the mode generating this second binary number is the most different, but is both needed to
Ensure that this second binary number is different from this first binary number, and this second binary number that current time generates is before
The binary number not generated.
As, when this second appointment numerical value is 0, then the mode generating this second binary number is: by this first binary system
The lowest order of number adds one, obtains the second binary number.Wherein, such as, when being 000 with initial binary number, if current time
000 is the first binary number, then the second binary number is 001.
And for example, when this second appointment numerical value is 1, then the mode generating this second binary number is: enter the two or two
The lowest order of number processed subtracts one, obtains the second binary number.Such as, when this first binary number is initial binary number 111, then give birth to
The second binary number become is 110.
S205, if having the numerical value on predetermined number position in this second binary number is this first appointment numerical value, then chooses
Go out this second binary number.
Often generate second binary number, be required to judge whether this second binary number has the number on predetermined number position
Value is the first appointment numerical value, if having the numerical value on predetermined number position in this second binary number is the first appointment numerical value, then protects
Deposit this second binary number;If the numerical value on the position of Non-precondition quantity is the first appointment numerical value in this second binary number,
The most directly carry out step 206.
Certainly, if needing to carry out statistical analysis based on multiple dimension, then multiple predetermined number can be set, therefore, as
Really this second binary number is that the position of the first numerical value quantity reaches any one predetermined number, all selects current the two or two and enters
Number processed.
Wherein, this first appointment numerical value is 0 or 1.
S206, it is judged that whether everybody of this second binary number be for being the 3rd appointment numerical value, if it is, perform step
208;If it does not, perform step 207.
S207, using current described second binary number as described first binary number, and returns described step 204.
Wherein, the 3rd appointment numerical value is a numerical value in 0 and 1, and the 3rd specifies numerical value to be different from the second appointment
Numerical value.
When everybody in this second binary number is the 3rd appointment numerical value, then illustrate to have generated all this figure places
Binary number, if the binary number that there is also this figure place is not generated, then this second binary number is entered as the one or two
Number processed, returns this step 204, continues to generate next second binary number, until having binary number all quilts of this figure place
Generate.
In the present embodiment, with one as step value, progressively generate the second binary number, e.g., use the side of increasing or decreasing
Formula, is increased or decreased the numerical value of binary number, such that it is able to obtain the binary number of this figure places all, it is to avoid binary number something lost
Leakage or repetition.
S208, for each binary number selected, by this binary number for this first every institute specifying numerical value
The property parameters composition property parameters combination represented.
S209, based on the property parameters combination obtained, treats analytical data and carries out the statistics of predetermined number dimension.
Wherein, this step 207 is similar to the related introduction of preceding embodiment with the operation of step 208, does not repeats them here.
It is understood that the operation of this step 205 and step 206 is not limited to shown in Fig. 2, this step 205 and step 206
Can also carry out simultaneously.
In the present embodiment, the numerical value that this first appointment numerical value is similarly in 0 and 1.This first appointment numeric representation
When any one corresponding property parameters of binary number participates in statistical computation, the numerical value in corresponding positions.As, if first refers to
When fixed number value is 1, if binary number is 000, then it represents that three corresponding property parameters of this binary number are all not involved in system
Meter is analyzed, the property parameters combination being combined as 0 dimension of the property parameters that this binary number is corresponding;If binary number is
011, then it represents that rear two corresponding property parameters of this binary number can participate in statistical analysis, and rear the two of this binary number
The property parameters of position correspondence is combined into the property parameters combination of 2 dimensions.
This second appointment numerical value is used for limiting initial binary number, and generates subsequent binary based on initial binary number
The mode of number.Therefore, first specifies numerical value and second to specify the meaning difference of numerical value, refers to setting the first appointment numerical value and second
During fixed number value, this first appointment numerical value and second can be set and specify numerical value identical, as being all 0;This first finger can also be set
Fixed number value specifies numerical value different with second, and if this second appointment numerical value is 0, and this second appointment numerical value is 1.
For the ease of understanding the scheme of the present embodiment, below as a example by data to be analyzed are for merchandise sales record, it is assumed that business
The property parameters that product sales record includes has trade name, manufacturer and selling time, and to need to carry out the analysis of 2 dimensions
As a example by be described in detail.Assume that the first appointment numerical value is 1 to be introduced, say, that binary number numerical value on certain is
1 represents that this corresponding property parameters participates in statistical analysis.Setting second specifies numerical value as 0, and the 3rd appointment numerical value is
1.Then generating this initial binary number is 000, using this 000 as first binary number perform step 204, this by 000 minimum
Position adds one, obtains the second binary number 001.This has the numerical value on 1 in 001 is 1, then preserve out this 001.Simultaneously, it is judged that
This 001 everybody be not all of being 1, then this is 001 as the first binary number, returns and performs this step 204, by 001 minimum
Position adds 1, obtains the second binary number 010, the like, until three of the second binary number 111,111 generated are 1,
Then perform subsequent step 207.
Finally, the binary number selected has 011,101 and 110, wherein, 011 corresponding manufacturer and the two of selling time
Dimension combines, 101 corresponding goods titles and two dimension combinations of selling time, 110 corresponding goods titles and the two of manufacturer
The combination of individual dimension.So, combine based on these three two dimension, corresponding statistical analysis can be carried out respectively.
Being only used in embodiments of the present invention describe conveniently, the property parameters having with data to be analyzed is 3 or 4
As a example by be described, but it is understood that, the property parameters that data the most to be analyzed have can have very
Many, when the property parameters that data to be analyzed have is the biggest, the method for the application present invention more embodies it and avoids omitting attribute
Parameter combines, and then improves the advantage such as precision of statistical analysis.
Further, in one embodiment of any of the above, for the ease of judging which position in the binary number selected
Corresponding property parameters participates in statistical analysis, after selecting binary number, for each binary number selected, according to
First rule specifying the true value in numerical value correspondence boolean's array, is converted to Bolean number successively by everybody value of this binary number
Element value in group, so that the most corresponding property parameters of each element value in boolean's array.
It is to say, this binary number is the value in the boolean value corresponding to position of the first appointment numerical value be true value,
And for this, first to specify the value in the boolean value corresponding to position of numerical value be false in this binary number, wherein, this binary number
In everybody corresponding property parameters, be corresponding in turn to each element of this boolean's array.Such as, the first appointment numerical value is 1, and two enter
When number processed is 101, then first element value during the highest order of this binary number is converted to boolean's array is ture, this binary system
Second element value that the second of number is converted in boolean's array is false, and the value of the lowest order of this binary number is converted to
The 3rd element value in boolean value is ture, therefore, boolean's array that this binary number is changed out be ture,
False, ture}, wherein, the property parameters that in this boolean's array, first element is corresponding is the highest in this binary number 101
The property parameters that position is corresponding, the second that property parameters is this binary number 101 that in this boolean's array, second element is corresponding
Corresponding property parameters, in this boolean's array, the 3rd property parameters that element is corresponding is the lowest order that this binary system counts 101
Corresponding property parameters.
After obtaining boolean's array, extract the property parameters that true value ture in boolean value is corresponding, the attribute that will extract
Parameter composition property parameters combination.Such as, boolean's array is { when ture, false, ture}, to extract this Bolean number the most respectively
In group, extract two property parameters are combined into property parameters by first element and the 3rd property parameters that element is corresponding
Combination.
The data processing method of the corresponding present invention, present invention also offers a kind of data processing equipment, sees Fig. 3, it is shown that
The structural representation of the present invention one embodiment of a kind of data processing equipment, the device of the present embodiment may include that acquisition is single
Unit 301, relation determination unit 302, binary number processing unit 303, property parameters assembled unit 304 and computing unit 305.
Wherein, acquiring unit 301, for obtaining the property parameters that data to be analyzed have;
Relation determination unit 302, for the property parameters having according to described data to be analyzed, determines that to be generated two enter
The figure place of number processed, wherein, the figure place of described binary number is identical with the number of the property parameters that described data to be analyzed have, and
Everybody of described binary number represents a property parameters of described data to be analyzed respectively;
Binary number processing unit 303, for the figure place determined according to described relation determination unit, generates described in having
The all described binary number of figure place, and from the plurality of binary number generated, chooses that to include predetermined number position be the
One binary number specifying numerical value, wherein, described first appointment numerical value is 0 or 1;
Property parameters assembled unit 304, each described two for selecting for described binary number processing unit enter
Number processed, will be the described first every represented property parameters composition property parameters group specifying numerical value in described binary number
Close;
Computing unit 305, for the described property parameters combination obtained based on described property parameters assembled unit, to described
Data to be analyzed carry out the statistics of described predetermined number dimension.
In the present embodiment, the quantity of the property parameters that relation determination unit has according to data to be analyzed, determine and treat
The figure place of binary number generated, and binary number to be generated everybody represent an attribute ginseng of these data to be analyzed respectively
Number, owing to binary number is made up of 0 and 1, so can set 0 or 1 is the first appointment numerical value, and thinks in binary number and be
First specifies the property parameters corresponding to position of numerical value to participate in statistical computation, and so, this binary number processing unit is according to this pass
System determines the figure place that unit is determined, all binary numbers generating corresponding figure place actually include in these data to be analyzed
Property parameters carries out each combining form of combination in any.
And this binary number processing unit is from the binary number generated, and to select predetermined number position be the first appointment numerical value
Binary number after, for each binary number selected, by this binary number for this first every institute specifying numerical value
The property parameters composition property parameters combination represented, just can be met and all of genus of this predetermined number identical dimensional number
Property parameter combination, it is to avoid omit the combination of the property parameters meeting this number of dimensions, and then improve and treat based on this number of dimensions
Analytical data carries out the precision of statistical computation.
Wherein, binary number processing unit generates according to the figure place that described relation determination unit is determined and has this figure place
The mode of binary number can have multiple.See Fig. 4, it is shown that in a kind of data processing equipment of the present invention, binary number processes
The structural representation of a kind of implementation of unit, in the present embodiment, this binary number processing unit 303, including:
Initial number signal generating unit 3031, for the figure place determined according to described relation determination unit, generates described in having
Figure place, and every initial binary number being described second appointment numerical value, enter described initial binary number as the one or two
Number processed, wherein, the second appointment numerical value is 0 or 1;
Mediant signal generating unit 3032, for according to preset rules and described first binary number, generates the second binary system
Number, described second binary number is 1 with the absolute value of described first binary difference;
Binary number chooses unit 3033, if described second binary number generated for described mediant signal generating unit
In have the numerical value on predetermined number position be described first specify numerical value, then select described second binary number;
Judging unit 3034, for everybody judging described second binary number that described mediant signal generating unit generates be
No for being the 3rd appointment numerical value, perform described property parameters assembled unit if it is, trigger;If it is not, then by current
Described second binary number is as described first binary number, and returns the described mediant signal generating unit of execution;Wherein, described
Three appointment numerical value are 0 or 1, and the described 3rd specifies numerical value to be different from described second appointment numerical value.
Wherein, this first appointment numerical value can specify numerical value identical with second, it is also possible to is and the 3rd appointment numerical value phase
With.
Wherein, this judging unit 3034 can be chosen unit at this binary number and determines whether to choose this second binary system
After number, then perform to judge whether everybody of this this second binary number is the 3rd operation specifying numerical value.This judging unit
Can also be to choose while unit carries out selection operation to this second binary number at this binary number, perform corresponding to judge
Operation.
When this second appointment numerical value is 0, this initial number signal generating unit, may include that
First initial number signal generating unit, for the figure place determined according to described relation determination unit, generates described in having
Figure place, and every be 0 initial binary number
Accordingly, this mediant signal generating unit, may include that
First mediant signal generating unit, for adding one by the lowest order of described first binary number, obtains the second binary system
Number.
When described second appointment numerical value is 1, described initial number signal generating unit, may include that
First initial number signal generating unit, for the figure place determined according to described relation determination unit, generates described in having
Figure place, and every be 1 initial binary;
Described mediant signal generating unit, including:
First mediant signal generating unit, for subtracting one by the lowest order of described first binary number, obtains the second binary system
Number.
Further, in one embodiment of any of the above, this property parameters assembled unit, may include that
Boolean's array converting unit, for for each described binary number selected, specifying numerical value pair according to first
Answer the rule of true value in boolean's array, everybody value of described binary number is converted to the element in boolean's array successively
Value, so that the most corresponding described property parameters of each element value in described boolean's array;
Parameter group zygote unit, for extracting the property parameters that in described boolean's array, true value is corresponding, by extract
Property parameters composition property parameters combination.
In this specification, each embodiment uses the mode gone forward one by one to describe, and what each embodiment stressed is and other
The difference of embodiment, between each embodiment, identical similar portion sees mutually.For device disclosed in embodiment
For, owing to it corresponds to the method disclosed in Example, so describe is fairly simple, relevant part sees method part and says
Bright.
Described above to the disclosed embodiments, makes professional and technical personnel in the field be capable of or uses the present invention.
Multiple amendment to these embodiments will be apparent from for those skilled in the art, as defined herein
General Principle can realize without departing from the spirit or scope of the present invention in other embodiments.Therefore, the present invention
It is not intended to be limited to the embodiments shown herein, and is to fit to and principles disclosed herein and features of novelty phase one
The widest scope caused.
Claims (10)
1. a data processing method, it is characterised in that including:
Obtain the property parameters that the data to be analyzed of pending statistical analysis have;
The property parameters having according to described data to be analyzed, determines the figure place of binary number to be generated, and wherein, described two enter
The figure place of number processed is identical with the number of the property parameters that described data to be analyzed have, and every table respectively of described binary number
Show a property parameters of described data to be analyzed;
Generate and there is all described binary number of described figure place, and from the multiple binary numbers generated, choose include pre-
If the binary number that number of bits is the first appointment numerical value, wherein, described first appointment numerical value is 0 or 1, and described predetermined number represents
The dimension that described data to be analyzed are analyzed;
For each described binary number selected, by described binary number for the described first every institute table specifying numerical value
The property parameters composition property parameters combination shown, wherein, in described binary number represented by the described first position specifying numerical value
Property parameters be participate in statistical computation property parameters;
Based on the described property parameters combination obtained, described data to be analyzed are carried out the statistics of described predetermined number dimension.
Method the most according to claim 1, it is characterised in that described generation has all described binary system of described figure place
Number, and from the plurality of binary number generated, choose and include the binary number that predetermined number position is the first appointment numerical value,
Including:
A: generate and there is described figure place, and every initial binary number being the second appointment numerical value, by described initial binary
Number is as the first binary number, and wherein, the second appointment numerical value is 0 or 1;
B: according to preset rules and described first binary number, generates the second binary number, and described second binary number is with described
The absolute value of first binary difference is 1;
C: if having the numerical value on predetermined number position in described second binary number is the first appointment numerical value, then select described
Two binary numbers;
C: judge everybody of described second binary number whether for being the 3rd appointment numerical value, if it is, perform to generate described
The operation of property parameters combination;If it is not, then using current described second binary number as described first binary number, and return
Return described step B;
Wherein, described 3rd appointment numerical value is 0 or 1, and the described 3rd specifies numerical value to be different from described second appointment numerical value.
Method the most according to claim 2, it is characterised in that when described second appointment numerical value is 0, described generation has
Described figure place, and every initial binary number being described second appointment numerical value, including:
Generation has a described figure place, and every be 0 initial binary number
Described according to preset rules with described first binary number, generate the second binary number, described second binary number and institute
The absolute value stating first binary difference is 1, including:
The lowest order of described first binary number is added one, obtains the second binary number.
Method the most according to claim 2, it is characterised in that when described second appointment numerical value is 1, described generation has
Described figure place, and every initial binary number being described second appointment numerical value, including:
Generation has a described figure place, and every be 1 initial binary;
Described according to preset rules with described first binary number, generate the second binary number, described second binary number and institute
The absolute value stating first binary difference is 1, including:
The lowest order of described first binary number is subtracted one, obtains the second binary number.
5. according to the method described in any one of Claims 1-4, it is characterised in that described for each select described two
System number, will be the described first every represented property parameters composition property parameters group specifying numerical value in described binary number
Close, including:
For each described binary number selected, according to the rule of the true value in first appointment numerical value correspondence boolean's array,
Everybody value of described binary number is converted to the element value in boolean's array successively, so that every in described boolean's array
The most corresponding described property parameters of individual element value;
Extract the property parameters that in described boolean's array, true value is corresponding, the property parameters composition property parameters group that will extract
Close.
6. a data processing equipment, it is characterised in that including:
Acquiring unit, for obtaining the property parameters that the data to be analyzed of pending statistical analysis have;
Relation determination unit, for the property parameters having according to described data to be analyzed, determines binary number to be generated
Figure place, wherein, the figure place of described binary number is identical with the number of the property parameters that described data to be analyzed have, and described two
Everybody of system number represents a property parameters of described data to be analyzed respectively;
Binary number processing unit, for the figure place determined according to described relation determination unit, generates and has described figure place
All described binary numbers, and from the multiple binary numbers generated, choose that to include predetermined number position be the first appointment numerical value
Binary number, wherein, described first to specify numerical value be 0 or 1, and described predetermined number represents and carries out described data to be analyzed point
The dimension of analysis;
Property parameters assembled unit, for each described binary number selected for described binary number processing unit, will
Described binary number is the described first every represented property parameters composition property parameters combination specifying numerical value, wherein,
In described binary number, the property parameters represented by the described first position specifying numerical value is the property parameters participating in statistical computation;
Computing unit, for the described property parameters combination obtained based on described property parameters assembled unit, to described to be analyzed
Data carry out the statistics of described predetermined number dimension.
Device the most according to claim 6, it is characterised in that described binary number processing unit, including:
Initial number signal generating unit, for the figure place determined according to described relation determination unit, generates and has described figure place, and often
Position is the initial binary number of the second appointment numerical value, using described initial binary number as the first binary number, wherein, second
Specifying numerical value is 0 or 1;
Mediant signal generating unit, for according to preset rules and described first binary number, generates the second binary number, and described the
Two binary numbers are 1 with the absolute value of described first binary difference;
Binary number chooses unit, if having pre-in described second binary number that number signal generating unit processed generates in the middle of described
If the numerical value in number of bits is described first appointment numerical value, then select described second binary number;
Judging unit, for judging that whether everybody of described second binary number that described mediant signal generating unit generates be for being
3rd specifies numerical value, performs described property parameters assembled unit if it is, trigger;If it is not, then by current described second
Binary number is as described first binary number, and returns the described intermediate binary number signal generating unit of execution;Wherein, the described 3rd
Specifying numerical value is 0 or 1, and the described 3rd specifies numerical value to be different from described second appointment numerical value.
Device the most according to claim 7, it is characterised in that when described second appointment numerical value is 0, described initial number is raw
Become unit, including:
First initial number signal generating unit, for the figure place determined according to described relation determination unit, generates and has described figure place,
And every be 0 initial binary number
Described mediant signal generating unit, including:
First mediant signal generating unit, for adding one by the lowest order of described first binary number, obtains the second binary number.
Device the most according to claim 7, it is characterised in that when described second appointment numerical value is 1, described initial number is raw
Become unit, including:
First initial number signal generating unit, for the figure place determined according to described relation determination unit, generates and has described figure place,
And every be 1 initial binary;
Described mediant signal generating unit, including:
First mediant signal generating unit, for subtracting one by the lowest order of described first binary number, obtains the second binary number.
10. according to the device described in any one of claim 6 to 9, it is characterised in that described property parameters assembled unit, including:
Boolean's array converting unit, for for each described binary number selected, specifying numerical value correspondence cloth according to first
The rule of the true value in your array, is converted to the element value in boolean's array successively by everybody value of described binary number, with
Make the most corresponding described property parameters of each element value in described boolean's array;
Parameter group zygote unit, for extracting the property parameters that in described boolean's array, true value is corresponding, the attribute that will extract
Parameter composition property parameters combination.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310573974.4A CN103559413B (en) | 2013-11-15 | 2013-11-15 | A kind of data processing method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310573974.4A CN103559413B (en) | 2013-11-15 | 2013-11-15 | A kind of data processing method and device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103559413A CN103559413A (en) | 2014-02-05 |
CN103559413B true CN103559413B (en) | 2016-11-02 |
Family
ID=50013659
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310573974.4A Active CN103559413B (en) | 2013-11-15 | 2013-11-15 | A kind of data processing method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103559413B (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107153651B (en) * | 2016-03-03 | 2021-04-02 | 阿里巴巴集团控股有限公司 | Multidimensional cross data processing method and apparatus |
CN108461153B (en) * | 2018-02-02 | 2022-03-15 | 上海市针灸经络研究所 | Test data management method/system, computer readable storage medium and device |
CN109840080B (en) * | 2018-12-28 | 2022-08-26 | 东软集团股份有限公司 | Character attribute comparison method and device, storage medium and electronic equipment |
CN117829851A (en) * | 2023-09-21 | 2024-04-05 | 江苏州际数码印花有限公司 | Inspection system for textile shipment |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5301284A (en) * | 1991-01-16 | 1994-04-05 | Walker-Estes Corporation | Mixed-resolution, N-dimensional object space method and apparatus |
US6804664B1 (en) * | 2000-10-10 | 2004-10-12 | Netzero, Inc. | Encoded-data database for fast queries |
CN101730892A (en) * | 2007-01-24 | 2010-06-09 | 迈可菲公司 | Web reputation scoring |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2000224585A (en) * | 1999-02-01 | 2000-08-11 | Ricoh Co Ltd | Encoding and decoding device |
US6904114B2 (en) * | 2003-04-25 | 2005-06-07 | J. Barry Shackleford | Ones counter employing two dimensional cellular array |
-
2013
- 2013-11-15 CN CN201310573974.4A patent/CN103559413B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5301284A (en) * | 1991-01-16 | 1994-04-05 | Walker-Estes Corporation | Mixed-resolution, N-dimensional object space method and apparatus |
US6804664B1 (en) * | 2000-10-10 | 2004-10-12 | Netzero, Inc. | Encoded-data database for fast queries |
CN101730892A (en) * | 2007-01-24 | 2010-06-09 | 迈可菲公司 | Web reputation scoring |
Also Published As
Publication number | Publication date |
---|---|
CN103559413A (en) | 2014-02-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Viloria et al. | Improvements for determining the number of clusters in k-means for innovation databases in SMEs | |
Van Exel et al. | The impact of crowdsourcing on spatial data quality indicators | |
Parmigiani et al. | Complementarity, capabilities, and the boundaries of the firm: the impact of within‐firm and interfirm expertise on concurrent sourcing of complementary components | |
CN103559413B (en) | A kind of data processing method and device | |
CN107609217B (en) | Processing method and device for collision check data | |
CN105893561A (en) | Ordering method and device | |
CN111611236A (en) | Data analysis method and system | |
CN110009502B (en) | Financial data analysis method, device, computer equipment and storage medium | |
CN106411587A (en) | Simulation architecture suitable for performance evaluation of satellite communications network | |
CN104182544B (en) | The dimension method for decomposing and device of analytical database | |
CN106651513B (en) | Quotation method and device for circuit board orders | |
CN109933771B (en) | Report automatic merging method, device, equipment and storage medium | |
US9251609B1 (en) | Timelined spider diagrams | |
Li et al. | A game model of supply chain management based on fractal analysis of time series | |
CN106022833A (en) | Commodity customized method based on big data processing | |
CN103631832A (en) | Service object ordering method, service object searching method and related device | |
Zhang et al. | Bounded and discrete data in data envelopment analysis with assurance regions: application to design performance evaluation of gear shaping machines | |
CN106250565A (en) | Querying method based on burst relevant database and system | |
Haruna et al. | Effect of advanced manufacturing technology (AMT) on the product output of manufacturing small and medium scale enterprises in Nigeria | |
CN113077538B (en) | Method and device for establishing three-dimensional temperature and humidity cloud picture of machine room and terminal equipment | |
CN110737704B (en) | Data display method and device | |
KR102163595B1 (en) | Positive Response Index Calculation Method | |
CN110110176B (en) | Data display method and device | |
Mazhara et al. | Limitations of Stock Price Valuation by Classical Methods: Critics of their Reliability and Influence of Behavioral Finance | |
CN113806223A (en) | Software evaluation method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant |