CN103559413A - Data processing method and device - Google Patents

Data processing method and device Download PDF

Info

Publication number
CN103559413A
CN103559413A CN201310573974.4A CN201310573974A CN103559413A CN 103559413 A CN103559413 A CN 103559413A CN 201310573974 A CN201310573974 A CN 201310573974A CN 103559413 A CN103559413 A CN 103559413A
Authority
CN
China
Prior art keywords
binary number
numerical value
property parameters
binary
place
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201310573974.4A
Other languages
Chinese (zh)
Other versions
CN103559413B (en
Inventor
曹艳白
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BEIJING SOUFUN SCIENCE & TECHNOLOGY DEVELOPMENT Co Ltd
Original Assignee
BEIJING SOUFUN SCIENCE & TECHNOLOGY DEVELOPMENT Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIJING SOUFUN SCIENCE & TECHNOLOGY DEVELOPMENT Co Ltd filed Critical BEIJING SOUFUN SCIENCE & TECHNOLOGY DEVELOPMENT Co Ltd
Priority to CN201310573974.4A priority Critical patent/CN103559413B/en
Publication of CN103559413A publication Critical patent/CN103559413A/en
Application granted granted Critical
Publication of CN103559413B publication Critical patent/CN103559413B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a data processing method and device. The method comprises following steps: attribute parameters of to-be-analyzed data are acquired; the digit number of a to-be-generated binary number is determined according to the attribute parameters of the to-be-analyzed data, wherein the digit number of the binary number is the same as the number of the attribute parameters of the to-be-analyzed data, and each digit of the binary number represents one attribute parameter of the to-be-analyzed data; all binary numbers with the digit number are generated, and binary numbers which contain a preset quantity digit as a first assigned value are selected from the multiple generated binary numbers; for each selected binary number, the attribute parameters represented by each digit of the first assigned value in each binary number constitute an attribute parameter combination; and preset quantity dimensionality statistics is performed on the to-be-analyzed data based on the obtained attribute parameter combination. The method can improve data analysis statistical accuracy.

Description

A kind of data processing method and device
Technical field
The present invention relates to technical field of data processing, relate in particular a kind of data processing method and device.
Background technology
In data statistics, often need to relate to re-scheduling and calculate.So-called re-scheduling is calculated and from treat statistics, is counted exactly the data recording of specified type, to get rid of the data recording that does not belong to this specified type.For example, take and treat that the sales data that statistics is certain supermarket is example, this sales data comprises many data recording, in every data recording, tool has comprised the attribute information such as the trade name of merchandising, production firm, selling time, if the commodity A that this month is sold carries out after re-scheduling calculating, only can count selling time is this month, and trade name is the data recording of commodity A, and other data recording can be excluded.
In actual applications, data to be analyzed generally have a plurality of property parameters, may need based on the incompatible re-scheduling of carrying out a plurality of dimensions of a plurality of different set of properties, to calculate respectively, like this, need artificial according to the number of dimensions of required statistics, list possible combinations of attributes situation, and then re-scheduling calculating is carried out in the combination of the property parameters based on possible respectively.
As being still introduced with example above, this sales data correspondence trade name, production firm, these three property parameters of selling time, these three property parameters can be combined into 8 kinds of different dimension combinations, i.e. these 8 kinds possible dimension combinations comprise a three dimensionality combination, three two-dimensions combinations, three dimension combinations and a zero dimension degree combination.Wherein, this three dimensionality is combined as the three-dimensional array by this three property parameters combination of trade name, production firm and selling time; The combination of these three two dimensions is respectively: the two dimension combination of trade name and production firm, the two dimension combination of trade name and selling time, the two dimension combination of production firm and selling time; The combination of these three dimensions is respectively that in trade name, production firm and selling time, any one property parameters is an one dimension combination; Zero dimension degree is exactly not consider property parameters combination arbitrarily.The commodity A that this month is sold above-mentioned carries out re-scheduling and calculates the re-scheduling calculating being actually based on selling a kind of two-dimensions of these two property parameters combinations of title and selling time.
When the quantity of the property parameters having when data is n, the total quantity of the property parameters combination based on different dimensions is 2 n power.Along with the increase of data complexity, the quantity of the property parameters that data have is corresponding increasing also.When the property parameters quantity that has when data is larger; possible also corresponding increasing; like this; by artificial mode, enumerate possible dimension combination and become impossible; and people enumerates also often to there will be the combination of omitting some property parameters; the dimension combination that makes to obtain is not comprehensive, has had influence on re-scheduling calculating, and then has reduced the precision of data statistic analysis.
Summary of the invention,
In view of this, the invention provides a kind of data processing method and device, to improve, utilize the property parameters of data to be analyzed to determine the accuracy of dimension combination, and then improve the precision of data statistic analysis.
For achieving the above object, the invention provides following technical scheme: a kind of data processing method, comprising:
Obtain the property parameters that data to be analyzed have;
The property parameters having according to described data to be analyzed, determine the figure place of binary number to be generated, wherein, the figure place of described binary number is identical with the number of the property parameters that described data to be analyzed have, and everybody of described binary number represents respectively a property parameters of described data to be analyzed;
Generation has all described binary number of described figure place, and from the described a plurality of binary numbers that generate, chooses and include the binary number that predetermined number position is the first appointment numerical value, and wherein, described the first appointment numerical value is 0 or 1;
The described binary number selecting for each, forms property parameters combination by the every represented property parameters that in described binary number is described the first appointment numerical value;
Described property parameters based on obtaining combines, and described data to be analyzed is carried out to the statistics of a described predetermined number dimension.
Preferably, described generation has all described binary number of described figure place, and from the described a plurality of binary numbers that generate, chooses and include the binary number that predetermined number position is the first appointment numerical value, comprising:
A: generate and there is described figure place, and every initial binary number that is described the second appointment numerical value, using described initial binary number as the first binary number, wherein, the second appointment numerical value is 0 or 1;
B: according to preset rules and described the first binary number, generate the second binary number, the absolute value of described the second binary number and described first binary difference is 1;
C: be the first appointment numerical value if there is the numerical value on predetermined number position in described the second binary number, select described the second binary number;
C: judge described the second binary number everybody whether for being the 3rd, specify numerical value, if so, carry out the operation that generates described property parameters combination; If not, using current described the second binary number as described the first binary number, and return to described step B;
Wherein, described the 3rd appointment numerical value is 0 or 1, and the described the 3rd specifies numerical value to be different from described the second appointment numerical value.
Preferably, when described second specifies numerical value to be 0, described generation has described figure place, and every initial binary number that is described the second appointment numerical value, comprising:
Generation has described figure place, and every is 0 initial binary number
Describedly according to preset rules and described the first binary number, generate the second binary number, the absolute value of described the second binary number and described first binary difference is 1, comprising:
The lowest order of described the first binary number is added to one, obtain the second binary number.
Preferably, when described second specifies numerical value to be 1, described generation has described figure place, and every initial binary number that is described the second appointment numerical value, comprising:
Generation has described figure place, and every is 1 initial scale-of-two;
Describedly according to preset rules and described the first binary number, generate the second binary number, the absolute value of described the second binary number and described first binary difference is 1, comprising:
The lowest order of described the first binary number is subtracted to one, obtain the second binary number.
Preferably, the described described binary number selecting for each, forms property parameters combination by the every represented property parameters that in described binary number is described the first appointment numerical value, comprising:
The described binary number selecting for each, according to the rule of the true value in the corresponding boolean's array of the first appointment numerical value, every value of described binary number is converted to the element value in boolean's array successively, so that the respectively corresponding described property parameters of each element value in described boolean's array;
Extract property parameters corresponding to true value in described boolean's array, the property parameters extracting is formed to property parameters combination.
On the other hand, the present invention also provides a kind of data processing equipment, comprising:
Acquiring unit, the property parameters having for obtaining data to be analyzed;
Be related to determining unit, for the property parameters having according to described data to be analyzed, determine the figure place of binary number to be generated, wherein, the figure place of described binary number is identical with the number of the property parameters that described data to be analyzed have, and everybody of described binary number represents respectively a property parameters of described data to be analyzed;
Binary number processing unit, be used for according to the described figure place that is related to that determining unit is determined, generation has all described binary number of described figure place, and from the described a plurality of binary numbers that generate, choose and include the binary number that predetermined number position is the first appointment numerical value, wherein, described the first appointment numerical value is 0 or 1;
Property parameters assembled unit, for for described binary number processing unit, select each described in binary number, by described binary number being, described first specify every the represented property parameters of numerical value to form property parameters to combine;
Computing unit, for the described property parameters combination obtaining based on described property parameters assembled unit, carries out the statistics of a described predetermined number dimension to described data to be analyzed.
Preferably, described binary number processing unit, comprising:
Initial number generation unit, for according to the described figure place that is related to that determining unit is determined, generates and has described figure place, and every initial binary number that is described the second appointment numerical value, using described initial binary number as the first binary number, wherein, the second appointment numerical value is 0 or 1;
Mediant generation unit, for according to preset rules and described the first binary number, generates the second binary number, and the absolute value of described the second binary number and described first binary difference is 1;
Binary number is chosen unit, if having the numerical value on predetermined number position for described the second binary number that in the middle of described, number generation unit processed generates, is described the first appointment numerical value, selects described the second binary number;
Whether judging unit, specify numerical value for being the 3rd for everybody who judges described the second binary number that described mediant generation unit generates, if so, triggers and carry out described property parameters assembled unit; If not, using current described the second binary number as described the first binary number, and return and carry out described middle binary number generation unit; Wherein, described the 3rd appointment numerical value is 0 or 1, and the described the 3rd specifies numerical value to be different from described the second appointment numerical value.
Preferably, when described second specifies numerical value to be 0, described initial number generation unit, comprising:
The first initial number generation unit, for according to the described figure place that is related to that determining unit is determined, generates and have described figure place, and every is 0 initial binary number
Described mediant generation unit, comprising:
The first mediant generation unit, for the lowest order of described the first binary number is added to one, obtains the second binary number.
Preferably, when described second specifies numerical value to be 1, described initial number generation unit, comprising:
The first initial number generation unit, for according to the described figure place that is related to that determining unit is determined, generates and have described figure place, and every is 1 initial scale-of-two;
Described mediant generation unit, comprising:
The first mediant generation unit, for the lowest order of described the first binary number is subtracted to one, obtains the second binary number.
Preferably, described property parameters assembled unit, comprising:
Boolean's array converting unit, for the described binary number selecting for each, according to the rule of the true value in the corresponding boolean's array of the first appointment numerical value, every value of described binary number is converted to the element value in boolean's array successively, so that the respectively corresponding described property parameters of each element value in described boolean's array;
Parameter combinations subelement, for extracting property parameters corresponding to described boolean's array true value, forms property parameters combination by the property parameters extracting.
Known via above-mentioned technical scheme, the quantity of the property parameters that the present invention has according to these data to be analyzed, generated the binary number with the identical figure place of quantity of this property parameters, everybody represents respectively a property parameters of these data to be analyzed the binary number generating, because binary number forms by 0 and 1, can set like this 0 or 1 is the first appointment numerical value, and think in binary number to be that the corresponding property parameters in position of the first appointment numerical value participates in statistical computation, like this, in fact the binary number generating has comprised that property parameters in these data to be analyzed carries out each array configuration of combination in any.When the number of dimensions of statistical study is predetermined number, from the binary number generating, select after the binary number that predetermined number position is the first appointment numerical value, for each binary number selecting, the every represented property parameters that in this binary number is this first appointment numerical value is formed to property parameters combination, just can be met all property parameters combinations of this number of dimensions, avoided omitting the combination of the property parameters that meets this number of dimensions, and then improved the precision of data to be analyzed being added up based on this number of dimensions.
Accompanying drawing explanation
In order to be illustrated more clearly in the embodiment of the present invention or technical scheme of the prior art, to the accompanying drawing of required use in embodiment or description of the Prior Art be briefly described below, apparently, accompanying drawing in the following describes is only embodiments of the invention, for those of ordinary skills, do not paying under the prerequisite of creative work, other accompanying drawing can also be provided according to the accompanying drawing providing.
Fig. 1 shows the schematic flow sheet of an embodiment of a kind of data processing method of the present invention;
Fig. 2 shows the schematic flow sheet of a kind of another embodiment of data processing method of the present invention;
Fig. 3 shows the structural representation of an embodiment of a kind of data processing equipment of the present invention;
Fig. 4 shows a kind of composition structural representation of binary number processing unit of a kind of data processing equipment of the present invention.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is clearly and completely described, obviously, described embodiment is only the present invention's part embodiment, rather than whole embodiment.Embodiment based in the present invention, those of ordinary skills, not making the every other embodiment obtaining under creative work prerequisite, belong to the scope of protection of the invention.
Referring to Fig. 1, show the schematic flow sheet of an embodiment of a kind of data processing method of the present invention, method of the present invention can be applied to arbitrarily in computing node, and the method for the present embodiment can comprise:
S101, obtains the property parameters that data to be analyzed have.
Wherein, these data to be analyzed can be for carrying out many data of statistical study, as merchandise sales record, the network bandwidth are used record etc.
The property parameters of these data to be analyzed is parameters of describing object, classification or the feature of this data representative to be analyzed.If the property parameters of these data to be analyzed can be the rise time of pieces of data in the name of the information as indicated in these data to be analyzed, data to be analyzed etc.While being communication usage data as data to be analyzed, property parameters can comprise the corresponding telephone number of each call record, operator, type of call in data to be analyzed, the parameters such as the duration of call.
Be understandable that, in data statistics field, the property parameters of data to be analyzed also can be called the dimension of data to be analyzed, and a property parameters of data to be analyzed is a dimension of these data to be analyzed.When data to be analyzed are added up, can choose as required one or more dimensions of these data to be analyzed and carry out statistical study.For example, can to this communication usage data, carry out statistical study based on telephone number and two dimensions of operator.
S102, the property parameters having according to these data to be analyzed, determines the figure place of binary number to be generated.
Wherein, the figure place of this binary number is identical with the number of the property parameters that these data to be analyzed have, and everybody of binary number to be generated represents respectively a property parameters of these data to be analyzed.
Need the figure place of the binary number of generation to be determined by the quantity of the property parameters of these data to be analyzed, and every of this binary number need generating all to a property parameters that should data to be analyzed, and the not coordination of different property parameters to binary number that should be to be generated.That is to say, set up the corresponding relation between the property parameters that has these binary every and these data to be analyzed that need to generate.
As, when the property parameters that these data to be analyzed have is 3, can determine that binary number to be generated is 3 bits, and a property parameters in these 3 property parameters of the every bit representation of this binary number, and the property parameters that the not coordination of binary number to be generated characterizes is different.When the property parameters of supposing these data to be analyzed is respectively telephone number, operator, type of call, can be in the binary number that needs to generate from high to low first to should telephone number, the corresponding operator of second, the 3rd corresponding type of call.
S103, generates all binary numbers with this figure place, and from a plurality of binary numbers that generate, chooses and include the binary number that predetermined number position is the first appointment numerical value.
According to determining in step 102 after the figure place of the binary number that needs generation, can generate all possible binary number with corresponding figure place.As the figure place of determining is 3, need to generate all triad numbers, the binary number generating comprises 000,001,010,011,100,101,110 and 111.
Owing to having predetermined the every corresponding property parameters of binary number to be generated in step 102, therefore, for any one binary number generating, every of each binary number all to a property parameters that should data to be analyzed.
Wherein, this predetermined number can be set as required, generally so that data to be analyzed are carried out to needed number of dimensions, determines.As, when carrying out data analysis, need to carry out the analysis of 3 dimensions, this predetermined number can be set as 3.Wherein, dimension has been described a data object has been analyzed to required number of parameters.As needs carry out three dimensionality analysis, need to determine all property parameters that three property parameters by these data to be analyzed form and combine.
Because binary number is by 0 or 1 digital composition, in this binary number, everybody is only 0 or 1, and therefore, this first appointment numerical value is 0 or 1.This first appointment numerical value can be set as a numerical value in 0 and 1, specifically can set as required.
In the embodiment of the present application, this first appointment numerical value representative participates in statistical computation, therefore, if the numerical value of certain is this first appointment numerical value in binary number, represents that this corresponding property parameters participates in statistical computation.Accordingly, the numerical value in certain in scale-of-two is not that this first specifies numerical value, and this corresponding property parameters does not participate in the statistical computation of data to be analyzed.
If while being the first appointment numerical value due to the numerical value on certain of binary number, this the corresponding property parameters that represents this binary number participates in statistical computation, therefore, determine these data to be analyzed are carried out to the required number of dimensions of statistical study, after determining and needing how much quantity property parameters to combine, for which can exist meet the combination of the property parameters of this number of dimensions, can, from the binary number generating, choose and include the binary number that predetermined number position is the first appointment numerical value.
In the binary number selecting, having the numerical value on predetermined number position is the first appointment numerical value, and for the binary number selecting for any one, in this binary number, being the property parameters combination that the every corresponding property parameters of the first appointment numerical value is grouped together and obtains, is a kind of combination that meets the property parameters of this number of dimensions of determining.
For example, with data to be analyzed, having 3 property parameters is introduced, the binary number generating comprises 000,001,010,011,100,101,110 and 111, the first appointment numerical value of take is 1, the analysis that need to carry out 2 dimensions is example, need to select the binary number that the numerical value on two is 1, the binary number selecting comprises 011,101 and 110.
S104, the binary number selecting for each, forms property parameters combination by the every represented property parameters that in this binary number is this first appointment numerical value.
For according to the binary number that selects, determine the combination of the property parameters that can participate in calculating that this binary number is corresponding, need to determine respectively the every represented property parameters that this binary number is the first appointment numerical value, then the property parameters of determining is combined, obtain property parameters combination.Owing to selecting a plurality of binary numbers, each binary number is corresponding property parameters combination all, can obtain a plurality of property parameters combinations.
For example, still with in the binary number of introducing above from high to low first to should telephone number, the corresponding operator of second, the 3rd corresponding type of call, needing to generate 3 bits is example, still suppose data to be analyzed to carry out the analysis of 2 dimensions, and the first appointment numerical value is 1, from generating binary number, the binary number selecting is 011, 101 and 110, and in binary number 011, be 1 everybody be respectively second and the 3rd, the property parameters that second is corresponding is operator, the 3rd corresponding property parameters is type of call, therefore the property parameters of this binary number 011 correspondence is combined as the combination of operator and type of call, based on these two dimensions, these data to be analyzed are carried out to statistical study.Accordingly, the combination of the telephone number of binary number 101 correspondences and these two property parameters of type of call, the combination of the corresponding telephone number of binary number 110 and these two property parameters of operator.
S105, the property parameters based on obtaining combines, and data to be analyzed is carried out to the statistics of predetermined number dimension.
Obtain after all property parameters combinations, can based on each property parameters combination, carry out the statistical study of respective dimensions number respectively.If predetermined number is 2, and certain property parameters is combined as the combination that comprises telephone number and operator, can to data to be analyzed, carry out the statistics of two-dimensions based on these two property parameters.
Wherein, the combination of the property parameters based on obtaining, carries out the statistics of respective dimensions to data to be analyzed, similar to existing mode, does not repeat them here.
Be understandable that, in actual applications, can need based on multiple different dimension, to carry out data analysis respectively, therefore, this predetermined number can be set a plurality of.For example, while needing the analysis of 2 dimensions and 3 dimensions, predetermined number can be 2 and 3.But for each predetermined number, choosing binary number, and the anabolic process of definite property parameters is all identical.
In the present embodiment, the quantity of the property parameters having according to these data to be analyzed, generated the binary number with the identical figure place of quantity of this property parameters, everybody represents respectively a property parameters of these data to be analyzed the binary number generating, because binary number forms by 0 and 1, can set like this 0 or 1 is the first appointment numerical value, and think in binary number to be that the corresponding property parameters in position of the first appointment numerical value participates in statistical computation, like this, in fact the binary number generating has comprised that property parameters in these data to be analyzed carries out each array configuration of combination in any.When the number of dimensions of carrying out when needs is predetermined number, from the binary number generating, select after the binary number that predetermined number position is the first appointment numerical value, for each binary number selecting, the every represented property parameters that in this binary number is this first appointment numerical value is formed to property parameters combination, just can be met all property parameters combinations of this number of dimensions, also avoided omitting the combination of the property parameters that meets this number of dimensions, and then improved the precision of data to be analyzed being carried out to statistical computation.
Meanwhile, the method meets this number of dimensions property parameters combination for utilizing computing machine to utilize binary mode to determine provides possibility, and then can improve the convenience of data to be analyzed being added up based on this number of dimensions.
Referring to Fig. 2, show the schematic flow sheet of a kind of another embodiment of data processing method of the present invention, the method for the present embodiment can comprise:
S201, obtains the property parameters that data to be analyzed have.
S202, the property parameters having according to these data to be analyzed, determines the figure place of binary number to be generated.
Wherein, the figure place of this binary number is identical with the number of the property parameters that these data to be analyzed have, to be generated binary everybody represent respectively a property parameters of these data to be analyzed.
S203, generates and has figure place, and every initial binary number that is the second appointment numerical value, using this initial binary number as the first binary number.
Wherein, the second appointment numerical value is 0 or 1, is only a definite numerical value in 0 and 1.
When this second appointment numerical value is different, the initial binary number of generation is also different.As this, second to specify numerical value be 1 o'clock, and everybody of this initial binary number is 1; As this, second to specify numerical value be 0 o'clock, and everybody of this initial binary number is 0.
For example, these data to be analyzed have 3 property parameters, and this second to specify numerical value be 0 o'clock, the initial binary number generating is 000.
S204, according to preset rules and the first binary number, generates the second binary number.
Wherein, the second binary number of generation and the absolute value of this first binary difference are 1.
Wherein, according to this second appointment numerical value, be 0 or 1, the mode that generates this second binary number is also different, but all needs to guarantee that this second binary number is different from this first binary number, and this second binary number that current time generates is the binary number not generated before.
As, when this second appointment numerical value is 0, the mode that generates this second binary number is: the lowest order of this first binary number is added to one, obtain the second binary number.Wherein, for example, take initial binary number as 000 o'clock, if current time 000 is the first binary number, the second binary number is 001.
And for example, when this second appointment numerical value is 1, the mode that generates this second binary number is: the lowest order of this second binary number is subtracted to one, obtain the second binary number.For example, when this first binary number is initial binary number 111, the second binary number generating is 110.
S205, is this first appointment numerical value if there is the numerical value on predetermined number position in this second binary number, selects this second binary number.
Second binary number of every generation, all need to judge that whether this second binary number has the numerical value on predetermined number position is the first appointment numerical value, if having the numerical value on predetermined number position in this second binary number is the first appointment numerical value, preserve this second binary number; If the numerical value in this second binary number on the position of Non-precondition quantity is the first appointment numerical value, directly carry out step 206.
Certainly, if need to carry out statistical study based on multiple dimension, can set a plurality of predetermined numbers, therefore, if the position that this second binary number is the first numerical value quantity reaches any one predetermined number, all select the second current binary number.
Wherein, this first appointment numerical value is 0 or 1.
S206, judge this second binary number everybody whether for being the 3rd, specify numerical value, if so, perform step 208; If not, perform step 207.
S207, using current described the second binary number as described the first binary number, and returns to described step 204.
Wherein, the 3rd appointment numerical value is a numerical value in 0 and 1, and the 3rd specifies numerical value to be different from the second appointment numerical value.
Everybody in this second binary number is the 3rd while specifying numerical value, explanation has generated the binary number of whole these figure places, if also exist the binary number of this figure place not to be generated, using this second binary number as the first binary number, return to this step 204, continue to generate next the second binary number, until there is the binary number of this figure place, be all generated.
In the present embodiment, take one as step value, progressively generate the second binary number, as, the mode of employing increasing or decreasing, increases or reduces the numerical value of binary number, thereby can obtain the binary number of all these figure places, has avoided binary number omit or repeat.
S208, the binary number selecting for each, forms property parameters combination by the every represented property parameters that in this binary number is this first appointment numerical value.
S209, the property parameters based on obtaining combines, and data to be analyzed is carried out to the statistics of predetermined number dimension.
Wherein, it is similar that the operation of this step 207 and step 208 and embodiment above relevant introduced, and do not repeat them here.
Be understandable that, the operation of this step 205 and step 206 is not limited to shown in Fig. 2, and this step 205 and step 206 also can be carried out simultaneously.
In the present embodiment, this first appointment numerical value is similarly a numerical value in 0 and 1.When any one corresponding property parameters of this first designation number value representation binary number participates in statistical computation, the numerical value in corresponding positions.As, if the first appointment numerical value is, if binary number is 000, represent that three corresponding property parameters of this binary number all do not participate in statistical study at 1 o'clock, the property parameters combination that is combined as 0 dimension of the property parameters that this binary number is corresponding; If binary number is 011, represent that rear two corresponding property parameters of this binary number can participate in statistical study, and rear two corresponding property parameters of this binary number are combined into the property parameters combination of 2 dimensions.
This second appointment numerical value is used for limiting initial binary number, and the mode that generates subsequent binary number based on initial binary number.Therefore, the first appointment numerical value and second specifies the meaning of numerical value different, when setting the first appointment numerical value and the second appointment numerical value, can set this first appointment numerical value and specify numerical value identical with second, as be all 0; Also can set this first appointment numerical value and specify numerical value different with second, if this second appointment numerical value is 0, and this second appointment numerical value is 1.
For the ease of understanding the scheme of the present embodiment, the data to be analyzed of take are below recorded as example as merchandise sales, suppose that the property parameters that merchandise sales records comprises has trade name, manufacturer and selling time, and take the analysis that need to carry out 2 dimensions and be described in detail as example.Supposing first, to specify numerical value be 1 to be introduced, and that is to say that the numerical value of binary number on certain is 1 and represents that this corresponding property parameters participates in statistical study.Setting the second appointment numerical value is that 0, the three appointment numerical value is 1.Generating this initial binary number is 000, and using this, 000 as the first binary number execution step 204, this lowest order by 000 adds one, obtains the second binary number 001.This has the numerical value on 1 in 001 is 1, preserve out this 001.Simultaneously, judge this 001 everybody be not to be all 1, this is 001 as the first binary number, return and carry out this step 204,001 lowest order is added to 1, obtain the second binary number 010, the like, until three of the second binary number 111,111 generating are 1, carry out subsequent step 207.
Finally, the binary number selecting has 011,101 and 110, wherein, and two dimension combinations of 011 corresponding manufacturer and selling time, two dimension combinations of 101 corresponding goods titles and selling time, the combination of two dimensions of 110 corresponding goods titles and manufacturer.Like this, based on these three two dimension combinations, can carry out respectively corresponding statistical study.
It is convenient to be only used in embodiments of the present invention describe, the property parameters that the data to be analyzed of take have is described as example as 3 or 4, but be understandable that, the property parameters that data to be analyzed have in actual applications can have a lot, when the property parameters that has when data to be analyzed is larger, apply method of the present invention and more embody it and avoid omitting property parameters combination, and then improve the advantages such as precision of statistical study.
Further, in above any one embodiment, in the binary number selecting for the ease of judgement, which corresponding property parameters participates in statistical study, after selecting binary number, the binary number selecting for each, rule according to the true value in the corresponding boolean's array of the first appointment numerical value, is converted to the element value in boolean's array successively by every value of this binary number, so that the respectively corresponding property parameters of each element value in boolean's array.
That is to say, in this binary number, be that the position of the first appointment numerical value value in corresponding boolean's numerical value is true value, and for this, first to specify the value in boolean's numerical value corresponding to the position of numerical value be false in this binary number, wherein, everybody corresponding property parameters in this binary number, corresponds to each element of this boolean's array successively.For example, the first appointment numerical value is 1, binary number is 101 o'clock, to be converted to first element value in boolean's array be ture to the most significant digit of this binary number, second element value that the second of this binary number is converted in boolean's array is false, the 3rd element value that the value of the lowest order of this binary number is converted in boolean's numerical value is ture, therefore, boolean's array that this binary number is changed out is { ture, false, ture}, wherein, the property parameters that in this boolean's array, first element is corresponding is property parameters corresponding to most significant digit in this binary number 101, second property parameters corresponding to second that property parameters corresponding to element is this binary number 101 in this boolean's array, in this boolean's array, the 3rd property parameters that element is corresponding goes out 101 property parameters corresponding to lowest order for this binary number.
Obtain after boolean's array, extract property parameters corresponding to true value ture in boolean's numerical value, the property parameters extracting is formed to property parameters combination.For example, boolean's array is that { ture, false, during ture}, extract respectively first element and the 3rd property parameters corresponding to element in this boolean's array, and two property parameters that extract are combined into property parameters combination.
Corresponding data processing method of the present invention, the present invention also provides a kind of data processing equipment, referring to Fig. 3, the structural representation that shows an embodiment of a kind of data processing equipment of the present invention, the device of the present embodiment can comprise: acquiring unit 301, be related to determining unit 302, binary number processing unit 303, property parameters assembled unit 304 and computing unit 305.
Wherein, acquiring unit 301, the property parameters having for obtaining data to be analyzed;
Be related to determining unit 302, for the property parameters having according to described data to be analyzed, determine the figure place of binary number to be generated, wherein, the figure place of described binary number is identical with the number of the property parameters that described data to be analyzed have, and everybody of described binary number represents respectively a property parameters of described data to be analyzed;
Binary number processing unit 303, be used for according to the described figure place that is related to that determining unit is determined, generation has all described binary number of described figure place, and from the described a plurality of binary numbers that generate, choose and include the binary number that predetermined number position is the first appointment numerical value, wherein, described the first appointment numerical value is 0 or 1;
Property parameters assembled unit 304, for for described binary number processing unit, select each described in binary number, by described binary number being, described first specify every the represented property parameters of numerical value to form property parameters to combine;
Computing unit 305, for the described property parameters combination obtaining based on described property parameters assembled unit, carries out the statistics of described predetermined number dimension to described data to be analyzed.
In the present embodiment, the quantity that is related to the property parameters that determining unit has according to data to be analyzed, determine the figure place of binary number to be generated, and everybody represents respectively a property parameters of these data to be analyzed binary number to be generated, because binary number forms by 0 and 1, can set like this 0 or 1 is the first appointment numerical value, and think in binary number to be that the corresponding property parameters in position of the first appointment numerical value participates in statistical computation, like this, this binary number processing unit is related to according to this figure place that determining unit is determined, in fact all binary numbers that generate corresponding figure place have comprised that property parameters in these data to be analyzed carries out each array configuration of combination in any.
And this binary number processing unit selects after the binary number that predetermined number position is the first appointment numerical value from the binary number generating, for each binary number selecting, the every represented property parameters that in this binary number is this first appointment numerical value is formed to property parameters combination, just can be met all property parameters combinations with this predetermined number identical dimensional number, avoided omitting the combination of the property parameters that meets this number of dimensions, and then improved and based on this number of dimensions, data to be analyzed have been carried out the precision of statistical computation.
Wherein, the mode that binary number processing unit generates the binary number with this figure place according to the described figure place that is related to that determining unit is determined can have multiple.Referring to Fig. 4, show the structural representation of a kind of implementation of binary number processing unit in a kind of data processing equipment of the present invention, in the present embodiment, this binary number processing unit 303, comprising:
Initial number generation unit 3031, for according to the described figure place that is related to that determining unit is determined, generates and has described figure place, and every initial binary number that is described the second appointment numerical value, using described initial binary number as the first binary number, wherein, the second appointment numerical value is 0 or 1;
Mediant generation unit 3032, for according to preset rules and described the first binary number, generates the second binary number, and the absolute value of described the second binary number and described first binary difference is 1;
Binary number is chosen unit 3033, if described the second binary number generating for described mediant generation unit has the numerical value on predetermined number position, is described the first appointment numerical value, selects described the second binary number;
Whether judging unit 3034, specify numerical value for being the 3rd for everybody who judges described the second binary number that described mediant generation unit generates, if so, triggers and carry out described property parameters assembled unit; If not, using current described the second binary number as described the first binary number, and return and carry out described mediant generation unit; Wherein, described the 3rd appointment numerical value is 0 or 1, and the described the 3rd specifies numerical value to be different from described the second appointment numerical value.
Wherein, this first appointment numerical value can be identical with the second appointment numerical value, can be also to specify numerical value identical with the 3rd.
Wherein, this judging unit 3034 can be chosen after unit determines and whether choose this second binary number at this binary number, then carries out everybody that judge this this second binary number and whether be the operation of the 3rd appointment numerical value.This judging unit can be also when this binary number is chosen unit this second binary number is carried out to selection operation, carries out corresponding decision operation.
When this second appointment numerical value is 0, this initial number generation unit, can comprise:
The first initial number generation unit, for according to the described figure place that is related to that determining unit is determined, generates and have described figure place, and every is 0 initial binary number
Accordingly, this mediant generation unit, can comprise:
The first mediant generation unit, for the lowest order of described the first binary number is added to one, obtains the second binary number.
When described second specifies numerical value to be 1, described initial number generation unit, can comprise:
The first initial number generation unit, for according to the described figure place that is related to that determining unit is determined, generates and have described figure place, and every is 1 initial scale-of-two;
Described mediant generation unit, comprising:
The first mediant generation unit, for the lowest order of described the first binary number is subtracted to one, obtains the second binary number.
Further, in above any one embodiment, this property parameters assembled unit, can comprise:
Boolean's array converting unit, for the described binary number selecting for each, according to the rule of the true value in the corresponding boolean's array of the first appointment numerical value, every value of described binary number is converted to the element value in boolean's array successively, so that the respectively corresponding described property parameters of each element value in described boolean's array;
Parameter combinations subelement, for extracting property parameters corresponding to described boolean's array true value, forms property parameters combination by the property parameters extracting.
In this instructions, each embodiment adopts the mode of going forward one by one to describe, and each embodiment stresses is the difference with other embodiment, between each embodiment identical similar part mutually referring to.For the disclosed device of embodiment, because it corresponds to the method disclosed in Example, so description is fairly simple, relevant part partly illustrates referring to method.
Above-mentioned explanation to the disclosed embodiments, makes professional and technical personnel in the field can realize or use the present invention.To the multiple modification of these embodiment, will be apparent for those skilled in the art, General Principle as defined herein can, in the situation that not departing from the spirit or scope of the present invention, realize in other embodiments.Therefore, the present invention will can not be restricted to these embodiment shown in this article, but will meet the widest scope consistent with principle disclosed herein and features of novelty.

Claims (10)

1. a data processing method, is characterized in that, comprising:
Obtain the property parameters that data to be analyzed have;
The property parameters having according to described data to be analyzed, determine the figure place of binary number to be generated, wherein, the figure place of described binary number is identical with the number of the property parameters that described data to be analyzed have, and everybody of described binary number represents respectively a property parameters of described data to be analyzed;
Generation has all described binary number of described figure place, and from the described a plurality of binary numbers that generate, chooses and include the binary number that predetermined number position is the first appointment numerical value, and wherein, described the first appointment numerical value is 0 or 1;
The described binary number selecting for each, forms property parameters combination by the every represented property parameters that in described binary number is described the first appointment numerical value;
Described property parameters based on obtaining combines, and described data to be analyzed is carried out to the statistics of a described predetermined number dimension.
2. method according to claim 1, is characterized in that, described generation has all described binary number of described figure place, and from the described a plurality of binary numbers that generate, chooses and include the binary number that predetermined number position is the first appointment numerical value, comprising:
A: generate and there is described figure place, and every initial binary number that is described the second appointment numerical value, using described initial binary number as the first binary number, wherein, the second appointment numerical value is 0 or 1;
B: according to preset rules and described the first binary number, generate the second binary number, the absolute value of described the second binary number and described first binary difference is 1;
C: be the first appointment numerical value if there is the numerical value on predetermined number position in described the second binary number, select described the second binary number;
C: judge described the second binary number everybody whether for being the 3rd, specify numerical value, if so, carry out the operation that generates described property parameters combination; If not, using current described the second binary number as described the first binary number, and return to described step B;
Wherein, described the 3rd appointment numerical value is 0 or 1, and the described the 3rd specifies numerical value to be different from described the second appointment numerical value.
3. method according to claim 2, is characterized in that, when described second specifies numerical value to be 0, described generation has described figure place, and every initial binary number that is described the second appointment numerical value, comprising:
Generation has described figure place, and every is 0 initial binary number
Describedly according to preset rules and described the first binary number, generate the second binary number, the absolute value of described the second binary number and described first binary difference is 1, comprising:
The lowest order of described the first binary number is added to one, obtain the second binary number.
4. method according to claim 2, is characterized in that, when described second specifies numerical value to be 1, described generation has described figure place, and every initial binary number that is described the second appointment numerical value, comprising:
Generation has described figure place, and every is 1 initial scale-of-two;
Describedly according to preset rules and described the first binary number, generate the second binary number, the absolute value of described the second binary number and described first binary difference is 1, comprising:
The lowest order of described the first binary number is subtracted to one, obtain the second binary number.
5. according to the method described in claim 1 to 4 any one, it is characterized in that, the described described binary number selecting for each, forms property parameters combination by the every represented property parameters that in described binary number is described the first appointment numerical value, comprising:
The described binary number selecting for each, according to the rule of the true value in the corresponding boolean's array of the first appointment numerical value, every value of described binary number is converted to the element value in boolean's array successively, so that the respectively corresponding described property parameters of each element value in described boolean's array;
Extract property parameters corresponding to true value in described boolean's array, the property parameters extracting is formed to property parameters combination.
6. a data processing equipment, is characterized in that, comprising:
Acquiring unit, the property parameters having for obtaining data to be analyzed;
Be related to determining unit, for the property parameters having according to described data to be analyzed, determine the figure place of binary number to be generated, wherein, the figure place of described binary number is identical with the number of the property parameters that described data to be analyzed have, and everybody of described binary number represents respectively a property parameters of described data to be analyzed;
Binary number processing unit, be used for according to the described figure place that is related to that determining unit is determined, generation has all described binary number of described figure place, and from the described a plurality of binary numbers that generate, choose and include the binary number that predetermined number position is the first appointment numerical value, wherein, described the first appointment numerical value is 0 or 1;
Property parameters assembled unit, for for described binary number processing unit, select each described in binary number, by described binary number being, described first specify every the represented property parameters of numerical value to form property parameters to combine;
Computing unit, for the described property parameters combination obtaining based on described property parameters assembled unit, carries out the statistics of a described predetermined number dimension to described data to be analyzed.
7. device according to claim 6, is characterized in that, described binary number processing unit, comprising:
Initial number generation unit, for according to the described figure place that is related to that determining unit is determined, generates and has described figure place, and every initial binary number that is described the second appointment numerical value, using described initial binary number as the first binary number, wherein, the second appointment numerical value is 0 or 1;
Mediant generation unit, for according to preset rules and described the first binary number, generates the second binary number, and the absolute value of described the second binary number and described first binary difference is 1;
Binary number is chosen unit, if having the numerical value on predetermined number position for described the second binary number that in the middle of described, number generation unit processed generates, is described the first appointment numerical value, selects described the second binary number;
Whether judging unit, specify numerical value for being the 3rd for everybody who judges described the second binary number that described mediant generation unit generates, if so, triggers and carry out described property parameters assembled unit; If not, using current described the second binary number as described the first binary number, and return and carry out described middle binary number generation unit; Wherein, described the 3rd appointment numerical value is 0 or 1, and the described the 3rd specifies numerical value to be different from described the second appointment numerical value.
8. device according to claim 7, is characterized in that, when described second specifies numerical value to be 0, described initial number generation unit, comprising:
The first initial number generation unit, for according to the described figure place that is related to that determining unit is determined, generates and have described figure place, and every is 0 initial binary number
Described mediant generation unit, comprising:
The first mediant generation unit, for the lowest order of described the first binary number is added to one, obtains the second binary number.
9. device according to claim 7, is characterized in that, when described second specifies numerical value to be 1, described initial number generation unit, comprising:
The first initial number generation unit, for according to the described figure place that is related to that determining unit is determined, generates and have described figure place, and every is 1 initial scale-of-two;
Described mediant generation unit, comprising:
The first mediant generation unit, for the lowest order of described the first binary number is subtracted to one, obtains the second binary number.
10. according to the device described in claim 6 to 9 any one, it is characterized in that, described property parameters assembled unit, comprising:
Boolean's array converting unit, for the described binary number selecting for each, according to the rule of the true value in the corresponding boolean's array of the first appointment numerical value, every value of described binary number is converted to the element value in boolean's array successively, so that the respectively corresponding described property parameters of each element value in described boolean's array;
Parameter combinations subelement, for extracting property parameters corresponding to described boolean's array true value, forms property parameters combination by the property parameters extracting.
CN201310573974.4A 2013-11-15 2013-11-15 A kind of data processing method and device Active CN103559413B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310573974.4A CN103559413B (en) 2013-11-15 2013-11-15 A kind of data processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310573974.4A CN103559413B (en) 2013-11-15 2013-11-15 A kind of data processing method and device

Publications (2)

Publication Number Publication Date
CN103559413A true CN103559413A (en) 2014-02-05
CN103559413B CN103559413B (en) 2016-11-02

Family

ID=50013659

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310573974.4A Active CN103559413B (en) 2013-11-15 2013-11-15 A kind of data processing method and device

Country Status (1)

Country Link
CN (1) CN103559413B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107153651A (en) * 2016-03-03 2017-09-12 阿里巴巴集团控股有限公司 A kind of multidimensional intersects data processing method and processing device
CN108461153A (en) * 2018-02-02 2018-08-28 上海市针灸经络研究所 Management method/system, computer readable storage medium and the equipment of test data
CN109840080A (en) * 2018-12-28 2019-06-04 东软集团股份有限公司 Character attibute comparative approach, device, storage medium and electronic equipment
CN114003593A (en) * 2021-11-02 2022-02-01 北京搜房科技发展有限公司 Method and device for clearing cache data, storage medium and electronic equipment
CN116957612A (en) * 2023-09-21 2023-10-27 江苏州际数码印花有限公司 Inspection system for textile shipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5301284A (en) * 1991-01-16 1994-04-05 Walker-Estes Corporation Mixed-resolution, N-dimensional object space method and apparatus
JP2000224585A (en) * 1999-02-01 2000-08-11 Ricoh Co Ltd Encoding and decoding device
US6804664B1 (en) * 2000-10-10 2004-10-12 Netzero, Inc. Encoded-data database for fast queries
US20040223580A1 (en) * 2003-04-25 2004-11-11 J. Barry Shackleford Ones counter employing two dimensional cellular array
CN101730892A (en) * 2007-01-24 2010-06-09 迈可菲公司 Web reputation scoring

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5301284A (en) * 1991-01-16 1994-04-05 Walker-Estes Corporation Mixed-resolution, N-dimensional object space method and apparatus
JP2000224585A (en) * 1999-02-01 2000-08-11 Ricoh Co Ltd Encoding and decoding device
US6804664B1 (en) * 2000-10-10 2004-10-12 Netzero, Inc. Encoded-data database for fast queries
US20040223580A1 (en) * 2003-04-25 2004-11-11 J. Barry Shackleford Ones counter employing two dimensional cellular array
CN101730892A (en) * 2007-01-24 2010-06-09 迈可菲公司 Web reputation scoring

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107153651A (en) * 2016-03-03 2017-09-12 阿里巴巴集团控股有限公司 A kind of multidimensional intersects data processing method and processing device
CN108461153A (en) * 2018-02-02 2018-08-28 上海市针灸经络研究所 Management method/system, computer readable storage medium and the equipment of test data
CN108461153B (en) * 2018-02-02 2022-03-15 上海市针灸经络研究所 Test data management method/system, computer readable storage medium and device
CN109840080A (en) * 2018-12-28 2019-06-04 东软集团股份有限公司 Character attibute comparative approach, device, storage medium and electronic equipment
CN114003593A (en) * 2021-11-02 2022-02-01 北京搜房科技发展有限公司 Method and device for clearing cache data, storage medium and electronic equipment
CN116957612A (en) * 2023-09-21 2023-10-27 江苏州际数码印花有限公司 Inspection system for textile shipment
CN116957612B (en) * 2023-09-21 2023-12-22 江苏州际数码印花有限公司 Inspection system for textile shipment

Also Published As

Publication number Publication date
CN103559413B (en) 2016-11-02

Similar Documents

Publication Publication Date Title
CN103559413A (en) Data processing method and device
CN107609217B (en) Processing method and device for collision check data
US20160173122A1 (en) System That Reconfigures Usage of a Storage Device and Method Thereof
JP4736713B2 (en) Systems and methods to support the selection of project members
CN111611236A (en) Data analysis method and system
JP6642090B2 (en) Quality control equipment and quality control program
CN106548035B (en) A kind of diagnostic method and device of data exception
CN107909216B (en) Method for predicting actual production cycle of part
CN107634901A (en) Session expression pushing method and device and terminal equipment
US20070142948A1 (en) Manufacturing analysis using a part-process matrix
JP2020052663A (en) Process design support device and process design support method
CN112148942A (en) Business index data classification method and device based on data clustering
US7689952B2 (en) System and method for determining and visualizing tradeoffs between yield and performance in electrical circuit designs
CN104317913B (en) The screening technique of combinations of attributes and the screening plant of combinations of attributes
Arnuphaptrairong Early Stage Software Effort Estimation Using
CN116263911A (en) Material definition method, medium, equipment and system
CN104462139A (en) User behavior clustering method and system
CN114756731A (en) Advertisement channel data processing method and device, storage medium and electronic equipment
CN104796478A (en) Resource recommending method and device
JP6034646B2 (en) Market price calculation program, market price calculation computer, market price calculation device
JP2007109029A (en) Questionnaire/hearing result factor analysis question item selection device
EP4369264A1 (en) Adjustable event logs
JP2024009227A (en) Database generation apparatus, database generation method, and database generation program
Mencaroni et al. A quantitative framework to support the decision between traditional, selective, and hybrid assembly
KR20160139307A (en) Method and apparatus for setting relationship line with reverse data modeling

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant