CN109145059A - For the data processing method of data statistics, server and storage medium - Google Patents

For the data processing method of data statistics, server and storage medium Download PDF

Info

Publication number
CN109145059A
CN109145059A CN201810711224.1A CN201810711224A CN109145059A CN 109145059 A CN109145059 A CN 109145059A CN 201810711224 A CN201810711224 A CN 201810711224A CN 109145059 A CN109145059 A CN 109145059A
Authority
CN
China
Prior art keywords
index
code
dimension
attribute
priority
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810711224.1A
Other languages
Chinese (zh)
Inventor
陈炳贵
邬向春
王国彬
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Bincent Technology Co Ltd
Original Assignee
Shenzhen Bincent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Bincent Technology Co Ltd filed Critical Shenzhen Bincent Technology Co Ltd
Priority to CN201810711224.1A priority Critical patent/CN109145059A/en
Publication of CN109145059A publication Critical patent/CN109145059A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses provide data processing method, server and the storage medium of a kind of data statistics.The described method includes: user behavior attribute is extracted from event log, according to the corresponding dimensional attribute of the user behavior attribute definition;According to the dimensional attribute, dimension code is set, and forms dimensional attribute table;Pre-set level classification, setting target classification code form index classification table;Dimension code and index classification code are combined, index dimensional relationships are formed, forms index dimensional relationships table according to the index dimensional relationships, rejects invalid index dimension result table, and effective index dimension result table generation data are stored.Code is arranged to index classification and dimensional attribute by user in the present invention, and it is combined to form index dimensional relationships by certain rule, the difficulty for reducing the contextual definition directly to index classification and dimensional attribute improves the efficiency that index dimensional relationships define, reduces storage pressure.

Description

For the data processing method of data statistics, server and storage medium
Technical field
The present invention relates to data processing field more particularly to a kind of data processing methods for data statistics, server And storage medium.
Background technique
With the development of mobile network, traditional performance statistics object, is no longer satisfied enterprise customer and refines The requirement of operation, the user behavior analysis to come into being become the concern target of enterprise customer and improve the basis of profitability. User behavior analysis can be counted by event log to user and media message content, these event logs and media The content that message is included is united on the basis of event log and media message considerably beyond traditional performance statistics object Meter and analysis can carry out depth analysis to a series of indexs such as system performance, user behavior, obtain more valuable information.
In the analysis application of user behavior, enterprise customer is required to from multiple dimensions or combination dimension, multi objective pair User behavior is analyzed.
It is directly combined with dimensional attribute and index classification in the prior art, forms index dimensional relationships, shape in dimension At illustrate the classification data formed in data and index classification be it is more huge, not only dragged slowly Statistical Speed, but also lead Cause analysis efficiency low.
Summary of the invention
The purpose of the present invention is in view of the above-mentioned drawbacks of the prior art, providing at a kind of data for data statistics Reason method, server and storage medium.
The technical solution adopted by the present invention is that providing a kind of data processing method for data statistics, the side first Method includes:
User behavior attribute is extracted from event log, according to the corresponding dimensional attribute of the user behavior attribute definition;
According to the dimensional attribute, dimension code is set, and forms dimensional attribute table;
Pre-set level classification, setting target classification code form index classification table;
Dimension code and index classification code are combined, index dimensional relationships are formed, is closed according to the index dimension System forms index dimensional relationships table;
Index dimension result table is formed according to the index dimensional relationships table;
Invalid index dimension result table is rejected, and effective index dimension result table generation data are stored.
Preferably, described that user behavior attribute is extracted from event log, it is corresponding according to the user behavior attribute definition Dimensional attribute include:
From the behavior property extracted in event log in user's certain time period, according to the behavior property of user in the period Generation number be user behavior property configuration preference level;
It is the dimensional attribute of corresponding priority by user behavior attribute definition, together according to the priority of user behavior attribute When, the dimensional attribute of priority rearward is defined as invalid dimensional attribute.
From the behavior property extracted in user's certain time period in event log, the user behavior attribute definition can be made Dimensional attribute it is more accurate, by the behavior property configuration preference level to the user, be configured the dimensional attribute also Priority.
Preferably, described that dimension code is arranged according to the dimensional attribute, and form dimensional attribute table and include:
By the dimension code of dimensional attribute setting respective priority, and form the dimensional attribute table of respective priority;
The dimensional attribute table includes invalid dimensional attribute.
The dimension code is configured according to the priority of the dimensional attribute, so the dimension code is also matched Priority is set, so that the dimensional attribute table for forming respective priority is more orderly.
It preferably, include granularity sublist in the dimensional attribute table, the granularity sublist is carried out according to the priority of granularity Setting.The granularity sublist is for illustrating distribution situation of the index under the dimension.
Preferably, the pre-set level classification, setting target classification code, forming index classification table includes:
Classified by the achievement data type in data warehouse to index, and corresponding index classification code is set;
Index classification table is formed according to the index classification code, and constructs index classification code index catalogue.Setting refers to Classification code is marked, convenient for being combined pairing with the dimension code, index classification code index catalogue is constructed, convenient for described Classification code is indexed extraction.
Preferably, the achievement data type by data warehouse classifies to index, and is arranged and refers to accordingly Marking classification code includes:
Priority is set according to the temperature that achievement data type is extracted, and according to priority index classification is arranged corresponding Code.Priority is set by temperature, meets the use habit of user.
Preferably, described to be combined dimension code and index classification code, forming index dimensional relationships includes:
Traversal combination is according to priority carried out to the dimension code and index classification code, forms index dimension generation Code;
According to the index dimension code, the index dimensional relationships are determined, and excellent by index dimension code progress First grade setting;
Traversal combination is according to priority carried out to the invalid dimension code and index classification code, forms invalid index Dimension code;
According to the invalid index dimension code, invalid index dimensional relationships are determined.The dimension code and the index Classification code carries out traversal combination, will not generate omission to the dimension code and the index classification code.
Preferably, described that the dimension code and index classification code are carried out to carry out traversal combination by functioning in an acting capacity of grade, it is formed Index dimension code includes:
The dimension code according to priority carry out sequence setting, the index classification code according to priority progress sequence are set It sets;
The index dimension code includes: the guideline code position being made of index classification code and is made of dimension code Dimension code bit;
The index dimension code priority is the sequence of the sum of guideline code and dimension code.
By the dimension code according to priority carry out sequence setting, the index classification code according to priority progress sequence is set It sets, the dimension code can be made and the index classification code is more convenient carries out traversal combination, the institute to be formed can also be made Stating index dimension code, there are orders.
Secondly, also providing a kind of server, the server includes processor and memory, is stored in the memory At least one instruction, at least one section of program, code set or instruction set, at least one instruction, at least one section of program, institute It states code set or described instruction collection is loaded by the processor and executed to realize as aforementioned described in any item for data system The data processing method of meter.
A kind of computer readable storage medium is finally also provided, at least one instruction, extremely is stored in the storage medium Few one section of program, code set or instruction set, at least one instruction, at least one section of program, the code set or described Instruction set is loaded by the processor and is executed to realize such as aforementioned described in any item data processing sides for data statistics Method.
Compared with prior art, the present invention at least has the advantages that the present invention by user to index classification Code is set with dimensional attribute, and is combined to form index dimensional relationships and be preset by certain rule, is reduced directly To the difficulty of the contextual definition of index classification and dimensional attribute, the efficiency that index dimensional relationships define is improved, meanwhile, reject nothing The index dimension result table of effect, reduces storage pressure.
Detailed description of the invention
Fig. 1 is the implementation environment schematic diagram of the embodiment of the present invention;
Fig. 2 is the method flow diagram of the embodiment of the present invention;
Fig. 3 is the definition dimensional attribute method flow diagram of the embodiment of the present invention;
Fig. 4 is that the index classification table of the embodiment of the present invention forms schematic diagram;
Fig. 5 is that the dimensional relationships of the embodiment of the present invention form schematic diagram;
Fig. 6 is the traversal combined method schematic diagram of the embodiment of the present invention.
Specific embodiment
The present invention will be further described with reference to the accompanying drawings and examples.
As shown in Figure 1, present invention firstly provides a kind of data processing method for data statistics, to better illustrate this The invention of invention is intended to, and designs a kind of implementation environment that method is defined suitable for the index dimensional relationships, the implementation environment packet Include: terminal, the terminal can be the smart machines such as smart phone, intelligent robot, plate and computer, but need to illustrate Be, the terminal be not restricted to more than the smart machines such as smart phone, intelligent robot, plate and computer.In addition to terminal Outside, the implementation environment further includes providing the data warehouse 1b of data basis, and the data in warehouse 1b are formed based on the data Data Mart 2b, for request data and calculate data application layer 3b and for the presentation layer 4b of display data.
Further to illustrate the invention of the embodiment of the present invention to be intended to, the implementation environment can be specially enterprise report exhibition Show, it, can be by being shown on the terminal (such as mobile phone) when the upward level display report of departmental staff.The terminal can be with Related data is extracted by the data warehouse 1b that enterprise is arranged, constructs table according to the data of dimension and index, and at the terminal The table is shown.
As a kind of environment of possible implementation, the terminal can also extract related data as number by cloud database 5b According to source, Data Warehouse for Enterprises 1b is constructed according to the data source of extraction, constructs data further according to the data in the data warehouse 1b Fairground 2b.
As shown in Fig. 2, the data processing method method for data statistics comprising steps of
S11, user behavior attribute is extracted from event log, according to the corresponding dimension category of the user behavior attribute definition Property;Further, the user behavior attribute extracted is analyzed, to make the result analyzed more meet the row of user It for habit, can analyze, determine to the tabulation in the user behavior attribute or to the table of the data warehouse application Tabulation in the user behavior or in the table of application some dimension number accounting, to obtain corresponding dimensional attribute.
In some possible embodiments, user can periodically generate the log of same class event, for example user is every Zhou Yihui manufactures various reports, or to the specific report of data warehouse application, can thus generate event log, uses The behavior property at family is to manufacture or apply report, and the dimensional attribute in these reports can be extracted, and according to report It manufactures or time for applying carries out the configuration of a priority.
Further, the log based on macrocyclic event, such as user can make and only do an A table annual January, and User wants to extract relevant index dimension result table using the moon as the period at February, for example, can be monthly do and do it is primary B table makees weekly and does a C table, then, when extracting user behavior attribute, will extract related in the A table done January A dimensional attribute can will be in event log about A and when A dimensional attribute is unnecessary in B table or C table The relevant dimensional attribute of the behavior property of table is defined as invalid dimensional attribute, is rejected.When A dimensional attribute is in B table or C table When part is related, A dimensional attribute can be defined to corresponding dimensional attribute and enter step S12.
S12, according to the dimensional attribute, dimension code is set, and forms dimensional attribute table;Further, the dimension Code can be binary code in data warehouse level, convenient for the storage and reading of the dimension code data.It needs to illustrate , the dimension code representative the attribute of the dimension, different dimensional attributes possess different dimension codes.
It is the row of user according to the number of the generation of the behavior property of user in the period as a kind of possible embodiment For the reversed priority of attribute configuration;It is corresponding anti-by user behavior attribute definition according to the reversed priority of user behavior attribute To the dimensional attribute of priority.It should be noted that the reversed priority is not the absolute reversed of the priority.Such as It says, user can make and only do an A table annual January, and user wants to extract relevant index dimension using the moon as the period at February Result table is spent, for example be can be and monthly made and do primary B table or make and do a C table weekly, then, extracting user behavior When attribute, relevant A dimensional attribute in the A table done January will be extracted, rejects the very big dimensional attribute of probability as one, This A dimensional attribute can configure reversed priority, not have to configuration preference level but.By the setting of reversed priority, can cut Take part as the rejecting table of comparisons.
In some possible embodiments, the dimension code can be decimal code in displaying level, most of The personnel for doing enterprise report are not programming personnel, and decimal code meets user for the habit of code compilation, makes the dimension The definition for spending code is more intuitive.
Further, the dimension code is written into dimension table in the data warehouse, is deposited in the dimension table It contains and data is illustrated to index.
In some possible embodiments, in order to read the dimension table index speed faster, the dimension code It can be separately provided as dimension code table, and be mapped by a mapping relations and the dimension table, thus in the dimension table Illustrate that data are encoded, make to illustrate data also and have that code carries out external difference in the dimension table.
S13, pre-set level classification, setting target classification code form index classification table;Further, the index point Class is to be classified in the data warehouse by achievement data type, and the index classification code is in data warehouse level Binary code, convenient for the storage and reading of the index classification code data, it should be noted that include degree in the index Measure information, the metric can be divided into absolute measure and opposite measurement, the metric can be divided into absolute number measurement and Relative number measurement, absolute number measurement reflection is scale index, such as population, GDP, income, number of users, and phase Logarithm measurement is mainly used to reflect the index of quality, such as profit margin, retention ratio, coverage rate.It may also be said that index is divided into Absolute number index and relative number index, the absolute number index are aggregated data, for example population, GDP, income, number of users exist Time, place, range aggregated data, the relative number index is reprocessing on the basis of the aggregated data of absolute number index Polymerization obtains, such as profit margin, retention ratio, coverage rate etc., in a profit margin formula: profit margin=profit ÷ cost × In 100%, profit is an absolute number index, and cost is also an absolute number index, rate of return data be profit data at The polymerization of notebook data.
In some possible embodiments, the index classification code can be decimal code in displaying level, greatly The personnel that majority does enterprise report are not programming personnel, and decimal code meets user for the habit of code compilation, makes institute The definition for stating index classification code is more intuitive.
Further, the index classification code is written into index fact table in the data warehouse, the finger It marks in true table and is stored with the data of the fact that index.
In some possible embodiments, in order to read the index classification table index speed faster, the index Classification code can be separately provided as index classification code table, and be mapped by a mapping relations and the index fact table, from And the data that illustrate in the index fact table are encoded, make the true data in the index fact table also have code into The external difference of row.
S14, dimension code and index classification code are combined, form index dimensional relationships, tieed up according to the index Degree relationship forms index dimensional relationships table.
Further, described to be combined dimension code and index classification code, it is initially formed an index dimension generation Code table stores index dimension code in the index dimension code table, and the index dimension code is by the dimension code and institute Index classification code combination is stated to form;Secondly, by the combination of the dimension code and the index classification code, it is corresponding to tie up Degree and index are also combined, and index dimensional relationships are formed;Again, by the index dimensional relationships and the index dimension code Table is associated, and makes the index dimension code in the corresponding index dimension code table of each index dimensional relationships, will be described Index dimensional relationships digitization, consequently facilitating storing the index dimensional relationships;Finally, being formed according to the index dimensional relationships Index dimensional relationships table, and the index dimensional relationships are stored in the index dimensional relationships table, it is the index dimension Relation table configures index list.
In some possible embodiments, the index list, which can be, to be separately configured in the index dimensional relationships table In, it is also possible to the index list with the index dimension code table as the index dimensional relationships table, with the index Dimension code table as the index dimensional relationships table index list when, the index dimension code table and the index dimension Relation table is associated, by indexing the index dimension code table, so that it may find the finger in the index dimensional relationships table Dimensional relationships are marked, with the index dimension code table Substitute Indexes dimensional relationships table, can be saved when index dimensional relationships table is made What is generated largely illustrates data, reduces the amount of storage of index dimensional relationships.
S15, index dimension result table is formed according to the index dimensional relationships table.
It should be noted that default for the index dimension result table according to the data that the index dimension result table generates Data directly can carry out application extraction to the index dimension result table when needing the index dimension result table.
In order to improve the extraction rate of the index dimension result table, the data of the generation be can store in Data Mart In, the index dimension result table is associated with by the index dimensional relationships table with the index dimension code table.
S16, invalid index dimension result table is rejected, and effective index dimension result table generation data is deposited Storage.
As a kind of possible embodiment, when being rejected to invalid index dimension result table, to define user The index dimension result table of the clear rejecting, is shown invalid index dimension result table, specifically will be invalid Index dimension result table be pre-configured with it is pre- reject in column, reject column to pre- and be shown to user, according to the behavior of user The pre- index dimension result table rejected in column is carried out proposing to switch to effective index dimension result table or pre- column of rejecting is carried out Quickly reject.
It can be described in order to improve the efficiency for rejecting invalid index dimension result table as a kind of possible embodiment The table of comparisons is rejected in configuration one in data warehouse, and multiple rejecting targets are arranged in the rejecting table of comparisons, and the rejecting target is nothing Imitate index dimensional relationships.According to the invalid index dimensional relationships, comparison forms the index dimension in index dimension result table and closes System carries out the index dimensional relationships identical with the invalid index dimensional relationships formed in index dimension result table without criterion Note, will be present the index dimensional relationships application of invalid flag to the index dimension result table rejected, obtain effective Index dimension result table is simultaneously saved in the data warehouse.
Further, the rejecting table of comparisons, which can be, is preset in the data warehouse, is also possible to according to user What behavior property was temporarily generated in the data warehouse, in the rejecting table of comparisons setting of latter is implemented, specifically, It according to the dimensional attribute for being configured with reversed priority, is mapped in data warehouse, is formed and reject the table of comparisons.
It, can be in order to further increase the efficiency for rejecting invalid index dimension result table as a kind of possible embodiment The table of comparisons is rejected in configuration one in the data warehouse, and multiple rejecting targets, the rejecting mesh is arranged in the rejecting table of comparisons It is designated as invalid index dimensional relationships.According to the invalid index dimensional relationships, compare index dimensional relationships table, will with it is described invalid The identical index dimensional relationships of index dimensional relationships are rejected, and remaining effective index dimension forms effective index dimension As a result table, and effective index dimension result table will be obtained and be saved in the data warehouse.
It can in order to further increase the efficiency for rejecting invalid index dimension result table as alternatively possible embodiment To be associated the invalid index dimensional relationships with the index dimension code table, claimed in the index dimension code Invalid index dimension code, forms invalid guideline code table, by the invalid index dimension code table and the index dimension generation Code table compares, and ties up in the index dimension code table to the invalid index with the invalid index dimension code table record Degree code is rejected, and is obtained effective index dimension code, is closed according to effective index dimension code to the index dimension System is claimed, and is obtained effective index dimensional relationships, by effective index dimensional relationships, is formed effective index dimension knot Fruit table, and effective index dimension result table will be obtained and be saved in the data warehouse.
As shown in figure 3, in embodiments of the present invention, it is described that user behavior attribute is extracted from event log, according to described The corresponding dimensional attribute of user behavior attribute definition comprising steps of
S21, from event log extract user's certain time period in behavior property, according to the behavior of user in the period The number of the generation of attribute is the behavior property configuration preference level of user;
It is described from the behavior property extracted in event log in user's certain time period the step of specifically: setting is from event The extraction time section of behavior property is extracted in log, the time interval should be a period of time continued forward at that time, than In one month, certainly, the time interval can carry out selection setting by user at the terminal.
In some possible embodiments, in order to keep the range extracted smaller, the workload of analysis is reduced, it can be from event The user behavior attribute for extracting and sometime putting is specified in log, for example, the user behavior attribute for extracting Monday is analyzed.
In other possible embodiments, in order to keep the range extracted smaller, while reducing analysis workload, also want The precision for guaranteeing analysis work, some time point can be placed in some period, specified and extracted from event log Multiple user behavior attributes sometime put, for example, the user behavior attribute for extracting each Monday in last season is analyzed.
The number according to the generation of the behavior property of user in the period is the behavior property configuration preference level of user The step of specifically include: the number that the behavior property of the user occurs is more, and the priority for its configuration is higher, for example, Within some month in the behavior property of user, manufactures or the number of application time table is six times, manufacture or apply time of regional table Number be five times, then manufactured for user or the behavior of application time table configure one be higher than manufacture or application time table it is preferential Grade.
Certainly, it is not excluded that the number that the behavior property of the user occurs is more, and the priority for its configuration is lower The case where, as a kind of embodiment, the mode of both configuration preference levels can be selected by setting positive sequence and inverted order.
S22, according to the priority of user behavior attribute, be the dimension category of corresponding priority by user behavior attribute definition Property, meanwhile, the dimensional attribute of priority rearward is defined as invalid dimensional attribute.Specifically, to the user's row extracted It is analyzed for attribute, it, can be in the user behavior attribute to make the result analyzed more meet the behavioural habits of user Tabulation or analyzed to the table of the data warehouse application, in the table for determining the tabulation or application in the user behavior The accounting of some dimension number, to obtain corresponding dimensional attribute;In addition, described extract user for the moment from event log Between behavior property in section the step of specifically: be arranged from event log and extract the extraction time section of behavior property, it is described Time interval should be a period of time continued forward at that time, such as in one month, and certainly, the time interval can be by user Selection setting is carried out at the terminal;The number according to the generation of the behavior property of user in the period is the behavior category of user Property configuration preference level the step of specifically include: the number that the behavior property of the user occurs is more, for the priority of its configuration It is higher, for example, within some month in the behavior property of user, manufacture or the number of application time table is six times, manufactures or Shen Please the number of regional table be five times, then being manufactured for user or the behavior of application time table configures one and is higher than when manufacturing or applying Between table priority;It is associated with user behavior attribute by the obtained dimensional attribute, make the user behavior attribute Priority can be defined the priority of the dimensional attribute.
From the behavior property extracted in user's certain time period in event log, the user behavior attribute definition can be made Dimensional attribute it is more accurate, by the behavior property configuration preference level to the user, be configured the dimensional attribute also Priority.
In embodiments of the present invention, described that dimension code is arranged according to the dimensional attribute, and form dimensional attribute table packet Include step:
By the dimension code of dimensional attribute setting respective priority, and form the dimensional attribute table of respective priority;
The dimensional attribute table includes invalid dimensional attribute.The dimension code is according to the preferential of the dimensional attribute Grade is configured, so the dimension code is also configured for priority, so that forming the dimensional attribute table of respective priority more Added with sequence.
Further, the dimensional attribute is associated with the dimension code, and the priority of the dimensional attribute can be right The priority of the dimension code is defined, further, the dimension code data warehouse level can for two into Code processed, convenient for the storage and reading of the dimension code data.It should be noted that the dimension code representative the dimension The attribute of degree, different dimensional attributes possess different dimension codes.
In some possible embodiments, the dimension code can be decimal code in displaying level, most of The personnel for doing enterprise report are not programming personnel, and decimal code meets user for the habit of code compilation, makes the dimension The definition for spending code is more intuitive.
For example, by taking the decimal system as an example, the dimension code can be according to priority sequentially can be highest priority be 1, Second priority is 2 equal sequences;Again by taking binary system as an example, the dimension code, which can be, to be according to priority sequentially ranked up, than Such as can be highest priority be the 01, second priority be 10, third priority be 11 sequence.
It in embodiments of the present invention, include granularity sublist in the dimensional attribute table, the granularity sublist is according to granularity Priority is configured.The granularity sublist is for illustrating distribution situation of the index under the dimension.It needs to illustrate That granularity is a data unit of account under dimension, the granularities of data mainly for achievement data computer capacity, with place For dimension, if this data item of population is using block range or a community as range statistics in statistical department.Population Data degree of refinement is higher, and particle size fraction is just smaller, for example is greater than using the range that community counts demographic data as granularity The range that demographic data is counted as granularity using residential building;On the contrary, degree of refinement is lower, particle size fraction is bigger.
Specifically, analyzing the user behavior attribute extracted, to make the result analyzed more meet user Behavioural habits, can be analyzed to the tabulation in the user behavior attribute or to the table of the data warehouse application, The accounting of some granularity number in the table of the tabulation or application in the user behavior is determined, to obtain corresponding granularity category Property;In addition, it is described from the behavior property extracted in event log in user's certain time period the step of specifically: setting is from event The extraction time section of behavior property is extracted in log, the time interval should be a period of time continued forward at that time, than In one month, certainly, the time interval can carry out selection setting by user at the terminal;It is described according to being used in the period The step of number of the generation of the behavior property at family is the behavior property configuration preference level of user specifically includes: the row of the user The number occurred for attribute is more, and the priority for its configuration is higher, for example, within some month in the behavior property of user, It manufactures or the number of application time table is six times, manufacture or apply that the number of regional table is five times, then being manufactured for user or Shen Please the behavior of timetable configure one and be higher than and manufacture or the priority of application time table;By the obtained granularity attribute and use Family behavior property is associated, determines the priority of the user behavior attribute to the priority of the granularity attribute Justice.
As shown in figure 4, in embodiments of the present invention, the pre-set level classification, setting target classification code forms index Classification chart comprising steps of
S31, classified by the achievement data type in data warehouse to index, and corresponding index classification generation is set Code;
S32, index classification table is formed according to the index classification code, and constructs index classification code index catalogue.If Set index classification code, convenient for being combined pairing with the dimension code, construct index classification code index catalogue, convenient for pair The classification code is indexed extraction.
Further, according to priority classify to the index, the index classification and the index classification code phase Association, the priority of the index classification can be defined the priority of the index classification code, further, institute State index classification code data warehouse level can for binary code, convenient for the index classification code data storage and It reads.It should be noted that the index classification code representative the data type of the index, different index classifications gather around There is different index classification codes.
In some possible embodiments, the index classification code can be decimal code in displaying level, greatly The personnel that majority does enterprise report are not programming personnel, and decimal code meets user for the habit of code compilation, makes institute The definition for stating index classification code is more intuitive.
For example, the index classification code, which can be, according to priority sequentially can be highest priority by taking the decimal system as an example It is the sequences such as 2 for the 1, second priority;Again by taking binary system as an example, the index classification code can be according to priority sequence and carry out Sequence, for example, can be highest priority be the 01, second priority be 10, third priority be 11 etc. sequence.
In embodiments of the present invention, the achievement data type by data warehouse classifies to index, and sets Set corresponding index classification code comprising steps of
Priority is set according to the temperature that achievement data type is extracted, and according to priority index classification is arranged corresponding Code.Priority is set by temperature, meets the use habit of user.
Specifically, being the index allocation according to the temperature that the achievement data type in the data warehouse is extracted Priority, and the priority of the index is configured on the index classification code, there is the index classification code also Prioritization.
In some possible embodiments, faster Data Mart is can be set as data in the speed in order to extract data Terminal, the temperature that can be extracted in the Data Mart to the data type is analyzed, and is configured corresponding excellent The priority of the index can also be equally configured on the index classification code by first grade.
As shown in figure 5, in embodiments of the present invention, it is described to be combined dimension code and index classification code, it is formed Index dimensional relationships comprising steps of
S41, traversal combination is according to priority carried out to the dimension code and index classification code, form index dimension Code;
S42, according to the index dimension code, determine the index dimensional relationships, and by the index dimension code into The setting of row major grade.The dimension code and the index classification code carry out traversal combination, will not to the dimension code and The index classification code generates omission.
Further, described that dimension code and index classification code are subjected to traversal combination, it is initially formed an index dimension Code table is spent, stores index dimension code in the index dimension code table, the index dimension code is by the dimension code It is formed with the index classification code combination;Secondly, by the combination of the dimension code and the index classification code, accordingly Dimension and index also combined, formed index dimensional relationships;Again, by the index dimensional relationships and the index dimension Code table is associated, and makes the index dimension code in the corresponding index dimension code table of each index dimensional relationships, will The index dimensional relationships digitization, consequently facilitating storing the index dimensional relationships;Finally, according to the index dimensional relationships Index dimensional relationships table is formed, and the index dimensional relationships are stored in the index dimensional relationships table, is the index Dimensional relationships table configures index list.
In some possible embodiments, the index list, which can be, to be separately configured in the index dimensional relationships table In, it is also possible to the index list with the index dimension code table as the index dimensional relationships table, with the index Dimension code table as the index dimensional relationships table index list when, the index dimension code table and the index dimension Relation table is associated, by indexing the index dimension code table, so that it may find the finger in the index dimensional relationships table Mark dimensional relationships.
It is described that the dimension code and index classification code are carried out by functioning in an acting capacity of as shown in fig. 6, in embodiments of the present invention Grade carries out traversal combination, form index dimension code comprising steps of
S51, the dimension code according to priority carry out sequence setting, the index classification code according to priority progress sequence Setting;Further, the index dimension code can by the dimension code according to priority to the index classification code into Row traversal combination.
Certainly, as a kind of possible embodiment, the index dimension code can also be pressed by the index classification code Priority carries out traversal combination to the dimension code.
S52, the index dimension code include: the guideline code position being made of index classification code and by dimension code-group At dimension code bit;Specifically, the index dimension code includes at least two parts, and one of part is the finger Classification code is marked, another one part is the dimension code.
It further, is preferably to distinguish the index classification code and the dimension code, the index dimension It can also include a separating character in code.
Further, the index dimension code can be binary code in data warehouse level, be convenient for the index The storage and reading of dimension code data.It should be noted that the index dimension code representative the pass of the index dimension System, different index dimensional relationships, corresponding different index dimension code.
In some possible embodiments, the index dimension code can be decimal code in displaying level, greatly The personnel that majority does enterprise report are not programming personnel, and decimal code meets user for the habit of code compilation, makes institute The definition for stating index dimension code is more intuitive.
For example, by taking the decimal system as an example, the index dimension code can be according to priority sequence, and can be defined as highest excellent First grade is that the 1, second priority is the sequences such as 2;Again by taking binary system as an example, the index dimension code can be according to priority sequence Be ranked up, for example, can be highest priority be the 01, second priority be 10, third priority be 11 etc. sequence.
More specific example, the dimension code are according to priority set as 1,2,3,4 ..., and the index classification code is pressed Priority is set as 1,2,3,4 ..., and the separator is "-", with the dimension code according to priority to the index classification For code carries out traversal combination, the index dimension code traversal group is combined into 1-1,1-2,1-3,1-4 ..., 2-1, 2-2,2-3,2-4 ..., 3-1,3-2,3-3,3-4 ..., 4-1,4-2,4-3,4-4 ...;Wherein, described 1-1 represents an index dimension code.
S53, the index dimension code priority are the sequence of the sum of index classification code and dimension code.Before specific For example, in index dimension code 1-1, the sum of the index dimension code and index classification code are 1+1=2, fixed with 2A The justice index dimension code priority, in index dimension code 1-2,2-1,1+2=3,2+1=3, defined with 3A described in Index dimension code priority, in index dimension code 1-3,2-2,3-1,1+3=4,2+2=4,3+1=4 are fixed with 4A The justice index dimension code priority, in index dimension code 1-4,2-3,3-2,4-1, sum 5, defined with 5A The index dimension code priority so can be obtained in the index dimension code priority, because of the dimension code and institute Stating index classification code all is to be configured by certain rule ordering to priority, the index dimension code it is preferential Grade also can be to follow certain rule.
By the dimension code according to priority carry out sequence setting, the index classification code according to priority progress sequence is set It sets, the dimension code can be made and the index classification code is more convenient carries out traversal combination, the institute to be formed can also be made Stating index dimension code, there are orders.
Secondly, also providing a kind of server, the server includes processor and memory, is stored in the memory At least one instruction, at least one section of program, code set or instruction set, at least one instruction, at least one section of program, institute State code set or described instruction collection as the processor loads and execute with realize as it is aforementioned any one embodiment as described in be used for The data processing method of data statistics.
Processor in the server can be computing chip, to the dimension data in calculation processing database and refer to The polymerization of data is marked, the memory may is that USB flash disk, read-only memory (ROM), random access memory (RAM), movement are hard The various storage devices that can store program code such as disk, magnetic or disk.
A kind of computer readable storage medium is finally also provided, at least one instruction, extremely is stored in the storage medium Few one section of program, code set or instruction set, at least one instruction, at least one section of program, the code set or described Instruction set is as the processor loads and executes to realize the data for data statistics as described in any one of aforementioned embodiment Processing method.
The computer readable storage medium includes: USB flash disk, read-only memory (ROM), random access memory (RAM), moves The various media that can store program code such as dynamic hard disk, magnetic or disk.
Above-described embodiment is merely to illustrate a specific embodiment of the invention.It should be pointed out that for the general of this field For logical technical staff, without departing from the inventive concept of the premise, several deformations and variation can also be made, these deformations and Variation all should belong to protection scope of the present invention.

Claims (10)

1. a kind of data processing method for data statistics, which is characterized in that for presetting dimension index result table, the side Method includes:
User behavior attribute is extracted from event log, according to the corresponding dimensional attribute of the user behavior attribute definition;
According to the dimensional attribute, dimension code is set, and forms dimensional attribute table;
Pre-set level classification, setting target classification code form index classification table;
Dimension code and index classification code are combined, index dimensional relationships are formed, according to the index dimensional relationships shape At index dimensional relationships table;
Index dimension result table is formed according to the index dimensional relationships table;
Invalid index dimension result table is rejected, and effective index dimension result table generation data are stored.
2. being used for the data processing method of data statistics as described in claim 1, which is characterized in that described from event log User behavior attribute is extracted, includes: according to the corresponding dimensional attribute of the user behavior attribute definition
From the behavior property extracted in event log in user's certain time period, according to the hair of the behavior property of user in the period Raw number is the behavior property configuration preference level of user;
It is the dimensional attribute of corresponding priority by user behavior attribute definition according to the priority of user behavior attribute, meanwhile, it will The dimensional attribute of priority rearward is defined as invalid dimensional attribute.
3. being used for the data processing method of data statistics as claimed in claim 2, which is characterized in that described according to the dimension Attribute is arranged dimension code, and forms dimensional attribute table and include:
By the dimension code of dimensional attribute setting respective priority, and form the dimensional attribute table of respective priority;
The dimensional attribute table includes invalid dimensional attribute.
4. being used for the data processing method of data statistics as claimed in claim 3, which is characterized in that in the dimensional attribute table Including granularity sublist, the granularity sublist is configured according to the priority of granularity.
5. the data processing method for data statistics as described in claim 1-4 is any, which is characterized in that described default Index classification, setting target classification code, forming index classification table includes:
Classified by the achievement data type in data warehouse to index, and corresponding index classification code is set;
Index classification table is formed according to the index classification code, and constructs index classification code index catalogue.
6. being used for the data processing method of data statistics as claimed in claim 5, which is characterized in that described to pass through data warehouse In achievement data type classify to index, and corresponding index classification code is set and includes:
Priority is set according to the temperature that achievement data type is extracted, and is according to priority arranged to index classification corresponding generation Code.
7. being used for the data processing method of data statistics as claimed in claim 6, which is characterized in that it is described by dimension code and Index classification code is combined, and is formed index dimensional relationships and is included:
Traversal combination is according to priority carried out to the dimension code and index classification code, forms index dimension code;
According to the index dimension code, the index dimensional relationships are determined, and carry out priority by the index dimension code Setting;
Traversal combination is according to priority carried out to the invalid dimension code and index classification code, forms invalid index dimension Code;
According to the invalid index dimension code, invalid index dimensional relationships are determined.
8. being used for the data processing method of data statistics as claimed in claim 7, which is characterized in that described to the dimension generation Code and index classification code carry out carrying out traversal combination by functioning in an acting capacity of grade, and forming index dimension code includes:
The dimension code according to priority carry out sequence setting, the index classification code according to priority carry out sequence setting;
The index dimension code includes: the guideline code position being made of index classification code and the dimension being made of dimension code Code bit;
The index dimension code priority is the sequence of the sum of guideline code and dimension code.
9. a kind of server, which is characterized in that including processor and memory, at least one finger is stored in the memory Enable, at least one section of program, code set or instruction set, at least one instruction, at least one section of program, the code set or Described instruction collection is loaded by the processor and is executed to realize and unite as described in any item of the claim 1 to 8 for data The data processing method of meter.
10. a kind of computer readable storage medium, which is characterized in that be stored at least one instruction, extremely in the storage medium Few one section of program, code set or instruction set, at least one instruction, at least one section of program, the code set or described Instruction set is loaded by the processor and is executed to realize as described in any item of the claim 1 to 8 for data statistics Data processing method.
CN201810711224.1A 2018-06-29 2018-06-29 For the data processing method of data statistics, server and storage medium Pending CN109145059A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810711224.1A CN109145059A (en) 2018-06-29 2018-06-29 For the data processing method of data statistics, server and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810711224.1A CN109145059A (en) 2018-06-29 2018-06-29 For the data processing method of data statistics, server and storage medium

Publications (1)

Publication Number Publication Date
CN109145059A true CN109145059A (en) 2019-01-04

Family

ID=64799579

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810711224.1A Pending CN109145059A (en) 2018-06-29 2018-06-29 For the data processing method of data statistics, server and storage medium

Country Status (1)

Country Link
CN (1) CN109145059A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112307041A (en) * 2020-10-29 2021-02-02 山东浪潮通软信息科技有限公司 Index dimension modeling method and device and computer readable medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7571151B1 (en) * 2005-12-15 2009-08-04 Gneiss Software, Inc. Data analysis tool for analyzing data stored in multiple text files
CN102855241A (en) * 2011-06-28 2013-01-02 上海迈辉信息技术有限公司 Multi-index expert suggestion system and realization method thereof
CN103136335A (en) * 2013-01-31 2013-06-05 北京千分点信息科技有限公司 Data control method based on data platforms
CN104142986A (en) * 2014-07-24 2014-11-12 中国软件与技术服务股份有限公司 Big data situation analysis early warning method and system based on clustering
CN104408179A (en) * 2014-12-15 2015-03-11 北京国双科技有限公司 Method and device for processing data from data table
CN105989076A (en) * 2015-02-10 2016-10-05 腾讯科技(深圳)有限公司 Data statistical method and device
CN106250543A (en) * 2016-08-10 2016-12-21 深圳市彬讯科技有限公司 A kind of automation data inquiry synchronous storage method

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7571151B1 (en) * 2005-12-15 2009-08-04 Gneiss Software, Inc. Data analysis tool for analyzing data stored in multiple text files
CN102855241A (en) * 2011-06-28 2013-01-02 上海迈辉信息技术有限公司 Multi-index expert suggestion system and realization method thereof
CN103136335A (en) * 2013-01-31 2013-06-05 北京千分点信息科技有限公司 Data control method based on data platforms
CN104142986A (en) * 2014-07-24 2014-11-12 中国软件与技术服务股份有限公司 Big data situation analysis early warning method and system based on clustering
CN104408179A (en) * 2014-12-15 2015-03-11 北京国双科技有限公司 Method and device for processing data from data table
CN105989076A (en) * 2015-02-10 2016-10-05 腾讯科技(深圳)有限公司 Data statistical method and device
CN106250543A (en) * 2016-08-10 2016-12-21 深圳市彬讯科技有限公司 A kind of automation data inquiry synchronous storage method

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112307041A (en) * 2020-10-29 2021-02-02 山东浪潮通软信息科技有限公司 Index dimension modeling method and device and computer readable medium

Similar Documents

Publication Publication Date Title
US10504120B2 (en) Determining a temporary transaction limit
US20150213631A1 (en) Time-based visualization of the number of events having various values for a field
US20130080444A1 (en) Chart Recommendations
US9135280B2 (en) Grouping interdependent fields
CN109189861A (en) Data stream statistics method, server and storage medium based on index
WO2014143208A1 (en) Systems, methods and apparatuses for implementing data upload, processing, and predictive query ap| exposure
CN103605651A (en) Data processing showing method based on on-line analytical processing (OLAP) multi-dimensional analysis
CN103186539A (en) Method and system for confirming user groups, inquiring information and recommending
US9305076B1 (en) Flattening a cluster hierarchy tree to filter documents
CN107622326B (en) User classification and available resource prediction method, device and equipment
CN111127105A (en) User hierarchical model construction method and system, and operation analysis method and system
JP6862531B2 (en) Guided data exploration
CN111831629A (en) Data processing method and device
CN109325648A (en) Multi-dimensional data stream statistics method, server and storage medium based on index
CN111931053A (en) Item pushing method and device based on clustering and matrix decomposition
CN111782686A (en) User data query method and device, electronic equipment and storage medium
CN108921693B (en) Data derivation method, device and equipment
US10877989B2 (en) Data conversion system and method of converting data
WO2014006851A1 (en) Anonymization device, anonymization system, anonymization method, and program recording medium
CN109241197A (en) Data processing method, server and the storage medium that index is shown
CN107729330B (en) Method and apparatus for acquiring data set
CN116663505B (en) Comment area management method and system based on Internet
CN109241048A (en) For the data processing method of data statistics, server and storage medium
CN109145059A (en) For the data processing method of data statistics, server and storage medium
CN117131055A (en) Data analysis method, data analysis device and data analysis system for multidimensional data

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Address after: 518000 R & D building 3501, block a, building 7, Vanke Cloud City Phase I, Xingke 1st Street, Xili community, Xili street, Nanshan, Shenzhen, Guangdong

Applicant after: Tubatu Group Co.,Ltd.

Address before: 1001-a, 10th floor, bike technology building, No.9, Keke Road, high tech Zone, Nanshan District, Shenzhen, Guangdong 518000

Applicant before: SHENZHEN BINCENT TECHNOLOGY Co.,Ltd.

RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190104