CN110795426B - Data generation method, device and computer readable storage medium - Google Patents

Data generation method, device and computer readable storage medium Download PDF

Info

Publication number
CN110795426B
CN110795426B CN201810878545.0A CN201810878545A CN110795426B CN 110795426 B CN110795426 B CN 110795426B CN 201810878545 A CN201810878545 A CN 201810878545A CN 110795426 B CN110795426 B CN 110795426B
Authority
CN
China
Prior art keywords
data
information
standard data
standard
data table
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810878545.0A
Other languages
Chinese (zh)
Other versions
CN110795426A (en
Inventor
钟强
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Xiaoyu Data Technology Co ltd
Original Assignee
Shanghai Xiaoyu Data Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Xiaoyu Data Technology Co ltd filed Critical Shanghai Xiaoyu Data Technology Co ltd
Priority to CN201810878545.0A priority Critical patent/CN110795426B/en
Publication of CN110795426A publication Critical patent/CN110795426A/en
Application granted granted Critical
Publication of CN110795426B publication Critical patent/CN110795426B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The application discloses a data generation method, a data generation device and a computer readable storage medium. The method comprises the following steps: acquiring a target data record for recording an object; determining first dimension information, characteristic information and time information in a target data record, wherein the first dimension information is first information used for representing an object in the target data record, the characteristic information is information capable of metering the first dimension information, and the time information is generation time, release time or user acquisition time related to the first dimension information; and generating standard data including the first dimension information, the feature information and the time information, wherein when the number of the target data records is multiple, the data structures of the plurality of standard data corresponding to the plurality of target data records are the same, so that the data calculation can be performed by using the plurality of standard data.

Description

Data generation method, device and computer readable storage medium
Technical Field
The present application relates to the field of computer technologies, and in particular, to a data generation method, an apparatus, and a computer-readable storage medium.
Background
In order to analyze a certain behavior, event or state, a lot of data need to be acquired, and then data calculation is performed on the acquired data, so that the event is analyzed by using big data, and the analyzed result is relatively accurate.
Disclosure of Invention
The present application is directed to a data generation method, apparatus, and computer-readable storage medium, so as to unify data acquired through multiple ways into the same data structure.
In a first aspect, to achieve the above object, the present application provides a data generating method, including:
obtaining a target data record for recording an object, wherein the object comprises a behavior, an event or a state;
determining first dimension information, feature information and time information in the target data record, wherein the first dimension information is first information used for representing the object in the target data record, the feature information is information capable of measuring the first dimension information, and the time information is generation time, release time or user acquisition time related to the first dimension information; and
Generating standard data including the first dimension information, the feature information, and the time information.
Optionally, after generating the standard data including the first dimension information, the feature information, and the time information, the method further includes:
detecting whether first standard data and second standard data exist in standard data belonging to the same data table, wherein time information in the first standard data is the same as time information in the second standard data, and first dimension information in the first standard data is the same as first dimension information in the second standard data; and
if not, performing data calculation by using the first standard data and the second standard data.
Optionally, after detecting whether the first standard data and the second standard data exist in the standard data belonging to the same data table, the method further includes:
if the standard data exists, adding second dimension information in a target data record corresponding to each standard data into the standard data, wherein the second dimension information is second information used for representing the object in the target data record;
Detecting whether second dimension information of the first standard data is the same as second dimension information of the second standard data; and
and if not, performing data calculation by using the standard data added with the second dimension information in the data table.
Optionally, after detecting whether the second dimension information of the first standard data is the same as the second dimension information of the second standard data, the method further includes:
if the standard data are the same, adding third dimension information in a target data record corresponding to each standard data into the standard data, wherein the third dimension information is third information used for representing the object in the target data record; and
and detecting whether the third dimension information of the first standard data is the same as that of the second standard data or not, and repeating the steps until the standard data in the data table meet the condition for data calculation.
Optionally, after each standard datum in the data table satisfies a condition for performing data calculation, the method further includes:
judging whether the time information of each standard data in the same data table accords with a preset time acquisition rule or not; and
If the time information of the standard data in the data table is not matched with the time format corresponding to the time information of the standard data in the data table, extracting the time information of the standard data in the data table according to the time format corresponding to the time information of the standard data in the data table;
and the time unit of the time information of each piece of standard data in the extracted data table is larger than the time unit of the time information of each piece of original standard data in the data table.
Optionally, if yes, before calculating the standard data in the two data tables, when the number of the standard data in the first data table is not the same as the number of the standard data in the second data table within the first specified time period, the method further comprises:
and combining or splitting each standard data in the first table data according to the sequence of the time information by taking the specified acquisition frequency within a second specified duration as a standard, and combining or splitting each standard data in the second table data according to the sequence of the time information so as to enable the time information of each standard data after splitting or combining in the first table data and the second table data to meet the specified acquisition frequency within the second specified duration.
Alternatively,
after the first table data are merged, the characteristic information of original standard data merged into one standard data in the first table data is correspondingly merged and calculated according to the characteristic of the first dimension information of the original standard data;
after the second table data are merged, the characteristic information of the original standard data merged into one standard data in the second table data is correspondingly merged and calculated according to the characteristic of the first dimension information of the original standard data;
after the standard data in the first data table are split, the split standard data in the first data table respectively comprise time information and first dimension information, and the feature information of the original standard data is added to one standard data in the split standard data of the original standard data;
after the standard data in the second data table are split, the split standard data in the second data table respectively comprise time information and first dimension information, and the feature information of the original standard data is added to one standard data in the split standard data of the original standard data.
Optionally, when the time information of each standard data in the same data table meets a preset time collecting rule and before the standard data in the two data tables are calculated, when the number of the standard data in the first data table is the same as the number of the standard data in the second data table within the first specified time period, the method further comprises:
comparing time information included in a first data table with time information included in a second data table to determine first time information only appearing in the first data table but not appearing in the second data table, and to determine second time information only appearing in the second data table but not appearing in the first data table; and
adding the first time information and the first dimension information corresponding to the first time information to the second data table, and adding the second time information and the first dimension information corresponding to the second time information to the first data table.
Optionally, after adding the first time information and the first dimension information corresponding to the first time information to the second data table, and adding the second time information and the first dimension information corresponding to the second time information to the first data table, the method further includes:
Adding dimension information that only appears in the first data table but does not appear in the second data table to the second data table when the number of kinds of dimension information included in the first data table is greater than the number of kinds of dimension information included in the second data table;
adding dimension information that appears only in the second data but not in the first data table to the first data table when the number of kinds of dimension information included in the second data table is greater than the number of kinds of dimension information included in the first data table.
Optionally, after the number of kinds of dimension information included in the first data table is equal to the number of kinds of dimension information included in the second data table, before calculating the standard data in the two data tables, the method further includes:
detecting whether each standard data in each data table comprises the time information; and
and when at least one standard data in the data table does not comprise the time information, not using the standard data in the data table for data calculation.
In a second aspect, to achieve the above object, the present application provides a data generating apparatus, comprising:
an acquisition unit, configured to acquire a target data record for recording an object, where the object includes a behavior, an event, or a state;
a determining unit, configured to determine first dimension information, feature information, and time information in the target data record, where the first dimension information is first information in the target data record, the feature information is information that can measure the first dimension information, and the time information is generation time, release time, or user acquisition time related to the first dimension information;
a generating unit configured to generate standard data including the first dimension information, the feature information, and the time information.
Optionally, the apparatus further comprises:
a detecting unit, configured to detect whether there is first standard data and second standard data in standard data belonging to the same data table after generating standard data including the first dimension information, the feature information, and the time information, where the time information in the first standard data is the same as the time information in the second standard data, and the first dimension information in the first standard data is the same as the first dimension information in the second standard data;
And the statistical unit is used for performing data calculation by using the first standard data and the second standard data if the first standard data and the second standard data do not exist.
Optionally, the apparatus further comprises:
an adding unit, configured to add, after detecting whether first standard data and second standard data exist in standard data belonging to the same data table, second dimension information in a target data record corresponding to each standard data to the standard data if the first standard data and the second standard data exist, where the second dimension information is second information used for representing the object in the target data record;
the detection unit is further configured to detect whether second dimension information of the first standard data is the same as second dimension information of the second standard data;
and the statistical unit is also used for calculating data by using each standard data added with the second dimension information in the data table if the standard data are different.
Alternatively, the first and second liquid crystal display panels may be,
the adding unit is further configured to, after detecting whether second dimension information of the first standard data is the same as second dimension information of the second standard data, if the second dimension information of the first standard data is the same as the second dimension information of the second standard data, add third dimension information in a target data record corresponding to each standard data to the standard data, where the third dimension information is third information used for representing the object in the target data record;
The detection unit is further configured to detect whether third dimension information of the first standard data is the same as third dimension information of the second standard data, and so on until each standard data in the data table meets a condition for performing data calculation.
Optionally, the apparatus further comprises:
the judging unit is used for judging whether the time information of each standard data in the same data table accords with a preset time acquisition rule or not after each standard data in the data table meets the condition of data calculation;
the extraction unit is used for extracting the time information of each standard data in the data table according to a time format corresponding to the time information of each standard data in the data table if the time information of each standard data in the data table is not matched with the time format of each standard data in the data table;
and the time unit of the time information of each piece of standard data in the extracted data table is larger than the time unit of the time information of each piece of original standard data in the data table.
Optionally, the apparatus further comprises:
and the frequency unifying unit is used for merging or splitting the standard data in the first table data according to the sequence of the time information by taking the specified acquisition frequency in the second specified duration as a standard before calculating the standard data in the two data tables, and merging or splitting the standard data in the second table data according to the sequence of the time information so that the time information of the standard data after being split or merged in the first table data and the second table data meets the specified acquisition frequency in the second specified duration.
Alternatively, the first and second liquid crystal display panels may be,
the frequency unifying unit is used for correspondingly combining and calculating the characteristic information of original standard data combined into one standard data in the first table data according to the characteristic of the first dimension information of the original standard data after the first table data are combined; after the second table data is merged, the feature information of the original standard data merged into one standard data in the second table data is correspondingly merged and calculated according to the characteristic of the first dimension information of the original standard data; when each standard data in the first data table is split, each split standard data in the first data table respectively comprises time information and first dimension information, and the characteristic information of the original standard data is added to one standard data in the standard data split from the original standard data; and after splitting each standard data in the second data table, each split standard data in the second data table respectively comprises time information and first dimension information, and the feature information of the original standard data is added to one standard data in the standard data split from the original standard data.
Optionally, the apparatus further comprises:
a comparison unit, configured to, when the time information of each standard data belonging to the same data table meets a preset time acquisition rule and before calculating the standard data in the two data tables, compare the time information included in the first data table with the time information included in the second data table when the number of the standard data in the first data table is the same as the number of the standard data in the second data table within the first specified time period, to determine first time information that is present only in the first data table but not in the second data table, and determine second time information that is present only in the second data table but not in the first data table;
a supplementing unit, configured to add the first time information and the first dimension information corresponding to the first time information to the second data table, and add the second time information and the first dimension information corresponding to the second time information to the first data table.
Alternatively,
the supplementing unit is further configured to, after adding the first time information and the first dimension information corresponding to the first time information to the second data table and adding the second time information and the first dimension information corresponding to the second time information to the first data table, add dimension information that is present only in the first data and that is not present in the second data table to the second data table when the number of kinds of dimension information included in the first data table is greater than the number of kinds of dimension information included in the second data table; and adding dimension information, which is present only in the second data but not in the first data table, to the first data table when the number of kinds of dimension information included in the second data table is greater than the number of kinds of dimension information included in the first data table.
Alternatively, the first and second liquid crystal display panels may be,
the detection unit is further configured to detect whether each piece of standard data in each data table includes the time information before calculating the standard data in the two data tables after the number of types of dimension information included in the first data table is equal to the number of types of dimension information included in the second data table; and when at least one standard data in the data table does not comprise the time information, not using the standard data in the data table to perform data calculation.
In a third aspect, to achieve the above object, the present application provides a computer-readable storage medium storing a computer program which, when executed in a computer processor, implements the steps of the method as described in any one of the above.
The technical scheme provided by the embodiment of the application can have the following beneficial effects:
in the application, after a target data record for recording an object is obtained, first dimension information, feature information and time information in the target data record are determined, and then standard data including the first dimension information, the feature information and the time information are generated, wherein the first dimension information, the feature information and the time information can describe one target data record in detail, so that the generated standard data can completely express the target data record, and when the number of the target data records is multiple, the data structures of a plurality of standard data corresponding to the plurality of target data records are the same, so that data calculation can be performed by using the plurality of standard data.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this application, serve to provide a further understanding of the application and to enable other features, objects, and advantages of the application to be more apparent. The drawings and their description illustrate the embodiments of the invention and do not limit it. In the drawings:
fig. 1 is a schematic flow chart of a data generation method provided in the present application;
FIG. 2 is a schematic diagram of a data table provided herein;
FIG. 3 is a schematic flow chart diagram of another data generation method provided herein;
FIG. 4 is a schematic flow chart diagram of another data generation method provided herein;
FIG. 5 is a schematic diagram of another data table provided herein;
FIG. 6 is a schematic diagram of another data table provided herein;
FIG. 7 is a schematic diagram of another data table provided herein;
FIG. 8 is a schematic flow chart diagram of another data generation method provided herein;
FIG. 9 is a schematic diagram of another data table provided herein;
FIG. 10 is a schematic illustration of another data table provided herein;
FIG. 11 is a schematic flow chart diagram of another data generation method provided herein;
FIG. 12 is a schematic diagram of another data table provided herein;
FIG. 13 is a schematic illustration of another data table provided herein;
FIG. 14 is a schematic illustration of another data table provided herein;
FIG. 15 is a schematic illustration of another data table provided herein;
FIG. 16 is a schematic illustration of another data table provided herein;
FIG. 17 is a schematic illustration of another data table provided herein;
FIG. 18 is a schematic illustration of another data table provided herein;
FIG. 19 is a schematic flow chart diagram of another data generation method provided herein;
FIG. 20 is a schematic illustration of another data table provided herein;
FIG. 21 is a schematic illustration of another data table provided herein;
FIG. 22 is a schematic illustration of another data table provided herein;
FIG. 23 is a schematic illustration of another data table provided herein;
FIG. 24 is a schematic illustration of another data table provided herein;
FIG. 25 is a schematic illustration of another data table provided herein;
FIG. 26 is a schematic flow chart diagram of another data generation method provided herein;
fig. 27 is a schematic structural diagram of a data generating apparatus provided in the present application;
FIG. 28 is a schematic diagram of another data generating apparatus provided in the present application;
fig. 29 is a schematic structural diagram of another data generating apparatus provided in the present application;
FIG. 30 is a schematic diagram of another data generating apparatus provided in the present application;
FIG. 31 is a schematic diagram of another data generating apparatus provided in the present application; and
fig. 32 is a schematic structural diagram of another data generation apparatus provided in the present application.
Detailed Description
In order to make the technical solutions better understood by those skilled in the art, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only partial embodiments of the present application, but not all embodiments. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments in the present application without making any creative effort shall fall within the protection scope of the present application.
It should be noted that the terms "first," "second," and the like in the description and claims of this application and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It should be understood that the data so used may be interchanged under appropriate circumstances in order to facilitate the description of the embodiments of the application herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
In the present application, the terms "upper", "lower", "left", "right", "front", "rear", "top", "bottom", "inner", "outer", "middle", "vertical", "horizontal", "lateral", "longitudinal", and the like indicate an orientation or positional relationship based on the orientation or positional relationship shown in the drawings. These terms are used primarily to better describe the present application and its embodiments, and are not used to limit the indicated devices, elements or components to a particular orientation or to be constructed and operated in a particular orientation.
Moreover, some of the above terms may be used to indicate other meanings besides the orientation or positional relationship, for example, the term "on" may also be used to indicate some kind of attachment or connection relationship in some cases. The specific meaning of these terms in this application will be understood by those of ordinary skill in the art as the case may be.
Furthermore, the terms "mounted," "disposed," "provided," "connected," and "sleeved" are to be construed broadly. For example, it may be a fixed connection, a removable connection, or a unitary construction; can be a mechanical connection, or an electrical connection; may be directly connected, or indirectly connected through intervening media, or may be in internal communication between two devices, elements or components. The specific meaning of the above terms in the present application can be understood by those of ordinary skill in the art as the case may be.
It should be noted that, in the present application, the embodiments and features of the embodiments may be combined with each other without conflict. The present application will be described in detail below with reference to the accompanying drawings in conjunction with embodiments.
It should be noted in advance that the behaviors, events or states mentioned in this application may include events, financial events, personal physical index change events (such as physical index parameters during weight loss), object state changes (such as changes of water from solid state to liquid state to gas state), production behaviors (such as clothes production), and the like, and may also include other behaviors, events or states, where the behaviors, events or states that can be recorded by data may be used as objects, and the specific behaviors, events or states are not specifically limited herein.
It should be noted that the target data record for recording the object may be historical data, or may also be data recorded currently in real time, and the manner and the type of the obtained target data record may be set according to actual needs, which is not limited herein.
It should be noted again that the following examples are only used for assisting the description of the present application, and do not limit the present application, and the specific target data records, the standard data, and the standard data forming data table may be set according to actual needs, and are not limited in detail herein.
Fig. 1 is a schematic flow chart of a data generation method provided in the present application, and as shown in fig. 1, the method includes the following steps:
101. a target data record for a recording object is obtained.
Wherein the object comprises a behavior, an event, or a state.
102. First dimension information, characteristic information, and time information in the target data record are determined.
The first dimension information is first information used for representing the object in the target data record, the characteristic information is information capable of measuring the first dimension information, and the time information is generation time, release time or user acquisition time related to the first dimension information.
Specifically, after the target data record is acquired, first information in the target data record for indicating an object of the target data record may be designated as first dimension information, second dimension information in the following embodiments may designate second information in the target data record for indicating the object of the target data record as the second dimension information, and so on, third dimension information in the following embodiments may designate third information in the target data record for indicating the object of the target data record as the third dimension information, the feature information is information capable of measuring the first dimension information (i.e., information capable of forming a quantized record), and the time information is a generation time, a release time, or a user acquisition time related to the first dimension information.
For example, when a target data record includes the following contents: the suit produced by 7, 10 and 2018 is 345 suits, the brand of the suit is ABC, the shipment time of the suit fabric is 2018, 5, 31 and the time, the product (suit) can be designated as the first dimension information of the target data record and the brand (ABC) can be used as the second dimension information of the target data record, the shipment time of the suit fabric (2018, 5, 31 and 2018), the production time of the product (2018, 7, 10 and the time) and the yield of the product (345) can be used as the third dimension information and the time information.
It should be noted that when a plurality of target data records are formed for the same object, since they are for the same object, regardless of whether the data structures of the plurality of target data records are the same, the primary content included in the plurality of target data records is the same, and thus may be specified as being included in all of the plurality of target data records, and first information indicating the object is taken as first dimension information, second information indicating the object and included in the plurality of target data records is taken as second dimension information, third information indicating the object, which is included in each of the plurality of target data records, is set as third dimension information, information capable of measuring the first dimension information is set as feature information, and generation time, distribution time, or user acquisition time associated with the first dimension information is set as time information.
Take two target data records as an example, wherein the content of the first target data record includes: 345 suits of western-style clothes produced in 7 and 10 months in 2018, ABC of the brand of the western-style clothes, 5 and 31 months in 2018 for making fabrics, and the content recorded by the second target data record comprises: the western-style clothes produced by 24/7/2018 are 341 suits, the brand of the western-style clothes is CDE, the shipment time of the fabric for manufacturing the western-style clothes is 2018, 6/30/2018, ABC can be used as the first dimension information recorded by the two target data, CDE can be used as the second dimension information recorded by the second target data, 31/2018, 5/31/2018 can be used as the third dimension information recorded by the first target data, 30/6/2018 is used as the third dimension information recorded by the second target data, 345 is used as the feature information recorded by the first target data, 341 is used as the feature information recorded by the second target data, 10/7/2018 is used as the time information recorded by the first target data, and 24/7/2018 is used as the time information recorded by the second target data.
It should be noted that the first dimension information, the second dimension information, the third dimension information, the time information, and the feature information about a specific target data record or a plurality of target data records formed for the same object may be formulated according to actual needs, and are not specifically limited herein.
103. And generating standard data comprising the first dimension information, the characteristic information and the time information.
In the application, after a target data record for recording an object is acquired, first dimension information, feature information and time information in the target data record are determined, and then standard data including the first dimension information, the feature information and the time information are generated, wherein the first dimension information, the feature information and the time information can describe one target data record in detail, so that the generated standard data can completely express the target data record, and when the number of the target data records is multiple, the data structures of a plurality of standard data corresponding to the plurality of target data records are the same, so that data calculation can be performed by using the plurality of standard data.
Taking western-style clothes production as an example, the target data record is the western-style clothes, the content of the target data record is the production quantity of the western-style clothes in each day of the 7 th month in 2018, after the standard data corresponding to the target data record in each day is generated, as shown in fig. 2, fig. 2 is a schematic diagram of a data table provided by the application, as shown in fig. 2, each standard data can form a data table for producing the western-style clothes in the 7 th month in 2018, the data formats of each standard data in the data table are the same, and the production of the western-style clothes in the 7 th month in 2018 can be calculated statistically according to the data table.
Fig. 3 is a schematic flow chart of another data generation method provided in the present application, and as shown in fig. 3, after generating standard data including the first dimension information, the feature information, and the time information, the method further includes:
301. whether the first standard data and the second standard data exist in the standard data belonging to the same data table is detected, and if not, step 302 is executed.
The time information in the first standard data is the same as the time information in the second standard data, and the first dimension information in the first standard data is the same as the first dimension information in the second standard data.
In particular, since the contents of the standard data records belonging to the same data table are different contents of the same object, as the specific case may refer to fig. 2, therefore, there must be a difference in the contents of the standard data records belonging to the same data table, and in order to make each standard data belonging to the same data table unique, it is necessary to detect whether the time information of the first standard data and the second standard data belonging to the same data table is the same, and detecting whether first dimension information of the first standard data and the second standard data belonging to the same data table is the same, if they are the same, it indicates that the same standard data exists in the data table (the data table has non-unique standard data), and if they are not the same (refer to the data table shown in fig. 2), it indicates that the same standard data exists in the data table (each standard data in the data table has uniqueness).
302. Performing data calculation using the first standard data and the second standard data.
Specifically, when the first standard data and the second standard data have uniqueness, the data calculation may be performed by using the standard data in the data table, and it should be noted that how to perform the data calculation by using the standard data in the data table may be set according to actual needs, and is not limited herein.
In a possible embodiment, fig. 4 is a schematic flowchart of another data generation method provided by the present application, based on fig. 3, if there is a difference, that is, time information in the first standard data is the same as time information in the second standard data, and first dimension information in the first standard data is the same as first dimension information in the second standard data, taking fig. 5 as an example, fig. 5 is a schematic diagram of another data table provided by the present application, as shown in fig. 5, time information and first dimension information of two standard data marked by black frames are the same, and in order to make the standard data in the data table unique, as shown in fig. 4, the method further includes the following steps:
401. and adding the second dimension information in the target data record corresponding to each standard data into the standard data.
Wherein the second dimension information is second information in the target data record for representing the object.
As to the second dimension information, detailed description is given above, and details are not repeated here, on the basis of fig. 5, a data table to which standard data after the second dimension information is added is shown in fig. 6 or fig. 7, fig. 6 is a schematic diagram of another data table provided by the present application, and fig. 7 is a schematic diagram of another data table provided by the present application, where the standard data in the data table shown in fig. 6 has uniqueness, and the standard data in the data table shown in fig. 7 does not have uniqueness, where as shown in fig. 6 and fig. 7, the fabric stocking time is used as the second dimension information, and of course, other second information used for representing the object in the target data record may also be used as the second dimension information, and the specific second dimension information may be determined according to actual needs, and is not specifically limited herein.
402. Detecting whether the second dimension information of the first standard data is the same as the second dimension information of the second standard data, if not, executing step 403.
Taking fig. 6 or fig. 7 as an example, the time information and the first dimension information of the two standard data marked by black line frames have been determined to be the same before, in order to further determine whether the two standard data have uniqueness, only the second dimension data needs to be compared at this time, if the second dimension information of the two standard data is the same, it is described that the two standard data have no uniqueness (as shown in fig. 7), and if the second dimension information of the two standard data is not the same, it is described that the two standard data have uniqueness (as shown in fig. 6).
403. And performing data calculation by using each standard data added with the second dimension information in the data table.
Taking fig. 6 as an example, each standard data in the data table shown in fig. 6 has uniqueness, and in this case, data calculation can be performed by using the standard data in the data table, and it should be noted that how to perform data calculation by using the standard data in the data table can be set according to actual needs, and is not limited in particular here.
In a possible embodiment, fig. 8 is a schematic flowchart of another data generation method provided by the present application, and on the basis of fig. 4, if the two standard data are the same, that is, the second dimension information in the first standard data is the same as the second dimension information in the second standard data, specifically as shown in fig. 7, the time information, the first dimension information, and the second dimension information of the two standard data marked by black line frames are all the same, as shown in fig. 8, in order to make the standard data in the data table have uniqueness, the method further includes the following steps:
801. and adding the third dimension information in the target data record corresponding to each standard data into the standard data.
Wherein the third dimension information is third information used for representing the object in the target data record.
As to the third dimension information, detailed descriptions are given above, and details are not repeated here, on the basis of fig. 7, a data table to which standard data after the third dimension information is added is shown in fig. 9 or fig. 10, fig. 9 is a schematic diagram of another data table provided by the present application, and fig. 10 is a schematic diagram of another data table provided by the present application, where the standard data in the data table shown in fig. 9 has uniqueness, and the standard data in the data table shown in fig. 10 does not have uniqueness, where as shown in fig. 9 and fig. 10, a western suit brand is used as the third dimension information, and of course, other third information used for representing the object in the target data record may also be used as the third dimension information, and specific third dimension information may be determined according to actual needs, and is not specifically limited herein.
802. And detecting whether the third dimension information of the first standard data is the same as that of the second standard data or not, and repeating the steps until the standard data in the data table meet the condition for data calculation.
Taking fig. 9 or fig. 10 as an example, when the data table after adding the third dimension information is shown in fig. 9, the time information, the first dimension information, and the second dimension information of the two standard data marked by the black frame are all the same, but the third dimension information of the two standard data is different, and at this time, the two standard data have uniqueness in the data table, so that data calculation can be performed on the data table, and when the data table after adding the third dimension information is shown in fig. 10, the time information, the first dimension information, the second dimension information, and the third dimension information of the two standard data marked by the black frame are all the same, and at this time, the two standard data still have no uniqueness in the data table, and it is necessary to add the other dimension information in the target data record corresponding to the two standard data to the data table shown in fig. 10, for example, the other dimension information can be a sales channel, particularly whether western style clothes are self-operated or electric business sold, and data calculation can not be carried out by using the data table until the two standard data have different dimension information.
It should be noted that other dimension information added on the basis of fig. 10 may be set according to actual needs, and is not limited in particular herein.
In a possible embodiment, fig. 11 is a schematic flowchart of another data generation method provided in the present application, and as shown in fig. 11, after each standard data in the data table satisfies a condition for performing data calculation, the method further includes the following steps:
1101. and judging whether the time information of each standard data in the same data table accords with a preset time acquisition rule, if not, executing step 1102, and if so, directly using the data table to perform data calculation.
Specifically, the preset time collection rule may be collection according to a specified collection rule, that is, the time information of each standard data in one data table has a certain rule, so as to facilitate subsequent data calculation.
For example, fig. 12 is a schematic diagram of another data table provided by the present application, and fig. 13 is a schematic diagram of another data table provided by the present application, where time information of each standard data in the data table shown in fig. 12 has a certain rule, and therefore meets a preset time acquisition rule, and time information of each standard data in the data table shown in fig. 13 does not have a certain rule, and therefore does not meet the preset time acquisition rule, it should be noted that the rule shown in fig. 12 is merely an exemplary description, and certainly, the rule of the time information may be in other forms, and a specific acquisition rule may be set according to actual needs, but time information with a certain regular time interval all meets the preset time acquisition rule.
1102. And extracting the time information of each standard data in the data table according to a time format corresponding to the time information of each standard data in the data table.
And the time unit of the time information of each piece of standard data in the extracted data table is larger than the time unit of the time information of each piece of original standard data in the data table.
Taking fig. 13 as an example, the time unit of the production time of the western-style clothes includes year, month, day, hour, minute and second, the time format corresponding to the time unit may be year, month and day, after the time information in fig. 13 is extracted, the obtained data table is shown in fig. 14, fig. 14 is a schematic diagram of another data table provided by the present application, as shown in fig. 14, the time information of each standard data in the data table has a certain rule, and conforms to the preset time acquisition rule.
If the time unit of the production time of the western-style clothes includes year, month and day, the time format corresponding to the time unit can be year and month so as to enable the time information to conform to a certain rule, and the specific extraction mode can be extracted according to the actual time unit, which is not limited specifically herein.
It should be noted that, taking fig. 13 as an example, the extraction in this application refers to directly deleting time minutes and seconds, or acquiring the year, month and day in the time information and then replacing the original time information with the acquired year, month and day, and specifically, what manner is adopted may be set according to actual needs, and is not specifically limited herein.
In one possible embodiment, the time information of each standard data belonging to the same data table conforms to the preset time acquisition rule, before the standard data in the two data tables are calculated, when the number of the standard data in the first data table is different from that in the second data table in the first specified time period, the acquisition frequency specified in the second specified time period can be used as a standard, merging or splitting each standard data in the first table data according to the sequence of the time information, and merging or splitting each standard data in the second table data according to the sequence of the time information, so that the time information of each standard data after being split or combined in the first table data and the second table data meets the acquisition frequency specified within the second specified time length.
For example, fig. 15 is a schematic diagram of another data table provided by the present application, and fig. 16 is a schematic diagram of another data table provided by the present application, as shown in fig. 15 and fig. 16, the first specified time duration is day, the number of standard data in the first data table shown in fig. 15 is 1 in one day, and the number of standard data in the first data table shown in fig. 16 is 2.
When the standard is acquired once in a second specified time period of one day, the set acquisition mode is the same as the acquisition mode shown in fig. 15, so the data table shown in fig. 15 may be unchanged, the standard data in the data table shown in fig. 16 needs to be merged, and the rainfall amount may be superposed, so the data table after merging the standard data is shown in fig. 17, and fig. 17 is a schematic diagram of another data table provided by the present application.
When the standard is that the standard is acquired twice within a second specified time period of one day, because the set acquisition mode is the same as the acquisition mode shown in fig. 16, the data table shown in fig. 16 may not be changed, and the standard data in the data table shown in fig. 15 needs to be split, so the data table after the standard data are split is shown in fig. 18, and fig. 18 is a schematic diagram of another data table provided by the present application.
It should be noted that the acquisition frequency specified in the second specified duration is only an exemplary description, and does not limit the present application, and the specific acquisition frequency specified in the second specified duration may be set according to actual needs, and after the acquisition frequency specified in the second specified duration is set, the standard data in the first table data and the second table data need to be split and combined according to the acquisition frequency specified in the set second specified duration, and details are not described in detail herein.
In a possible embodiment, as shown in fig. 15 to fig. 18, after the first table data is merged, the feature information of the original standard data merged into one standard data in the first table data is correspondingly merged according to the characteristic of the first dimension information of the original standard data; after the second table data are merged, the characteristic information of the original standard data merged into one standard data in the second table data is correspondingly merged and calculated according to the characteristic of the first dimension information of the original standard data; after the standard data in the first data table are split, the split standard data in the first data table respectively comprise time information and first dimension information, and the feature information of the original standard data is added to one standard data in the split standard data of the original standard data; after the standard data in the second data table are split, the split standard data in the second data table respectively comprise time information and first dimension information, and the feature information of the original standard data is added to one standard data in the split standard data of the original standard data.
In a possible embodiment, fig. 19 is a schematic flow chart of another data generation method provided in the present application, where before calculating the standard data in the two data tables when the time information of each standard data in the same data table meets a preset time collection rule, and when the number of standard data in the first data table is the same as the number of standard data in the second data table within the first specified time period, as shown in fig. 19, the method further includes:
1901. comparing the time information included in the first data table with the time information included in the second data table to determine first time information only appearing in the first data table but not appearing in the second data table, and determining second time information only appearing in the second data table but not appearing in the first data table.
1902. Adding the first time information and the first dimension information corresponding to the first time information to the second data table, and adding the second time information and the first dimension information corresponding to the second time information to the first data table.
Taking fig. 15 and 17 as an example, in the first data table shown in fig. 15, the number of standard data is the same as that in the second data table shown in fig. 17, but in the first data table shown in fig. 15, the suit production time is 2018, month 7, month 19 to 2018, month 7, month 30, and is not present in the second data table shown in fig. 17, the weather forecast time is 2018, month 7, month 10 and 2018, month 7, month 11, and is not present in the first data table shown in fig. 15, in order to enable data calculation of the first data table shown in fig. 15 and the second data table shown in fig. 17, the first data table shown in fig. 15 and the second data table shown in fig. 17 need to be adjusted so that the same time information is included in the first data table shown in fig. 15 and the second data table shown in fig. 17, fig. 20 may be shown after the first data table shown in fig. 15 is adjusted, fig. 21 may be shown after the second data table shown in fig. 17 is adjusted, fig. 20 is a schematic diagram of another data table provided in the present application, and fig. 21 is a schematic diagram of another data table provided in the present application.
Taking fig. 16 and 18 as an example, the number of standard data in the first data table shown in fig. 16 and the number of standard data in the second data table shown in fig. 18 are the same in one day, but only the time information shown in fig. 16 but not the time information shown in fig. 18 and only the time information shown in fig. 18 but not the time information shown in fig. 16 are shown in fig. 16 and 18, in order to perform data calculation on the first data table shown in fig. 16 and the second data table shown in fig. 18, it is necessary to adjust the first data table shown in fig. 16 and the second data table shown in fig. 18 so that the first data table shown in fig. 16 and the second data table shown in fig. 18 have the same time information, the first data table shown in fig. 16 may be adjusted as shown in fig. 22, and the second data table shown in fig. 18 may be adjusted as shown in fig. 23, fig. 22 is a schematic diagram of another data table provided in the present application, and fig. 23 is a schematic diagram of another data table provided in the present application.
It should be noted that fig. 15 to 18 and fig. 20 to 23 are only exemplary illustrations and do not limit the present application.
In a possible embodiment, after adding the first time information and the first dimension information corresponding to the first time information to the second data table and adding the second time information and the first dimension information corresponding to the second time information to the first data table, when the number of kinds of dimension information included in the first data table is greater than the number of kinds of dimension information included in the second data table, adding dimension information that only appears in the first data table but does not appear in the second data table to the second data table; adding dimension information that appears only in the second data but not in the first data table to the first data table when the number of kinds of dimension information included in the second data table is greater than the number of kinds of dimension information included in the first data table.
Specifically, in order to make the standard data in one data table unique, it may be necessary to add second dimension information and third dimension information to the standard data, and fourth dimension information may also be added, regarding the reason for adding the second dimension information and the third dimension information, even the fourth dimension information, in the above detailed description, details are not repeated herein, and in order to perform data calculation on two data tables, it is necessary to make the types of dimension information in the two data tables the same, that is, when one data table has only the first dimension information and the second dimension information, the other data table also needs to have only the first dimension information and the second dimension information, if one data table has only the first dimension information and the second dimension information and the other data table also has only the first dimension information, the above-mentioned second dimension information needs to be added to the standard data of the other data table.
For example, fig. 24 is a schematic diagram of another data table provided by the present application, as shown in fig. 2 and fig. 24, fig. 2 is a data table for western costume production, fig. 24 is a data table for western costume sales, and fig. 2 only includes the first dimension information, and fig. 24 includes not only the first dimension information but also the second dimension information, at this time, it is necessary to add the second dimension information in the data table shown in fig. 24 to the data table shown in fig. 2, the data table of the data table shown in fig. 2 after adding the second dimension information may be as shown in fig. 25, and fig. 25 is a schematic diagram of another data table provided by the present application.
It should be noted that, when each standard data belonging to the same data table is unique, and when the time information of each standard data belonging to the same data table meets the preset time acquisition rule, and the number of standard data in the first data table is the same as the number of standard data in the second data table within the first specified time period, the dimension information is added to the first data table according to the corresponding relationship between each standard data in the first data table and each standard data in the second data table, or when the dimension information is added to the second data table, the dimension information is added according to the corresponding relationship between each standard data in the second data table and each standard data in the first data table.
In a possible embodiment, fig. 26 is a schematic flow chart of another data generation method provided in the present application, after the number of types of dimension information included in the first data table is equal to the number of types of dimension information included in the second data table, and before the standard data in the two data tables are calculated, in order to ensure that the standard data in each data table can be subjected to data calculation, it is required to determine that each standard data in the two data tables has time information, and in order to achieve the above purpose, as shown in fig. 26, the method further includes the following steps:
2601. it is detected whether the time information is included in each standard data in each data table.
2602. When at least one standard data in the data table does not include the time information, the standard data in the data table is not used for data calculation.
Specifically, when the standard data of each data table has time information, data calculation may be performed using the two data tables, and when at least one standard data does not have time information, data calculation may not be performed using the standard data in the data table.
Fig. 27 is a schematic structural diagram of a data generating apparatus provided in the present application, and as shown in fig. 27, the apparatus includes:
an obtaining unit 2701, configured to obtain a target data record for recording an object, where the object includes a behavior, an event, or a state;
a determining unit 2702, configured to determine first dimension information, feature information and time information in the target data record, where the first dimension information is first information used for representing the object in the target data record, the feature information is information capable of measuring the first dimension information, and the time information is generation time, release time or user acquisition time related to the first dimension information;
a generating unit 2703, configured to generate standard data including the first dimension information, the feature information, and the time information.
In a possible embodiment, fig. 28 is a schematic structural diagram of another data generating apparatus provided in the present application, and as shown in fig. 28, the apparatus further includes:
a detecting unit 2704, configured to detect whether there are first standard data and second standard data in standard data belonging to the same data table after generating standard data including the first dimension information, the feature information, and the time information, where the time information in the first standard data is the same as the time information in the second standard data, and the first dimension information in the first standard data is the same as the first dimension information in the second standard data;
A statistical unit 2705, configured to perform data calculation using the first standard data and the second standard data if the first standard data and the second standard data do not exist.
In a possible embodiment, fig. 29 is a schematic structural diagram of another data generating apparatus provided in the present application, and as shown in fig. 29, the apparatus further includes:
an adding unit 2706, configured to, after detecting whether there are first standard data and second standard data in standard data belonging to the same data table, if there are first standard data and second standard data, add second dimension information in a target data record corresponding to each standard data to the standard data, where the second dimension information is second information used for representing the object in the target data record;
the detecting unit 2704 is further configured to detect whether the second dimension information of the first standard data is the same as the second dimension information of the second standard data;
the statistical unit 2705 is further configured to perform data calculation using each standard data added with the second dimension information in the data table if the standard data are not the same.
In one possible embodiment, as shown in figure 29,
the adding unit 2706 is further configured to, after detecting whether the second dimension information of the first standard data is the same as the second dimension information of the second standard data, if the second dimension information of the first standard data is the same as the second dimension information of the second standard data, add third dimension information in a target data record corresponding to each standard data to the standard data, where the third dimension information is third information used for representing the object in the target data record;
The detecting unit 2704 is further configured to detect whether third dimension information of the first standard data is the same as third dimension information of the second standard data, and so on until each standard data in the data table meets a condition for performing data calculation.
In a possible embodiment, fig. 30 is a schematic structural diagram of another data generating apparatus provided in the present application, and as shown in fig. 30, the apparatus further includes:
the judging unit 2707 is configured to judge whether time information of each standard data belonging to the same data table meets a preset time acquisition rule after each standard data in the data table meets a condition for performing data calculation;
an extracting unit 2708, configured to, if the time information of each standard data in the data table does not match the time format of the time format corresponding to the time information of each standard data in the data table, extract the time information of each standard data in the data table;
and the time unit of the time information of each piece of standard data in the extracted data table is larger than the time unit of the time information of each piece of original standard data in the data table.
In a possible embodiment, fig. 31 is a schematic structural diagram of another data generating apparatus provided in the present application, and as shown in fig. 31, the apparatus further includes:
A frequency unifying unit 2709, configured to, if the standard data in the two data tables are met, when the number of the standard data in the first data table is different from the number of the standard data in the second data table within a first specified time period before the standard data in the two data tables are calculated, merge or split the standard data in the first table data according to the sequence of the time information by using the acquisition frequency specified within a second specified time period as a standard, and merge or split the standard data in the second table data according to the sequence of the time information, so that the time information of the standard data after being split or merged in the first table data and the second table data satisfies the acquisition frequency specified within the second specified time period.
In a possible embodiment, the frequency unification unit 2709 is configured to, after merging the first table data, perform corresponding merging calculation on feature information of original standard data merged into one standard data in the first table data according to a characteristic of first dimension information of the original standard data; after the second table data are merged, the feature information of the original standard data merged into one standard data in the second table data is correspondingly merged and calculated according to the characteristic of the first dimension information of the original standard data; when each standard data in the first data table is split, each split standard data in the first data table respectively comprises time information and first dimension information, and the characteristic information of the original standard data is added to one standard data in the standard data split from the original standard data; and after splitting each standard data in the second data table, each split standard data in the second data table respectively comprises time information and first dimension information, and the feature information of the original standard data is added to one standard data in the standard data split from the original standard data.
In a possible embodiment, fig. 32 is a schematic structural diagram of another data generating apparatus provided in the present application, and as shown in fig. 32, the apparatus further includes:
a comparing unit 2710, configured to, when the time information of each standard data belonging to the same data table meets a preset time acquisition rule and before calculating the standard data in the two data tables, compare the time information included in the first data table with the time information included in the second data table when the number of the standard data in the first data table is the same as the number of the standard data in the second data table within the first specified time period, to determine first time information that is present only in the first data table but not in the second data table, and determine second time information that is present only in the second data table but not in the first data table;
a supplementing unit 2711, configured to add the first time information and the first dimension information corresponding to the first time information to the second data table, and add the second time information and the first dimension information corresponding to the second time information to the first data table.
In a possible embodiment, the supplementing unit 2711 is further configured to, after adding the first time information and the first dimension information corresponding to the first time information to the second data table and adding the second time information and the first dimension information corresponding to the second time information to the first data table, add the dimension information that is present only in the first data and not present in the second data table to the second data table when the number of kinds of the dimension information included in the first data table is greater than the number of kinds of the dimension information included in the second data table; and adding dimension information, which is present only in the second data but not in the first data table, to the first data table when the number of kinds of dimension information included in the second data table is greater than the number of kinds of dimension information included in the first data table.
In a possible embodiment, the detecting unit 2704 is further configured to detect whether the time information is included in each standard data in each data table before calculating the standard data in the two data tables after the number of kinds of dimension information included in the first data table is equal to the number of kinds of dimension information included in the second data table; and when at least one standard data in the data table does not include the time information, not using the standard data in the data table for data calculation.
The specific manner in which each unit in the above embodiments performs the operation has been described in detail in the embodiments related to the method, and will not be described in detail here.
In the application, after a target data record for recording an object is obtained, first dimension information, feature information and time information in the target data record are determined, and then standard data including the first dimension information, the feature information and the time information are generated, wherein the first dimension information, the feature information and the time information can describe one target data record in detail, so that the generated standard data can completely express the target data record, and when the number of the target data records is multiple, the data structures of a plurality of standard data corresponding to the plurality of target data records are the same, so that data calculation can be performed by using the plurality of standard data.
The present application also provides a computer-readable storage medium storing a computer program which, when executed in a computer processor, performs the steps of any of the methods described above.
The above description is only a preferred embodiment of the present application and is not intended to limit the present application, and various modifications and changes may be made to the present application by those skilled in the art. Any modification, equivalent replacement, improvement and the like made within the spirit and principle of the present application shall be included in the protection scope of the present application.

Claims (15)

1. A method of generating data, comprising:
obtaining a target data record for recording an object, wherein the object comprises a behavior, an event or a state;
determining first dimension information, feature information and time information in the target data record, wherein the first dimension information is first information used for representing the object in the target data record, the feature information is information capable of measuring the first dimension information, and the time information is generation time, release time or user acquisition time related to the first dimension information; and
generating standard data including the first dimension information, the feature information, and the time information;
after generating the standard data including the first dimension information, the feature information, and the time information, the method further includes:
Detecting whether first standard data and second standard data exist in standard data belonging to the same data table, wherein time information in the first standard data is the same as time information in the second standard data, and first dimension information in the first standard data is the same as first dimension information in the second standard data; and
if not, performing data calculation by using the first standard data and the second standard data;
after detecting whether the first standard data and the second standard data exist in the standard data belonging to the same data table, the method further includes:
if the standard data exists, adding second dimension information in a target data record corresponding to each standard data into the standard data, wherein the second dimension information is second information used for representing the object in the target data record;
detecting whether second dimension information of the first standard data is the same as second dimension information of the second standard data; and
if not, performing data calculation by using each standard data added with the second dimension information in the data table;
after detecting whether the second dimension information of the first standard data is the same as the second dimension information of the second standard data, the method further includes:
If the standard data are the same, adding third dimension information in a target data record corresponding to each standard data into the standard data, wherein the third dimension information is third information used for representing the object in the target data record; and
and detecting whether the third dimension information of the first standard data is the same as that of the second standard data or not, and repeating the steps until the standard data in the data table meet the condition for data calculation.
2. The method of claim 1, wherein after each criterion data in the data table satisfies a condition for performing a data calculation, the method further comprises:
judging whether the time information of each standard data in the same data table accords with a preset time acquisition rule or not; and
if not, extracting the time information of each standard data in the data table according to a time format corresponding to the time information of each standard data in the data table;
and the time unit of the time information of each piece of standard data in the extracted data table is larger than the time unit of the time information of each piece of original standard data in the data table.
3. The method of claim 2, wherein if true, prior to calculating the standard data in the two data tables, when the number of standard data in the first data table is not the same as the number of standard data in the second data table for a first specified length of time, the method further comprises:
and combining or splitting each standard data in the first table data according to the sequence of the time information by taking the specified acquisition frequency within a second specified duration as a standard, and combining or splitting each standard data in the second table data according to the sequence of the time information so as to enable the time information of each standard data after splitting or combining in the first table data and the second table data to meet the specified acquisition frequency within the second specified duration.
4. The method of claim 3,
after the first table data are merged, performing corresponding merging calculation on the feature information of the original standard data merged into one standard data in the first table data according to the characteristic of the first dimension information of the original standard data;
after the second table data are merged, the characteristic information of the original standard data merged into one standard data in the second table data is correspondingly merged and calculated according to the characteristic of the first dimension information of the original standard data;
After the standard data in the first data table are split, the split standard data in the first data table respectively comprise time information and first dimension information, and the feature information of the original standard data is added to one standard data in the split standard data of the original standard data;
after the standard data in the second data table are split, the split standard data in the second data table respectively comprise time information and first dimension information, and the feature information of the original standard data is added to one standard data in the split standard data of the original standard data.
5. The method of claim 4, wherein when the time information of each standard data belonging to the same data table meets a preset time acquisition rule and before the standard data in the two data tables are calculated, the number of standard data in the first data table is the same as the number of standard data in the second data table within the first specified time period, the method further comprises:
comparing time information included in a first data table with time information included in a second data table to determine first time information only appearing in the first data table but not appearing in the second data table and determine second time information only appearing in the second data table but not appearing in the first data table; and
Adding the first time information and the first dimension information corresponding to the first time information to the second data table, and adding the second time information and the first dimension information corresponding to the second time information to the first data table.
6. The method of claim 5, wherein after adding the first time information and the first dimension information corresponding to the first time information to the second data table and adding the second time information and the first dimension information corresponding to the second time information to the first data table, the method further comprises:
adding dimension information, which only appears in the first data table but does not appear in the second data table, to the second data table when the number of kinds of dimension information included in the first data table is greater than the number of kinds of dimension information included in the second data table;
adding dimension information that only appears in the second data table but does not appear in the first data table to the first data table when the number of kinds of dimension information included in the second data table is greater than the number of kinds of dimension information included in the first data table.
7. The method of claim 6, wherein after the number of categories of dimension information included in the first data table is equal to the number of categories of dimension information included in the second data table, prior to calculating the standard data in the two data tables, the method further comprises:
detecting whether each standard data in each data table comprises the time information; and
when at least one standard data in the data table does not include the time information, the standard data in the data table is not used for data calculation.
8. An apparatus for generating data, the apparatus comprising:
an acquisition unit, configured to acquire a target data record for recording an object, where the object includes a behavior, an event, or a state;
a determining unit, configured to determine first dimension information, feature information, and time information in the target data record, where the first dimension information is first information in the target data record, the feature information is information capable of measuring the first dimension information, and the time information is generation time, release time, or user acquisition time related to the first dimension information;
A generating unit configured to generate standard data including the first dimension information, the feature information, and the time information;
the device further comprises:
a detection unit configured to detect whether there is first standard data and second standard data in standard data belonging to the same data table after generating standard data including the first dimension information, the feature information, and the time information, wherein the time information in the first standard data is the same as the time information in the second standard data, and the first dimension information in the first standard data is the same as the first dimension information in the second standard data;
a statistical unit for performing data calculation using the first standard data and the second standard data if not present;
the device further comprises:
an adding unit, configured to add, after detecting whether first standard data and second standard data exist in standard data belonging to the same data table, second dimension information in a target data record corresponding to each standard data to the standard data if the first standard data and the second standard data exist, where the second dimension information is second information used for representing the object in the target data record;
The detection unit is further configured to detect whether second dimension information of the first standard data is the same as second dimension information of the second standard data;
the statistical unit is also used for calculating data by using each standard data added with second dimension information in the data table if the standard data are different;
the adding unit is further configured to, after detecting whether second dimension information of the first standard data is the same as second dimension information of the second standard data, if the second dimension information of the first standard data is the same as the second dimension information of the second standard data, add third dimension information in a target data record corresponding to each standard data to the standard data, where the third dimension information is third information used for representing the object in the target data record;
the detection unit is further configured to detect whether third dimension information of the first standard data is the same as third dimension information of the second standard data, and so on until each standard data in the data table meets a condition for performing data calculation.
9. The apparatus of claim 8, wherein the apparatus further comprises:
the judging unit is used for judging whether the time information of each standard data in the same data table accords with a preset time acquisition rule or not after each standard data in the data table meets the condition of data calculation;
The extracting unit is used for extracting the time information of each standard data in the data table according to a time format corresponding to the time information of each standard data in the data table if the time information of each standard data in the data table is not matched with the time format of each standard data in the data table;
and the time unit of the time information of each piece of standard data in the extracted data table is larger than the time unit of the time information of each piece of original standard data in the data table.
10. The apparatus of claim 9, wherein the apparatus further comprises:
and the frequency unifying unit is used for merging or splitting the standard data in the first table data according to the sequence of the time information by taking the specified acquisition frequency in the second specified duration as a standard before calculating the standard data in the two data tables, and merging or splitting the standard data in the second table data according to the sequence of the time information so that the time information of the standard data after being split or merged in the first table data and the second table data meets the specified acquisition frequency in the second specified duration.
11. The apparatus of claim 10,
the frequency unifying unit is used for correspondingly combining and calculating the characteristic information of original standard data combined into one standard data in the first table data according to the characteristic of the first dimension information of the original standard data after the first table data are combined; after the second table data is merged, the feature information of the original standard data merged into one standard data in the second table data is correspondingly merged and calculated according to the characteristic of the first dimension information of the original standard data; when each standard data in the first data table is split, each split standard data in the first data table respectively comprises time information and first dimension information, and the characteristic information of the original standard data is added to one standard data in the standard data split from the original standard data; and when the standard data in the second data table are split, the split standard data in the second data table respectively comprise time information and first dimension information, and the feature information of the original standard data is added to one standard data in the split standard data of the original standard data.
12. The apparatus of claim 11, wherein the apparatus further comprises:
a comparison unit, configured to, when the time information of each standard data in the same data table meets a preset time collection rule and before the standard data in the two data tables are calculated, compare the time information included in the first data table with the time information included in the second data table when the number of the standard data in the first data table is the same as the number of the standard data in the second data table within the first specified time period, to determine first time information that is only present in the first data table but not present in the second data table, and determine second time information that is only present in the second data table but not present in the first data table;
a supplementing unit, configured to add the first time information and the first dimension information corresponding to the first time information to the second data table, and add the second time information and the first dimension information corresponding to the second time information to the first data table.
13. The apparatus of claim 12,
the supplementing unit is further configured to, after adding the first time information and the first dimension information corresponding to the first time information to the second data table and adding the second time information and the first dimension information corresponding to the second time information to the first data table, add dimension information that only appears in the first data table but does not appear in the second data table to the second data table when the number of kinds of dimension information included in the first data table is greater than the number of kinds of dimension information included in the second data table; and adding dimension information, which only appears in the second data table but does not appear in the first data table, to the first data table when the number of kinds of dimension information included in the second data table is greater than the number of kinds of dimension information included in the first data table.
14. The apparatus of claim 13,
the detection unit is further configured to detect whether each piece of standard data in each data table includes the time information before calculating the standard data in the two data tables after the number of types of dimension information included in the first data table is equal to the number of types of dimension information included in the second data table; and when at least one standard data in the data table does not comprise the time information, not using the standard data in the data table to perform data calculation.
15. A computer-readable storage medium, in which a computer program is stored which, when being executed in a computer processor, carries out the steps of the method according to any one of claims 1 to 7.
CN201810878545.0A 2018-08-03 2018-08-03 Data generation method, device and computer readable storage medium Active CN110795426B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810878545.0A CN110795426B (en) 2018-08-03 2018-08-03 Data generation method, device and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810878545.0A CN110795426B (en) 2018-08-03 2018-08-03 Data generation method, device and computer readable storage medium

Publications (2)

Publication Number Publication Date
CN110795426A CN110795426A (en) 2020-02-14
CN110795426B true CN110795426B (en) 2022-07-19

Family

ID=69425977

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810878545.0A Active CN110795426B (en) 2018-08-03 2018-08-03 Data generation method, device and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN110795426B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5893125A (en) * 1995-01-27 1999-04-06 Borland International, Inc. Non-modal database system with methods for incremental maintenance
WO1999023585A1 (en) * 1997-11-05 1999-05-14 Amazon.Com, Inc. System and method for selecting rows from dimensional databases
US5970490A (en) * 1996-11-05 1999-10-19 Xerox Corporation Integration platform for heterogeneous databases
CN101799807A (en) * 2009-02-10 2010-08-11 中国移动通信集团公司 Heterogeneous data table merging method and system thereof

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SE0004831L (en) * 2000-03-22 2001-09-23 Soeren Molin A data storage procedure and a data storage structure
US20030130894A1 (en) * 2001-11-30 2003-07-10 Alison Huettner System for converting and delivering multiple subscriber data requests to remote subscribers
CN100558045C (en) * 2006-08-07 2009-11-04 华为技术有限公司 A kind of system and method that generates communication customer description information
US8838549B2 (en) * 2008-07-07 2014-09-16 Chandra Bodapati Detecting duplicate records
GB0920644D0 (en) * 2009-11-25 2010-01-13 Geniedb System for improved record consistency and availability
US20110218937A1 (en) * 2010-03-03 2011-09-08 Jeremy Elser System and Methods for Management of Real Property
US10057349B2 (en) * 2015-11-12 2018-08-21 Facebook, Inc. Data stream consolidation in a social networking system for near real-time analysis

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5893125A (en) * 1995-01-27 1999-04-06 Borland International, Inc. Non-modal database system with methods for incremental maintenance
US5970490A (en) * 1996-11-05 1999-10-19 Xerox Corporation Integration platform for heterogeneous databases
WO1999023585A1 (en) * 1997-11-05 1999-05-14 Amazon.Com, Inc. System and method for selecting rows from dimensional databases
CN101799807A (en) * 2009-02-10 2010-08-11 中国移动通信集团公司 Heterogeneous data table merging method and system thereof

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
"Research of the Access and Integration of Grid Database";Xiufen Fu 等;《2006 10th International Conference on Computer Supported Cooperative Work in Design》;20061204;全文 *
MySQL在健康相关数据清理中的应用;宋杰等;《中国卫生统计》;20170625(第03期);全文 *
基于ICG算法的有效数据生成方法;金鑫;《计算机工程》;20060220(第04期);全文 *

Also Published As

Publication number Publication date
CN110795426A (en) 2020-02-14

Similar Documents

Publication Publication Date Title
CN104216349B (en) Utilize the yield analysis system and method for the sensing data of manufacturing equipment
CN106845731A (en) A kind of potential renewal user based on multi-model fusion has found method
CN102216908B (en) Support the system, the method and apparatus that perform the action corresponding to detection event
US20150046249A1 (en) Method and system for measuring web advertising effect based on multiple-contact attribution model
KR101925506B1 (en) Method and apparatus for predicting the spread of an infectious disease
CN110457867A (en) A kind of time series data based on machine learning is filled up and restoring method
JP2017010277A (en) Work analysis system and work analysis method
CN115293498A (en) Data collection system for enterprise asset arrangement based on block chain
CN110795426B (en) Data generation method, device and computer readable storage medium
CN104318344A (en) Consumption characteristic-based product production assistant decision making method and system
CN105493050B (en) Data processing equipment and data processing method
CN107688476A (en) The methods of exhibiting and device of info web
CN104572747A (en) Cross option push method and cross option push system
KR101346868B1 (en) Method for estimating election based on population statistics of selected voter by hour
CN115730284A (en) Method, device, equipment and storage medium for controlling authority of report data
CN106476422B (en) Printing quality monitoring system with responsibility judgment function
CN112989988A (en) Information integration method, device, equipment, readable storage medium and program product
CN108337323B (en) Instrument metering data AR comparison system and method based on full terminal
CN108153863B (en) Video information representation method and device
CN112529182A (en) Model training method and device based on edge calculation
KR20170055379A (en) Purchase price forecasting methods for new developments utilizing the ERP database
CN111596619A (en) Computer system and facility monitoring method
JP2021022199A5 (en)
CN109117456A (en) Scoring abnormal distribution detection method based on tensor Robust Principal Component Analysis
CN109710649A (en) Circulation material recommended method and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant