CN110147352A - A kind of data processing method and device - Google Patents

A kind of data processing method and device Download PDF

Info

Publication number
CN110147352A
CN110147352A CN201710904814.1A CN201710904814A CN110147352A CN 110147352 A CN110147352 A CN 110147352A CN 201710904814 A CN201710904814 A CN 201710904814A CN 110147352 A CN110147352 A CN 110147352A
Authority
CN
China
Prior art keywords
data
dimension
file
combination
report
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710904814.1A
Other languages
Chinese (zh)
Inventor
葛婷
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Gridsum Technology Co Ltd
Original Assignee
Beijing Gridsum Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Gridsum Technology Co Ltd filed Critical Beijing Gridsum Technology Co Ltd
Priority to CN201710904814.1A priority Critical patent/CN110147352A/en
Publication of CN110147352A publication Critical patent/CN110147352A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/16File or folder operations, e.g. details of user interfaces specifically adapted to file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/174Form filling; Merging

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Human Computer Interaction (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention provides a kind of data processing method and device, method includes: to receive at least two data files;The data at least two data file are polymerize based at least one dimension combination, generate data report corresponding with the dimension combination;Wherein, the dimension combination includes the combination of different data dimension.It reports it can be seen that the application can be automatically generated based on different data files with dimension combination corresponding data, without artificial statistics, improves work efficiency.

Description

A kind of data processing method and device
Technical field
This application involves technical field of data processing, more particularly relate to a kind of data processing method and device.
Background technique
Currently, in data processing field, many data are stored in the form of a file, such as with excel text Part carrys out storing data, and accordingly, there exist the demands for counting the data in different files.
For example, in the technical field of search engine marketing (Search Engine Marketing, SEM), for a certain The front end data and Back end data store that task generates are in different files, when generating data report, need based on front end Data and Back end data are counted.Wherein, front end data is mainly the data obtained from media side, such as Baidu, 360, today Top news etc..Back end data mainly passes through the data that monitoring obtains.
And in the prior art, it needs manually to count the data from different files, very labor intensive and time, reduces Working efficiency.
Summary of the invention
In view of the above problems, it proposes on the present invention overcomes the above problem or at least be partially solved in order to provide one kind State a kind of data processing method and device of problem.
A kind of data processing method, comprising:
Receive at least two data files;
The data at least two data file are polymerize based at least one dimension combination, generate with The corresponding data report of the dimension combination;Wherein, the dimension combination includes the combination of different data dimension.
Preferably, at least two data file includes identical data dimension;
This method further include:
Based on data dimension preassigned in the identical data dimension from least two data file will it is described to Few two data file mergencess are a data file;
Correspondingly, described carry out the data at least two data file based at least one dimension combination Polymerization generates data report corresponding with the dimension combination, comprising:
The data in the data file after merging are polymerize based at least one dimension combination, generate with it is described The corresponding data report of dimension combination.
Preferably, the dimension combination includes the first data dimension and the second data dimension according to third data dimension Combination;
It is described that the data at least two data file are polymerize based at least one dimension combination, it is raw At data report corresponding with the dimension combination, comprising:
Based at least two data file, the data under first data dimension, second data dimension are established The corresponding relationship of the data under data and the third data dimension under degree;Wherein, each corresponding relationship has uniqueness;
Under each corresponding relationship, to being different from first data dimension, described at least two data file Data under other data dimensions of second data dimension and the third data dimension are summarized, and first data are generated The data report of dimension and second data dimension about the third data dimension.
Preferably, further includes:
Generate first data dimension and second data dimension summarizes data report.
A kind of data processing equipment, comprising:
File unit is received, for receiving at least two data files;
Generate reporting unit, for based at least one dimension combination to the number at least two data file According to being polymerize, data report corresponding with the dimension combination is generated;Wherein, the dimension combination includes difference The combination of data dimension.
Preferably, at least two data file includes identical data dimension;The device further include:
File mergences unit, for based on preassigned from the identical data dimension of at least two data file At least two data file is merged into a data file by data dimension;
Correspondingly, the generation reporting unit is specifically used for based at least one dimension combination to the data after merging Data in file are polymerize, and data report corresponding with the dimension combination is generated.
Preferably, the dimension combination includes the first data dimension and the second data dimension according to third data dimension Combination;
The generation reporting unit, comprising:
First establishes module, for being based at least two data file, establishes the number under first data dimension According to the corresponding relationship of the data under data and the third data dimension under second data dimension;Wherein, each correspondence Relationship has uniqueness;
First generation module is used under each corresponding relationship, described to being different from least two data file Data under other data dimensions of first data dimension, second data dimension and the third data dimension are converged Always, the data report of first data dimension and second data dimension about the third data dimension is generated.
Preferably, the generation reporting unit further include:
Second generation module summarizes datagram for generate first data dimension and second data dimension It accuses.
A kind of storage medium, the storage medium include the program of storage, wherein in described program operation described in control Equipment where storage medium executes a kind of as above described in any item data processing methods.
A kind of processor, the processor execute as above described in any item for running program, when described program is run A kind of data processing method.
By above-mentioned technical proposal, the present invention provides a kind of data processing methods, comprising: receives at least two data text Part polymerize the data at least two data file based on dimension combination, and generation is combined with the dimension The corresponding data report of mode;Wherein, which includes the combination of different data dimension;It can be seen that the application It can be automatically generated based on different data files and be reported with dimension combination corresponding data, without artificial statistics, improved Working efficiency.
The above description is only an overview of the technical scheme of the present invention, in order to better understand the technical means of the present invention, And it can be implemented in accordance with the contents of the specification, and in order to allow above and other objects of the present invention, feature and advantage can It is clearer and more comprehensible, the followings are specific embodiments of the present invention.
Detailed description of the invention
By reading the following detailed description of the preferred embodiment, various other advantages and benefits are common for this field Technical staff will become clear.The drawings are only for the purpose of illustrating a preferred embodiment, and is not considered as to the present invention Limitation.And throughout the drawings, use is identicalExamining symbol indicates identical component.In the accompanying drawings:
Fig. 1 shows a kind of flow diagram of data processing method disclosed in one embodiment of the invention;
Fig. 2 shows another embodiment of the present invention provides a kind of data processing method flow diagram;
Fig. 3 shows a kind of flow diagram of data processing method of further embodiment of this invention offer;
Fig. 4 shows a kind of structural schematic diagram of data processing equipment provided by one embodiment of the present invention;
Fig. 5 show another embodiment of the present invention provides a kind of data processing equipment structural schematic diagram;
Fig. 6 shows a kind of structural schematic diagram of data processing equipment of further embodiment of this invention offer.
Specific embodiment
Exemplary embodiments of the present disclosure are described in more detail below with reference to accompanying drawings.Although showing the disclosure in attached drawing Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure without should be by embodiments set forth here It is limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the scope of the present disclosure It is fully disclosed to those skilled in the art.
One embodiment of the invention discloses a kind of data processing method, as shown in Figure 1, method includes the following steps:
Step 101: receiving at least two data files;
Each data file is stored with data, the data that different data file is stored can part it is identical, can also be with It is entirely different.
Optionally, the data stored under each data file can be carried out according to data dimension and the corresponding relationship of data Storage.Wherein, the attribute that data dimension may refer to as data.The format present invention of the data file does not limit, such as can be with For Word file, Excel file etc..
By taking excel data file storing data as an example, as shown in table 1, the data dimension which has be the date, Keyword and click volume all have corresponding data under each data dimension.
Table 1
Date Keyword Click volume
September 1 day After-sun recovery 7
September 2 days Face modifies after shining 5
September 3 days Skin after-sun recovery 8
Optionally, at least two data files are received, comprising: receive first data file and tool with the first data There is the second data file of the second data.
With SEM (Search Engine Marketing, media research marketing) data instance, the first data file is tool There is the file of front end data, the second data file is the file with Back end data.
It should be noted that front end data is the data obtained from media search engine, such as from Baidu, 360, search dog, today The data that the search engines such as top news obtain.Front end data is typically stored in excel file, including structure of accounts, equipment end, exhibition At least one of data of data dimensions such as occurrence number, click volume, spending limit.
Wherein, structure of accounts includes keyword, name on account, plan title, unit title etc..Equipment end include the end PC, Mobile phone terminal etc..
Back end data is the data by monitoring or receiving, and Back end data is typically stored in excel file, including At least one of data of data dimensions such as structure of accounts, equipment end, the order amount of money, order volume.Wherein, structure of accounts includes Keyword, name on account, plan title, unit title etc..Equipment end includes the end PC, mobile phone terminal etc..
Step 102: the data at least two data file being gathered based at least one dimension combination It closes, generates data report corresponding with the dimension combination.
Wherein, dimension combination includes the combination of different data dimension, the combination of at least two kinds data dimensions.Example Such as, a kind of dimension combination is the combination of name on account and date, and another dimension combination is name on account, sets The combination at standby end and date.
A kind of corresponding data report of dimension combination, when generating at least two data reports, two data Report can be respectively stored in different data files, also be stored in same data file certainly.It is number with data report For report, each data sheet can correspond to an excel file, alternatively, multiple data reports are stored in an excel text In different sheet in part.By taking data report is data file as an example, each data file can correspond to a word document, or Person, multiple data files are stored in the not same page in a word document.
Present embodiment discloses a kind of data processing methods, comprising: receives at least two data files, is combined based on dimension Mode polymerize the data at least two data file, generates datagram corresponding with the dimension combination It accuses;Wherein, which includes the combination of different data dimension;It can be seen that the application can be based on different numbers It automatically generates according to file and is reported with dimension combination corresponding data, without artificial statistics, improved work efficiency.
Another embodiment of the present invention discloses a kind of data processing method, as shown in Fig. 2, method includes the following steps:
Step 201: receiving at least two data files;
Wherein, at least two data file includes identical data dimension.For example, receiving the comprising the first data One data file and the second data file comprising the second data, the first data and the second data that the first data file is stored The second data that file is stored all have the identical data dimension such as name on account, date and equipment end.
Step 202: based on the preassigned data dimension from the identical data dimension of at least two data file At least two data file is merged into a data file;
In the present invention, preassigned data dimension includes at least one, it is preferred that can be by least two number Preassigned data dimension is used as according to the identical data dimension of file.The preassigned data dimension can be by system Default setting, or be customized by the user, i.e., it is specified when system brings into operation by user, due to being from least two data text Preassigned data dimension in identical data dimension in part, therefore the specified number is all had at least two data files According to dimension.
It optionally, will based on data dimension preassigned from the identical data dimension of at least two data file It includes: the data dimension that at least two data documents is fixed that at least two data file, which merges into a data file, Merge into a data dimension, and other unappropriated data dimensions are still separately as a data dimension, to generate packet Another data file containing the data at least two data file.Wherein, what the data file after merging included is specified Data dimension under data can't repeat.
For example, the data dimension all having in the first data file and the second data file is name on account, the first data The data dimension that file also has is click volume, and the data dimension that the second data file also has is order volume, then, it is being based on After name on account merges the first data file and the second data file, in newly-generated data file, the first data file A name on account is merged into the name on account of the second data file, and click volume is ordered still separately as a data dimension Single amount is also separately as a data dimension.Assuming that generating third number after the first data file and the second data file are merged According to file, then the data dimension for including in third data file has name on account, click volume and order volume, due to the first number Be according to file and the second data file merged according to name on account, therefore in third data file only have one about The data dimension of name on account.
By merging above-mentioned at least two data files received, so that system can be based on the data after merging Data in file are polymerize, to generate data report.Specifically, merging corresponding with document format data can be used Mode merges above-mentioned at least two data file.When data file is excel file, can use and vlookup function Similar function merges at least two data file.
It it should be noted that the data file after merging is not shown in front end, and is only system on backstage by least two Data file merge after when carrying out data aggregate used data file.If with the demand that front end is shown, in this hair In bright another embodiment, this method can also include: to show the data file after merging in front end.
Step 203: the data in the data file after merging are polymerize based at least one dimension combination, it is raw At data report corresponding with the dimension combination.
Wherein, the dimension combination includes the combination of different data dimension.
In the present embodiment, by receiving at least two data files, based on from the identical of at least two data file At least two data file is merged into a data file by preassigned data dimension in data dimension, based at least A kind of dimension combination polymerize the data in the data file after merging, generates corresponding with the dimension combination Data report, wherein the dimension combination includes the combination of different data dimension;It can be seen that the application can be based on Different data files is automatically generated to be reported with dimension combination corresponding data, without artificial statistics, is improved work efficiency.
Further embodiment of this invention discloses a kind of data processing method, as shown in figure 3, method includes the following steps:
Step 301: receiving at least two data files;
Step 302: it is based at least two data file, establishes the data under first data dimension, described the The corresponding relationship of the data under data and the third data dimension under two data dimensions;
Wherein, each corresponding relationship has uniqueness.It is understood that since the corresponding relationship is the first data dimension Under data, under the data under the second data dimension and third data dimension data corresponding relationship, therefore the uniqueness refers to It is unique to exist simultaneously three kinds of identical data in different corresponding relationships.But in different corresponding relationships, Ke Yicun In one or two identical data.It can be explained hereinafter with specific example.
It should be noted that if will two data file mergencess be at least a data before carrying out data aggregate File, then establishing the data under first data dimension, second data dimension in direct data file after merging The corresponding relationship of the data under data and the third data dimension under degree.
Step 303: under each corresponding relationship, to being different from first data dimension at least two data file Data under other data dimensions of degree, second data dimension and the third data dimension are summarized, described in generation The data report of first data dimension and second data dimension about the third data dimension.
Wherein, dimension combination includes the combination of the first data dimension and the second data dimension according to the date.
In the present embodiment, dimension combination is the first data dimension, the second data dimension and third data dimension Combined mode.So, the first data dimension and the second data dimension can be generated about third based on the dimension combination The data report of data dimension.
For front end data and Back end data described in the embodiment above, the first data dimension can for equipment end, Second data dimension can be name on account, and third data dimension is the date, i.e., generating device end-name on account is about the date Divide day data report.Alternatively, it can be name on account, third data that the first data dimension, which can be part of speech, the second data dimension, Dimension is the date, i.e., generation part of speech-name on account divides day data report about the date.
It should be noted that if will two data file mergencess be at least a data before carrying out data aggregate File, then under each corresponding relationship, directly to being different from first data dimension, described in the data file after merging Data under other data dimensions of second data dimension and the third data dimension are summarized, and first data are generated The data report of dimension and second data dimension about the third data dimension.
In an alternative embodiment of the invention, this method further include:
Step 304: generate first data dimension and second data dimension summarizes data report.
It should be noted that when generating data report, it can only summarize and be different from described the in the data file after merging Numeric data under other data dimensions of one data dimension, second data dimension and the third data dimension, and simultaneously Other categorical datas are not summarized.
By taking the first data dimension can be name on account for equipment end, the second data dimension as an example, i.e. generating device End-name on account summarizes data report.Alternatively, it can be account that the first data dimension, which can be part of speech, the second data dimension, Title, i.e. generation part of speech-name on account summarize data report.
For ease of understanding, the present invention is illustrated by taking table 2- table 5 as an example, and table 2- table 5 is as follows:
Table 2
Table 3
Table 4
Table 5
Summarize data
Equipment Account Show It clicks
MOB baidu-001 23 51
PC baidu-002 21 65
Specifically, a kind of data processing method includes following procedure:
(1) data file corresponding to table 2 and table 3 is received;
The data dimension having in the data file corresponding to table 2 includes: date, keyword, unit title, plan name Claim, show number, click volume, name on account and equipment end.
The data dimension having in the data file corresponding to table 3 includes: name on account, date, plan title, unit Title, keyword, equipment end and click volume.
(2) by data file corresponding to table 2 and table 3 according to date, name on account, equipment end, plan title, unit name Claim, keyword merges.
(3) in data file after merging, the data under equipment end are established, the data under the name on account with it is described The corresponding relationship of data under date.
The corresponding relationship has uniqueness, the number it can be seen from the generation of table 4 under equipment end, name on account and date In, MOB, baidu-001 and 2017/8/1 corresponding relationship, MOB, baidu-001 and 2017/8/2 corresponding relationship, MOB, baidu-001 and 2017/8/3 corresponding relationship, PC, baidu-001 and 2017/8/1 corresponding relationship, PC, The corresponding relationship of baidu-001 and 2017/8/2, PC, baidu-001 and 2017/8/3 corresponding relationship tool are uniquely to deposit ?.
(4) under each corresponding relationship, the data under showing, clicking in the data file after merging are summarized, Generating device end and name on account divide day data report about the date.It is specific as shown in table 4.
(5) generating device end-name on account summarizes data report, specific as shown in table 5.
It is corresponding with a kind of above-mentioned data processing method, the invention also discloses a kind of data processing equipment, below by way of Several embodiments are illustrated, specific:
One embodiment of the invention discloses a kind of data processing equipment, as shown in figure 4, the device includes: reception file Unit 401 and generation reporting unit 402;Wherein:
File unit 401 is received, for receiving at least two data files;
Each data file is stored with data, the data of storage described in different data file can part it is identical, can also With entirely different.
Wherein, the data stored under each data file can be deposited according to the corresponding relationship of data dimension and data Storage.Wherein, the attribute that data dimension may refer to as data.The format present invention of the data file does not limit, and such as can be Word file, Excel file etc..
Optionally, at least two data files are received, comprising: receive first data file and tool with the first data There is the second data file of the second data.
With SEM (Search Engine Marketing, media research marketing) data instance, the first data file is tool There is the file of front end data, the second data file is the file with Back end data.
It should be noted that front end data is the data obtained from media search engine, such as from Baidu, 360, search dog, today The data that the search engines such as top news obtain.Front end data is typically stored in excel file, including structure of accounts, equipment end, exhibition At least one of data of data dimensions such as occurrence number, click volume, spending limit.
Wherein, structure of accounts includes keyword, name on account, plan title, unit title etc..Equipment end include the end PC, Mobile phone terminal etc..
Back end data is the data by monitoring or receiving, and Back end data is typically stored in excel file, including At least one of data of data dimensions such as structure of accounts, equipment end, the order amount of money, order volume.Wherein, structure of accounts includes Keyword, name on account, plan title, unit title etc..Equipment end includes the end PC, mobile phone terminal etc..
Reporting unit 402 is generated, for being based at least one dimension combination at least two data file Data polymerize, generate corresponding with dimension combination data report.
Wherein, dimension combination includes the combination of different data dimension, the combination of at least two kinds data dimensions.Example Such as, a kind of dimension combination is the combination of name on account and date, and a kind of dimension combination is name on account, equipment The combination at end and date.
A kind of corresponding data report of dimension combination, when generating at least two data reports, two data Report can be respectively stored in different data files, also be stored in same data file certainly.It is number with data report For report, each data sheet can correspond to an excel file, alternatively, multiple data reports are stored in an excel text In different sheet in part.By taking data report is data file as an example, each data file can correspond to a word document, or Person, multiple data files are stored in the not same page in a word document.
In the present embodiment, by receiving at least two data files, based on dimension combination at least two number It is polymerize according to the data in file, to generate data report corresponding with the dimension combination;Wherein, which combines Mode includes the combination of different data dimension;It can be seen that the application can be automatically generated and be tieed up based on different data files The report of combination corresponding data is spent, without artificial statistics, is improved work efficiency.
Another embodiment of the present invention discloses a kind of data processing equipment, as shown in figure 5, the device includes: reception file Unit 501, file mergences unit 502 and generation reporting unit 503;It is specific:
File unit 501 is received, for receiving at least two data files;
Wherein, at least two data files include identical data dimension.For example, receiving file unit for receiving packet The first data file containing the first data and the second data file comprising the second data, the first data file stored first The second data that data and the second data file are stored all have the identical data dimension such as name on account, date and equipment end Degree.
File mergences unit 503, for based on preparatory from the identical data dimension of at least two data file At least two data file is merged into a data file by specified data dimension;
In the present invention, preassigned data dimension includes at least one, it is preferred that can be by least two number Preassigned data dimension is used as according to the identical data dimension of file.The preassigned data dimension can be by system Default setting, or be customized by the user, i.e., when system brings into operation by the specified combined data dimension of user, due to be from The identical preassigned data dimension of data dimension at least two data files, therefore at least two data files With the specified data dimension.
Optionally, file mergences unit specifically can be used for merging the fixed data dimension of at least two data documents For a data dimension, and other unappropriated data dimensions wrap institute to generate still separately as a data dimension State another data file of the data at least two data files.Wherein, the specified number that the data file after merging includes It can't be repeated according to the data under dimension.
It it should be noted that the data file after merging is not shown in front end, and is only system on backstage by least two Data file merge after when carrying out data aggregate used data file.If with the demand that front end is shown, in this hair In bright another embodiment, which can also include: file display unit, show for the data file after merging preceding End.
Generate reporting unit 504, for based at least one dimension combination to the number in the data file after merging According to being polymerize, data report corresponding with the dimension combination is generated.
In the present embodiment, by receiving at least two data files, based on from the identical of at least two data file At least two data file is merged into a data file by preassigned data dimension in data dimension, based at least A kind of dimension combination polymerize the data in the data file after merging, generates corresponding with the dimension combination Data report, wherein the dimension combination includes the combination of different data dimension;It can be seen that the application can be based on Different data files is automatically generated to be reported with dimension combination corresponding data, without artificial statistics, is improved work efficiency.
Further embodiment of this invention discloses a kind of data processing equipment, as shown in fig. 6, the device includes: reception file Unit 601 and generation reporting unit 602, wherein generate reporting unit 602: module 6021 and the first life are established including first At module 6022;It is specific:
File unit 601 is received, for receiving at least two data files;
First establishes module 6021, for being based at least two data file, establishes under first data dimension Data, the corresponding relationship of the data under data and the third data dimension under second data dimension;
Wherein, each corresponding relationship has uniqueness.It is understood that since the corresponding relationship is the first data dimension Under data, under the data under the second data dimension and third data dimension data corresponding relationship, therefore the uniqueness refers to It is unique to exist simultaneously three kinds of identical data in different corresponding relationships.But in different corresponding relationships, Ke Yicun In one or two identical data.
Dimension combination includes the combination of the first data dimension and the second data dimension according to third data dimension.
It should be noted that first, which establishes module, specifically can be used for if the device includes file mergences unit The data under first data dimension are established in data file after merging, the data under second data dimension with it is described The corresponding relationship of data under third data dimension.
First generation module 6022, under each corresponding relationship, to being different from least two data file Data under other data dimensions of first data dimension, second data dimension and the third data dimension carry out Summarize, generates the data report of first data dimension and second data dimension about the third data dimension.
Wherein, dimension combination includes the combination of the first data dimension and the second data dimension according to the date.
In the present embodiment, dimension combination is the first data dimension, the second data dimension and third data dimension Combined mode.So, the first data dimension and the second data dimension can be generated about third based on the dimension combination The data report of data dimension.
For front end data and Back end data described in the embodiment above, the first data dimension can for equipment end, Second data dimension can be name on account, and third data dimension is the date, i.e., generating device end-name on account is about the date Divide day data report.Alternatively, it can be name on account, third data that the first data dimension, which can be part of speech, the second data dimension, Dimension is the date, i.e., generation part of speech-name on account divides day data report about the date.
It should be noted that the first generation module specifically can be used for if the device includes file mergences unit Under each corresponding relationship, to be different from the data file after merging first data dimension, second data dimension and Data under other data dimensions of the third data dimension are summarized, and first data dimension and described second are generated Data report of the data dimension about the third data dimension.
In still another embodiment of the process, generating reporting unit further includes the second generation module 6023, described for generating First data dimension and second data dimension summarize data report.
It should be noted that being generated in the data file that reporting unit can only summarize after merging when generating data report The number being different under other data dimensions of first data dimension, second data dimension and the third data dimension Value Data, and do not summarize other categorical datas.
A kind of data processing equipment includes processor and memory, above-mentioned reception file unit, file mergences unit, It generates reporting unit and is used as program unit storage in memory, above procedure stored in memory is executed by processor Unit realizes corresponding function.
Include kernel in processor, is gone in memory to transfer corresponding program unit by kernel.Kernel can be set one Or more, it reports to automatically generate with dimension combination corresponding data by adjusting kernel parameter, without artificial statistics, improves Working efficiency.
Memory may include the non-volatile memory in computer-readable medium, random access memory (RAM) and/ Or the forms such as Nonvolatile memory, if read-only memory (ROM) or flash memory (flash RAM), memory include that at least one is deposited Store up chip.
The embodiment of the invention provides a kind of storage mediums, are stored thereon with program, real when which is executed by processor A kind of existing data processing method.Specifically, storage medium includes the program of storage, wherein run time control in described program Equipment executes a kind of data processing method where making the storage medium
The embodiment of the invention provides a kind of processor, the processor is for running program, wherein described program operation A kind of data processing method described in Shi Zhihang.
The embodiment of the invention provides a kind of equipment, equipment include processor, memory and storage on a memory and can The program run on a processor, processor perform the steps of when executing program
Receive at least two data files;
The data at least two data file are polymerize based at least one dimension combination, generate with The corresponding data report of the dimension combination;Wherein, the dimension combination includes the combination of different data dimension.
Optionally, at least two data file includes identical data dimension;Processor is also realized when executing program Following steps:
Based on data dimension preassigned in the identical data dimension from least two data file will it is described to Few two data file mergencess are a data file;
Correspondingly, described carry out the data at least two data file based at least one dimension combination Polymerization generates data report corresponding with the dimension combination, comprising:
The data in the data file after merging are polymerize based at least one dimension combination, generate with it is described The corresponding data report of dimension combination.
Optionally, the dimension combination includes the first data dimension and the second data dimension according to third data dimension Combination;
It is described that the data at least two data file are polymerize based at least one dimension combination, it is raw At data report corresponding with the dimension combination, comprising:
Based at least two data file, the data under first data dimension, second data dimension are established The corresponding relationship of the data under data and the third data dimension under degree;Wherein, each corresponding relationship has uniqueness;
Under each corresponding relationship, to being different from first data dimension, described at least two data file Data under other data dimensions of second data dimension and the third data dimension are summarized, and first data are generated The data report of dimension and second data dimension about the third data dimension.
Optionally, it is also performed the steps of when processor executes program
Generate first data dimension and second data dimension summarizes data report.
Equipment herein can be server, PC, PAD, mobile phone etc..
Present invention also provides a kind of computer program products, when executing on data processing equipment, are adapted for carrying out just The program of beginningization there are as below methods step:
Receive at least two data files;
The data at least two data file are polymerize based at least one dimension combination, generate with The corresponding data report of the dimension combination;Wherein, the dimension combination includes the combination of different data dimension.
Optionally, at least two data file includes identical data dimension;Also with the journey of following method and step Sequence:
Based on data dimension preassigned in the identical data dimension from least two data file will it is described to Few two data file mergencess are a data file;
Correspondingly, described carry out the data at least two data file based at least one dimension combination Polymerization generates data report corresponding with the dimension combination, comprising:
The data in the data file after merging are polymerize based at least one dimension combination, generate with it is described The corresponding data report of dimension combination.
Optionally, the dimension combination includes the first data dimension and the second data dimension according to third data dimension Combination;
It is described that the data at least two data file are polymerize based at least one dimension combination, it is raw At data report corresponding with the dimension combination, comprising:
Based at least two data file, the data under first data dimension, second data dimension are established The corresponding relationship of the data under data and the third data dimension under degree;Wherein, each corresponding relationship has uniqueness;
Under each corresponding relationship, to being different from first data dimension, described at least two data file Data under other data dimensions of second data dimension and the third data dimension are summarized, and first data are generated The data report of dimension and second data dimension about the third data dimension.
Optionally, also with the program of following method and step:
Generate first data dimension and second data dimension summarizes data report.
It should be understood by those skilled in the art that, embodiments herein can provide as method, system or computer program Product.Therefore, complete hardware embodiment, complete software embodiment or reality combining software and hardware aspects can be used in the application Apply the form of example.Moreover, it wherein includes the computer of computer usable program code that the application, which can be used in one or more, The computer program implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) produces The form of product.
The application is referring to method, the process of equipment (system) and computer program product according to the embodiment of the present application Figure and/or block diagram describe.It should be understood that every one stream in flowchart and/or the block diagram can be realized by computer program instructions The combination of process and/or box in journey and/or box and flowchart and/or the block diagram.It can provide these computer programs Instruct the processor of general purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to produce A raw machine, so that being generated by the instruction that computer or the processor of other programmable data processing devices execute for real The device for the function of being specified in present one or more flows of the flowchart and/or one or more blocks of the block diagram.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates, Enable the manufacture of device, the command device realize in one box of one or more flows of the flowchart and/or block diagram or The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that counting Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer or The instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram one The step of function of being specified in a box or multiple boxes.
In a typical configuration, calculating equipment includes one or more processors (CPU), input/output interface, net Network interface and memory.
Memory may include the non-volatile memory in computer-readable medium, random access memory (RAM) and/ Or the forms such as Nonvolatile memory, such as read-only memory (ROM) or flash memory (flash RAM).Memory is computer-readable Jie The example of matter.
Computer-readable medium includes permanent and non-permanent, removable and non-removable media can be by any method Or technology come realize information store.Information can be computer readable instructions, data structure, the module of program or other data. The example of the storage medium of computer includes, but are not limited to phase change memory (PRAM), static random access memory (SRAM), moves State random access memory (DRAM), other kinds of random access memory (RAM), read-only memory (ROM), electric erasable Programmable read only memory (EEPROM), flash memory or other memory techniques, read-only disc read only memory (CD-ROM) (CD-ROM), Digital versatile disc (DVD) or other optical storage, magnetic cassettes, tape magnetic disk storage or other magnetic storage devices Or any other non-transmission medium, can be used for storage can be accessed by a computing device information.As defined in this article, it calculates Machine readable medium does not include temporary computer readable media (transitory media), such as the data-signal and carrier wave of modulation.
It should also be noted that, the terms "include", "comprise" or its any other variant are intended to nonexcludability It include so that the process, method, commodity or the equipment that include a series of elements not only include those elements, but also to wrap Include other elements that are not explicitly listed, or further include for this process, method, commodity or equipment intrinsic want Element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including element There is also other identical elements in process, method, commodity or equipment.
It will be understood by those skilled in the art that embodiments herein can provide as method, system or computer program product. Therefore, complete hardware embodiment, complete software embodiment or embodiment combining software and hardware aspects can be used in the application Form.It is deposited moreover, the application can be used to can be used in the computer that one or more wherein includes computer usable program code The shape for the computer program product implemented on storage media (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) Formula.
The above is only embodiments herein, are not intended to limit this application.To those skilled in the art, Various changes and changes are possible in this application.It is all within the spirit and principles of the present application made by any modification, equivalent replacement, Improve etc., it should be included within the scope of the claims of this application.

Claims (10)

1. a kind of data processing method characterized by comprising
Receive at least two data files;
The data at least two data file are polymerize based at least one dimension combination, generate with it is described The corresponding data report of dimension combination;Wherein, the dimension combination includes the combination of different data dimension.
2. the method according to claim 1, wherein at least two data file includes identical data dimension Degree;
This method further include:
Based on data dimension preassigned from the identical data dimension of at least two data file at least two by described in A data file mergences is a data file;
Correspondingly, described gather the data at least two data file based at least one dimension combination It closes, generates data report corresponding with the dimension combination, comprising:
The data in the data file after merging are polymerize based at least one dimension combination, are generated and the dimension The corresponding data report of combination.
3. the method according to claim 1, wherein the dimension combination includes the first data dimension and the Two data dimensions according to third data dimension combination;
It is described that the data at least two data file are polymerize based at least one dimension combination, generate with The corresponding data report of the dimension combination, comprising:
Based at least two data file, the data under first data dimension are established, under second data dimension Data and the third data dimension under data corresponding relationship;Wherein, each corresponding relationship has uniqueness;
Under each corresponding relationship, to being different from first data dimension, described second at least two data file Data under other data dimensions of data dimension and the third data dimension are summarized, and first data dimension is generated Data report with second data dimension about the third data dimension.
4. according to the method described in claim 3, it is characterized by further comprising:
Generate first data dimension and second data dimension summarizes data report.
5. a kind of data processing equipment characterized by comprising
File unit is received, for receiving at least two data files;
Generate reporting unit, for based at least one dimension combination to the data at least two data file into Row polymerization, generates data report corresponding with the dimension combination;Wherein, the dimension combination includes different data The combination of dimension.
6. device according to claim 5, which is characterized in that at least two data file includes identical data dimension Degree;The device further include:
File mergences unit, for based on the preassigned data from the identical data dimension of at least two data file At least two data file is merged into a data file by dimension;
Correspondingly, the generation reporting unit is specifically used for based at least one dimension combination to the data file after merging In data polymerize, generate corresponding with dimension combination data report.
7. device according to claim 5, which is characterized in that the dimension combination includes the first data dimension and the Two data dimensions according to third data dimension combination;
The generation reporting unit, comprising:
First establishes module, for being based at least two data file, establishes the data under first data dimension, institute State the corresponding relationship of the data under the second data dimension and the data under the third data dimension;Wherein, each corresponding relationship With uniqueness;
First generation module, under each corresponding relationship, to being different from described first at least two data file Data under other data dimensions of data dimension, second data dimension and the third data dimension are summarized, raw Data report at first data dimension and second data dimension about the third data dimension.
8. device according to claim 5, which is characterized in that the generation reporting unit further include:
Second generation module summarizes data report for generate first data dimension and second data dimension.
9. a kind of storage medium, which is characterized in that the storage medium includes the program of storage, wherein run in described program When control the storage medium where equipment execute a kind of such as data processing method of any of claims 1-4.
10. a kind of processor, which is characterized in that the processor executes such as right for running program when described program is run It is required that a kind of data processing method described in any one of 1-4.
CN201710904814.1A 2017-09-29 2017-09-29 A kind of data processing method and device Pending CN110147352A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710904814.1A CN110147352A (en) 2017-09-29 2017-09-29 A kind of data processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710904814.1A CN110147352A (en) 2017-09-29 2017-09-29 A kind of data processing method and device

Publications (1)

Publication Number Publication Date
CN110147352A true CN110147352A (en) 2019-08-20

Family

ID=67588028

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710904814.1A Pending CN110147352A (en) 2017-09-29 2017-09-29 A kind of data processing method and device

Country Status (1)

Country Link
CN (1) CN110147352A (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103020158A (en) * 2012-11-26 2013-04-03 中兴通讯股份有限公司 Report form creation method, device and system
CN104484398A (en) * 2014-12-12 2015-04-01 北京国双科技有限公司 Method and device for aggregation of data in datasheet
US20150161185A1 (en) * 2013-12-09 2015-06-11 Linkedin Corporation Enabling and performing count-distinct queries on a large set of data
CN106528511A (en) * 2016-09-30 2017-03-22 东软集团股份有限公司 Form analysis method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103020158A (en) * 2012-11-26 2013-04-03 中兴通讯股份有限公司 Report form creation method, device and system
US20150161185A1 (en) * 2013-12-09 2015-06-11 Linkedin Corporation Enabling and performing count-distinct queries on a large set of data
CN104484398A (en) * 2014-12-12 2015-04-01 北京国双科技有限公司 Method and device for aggregation of data in datasheet
CN106528511A (en) * 2016-09-30 2017-03-22 东软集团股份有限公司 Form analysis method and device

Similar Documents

Publication Publication Date Title
WO2015148159A1 (en) Determining a temporary transaction limit
JP6779231B2 (en) Data processing method and system
US20130006996A1 (en) Clustering E-Mails Using Collaborative Information
CN107391532B (en) Data filtering method and device
CN110188100A (en) Data processing method, device and computer storage medium
WO2016101811A1 (en) Information arrangement method and apparatus
WO2017133568A1 (en) Mining method and device for target characteristic data
CN106815254A (en) A kind of data processing method and device
CN110457182A (en) A kind of load balancing cluster example operating index monitoring system
CN103235811A (en) Data storage method and device
US11144793B2 (en) Incremental clustering of a data stream via an orthogonal transform based indexing
EP3437060A1 (en) Rule based hierarchical configuration
CN102982112A (en) Ranking list generation method and journal generation method and server
CN106815274A (en) Daily record data method for digging and system based on Hadoop
CN102932416A (en) Intermediate data storage method, processing method and device in information flow task
CN110069453A (en) Operation/maintenance data treating method and apparatus
CN106570005A (en) Database cleaning method and device
US20170235625A1 (en) Data mining using categorical attributes
CN107391533A (en) Generate the method and device of graphic data base Query Result
CN111143546A (en) Method and device for obtaining recommendation language and electronic equipment
CN110147352A (en) A kind of data processing method and device
CN109582476A (en) Data processing method, apparatus and system
Prakashbhai et al. Inference patterns from Big Data using aggregation, filtering and tagging-A survey
CN106708845A (en) Data processing method and device for Internet account
CN109068286A (en) A kind of method, medium and the equipment of information parsing

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
CB02 Change of applicant information

Address after: 100080 No. 401, 4th Floor, Haitai Building, 229 North Fourth Ring Road, Haidian District, Beijing

Applicant after: BEIJING GRIDSUM TECHNOLOGY Co.,Ltd.

Address before: 100086 Beijing city Haidian District Shuangyushu Area No. 76 Zhichun Road cuigongfandian 8 layer A

Applicant before: BEIJING GRIDSUM TECHNOLOGY Co.,Ltd.

CB02 Change of applicant information
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190820

RJ01 Rejection of invention patent application after publication