CN110147352A - A kind of data processing method and device - Google Patents
A kind of data processing method and device Download PDFInfo
- Publication number
- CN110147352A CN110147352A CN201710904814.1A CN201710904814A CN110147352A CN 110147352 A CN110147352 A CN 110147352A CN 201710904814 A CN201710904814 A CN 201710904814A CN 110147352 A CN110147352 A CN 110147352A
- Authority
- CN
- China
- Prior art keywords
- data
- dimension
- file
- combination
- report
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/16—File or folder operations, e.g. details of user interfaces specifically adapted to file systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
- G06F40/174—Form filling; Merging
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Human Computer Interaction (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The present invention provides a kind of data processing method and device, method includes: to receive at least two data files;The data at least two data file are polymerize based at least one dimension combination, generate data report corresponding with the dimension combination;Wherein, the dimension combination includes the combination of different data dimension.It reports it can be seen that the application can be automatically generated based on different data files with dimension combination corresponding data, without artificial statistics, improves work efficiency.
Description
Technical field
This application involves technical field of data processing, more particularly relate to a kind of data processing method and device.
Background technique
Currently, in data processing field, many data are stored in the form of a file, such as with excel text
Part carrys out storing data, and accordingly, there exist the demands for counting the data in different files.
For example, in the technical field of search engine marketing (Search Engine Marketing, SEM), for a certain
The front end data and Back end data store that task generates are in different files, when generating data report, need based on front end
Data and Back end data are counted.Wherein, front end data is mainly the data obtained from media side, such as Baidu, 360, today
Top news etc..Back end data mainly passes through the data that monitoring obtains.
And in the prior art, it needs manually to count the data from different files, very labor intensive and time, reduces
Working efficiency.
Summary of the invention
In view of the above problems, it proposes on the present invention overcomes the above problem or at least be partially solved in order to provide one kind
State a kind of data processing method and device of problem.
A kind of data processing method, comprising:
Receive at least two data files;
The data at least two data file are polymerize based at least one dimension combination, generate with
The corresponding data report of the dimension combination;Wherein, the dimension combination includes the combination of different data dimension.
Preferably, at least two data file includes identical data dimension;
This method further include:
Based on data dimension preassigned in the identical data dimension from least two data file will it is described to
Few two data file mergencess are a data file;
Correspondingly, described carry out the data at least two data file based at least one dimension combination
Polymerization generates data report corresponding with the dimension combination, comprising:
The data in the data file after merging are polymerize based at least one dimension combination, generate with it is described
The corresponding data report of dimension combination.
Preferably, the dimension combination includes the first data dimension and the second data dimension according to third data dimension
Combination;
It is described that the data at least two data file are polymerize based at least one dimension combination, it is raw
At data report corresponding with the dimension combination, comprising:
Based at least two data file, the data under first data dimension, second data dimension are established
The corresponding relationship of the data under data and the third data dimension under degree;Wherein, each corresponding relationship has uniqueness;
Under each corresponding relationship, to being different from first data dimension, described at least two data file
Data under other data dimensions of second data dimension and the third data dimension are summarized, and first data are generated
The data report of dimension and second data dimension about the third data dimension.
Preferably, further includes:
Generate first data dimension and second data dimension summarizes data report.
A kind of data processing equipment, comprising:
File unit is received, for receiving at least two data files;
Generate reporting unit, for based at least one dimension combination to the number at least two data file
According to being polymerize, data report corresponding with the dimension combination is generated;Wherein, the dimension combination includes difference
The combination of data dimension.
Preferably, at least two data file includes identical data dimension;The device further include:
File mergences unit, for based on preassigned from the identical data dimension of at least two data file
At least two data file is merged into a data file by data dimension;
Correspondingly, the generation reporting unit is specifically used for based at least one dimension combination to the data after merging
Data in file are polymerize, and data report corresponding with the dimension combination is generated.
Preferably, the dimension combination includes the first data dimension and the second data dimension according to third data dimension
Combination;
The generation reporting unit, comprising:
First establishes module, for being based at least two data file, establishes the number under first data dimension
According to the corresponding relationship of the data under data and the third data dimension under second data dimension;Wherein, each correspondence
Relationship has uniqueness;
First generation module is used under each corresponding relationship, described to being different from least two data file
Data under other data dimensions of first data dimension, second data dimension and the third data dimension are converged
Always, the data report of first data dimension and second data dimension about the third data dimension is generated.
Preferably, the generation reporting unit further include:
Second generation module summarizes datagram for generate first data dimension and second data dimension
It accuses.
A kind of storage medium, the storage medium include the program of storage, wherein in described program operation described in control
Equipment where storage medium executes a kind of as above described in any item data processing methods.
A kind of processor, the processor execute as above described in any item for running program, when described program is run
A kind of data processing method.
By above-mentioned technical proposal, the present invention provides a kind of data processing methods, comprising: receives at least two data text
Part polymerize the data at least two data file based on dimension combination, and generation is combined with the dimension
The corresponding data report of mode;Wherein, which includes the combination of different data dimension;It can be seen that the application
It can be automatically generated based on different data files and be reported with dimension combination corresponding data, without artificial statistics, improved
Working efficiency.
The above description is only an overview of the technical scheme of the present invention, in order to better understand the technical means of the present invention,
And it can be implemented in accordance with the contents of the specification, and in order to allow above and other objects of the present invention, feature and advantage can
It is clearer and more comprehensible, the followings are specific embodiments of the present invention.
Detailed description of the invention
By reading the following detailed description of the preferred embodiment, various other advantages and benefits are common for this field
Technical staff will become clear.The drawings are only for the purpose of illustrating a preferred embodiment, and is not considered as to the present invention
Limitation.And throughout the drawings, use is identicalExamining symbol indicates identical component.In the accompanying drawings:
Fig. 1 shows a kind of flow diagram of data processing method disclosed in one embodiment of the invention;
Fig. 2 shows another embodiment of the present invention provides a kind of data processing method flow diagram;
Fig. 3 shows a kind of flow diagram of data processing method of further embodiment of this invention offer;
Fig. 4 shows a kind of structural schematic diagram of data processing equipment provided by one embodiment of the present invention;
Fig. 5 show another embodiment of the present invention provides a kind of data processing equipment structural schematic diagram;
Fig. 6 shows a kind of structural schematic diagram of data processing equipment of further embodiment of this invention offer.
Specific embodiment
Exemplary embodiments of the present disclosure are described in more detail below with reference to accompanying drawings.Although showing the disclosure in attached drawing
Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure without should be by embodiments set forth here
It is limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the scope of the present disclosure
It is fully disclosed to those skilled in the art.
One embodiment of the invention discloses a kind of data processing method, as shown in Figure 1, method includes the following steps:
Step 101: receiving at least two data files;
Each data file is stored with data, the data that different data file is stored can part it is identical, can also be with
It is entirely different.
Optionally, the data stored under each data file can be carried out according to data dimension and the corresponding relationship of data
Storage.Wherein, the attribute that data dimension may refer to as data.The format present invention of the data file does not limit, such as can be with
For Word file, Excel file etc..
By taking excel data file storing data as an example, as shown in table 1, the data dimension which has be the date,
Keyword and click volume all have corresponding data under each data dimension.
Table 1
Date | Keyword | Click volume |
September 1 day | After-sun recovery | 7 |
September 2 days | Face modifies after shining | 5 |
September 3 days | Skin after-sun recovery | 8 |
Optionally, at least two data files are received, comprising: receive first data file and tool with the first data
There is the second data file of the second data.
With SEM (Search Engine Marketing, media research marketing) data instance, the first data file is tool
There is the file of front end data, the second data file is the file with Back end data.
It should be noted that front end data is the data obtained from media search engine, such as from Baidu, 360, search dog, today
The data that the search engines such as top news obtain.Front end data is typically stored in excel file, including structure of accounts, equipment end, exhibition
At least one of data of data dimensions such as occurrence number, click volume, spending limit.
Wherein, structure of accounts includes keyword, name on account, plan title, unit title etc..Equipment end include the end PC,
Mobile phone terminal etc..
Back end data is the data by monitoring or receiving, and Back end data is typically stored in excel file, including
At least one of data of data dimensions such as structure of accounts, equipment end, the order amount of money, order volume.Wherein, structure of accounts includes
Keyword, name on account, plan title, unit title etc..Equipment end includes the end PC, mobile phone terminal etc..
Step 102: the data at least two data file being gathered based at least one dimension combination
It closes, generates data report corresponding with the dimension combination.
Wherein, dimension combination includes the combination of different data dimension, the combination of at least two kinds data dimensions.Example
Such as, a kind of dimension combination is the combination of name on account and date, and another dimension combination is name on account, sets
The combination at standby end and date.
A kind of corresponding data report of dimension combination, when generating at least two data reports, two data
Report can be respectively stored in different data files, also be stored in same data file certainly.It is number with data report
For report, each data sheet can correspond to an excel file, alternatively, multiple data reports are stored in an excel text
In different sheet in part.By taking data report is data file as an example, each data file can correspond to a word document, or
Person, multiple data files are stored in the not same page in a word document.
Present embodiment discloses a kind of data processing methods, comprising: receives at least two data files, is combined based on dimension
Mode polymerize the data at least two data file, generates datagram corresponding with the dimension combination
It accuses;Wherein, which includes the combination of different data dimension;It can be seen that the application can be based on different numbers
It automatically generates according to file and is reported with dimension combination corresponding data, without artificial statistics, improved work efficiency.
Another embodiment of the present invention discloses a kind of data processing method, as shown in Fig. 2, method includes the following steps:
Step 201: receiving at least two data files;
Wherein, at least two data file includes identical data dimension.For example, receiving the comprising the first data
One data file and the second data file comprising the second data, the first data and the second data that the first data file is stored
The second data that file is stored all have the identical data dimension such as name on account, date and equipment end.
Step 202: based on the preassigned data dimension from the identical data dimension of at least two data file
At least two data file is merged into a data file;
In the present invention, preassigned data dimension includes at least one, it is preferred that can be by least two number
Preassigned data dimension is used as according to the identical data dimension of file.The preassigned data dimension can be by system
Default setting, or be customized by the user, i.e., it is specified when system brings into operation by user, due to being from least two data text
Preassigned data dimension in identical data dimension in part, therefore the specified number is all had at least two data files
According to dimension.
It optionally, will based on data dimension preassigned from the identical data dimension of at least two data file
It includes: the data dimension that at least two data documents is fixed that at least two data file, which merges into a data file,
Merge into a data dimension, and other unappropriated data dimensions are still separately as a data dimension, to generate packet
Another data file containing the data at least two data file.Wherein, what the data file after merging included is specified
Data dimension under data can't repeat.
For example, the data dimension all having in the first data file and the second data file is name on account, the first data
The data dimension that file also has is click volume, and the data dimension that the second data file also has is order volume, then, it is being based on
After name on account merges the first data file and the second data file, in newly-generated data file, the first data file
A name on account is merged into the name on account of the second data file, and click volume is ordered still separately as a data dimension
Single amount is also separately as a data dimension.Assuming that generating third number after the first data file and the second data file are merged
According to file, then the data dimension for including in third data file has name on account, click volume and order volume, due to the first number
Be according to file and the second data file merged according to name on account, therefore in third data file only have one about
The data dimension of name on account.
By merging above-mentioned at least two data files received, so that system can be based on the data after merging
Data in file are polymerize, to generate data report.Specifically, merging corresponding with document format data can be used
Mode merges above-mentioned at least two data file.When data file is excel file, can use and vlookup function
Similar function merges at least two data file.
It it should be noted that the data file after merging is not shown in front end, and is only system on backstage by least two
Data file merge after when carrying out data aggregate used data file.If with the demand that front end is shown, in this hair
In bright another embodiment, this method can also include: to show the data file after merging in front end.
Step 203: the data in the data file after merging are polymerize based at least one dimension combination, it is raw
At data report corresponding with the dimension combination.
Wherein, the dimension combination includes the combination of different data dimension.
In the present embodiment, by receiving at least two data files, based on from the identical of at least two data file
At least two data file is merged into a data file by preassigned data dimension in data dimension, based at least
A kind of dimension combination polymerize the data in the data file after merging, generates corresponding with the dimension combination
Data report, wherein the dimension combination includes the combination of different data dimension;It can be seen that the application can be based on
Different data files is automatically generated to be reported with dimension combination corresponding data, without artificial statistics, is improved work efficiency.
Further embodiment of this invention discloses a kind of data processing method, as shown in figure 3, method includes the following steps:
Step 301: receiving at least two data files;
Step 302: it is based at least two data file, establishes the data under first data dimension, described the
The corresponding relationship of the data under data and the third data dimension under two data dimensions;
Wherein, each corresponding relationship has uniqueness.It is understood that since the corresponding relationship is the first data dimension
Under data, under the data under the second data dimension and third data dimension data corresponding relationship, therefore the uniqueness refers to
It is unique to exist simultaneously three kinds of identical data in different corresponding relationships.But in different corresponding relationships, Ke Yicun
In one or two identical data.It can be explained hereinafter with specific example.
It should be noted that if will two data file mergencess be at least a data before carrying out data aggregate
File, then establishing the data under first data dimension, second data dimension in direct data file after merging
The corresponding relationship of the data under data and the third data dimension under degree.
Step 303: under each corresponding relationship, to being different from first data dimension at least two data file
Data under other data dimensions of degree, second data dimension and the third data dimension are summarized, described in generation
The data report of first data dimension and second data dimension about the third data dimension.
Wherein, dimension combination includes the combination of the first data dimension and the second data dimension according to the date.
In the present embodiment, dimension combination is the first data dimension, the second data dimension and third data dimension
Combined mode.So, the first data dimension and the second data dimension can be generated about third based on the dimension combination
The data report of data dimension.
For front end data and Back end data described in the embodiment above, the first data dimension can for equipment end,
Second data dimension can be name on account, and third data dimension is the date, i.e., generating device end-name on account is about the date
Divide day data report.Alternatively, it can be name on account, third data that the first data dimension, which can be part of speech, the second data dimension,
Dimension is the date, i.e., generation part of speech-name on account divides day data report about the date.
It should be noted that if will two data file mergencess be at least a data before carrying out data aggregate
File, then under each corresponding relationship, directly to being different from first data dimension, described in the data file after merging
Data under other data dimensions of second data dimension and the third data dimension are summarized, and first data are generated
The data report of dimension and second data dimension about the third data dimension.
In an alternative embodiment of the invention, this method further include:
Step 304: generate first data dimension and second data dimension summarizes data report.
It should be noted that when generating data report, it can only summarize and be different from described the in the data file after merging
Numeric data under other data dimensions of one data dimension, second data dimension and the third data dimension, and simultaneously
Other categorical datas are not summarized.
By taking the first data dimension can be name on account for equipment end, the second data dimension as an example, i.e. generating device
End-name on account summarizes data report.Alternatively, it can be account that the first data dimension, which can be part of speech, the second data dimension,
Title, i.e. generation part of speech-name on account summarize data report.
For ease of understanding, the present invention is illustrated by taking table 2- table 5 as an example, and table 2- table 5 is as follows:
Table 2
Table 3
Table 4
Table 5
Summarize data | |||
Equipment | Account | Show | It clicks |
MOB | baidu-001 | 23 | 51 |
PC | baidu-002 | 21 | 65 |
Specifically, a kind of data processing method includes following procedure:
(1) data file corresponding to table 2 and table 3 is received;
The data dimension having in the data file corresponding to table 2 includes: date, keyword, unit title, plan name
Claim, show number, click volume, name on account and equipment end.
The data dimension having in the data file corresponding to table 3 includes: name on account, date, plan title, unit
Title, keyword, equipment end and click volume.
(2) by data file corresponding to table 2 and table 3 according to date, name on account, equipment end, plan title, unit name
Claim, keyword merges.
(3) in data file after merging, the data under equipment end are established, the data under the name on account with it is described
The corresponding relationship of data under date.
The corresponding relationship has uniqueness, the number it can be seen from the generation of table 4 under equipment end, name on account and date
In, MOB, baidu-001 and 2017/8/1 corresponding relationship, MOB, baidu-001 and 2017/8/2 corresponding relationship,
MOB, baidu-001 and 2017/8/3 corresponding relationship, PC, baidu-001 and 2017/8/1 corresponding relationship, PC,
The corresponding relationship of baidu-001 and 2017/8/2, PC, baidu-001 and 2017/8/3 corresponding relationship tool are uniquely to deposit
?.
(4) under each corresponding relationship, the data under showing, clicking in the data file after merging are summarized,
Generating device end and name on account divide day data report about the date.It is specific as shown in table 4.
(5) generating device end-name on account summarizes data report, specific as shown in table 5.
It is corresponding with a kind of above-mentioned data processing method, the invention also discloses a kind of data processing equipment, below by way of
Several embodiments are illustrated, specific:
One embodiment of the invention discloses a kind of data processing equipment, as shown in figure 4, the device includes: reception file
Unit 401 and generation reporting unit 402;Wherein:
File unit 401 is received, for receiving at least two data files;
Each data file is stored with data, the data of storage described in different data file can part it is identical, can also
With entirely different.
Wherein, the data stored under each data file can be deposited according to the corresponding relationship of data dimension and data
Storage.Wherein, the attribute that data dimension may refer to as data.The format present invention of the data file does not limit, and such as can be
Word file, Excel file etc..
Optionally, at least two data files are received, comprising: receive first data file and tool with the first data
There is the second data file of the second data.
With SEM (Search Engine Marketing, media research marketing) data instance, the first data file is tool
There is the file of front end data, the second data file is the file with Back end data.
It should be noted that front end data is the data obtained from media search engine, such as from Baidu, 360, search dog, today
The data that the search engines such as top news obtain.Front end data is typically stored in excel file, including structure of accounts, equipment end, exhibition
At least one of data of data dimensions such as occurrence number, click volume, spending limit.
Wherein, structure of accounts includes keyword, name on account, plan title, unit title etc..Equipment end include the end PC,
Mobile phone terminal etc..
Back end data is the data by monitoring or receiving, and Back end data is typically stored in excel file, including
At least one of data of data dimensions such as structure of accounts, equipment end, the order amount of money, order volume.Wherein, structure of accounts includes
Keyword, name on account, plan title, unit title etc..Equipment end includes the end PC, mobile phone terminal etc..
Reporting unit 402 is generated, for being based at least one dimension combination at least two data file
Data polymerize, generate corresponding with dimension combination data report.
Wherein, dimension combination includes the combination of different data dimension, the combination of at least two kinds data dimensions.Example
Such as, a kind of dimension combination is the combination of name on account and date, and a kind of dimension combination is name on account, equipment
The combination at end and date.
A kind of corresponding data report of dimension combination, when generating at least two data reports, two data
Report can be respectively stored in different data files, also be stored in same data file certainly.It is number with data report
For report, each data sheet can correspond to an excel file, alternatively, multiple data reports are stored in an excel text
In different sheet in part.By taking data report is data file as an example, each data file can correspond to a word document, or
Person, multiple data files are stored in the not same page in a word document.
In the present embodiment, by receiving at least two data files, based on dimension combination at least two number
It is polymerize according to the data in file, to generate data report corresponding with the dimension combination;Wherein, which combines
Mode includes the combination of different data dimension;It can be seen that the application can be automatically generated and be tieed up based on different data files
The report of combination corresponding data is spent, without artificial statistics, is improved work efficiency.
Another embodiment of the present invention discloses a kind of data processing equipment, as shown in figure 5, the device includes: reception file
Unit 501, file mergences unit 502 and generation reporting unit 503;It is specific:
File unit 501 is received, for receiving at least two data files;
Wherein, at least two data files include identical data dimension.For example, receiving file unit for receiving packet
The first data file containing the first data and the second data file comprising the second data, the first data file stored first
The second data that data and the second data file are stored all have the identical data dimension such as name on account, date and equipment end
Degree.
File mergences unit 503, for based on preparatory from the identical data dimension of at least two data file
At least two data file is merged into a data file by specified data dimension;
In the present invention, preassigned data dimension includes at least one, it is preferred that can be by least two number
Preassigned data dimension is used as according to the identical data dimension of file.The preassigned data dimension can be by system
Default setting, or be customized by the user, i.e., when system brings into operation by the specified combined data dimension of user, due to be from
The identical preassigned data dimension of data dimension at least two data files, therefore at least two data files
With the specified data dimension.
Optionally, file mergences unit specifically can be used for merging the fixed data dimension of at least two data documents
For a data dimension, and other unappropriated data dimensions wrap institute to generate still separately as a data dimension
State another data file of the data at least two data files.Wherein, the specified number that the data file after merging includes
It can't be repeated according to the data under dimension.
It it should be noted that the data file after merging is not shown in front end, and is only system on backstage by least two
Data file merge after when carrying out data aggregate used data file.If with the demand that front end is shown, in this hair
In bright another embodiment, which can also include: file display unit, show for the data file after merging preceding
End.
Generate reporting unit 504, for based at least one dimension combination to the number in the data file after merging
According to being polymerize, data report corresponding with the dimension combination is generated.
In the present embodiment, by receiving at least two data files, based on from the identical of at least two data file
At least two data file is merged into a data file by preassigned data dimension in data dimension, based at least
A kind of dimension combination polymerize the data in the data file after merging, generates corresponding with the dimension combination
Data report, wherein the dimension combination includes the combination of different data dimension;It can be seen that the application can be based on
Different data files is automatically generated to be reported with dimension combination corresponding data, without artificial statistics, is improved work efficiency.
Further embodiment of this invention discloses a kind of data processing equipment, as shown in fig. 6, the device includes: reception file
Unit 601 and generation reporting unit 602, wherein generate reporting unit 602: module 6021 and the first life are established including first
At module 6022;It is specific:
File unit 601 is received, for receiving at least two data files;
First establishes module 6021, for being based at least two data file, establishes under first data dimension
Data, the corresponding relationship of the data under data and the third data dimension under second data dimension;
Wherein, each corresponding relationship has uniqueness.It is understood that since the corresponding relationship is the first data dimension
Under data, under the data under the second data dimension and third data dimension data corresponding relationship, therefore the uniqueness refers to
It is unique to exist simultaneously three kinds of identical data in different corresponding relationships.But in different corresponding relationships, Ke Yicun
In one or two identical data.
Dimension combination includes the combination of the first data dimension and the second data dimension according to third data dimension.
It should be noted that first, which establishes module, specifically can be used for if the device includes file mergences unit
The data under first data dimension are established in data file after merging, the data under second data dimension with it is described
The corresponding relationship of data under third data dimension.
First generation module 6022, under each corresponding relationship, to being different from least two data file
Data under other data dimensions of first data dimension, second data dimension and the third data dimension carry out
Summarize, generates the data report of first data dimension and second data dimension about the third data dimension.
Wherein, dimension combination includes the combination of the first data dimension and the second data dimension according to the date.
In the present embodiment, dimension combination is the first data dimension, the second data dimension and third data dimension
Combined mode.So, the first data dimension and the second data dimension can be generated about third based on the dimension combination
The data report of data dimension.
For front end data and Back end data described in the embodiment above, the first data dimension can for equipment end,
Second data dimension can be name on account, and third data dimension is the date, i.e., generating device end-name on account is about the date
Divide day data report.Alternatively, it can be name on account, third data that the first data dimension, which can be part of speech, the second data dimension,
Dimension is the date, i.e., generation part of speech-name on account divides day data report about the date.
It should be noted that the first generation module specifically can be used for if the device includes file mergences unit
Under each corresponding relationship, to be different from the data file after merging first data dimension, second data dimension and
Data under other data dimensions of the third data dimension are summarized, and first data dimension and described second are generated
Data report of the data dimension about the third data dimension.
In still another embodiment of the process, generating reporting unit further includes the second generation module 6023, described for generating
First data dimension and second data dimension summarize data report.
It should be noted that being generated in the data file that reporting unit can only summarize after merging when generating data report
The number being different under other data dimensions of first data dimension, second data dimension and the third data dimension
Value Data, and do not summarize other categorical datas.
A kind of data processing equipment includes processor and memory, above-mentioned reception file unit, file mergences unit,
It generates reporting unit and is used as program unit storage in memory, above procedure stored in memory is executed by processor
Unit realizes corresponding function.
Include kernel in processor, is gone in memory to transfer corresponding program unit by kernel.Kernel can be set one
Or more, it reports to automatically generate with dimension combination corresponding data by adjusting kernel parameter, without artificial statistics, improves
Working efficiency.
Memory may include the non-volatile memory in computer-readable medium, random access memory (RAM) and/
Or the forms such as Nonvolatile memory, if read-only memory (ROM) or flash memory (flash RAM), memory include that at least one is deposited
Store up chip.
The embodiment of the invention provides a kind of storage mediums, are stored thereon with program, real when which is executed by processor
A kind of existing data processing method.Specifically, storage medium includes the program of storage, wherein run time control in described program
Equipment executes a kind of data processing method where making the storage medium
The embodiment of the invention provides a kind of processor, the processor is for running program, wherein described program operation
A kind of data processing method described in Shi Zhihang.
The embodiment of the invention provides a kind of equipment, equipment include processor, memory and storage on a memory and can
The program run on a processor, processor perform the steps of when executing program
Receive at least two data files;
The data at least two data file are polymerize based at least one dimension combination, generate with
The corresponding data report of the dimension combination;Wherein, the dimension combination includes the combination of different data dimension.
Optionally, at least two data file includes identical data dimension;Processor is also realized when executing program
Following steps:
Based on data dimension preassigned in the identical data dimension from least two data file will it is described to
Few two data file mergencess are a data file;
Correspondingly, described carry out the data at least two data file based at least one dimension combination
Polymerization generates data report corresponding with the dimension combination, comprising:
The data in the data file after merging are polymerize based at least one dimension combination, generate with it is described
The corresponding data report of dimension combination.
Optionally, the dimension combination includes the first data dimension and the second data dimension according to third data dimension
Combination;
It is described that the data at least two data file are polymerize based at least one dimension combination, it is raw
At data report corresponding with the dimension combination, comprising:
Based at least two data file, the data under first data dimension, second data dimension are established
The corresponding relationship of the data under data and the third data dimension under degree;Wherein, each corresponding relationship has uniqueness;
Under each corresponding relationship, to being different from first data dimension, described at least two data file
Data under other data dimensions of second data dimension and the third data dimension are summarized, and first data are generated
The data report of dimension and second data dimension about the third data dimension.
Optionally, it is also performed the steps of when processor executes program
Generate first data dimension and second data dimension summarizes data report.
Equipment herein can be server, PC, PAD, mobile phone etc..
Present invention also provides a kind of computer program products, when executing on data processing equipment, are adapted for carrying out just
The program of beginningization there are as below methods step:
Receive at least two data files;
The data at least two data file are polymerize based at least one dimension combination, generate with
The corresponding data report of the dimension combination;Wherein, the dimension combination includes the combination of different data dimension.
Optionally, at least two data file includes identical data dimension;Also with the journey of following method and step
Sequence:
Based on data dimension preassigned in the identical data dimension from least two data file will it is described to
Few two data file mergencess are a data file;
Correspondingly, described carry out the data at least two data file based at least one dimension combination
Polymerization generates data report corresponding with the dimension combination, comprising:
The data in the data file after merging are polymerize based at least one dimension combination, generate with it is described
The corresponding data report of dimension combination.
Optionally, the dimension combination includes the first data dimension and the second data dimension according to third data dimension
Combination;
It is described that the data at least two data file are polymerize based at least one dimension combination, it is raw
At data report corresponding with the dimension combination, comprising:
Based at least two data file, the data under first data dimension, second data dimension are established
The corresponding relationship of the data under data and the third data dimension under degree;Wherein, each corresponding relationship has uniqueness;
Under each corresponding relationship, to being different from first data dimension, described at least two data file
Data under other data dimensions of second data dimension and the third data dimension are summarized, and first data are generated
The data report of dimension and second data dimension about the third data dimension.
Optionally, also with the program of following method and step:
Generate first data dimension and second data dimension summarizes data report.
It should be understood by those skilled in the art that, embodiments herein can provide as method, system or computer program
Product.Therefore, complete hardware embodiment, complete software embodiment or reality combining software and hardware aspects can be used in the application
Apply the form of example.Moreover, it wherein includes the computer of computer usable program code that the application, which can be used in one or more,
The computer program implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) produces
The form of product.
The application is referring to method, the process of equipment (system) and computer program product according to the embodiment of the present application
Figure and/or block diagram describe.It should be understood that every one stream in flowchart and/or the block diagram can be realized by computer program instructions
The combination of process and/or box in journey and/or box and flowchart and/or the block diagram.It can provide these computer programs
Instruct the processor of general purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to produce
A raw machine, so that being generated by the instruction that computer or the processor of other programmable data processing devices execute for real
The device for the function of being specified in present one or more flows of the flowchart and/or one or more blocks of the block diagram.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy
Determine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates,
Enable the manufacture of device, the command device realize in one box of one or more flows of the flowchart and/or block diagram or
The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that counting
Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer or
The instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram one
The step of function of being specified in a box or multiple boxes.
In a typical configuration, calculating equipment includes one or more processors (CPU), input/output interface, net
Network interface and memory.
Memory may include the non-volatile memory in computer-readable medium, random access memory (RAM) and/
Or the forms such as Nonvolatile memory, such as read-only memory (ROM) or flash memory (flash RAM).Memory is computer-readable Jie
The example of matter.
Computer-readable medium includes permanent and non-permanent, removable and non-removable media can be by any method
Or technology come realize information store.Information can be computer readable instructions, data structure, the module of program or other data.
The example of the storage medium of computer includes, but are not limited to phase change memory (PRAM), static random access memory (SRAM), moves
State random access memory (DRAM), other kinds of random access memory (RAM), read-only memory (ROM), electric erasable
Programmable read only memory (EEPROM), flash memory or other memory techniques, read-only disc read only memory (CD-ROM) (CD-ROM),
Digital versatile disc (DVD) or other optical storage, magnetic cassettes, tape magnetic disk storage or other magnetic storage devices
Or any other non-transmission medium, can be used for storage can be accessed by a computing device information.As defined in this article, it calculates
Machine readable medium does not include temporary computer readable media (transitory media), such as the data-signal and carrier wave of modulation.
It should also be noted that, the terms "include", "comprise" or its any other variant are intended to nonexcludability
It include so that the process, method, commodity or the equipment that include a series of elements not only include those elements, but also to wrap
Include other elements that are not explicitly listed, or further include for this process, method, commodity or equipment intrinsic want
Element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that including element
There is also other identical elements in process, method, commodity or equipment.
It will be understood by those skilled in the art that embodiments herein can provide as method, system or computer program product.
Therefore, complete hardware embodiment, complete software embodiment or embodiment combining software and hardware aspects can be used in the application
Form.It is deposited moreover, the application can be used to can be used in the computer that one or more wherein includes computer usable program code
The shape for the computer program product implemented on storage media (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.)
Formula.
The above is only embodiments herein, are not intended to limit this application.To those skilled in the art,
Various changes and changes are possible in this application.It is all within the spirit and principles of the present application made by any modification, equivalent replacement,
Improve etc., it should be included within the scope of the claims of this application.
Claims (10)
1. a kind of data processing method characterized by comprising
Receive at least two data files;
The data at least two data file are polymerize based at least one dimension combination, generate with it is described
The corresponding data report of dimension combination;Wherein, the dimension combination includes the combination of different data dimension.
2. the method according to claim 1, wherein at least two data file includes identical data dimension
Degree;
This method further include:
Based on data dimension preassigned from the identical data dimension of at least two data file at least two by described in
A data file mergences is a data file;
Correspondingly, described gather the data at least two data file based at least one dimension combination
It closes, generates data report corresponding with the dimension combination, comprising:
The data in the data file after merging are polymerize based at least one dimension combination, are generated and the dimension
The corresponding data report of combination.
3. the method according to claim 1, wherein the dimension combination includes the first data dimension and the
Two data dimensions according to third data dimension combination;
It is described that the data at least two data file are polymerize based at least one dimension combination, generate with
The corresponding data report of the dimension combination, comprising:
Based at least two data file, the data under first data dimension are established, under second data dimension
Data and the third data dimension under data corresponding relationship;Wherein, each corresponding relationship has uniqueness;
Under each corresponding relationship, to being different from first data dimension, described second at least two data file
Data under other data dimensions of data dimension and the third data dimension are summarized, and first data dimension is generated
Data report with second data dimension about the third data dimension.
4. according to the method described in claim 3, it is characterized by further comprising:
Generate first data dimension and second data dimension summarizes data report.
5. a kind of data processing equipment characterized by comprising
File unit is received, for receiving at least two data files;
Generate reporting unit, for based at least one dimension combination to the data at least two data file into
Row polymerization, generates data report corresponding with the dimension combination;Wherein, the dimension combination includes different data
The combination of dimension.
6. device according to claim 5, which is characterized in that at least two data file includes identical data dimension
Degree;The device further include:
File mergences unit, for based on the preassigned data from the identical data dimension of at least two data file
At least two data file is merged into a data file by dimension;
Correspondingly, the generation reporting unit is specifically used for based at least one dimension combination to the data file after merging
In data polymerize, generate corresponding with dimension combination data report.
7. device according to claim 5, which is characterized in that the dimension combination includes the first data dimension and the
Two data dimensions according to third data dimension combination;
The generation reporting unit, comprising:
First establishes module, for being based at least two data file, establishes the data under first data dimension, institute
State the corresponding relationship of the data under the second data dimension and the data under the third data dimension;Wherein, each corresponding relationship
With uniqueness;
First generation module, under each corresponding relationship, to being different from described first at least two data file
Data under other data dimensions of data dimension, second data dimension and the third data dimension are summarized, raw
Data report at first data dimension and second data dimension about the third data dimension.
8. device according to claim 5, which is characterized in that the generation reporting unit further include:
Second generation module summarizes data report for generate first data dimension and second data dimension.
9. a kind of storage medium, which is characterized in that the storage medium includes the program of storage, wherein run in described program
When control the storage medium where equipment execute a kind of such as data processing method of any of claims 1-4.
10. a kind of processor, which is characterized in that the processor executes such as right for running program when described program is run
It is required that a kind of data processing method described in any one of 1-4.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710904814.1A CN110147352A (en) | 2017-09-29 | 2017-09-29 | A kind of data processing method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710904814.1A CN110147352A (en) | 2017-09-29 | 2017-09-29 | A kind of data processing method and device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110147352A true CN110147352A (en) | 2019-08-20 |
Family
ID=67588028
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710904814.1A Pending CN110147352A (en) | 2017-09-29 | 2017-09-29 | A kind of data processing method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110147352A (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103020158A (en) * | 2012-11-26 | 2013-04-03 | 中兴通讯股份有限公司 | Report form creation method, device and system |
CN104484398A (en) * | 2014-12-12 | 2015-04-01 | 北京国双科技有限公司 | Method and device for aggregation of data in datasheet |
US20150161185A1 (en) * | 2013-12-09 | 2015-06-11 | Linkedin Corporation | Enabling and performing count-distinct queries on a large set of data |
CN106528511A (en) * | 2016-09-30 | 2017-03-22 | 东软集团股份有限公司 | Form analysis method and device |
-
2017
- 2017-09-29 CN CN201710904814.1A patent/CN110147352A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103020158A (en) * | 2012-11-26 | 2013-04-03 | 中兴通讯股份有限公司 | Report form creation method, device and system |
US20150161185A1 (en) * | 2013-12-09 | 2015-06-11 | Linkedin Corporation | Enabling and performing count-distinct queries on a large set of data |
CN104484398A (en) * | 2014-12-12 | 2015-04-01 | 北京国双科技有限公司 | Method and device for aggregation of data in datasheet |
CN106528511A (en) * | 2016-09-30 | 2017-03-22 | 东软集团股份有限公司 | Form analysis method and device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2015148159A1 (en) | Determining a temporary transaction limit | |
JP6779231B2 (en) | Data processing method and system | |
US20130006996A1 (en) | Clustering E-Mails Using Collaborative Information | |
CN107391532B (en) | Data filtering method and device | |
CN110188100A (en) | Data processing method, device and computer storage medium | |
WO2016101811A1 (en) | Information arrangement method and apparatus | |
WO2017133568A1 (en) | Mining method and device for target characteristic data | |
CN106815254A (en) | A kind of data processing method and device | |
CN110457182A (en) | A kind of load balancing cluster example operating index monitoring system | |
CN103235811A (en) | Data storage method and device | |
US11144793B2 (en) | Incremental clustering of a data stream via an orthogonal transform based indexing | |
EP3437060A1 (en) | Rule based hierarchical configuration | |
CN102982112A (en) | Ranking list generation method and journal generation method and server | |
CN106815274A (en) | Daily record data method for digging and system based on Hadoop | |
CN102932416A (en) | Intermediate data storage method, processing method and device in information flow task | |
CN110069453A (en) | Operation/maintenance data treating method and apparatus | |
CN106570005A (en) | Database cleaning method and device | |
US20170235625A1 (en) | Data mining using categorical attributes | |
CN107391533A (en) | Generate the method and device of graphic data base Query Result | |
CN111143546A (en) | Method and device for obtaining recommendation language and electronic equipment | |
CN110147352A (en) | A kind of data processing method and device | |
CN109582476A (en) | Data processing method, apparatus and system | |
Prakashbhai et al. | Inference patterns from Big Data using aggregation, filtering and tagging-A survey | |
CN106708845A (en) | Data processing method and device for Internet account | |
CN109068286A (en) | A kind of method, medium and the equipment of information parsing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
CB02 | Change of applicant information |
Address after: 100080 No. 401, 4th Floor, Haitai Building, 229 North Fourth Ring Road, Haidian District, Beijing Applicant after: BEIJING GRIDSUM TECHNOLOGY Co.,Ltd. Address before: 100086 Beijing city Haidian District Shuangyushu Area No. 76 Zhichun Road cuigongfandian 8 layer A Applicant before: BEIJING GRIDSUM TECHNOLOGY Co.,Ltd. |
|
CB02 | Change of applicant information | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190820 |
|
RJ01 | Rejection of invention patent application after publication |