CN106407233B - A kind of data processing method and equipment - Google Patents

A kind of data processing method and equipment Download PDF

Info

Publication number
CN106407233B
CN106407233B CN201510468507.4A CN201510468507A CN106407233B CN 106407233 B CN106407233 B CN 106407233B CN 201510468507 A CN201510468507 A CN 201510468507A CN 106407233 B CN106407233 B CN 106407233B
Authority
CN
China
Prior art keywords
data
business datum
target service
tables
content
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510468507.4A
Other languages
Chinese (zh)
Other versions
CN106407233A (en
Inventor
吴磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Advanced New Technologies Co Ltd
Advantageous New Technologies Co Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201510468507.4A priority Critical patent/CN106407233B/en
Publication of CN106407233A publication Critical patent/CN106407233A/en
Application granted granted Critical
Publication of CN106407233B publication Critical patent/CN106407233B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/283Multi-dimensional databases or data warehouses, e.g. MOLAP or ROLAP

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

This application discloses a kind of data processing method and equipment, it include: the multi-group data information associated with target service for obtaining and being stored in the first tables of data, the generation time of the business datum comprising target service and the first data content of business datum in each group of data information;When in the first tables of data data wander occurs for the business datum for determining target service, the second data content associated with the business datum of target service that data wander occurs is obtained from the second tables of data;First data content of the business datum that will acquire and the second data content of business datum merge, data cleansing operation is executed to the data content after merging, thus effectively avoids because data wander causes to occur omitting cumulative problem in business datum merging process, effectively improve the accuracy of the business datum stored in data warehouse, the Data Warehouse method of synchronization is simplified simultaneously, Data Warehouse treatment effeciency is effectively promoted.

Description

A kind of data processing method and equipment
Technical field
This application involves internet information processing technique more particularly to a kind of data processing methods and equipment.
Background technique
Data warehouse be subject-oriented (English: Subject Oriented), it is integrated (English: Integrated), the number of metastable (English: Non-Volatile), reflecting history variation (English: Time Variant) According to set.The data file of structuring is mapped as a tables of data in data warehouse.
Data in data warehouse are to execute the bases such as data pick-up, data scrubbing to the data in original dispersion database It processes, summarize and arranges by system on plinth operation, can guarantee that the data in data warehouse eliminate source in this way The inconsistency of data.
In practical applications, the corresponding multiple business datums of a business may occur in multiple and different systems, and Between system the time difference of interaction may cause again time appearance that the business datum stores in the tables of data of not homologous ray across It the case where, this phenomenon are known as data wander.Such as: a business is an order business, then generating one group in table 1 Data related with the order business: generation time, the order information of the order business and payment events information, corresponding production The raw time is No. 1 23:59:00;One group of data related with the order business: generation time, the order business are generated in table 2 Order information and payment amount information, corresponding generation time be No. 2 00:00:00, it can be seen that, the order business Data wander occurs for business datum.
However that data wander occurs is inevitable for business datum between system, then by the business number in not homologous ray According to being synchronized in data warehouse, how to be effectively prevented from because data wander causes to occur omitting in business datum cumulative process to tire out The problem of addition is urgent need to resolve.Make in data warehouse just because of data wander through the standard of cumulative obtained business datum True property is lower.
Summary of the invention
In view of this, the embodiment of the present application provides a kind of data processing method and equipment, for solving in the prior art It is existing how to be effectively prevented from because data wander causes to occur omitting cumulative problem in business datum cumulative process.
A kind of data processing method, comprising:
Obtain the multi-group data information associated with target service stored in the first tables of data, wherein described in each group The generation time of business datum in data information comprising the target service and the first data content of the business datum;
When in first tables of data data wander occurs for the business datum for determining the target service, from the second number The second data content associated with the business datum of the target service of data wander occurs according to acquisition in table, In, first tables of data is different from second tables of data;
First data content of the business datum that will acquire is closed with the second data content of the business datum And data cleansing operation is executed to the data content after merging.
A kind of data processing equipment, comprising:
Acquiring unit, for obtaining the multi-group data information associated with target service stored in the first tables of data, In, it include the generation time and the business datum of the business datum of the target service in data information described in each group First data content;
The acquiring unit is also used to occur in first tables of data in the business datum for determining the target service When data wander, is obtained from the second tables of data and the associated with the business datum of the target service of data wander occurs The second data content, wherein first tables of data is different from second tables of data;
Processing unit, the first data content of the business datum for will acquire and the second number of the business datum It is merged according to content, data cleansing operation is executed to the data content after merging.
The application has the beneficial effect that:
The embodiment of the present application obtains the multi-group data information associated with target service stored in the first tables of data, each First number of the generation time of the business datum comprising the target service and the business datum in the group data information According to content;When in first tables of data data wander occurs for the business datum for determining the target service, from the second number It is described according to the second data content associated with the business datum of the target service for obtaining generation data wander in table First tables of data is different from second tables of data;First data content of the business datum that will acquire and the business number According to the second data content merge, to after merging data content execute data cleansing operation, in this way, data warehouse into Before row data cleansing, judge whether the business datum obtained occurs data wander, and is determining business datum generation data drift When shifting, the data content that the business datum of data wander occurs is obtained, and then merge to the data content of business datum, had It avoids to effect because data wander causes to occur omitting cumulative problem in business datum merging process, effectively improves data The accuracy of the business datum stored in warehouse, while the Data Warehouse method of synchronization is simplified, number is effectively promoted According to data-handling efficiency in warehouse.
Detailed description of the invention
In order to more clearly explain the technical solutions in the embodiments of the present application, make required in being described below to embodiment Attached drawing is briefly introduced, it should be apparent that, the drawings in the following description are only some examples of the present application, for this For the those of ordinary skill in field, without any creative labor, it can also be obtained according to these attached drawings His attached drawing.
Fig. 1 is a kind of flow diagram of data processing method provided by the embodiments of the present application;
Fig. 2 is a kind of structural schematic diagram of data processing equipment provided by the embodiments of the present application.
Specific embodiment
In order to realize the purpose of the embodiment of the present application, the embodiment of the present application provides a kind of data processing method and equipment, The multi-group data information associated with target service stored in the first tables of data is obtained, includes in data information described in each group First data content of the generation time of the business datum of the target service and the business datum;Determining the target When data wander occurs in first tables of data for the business datum of business, is obtained from the second tables of data and data wander occurs The second data content associated with the business datum of the target service, first tables of data and it is described second number According to table difference;First data content of the business datum that will acquire is closed with the second data content of the business datum And data cleansing operation is executed to the data content after merging, in this way, data warehouse is before carrying out data cleansing, judgement is obtained Whether the business datum taken occurs data wander, and when determining that data wander occurs for business datum, obtains and data wander occurs Business datum data content, and then the data content of business datum is merged, is efficiently avoided because data are floated Moving causes to occur in business datum merging process to omit cumulative problem, effectively improves the business datum stored in data warehouse Accuracy, while simplifying the Data Warehouse method of synchronization, Data Warehouse treatment effeciency be effectively promoted.
It should be noted that data cleansing described in the embodiment of the present application refer to data warehouse to the data being drawn into Row cleaning, finds and corrects mistake present in data.Generally comprise check data consistency, to occur invalid value or lack The data of mistake value are handled.Here processing may include deletion.
The embodiment of the present application can be applied to for multistage business, such as: installment business, or need to hold The business etc. of row multi-pass operation.
The each embodiment of the application is described in further detail with reference to the accompanying drawings of the specification.Obviously, described Embodiment is only a part of the embodiment of the application, instead of all the embodiments.Based on the embodiment in the application, ability Domain those of ordinary skill all other embodiment obtained without making creative work belongs to the application guarantor The range of shield.
Fig. 1 is a kind of flow diagram of data processing method provided by the embodiments of the present application.The method can be as follows It is described.The executing subject of the embodiment of the present application can be data warehouse.
Step 101: obtaining the multi-group data information associated with target service stored in the first tables of data.
Wherein, the generation time of business datum in data information described in each group comprising the target service and described First data content of business datum.
In a step 101, since data warehouse has the ability being managed to mass data, each decentralized system acquisition To business datum need in specified data to be synchronized to data warehouse synchronization time, to realize data warehouse to mass data Management.
The function of data warehouse can realize by some tools, such as: open data processing service (English: Open Data Processing Service;Abbreviation: ODPS);Hive tool etc..
It should be noted that Hive is a kind of open source Tool for Data Warehouse based on Hadoop, it can be by the data of structuring File Mapping is a tables of data, and is capable of providing simple SQL query function, SQL statement can also be converted to Map Reduce task is run.
Data warehouse is generally required when completion data are synchronous by data pick-up and the two stages of data cleansing.Its In, data pick-up refers to that data warehouse acquires the business datum that each system within a specified time acquires from decentralized system.
It should be noted that specified time can also can set, example according to system requirements determine according to actual needs Such as: daily 00:00:00~23:59:59.
The data warehouse execution data synchronous time can be timing, be also possible to periodically, such as: it is set as every Its 00:00:00~00:30:00;Or it is set as 00:00:00~00:30:00 etc. on every Mondays.Assuming that data warehouse executes The data synchronous time is set as daily 00:00:00~00:30:00, then within this period, data warehouse from point The business datum acquired within the previous day is extracted in the system of dissipating.Such as: in No. 2 00:00:00~00:30:00, data warehouse The business datum acquired at No. 1 is extracted from decentralized system.
Usual decentralized system stores the business datum of acquisition in one day by the way of table.
In this way, data warehouse obtains associated with target service more when execution data are synchronous from the first tables of data Group data information.
In the first tables of data, for different business, data are generated for each business datum that each business generates The business datum generated in information, the i.e. service identification comprising business, the generation time and the generation time of business datum Data content etc..
Due to the case where in practical applications, will appear across day generation due to the data content of business datum, lead to business There is a phenomenon where data wanders for data content, that is, are directed to the business datum of target service, and the change time of business datum occurs 1 Number 23:59:59;But occur for the corresponding data content of the change in No. 2 00:00:00.In systems, for No. 2 00: There is a possibility that being considered as invalid data in the data content that 00:00 is generated, when executing data cleansing, which will be clear It washes, causes the business datum of target service imperfect in this way.
Step 102: for wherein one group of data information, judging the business datum of the target service whether described first Data wander occurs in tables of data;If data wander occurs, 103 are thened follow the steps;If data wander not yet occurs, according to existing There is technical solution to carry out data pick-up.
In a step 102, for wherein one group of data information, according to the target service for including in the data information Business datum generation time, when judging whether the generation time of the business datum of the target service is included in default first Between within the scope of.
Wherein, the default first time range extracts business datum from different system databases according to data warehouse Time determine.
If the generation time that judging result is the business datum of the target service be included in default first time range it It is interior, it is determined that in first tables of data data wander occurs for the business datum of the target service.
Specifically, for one group of data information in the first tables of data, it is assumed that business datum content in one group of data information The business number in the data information is further determined that at this time according to the generation time of the business datum in the data information for sky According to generation time whether be included in default first time within the scope of, if the generation time of the business datum in the data information Within the scope of default first time, then it can determine that the business datum in the data information occurs in the first tables of data Data wander.
Such as: the time that data warehouse extracts business datum from different system databases be determined as 00:00:00~ 00:30:00, then default first time range can determine are as follows: 23:59:50~23:59:59, once the target service The generation time of business datum is included within 23:59:50~23:59:59, it is determined that the business datum of the target service exists Data wander occurs in first tables of data.
Step 103: when in first tables of data data wander occurs for the business datum for determining the target service, The second data associated with the business datum of the target service that data wander occurs are obtained from the second tables of data Content.
Wherein, first tables of data is different from second tables of data.
In step 103, after due to data wander, the data content of business datum is possibly stored in another data In table, the associated with the business datum of the target service of data wander occurs then obtaining from the second tables of data Second data content.
Specifically, generated in default second time range from searching in the second tables of data, and with the target service Associated data content, wherein default second time range is used for characterize data warehouse from different system databases Middle extraction business datum;When determining that the data content searched is associated with the business datum of the target service, will look into The data content found is as the second data associated with the business datum of the target service that data wander occurs Content.
It should be noted that the default first time range and default second time range are different, but preset the Time difference between one time range and default second time range meets given threshold.
The given threshold can also can be determined according to the characteristic of data wander determine according to actual needs.
The tables of data of the service identification comprising target service is searched first from other tables of data (it is assumed that being second Tables of data);
Secondly, generated in default second time range from searching in the second tables of data, and with the target service phase Associated data content determines that generation time is included in that is, according to the generation time for the business datum for including in the second tables of data Business datum in default second time range, and from determining that data occur with the first tables of data in determining business datum The data content of drift.
As shown in table 1, it is the schematic table of the first tables of data and the second tables of data:
Table 1
Step 104: in the first data content of the business datum that will acquire and the second data of the business datum Appearance merges, and executes data cleansing operation to the data content after merging.
At step 104, for the business datum being drawn into, by the first data content of the business datum and the industry Second data content of business data merges, and obtains the partial data content of the business datum.
In another embodiment of the application, data warehouse needs to update historical data after completing data pick-up, Therefore, data warehouse obtains the historical data content of the business datum of the target service again;And by the history number It is closed according to the second data content of content, the first data content of the business datum of acquisition and the business datum And.
In another embodiment of the application, data warehouse is right in the data information being drawn into the first tables of data It, can first will be in the historical data of the business datum of the target service in the business datum that data wander not yet occurs Hold and is merged with the first data content of the business datum obtained;Secondly by amalgamation result and the industry that gets Second data content of business data merges.
By data processing method provided by the embodiments of the present application, obtain stored in the first tables of data with target service phase Associated multi-group data information, the generation time of the business datum in data information described in each group comprising the target service with And the first data content of the business datum;It is sent out in first tables of data in the business datum for determining the target service When raw data wander, is obtained from the second tables of data and the related to the business datum of the target service of data wander occurs Second data content of connection, first tables of data are different from second tables of data;The of the business datum that will acquire One data content and the second data content of the business datum merge, and execute data cleansing to the data content after merging Operation, in this way, data warehouse before carrying out data cleansing, judges whether the business datum of acquisition occurs data wander, and When determining that data wander occurs for business datum, the data content that the business datum of data wander occurs is obtained, and then to business number According to data content merge, efficiently avoid because data wander cause to occur omitting in business datum merging process it is tired The problem of adding, effectively improves the accuracy of the business datum stored in data warehouse.
Such as: for target service, there are following groups data informations, as shown in table 2:
Table 2
The service identification of target service Generation time Business datum Data content
1111 No. 1 11:59:59 Payment 10
1111 No. 2 23:59:59 Payment It is empty
1111 No. 3 00:00:00 It is empty 20
If the time that data warehouse extracts business datum is No. 2 00:00:00~00:30:00, due to the production of business datum The raw time is No. 1 11:59:59, is not included within default first time range (23:59:50~23:59:59), then extracting Data content to the business datum of target service is 10;If the time that data warehouse extracts business datum is No. 3 00:00:00 ~00:30:00, since the generation time of business datum is No. 2 23:59:59, be included in default first time range (23:59: 50~23:59:59) within, then it is determined that data wander occurs for the business datum, need at this time further from when presetting second Between the data content that the business datum of data wander occurs is determined within range (00:00:00~00:15:00), that is, get 20, in this way, data warehouse can the relatively accurate business datum to the target service, will not because of in data information because lack It loses content and causes the data information invalid, efficiently avoid because data wander causes to occur in business datum merging process Cumulative problem is omitted, the accuracy of the business datum stored in data warehouse is effectively improved.
Fig. 2 is a kind of structural schematic diagram of data processing equipment provided by the embodiments of the present application.The data processing equipment It include: acquiring unit 21 and processing unit 22, in which:
Acquiring unit 21, for obtaining the multi-group data information associated with target service stored in the first tables of data, Wherein, the generation time and the business datum of the business datum in data information described in each group comprising the target service The first data content;
The acquiring unit 21 is also used to send out in first tables of data in the business datum for determining the target service When raw data wander, is obtained from the second tables of data and the related to the business datum of the target service of data wander occurs Second data content of connection, wherein first tables of data is different from second tables of data;
Processing unit 22, the first data content of the business datum for will acquire and the second of the business datum Data content merges, and executes data cleansing operation to the data content after merging.
Specifically, the acquiring unit 21 determines that the business datum of the target service occurs in first tables of data Data wander, comprising:
For wherein one group of data information, according to the business datum for the target service for including in the data information Generation time, judges whether the generation time of the business datum of the target service was included within the scope of default first time, Wherein, the default first time range extracted from different system databases according to data warehouse business datum time it is true It is fixed;
If the generation time that judging result is the business datum of the target service be included in default first time range it It is interior, it is determined that in first tables of data data wander occurs for the business datum of the target service.
Specifically, the acquiring unit 21 obtained from the second tables of data occur data wander with the target service Associated second data content of business datum, comprising:
It is generated in default second time range from lookup in the second tables of data, and associated with the target service Data content, wherein default second time range extracts industry from different system databases for characterize data warehouse Business data;
When determining that the data content searched is associated with the business datum of the target service, the number that will find According to content as the second data content associated with the business datum of the target service that data wander occurs.
Specifically, the first data content of the business datum that the processing unit 22 will acquire and the business datum The second data content merge, comprising:
Obtain the historical data content of the business datum of the target service;
By the historical data content, the first data content of the business datum of acquisition and the business datum Second data content merges.
It should be noted that equipment provided by the embodiments of the present application can be realized by hardware mode, it can also be by soft Part mode realizes, here without limitation,
The equipment judges whether the business datum obtained occurs data wander before carrying out data cleansing, and true When determining business datum generation data wander, the data content that the business datum of data wander occurs is obtained, and then to business datum Data content merge, efficiently avoid because data wander cause to occur omitting in business datum merging process it is cumulative The problem of, effectively improve the accuracy of the business datum stored in data warehouse.
It will be understood by those skilled in the art that embodiments herein can provide as method, apparatus (equipment) or computer Program product.Therefore, in terms of the application can be used complete hardware embodiment, complete software embodiment or combine software and hardware Embodiment form.Moreover, it wherein includes the meter of computer usable program code that the application, which can be used in one or more, The computer journey implemented in calculation machine usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) The form of sequence product.
The application is flow chart of the reference according to method, apparatus (equipment) and computer program product of the embodiment of the present application And/or block diagram describes.It should be understood that each process in flowchart and/or the block diagram can be realized by computer program instructions And/or the combination of the process and/or box in box and flowchart and/or the block diagram.It can provide these computer programs to refer to Enable the processor of general purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to generate One machine so that by the instruction that the processor of computer or other programmable data processing devices executes generate for realizing The device for the function of being specified in one or more flows of the flowchart and/or one or more blocks of the block diagram.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates, Enable the manufacture of device, the command device realize in one box of one or more flows of the flowchart and/or block diagram or The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that counting Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer or The instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram one The step of function of being specified in a box or multiple boxes.
Although the preferred embodiment of the application has been described, it is created once a person skilled in the art knows basic Property concept, then additional changes and modifications may be made to these embodiments.So it includes excellent that the following claims are intended to be interpreted as It selects embodiment and falls into all change and modification of the application range.
Obviously, those skilled in the art various changes and modifications can be made to the invention without departing from the application model It encloses.In this way, if these modifications and variations of the application belong within the scope of the claim of this application and its equivalent technologies, then The application is also intended to include these modifications and variations.

Claims (8)

1. a kind of data processing method characterized by comprising
Obtain the multi-group data information associated with target service stored in the first tables of data, wherein data described in each group The generation time of business datum in information comprising the target service and the first data content of the business datum;
When in first tables of data data wander occurs for the business datum for determining the target service, from the second tables of data It is middle to obtain the second data content associated with the business datum of the target service that data wander occurs, wherein institute It is different from second tables of data to state the first tables of data;
First data content of the business datum that will acquire is merged with the second data content of the business datum, right Data content after merging executes data cleansing operation.
2. data processing method as described in claim 1, which is characterized in that determine the business datum of the target service in institute It states in the first tables of data and data wander occurs, comprising:
For wherein one group of data information, according to the generation of the business datum for the target service for including in the data information Time, judge whether the generation time of the business datum of the target service was included within the scope of default first time, wherein The time that the default first time range extracts business datum according to data warehouse from different system databases determines;
If the generation time that judging result is the business datum of the target service was included within the scope of default first time, Determine that in first tables of data data wander occurs for the business datum of the target service.
3. data processing method as claimed in claim 2, which is characterized in that obtained from the second tables of data and data wander occurs The second data content associated with the business datum of the target service, comprising:
It is generated in default second time range from lookup in the second tables of data, and data associated with the target service Content, wherein default second time range extracts business number for characterize data warehouse from different system databases It is different from default second time range according to, the default first time range, the default first time range with it is described Time difference between default second time range meets given threshold;
It, will be in the data that found when determining that the data content searched is associated with the business datum of the target service Hold as the second data content associated with the business datum of the target service that data wander occurs.
4. data processing method as described in any one of claims 1 to 3, which is characterized in that the business datum that will acquire The first data content and the second data content of the business datum merge, comprising:
Obtain the historical data content of the business datum of the target service;
By the second of the historical data content, the first data content of the business datum of acquisition and the business datum Data content merges.
5. a kind of data processing equipment characterized by comprising
Acquiring unit, for obtaining the multi-group data information associated with target service stored in the first tables of data, wherein every In data information described in one group comprising the target service business datum generation time and the business datum first Data content;
The acquiring unit is also used to that data occur in first tables of data in the business datum for determining the target service When drift, is obtained from the second tables of data and occur associated with the business datum of the target service the of data wander Two data contents, wherein first tables of data is different from second tables of data;
In processing unit, the first data content of the business datum for will acquire and the second data of the business datum Appearance merges, and executes data cleansing operation to the data content after merging.
6. data processing equipment as claimed in claim 5, which is characterized in that the acquiring unit determines the target service In first tables of data data wander occurs for business datum, comprising:
For wherein one group of data information, according to the generation of the business datum for the target service for including in the data information Time, judge whether the generation time of the business datum of the target service was included within the scope of default first time, wherein The time that the default first time range extracts business datum according to data warehouse from different system databases determines;
If the generation time that judging result is the business datum of the target service was included within the scope of default first time, Determine that in first tables of data data wander occurs for the business datum of the target service.
7. data processing equipment as claimed in claim 6, which is characterized in that the acquiring unit is obtained from the second tables of data The second data content associated with the business datum of the target service of data wander occurs, comprising:
It is generated in default second time range from lookup in the second tables of data, and data associated with the target service Content, wherein default second time range extracts business number for characterize data warehouse from different system databases It is different from default second time range according to, the default first time range, the default first time range with it is described Time difference between default second time range meets given threshold;
It, will be in the data that found when determining that the data content searched is associated with the business datum of the target service Hold as the second data content associated with the business datum of the target service that data wander occurs.
8. such as the described in any item data processing equipments of claim 5 to 7, which is characterized in that the processing unit will acquire First data content of the business datum is merged with the second data content of the business datum, comprising:
Obtain the historical data content of the business datum of the target service;
By the second of the historical data content, the first data content of the business datum of acquisition and the business datum Data content merges.
CN201510468507.4A 2015-08-03 2015-08-03 A kind of data processing method and equipment Active CN106407233B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510468507.4A CN106407233B (en) 2015-08-03 2015-08-03 A kind of data processing method and equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510468507.4A CN106407233B (en) 2015-08-03 2015-08-03 A kind of data processing method and equipment

Publications (2)

Publication Number Publication Date
CN106407233A CN106407233A (en) 2017-02-15
CN106407233B true CN106407233B (en) 2019-08-02

Family

ID=58008148

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510468507.4A Active CN106407233B (en) 2015-08-03 2015-08-03 A kind of data processing method and equipment

Country Status (1)

Country Link
CN (1) CN106407233B (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107025266B (en) * 2017-02-28 2020-10-30 创新先进技术有限公司 Service data processing method and device
US10606828B2 (en) * 2017-10-19 2020-03-31 Jpmorgan Chase Bank, N.A. Storage correlation engine
CN107943840B (en) * 2017-10-30 2022-01-11 深圳前海微众银行股份有限公司 Data processing method, system and computer readable storage medium
CN108897818B (en) * 2018-06-20 2020-12-01 北京三快在线科技有限公司 Method and device for determining aging state of data processing process and readable storage medium
CN112069193A (en) * 2020-08-27 2020-12-11 上海上讯信息技术股份有限公司 Correlation method and device based on asynchronous correlation

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101964100A (en) * 2010-09-28 2011-02-02 北京正邦高科信息技术有限公司 Method and system for calculating incoming lines of media
CN104360997A (en) * 2014-04-01 2015-02-18 芜湖齐创自动化系统有限公司 Big data drifting technology based on structured database
CN104462082A (en) * 2013-09-12 2015-03-25 深圳中科金证科技有限公司 Data warehouse based medical data integration method and system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101964100A (en) * 2010-09-28 2011-02-02 北京正邦高科信息技术有限公司 Method and system for calculating incoming lines of media
CN104462082A (en) * 2013-09-12 2015-03-25 深圳中科金证科技有限公司 Data warehouse based medical data integration method and system
CN104360997A (en) * 2014-04-01 2015-02-18 芜湖齐创自动化系统有限公司 Big data drifting technology based on structured database

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
"农信社建设数据仓库的几点思考";任伯群;《金融电子化》;20150520(第5期);第78-79页
"基于产业链协作平台的商务智能架构及数据挖掘技术探讨";穆俊;《鸡西大学学报》;20150515;第15卷(第5期);第25-28页

Also Published As

Publication number Publication date
CN106407233A (en) 2017-02-15

Similar Documents

Publication Publication Date Title
CN106407233B (en) A kind of data processing method and equipment
CN108153784B (en) Synchronous data processing method and device
US9323809B2 (en) System and methods for rapid data analysis
CN106453437B (en) equipment identification code acquisition method and device
WO2017107853A1 (en) Data monitoring management method, and data monitoring method and system
CN103577474B (en) The update method and system of a kind of database
CN107016018B (en) Database index creation method and device
CN109145003B (en) Method and device for constructing knowledge graph
CN108628972B (en) Data table processing method and device and storage medium
CN106970929A (en) Data lead-in method and device
CN106648839B (en) Data processing method and device
CN107748752A (en) A kind of data processing method and device
CN110555108B (en) Event context generation method, device, equipment and storage medium
US20190087470A1 (en) System and method for mining user cycle mode
JP2020123320A (en) Method, apparatus, device and storage medium for managing index
US10970295B2 (en) Collecting statistics in unconventional database environments
US20180011945A1 (en) Inferring graph topologies
CN112307151B (en) Navigation data processing method and device
CN113672692A (en) Data processing method, data processing device, computer equipment and storage medium
CN105354224A (en) Knowledge data processing method and apparatus
CN105243277A (en) Computer-aided medical data processing system and method
CN111125090B (en) Data access method and device
CN112559493A (en) Data blood relationship analysis method, computer device, and storage medium
CN116432185B (en) Abnormality detection method and device, readable storage medium and electronic equipment
CN110083602B (en) Method and device for data storage and data processing based on hive table

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20200922

Address after: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman, British Islands

Patentee after: Innovative advanced technology Co.,Ltd.

Address before: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman, British Islands

Patentee before: Advanced innovation technology Co.,Ltd.

Effective date of registration: 20200922

Address after: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman, British Islands

Patentee after: Advanced innovation technology Co.,Ltd.

Address before: A four-storey 847 mailbox in Grand Cayman Capital Building, British Cayman Islands

Patentee before: Alibaba Group Holding Ltd.

TR01 Transfer of patent right