CN104462604A - Data processing method and system - Google Patents
Data processing method and system Download PDFInfo
- Publication number
- CN104462604A CN104462604A CN201410855040.4A CN201410855040A CN104462604A CN 104462604 A CN104462604 A CN 104462604A CN 201410855040 A CN201410855040 A CN 201410855040A CN 104462604 A CN104462604 A CN 104462604A
- Authority
- CN
- China
- Prior art keywords
- data
- module
- processing
- database
- verification
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/0703—Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Quality & Reliability (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention relates to the technical field of computer information processing and discloses a data processing method and system. The data processing method and system aim to solve the problem that data are difficult to integrate and utilize in the data processing process. The data processing method mainly comprises the steps of collecting data from a data source, processing and disposing the data, verifying the processed and disposed data and outputting the verified data according to application demands. According to the technical scheme, data are cleaned, data formats are regulated, and the application range of the data is improved through secondary processing.
Description
Technical field
The present invention relates to technical field of computer information processing, particularly a kind of data processing method and system.
Background technology
Along with large data association area is fast-developing, the data of different system, disparate databases are integrated, and be applied in new market demand software or the degree of depth of carrying out data excavates the developing direction having become large data fields.At present, difficult point maximum in data mart modeling is that timing node that is different due to Data Source, data application demand that is different, data is different, causes the unitarity of data, integrality incomplete, affects integration and the utilization of data.
Summary of the invention
[technical matters that will solve]
The object of this invention is to provide a kind of data processing method and system, utilize the problem of difficulty to solve Data Integration in data mart modeling process.
[technical scheme]
The present invention is achieved by the following technical solutions.
The present invention relates to a kind of data processing method, the method comprises the following steps:
The data transformations of collection is that unified form is stored in the first database by steps A: from data source image data;
Step B: processing and sorting is carried out to the data in the first database, described processing and sorting specifically comprises raw data cleaning, providing data formatting process, Data Comparison, data correlation process, data secondary processing;
Step C: the data after processing and sorting are verified;
Step D: export the data after verification to second database according to application demand, described second database is the database of operation system.
As one preferred embodiment, described data source is Database Systems and/or internet.
As another preferred embodiment, from the method for Database Systems image data be: utilize data transformations instrument the batch data Database Systems to be imported in the first database.
As another preferred embodiment, comprise from the method for internet image data: targeted website is located; Webpage source code is analyzed; Website data modeling; Data grabber.
As another preferred embodiment, in described step C be verified as data uniqueness verification, data layout verification or data plausibility check.
As another preferred embodiment, the verification of described data uniqueness specifically comprises to be carried out unicity retrieval to tables of data field or carries out unicity retrieval to the combination of multiple field;
Described data layout verification comprises to be retrieved the type of data;
The plausibility check of described data comprises and judging date, character length, type.
As another preferred embodiment, described raw data cleaning specifically comprises apparent error data processing, repeating data process and data merging treatment;
Described providing data formatting process comprises deletion and the replacement of special character;
Described Data Comparison comprises and the data in different pieces of information source being contrasted according to data field, to be then integrated into by homogeneous data in tables of data and to form data history table according to timing node;
Described data correlation process comprises carries out index by the data be associated in different pieces of information table, and sets up index relative;
Data directory is set up in the data mining that described data secondary processing comprises for raw data.
The invention still further relates to a kind of NC manufacturing system, this system comprises data acquisition module, data mart modeling module, data check module and statistical conversion module,
Described data acquisition module is used for from data source image data, and is that unified form is stored in the first database by the data transformations of collection;
Described data mart modeling module is used for carrying out processing and sorting to the data in the first database, and described processing module specifically comprises raw data cleaning module, providing data formatting processing module, Data Comparison module, data correlation processing module, data secondary processing module;
Described data check module is used for verifying the data after processing and sorting;
Described statistical conversion module is used for the data after by verification and exports the second database to according to application demand, and described second database is the database of operation system.
As one preferred embodiment, described data check module specifically comprises data uniqueness correction verification module, data layout correction verification module or data plausibility check module,
Described data uniqueness correction verification module is used for carrying out unicity retrieval to tables of data field or carrying out unicity retrieval to the combination of multiple field;
Described data layout correction verification module is used for retrieving the type of data;
Described data plausibility check module is used for judging date, character length, type.
As another preferred embodiment, described raw data cleaning module is used for apparent error data processing, repeating data process and data merging treatment;
Described providing data formatting processing module is used for deletion and the replacement of special character;
Described Data Comparison module is used for the data in different pieces of information source to contrast according to data field, to be then integrated into by homogeneous data in tables of data and to form data history table according to timing node;
Described data correlation processing module is used for the data be associated in different pieces of information table to carry out index, and sets up index relative;
Described data secondary processing module is used for setting up data directory for the data mining of raw data.
[beneficial effect]
The needs that unified for the data in different pieces of information source processing process can utilize to meet Data Integration by the technical scheme that the present invention proposes.
Accompanying drawing explanation
The structured flowchart of the NC manufacturing system that Fig. 1 provides for embodiments of the invention one;
The process flow diagram of the data processing method that Fig. 2 provides for embodiments of the invention two;
The process flow diagram of the data processing method that Fig. 3 provides for embodiments of the invention three.
Embodiment
For making the object, technical solutions and advantages of the present invention clearly, below in conjunction with accompanying drawing, clear, complete description is carried out to the specific embodiment of the present invention, obviously, described embodiment is a part of embodiment of the present invention, instead of whole embodiment, neither limitation of the present invention.Based on embodiments of the invention, those of ordinary skill in the art, not paying the every other embodiment obtained under creative work prerequisite, belong to protection scope of the present invention.
The structural drawing of the NC manufacturing system that Fig. 1 provides for the embodiment of the present invention one, as shown in Figure 1, this system comprises data acquisition module 101, data mart modeling module 102, data check module 103 and statistical conversion module 104,
The data transformations of collection for from data source image data, and is that unified form is stored in the first database by data acquisition module 101.
Data mart modeling module 102 is for carrying out processing and sorting to the data in the first database, and described processing module specifically comprises raw data cleaning module, providing data formatting processing module, Data Comparison module, data correlation processing module, data secondary processing module.Wherein raw data cleaning module is used for apparent error data processing, repeating data process and data merging treatment; Providing data formatting processing module is used for deletion and the replacement of special character; Data Comparison module is used for the data in different pieces of information source to contrast according to data field, to be then integrated into by homogeneous data in tables of data and to form data history table according to timing node; Data correlation processing module is used for the data be associated in different pieces of information table to carry out index, and sets up index relative; Data secondary processing module is used for setting up data directory for the data mining of raw data.
Data check module 103 is for verifying the data after processing and sorting.In the present embodiment, data check module specifically comprises data uniqueness correction verification module, data layout correction verification module or data plausibility check module, and data uniqueness correction verification module is used for carrying out unicity retrieval to tables of data field or carrying out unicity retrieval to the combination of multiple field; Data layout verification is used for retrieving the type of data; The plausibility check of data is used for judging date, character length, type.
Statistical conversion module 104 is for exporting the data after verification to second database according to application demand, and wherein the second database is the database of operation system.
Adopting the NC manufacturing system that provides of embodiment one to carry out the method for data acquisition can with reference to following concrete grammar embodiment.
The process flow diagram of the business data job operation that Fig. 2 provides for the embodiment of the present invention two.As shown in Figure 2,201 to step that the method comprising the steps of 211, is described to step 211 step 201 below respectively.
Step 201: be saved to data collection data storehouse from enterprise database system acquisition raw data.
Particularly, by utilizing the statistical conversion instrument of SQL Server, the business data be stored in SQL Server database being derived, then raw data is imported in the data collection data storehouse for data mart modeling.
Step 202: raw data is cleared up.
Data in step 202 pair data acquisition database are cleared up, and specifically comprise:
Repeating data process: combing is carried out to identical business data such as enterprise name, address or operation licenses number, carries out after comparison one by one merging or deleting;
Delete special data: mainly for the misdata in raw data, repeating data carry out deleting, union operation.As deleted enterprise name for " XXX test data ".
Process for some ragged array: such as NULL and empty data are unified into NULL; Correct for the data of apparent error in date field, such as, be that the date of " 2031-5-8 " is processed into " 2013-5-8 " by the Date of Incorporation, be less than the data scrubbing that 1949 are greater than 2014 years the Date of Incorporation.
Step 203: formatted data.
Step 203, for the deletion of special character and replacement, specifically comprises:
Special character in format legal person, enterprise name: such as the character of band * is replaced with sky;
Format registered enterprise fund: calculate registered capital (Currency Type of such as some registered enterprise funds is dollar, yen etc.) according to Currency Type and the current exchange rate, and be unified into Renminbi.
Step 204: Data Comparison.
Particularly, the raw data of the business data existed in NC manufacturing system and collection is contrasted, there is the part field that same enterprise then upgrades the business data in NC manufacturing system, as upgraded enterprise address, website field etc.
Step 205: raw data association process.
Raw data in step 205 pair data acquisition database carries out association process, specifically comprise the data field one_to_one corresponding enterprise stored in raw data enterprise literary name section and NC manufacturing system shown, then during the table data importing of original enterprise is shown to the enterprise that NC manufacturing system is corresponding.
Step 206: data secondary processing.
Step 206 mainly comprises: the industry etc. of the unique identification of the data processing of the enterprise that stops doing business, more new spectra, more new spectra modification information, renewal enterprise area field, more industry, the more new spectra of new spectra.
The data processing of enterprise of stopping doing business in step 206 specifically divides three kinds of situations:
For brand-name enterprise, then extract trademark application people and mate with original business data, by the match is successful and enterprise is not the open for business data of enterprise adds in enterprise storehouse;
For the enterprise having patent, then extract patent applicant and mate with original business data, by the match is successful and enterprise is not the open for business data of enterprise adds in enterprise storehouse;
For having trade mark or having patent and do not working as the enterprise in the secondary business data provided, but the enterprise had in the former business data provided, the data of this enterprise are added in enterprise storehouse.
In step 206, more the unique identification of new spectra mainly writes unique identifier to the business data in NC manufacturing system.
In step 206, more new spectra modification information specifically comprises:
By incremental data and raw data comparison, for operation license or organization mechanism code identical, but the different enterprise of enterprise name carries out index, adds original undertaking's title in enterprise's history information table, and the enterprise name upgrading enterprise's master meter is title after changing;
By carrying out fuzzy matching to enterprise name, legal representative, address, then pairing approximation enterprise carries out artificial treatment, determines latest company's title, and title will write enterprise's history information table before changing.
Upgrade enterprise's area field in step 206 specifically to comprise:
By registration authority's code update city fields and district field;
Upgrade enterprise region by enterprise address, such as " Mianyang City XXX company ", area update becomes Mianyang City;
According to enterprise name, region is verified, such as " Chengdu XXX company ", if Region dividing is not or not Chengdu, then carry out manual verification.
In step 206, more the industry of new spectra specifically comprises:
Trade division is carried out by the industry code (98 class industry code) of enterprises registration;
Upgrade trade mark in step 206 to mate with trademark application people title mainly through enterprise name with associating of enterprise, set up associating of business data and branding data.
Step 207: the data after processing and sorting are verified.
Particularly, the verification in step 207 comprises data uniqueness verification, data layout verification or and data plausibility check.
Step 208: the data after verification are exported according to application demand.
Particularly, the data after verification are exported to the database of operation system by step 208 according to application demand.
The process flow diagram of the branding data job operation that Fig. 3 provides for the embodiment of the present invention two, as shown in Figure 2,301 to step that the method comprising the steps of 311, is described to step 311 step 301 below respectively.
Step 301: gather branding data and be saved to data collection data storehouse.
The present embodiment specifically gathers the branding data in Sichuan Province, particularly, by carrying out source code analysis to branding data website, obtain branding data structural code, set up data grabber rule according to branding data structure, capture branding data stored in data collection data storehouse.In order to avoid capturing unwanted data, before data grabber, need the rule first formulating image data, such as, only gather the branding data belonging to the address in Sichuan Province, specifically comprise: trade mark address comprises " Chengdu ", " Sichuan Province " etc.
Step 302: raw data is cleared up.
Step 302 specifically comprises:
The data that cleaning gathers, delete the data outside acquisition range, such as, only gather the branding data in Sichuan Province, contrast one by one according to regional address table to the data gathered, delete data outside region.
Step 303: formatted data.
Step 303 specifically comprises:
Correct special applicant's title.Such as, comprise " J in applicant; 1 ", should be updated in " river ";
Correct private right start-stop day.Such as the private right starting date of " 1900-01-01 " is set to null character (NUL).
Step 304: Data Comparison.
Particularly, the raw data of the branding data existed in NC manufacturing system and collection is contrasted, if there is same enterprise, upgrade the part field of the branding data in NC manufacturing system, as upgraded trade mark address, commodity and service project etc.
Step 305: raw data association process.
Raw data in step 305 pair data acquisition database carries out association process, specifically comprise the data field one_to_one corresponding of the trade mark table stored in the raw data trade mark literary name section of collection and NC manufacturing system, then by the original trade mark table data importing that gathers in trade mark table corresponding to NC manufacturing system.
Step 306: data secondary processing, comprises the district, renewal trade mark industry, renewal trade mark statutory status etc. that upgrade city fields and city.
The renewal city fields of step 306 and the district in city specifically comprise:
Collect the address in each city, Sichuan Province as rule, upgrade city fields; Collect each address, district in city, Sichuan Province as rule, the more district of new town.
Upgrade trade mark industry in step 306 specifically to comprise:
Set up trade mark and national economy to classify contrast relationship rule list, particularly, the commodity and service project described in the first commodity and service project of trade mark being classified with national economy is carried out corresponding, is then that branding data adds profession identity according to this rule.
Upgrade trade mark statutory status in step 306 specifically to comprise: arrange the trade mark statutory status of the raw data gathered and divide, set up statutory status rule list, the statutory status of trade mark is updated to effectively, in careful, invalid three kinds of states.Such as statutory status is " objection ruling % objection is false ", is updated to " invalid-objection ".
Step 307: the data after processing and sorting are verified.
Particularly, the verification in step 307 comprises data uniqueness verification, data layout School Affairs data plausibility check.
Step 308: the data after verification are exported according to application demand.
Particularly, the data after verification are exported to the database of operation system by step 308 according to application demand.
As can be seen from the above embodiments, the embodiment of the present invention solves Data Integration in data mart modeling process and utilizes the problem of difficulty, particularly, the business data job operation provided by embodiment, data are cleared up, specification data layout, improves the scope of application of data by secondary processing.
Claims (10)
1. a data processing method, is characterized in that comprising the steps:
The data transformations of collection is that unified form is stored in the first database by steps A: from data source image data;
Step B: processing and sorting is carried out to the data in the first database, described processing and sorting specifically comprises raw data cleaning, providing data formatting process, Data Comparison, data correlation process, data secondary processing;
Step C: the data after processing and sorting are verified;
Step D: export the data after verification to second database according to application demand, described second database is the database of operation system.
2. data processing method according to claim 1, is characterized in that described data source is Database Systems and/or internet.
3. data processing method according to claim 2, it is characterized in that from the method for Database Systems image data be: utilize data transformations instrument the batch data Database Systems to be imported in the first database.
4. the method for the data mart modeling according to Claims 2 or 3, is characterized in that comprising from the method for internet image data: targeted website is located; Webpage source code is analyzed; Website data modeling; Data grabber.
5. data processing method according to claim 1, it is characterized in that in described step C be verified as data uniqueness verification, data layout verification or data plausibility check.
6. data processing method according to claim 5, is characterized in that:
The verification of described data uniqueness specifically comprises to be carried out unicity retrieval to tables of data field or carries out unicity retrieval to the combination of multiple field;
Described data layout verification comprises to be retrieved the type of data;
The plausibility check of described data comprises and judging date, character length, type.
7. data processing method according to claim 1, is characterized in that:
Described raw data cleaning specifically comprises apparent error data processing, repeating data process and data merging treatment;
Described providing data formatting process comprises deletion and the replacement of special character;
Described Data Comparison comprises and the data in different pieces of information source being contrasted according to data field, to be then integrated into by homogeneous data in tables of data and to form data history table according to timing node;
Described data correlation process comprises carries out index by the data be associated in different pieces of information table, and sets up index relative;
Data directory is set up in the data mining that described data secondary processing comprises for raw data.
8. a NC manufacturing system, is characterized in that comprising data acquisition module, data mart modeling module, data check module and statistical conversion module,
Described data acquisition module is used for from data source image data, and is that unified form is stored in the first database by the data transformations of collection;
Described data mart modeling module is used for carrying out processing and sorting to the data in the first database, and described processing module specifically comprises raw data cleaning module, providing data formatting processing module, Data Comparison module, data correlation processing module, data secondary processing module;
Described data check module is used for verifying the data after processing and sorting;
Described statistical conversion module is used for the data after by verification and exports the second database to according to application demand, and described second database is the database of operation system.
9. NC manufacturing system according to claim 8, is characterized in that described data check module specifically comprises data uniqueness correction verification module, data layout correction verification module or data plausibility check module,
Described data uniqueness correction verification module is used for carrying out unicity retrieval to tables of data field or carrying out unicity retrieval to the combination of multiple field;
Described data layout correction verification module is used for retrieving the type of data;
Described data plausibility check module is used for judging date, character length, type.
10. NC manufacturing system according to claim 8, is characterized in that:
Described raw data cleaning module is used for apparent error data processing, repeating data process and data merging treatment;
Described providing data formatting processing module is used for deletion and the replacement of special character;
Described Data Comparison module is used for the data in different pieces of information source to contrast according to data field, to be then integrated into by homogeneous data in tables of data and to form data history table according to timing node;
Described data correlation processing module is used for the data be associated in different pieces of information table to carry out index, and sets up index relative;
Described data secondary processing module is used for setting up data directory for the data mining of raw data.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410855040.4A CN104462604B (en) | 2014-12-31 | 2014-12-31 | Data processing method and system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410855040.4A CN104462604B (en) | 2014-12-31 | 2014-12-31 | Data processing method and system |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104462604A true CN104462604A (en) | 2015-03-25 |
CN104462604B CN104462604B (en) | 2017-10-31 |
Family
ID=52908639
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410855040.4A Active CN104462604B (en) | 2014-12-31 | 2014-12-31 | Data processing method and system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104462604B (en) |
Cited By (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106095837A (en) * | 2016-06-01 | 2016-11-09 | 杭州中奥科技有限公司 | It is applicable to data system and the data processing method of industrial and commercial all kinds of business |
WO2016184192A1 (en) * | 2015-05-21 | 2016-11-24 | 中兴通讯股份有限公司 | Data processing method and device |
CN106294492A (en) * | 2015-06-08 | 2017-01-04 | 深圳中兴网信科技有限公司 | Data cleaning method and cleaning engine |
CN106293977A (en) * | 2015-05-15 | 2017-01-04 | 阿里巴巴集团控股有限公司 | A kind of data verification method and equipment |
CN106289794A (en) * | 2015-05-25 | 2017-01-04 | 上海汽车集团股份有限公司 | The data processing method of car load test and device |
CN106326321A (en) * | 2015-07-10 | 2017-01-11 | 中兴通讯股份有限公司 | Big data exchange method and device |
CN106599193A (en) * | 2016-12-14 | 2017-04-26 | 云南电网有限责任公司电力科学研究院 | Data cleaning method and system |
CN107066531A (en) * | 2017-03-01 | 2017-08-18 | 苏州朗动网络科技有限公司 | A kind of business data radar monitoring method and system based on enterprise's big data platform |
CN107229662A (en) * | 2016-03-25 | 2017-10-03 | 阿里巴巴集团控股有限公司 | Data cleaning method and device |
CN108073694A (en) * | 2017-12-08 | 2018-05-25 | 国云科技股份有限公司 | A kind of enterprise attributes standardized system and its implementation based on biradical standard |
CN109446157A (en) * | 2018-10-18 | 2019-03-08 | 武汉虹旭信息技术有限责任公司 | System and method are looked into a kind of data format school based on format data |
CN109558466A (en) * | 2018-11-29 | 2019-04-02 | 成都天衡智造科技有限公司 | A kind of source data configuring management method in manufacturing industry data mining |
CN111258998A (en) * | 2020-01-16 | 2020-06-09 | 北京字节跳动网络技术有限公司 | Data verification method, device, medium and electronic equipment |
CN111914015A (en) * | 2020-08-25 | 2020-11-10 | 河北时代电子有限公司 | Multisource data gateway data analysis early warning system based on industrial protocol |
CN112506580A (en) * | 2020-12-16 | 2021-03-16 | 青岛海尔科技有限公司 | Data fusion method and device, storage medium and electronic device |
CN113297177A (en) * | 2021-05-31 | 2021-08-24 | 深圳市资道智能科技有限公司 | Non-marketing enterprise standardized database construction and application method |
WO2021169574A1 (en) * | 2020-02-24 | 2021-09-02 | 京东方科技集团股份有限公司 | Data exchange method and apparatus, and readable storage medium, and data exchange system |
CN113792039A (en) * | 2021-03-15 | 2021-12-14 | 北京京东振世信息技术有限公司 | Data processing method and device, electronic equipment and storage medium |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103719460A (en) * | 2012-10-16 | 2014-04-16 | 钱国祥 | Radix ophiopogonis lung atrophy health-care tea |
CN103761217A (en) * | 2014-01-07 | 2014-04-30 | 成都市卓睿科技有限公司 | Automatic multifunctional-statement generation method |
US20140181413A1 (en) * | 2012-12-20 | 2014-06-26 | Advanced Micro Devices, Inc. | Method and system for shutting down active core based caches |
-
2014
- 2014-12-31 CN CN201410855040.4A patent/CN104462604B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103719460A (en) * | 2012-10-16 | 2014-04-16 | 钱国祥 | Radix ophiopogonis lung atrophy health-care tea |
US20140181413A1 (en) * | 2012-12-20 | 2014-06-26 | Advanced Micro Devices, Inc. | Method and system for shutting down active core based caches |
CN103761217A (en) * | 2014-01-07 | 2014-04-30 | 成都市卓睿科技有限公司 | Automatic multifunctional-statement generation method |
Non-Patent Citations (1)
Title |
---|
刘喜文: "构建数据仓库过程中的数据清洗研究", 《图书与情报》 * |
Cited By (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106293977B (en) * | 2015-05-15 | 2019-04-05 | 阿里巴巴集团控股有限公司 | A kind of data verification method and equipment |
CN106293977A (en) * | 2015-05-15 | 2017-01-04 | 阿里巴巴集团控股有限公司 | A kind of data verification method and equipment |
WO2016184192A1 (en) * | 2015-05-21 | 2016-11-24 | 中兴通讯股份有限公司 | Data processing method and device |
CN106296498A (en) * | 2015-05-21 | 2017-01-04 | 中兴通讯股份有限公司 | Data processing method and device |
CN106289794A (en) * | 2015-05-25 | 2017-01-04 | 上海汽车集团股份有限公司 | The data processing method of car load test and device |
CN106289794B (en) * | 2015-05-25 | 2019-08-13 | 上海汽车集团股份有限公司 | The data processing method and device of vehicle test |
CN106294492A (en) * | 2015-06-08 | 2017-01-04 | 深圳中兴网信科技有限公司 | Data cleaning method and cleaning engine |
CN106326321A (en) * | 2015-07-10 | 2017-01-11 | 中兴通讯股份有限公司 | Big data exchange method and device |
CN106326321B (en) * | 2015-07-10 | 2022-01-28 | 中兴通讯股份有限公司 | Big data exchange method and device |
CN107229662B (en) * | 2016-03-25 | 2022-02-25 | 阿里巴巴集团控股有限公司 | Data cleaning method and device |
CN107229662A (en) * | 2016-03-25 | 2017-10-03 | 阿里巴巴集团控股有限公司 | Data cleaning method and device |
CN106095837A (en) * | 2016-06-01 | 2016-11-09 | 杭州中奥科技有限公司 | It is applicable to data system and the data processing method of industrial and commercial all kinds of business |
CN106599193A (en) * | 2016-12-14 | 2017-04-26 | 云南电网有限责任公司电力科学研究院 | Data cleaning method and system |
CN107066531A (en) * | 2017-03-01 | 2017-08-18 | 苏州朗动网络科技有限公司 | A kind of business data radar monitoring method and system based on enterprise's big data platform |
CN108073694A (en) * | 2017-12-08 | 2018-05-25 | 国云科技股份有限公司 | A kind of enterprise attributes standardized system and its implementation based on biradical standard |
CN109446157B (en) * | 2018-10-18 | 2021-10-29 | 武汉虹旭信息技术有限责任公司 | Data format checking system and method based on formatted data |
CN109446157A (en) * | 2018-10-18 | 2019-03-08 | 武汉虹旭信息技术有限责任公司 | System and method are looked into a kind of data format school based on format data |
CN109558466A (en) * | 2018-11-29 | 2019-04-02 | 成都天衡智造科技有限公司 | A kind of source data configuring management method in manufacturing industry data mining |
CN111258998A (en) * | 2020-01-16 | 2020-06-09 | 北京字节跳动网络技术有限公司 | Data verification method, device, medium and electronic equipment |
WO2021169574A1 (en) * | 2020-02-24 | 2021-09-02 | 京东方科技集团股份有限公司 | Data exchange method and apparatus, and readable storage medium, and data exchange system |
CN111914015A (en) * | 2020-08-25 | 2020-11-10 | 河北时代电子有限公司 | Multisource data gateway data analysis early warning system based on industrial protocol |
CN112506580A (en) * | 2020-12-16 | 2021-03-16 | 青岛海尔科技有限公司 | Data fusion method and device, storage medium and electronic device |
CN113792039A (en) * | 2021-03-15 | 2021-12-14 | 北京京东振世信息技术有限公司 | Data processing method and device, electronic equipment and storage medium |
CN113792039B (en) * | 2021-03-15 | 2024-03-01 | 北京京东振世信息技术有限公司 | Data processing method and device, electronic equipment and storage medium |
CN113297177A (en) * | 2021-05-31 | 2021-08-24 | 深圳市资道智能科技有限公司 | Non-marketing enterprise standardized database construction and application method |
Also Published As
Publication number | Publication date |
---|---|
CN104462604B (en) | 2017-10-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104462604A (en) | Data processing method and system | |
He et al. | A database linking Chinese patents to China’s census firms | |
CN110383319B (en) | Large scale heterogeneous data ingestion and user resolution | |
CN106022902A (en) | Accounting method and device | |
CN103177068A (en) | Systems and methods for merging source records in accordance with survivorship rules | |
CN105677625B (en) | Processing method is made a report in the collaboration of multi-layer collect statistics report | |
CN105099729A (en) | User ID (Identification) recognition method and device | |
CN102346901A (en) | Internet medicine trading subject credit assessment system and method | |
CN101071445A (en) | Classified sample set optimizing method and content-related advertising server | |
CN102521713B (en) | Data processing equipment and data processing method | |
CN107168937A (en) | Financial cloud accounting element particle and assemble method based on XBRL | |
CN102208061A (en) | Data cancel after verification processing device and method | |
CN114187082A (en) | Financial accounting and reimbursement method and system | |
CN106485579A (en) | A kind of tax declaration method | |
WO2008127443A1 (en) | Image data extraction automation process | |
CN103914540A (en) | Processing method for flexibly configuring ammeter specifications for ammeter reading | |
CN101320394A (en) | Data acquisition method supporting multiple file types | |
CN112463985A (en) | Government affair map model construction method, device, equipment and computer readable medium | |
Eberle et al. | Record linkage of the linked employer-employee survey of the socio-economic panel study (SOEP-LEE) and the establishment history panel (BHP) | |
Bruntink | An initial quality analysis of the ohloh software evolution data | |
CN112115271B (en) | Knowledge graph construction method and device | |
CN109324963A (en) | The method and terminal device of automatic test profitable result | |
CN1588405A (en) | Data processing device and method for risk control system | |
CN112613853A (en) | Data aggregation method and device, computer equipment and readable storage medium | |
CN113011831B (en) | System for data auditing and data auditing method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |