CN104462604A - Data processing method and system - Google Patents

Data processing method and system Download PDF

Info

Publication number
CN104462604A
CN104462604A CN201410855040.4A CN201410855040A CN104462604A CN 104462604 A CN104462604 A CN 104462604A CN 201410855040 A CN201410855040 A CN 201410855040A CN 104462604 A CN104462604 A CN 104462604A
Authority
CN
China
Prior art keywords
data
module
processing
database
verification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410855040.4A
Other languages
Chinese (zh)
Other versions
CN104462604B (en
Inventor
吴锦德
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu Zhuorui Technology Co Ltd
Original Assignee
Chengdu Zhuorui Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chengdu Zhuorui Technology Co Ltd filed Critical Chengdu Zhuorui Technology Co Ltd
Priority to CN201410855040.4A priority Critical patent/CN104462604B/en
Publication of CN104462604A publication Critical patent/CN104462604A/en
Application granted granted Critical
Publication of CN104462604B publication Critical patent/CN104462604B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Quality & Reliability (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention relates to the technical field of computer information processing and discloses a data processing method and system. The data processing method and system aim to solve the problem that data are difficult to integrate and utilize in the data processing process. The data processing method mainly comprises the steps of collecting data from a data source, processing and disposing the data, verifying the processed and disposed data and outputting the verified data according to application demands. According to the technical scheme, data are cleaned, data formats are regulated, and the application range of the data is improved through secondary processing.

Description

Data processing method and system
Technical field
The present invention relates to technical field of computer information processing, particularly a kind of data processing method and system.
Background technology
Along with large data association area is fast-developing, the data of different system, disparate databases are integrated, and be applied in new market demand software or the degree of depth of carrying out data excavates the developing direction having become large data fields.At present, difficult point maximum in data mart modeling is that timing node that is different due to Data Source, data application demand that is different, data is different, causes the unitarity of data, integrality incomplete, affects integration and the utilization of data.
Summary of the invention
[technical matters that will solve]
The object of this invention is to provide a kind of data processing method and system, utilize the problem of difficulty to solve Data Integration in data mart modeling process.
[technical scheme]
The present invention is achieved by the following technical solutions.
The present invention relates to a kind of data processing method, the method comprises the following steps:
The data transformations of collection is that unified form is stored in the first database by steps A: from data source image data;
Step B: processing and sorting is carried out to the data in the first database, described processing and sorting specifically comprises raw data cleaning, providing data formatting process, Data Comparison, data correlation process, data secondary processing;
Step C: the data after processing and sorting are verified;
Step D: export the data after verification to second database according to application demand, described second database is the database of operation system.
As one preferred embodiment, described data source is Database Systems and/or internet.
As another preferred embodiment, from the method for Database Systems image data be: utilize data transformations instrument the batch data Database Systems to be imported in the first database.
As another preferred embodiment, comprise from the method for internet image data: targeted website is located; Webpage source code is analyzed; Website data modeling; Data grabber.
As another preferred embodiment, in described step C be verified as data uniqueness verification, data layout verification or data plausibility check.
As another preferred embodiment, the verification of described data uniqueness specifically comprises to be carried out unicity retrieval to tables of data field or carries out unicity retrieval to the combination of multiple field;
Described data layout verification comprises to be retrieved the type of data;
The plausibility check of described data comprises and judging date, character length, type.
As another preferred embodiment, described raw data cleaning specifically comprises apparent error data processing, repeating data process and data merging treatment;
Described providing data formatting process comprises deletion and the replacement of special character;
Described Data Comparison comprises and the data in different pieces of information source being contrasted according to data field, to be then integrated into by homogeneous data in tables of data and to form data history table according to timing node;
Described data correlation process comprises carries out index by the data be associated in different pieces of information table, and sets up index relative;
Data directory is set up in the data mining that described data secondary processing comprises for raw data.
The invention still further relates to a kind of NC manufacturing system, this system comprises data acquisition module, data mart modeling module, data check module and statistical conversion module,
Described data acquisition module is used for from data source image data, and is that unified form is stored in the first database by the data transformations of collection;
Described data mart modeling module is used for carrying out processing and sorting to the data in the first database, and described processing module specifically comprises raw data cleaning module, providing data formatting processing module, Data Comparison module, data correlation processing module, data secondary processing module;
Described data check module is used for verifying the data after processing and sorting;
Described statistical conversion module is used for the data after by verification and exports the second database to according to application demand, and described second database is the database of operation system.
As one preferred embodiment, described data check module specifically comprises data uniqueness correction verification module, data layout correction verification module or data plausibility check module,
Described data uniqueness correction verification module is used for carrying out unicity retrieval to tables of data field or carrying out unicity retrieval to the combination of multiple field;
Described data layout correction verification module is used for retrieving the type of data;
Described data plausibility check module is used for judging date, character length, type.
As another preferred embodiment, described raw data cleaning module is used for apparent error data processing, repeating data process and data merging treatment;
Described providing data formatting processing module is used for deletion and the replacement of special character;
Described Data Comparison module is used for the data in different pieces of information source to contrast according to data field, to be then integrated into by homogeneous data in tables of data and to form data history table according to timing node;
Described data correlation processing module is used for the data be associated in different pieces of information table to carry out index, and sets up index relative;
Described data secondary processing module is used for setting up data directory for the data mining of raw data.
[beneficial effect]
The needs that unified for the data in different pieces of information source processing process can utilize to meet Data Integration by the technical scheme that the present invention proposes.
Accompanying drawing explanation
The structured flowchart of the NC manufacturing system that Fig. 1 provides for embodiments of the invention one;
The process flow diagram of the data processing method that Fig. 2 provides for embodiments of the invention two;
The process flow diagram of the data processing method that Fig. 3 provides for embodiments of the invention three.
Embodiment
For making the object, technical solutions and advantages of the present invention clearly, below in conjunction with accompanying drawing, clear, complete description is carried out to the specific embodiment of the present invention, obviously, described embodiment is a part of embodiment of the present invention, instead of whole embodiment, neither limitation of the present invention.Based on embodiments of the invention, those of ordinary skill in the art, not paying the every other embodiment obtained under creative work prerequisite, belong to protection scope of the present invention.
The structural drawing of the NC manufacturing system that Fig. 1 provides for the embodiment of the present invention one, as shown in Figure 1, this system comprises data acquisition module 101, data mart modeling module 102, data check module 103 and statistical conversion module 104,
The data transformations of collection for from data source image data, and is that unified form is stored in the first database by data acquisition module 101.
Data mart modeling module 102 is for carrying out processing and sorting to the data in the first database, and described processing module specifically comprises raw data cleaning module, providing data formatting processing module, Data Comparison module, data correlation processing module, data secondary processing module.Wherein raw data cleaning module is used for apparent error data processing, repeating data process and data merging treatment; Providing data formatting processing module is used for deletion and the replacement of special character; Data Comparison module is used for the data in different pieces of information source to contrast according to data field, to be then integrated into by homogeneous data in tables of data and to form data history table according to timing node; Data correlation processing module is used for the data be associated in different pieces of information table to carry out index, and sets up index relative; Data secondary processing module is used for setting up data directory for the data mining of raw data.
Data check module 103 is for verifying the data after processing and sorting.In the present embodiment, data check module specifically comprises data uniqueness correction verification module, data layout correction verification module or data plausibility check module, and data uniqueness correction verification module is used for carrying out unicity retrieval to tables of data field or carrying out unicity retrieval to the combination of multiple field; Data layout verification is used for retrieving the type of data; The plausibility check of data is used for judging date, character length, type.
Statistical conversion module 104 is for exporting the data after verification to second database according to application demand, and wherein the second database is the database of operation system.
Adopting the NC manufacturing system that provides of embodiment one to carry out the method for data acquisition can with reference to following concrete grammar embodiment.
The process flow diagram of the business data job operation that Fig. 2 provides for the embodiment of the present invention two.As shown in Figure 2,201 to step that the method comprising the steps of 211, is described to step 211 step 201 below respectively.
Step 201: be saved to data collection data storehouse from enterprise database system acquisition raw data.
Particularly, by utilizing the statistical conversion instrument of SQL Server, the business data be stored in SQL Server database being derived, then raw data is imported in the data collection data storehouse for data mart modeling.
Step 202: raw data is cleared up.
Data in step 202 pair data acquisition database are cleared up, and specifically comprise:
Repeating data process: combing is carried out to identical business data such as enterprise name, address or operation licenses number, carries out after comparison one by one merging or deleting;
Delete special data: mainly for the misdata in raw data, repeating data carry out deleting, union operation.As deleted enterprise name for " XXX test data ".
Process for some ragged array: such as NULL and empty data are unified into NULL; Correct for the data of apparent error in date field, such as, be that the date of " 2031-5-8 " is processed into " 2013-5-8 " by the Date of Incorporation, be less than the data scrubbing that 1949 are greater than 2014 years the Date of Incorporation.
Step 203: formatted data.
Step 203, for the deletion of special character and replacement, specifically comprises:
Special character in format legal person, enterprise name: such as the character of band * is replaced with sky;
Format registered enterprise fund: calculate registered capital (Currency Type of such as some registered enterprise funds is dollar, yen etc.) according to Currency Type and the current exchange rate, and be unified into Renminbi.
Step 204: Data Comparison.
Particularly, the raw data of the business data existed in NC manufacturing system and collection is contrasted, there is the part field that same enterprise then upgrades the business data in NC manufacturing system, as upgraded enterprise address, website field etc.
Step 205: raw data association process.
Raw data in step 205 pair data acquisition database carries out association process, specifically comprise the data field one_to_one corresponding enterprise stored in raw data enterprise literary name section and NC manufacturing system shown, then during the table data importing of original enterprise is shown to the enterprise that NC manufacturing system is corresponding.
Step 206: data secondary processing.
Step 206 mainly comprises: the industry etc. of the unique identification of the data processing of the enterprise that stops doing business, more new spectra, more new spectra modification information, renewal enterprise area field, more industry, the more new spectra of new spectra.
The data processing of enterprise of stopping doing business in step 206 specifically divides three kinds of situations:
For brand-name enterprise, then extract trademark application people and mate with original business data, by the match is successful and enterprise is not the open for business data of enterprise adds in enterprise storehouse;
For the enterprise having patent, then extract patent applicant and mate with original business data, by the match is successful and enterprise is not the open for business data of enterprise adds in enterprise storehouse;
For having trade mark or having patent and do not working as the enterprise in the secondary business data provided, but the enterprise had in the former business data provided, the data of this enterprise are added in enterprise storehouse.
In step 206, more the unique identification of new spectra mainly writes unique identifier to the business data in NC manufacturing system.
In step 206, more new spectra modification information specifically comprises:
By incremental data and raw data comparison, for operation license or organization mechanism code identical, but the different enterprise of enterprise name carries out index, adds original undertaking's title in enterprise's history information table, and the enterprise name upgrading enterprise's master meter is title after changing;
By carrying out fuzzy matching to enterprise name, legal representative, address, then pairing approximation enterprise carries out artificial treatment, determines latest company's title, and title will write enterprise's history information table before changing.
Upgrade enterprise's area field in step 206 specifically to comprise:
By registration authority's code update city fields and district field;
Upgrade enterprise region by enterprise address, such as " Mianyang City XXX company ", area update becomes Mianyang City;
According to enterprise name, region is verified, such as " Chengdu XXX company ", if Region dividing is not or not Chengdu, then carry out manual verification.
In step 206, more the industry of new spectra specifically comprises:
Trade division is carried out by the industry code (98 class industry code) of enterprises registration;
Upgrade trade mark in step 206 to mate with trademark application people title mainly through enterprise name with associating of enterprise, set up associating of business data and branding data.
Step 207: the data after processing and sorting are verified.
Particularly, the verification in step 207 comprises data uniqueness verification, data layout verification or and data plausibility check.
Step 208: the data after verification are exported according to application demand.
Particularly, the data after verification are exported to the database of operation system by step 208 according to application demand.
The process flow diagram of the branding data job operation that Fig. 3 provides for the embodiment of the present invention two, as shown in Figure 2,301 to step that the method comprising the steps of 311, is described to step 311 step 301 below respectively.
Step 301: gather branding data and be saved to data collection data storehouse.
The present embodiment specifically gathers the branding data in Sichuan Province, particularly, by carrying out source code analysis to branding data website, obtain branding data structural code, set up data grabber rule according to branding data structure, capture branding data stored in data collection data storehouse.In order to avoid capturing unwanted data, before data grabber, need the rule first formulating image data, such as, only gather the branding data belonging to the address in Sichuan Province, specifically comprise: trade mark address comprises " Chengdu ", " Sichuan Province " etc.
Step 302: raw data is cleared up.
Step 302 specifically comprises:
The data that cleaning gathers, delete the data outside acquisition range, such as, only gather the branding data in Sichuan Province, contrast one by one according to regional address table to the data gathered, delete data outside region.
Step 303: formatted data.
Step 303 specifically comprises:
Correct special applicant's title.Such as, comprise " J in applicant; 1 ", should be updated in " river ";
Correct private right start-stop day.Such as the private right starting date of " 1900-01-01 " is set to null character (NUL).
Step 304: Data Comparison.
Particularly, the raw data of the branding data existed in NC manufacturing system and collection is contrasted, if there is same enterprise, upgrade the part field of the branding data in NC manufacturing system, as upgraded trade mark address, commodity and service project etc.
Step 305: raw data association process.
Raw data in step 305 pair data acquisition database carries out association process, specifically comprise the data field one_to_one corresponding of the trade mark table stored in the raw data trade mark literary name section of collection and NC manufacturing system, then by the original trade mark table data importing that gathers in trade mark table corresponding to NC manufacturing system.
Step 306: data secondary processing, comprises the district, renewal trade mark industry, renewal trade mark statutory status etc. that upgrade city fields and city.
The renewal city fields of step 306 and the district in city specifically comprise:
Collect the address in each city, Sichuan Province as rule, upgrade city fields; Collect each address, district in city, Sichuan Province as rule, the more district of new town.
Upgrade trade mark industry in step 306 specifically to comprise:
Set up trade mark and national economy to classify contrast relationship rule list, particularly, the commodity and service project described in the first commodity and service project of trade mark being classified with national economy is carried out corresponding, is then that branding data adds profession identity according to this rule.
Upgrade trade mark statutory status in step 306 specifically to comprise: arrange the trade mark statutory status of the raw data gathered and divide, set up statutory status rule list, the statutory status of trade mark is updated to effectively, in careful, invalid three kinds of states.Such as statutory status is " objection ruling % objection is false ", is updated to " invalid-objection ".
Step 307: the data after processing and sorting are verified.
Particularly, the verification in step 307 comprises data uniqueness verification, data layout School Affairs data plausibility check.
Step 308: the data after verification are exported according to application demand.
Particularly, the data after verification are exported to the database of operation system by step 308 according to application demand.
As can be seen from the above embodiments, the embodiment of the present invention solves Data Integration in data mart modeling process and utilizes the problem of difficulty, particularly, the business data job operation provided by embodiment, data are cleared up, specification data layout, improves the scope of application of data by secondary processing.

Claims (10)

1. a data processing method, is characterized in that comprising the steps:
The data transformations of collection is that unified form is stored in the first database by steps A: from data source image data;
Step B: processing and sorting is carried out to the data in the first database, described processing and sorting specifically comprises raw data cleaning, providing data formatting process, Data Comparison, data correlation process, data secondary processing;
Step C: the data after processing and sorting are verified;
Step D: export the data after verification to second database according to application demand, described second database is the database of operation system.
2. data processing method according to claim 1, is characterized in that described data source is Database Systems and/or internet.
3. data processing method according to claim 2, it is characterized in that from the method for Database Systems image data be: utilize data transformations instrument the batch data Database Systems to be imported in the first database.
4. the method for the data mart modeling according to Claims 2 or 3, is characterized in that comprising from the method for internet image data: targeted website is located; Webpage source code is analyzed; Website data modeling; Data grabber.
5. data processing method according to claim 1, it is characterized in that in described step C be verified as data uniqueness verification, data layout verification or data plausibility check.
6. data processing method according to claim 5, is characterized in that:
The verification of described data uniqueness specifically comprises to be carried out unicity retrieval to tables of data field or carries out unicity retrieval to the combination of multiple field;
Described data layout verification comprises to be retrieved the type of data;
The plausibility check of described data comprises and judging date, character length, type.
7. data processing method according to claim 1, is characterized in that:
Described raw data cleaning specifically comprises apparent error data processing, repeating data process and data merging treatment;
Described providing data formatting process comprises deletion and the replacement of special character;
Described Data Comparison comprises and the data in different pieces of information source being contrasted according to data field, to be then integrated into by homogeneous data in tables of data and to form data history table according to timing node;
Described data correlation process comprises carries out index by the data be associated in different pieces of information table, and sets up index relative;
Data directory is set up in the data mining that described data secondary processing comprises for raw data.
8. a NC manufacturing system, is characterized in that comprising data acquisition module, data mart modeling module, data check module and statistical conversion module,
Described data acquisition module is used for from data source image data, and is that unified form is stored in the first database by the data transformations of collection;
Described data mart modeling module is used for carrying out processing and sorting to the data in the first database, and described processing module specifically comprises raw data cleaning module, providing data formatting processing module, Data Comparison module, data correlation processing module, data secondary processing module;
Described data check module is used for verifying the data after processing and sorting;
Described statistical conversion module is used for the data after by verification and exports the second database to according to application demand, and described second database is the database of operation system.
9. NC manufacturing system according to claim 8, is characterized in that described data check module specifically comprises data uniqueness correction verification module, data layout correction verification module or data plausibility check module,
Described data uniqueness correction verification module is used for carrying out unicity retrieval to tables of data field or carrying out unicity retrieval to the combination of multiple field;
Described data layout correction verification module is used for retrieving the type of data;
Described data plausibility check module is used for judging date, character length, type.
10. NC manufacturing system according to claim 8, is characterized in that:
Described raw data cleaning module is used for apparent error data processing, repeating data process and data merging treatment;
Described providing data formatting processing module is used for deletion and the replacement of special character;
Described Data Comparison module is used for the data in different pieces of information source to contrast according to data field, to be then integrated into by homogeneous data in tables of data and to form data history table according to timing node;
Described data correlation processing module is used for the data be associated in different pieces of information table to carry out index, and sets up index relative;
Described data secondary processing module is used for setting up data directory for the data mining of raw data.
CN201410855040.4A 2014-12-31 2014-12-31 Data processing method and system Active CN104462604B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410855040.4A CN104462604B (en) 2014-12-31 2014-12-31 Data processing method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410855040.4A CN104462604B (en) 2014-12-31 2014-12-31 Data processing method and system

Publications (2)

Publication Number Publication Date
CN104462604A true CN104462604A (en) 2015-03-25
CN104462604B CN104462604B (en) 2017-10-31

Family

ID=52908639

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410855040.4A Active CN104462604B (en) 2014-12-31 2014-12-31 Data processing method and system

Country Status (1)

Country Link
CN (1) CN104462604B (en)

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106095837A (en) * 2016-06-01 2016-11-09 杭州中奥科技有限公司 It is applicable to data system and the data processing method of industrial and commercial all kinds of business
WO2016184192A1 (en) * 2015-05-21 2016-11-24 中兴通讯股份有限公司 Data processing method and device
CN106294492A (en) * 2015-06-08 2017-01-04 深圳中兴网信科技有限公司 Data cleaning method and cleaning engine
CN106293977A (en) * 2015-05-15 2017-01-04 阿里巴巴集团控股有限公司 A kind of data verification method and equipment
CN106289794A (en) * 2015-05-25 2017-01-04 上海汽车集团股份有限公司 The data processing method of car load test and device
CN106326321A (en) * 2015-07-10 2017-01-11 中兴通讯股份有限公司 Big data exchange method and device
CN106599193A (en) * 2016-12-14 2017-04-26 云南电网有限责任公司电力科学研究院 Data cleaning method and system
CN107066531A (en) * 2017-03-01 2017-08-18 苏州朗动网络科技有限公司 A kind of business data radar monitoring method and system based on enterprise's big data platform
CN107229662A (en) * 2016-03-25 2017-10-03 阿里巴巴集团控股有限公司 Data cleaning method and device
CN108073694A (en) * 2017-12-08 2018-05-25 国云科技股份有限公司 A kind of enterprise attributes standardized system and its implementation based on biradical standard
CN109446157A (en) * 2018-10-18 2019-03-08 武汉虹旭信息技术有限责任公司 System and method are looked into a kind of data format school based on format data
CN109558466A (en) * 2018-11-29 2019-04-02 成都天衡智造科技有限公司 A kind of source data configuring management method in manufacturing industry data mining
CN111258998A (en) * 2020-01-16 2020-06-09 北京字节跳动网络技术有限公司 Data verification method, device, medium and electronic equipment
CN111914015A (en) * 2020-08-25 2020-11-10 河北时代电子有限公司 Multisource data gateway data analysis early warning system based on industrial protocol
CN112506580A (en) * 2020-12-16 2021-03-16 青岛海尔科技有限公司 Data fusion method and device, storage medium and electronic device
CN113297177A (en) * 2021-05-31 2021-08-24 深圳市资道智能科技有限公司 Non-marketing enterprise standardized database construction and application method
WO2021169574A1 (en) * 2020-02-24 2021-09-02 京东方科技集团股份有限公司 Data exchange method and apparatus, and readable storage medium, and data exchange system
CN113792039A (en) * 2021-03-15 2021-12-14 北京京东振世信息技术有限公司 Data processing method and device, electronic equipment and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103719460A (en) * 2012-10-16 2014-04-16 钱国祥 Radix ophiopogonis lung atrophy health-care tea
CN103761217A (en) * 2014-01-07 2014-04-30 成都市卓睿科技有限公司 Automatic multifunctional-statement generation method
US20140181413A1 (en) * 2012-12-20 2014-06-26 Advanced Micro Devices, Inc. Method and system for shutting down active core based caches

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103719460A (en) * 2012-10-16 2014-04-16 钱国祥 Radix ophiopogonis lung atrophy health-care tea
US20140181413A1 (en) * 2012-12-20 2014-06-26 Advanced Micro Devices, Inc. Method and system for shutting down active core based caches
CN103761217A (en) * 2014-01-07 2014-04-30 成都市卓睿科技有限公司 Automatic multifunctional-statement generation method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
刘喜文: "构建数据仓库过程中的数据清洗研究", 《图书与情报》 *

Cited By (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106293977B (en) * 2015-05-15 2019-04-05 阿里巴巴集团控股有限公司 A kind of data verification method and equipment
CN106293977A (en) * 2015-05-15 2017-01-04 阿里巴巴集团控股有限公司 A kind of data verification method and equipment
WO2016184192A1 (en) * 2015-05-21 2016-11-24 中兴通讯股份有限公司 Data processing method and device
CN106296498A (en) * 2015-05-21 2017-01-04 中兴通讯股份有限公司 Data processing method and device
CN106289794A (en) * 2015-05-25 2017-01-04 上海汽车集团股份有限公司 The data processing method of car load test and device
CN106289794B (en) * 2015-05-25 2019-08-13 上海汽车集团股份有限公司 The data processing method and device of vehicle test
CN106294492A (en) * 2015-06-08 2017-01-04 深圳中兴网信科技有限公司 Data cleaning method and cleaning engine
CN106326321A (en) * 2015-07-10 2017-01-11 中兴通讯股份有限公司 Big data exchange method and device
CN106326321B (en) * 2015-07-10 2022-01-28 中兴通讯股份有限公司 Big data exchange method and device
CN107229662B (en) * 2016-03-25 2022-02-25 阿里巴巴集团控股有限公司 Data cleaning method and device
CN107229662A (en) * 2016-03-25 2017-10-03 阿里巴巴集团控股有限公司 Data cleaning method and device
CN106095837A (en) * 2016-06-01 2016-11-09 杭州中奥科技有限公司 It is applicable to data system and the data processing method of industrial and commercial all kinds of business
CN106599193A (en) * 2016-12-14 2017-04-26 云南电网有限责任公司电力科学研究院 Data cleaning method and system
CN107066531A (en) * 2017-03-01 2017-08-18 苏州朗动网络科技有限公司 A kind of business data radar monitoring method and system based on enterprise's big data platform
CN108073694A (en) * 2017-12-08 2018-05-25 国云科技股份有限公司 A kind of enterprise attributes standardized system and its implementation based on biradical standard
CN109446157B (en) * 2018-10-18 2021-10-29 武汉虹旭信息技术有限责任公司 Data format checking system and method based on formatted data
CN109446157A (en) * 2018-10-18 2019-03-08 武汉虹旭信息技术有限责任公司 System and method are looked into a kind of data format school based on format data
CN109558466A (en) * 2018-11-29 2019-04-02 成都天衡智造科技有限公司 A kind of source data configuring management method in manufacturing industry data mining
CN111258998A (en) * 2020-01-16 2020-06-09 北京字节跳动网络技术有限公司 Data verification method, device, medium and electronic equipment
WO2021169574A1 (en) * 2020-02-24 2021-09-02 京东方科技集团股份有限公司 Data exchange method and apparatus, and readable storage medium, and data exchange system
CN111914015A (en) * 2020-08-25 2020-11-10 河北时代电子有限公司 Multisource data gateway data analysis early warning system based on industrial protocol
CN112506580A (en) * 2020-12-16 2021-03-16 青岛海尔科技有限公司 Data fusion method and device, storage medium and electronic device
CN113792039A (en) * 2021-03-15 2021-12-14 北京京东振世信息技术有限公司 Data processing method and device, electronic equipment and storage medium
CN113792039B (en) * 2021-03-15 2024-03-01 北京京东振世信息技术有限公司 Data processing method and device, electronic equipment and storage medium
CN113297177A (en) * 2021-05-31 2021-08-24 深圳市资道智能科技有限公司 Non-marketing enterprise standardized database construction and application method

Also Published As

Publication number Publication date
CN104462604B (en) 2017-10-31

Similar Documents

Publication Publication Date Title
CN104462604A (en) Data processing method and system
He et al. A database linking Chinese patents to China’s census firms
CN110383319B (en) Large scale heterogeneous data ingestion and user resolution
CN106022902A (en) Accounting method and device
CN103177068A (en) Systems and methods for merging source records in accordance with survivorship rules
CN105677625B (en) Processing method is made a report in the collaboration of multi-layer collect statistics report
CN105099729A (en) User ID (Identification) recognition method and device
CN102346901A (en) Internet medicine trading subject credit assessment system and method
CN101071445A (en) Classified sample set optimizing method and content-related advertising server
CN102521713B (en) Data processing equipment and data processing method
CN107168937A (en) Financial cloud accounting element particle and assemble method based on XBRL
CN102208061A (en) Data cancel after verification processing device and method
CN114187082A (en) Financial accounting and reimbursement method and system
CN106485579A (en) A kind of tax declaration method
WO2008127443A1 (en) Image data extraction automation process
CN103914540A (en) Processing method for flexibly configuring ammeter specifications for ammeter reading
CN101320394A (en) Data acquisition method supporting multiple file types
CN112463985A (en) Government affair map model construction method, device, equipment and computer readable medium
Eberle et al. Record linkage of the linked employer-employee survey of the socio-economic panel study (SOEP-LEE) and the establishment history panel (BHP)
Bruntink An initial quality analysis of the ohloh software evolution data
CN112115271B (en) Knowledge graph construction method and device
CN109324963A (en) The method and terminal device of automatic test profitable result
CN1588405A (en) Data processing device and method for risk control system
CN112613853A (en) Data aggregation method and device, computer equipment and readable storage medium
CN113011831B (en) System for data auditing and data auditing method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant