CN103473375A - Data cleaning method and data cleaning system - Google Patents
Data cleaning method and data cleaning system Download PDFInfo
- Publication number
- CN103473375A CN103473375A CN2013104563951A CN201310456395A CN103473375A CN 103473375 A CN103473375 A CN 103473375A CN 2013104563951 A CN2013104563951 A CN 2013104563951A CN 201310456395 A CN201310456395 A CN 201310456395A CN 103473375 A CN103473375 A CN 103473375A
- Authority
- CN
- China
- Prior art keywords
- data
- field
- value
- record
- numeric
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000004140 cleaning Methods 0.000 title claims abstract description 58
- 238000000034 method Methods 0.000 title claims abstract description 45
- 241001269238 Data Species 0.000 claims description 15
- 238000012937 correction Methods 0.000 claims description 11
- 230000001143 conditioned effect Effects 0.000 claims description 7
- 230000008901 benefit Effects 0.000 abstract description 2
- 230000008569 process Effects 0.000 description 11
- 238000010586 diagram Methods 0.000 description 8
- 238000010606 normalization Methods 0.000 description 6
- 230000008878 coupling Effects 0.000 description 4
- 238000010168 coupling process Methods 0.000 description 4
- 238000005859 coupling reaction Methods 0.000 description 4
- 230000008034 disappearance Effects 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 238000012790 confirmation Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
Images
Landscapes
- Medical Treatment And Welfare Office Work (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2013104563951A CN103473375A (en) | 2013-09-29 | 2013-09-29 | Data cleaning method and data cleaning system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2013104563951A CN103473375A (en) | 2013-09-29 | 2013-09-29 | Data cleaning method and data cleaning system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN103473375A true CN103473375A (en) | 2013-12-25 |
Family
ID=49798223
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2013104563951A Pending CN103473375A (en) | 2013-09-29 | 2013-09-29 | Data cleaning method and data cleaning system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103473375A (en) |
Cited By (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104504021A (en) * | 2014-12-11 | 2015-04-08 | 北京国双科技有限公司 | Data matching method and device |
CN104572946A (en) * | 2014-12-30 | 2015-04-29 | 小米科技有限责任公司 | Method and device for processing data of yellow pages |
CN104699796A (en) * | 2015-03-18 | 2015-06-10 | 浪潮集团有限公司 | Data cleaning method based on data warehouse |
CN104993958A (en) * | 2015-06-29 | 2015-10-21 | 北京京东尚科信息技术有限公司 | Method and system for generating user master data |
CN105447126A (en) * | 2015-11-17 | 2016-03-30 | 苏州蜗牛数字科技股份有限公司 | Game prop personalized recommendation method |
CN105468658A (en) * | 2014-09-26 | 2016-04-06 | 中国移动通信集团湖北有限公司 | Data cleaning method and apparatus |
CN106230890A (en) * | 2016-07-15 | 2016-12-14 | 中电长城网际系统应用有限公司 | A kind of message normalization processing method and system |
CN106294492A (en) * | 2015-06-08 | 2017-01-04 | 深圳中兴网信科技有限公司 | Data cleaning method and cleaning engine |
CN106446125A (en) * | 2016-09-19 | 2017-02-22 | 广东中标数据科技股份有限公司 | Method and device for improving data quality |
CN106933992A (en) * | 2017-02-24 | 2017-07-07 | 北京华安普惠高新技术有限公司 | Distributed data purging system and method based on data analysis |
CN107103048A (en) * | 2017-03-31 | 2017-08-29 | 苏州艾隆信息技术有限公司 | Medicine information matching process and system |
CN107229662A (en) * | 2016-03-25 | 2017-10-03 | 阿里巴巴集团控股有限公司 | Data cleaning method and device |
CN107408268A (en) * | 2015-01-28 | 2017-11-28 | 环联公司 | System and method for retrieving and processing credit data for centralized review |
CN108073591A (en) * | 2016-11-10 | 2018-05-25 | 北京宸信征信有限公司 | The integration storage system and method for a kind of multi-source data with identity attribute |
CN109241363A (en) * | 2018-06-04 | 2019-01-18 | 平安科技(深圳)有限公司 | List cleaning method, system, computer equipment and storage medium |
WO2019080427A1 (en) * | 2017-10-27 | 2019-05-02 | 平安科技(深圳)有限公司 | Medical data cleaning method, electronic apparatus and storage medium |
CN109947751A (en) * | 2018-12-29 | 2019-06-28 | 医渡云(北京)技术有限公司 | A kind of medical data processing method, device, readable medium and electronic equipment |
CN111581182A (en) * | 2020-04-21 | 2020-08-25 | 北京龙云科技有限公司 | Data cleaning method and device |
CN111949641A (en) * | 2020-08-06 | 2020-11-17 | 武汉理工光科股份有限公司 | Method and system for cleaning and synchronizing data between multi-stage platforms |
CN113535518A (en) * | 2021-07-23 | 2021-10-22 | 北京八分量信息科技有限公司 | Distributed real-time dynamic monitoring method and system for user behaviors |
CN113821503A (en) * | 2021-09-23 | 2021-12-21 | 北京金山云网络技术有限公司 | Medical data processing method and device and edge server |
CN115098478A (en) * | 2022-06-23 | 2022-09-23 | 中电通商数字技术(上海)有限公司 | Resident main index generation method, device and medium |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110055252A1 (en) * | 2003-03-28 | 2011-03-03 | Dun & Bradstreet, Inc. | System and method for data cleansing |
CN102156893A (en) * | 2011-03-24 | 2011-08-17 | 大连海事大学 | Cleaning system and method thereof for data acquired by RFID device under network |
CN102411569A (en) * | 2010-09-20 | 2012-04-11 | 上海众融信息技术有限公司 | Database conversion and cleaning information processing method |
-
2013
- 2013-09-29 CN CN2013104563951A patent/CN103473375A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110055252A1 (en) * | 2003-03-28 | 2011-03-03 | Dun & Bradstreet, Inc. | System and method for data cleansing |
CN102411569A (en) * | 2010-09-20 | 2012-04-11 | 上海众融信息技术有限公司 | Database conversion and cleaning information processing method |
CN102156893A (en) * | 2011-03-24 | 2011-08-17 | 大连海事大学 | Cleaning system and method thereof for data acquired by RFID device under network |
Non-Patent Citations (4)
Title |
---|
包从剑: "数据清洗的若干关键技术研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
叶振春: "实兵对抗演习评估系统中数据清理方法研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
杨宏娜: "基于数据仓库的数据清洗技术研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
陈伟: "数据清理关键技术及其软件平台的研究与应用", 《中国优秀博硕士学位论文全文数据库 (博士) 信息科技辑》 * |
Cited By (32)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105468658B (en) * | 2014-09-26 | 2020-04-03 | 中国移动通信集团湖北有限公司 | Data cleaning method and device |
CN105468658A (en) * | 2014-09-26 | 2016-04-06 | 中国移动通信集团湖北有限公司 | Data cleaning method and apparatus |
CN104504021A (en) * | 2014-12-11 | 2015-04-08 | 北京国双科技有限公司 | Data matching method and device |
CN104572946A (en) * | 2014-12-30 | 2015-04-29 | 小米科技有限责任公司 | Method and device for processing data of yellow pages |
CN104572946B (en) * | 2014-12-30 | 2018-07-06 | 小米科技有限责任公司 | Yellow page data processing method and processing device |
CN107408268A (en) * | 2015-01-28 | 2017-11-28 | 环联公司 | System and method for retrieving and processing credit data for centralized review |
CN104699796A (en) * | 2015-03-18 | 2015-06-10 | 浪潮集团有限公司 | Data cleaning method based on data warehouse |
CN106294492A (en) * | 2015-06-08 | 2017-01-04 | 深圳中兴网信科技有限公司 | Data cleaning method and cleaning engine |
CN104993958A (en) * | 2015-06-29 | 2015-10-21 | 北京京东尚科信息技术有限公司 | Method and system for generating user master data |
CN105447126A (en) * | 2015-11-17 | 2016-03-30 | 苏州蜗牛数字科技股份有限公司 | Game prop personalized recommendation method |
CN107229662B (en) * | 2016-03-25 | 2022-02-25 | 阿里巴巴集团控股有限公司 | Data cleaning method and device |
CN107229662A (en) * | 2016-03-25 | 2017-10-03 | 阿里巴巴集团控股有限公司 | Data cleaning method and device |
CN106230890A (en) * | 2016-07-15 | 2016-12-14 | 中电长城网际系统应用有限公司 | A kind of message normalization processing method and system |
CN106446125A (en) * | 2016-09-19 | 2017-02-22 | 广东中标数据科技股份有限公司 | Method and device for improving data quality |
CN106446125B (en) * | 2016-09-19 | 2019-12-24 | 广东中标数据科技股份有限公司 | Method and device for improving data quality |
CN108073591A (en) * | 2016-11-10 | 2018-05-25 | 北京宸信征信有限公司 | The integration storage system and method for a kind of multi-source data with identity attribute |
CN108073591B (en) * | 2016-11-10 | 2021-10-12 | 北京宸信征信有限公司 | Integrated storage system and method of multi-source data with identity attribute |
CN106933992A (en) * | 2017-02-24 | 2017-07-07 | 北京华安普惠高新技术有限公司 | Distributed data purging system and method based on data analysis |
CN106933992B (en) * | 2017-02-24 | 2018-02-06 | 北京华安普惠高新技术有限公司 | Distributed data purging system and method based on data analysis |
CN107103048B (en) * | 2017-03-31 | 2021-04-20 | 苏州艾隆信息技术有限公司 | Medicine information matching method and system |
CN107103048A (en) * | 2017-03-31 | 2017-08-29 | 苏州艾隆信息技术有限公司 | Medicine information matching process and system |
WO2019080427A1 (en) * | 2017-10-27 | 2019-05-02 | 平安科技(深圳)有限公司 | Medical data cleaning method, electronic apparatus and storage medium |
WO2019232952A1 (en) * | 2018-06-04 | 2019-12-12 | 平安科技(深圳)有限公司 | List clearing method, system, computer device, and storage medium |
CN109241363A (en) * | 2018-06-04 | 2019-01-18 | 平安科技(深圳)有限公司 | List cleaning method, system, computer equipment and storage medium |
CN109947751A (en) * | 2018-12-29 | 2019-06-28 | 医渡云(北京)技术有限公司 | A kind of medical data processing method, device, readable medium and electronic equipment |
CN111581182A (en) * | 2020-04-21 | 2020-08-25 | 北京龙云科技有限公司 | Data cleaning method and device |
CN111949641A (en) * | 2020-08-06 | 2020-11-17 | 武汉理工光科股份有限公司 | Method and system for cleaning and synchronizing data between multi-stage platforms |
CN111949641B (en) * | 2020-08-06 | 2023-07-14 | 武汉理工光科股份有限公司 | Method and system for cleaning and synchronizing data among multiple stages of platforms |
CN113535518A (en) * | 2021-07-23 | 2021-10-22 | 北京八分量信息科技有限公司 | Distributed real-time dynamic monitoring method and system for user behaviors |
CN113535518B (en) * | 2021-07-23 | 2023-12-05 | 北京八分量信息科技有限公司 | Distributed real-time dynamic monitoring method and system for user behaviors |
CN113821503A (en) * | 2021-09-23 | 2021-12-21 | 北京金山云网络技术有限公司 | Medical data processing method and device and edge server |
CN115098478A (en) * | 2022-06-23 | 2022-09-23 | 中电通商数字技术(上海)有限公司 | Resident main index generation method, device and medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103473375A (en) | Data cleaning method and data cleaning system | |
Langley et al. | A decision tree for nonmetric sex assessment from the skull | |
CN112365987B (en) | Diagnostic data abnormality detection method, diagnostic data abnormality detection device, computer device, and storage medium | |
US10275828B2 (en) | Expanded data processing for improved entity matching | |
CN103530334B (en) | Based on the data matching system and method for comparing template | |
CN110378347B (en) | Method and device for extracting key information of medical examination sheet | |
CN107194167A (en) | A kind of doctors and patients' data management system and method | |
CN103473373A (en) | Threshold matching model-based similarity analysis system and threshold matching model-based similarity analysis method | |
CN101727535A (en) | Cross indexing method for patients crossing system and system thereof | |
CN111785341A (en) | Patient main index data merging method and device based on similarity | |
US20200013491A1 (en) | Interoperable Record Matching Process | |
Kamnikar et al. | Intraobserver error in macromorphoscopic trait data | |
CN109448811B (en) | Prescription auditing improvement method and device, electronic equipment and storage medium | |
CN109545319B (en) | Prescription alarm method based on knowledge relation analysis and terminal equipment | |
CN104063567A (en) | Establishment method of patient identity source cross reference | |
KR20190118618A (en) | Information processing apparatus, information processing method and recording medium | |
CN107480299B (en) | Information processing method and device | |
CN113221541A (en) | Data extraction method and device | |
CN108320779A (en) | Medical data processing method and processing device | |
CN108388610B (en) | Data ETL processing method and device | |
KR101456189B1 (en) | Method for evaluating patents using engine and evaluation server | |
CN116206767A (en) | Disease knowledge mining method, device, electronic equipment and storage medium | |
CN115293915A (en) | Service data verification method, device, equipment and storage medium | |
CN115640376A (en) | Text labeling method and device, electronic equipment and computer-readable storage medium | |
CN103489051A (en) | Method for checking and normalizing customer information in multiple information systems of fund company |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
ASS | Succession or assignment of patent right |
Owner name: PKU HEALTHCARE IT CO., LTD. Free format text: FORMER OWNER: FOUNDER INTERNATIONAL CO., LTD. Effective date: 20150203 Free format text: FORMER OWNER: FOUNDER INTERNATIONAL (BEIJING) CO., LTD. Effective date: 20150203 |
|
C41 | Transfer of patent application or patent right or utility model | ||
COR | Change of bibliographic data |
Free format text: CORRECT: ADDRESS; FROM: 215123 SUZHOU, JIANGSU PROVINCE TO: 100080 HAIDIAN, BEIJING |
|
TA01 | Transfer of patent application right |
Effective date of registration: 20150203 Address after: 100080, No. 19, No. 52 West Fourth Ring Road, Beijing, Haidian District Applicant after: Medical information Technology Co., Ltd. of Beijing University Address before: Suzhou City, Jiangsu Province, Suzhou Industrial Park 215123 Xinghu Street No. 328 Creative Industry Park founder International Building Applicant before: Founder International Co., Ltd. Applicant before: Founder international software (Beijing) Co., Ltd. |
|
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20131225 |