CN103530334A - System and method for data matching based on comparison module - Google Patents
System and method for data matching based on comparison module Download PDFInfo
- Publication number
- CN103530334A CN103530334A CN201310456767.0A CN201310456767A CN103530334A CN 103530334 A CN103530334 A CN 103530334A CN 201310456767 A CN201310456767 A CN 201310456767A CN 103530334 A CN103530334 A CN 103530334A
- Authority
- CN
- China
- Prior art keywords
- data
- threshold
- similarity
- data recording
- coupling
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 35
- 230000008878 coupling Effects 0.000 claims description 60
- 238000010168 coupling process Methods 0.000 claims description 60
- 238000005859 coupling reaction Methods 0.000 claims description 60
- 238000000638 solvent extraction Methods 0.000 abstract 2
- 230000000903 blocking effect Effects 0.000 abstract 1
- 241001269238 Data Species 0.000 description 6
- 230000008569 process Effects 0.000 description 5
- 238000010586 diagram Methods 0.000 description 4
- 238000010606 normalization Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 230000008034 disappearance Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/21—Design, administration or maintenance of databases
- G06F16/215—Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
- G06F16/285—Clustering or classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/22—Social work or social welfare, e.g. community support activities or counselling services
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Business, Economics & Management (AREA)
- Data Mining & Analysis (AREA)
- Tourism & Hospitality (AREA)
- General Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Strategic Management (AREA)
- General Business, Economics & Management (AREA)
- Human Resources & Organizations (AREA)
- Marketing (AREA)
- General Health & Medical Sciences (AREA)
- Child & Adolescent Psychology (AREA)
- Economics (AREA)
- Primary Health Care (AREA)
- Quality & Reliability (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
ID | A | B | C | D |
1 | a1 | b1 | c1 | d1 |
2 | a2 | b2 | c2 | d2 |
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310456767.0A CN103530334B (en) | 2013-09-29 | 2013-09-29 | Based on the data matching system and method for comparing template |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310456767.0A CN103530334B (en) | 2013-09-29 | 2013-09-29 | Based on the data matching system and method for comparing template |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103530334A true CN103530334A (en) | 2014-01-22 |
CN103530334B CN103530334B (en) | 2018-01-23 |
Family
ID=49932343
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310456767.0A Active CN103530334B (en) | 2013-09-29 | 2013-09-29 | Based on the data matching system and method for comparing template |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103530334B (en) |
Cited By (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104809141A (en) * | 2014-01-29 | 2015-07-29 | 携程计算机技术(上海)有限公司 | Matching system and method of hotel data |
CN105096028A (en) * | 2014-11-20 | 2015-11-25 | 北京航天金盾科技有限公司 | Intelligent matching method of population data |
CN106021526A (en) * | 2016-05-25 | 2016-10-12 | 东软集团股份有限公司 | News classification method and device |
CN106681524A (en) * | 2015-11-10 | 2017-05-17 | 阿里巴巴集团控股有限公司 | Method and device for processing information |
CN107103048A (en) * | 2017-03-31 | 2017-08-29 | 苏州艾隆信息技术有限公司 | Medicine information matching process and system |
CN107193860A (en) * | 2017-03-31 | 2017-09-22 | 苏州艾隆信息技术有限公司 | Medicine information multidimensional identification method and system |
CN107203686A (en) * | 2017-03-31 | 2017-09-26 | 苏州艾隆信息技术有限公司 | medicine information difference processing method and system |
CN107291672A (en) * | 2016-03-31 | 2017-10-24 | 阿里巴巴集团控股有限公司 | The treating method and apparatus of tables of data |
CN108038504A (en) * | 2017-12-11 | 2018-05-15 | 深圳房讯通信息技术有限公司 | A kind of method for parsing property ownership certificate photo content |
WO2018166343A1 (en) * | 2017-03-13 | 2018-09-20 | 腾讯科技(深圳)有限公司 | Data fusion method and device, storage medium and electronic device |
CN108664497A (en) * | 2017-03-30 | 2018-10-16 | 大有秦鼎(北京)科技有限公司 | The method and apparatus of Data Matching |
CN108920601A (en) * | 2018-06-27 | 2018-11-30 | 中国联合网络通信集团有限公司 | A kind of data matching method and device |
CN109063178A (en) * | 2018-08-22 | 2018-12-21 | 四川新网银行股份有限公司 | A kind of method and device of the self-service analytical statement extended automatically |
CN111737533A (en) * | 2020-06-19 | 2020-10-02 | 东软集团股份有限公司 | Processing method and device for inspection items, storage medium and equipment |
CN112732703A (en) * | 2021-03-23 | 2021-04-30 | 中国信息通信研究院 | Metadata processing method, metadata processing apparatus, and readable storage medium |
CN113434584A (en) * | 2021-06-28 | 2021-09-24 | 国网北京市电力公司 | Data processing method and device for power equipment and electronic equipment |
CN113535943A (en) * | 2020-04-14 | 2021-10-22 | 阿里巴巴集团控股有限公司 | Medical record classification method and device and data record classification method and device |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101739414A (en) * | 2008-11-25 | 2010-06-16 | 华中师范大学 | Ontological concept mapping method |
CN102542262A (en) * | 2012-01-04 | 2012-07-04 | 东南大学 | Waveform identification method based on operating-characteristic working condition waveform library of high-speed rail |
CN103186427A (en) * | 2011-12-31 | 2013-07-03 | 中国银联股份有限公司 | System and method for analyzing data record set |
EP2592575A3 (en) * | 2011-11-08 | 2013-07-31 | Comcast Cable Communications, LLC | Content descriptor |
CN103257961A (en) * | 2012-02-15 | 2013-08-21 | 北大方正集团有限公司 | Method, device and system of bibliography repeat removal |
-
2013
- 2013-09-29 CN CN201310456767.0A patent/CN103530334B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101739414A (en) * | 2008-11-25 | 2010-06-16 | 华中师范大学 | Ontological concept mapping method |
EP2592575A3 (en) * | 2011-11-08 | 2013-07-31 | Comcast Cable Communications, LLC | Content descriptor |
CN103186427A (en) * | 2011-12-31 | 2013-07-03 | 中国银联股份有限公司 | System and method for analyzing data record set |
CN102542262A (en) * | 2012-01-04 | 2012-07-04 | 东南大学 | Waveform identification method based on operating-characteristic working condition waveform library of high-speed rail |
CN103257961A (en) * | 2012-02-15 | 2013-08-21 | 北大方正集团有限公司 | Method, device and system of bibliography repeat removal |
Non-Patent Citations (5)
Title |
---|
THINKPHPER: ""大数据量的分表方法"", 《BLOG.SINA.COM.CN/S/BLOG_64492FE10100QI3I.HTML》 * |
ZHAO HAO ET AL.: ""Adaptive threshold backtracking matching pursuit for compressive sensing"", 《IET INTERNATIONAL RADAR CONFERENCE 2013》 * |
洪圆等: ""一种使用双阀值的数据仓库环境下重复记录消除算法"", 《计算机工程与应用》 * |
陈波: ""征信系统中实体匹配方法及应用研究"", 《中国博士学位论文全文数据库 经济与管理科学辑》 * |
齐为华: ""不同应用系统相关数据的匹配检测与借用"", 《2007年CAD/CAM学术交流会议论文集》 * |
Cited By (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104809141A (en) * | 2014-01-29 | 2015-07-29 | 携程计算机技术(上海)有限公司 | Matching system and method of hotel data |
CN105096028A (en) * | 2014-11-20 | 2015-11-25 | 北京航天金盾科技有限公司 | Intelligent matching method of population data |
CN106681524A (en) * | 2015-11-10 | 2017-05-17 | 阿里巴巴集团控股有限公司 | Method and device for processing information |
CN107291672A (en) * | 2016-03-31 | 2017-10-24 | 阿里巴巴集团控股有限公司 | The treating method and apparatus of tables of data |
CN106021526A (en) * | 2016-05-25 | 2016-10-12 | 东软集团股份有限公司 | News classification method and device |
CN106021526B (en) * | 2016-05-25 | 2019-09-27 | 东软集团股份有限公司 | News category method and device |
WO2018166343A1 (en) * | 2017-03-13 | 2018-09-20 | 腾讯科技(深圳)有限公司 | Data fusion method and device, storage medium and electronic device |
CN108664497B (en) * | 2017-03-30 | 2020-11-03 | 大有秦鼎(北京)科技有限公司 | Data matching method and device |
CN108664497A (en) * | 2017-03-30 | 2018-10-16 | 大有秦鼎(北京)科技有限公司 | The method and apparatus of Data Matching |
CN107193860B (en) * | 2017-03-31 | 2021-03-02 | 苏州艾隆信息技术有限公司 | Medicine information multidimensional identification method and system |
CN107193860A (en) * | 2017-03-31 | 2017-09-22 | 苏州艾隆信息技术有限公司 | Medicine information multidimensional identification method and system |
CN107103048B (en) * | 2017-03-31 | 2021-04-20 | 苏州艾隆信息技术有限公司 | Medicine information matching method and system |
CN107103048A (en) * | 2017-03-31 | 2017-08-29 | 苏州艾隆信息技术有限公司 | Medicine information matching process and system |
CN107203686A (en) * | 2017-03-31 | 2017-09-26 | 苏州艾隆信息技术有限公司 | medicine information difference processing method and system |
CN108038504A (en) * | 2017-12-11 | 2018-05-15 | 深圳房讯通信息技术有限公司 | A kind of method for parsing property ownership certificate photo content |
CN108920601B (en) * | 2018-06-27 | 2020-12-01 | 中国联合网络通信集团有限公司 | Data matching method and device |
CN108920601A (en) * | 2018-06-27 | 2018-11-30 | 中国联合网络通信集团有限公司 | A kind of data matching method and device |
CN109063178B (en) * | 2018-08-22 | 2019-12-24 | 四川新网银行股份有限公司 | Method and device for automatically expanding self-help analysis report |
CN109063178A (en) * | 2018-08-22 | 2018-12-21 | 四川新网银行股份有限公司 | A kind of method and device of the self-service analytical statement extended automatically |
CN113535943A (en) * | 2020-04-14 | 2021-10-22 | 阿里巴巴集团控股有限公司 | Medical record classification method and device and data record classification method and device |
CN111737533A (en) * | 2020-06-19 | 2020-10-02 | 东软集团股份有限公司 | Processing method and device for inspection items, storage medium and equipment |
CN111737533B (en) * | 2020-06-19 | 2024-02-09 | 东软集团股份有限公司 | Method, device, storage medium and equipment for processing inspection items |
CN112732703A (en) * | 2021-03-23 | 2021-04-30 | 中国信息通信研究院 | Metadata processing method, metadata processing apparatus, and readable storage medium |
CN113434584A (en) * | 2021-06-28 | 2021-09-24 | 国网北京市电力公司 | Data processing method and device for power equipment and electronic equipment |
CN113434584B (en) * | 2021-06-28 | 2022-10-14 | 国网北京市电力公司 | Data processing method and device for power equipment and electronic equipment |
Also Published As
Publication number | Publication date |
---|---|
CN103530334B (en) | 2018-01-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103530334A (en) | System and method for data matching based on comparison module | |
CN103473375A (en) | Data cleaning method and data cleaning system | |
CN103473373A (en) | Threshold matching model-based similarity analysis system and threshold matching model-based similarity analysis method | |
US20150356128A1 (en) | Index key generating device, index key generating method, and search method | |
US9177020B2 (en) | Gathering index statistics using sampling | |
JP2013536492A (en) | Data analysis using multiple systems | |
CN108038130A (en) | Automatic cleaning method, device, equipment and the storage medium of fictitious users | |
CN103714086A (en) | Method and device used for generating non-relational data base module | |
WO2022222942A1 (en) | Method and apparatus for generating question and answer record, electronic device, and storage medium | |
Ji et al. | Anthropometry and classification of auricular concha for the ergonomic design of earphones | |
US20190108270A1 (en) | Data convergence | |
CN110909168A (en) | Knowledge graph updating method and device, storage medium and electronic device | |
CN111160855A (en) | Report sheet automatic auditing method, device, equipment and storage medium | |
CN113111063A (en) | Medical patient main index discovery method applied to multiple data sources | |
CN113743477A (en) | Histogram data publishing method based on differential privacy | |
CN111640517B (en) | Medical record coding method and device, storage medium and electronic equipment | |
CN109346146A (en) | Checking prescription distribution method, device, electronic equipment and storage medium | |
CN106961508A (en) | Communication means and device based on Sex criminals | |
CN116150632A (en) | Internet of things equipment identification method based on local sensitive hash in intelligent home | |
CN112163127B (en) | Relationship graph construction method and device, electronic equipment and storage medium | |
CN110175220B (en) | Document similarity measurement method and system based on keyword position structure distribution | |
CN112991131A (en) | Government affair data processing method suitable for electronic government affair platform | |
CN108846543B (en) | Computing method and device for non-overlapping community set quality metric index | |
CN111209284A (en) | Metadata-based table dividing method and device | |
CN113449102A (en) | Text clustering method, equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
ASS | Succession or assignment of patent right |
Owner name: PKU HEALTHCARE IT CO., LTD. Free format text: FORMER OWNER: FOUNDER INTERNATIONAL CO., LTD. Effective date: 20150203 Free format text: FORMER OWNER: FOUNDER INTERNATIONAL (BEIJING) CO., LTD. Effective date: 20150203 |
|
C41 | Transfer of patent application or patent right or utility model | ||
COR | Change of bibliographic data |
Free format text: CORRECT: ADDRESS; FROM: 215123 SUZHOU, JIANGSU PROVINCE TO: 100080 HAIDIAN, BEIJING |
|
TA01 | Transfer of patent application right |
Effective date of registration: 20150203 Address after: 100080, No. 19, No. 52 West Fourth Ring Road, Beijing, Haidian District Applicant after: Peking University Medical Information Technology Co.,Ltd. Address before: Suzhou City, Jiangsu Province, Suzhou Industrial Park 215123 Xinghu Street No. 328 Creative Industry Park founder International Building Applicant before: FOUNDER INTERNATIONAL Co.,Ltd. Applicant before: Founder International Co.,Ltd. (Beijing) |
|
GR01 | Patent grant | ||
GR01 | Patent grant | ||
PP01 | Preservation of patent right |
Effective date of registration: 20240202 Granted publication date: 20180123 |
|
PP01 | Preservation of patent right |