Connect public, paid and private patent data with Google Patents Public Datasets

基于元数据去除重复对象的方法

Info

Publication number
CN100576207C
CN100576207C CN 200710106024 CN200710106024A CN100576207C CN 100576207 C CN100576207 C CN 100576207C CN 200710106024 CN200710106024 CN 200710106024 CN 200710106024 A CN200710106024 A CN 200710106024A CN 100576207 C CN100576207 C CN 100576207C
Authority
CN
Grant status
Grant
Patent type
Prior art keywords
data
value
similarity
meta
comparison
Prior art date
Application number
CN 200710106024
Other languages
English (en)
Chinese (zh)
Other versions
CN101286156A (zh )
Inventor
飞 高
Original Assignee
北大方正集团有限公司;北京方正阿帕比技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Grant date

Links

CN 200710106024 2007-05-29 2007-05-29 基于元数据去除重复对象的方法 CN100576207C (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 200710106024 CN100576207C (zh) 2007-05-29 2007-05-29 基于元数据去除重复对象的方法

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 200710106024 CN100576207C (zh) 2007-05-29 2007-05-29 基于元数据去除重复对象的方法

Publications (2)

Publication Number Publication Date
CN101286156A true CN101286156A (zh) 2008-10-15
CN100576207C true CN100576207C (zh) 2009-12-30

Family

ID=40058367

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 200710106024 CN100576207C (zh) 2007-05-29 2007-05-29 基于元数据去除重复对象的方法

Country Status (1)

Country Link
CN (1) CN100576207C (de)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102236635A (zh) * 2010-04-22 2011-11-09 上海百果信息科技有限公司 一种通过捕捉比对关键元素实现多系统信息关联的方法
CN102609419B (zh) * 2011-01-21 2015-02-18 北京世纪读秀技术有限公司 相似数据排重方法
CN102609418B (zh) * 2011-01-21 2015-02-04 北京世纪读秀技术有限公司 数据质量级别判断方法
US9223511B2 (en) 2011-04-08 2015-12-29 Micron Technology, Inc. Data deduplication
CN102325347A (zh) * 2011-09-14 2012-01-18 中兴通讯股份有限公司 一种lte系统中的传输流模板匹配方法及装置
US9489133B2 (en) * 2011-11-30 2016-11-08 International Business Machines Corporation Optimizing migration/copy of de-duplicated data
CN103166917B (zh) * 2011-12-12 2016-02-10 阿里巴巴集团控股有限公司 网络设备身份识别方法及系统
CN103257961B (zh) * 2012-02-15 2016-08-10 北大方正集团有限公司 书目消重的方法、装置及系统
CN103425711B (zh) * 2012-05-25 2017-08-25 株式会社理光 基于多对象实例的对象值对齐方法
CN103729369B (zh) * 2012-10-15 2017-06-13 金蝶软件(中国)有限公司 自动处理撞单的方法及装置
US20150032609A1 (en) * 2013-07-29 2015-01-29 International Business Machines Corporation Correlation of data sets using determined data types
CN103473654A (zh) * 2013-09-23 2013-12-25 国家电网公司 一种用于电力erp系统的资产数据清理辅助方法及系统
CN104899408A (zh) * 2014-03-05 2015-09-09 孙宝文 有趣项集获取方法和装置

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2000073960A1 (en) 1999-05-28 2000-12-07 Goto.Com, Inc. System and method for influencing a position on a search result list generated by a computer network search engine
CN1333640A (zh) 2000-07-11 2002-01-30 三洋电机株式会社 移动终端机
CN1416644A (zh) 2000-11-09 2003-05-07 皇家菲利浦电子有限公司 基于内容过滤以限制重复出现的方法和系统

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2000073960A1 (en) 1999-05-28 2000-12-07 Goto.Com, Inc. System and method for influencing a position on a search result list generated by a computer network search engine
CN1333640A (zh) 2000-07-11 2002-01-30 三洋电机株式会社 移动终端机
CN1416644A (zh) 2000-11-09 2003-05-07 皇家菲利浦电子有限公司 基于内容过滤以限制重复出现的方法和系统

Also Published As

Publication number Publication date Type
CN101286156A (zh) 2008-10-15 application

Similar Documents

Publication Publication Date Title
Kumar et al. Extracting large-scale knowledge bases from the web
Sparck Jones Automatic indexing
Wang et al. Mining correlated bursty topic patterns from coordinated text streams
US20090043797A1 (en) System And Methods For Clustering Large Database of Documents
US20040205524A1 (en) Spreadsheet data processing system
US20090222395A1 (en) Systems, methods, and software for entity extraction and resolution coupled with event and relationship extraction
US20090049062A1 (en) Method for Organizing Structurally Similar Web Pages from a Web Site
US20130268526A1 (en) Discovery engine
US20090226098A1 (en) Character string updated degree evaluation program
Hasan Dalip et al. Automatic quality assessment of content created collaboratively by web communities: a case study of wikipedia
Wu et al. Webiq: Learning from the web to match deep-web query interfaces
Simmons et al. Memes Online: Extracted, Subtracted, Injected, and Recollected.
US20120036130A1 (en) Systems, methods, software and interfaces for entity extraction and resolution and tagging
CN101127042A (zh) 一种基于语言模型的情感分类方法
Wang et al. Bootstrapping both product features and opinion words from chinese customer reviews with cross-inducing
CN101377777A (zh) 一种自动问答方法和系统
Das Sarma et al. Dynamic relationship and event discovery
CN102254014A (zh) 一种网页特征自适应的信息抽取方法
US20120036125A1 (en) Method and system for integrating web-based systems with local document processing applications
CN102360383A (zh) 一种面向文本的领域术语与术语关系抽取方法
Cui CharaParser for fine‐grained semantic annotation of organism morphological descriptions
CN102279894A (zh) 基于语义的查找、集成和提供评论信息的方法及搜索系统
Ji et al. A source code linearization technique for detecting plagiarized programs
CN1158460A (zh) 一种跨语种语料自动分类与检索方法
Blohm et al. Using the web to reduce data sparseness in pattern-based information extraction

Legal Events

Date Code Title Description
C06 Publication
C10 Request of examination as to substance
C14 Granted
C41 Transfer of the right of patent application or the patent right
ASS Succession or assignment of patent right

Free format text: FORMER OWNER: PEKING UNIVERSITY FOUNDER GROUP CORP.

Owner name: LIDE TECHNOLOGY DEVELOPMENT CO., LTD.

Effective date: 20120823

COR Bibliographic change or correction in the description

Free format text: CORRECT: ADDRESS; FROM: 100871 HAIDIAN, BEIJING TO: 409000 QIANJIANG, CHONGQING