CN1728147B - 基于异类关系确定目标相似性的方法和系统 - Google Patents

基于异类关系确定目标相似性的方法和系统 Download PDF

Info

Publication number
CN1728147B
CN1728147B CN2005100922448A CN200510092244A CN1728147B CN 1728147 B CN1728147 B CN 1728147B CN 2005100922448 A CN2005100922448 A CN 2005100922448A CN 200510092244 A CN200510092244 A CN 200510092244A CN 1728147 B CN1728147 B CN 1728147B
Authority
CN
China
Prior art keywords
similarity
target
type
osculant
relation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN2005100922448A
Other languages
English (en)
Chinese (zh)
Other versions
CN1728147A (zh
Inventor
B·章
G·薛
H-J·曾
马维英
陈正
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of CN1728147A publication Critical patent/CN1728147A/zh
Application granted granted Critical
Publication of CN1728147B publication Critical patent/CN1728147B/zh
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/16Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • G06F16/9558Details of hyperlinks; Management of linked annotations
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99932Access augmentation or optimizing
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Software Systems (AREA)
  • Computer Hardware Design (AREA)
  • Mathematical Physics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Stored Programmes (AREA)
  • Complex Calculations (AREA)
CN2005100922448A 2004-05-14 2005-05-16 基于异类关系确定目标相似性的方法和系统 Expired - Fee Related CN1728147B (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/846,949 2004-05-14
US10/846,949 US7376643B2 (en) 2004-05-14 2004-05-14 Method and system for determining similarity of objects based on heterogeneous relationships

Publications (2)

Publication Number Publication Date
CN1728147A CN1728147A (zh) 2006-02-01
CN1728147B true CN1728147B (zh) 2010-09-08

Family

ID=34939829

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2005100922448A Expired - Fee Related CN1728147B (zh) 2004-05-14 2005-05-16 基于异类关系确定目标相似性的方法和系统

Country Status (10)

Country Link
US (1) US7376643B2 (https=)
EP (1) EP1596314B1 (https=)
JP (1) JP5147162B2 (https=)
KR (1) KR101130533B1 (https=)
CN (1) CN1728147B (https=)
AU (1) AU2005202016A1 (https=)
BR (1) BRPI0503220A (https=)
CA (1) CA2507365A1 (https=)
MX (1) MXPA05005219A (https=)
RU (1) RU2419857C2 (https=)

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8135698B2 (en) * 2004-06-25 2012-03-13 International Business Machines Corporation Techniques for representing relationships between queries
US7779001B2 (en) * 2004-10-29 2010-08-17 Microsoft Corporation Web page ranking with hierarchical considerations
US8762280B1 (en) * 2004-12-02 2014-06-24 Google Inc. Method and system for using a network analysis system to verify content on a website
US7509320B2 (en) * 2005-12-14 2009-03-24 Siemens Aktiengesellschaft Methods and apparatus to determine context relevant information
US8332386B2 (en) * 2006-03-29 2012-12-11 Oracle International Corporation Contextual search of a collaborative environment
US8768932B1 (en) * 2007-05-14 2014-07-01 Google Inc. Method and apparatus for ranking search results
US20090198666A1 (en) * 2008-02-01 2009-08-06 Winston Ronald H Affinity matching system and method
US8321803B2 (en) * 2008-06-19 2012-11-27 International Business Machines Corporation Aggregating service components
CN101615178B (zh) * 2008-06-26 2013-01-09 日电(中国)有限公司 用于建立对象层次结构的方法和系统
US20100211533A1 (en) * 2009-02-18 2010-08-19 Microsoft Corporation Extracting structured data from web forums
US9443209B2 (en) * 2009-04-30 2016-09-13 Paypal, Inc. Recommendations based on branding
US9286411B2 (en) * 2009-06-25 2016-03-15 International Business Machines Corporation Retrieval of relevant objects in a similarity
CN102341802B (zh) * 2009-06-30 2014-05-28 国际商业机器公司 图的相似度计算系统和方法
US8266149B2 (en) * 2010-12-10 2012-09-11 Yahoo! Inc. Clustering with similarity-adjusted entropy
US9460390B1 (en) * 2011-12-21 2016-10-04 Emc Corporation Analyzing device similarity
CN103218358A (zh) * 2012-01-18 2013-07-24 百度在线网络技术(北京)有限公司 一种Diff打分方法以及系统
US9292793B1 (en) * 2012-03-31 2016-03-22 Emc Corporation Analyzing device similarity
US20140067443A1 (en) * 2012-08-28 2014-03-06 International Business Machines Corporation Business process transformation recommendation generation
CN108738036B (zh) * 2017-04-14 2021-06-18 广州杰赛科技股份有限公司 移动通信的关键用户提取方法和系统
CN107766498B (zh) * 2017-10-19 2022-01-07 北京百度网讯科技有限公司 用于生成信息的方法和装置
CN108256070B (zh) * 2018-01-17 2022-07-15 北京百度网讯科技有限公司 用于生成信息的方法和装置
CA3096119A1 (en) * 2019-10-07 2021-04-07 Royal Bank Of Canada System and method for link prediction with semantic analysis
TWI742446B (zh) * 2019-10-08 2021-10-11 東方線上股份有限公司 詞句庫擴展系統及其方法
CN118018269B (zh) * 2024-01-31 2024-12-24 北京亚鸿世纪科技发展有限公司 一种数据安全分析方法及系统

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6421675B1 (en) 1998-03-16 2002-07-16 S. L. I. Systems, Inc. Search engine
JP2001160067A (ja) * 1999-09-22 2001-06-12 Ddi Corp 類似文書検索方法および該類似文書検索方法を利用した推薦記事通知サービスシステム
JP3678985B2 (ja) 2000-08-25 2005-08-03 日本電信電話株式会社 ウェブページ間の類似度自動判定方法、装置及びそのプログラムを記録した媒体
US7440943B2 (en) 2000-12-22 2008-10-21 Xerox Corporation Recommender system and method
US7251648B2 (en) 2002-06-28 2007-07-31 Microsoft Corporation Automatically ranking answers to database queries
WO2005008526A1 (en) * 2003-07-23 2005-01-27 University College Dublin, National University Of Ireland, Dublin Information retrieval

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
John A. Tomlin.A new paradigm for ranking pages on the World Wide Web.Proceedings of the 12th international conference on World Wide Web, ACM.2003,350-352<http://portal.acm.org/citation.cfm?id=775202&coll=ACM&dl=ACM&CFID=54162463&CFTOKEN=34995373>. *
Taher H. Haveliwala.Topic-sensitivepagerank:Acontext-sensitiverankingalgorithmfor web search.Proceedings of the 11th international conference on World Wide Web, ACM.2002,517-526<http://portal.acm.org/citation.cfm?id=511513&coll=ACM&dl=ACM&CFID=54162463&CFTOKEN=34995373>. *
Wen J-R et, al.Query clustering using user logs.ACM transactions on information systems20 1.2002,20(1),62-78<http://portal.acm.org/citation.cfm?id=503108&coll=ACM&dl=ACM&CFID=54162463&CFTOKEN=34995373>. *

Also Published As

Publication number Publication date
AU2005202016A1 (en) 2005-12-22
EP1596314A1 (en) 2005-11-16
US20050256833A1 (en) 2005-11-17
EP1596314B1 (en) 2013-07-17
US7376643B2 (en) 2008-05-20
KR20060047858A (ko) 2006-05-18
CA2507365A1 (en) 2005-11-14
JP5147162B2 (ja) 2013-02-20
BRPI0503220A (pt) 2006-01-10
KR101130533B1 (ko) 2012-04-12
RU2419857C2 (ru) 2011-05-27
CN1728147A (zh) 2006-02-01
MXPA05005219A (es) 2005-12-06
JP2005327299A (ja) 2005-11-24
RU2005114666A (ru) 2006-11-20
AU2005202016A8 (en) 2005-12-22

Similar Documents

Publication Publication Date Title
CN1728147B (zh) 基于异类关系确定目标相似性的方法和系统
RU2517271C2 (ru) Длина документа в качестве статического признака релевантности для ранжирования результатов поиска
Nasraoui et al. A web usage mining framework for mining evolving user profiles in dynamic web sites
Johnson et al. Collective, hierarchical clustering from distributed, heterogeneous data
US7953723B1 (en) Federation for parallel searching
US7117206B1 (en) Method for ranking hyperlinked pages using content and connectivity analysis
CN1716259B (zh) 基于内部-类型关联和交互-类型关联来排列对象的方法和系统
US8688682B2 (en) Query expression evaluation using sample based projected selectivity
Xing et al. Efficient data mining for web navigation patterns
US20050131929A1 (en) Computer-implemented multidimensional database processing method and system
Gong et al. Business information query expansion through semantic network
Li et al. DSM-PLW: Single-pass mining of path traversal patterns over streaming Web click-sequences
Lahiri et al. Identifying correlated heavy-hitters in a two-dimensional data stream
Skhiri et al. Large graph mining: recent developments, challenges and potential solutions
US20030018623A1 (en) System and method of query processing of time variant objects
Wu et al. Approxrank: Estimating rank for a subgraph
Lieberam-Schmidt Analyzing and influencing search engine results: business and technology impacts on web information retrieval
Tsoukanara et al. Skyline-based temporal graph exploration
Agarwal et al. Semantic methods and tools for information portals
Tsoukanara et al. Skyline-based exploration of temporal property graphs
Wu et al. Automatic topics discovery from hyperlinked documents
AnjanKumar et al. Probabilistic classification techniques to perform geographical labeling of web objects
Bai et al. Index-based top k α-maximal-clique enumeration over uncertain graphs: J. Bai et al.
Kennedy et al. Detecting the temporal context of queries
Li et al. Mining unexpected Web usage behaviors

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20100908

Termination date: 20140516