JP2008519328A5 - - Google Patents

Download PDF

Info

Publication number
JP2008519328A5
JP2008519328A5 JP2007539077A JP2007539077A JP2008519328A5 JP 2008519328 A5 JP2008519328 A5 JP 2008519328A5 JP 2007539077 A JP2007539077 A JP 2007539077A JP 2007539077 A JP2007539077 A JP 2007539077A JP 2008519328 A5 JP2008519328 A5 JP 2008519328A5
Authority
JP
Japan
Prior art keywords
documents
hit
forming
metric
measure
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2007539077A
Other languages
English (en)
Japanese (ja)
Other versions
JP2008519328A (ja
JP4908422B2 (ja
Filing date
Publication date
Priority claimed from US11/198,471 external-priority patent/US7533092B2/en
Application filed filed Critical
Publication of JP2008519328A publication Critical patent/JP2008519328A/ja
Publication of JP2008519328A5 publication Critical patent/JP2008519328A5/ja
Application granted granted Critical
Publication of JP4908422B2 publication Critical patent/JP4908422B2/ja
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

JP2007539077A 2004-10-28 2005-10-26 リンクベースのスパム検出 Expired - Lifetime JP4908422B2 (ja)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US62329504P 2004-10-28 2004-10-28
US60/623,295 2004-10-28
US11/198,471 2005-08-04
US11/198,471 US7533092B2 (en) 2004-10-28 2005-08-04 Link-based spam detection
PCT/US2005/038619 WO2006049996A2 (en) 2004-10-28 2005-10-26 Link-based spam detection

Publications (3)

Publication Number Publication Date
JP2008519328A JP2008519328A (ja) 2008-06-05
JP2008519328A5 true JP2008519328A5 (enExample) 2008-09-11
JP4908422B2 JP4908422B2 (ja) 2012-04-04

Family

ID=35705210

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2007539077A Expired - Lifetime JP4908422B2 (ja) 2004-10-28 2005-10-26 リンクベースのスパム検出

Country Status (6)

Country Link
US (1) US7533092B2 (enExample)
EP (1) EP1817697A2 (enExample)
JP (1) JP4908422B2 (enExample)
KR (1) KR101230687B1 (enExample)
CN (1) CN101180624B (enExample)
WO (1) WO2006049996A2 (enExample)

Families Citing this family (83)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7466663B2 (en) * 2000-10-26 2008-12-16 Inrotis Technology, Limited Method and apparatus for identifying components of a network having high importance for network integrity
US7743045B2 (en) * 2005-08-10 2010-06-22 Google Inc. Detecting spam related and biased contexts for programmable search engines
US20070038614A1 (en) * 2005-08-10 2007-02-15 Guha Ramanathan V Generating and presenting advertisements based on context data for programmable search engines
US7716199B2 (en) * 2005-08-10 2010-05-11 Google Inc. Aggregating context data for programmable search engines
US7693830B2 (en) * 2005-08-10 2010-04-06 Google Inc. Programmable search engine
US8125922B2 (en) * 2002-10-29 2012-02-28 Searchbolt Limited Method and apparatus for generating a ranked index of web pages
US7505964B2 (en) 2003-09-12 2009-03-17 Google Inc. Methods and systems for improving a search ranking using related queries
US7606793B2 (en) 2004-09-27 2009-10-20 Microsoft Corporation System and method for scoping searches using index keys
US20060069667A1 (en) * 2004-09-30 2006-03-30 Microsoft Corporation Content evaluation
US7533092B2 (en) * 2004-10-28 2009-05-12 Yahoo! Inc. Link-based spam detection
US20060123478A1 (en) * 2004-12-02 2006-06-08 Microsoft Corporation Phishing detection, prevention, and notification
US7634810B2 (en) * 2004-12-02 2009-12-15 Microsoft Corporation Phishing detection, prevention, and notification
US20110197114A1 (en) * 2004-12-08 2011-08-11 John Martin Electronic message response and remediation system and method
US7962510B2 (en) * 2005-02-11 2011-06-14 Microsoft Corporation Using content analysis to detect spam web pages
US8086605B2 (en) * 2005-06-28 2011-12-27 Yahoo! Inc. Search engine with augmented relevance ranking by community participation
US20070078939A1 (en) * 2005-09-26 2007-04-05 Technorati, Inc. Method and apparatus for identifying and classifying network documents as spam
US20090299819A1 (en) * 2006-03-04 2009-12-03 John Stannard Davis, III Behavioral Trust Rating Filtering System
US7580931B2 (en) * 2006-03-13 2009-08-25 Microsoft Corporation Topic distillation via subsite retrieval
WO2007123416A1 (en) * 2006-04-24 2007-11-01 Telenor Asa Method and device for efficiently ranking documents in a similarity graph
US7634476B2 (en) * 2006-07-25 2009-12-15 Microsoft Corporation Ranking of web sites by aggregating web page ranks
US20080033797A1 (en) * 2006-08-01 2008-02-07 Microsoft Corporation Search query monetization-based ranking and filtering
US20080126331A1 (en) * 2006-08-25 2008-05-29 Xerox Corporation System and method for ranking reference documents
US8661029B1 (en) 2006-11-02 2014-02-25 Google Inc. Modifying search result ranking based on implicit user feedback
US20080114753A1 (en) * 2006-11-15 2008-05-15 Apmath Ltd. Method and a device for ranking linked documents
US20080147669A1 (en) * 2006-12-14 2008-06-19 Microsoft Corporation Detecting web spam from changes to links of web sites
US7885952B2 (en) * 2006-12-20 2011-02-08 Microsoft Corporation Cloaking detection utilizing popularity and market value
US7693833B2 (en) * 2007-02-01 2010-04-06 John Nagle System and method for improving integrity of internet search
US8595204B2 (en) * 2007-03-05 2013-11-26 Microsoft Corporation Spam score propagation for web spam detection
US7680851B2 (en) 2007-03-07 2010-03-16 Microsoft Corporation Active spam testing system
US8938463B1 (en) 2007-03-12 2015-01-20 Google Inc. Modifying search result ranking based on implicit user feedback and a model of presentation bias
US8694374B1 (en) * 2007-03-14 2014-04-08 Google Inc. Detecting click spam
US7756987B2 (en) * 2007-04-04 2010-07-13 Microsoft Corporation Cybersquatter patrol
US20080270549A1 (en) * 2007-04-26 2008-10-30 Microsoft Corporation Extracting link spam using random walks and spam seeds
US7930303B2 (en) * 2007-04-30 2011-04-19 Microsoft Corporation Calculating global importance of documents based on global hitting times
US9092510B1 (en) 2007-04-30 2015-07-28 Google Inc. Modifying search result ranking based on a temporal element of user feedback
US7853589B2 (en) * 2007-04-30 2010-12-14 Microsoft Corporation Web spam page classification using query-dependent data
US7941391B2 (en) * 2007-05-04 2011-05-10 Microsoft Corporation Link spam detection using smooth classification function
US7788254B2 (en) * 2007-05-04 2010-08-31 Microsoft Corporation Web page analysis using multiple graphs
US9430577B2 (en) * 2007-05-31 2016-08-30 Microsoft Technology Licensing, Llc Search ranger system and double-funnel model for search spam analyses and browser protection
US7873635B2 (en) * 2007-05-31 2011-01-18 Microsoft Corporation Search ranger system and double-funnel model for search spam analyses and browser protection
US8667117B2 (en) * 2007-05-31 2014-03-04 Microsoft Corporation Search ranger system and double-funnel model for search spam analyses and browser protection
US8244737B2 (en) * 2007-06-18 2012-08-14 Microsoft Corporation Ranking documents based on a series of document graphs
US8438189B2 (en) * 2007-07-23 2013-05-07 Microsoft Corporation Local computation of rank contributions
US8694511B1 (en) 2007-08-20 2014-04-08 Google Inc. Modifying search result ranking based on populations
US8041338B2 (en) * 2007-09-10 2011-10-18 Microsoft Corporation Mobile wallet and digital payment
US8909655B1 (en) 2007-10-11 2014-12-09 Google Inc. Time based ranking
US9348912B2 (en) 2007-10-18 2016-05-24 Microsoft Technology Licensing, Llc Document length as a static relevance feature for ranking search results
US20090177690A1 (en) * 2008-01-03 2009-07-09 Sinem Guven Determining an Optimal Solution Set Based on Human Selection
US8219549B2 (en) * 2008-02-06 2012-07-10 Microsoft Corporation Forum mining for suspicious link spam sites detection
US8010482B2 (en) * 2008-03-03 2011-08-30 Microsoft Corporation Locally computable spam detection features and robust pagerank
US8812493B2 (en) 2008-04-11 2014-08-19 Microsoft Corporation Search results ranking using editing distance and document information
US20090307191A1 (en) * 2008-06-10 2009-12-10 Li Hong C Techniques to establish trust of a web page to prevent malware redirects from web searches or hyperlinks
EP2169568A1 (en) 2008-09-17 2010-03-31 OGS Search Limited Method and apparatus for generating a ranked index of web pages
US7974970B2 (en) * 2008-10-09 2011-07-05 Yahoo! Inc. Detection of undesirable web pages
US8396865B1 (en) 2008-12-10 2013-03-12 Google Inc. Sharing search engine relevance data between corpora
US9009146B1 (en) 2009-04-08 2015-04-14 Google Inc. Ranking search results based on similar queries
US8447760B1 (en) 2009-07-20 2013-05-21 Google Inc. Generating a related set of documents for an initial set of documents
US8498974B1 (en) 2009-08-31 2013-07-30 Google Inc. Refining search results
US8972391B1 (en) 2009-10-02 2015-03-03 Google Inc. Recent interest based relevance scoring
US8874555B1 (en) 2009-11-20 2014-10-28 Google Inc. Modifying scoring data based on historical changes
US8615514B1 (en) 2010-02-03 2013-12-24 Google Inc. Evaluating website properties by partitioning user feedback
US8924379B1 (en) 2010-03-05 2014-12-30 Google Inc. Temporal-based score adjustments
US8959093B1 (en) 2010-03-15 2015-02-17 Google Inc. Ranking search results based on anchors
US8738635B2 (en) * 2010-06-01 2014-05-27 Microsoft Corporation Detection of junk in search result ranking
US9623119B1 (en) 2010-06-29 2017-04-18 Google Inc. Accentuating search results
US8832083B1 (en) 2010-07-23 2014-09-09 Google Inc. Combining user feedback
US8707441B1 (en) * 2010-08-17 2014-04-22 Symantec Corporation Techniques for identifying optimized malicious search engine results
US8874566B2 (en) 2010-09-09 2014-10-28 Disney Enterprises, Inc. Online content ranking system based on authenticity metric values for web elements
US9002867B1 (en) 2010-12-30 2015-04-07 Google Inc. Modifying ranking data based on document changes
CN102222115B (zh) * 2011-07-12 2013-09-11 厦门大学 基于关键词共现的研究热点边连通度分析方法
CN102214245B (zh) * 2011-07-12 2013-09-11 厦门大学 基于关键词共现的研究热点图论分析方法
CN102571768B (zh) * 2011-12-26 2014-11-26 北京大学 一种钓鱼网站检测方法
CN102591965B (zh) * 2011-12-30 2014-07-09 奇智软件(北京)有限公司 一种黑链检测的方法及装置
US9495462B2 (en) 2012-01-27 2016-11-15 Microsoft Technology Licensing, Llc Re-ranking search results
US9002832B1 (en) 2012-06-04 2015-04-07 Google Inc. Classifying sites as low quality sites
US9183499B1 (en) 2013-04-19 2015-11-10 Google Inc. Evaluating quality based on neighbor features
CN103345499A (zh) * 2013-06-28 2013-10-09 宇龙计算机通信科技(深圳)有限公司 一种搜索引擎的搜索结果处理方法及装置
CN103412922B (zh) * 2013-08-12 2017-02-08 曙光信息产业股份有限公司 一种数据查询处理方法
US20170046376A1 (en) * 2015-04-03 2017-02-16 Yahoo! Inc. Method and system for monitoring data quality and dependency
CN105373598B (zh) * 2015-10-27 2017-03-15 广州神马移动信息科技有限公司 作弊站点识别方法及装置
CN108304395B (zh) * 2016-02-05 2022-09-06 北京迅奥科技有限公司 网页作弊检测
CN108984630B (zh) * 2018-06-20 2021-08-24 天津大学 复杂网络中节点重要性在垃圾网页检测中的应用方法
US12235952B2 (en) * 2021-07-21 2025-02-25 Y.E. Hub Armenia LLC Method and system for prioritizing web-resources for malicious data assessment

Family Cites Families (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4167652A (en) * 1974-10-17 1979-09-11 Telefonaktiebolaget L M Ericsson Method and apparatus for the interchanges of PCM word
US7082426B2 (en) * 1993-06-18 2006-07-25 Cnet Networks, Inc. Content aggregation method and apparatus for an on-line product catalog
US6285999B1 (en) * 1997-01-10 2001-09-04 The Board Of Trustees Of The Leland Stanford Junior University Method for node ranking in a linked database
US6728752B1 (en) * 1999-01-26 2004-04-27 Xerox Corporation System and method for information browsing using multi-modal features
US6678681B1 (en) * 1999-03-10 2004-01-13 Google Inc. Information extraction from a database
US6985431B1 (en) * 1999-08-27 2006-01-10 International Business Machines Corporation Network switch and components and method of operation
US6404752B1 (en) * 1999-08-27 2002-06-11 International Business Machines Corporation Network switch using network processor and methods
US6865575B1 (en) * 2000-07-06 2005-03-08 Google, Inc. Methods and apparatus for using a modified index to provide search results in response to an ambiguous search query
US6529903B2 (en) * 2000-07-06 2003-03-04 Google, Inc. Methods and apparatus for using a modified index to provide search results in response to an ambiguous search query
US20040193503A1 (en) * 2000-10-04 2004-09-30 Eder Jeff Scott Interactive sales performance management system
US7197470B1 (en) * 2000-10-11 2007-03-27 Buzzmetrics, Ltd. System and method for collection analysis of electronic discussion methods
US20040236673A1 (en) * 2000-10-17 2004-11-25 Eder Jeff Scott Collaborative risk transfer system
CA2323883C (en) * 2000-10-19 2016-02-16 Patrick Ryan Morin Method and device for classifying internet objects and objects stored oncomputer-readable media
US8509086B2 (en) * 2001-06-20 2013-08-13 Arbor Networks, Inc. Detecting network misuse
US7089252B2 (en) * 2002-04-25 2006-08-08 International Business Machines Corporation System and method for rapid computation of PageRank
US20040002988A1 (en) * 2002-06-26 2004-01-01 Praveen Seshadri System and method for modeling subscriptions and subscribers as data
CN1536483A (zh) * 2003-04-04 2004-10-13 陈文中 网络信息抽取及处理的方法及系统
US7346839B2 (en) * 2003-09-30 2008-03-18 Google Inc. Information retrieval based on historical data
US20050210008A1 (en) * 2004-03-18 2005-09-22 Bao Tran Systems and methods for analyzing documents over a network
US7343374B2 (en) * 2004-03-29 2008-03-11 Yahoo! Inc. Computation of page authority weights using personalized bookmarks
US20060064411A1 (en) * 2004-09-22 2006-03-23 William Gross Search engine using user intent
US20060085391A1 (en) * 2004-09-24 2006-04-20 Microsoft Corporation Automatic query suggestions
CA2584466A1 (en) * 2004-10-18 2006-04-27 Bioveris Corporation Systems and methods for obtaining, storing, processing and utilizing immunologic information of an individual or population
US7533092B2 (en) * 2004-10-28 2009-05-12 Yahoo! Inc. Link-based spam detection

Similar Documents

Publication Publication Date Title
JP2008519328A5 (enExample)
US11216503B1 (en) Clustering search results
KR100963623B1 (ko) 시맨틱 웹 자원의 랭킹처리방법
CN103020212B (zh) 一种基于用户查询日志实时发现热点视频的方法和装置
TWI615723B (zh) 網路搜尋方法及設備
CN105718582B (zh) 一种E-learning平台下学习资源个性化推荐系统及方法
CN101493819A (zh) 一种搜索引擎作弊检测的优化方法
AU2005201824A1 (en) Method and system for ranking documents of a search result to improve diversity and information richness
CN111095293A (zh) 图像美学处理方法及电子设备
CN101388024B (zh) 一种基于复杂网络的压缩空间高效搜索方法
CN105975596A (zh) 一种搜索引擎查询扩展的方法及系统
CN102456016B (zh) 一种对搜索结果进行排序的方法及装置
CN101075942A (zh) 基于专家值传播算法的社会网络专家信息处理系统及方法
US20130054566A1 (en) Acceleration of ranking algorithms using a graphics processing unit
JP2013522720A5 (enExample)
CN101556603A (zh) 一种用于对检索结果重新排序的协同检索方法
JP2005276225A5 (enExample)
US7765204B2 (en) Method of finding candidate sub-queries from longer queries
CN104281619A (zh) 搜索结果排序系统及方法
CN108280127A (zh) 一种海量相似新闻查重甄选方法、系统及装置
CN105956195B (zh) 简历搜索方法和装置
CN106095982A (zh) 简历搜索方法及装置
CN101604340B (zh) 一种获得查询的时新性的方法
CN103309962A (zh) 一种基于内容相关度和社交影响力的微博服务专家定位方法
Liang et al. Finding knowledgeable groups in enterprise corpora