HK1115930A1 - Link-based spam detection - Google Patents
Link-based spam detectionInfo
- Publication number
- HK1115930A1 HK1115930A1 HK08111675.1A HK08111675A HK1115930A1 HK 1115930 A1 HK1115930 A1 HK 1115930A1 HK 08111675 A HK08111675 A HK 08111675A HK 1115930 A1 HK1115930 A1 HK 1115930A1
- Authority
- HK
- Hong Kong
- Prior art keywords
- link
- spam detection
- based spam
- detection
- spam
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9538—Presentation of query results
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99935—Query augmenting and refining, e.g. inexact access
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99937—Sorting
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99941—Database schema or data structure
- Y10S707/99943—Generating database or data structure, e.g. via user interface
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Transfer Between Computers (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US62329504P | 2004-10-28 | 2004-10-28 | |
US11/198,471 US7533092B2 (en) | 2004-10-28 | 2005-08-04 | Link-based spam detection |
PCT/US2005/038619 WO2006049996A2 (en) | 2004-10-28 | 2005-10-26 | Link-based spam detection |
Publications (1)
Publication Number | Publication Date |
---|---|
HK1115930A1 true HK1115930A1 (en) | 2008-12-12 |
Family
ID=35705210
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
HK08111675.1A HK1115930A1 (en) | 2004-10-28 | 2008-10-23 | Link-based spam detection |
Country Status (7)
Country | Link |
---|---|
US (1) | US7533092B2 (zh) |
EP (1) | EP1817697A2 (zh) |
JP (1) | JP4908422B2 (zh) |
KR (1) | KR101230687B1 (zh) |
CN (1) | CN101180624B (zh) |
HK (1) | HK1115930A1 (zh) |
WO (1) | WO2006049996A2 (zh) |
Families Citing this family (82)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7466663B2 (en) * | 2000-10-26 | 2008-12-16 | Inrotis Technology, Limited | Method and apparatus for identifying components of a network having high importance for network integrity |
US7743045B2 (en) * | 2005-08-10 | 2010-06-22 | Google Inc. | Detecting spam related and biased contexts for programmable search engines |
US7693830B2 (en) | 2005-08-10 | 2010-04-06 | Google Inc. | Programmable search engine |
US7716199B2 (en) * | 2005-08-10 | 2010-05-11 | Google Inc. | Aggregating context data for programmable search engines |
US20070038614A1 (en) * | 2005-08-10 | 2007-02-15 | Guha Ramanathan V | Generating and presenting advertisements based on context data for programmable search engines |
US8125922B2 (en) * | 2002-10-29 | 2012-02-28 | Searchbolt Limited | Method and apparatus for generating a ranked index of web pages |
US7505964B2 (en) | 2003-09-12 | 2009-03-17 | Google Inc. | Methods and systems for improving a search ranking using related queries |
US7606793B2 (en) | 2004-09-27 | 2009-10-20 | Microsoft Corporation | System and method for scoping searches using index keys |
US20060069667A1 (en) * | 2004-09-30 | 2006-03-30 | Microsoft Corporation | Content evaluation |
US7533092B2 (en) * | 2004-10-28 | 2009-05-12 | Yahoo! Inc. | Link-based spam detection |
US7634810B2 (en) * | 2004-12-02 | 2009-12-15 | Microsoft Corporation | Phishing detection, prevention, and notification |
US20060123478A1 (en) * | 2004-12-02 | 2006-06-08 | Microsoft Corporation | Phishing detection, prevention, and notification |
US20110197114A1 (en) * | 2004-12-08 | 2011-08-11 | John Martin | Electronic message response and remediation system and method |
US7962510B2 (en) * | 2005-02-11 | 2011-06-14 | Microsoft Corporation | Using content analysis to detect spam web pages |
WO2007002820A2 (en) * | 2005-06-28 | 2007-01-04 | Yahoo! Inc. | Search engine with augmented relevance ranking by community participation |
US20070078939A1 (en) * | 2005-09-26 | 2007-04-05 | Technorati, Inc. | Method and apparatus for identifying and classifying network documents as spam |
US20090299819A1 (en) * | 2006-03-04 | 2009-12-03 | John Stannard Davis, III | Behavioral Trust Rating Filtering System |
US7580931B2 (en) * | 2006-03-13 | 2009-08-25 | Microsoft Corporation | Topic distillation via subsite retrieval |
US7752198B2 (en) * | 2006-04-24 | 2010-07-06 | Telenor Asa | Method and device for efficiently ranking documents in a similarity graph |
US7634476B2 (en) * | 2006-07-25 | 2009-12-15 | Microsoft Corporation | Ranking of web sites by aggregating web page ranks |
US20080033797A1 (en) * | 2006-08-01 | 2008-02-07 | Microsoft Corporation | Search query monetization-based ranking and filtering |
US20080126331A1 (en) * | 2006-08-25 | 2008-05-29 | Xerox Corporation | System and method for ranking reference documents |
US8661029B1 (en) | 2006-11-02 | 2014-02-25 | Google Inc. | Modifying search result ranking based on implicit user feedback |
US20080114753A1 (en) * | 2006-11-15 | 2008-05-15 | Apmath Ltd. | Method and a device for ranking linked documents |
US20080147669A1 (en) * | 2006-12-14 | 2008-06-19 | Microsoft Corporation | Detecting web spam from changes to links of web sites |
US7885952B2 (en) * | 2006-12-20 | 2011-02-08 | Microsoft Corporation | Cloaking detection utilizing popularity and market value |
US7693833B2 (en) * | 2007-02-01 | 2010-04-06 | John Nagle | System and method for improving integrity of internet search |
US20080222725A1 (en) * | 2007-03-05 | 2008-09-11 | Microsoft Corporation | Graph structures and web spam detection |
US7680851B2 (en) | 2007-03-07 | 2010-03-16 | Microsoft Corporation | Active spam testing system |
US8938463B1 (en) | 2007-03-12 | 2015-01-20 | Google Inc. | Modifying search result ranking based on implicit user feedback and a model of presentation bias |
US8694374B1 (en) * | 2007-03-14 | 2014-04-08 | Google Inc. | Detecting click spam |
US7756987B2 (en) * | 2007-04-04 | 2010-07-13 | Microsoft Corporation | Cybersquatter patrol |
US20080270549A1 (en) * | 2007-04-26 | 2008-10-30 | Microsoft Corporation | Extracting link spam using random walks and spam seeds |
US9092510B1 (en) | 2007-04-30 | 2015-07-28 | Google Inc. | Modifying search result ranking based on a temporal element of user feedback |
US7930303B2 (en) * | 2007-04-30 | 2011-04-19 | Microsoft Corporation | Calculating global importance of documents based on global hitting times |
US7853589B2 (en) * | 2007-04-30 | 2010-12-14 | Microsoft Corporation | Web spam page classification using query-dependent data |
US7941391B2 (en) | 2007-05-04 | 2011-05-10 | Microsoft Corporation | Link spam detection using smooth classification function |
US7788254B2 (en) * | 2007-05-04 | 2010-08-31 | Microsoft Corporation | Web page analysis using multiple graphs |
US8667117B2 (en) * | 2007-05-31 | 2014-03-04 | Microsoft Corporation | Search ranger system and double-funnel model for search spam analyses and browser protection |
US7873635B2 (en) * | 2007-05-31 | 2011-01-18 | Microsoft Corporation | Search ranger system and double-funnel model for search spam analyses and browser protection |
US9430577B2 (en) * | 2007-05-31 | 2016-08-30 | Microsoft Technology Licensing, Llc | Search ranger system and double-funnel model for search spam analyses and browser protection |
US8244737B2 (en) * | 2007-06-18 | 2012-08-14 | Microsoft Corporation | Ranking documents based on a series of document graphs |
US8438189B2 (en) * | 2007-07-23 | 2013-05-07 | Microsoft Corporation | Local computation of rank contributions |
US8694511B1 (en) | 2007-08-20 | 2014-04-08 | Google Inc. | Modifying search result ranking based on populations |
US8041338B2 (en) * | 2007-09-10 | 2011-10-18 | Microsoft Corporation | Mobile wallet and digital payment |
US8909655B1 (en) | 2007-10-11 | 2014-12-09 | Google Inc. | Time based ranking |
US9348912B2 (en) | 2007-10-18 | 2016-05-24 | Microsoft Technology Licensing, Llc | Document length as a static relevance feature for ranking search results |
US20090177690A1 (en) * | 2008-01-03 | 2009-07-09 | Sinem Guven | Determining an Optimal Solution Set Based on Human Selection |
US8219549B2 (en) * | 2008-02-06 | 2012-07-10 | Microsoft Corporation | Forum mining for suspicious link spam sites detection |
US8010482B2 (en) * | 2008-03-03 | 2011-08-30 | Microsoft Corporation | Locally computable spam detection features and robust pagerank |
US8812493B2 (en) | 2008-04-11 | 2014-08-19 | Microsoft Corporation | Search results ranking using editing distance and document information |
US20090307191A1 (en) * | 2008-06-10 | 2009-12-10 | Li Hong C | Techniques to establish trust of a web page to prevent malware redirects from web searches or hyperlinks |
EP2169568A1 (en) | 2008-09-17 | 2010-03-31 | OGS Search Limited | Method and apparatus for generating a ranked index of web pages |
US7974970B2 (en) * | 2008-10-09 | 2011-07-05 | Yahoo! Inc. | Detection of undesirable web pages |
US8396865B1 (en) | 2008-12-10 | 2013-03-12 | Google Inc. | Sharing search engine relevance data between corpora |
US9009146B1 (en) | 2009-04-08 | 2015-04-14 | Google Inc. | Ranking search results based on similar queries |
US8447760B1 (en) | 2009-07-20 | 2013-05-21 | Google Inc. | Generating a related set of documents for an initial set of documents |
US8498974B1 (en) | 2009-08-31 | 2013-07-30 | Google Inc. | Refining search results |
US8972391B1 (en) | 2009-10-02 | 2015-03-03 | Google Inc. | Recent interest based relevance scoring |
US8874555B1 (en) | 2009-11-20 | 2014-10-28 | Google Inc. | Modifying scoring data based on historical changes |
US8615514B1 (en) | 2010-02-03 | 2013-12-24 | Google Inc. | Evaluating website properties by partitioning user feedback |
US8924379B1 (en) | 2010-03-05 | 2014-12-30 | Google Inc. | Temporal-based score adjustments |
US8959093B1 (en) | 2010-03-15 | 2015-02-17 | Google Inc. | Ranking search results based on anchors |
US8738635B2 (en) * | 2010-06-01 | 2014-05-27 | Microsoft Corporation | Detection of junk in search result ranking |
US9623119B1 (en) | 2010-06-29 | 2017-04-18 | Google Inc. | Accentuating search results |
US8832083B1 (en) | 2010-07-23 | 2014-09-09 | Google Inc. | Combining user feedback |
US8707441B1 (en) * | 2010-08-17 | 2014-04-22 | Symantec Corporation | Techniques for identifying optimized malicious search engine results |
US8874566B2 (en) | 2010-09-09 | 2014-10-28 | Disney Enterprises, Inc. | Online content ranking system based on authenticity metric values for web elements |
US9002867B1 (en) | 2010-12-30 | 2015-04-07 | Google Inc. | Modifying ranking data based on document changes |
CN102222115B (zh) * | 2011-07-12 | 2013-09-11 | 厦门大学 | 基于关键词共现的研究热点边连通度分析方法 |
CN102214245B (zh) * | 2011-07-12 | 2013-09-11 | 厦门大学 | 基于关键词共现的研究热点图论分析方法 |
CN102571768B (zh) * | 2011-12-26 | 2014-11-26 | 北京大学 | 一种钓鱼网站检测方法 |
CN102591965B (zh) * | 2011-12-30 | 2014-07-09 | 奇智软件(北京)有限公司 | 一种黑链检测的方法及装置 |
US9495462B2 (en) | 2012-01-27 | 2016-11-15 | Microsoft Technology Licensing, Llc | Re-ranking search results |
US9002832B1 (en) | 2012-06-04 | 2015-04-07 | Google Inc. | Classifying sites as low quality sites |
US9183499B1 (en) | 2013-04-19 | 2015-11-10 | Google Inc. | Evaluating quality based on neighbor features |
CN103345499A (zh) * | 2013-06-28 | 2013-10-09 | 宇龙计算机通信科技(深圳)有限公司 | 一种搜索引擎的搜索结果处理方法及装置 |
CN103412922B (zh) * | 2013-08-12 | 2017-02-08 | 曙光信息产业股份有限公司 | 一种数据查询处理方法 |
WO2016155007A1 (en) * | 2015-04-03 | 2016-10-06 | Yahoo! Inc. | Method and system for monitoring data quality and dependency |
CN105373598B (zh) * | 2015-10-27 | 2017-03-15 | 广州神马移动信息科技有限公司 | 作弊站点识别方法及装置 |
CN108304395B (zh) * | 2016-02-05 | 2022-09-06 | 北京迅奥科技有限公司 | 网页作弊检测 |
CN108984630B (zh) * | 2018-06-20 | 2021-08-24 | 天津大学 | 复杂网络中节点重要性在垃圾网页检测中的应用方法 |
Family Cites Families (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4167652A (en) * | 1974-10-17 | 1979-09-11 | Telefonaktiebolaget L M Ericsson | Method and apparatus for the interchanges of PCM word |
US7082426B2 (en) * | 1993-06-18 | 2006-07-25 | Cnet Networks, Inc. | Content aggregation method and apparatus for an on-line product catalog |
US6285999B1 (en) * | 1997-01-10 | 2001-09-04 | The Board Of Trustees Of The Leland Stanford Junior University | Method for node ranking in a linked database |
US6728752B1 (en) * | 1999-01-26 | 2004-04-27 | Xerox Corporation | System and method for information browsing using multi-modal features |
US6678681B1 (en) * | 1999-03-10 | 2004-01-13 | Google Inc. | Information extraction from a database |
US6404752B1 (en) * | 1999-08-27 | 2002-06-11 | International Business Machines Corporation | Network switch using network processor and methods |
US6985431B1 (en) * | 1999-08-27 | 2006-01-10 | International Business Machines Corporation | Network switch and components and method of operation |
US6529903B2 (en) * | 2000-07-06 | 2003-03-04 | Google, Inc. | Methods and apparatus for using a modified index to provide search results in response to an ambiguous search query |
US6865575B1 (en) * | 2000-07-06 | 2005-03-08 | Google, Inc. | Methods and apparatus for using a modified index to provide search results in response to an ambiguous search query |
US20040193503A1 (en) * | 2000-10-04 | 2004-09-30 | Eder Jeff Scott | Interactive sales performance management system |
US7197470B1 (en) * | 2000-10-11 | 2007-03-27 | Buzzmetrics, Ltd. | System and method for collection analysis of electronic discussion methods |
US20040236673A1 (en) * | 2000-10-17 | 2004-11-25 | Eder Jeff Scott | Collaborative risk transfer system |
CA2323883C (en) * | 2000-10-19 | 2016-02-16 | Patrick Ryan Morin | Method and device for classifying internet objects and objects stored oncomputer-readable media |
US8509086B2 (en) * | 2001-06-20 | 2013-08-13 | Arbor Networks, Inc. | Detecting network misuse |
US7089252B2 (en) * | 2002-04-25 | 2006-08-08 | International Business Machines Corporation | System and method for rapid computation of PageRank |
US20040002988A1 (en) * | 2002-06-26 | 2004-01-01 | Praveen Seshadri | System and method for modeling subscriptions and subscribers as data |
CN1536483A (zh) * | 2003-04-04 | 2004-10-13 | 陈文中 | 网络信息抽取及处理的方法及系统 |
US7346839B2 (en) * | 2003-09-30 | 2008-03-18 | Google Inc. | Information retrieval based on historical data |
US20050210008A1 (en) * | 2004-03-18 | 2005-09-22 | Bao Tran | Systems and methods for analyzing documents over a network |
US7343374B2 (en) * | 2004-03-29 | 2008-03-11 | Yahoo! Inc. | Computation of page authority weights using personalized bookmarks |
US20060064411A1 (en) * | 2004-09-22 | 2006-03-23 | William Gross | Search engine using user intent |
US20060085391A1 (en) * | 2004-09-24 | 2006-04-20 | Microsoft Corporation | Automatic query suggestions |
WO2006045004A2 (en) * | 2004-10-18 | 2006-04-27 | Bioveris Corporation | System and method for obtaining, storing, and processing immunologic information of individuals and populations |
US7533092B2 (en) * | 2004-10-28 | 2009-05-12 | Yahoo! Inc. | Link-based spam detection |
-
2005
- 2005-08-04 US US11/198,471 patent/US7533092B2/en not_active Expired - Fee Related
- 2005-10-26 WO PCT/US2005/038619 patent/WO2006049996A2/en active Application Filing
- 2005-10-26 CN CN2005800372291A patent/CN101180624B/zh not_active Expired - Fee Related
- 2005-10-26 JP JP2007539077A patent/JP4908422B2/ja active Active
- 2005-10-26 EP EP05821001A patent/EP1817697A2/en not_active Ceased
- 2005-10-26 KR KR1020077011999A patent/KR101230687B1/ko active IP Right Grant
-
2008
- 2008-10-23 HK HK08111675.1A patent/HK1115930A1/xx not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
CN101180624B (zh) | 2012-05-09 |
WO2006049996A2 (en) | 2006-05-11 |
US7533092B2 (en) | 2009-05-12 |
US20060095416A1 (en) | 2006-05-04 |
JP4908422B2 (ja) | 2012-04-04 |
WO2006049996A3 (en) | 2007-09-27 |
CN101180624A (zh) | 2008-05-14 |
JP2008519328A (ja) | 2008-06-05 |
KR20070085477A (ko) | 2007-08-27 |
EP1817697A2 (en) | 2007-08-15 |
KR101230687B1 (ko) | 2013-02-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
HK1115930A1 (en) | Link-based spam detection | |
GB2400667B (en) | Attention detection | |
IL181842A0 (en) | Multi-threat detection system | |
EP1780643A4 (en) | QUARANTINE SYSTEM | |
GB2444664B (en) | Fire detection | |
GB0519968D0 (en) | Object detection | |
GB0328736D0 (en) | Object detection | |
EP1811920A4 (en) | WIRELESS BREAK DETECTION | |
GB0411064D0 (en) | Nodule boundary detection | |
EP1812604A4 (en) | DETECTION ASSAY IN ONE STEP | |
GB0412026D0 (en) | Object detection | |
GB0512743D0 (en) | Range detection | |
GB0514688D0 (en) | Smoke detection | |
GB0328735D0 (en) | Object detection | |
GB0328741D0 (en) | Object detection | |
GB0503638D0 (en) | Detection arrangement | |
GB0328743D0 (en) | Object detection | |
GB0328739D0 (en) | Object detection | |
GB2414295B (en) | Nodule detection | |
GB0404299D0 (en) | Improved can processing | |
GB0404740D0 (en) | Tool detection | |
GB0424127D0 (en) | Supercomputing | |
GB0514706D0 (en) | Fire detection | |
GB0509497D0 (en) | Detection system | |
GB0424125D0 (en) | Supercomputing |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PC | Patent ceased (i.e. patent has lapsed due to the failure to pay the renewal fee) |
Effective date: 20211026 |