CN103778217A - Current webpage list-based method and system for recommendation - Google Patents
Current webpage list-based method and system for recommendation Download PDFInfo
- Publication number
- CN103778217A CN103778217A CN201410024821.9A CN201410024821A CN103778217A CN 103778217 A CN103778217 A CN 103778217A CN 201410024821 A CN201410024821 A CN 201410024821A CN 103778217 A CN103778217 A CN 103778217A
- Authority
- CN
- China
- Prior art keywords
- url
- collected
- webpage
- module
- web page
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 31
- 238000012545 processing Methods 0.000 claims abstract description 5
- 241000270322 Lepidosauria Species 0.000 claims description 14
- 238000004458 analytical method Methods 0.000 claims description 6
- 230000009193 crawling Effects 0.000 claims description 6
- 230000008569 process Effects 0.000 claims description 5
- 230000006870 function Effects 0.000 description 13
- 238000001514 detection method Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 238000003780 insertion Methods 0.000 description 3
- 230000037431 insertion Effects 0.000 description 3
- 230000007547 defect Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/02—Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Information Transfer Between Computers (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410024821.9A CN103778217A (en) | 2014-01-20 | 2014-01-20 | Current webpage list-based method and system for recommendation |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410024821.9A CN103778217A (en) | 2014-01-20 | 2014-01-20 | Current webpage list-based method and system for recommendation |
Publications (1)
Publication Number | Publication Date |
---|---|
CN103778217A true CN103778217A (en) | 2014-05-07 |
Family
ID=50570452
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410024821.9A Pending CN103778217A (en) | 2014-01-20 | 2014-01-20 | Current webpage list-based method and system for recommendation |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103778217A (en) |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150356196A1 (en) * | 2014-06-04 | 2015-12-10 | International Business Machines Corporation | Classifying uniform resource locators |
CN105630980A (en) * | 2015-12-25 | 2016-06-01 | 北京奇虎科技有限公司 | Game recommending strategy obtaining method and device |
CN106126648A (en) * | 2016-06-23 | 2016-11-16 | 华南理工大学 | A kind of based on the distributed merchandise news reptile method redo log |
CN109472637A (en) * | 2018-10-18 | 2019-03-15 | 微梦创科网络科技(中国)有限公司 | A method and device for optimizing user scheduled advertising |
CN110020058A (en) * | 2017-12-30 | 2019-07-16 | 中国移动通信集团贵州有限公司 | Information processing method, device, equipment and medium |
CN110781386A (en) * | 2019-10-10 | 2020-02-11 | 支付宝(杭州)信息技术有限公司 | Information recommendation method and device, and bloom filter creation method and device |
CN110968578A (en) * | 2018-09-28 | 2020-04-07 | 中建水务环保有限公司 | Sewage treatment process recommendation method and device |
CN111209458A (en) * | 2018-11-22 | 2020-05-29 | 顺丰科技有限公司 | Data processing system and method for web crawler |
-
2014
- 2014-01-20 CN CN201410024821.9A patent/CN103778217A/en active Pending
Cited By (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150356196A1 (en) * | 2014-06-04 | 2015-12-10 | International Business Machines Corporation | Classifying uniform resource locators |
US20160179929A1 (en) * | 2014-06-04 | 2016-06-23 | International Business Machines Corporation | Classifying uniform resource locators |
US9569522B2 (en) * | 2014-06-04 | 2017-02-14 | International Business Machines Corporation | Classifying uniform resource locators |
US9582565B2 (en) * | 2014-06-04 | 2017-02-28 | International Business Machines Corporation | Classifying uniform resource locators |
US9928292B2 (en) * | 2014-06-04 | 2018-03-27 | International Business Machines Corporation | Classifying uniform resource locators |
US9928301B2 (en) * | 2014-06-04 | 2018-03-27 | International Business Machines Corporation | Classifying uniform resource locators |
CN105630980A (en) * | 2015-12-25 | 2016-06-01 | 北京奇虎科技有限公司 | Game recommending strategy obtaining method and device |
CN105630980B (en) * | 2015-12-25 | 2019-05-28 | 北京奇虎科技有限公司 | Game recommdation strategy acquisition methods and device |
CN106126648A (en) * | 2016-06-23 | 2016-11-16 | 华南理工大学 | A kind of based on the distributed merchandise news reptile method redo log |
CN106126648B (en) * | 2016-06-23 | 2019-04-09 | 华南理工大学 | A distributed commodity information crawler method based on redo log |
CN110020058A (en) * | 2017-12-30 | 2019-07-16 | 中国移动通信集团贵州有限公司 | Information processing method, device, equipment and medium |
CN110968578A (en) * | 2018-09-28 | 2020-04-07 | 中建水务环保有限公司 | Sewage treatment process recommendation method and device |
CN110968578B (en) * | 2018-09-28 | 2023-04-25 | 中建生态环境集团有限公司 | Sewage treatment process recommendation method and device |
CN109472637A (en) * | 2018-10-18 | 2019-03-15 | 微梦创科网络科技(中国)有限公司 | A method and device for optimizing user scheduled advertising |
CN111209458A (en) * | 2018-11-22 | 2020-05-29 | 顺丰科技有限公司 | Data processing system and method for web crawler |
CN110781386A (en) * | 2019-10-10 | 2020-02-11 | 支付宝(杭州)信息技术有限公司 | Information recommendation method and device, and bloom filter creation method and device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103778217A (en) | Current webpage list-based method and system for recommendation | |
US20240111818A1 (en) | Method for training isolation forest, and method for recognizing web crawler | |
CN102760138B (en) | Classification method and device for user network behaviors and search method and device for user network behaviors | |
TWI497325B (en) | Method for classification of objects in a graph data stream | |
CN105404699A (en) | Method, device and server for searching articles of finance and economics | |
CN102567494B (en) | Website classification method and device | |
CN110602045B (en) | Malicious webpage identification method based on feature fusion and machine learning | |
CN101814083A (en) | Automatic webpage classification method and system | |
CN106528847A (en) | Multi-dimensional processing method and system for massive data | |
CN105224636A (en) | A kind of data access method and device | |
US8799237B2 (en) | Identification disambiguation in databases | |
CN108241867B (en) | Classification method and device | |
CN104503891A (en) | Method and device for online monitoring JVM (Java Virtual Machine) thread | |
CN1716259A (en) | Method and system for ranking objects based on intra-type and inter-type relationships | |
CN104077286A (en) | Commodity information search method and system | |
CN105183873A (en) | Malicious clicking behavior detection method and device | |
US20220019742A1 (en) | Situational awareness by fusing multi-modal data with semantic model | |
US20140358867A1 (en) | De-duplication deployment planning | |
CN103186666A (en) | Method, device and equipment for searching based on favorites | |
US20090259649A1 (en) | System and method for detecting templates of a website using hyperlink analysis | |
US20150269138A1 (en) | Publication Scope Visualization and Analysis | |
CN110546633A (en) | Named entity based category tag addition for documents | |
US10147095B2 (en) | Chain understanding in search | |
CN103455491A (en) | Method and device for classifying search terms | |
CN103605744A (en) | Method and device for analyzing website searching engine traffic data |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C53 | Correction of patent for invention or patent application | ||
CB03 | Change of inventor or designer information |
Inventor after: Cui Jingjing Inventor after: Lin Jiajie Inventor after: Wu Peng Inventor after: Ma Zhanguo Inventor after: Li Chunhua Inventor before: Cui Jingjing Inventor before: Lin Jiajie Inventor before: Wu Peng Inventor before: Ma Zhanguo Inventor before: Li Chunhua Inventor before: Liu Lina |
|
COR | Change of bibliographic data |
Free format text: CORRECT: INVENTOR; FROM: CUI JINGJING LIN JIAJIE WU PENG MA ZHANGUO LI CHUNHUA LIU LINA TO: CUI JINGJING LIN JIAJIE WU PENG MA ZHANGUO LI CHUNHUA |
|
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20140507 |
|
RJ01 | Rejection of invention patent application after publication |