CN108804540A - search engine link analysis system and analysis method - Google Patents
search engine link analysis system and analysis method Download PDFInfo
- Publication number
- CN108804540A CN108804540A CN201810431864.7A CN201810431864A CN108804540A CN 108804540 A CN108804540 A CN 108804540A CN 201810431864 A CN201810431864 A CN 201810431864A CN 108804540 A CN108804540 A CN 108804540A
- Authority
- CN
- China
- Prior art keywords
- website
- chain
- information
- link
- data information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/07—Responding to the occurrence of a fault, e.g. fault tolerance
- G06F11/14—Error detection or correction of the data by redundancy in operation
- G06F11/1402—Saving, restoring, recovering or retrying
- G06F11/1446—Point-in-time backing up or restoration of persistent data
- G06F11/1458—Management of the backup or restore process
- G06F11/1464—Management of the backup or restore process for networked environments
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Quality & Reliability (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Present invention is disclosed a kind of search engine link analysis system and analysis method, system includes internet cloud platform unit, information scratching unit, information memory cell, information operation processing unit and client feedback unit;Method includes internet cloud platform step, information scratching step, information storing step member, information operation processing step and client feedback step.The present invention judges the reliability in search result source, and in this, as foundation, redistributed to the weight in search result analytic process, to improve the accuracy and reliability of search result by the confirmation to searching for information source.Meanwhile the present invention can monitor the keyword ranking in all kinds of websites in real time, reduce the influence of all kinds of bursts, abnormal conditions to search result, avoid the adverse effect that artificial malicious link brings network search engines.
Description
Technical field
The present invention relates to a kind of analysis system and analysis methods, and in particular to a kind of search engine link analysis system and point
Analysis method, belongs to field of Internet search.
Background technology
With the universal of internet, the continuous development of network search engines, people increasingly incline when consulting various information
Use search engine, network search engines utilization rate in people's daily life and popularity rate also higher and higher in selection.
Also just because of such development trend, the ranking system of network search engines also comes into being.In general, net
The ranking system of network search engine can be automatic to tie according to information such as the click volumes of keyword in the volumes of searches of keyword, website
The ranking of search result is calculated, and is presented to user in the form of from high to low.
But in actual application process, technical staff has found, current existing search engine ranking system is easy to
Influenced by the malice of all kinds of illegal websites, informal forum etc. in network, be especially embodied in blog group, forum mass-sending with
And several aspects such as station group.For station group and blog group, can in a short time it be copied by modes such as replication links greatly
The keyword of amount, and for forum mass-sends, it can also be big for keyword manufacture by way of voting to target keyword
The click volume of amount.Both above-mentioned ways all can generate malice to the ranking system of search engine to be influenced, and search result is caused
Accuracy substantially reduced with reliability.
In conclusion how to provide a kind of search engine link analysis system and analysis method, drawn with improving web search
The accuracy of search result is held up, those skilled in that art institute urgent problem to be solved is just become.
Invention content
In view of the prior art there are drawbacks described above, the purpose of the present invention is to propose to a kind of search engine link analysis system and
Analysis method.
The purpose of the present invention will be achieved by the following technical programs:
A kind of search engine link analysis system, including:
Internet cloud platform unit obtains the data information in WWW for establishing data connection with WWW;
Information scratching unit obtains data information for the operation requests according to user in WWW, and to data information into
Row is reprinted and is issued;
Information memory cell, the data information grabbed for receiving information scratching unit, and data information is stored
Backup;
Information operation processing unit, the operation requests for receiving user, and obtained in information memory cell according to operation requests
Data information is taken, and carries out calculation process;
Client feedback unit, the operation requests for keying in user, and the handling result of information operation processing unit is fed back
To user.
Preferably, described information placement unit includes:
Crawler server, for capturing data information in WWW;
Website server, the operation requests for receiving user complete data information crawl according to operation and control crawler server,
And the data information grabbed reprinting is issued.
Preferably, described information operation processing unit includes:
Network segment enquiry module, for the network segment belonging to query web IP;
Inquiry of the domain name module is used for nslookup IP and domain name owner information;
Threshold setting module, for anti-chain number rate of climb threshold value, outer chain growth speed threshold value, identical link amount threshold to be arranged
And the amount threshold that interlinks, handle foundation as judgement;
Anti-chain number rate of climb judgment module, the rate of climb for detecting website anti-chain number are simultaneously compared, when detection website
When the anti-chain number rate of climb is more than anti-chain number rate of climb threshold value, the processing of drop power is carried out to website or emphasis monitoring is handled;
Outer chain growth speed judgment module, growth rate for detecting website exterior chain are simultaneously compared, when detection website exterior chain
When growth rate is more than outer chain growth speed threshold value, drop power operation is carried out to the exterior chain of website;
First content comparing module, for comparing anchor file and linking content of pages, when anchor file is unrelated with content of pages is linked
When, drop power operation is carried out to exterior chain;
Secondary content comparing module, for comparing the website anti-chain page and link content of pages, when the website anti-chain page and link
When content of pages is unrelated, drop power operation is carried out to exterior chain;
Website exterior chain analysis module compares website exterior chain content for detecting, existing in acquisition website to link identical anti-chain
Quantity carries out drop power operation when linking identical anti-chain quantity more than identical link amount threshold to website or exterior chain;
Link analysis module in website compares website url linked contents for detecting, and obtains mutual chain between url link similar websites
The quantity connect carries out drop power operation when the quantity to interlink, which is more than, interlinks amount threshold to website or exterior chain.
Preferably, the Anchor Text is the contextual information where link.
Preferably, the client feedback unit includes App clients or Web client.
A kind of search engine link analysis method, including:
S1, internet cloud platform step establish data connection with WWW, obtain the data information in WWW;
S2, information scratching step, the operation requests according to user obtain data information in WWW, and are carried out to data information
Reprinting issues;
S3, information storing step receive the data information that information scratching unit has grabbed, and to data information store standby
Part;
S4, information operation processing step receive the operation requests of user, and are obtained in information memory cell according to operation requests
Data information, and carry out calculation process;
S5, client feedback step key in the operation requests of user, and the handling result of information operation processing unit are fed back to
User.
Preferably, described information crawl step includes:
Sub-step is arranged in S21, crawler server, and crawler server is arranged, data information is captured in WWW;
Sub-step is arranged in S22, Website server, and Website server is arranged, and receives the operation requests of user, is climbed according to operation and control
Worm server completes data information crawl, and the data information grabbed reprinting is issued.
Preferably, described information operation processing step includes:
S41, the network segment inquire sub-step, the network segment belonging to query web IP;
S42, inquiry of the domain name sub-step, nslookup IP and domain name owner information;
S43, threshold value set sub-step, setting anti-chain number rate of climb threshold value, outer chain growth speed threshold value, identical number of links threshold
It is worth and the amount threshold that interlinks, as judging to handle foundation;
S44, the anti-chain number rate of climb judge sub-step, detect the rate of climb of website anti-chain number and are compared, when detection net
When the anti-chain number rate of climb of standing is more than anti-chain number rate of climb threshold value, the processing of drop power is carried out to website or emphasis monitoring is handled;
S45, outer chain growth speed judge sub-step, detect the growth rate of website exterior chain and are compared, when outside detection website
When chain growth speed is more than outer chain growth speed threshold value, drop power operation is carried out to the exterior chain of website;
S46, first content compare sub-step, compare and anchor file and link content of pages, when anchor file with link content of pages without
Guan Shi carries out drop power operation to exterior chain;
S47, secondary content compare sub-step, the comparison website anti-chain page and link content of pages, when the website anti-chain page and chain
Connect content of pages it is unrelated when, to exterior chain carry out drop power operation;
Link analysis sub-step outside S48, website, detection compare website exterior chain content, and it is identical anti-to obtain existing link in website
Chain quantity carries out drop power operation when linking identical anti-chain quantity more than identical link amount threshold to website or exterior chain;
Link analysis sub-step in S49, website detects and compares website url linked contents, between acquisition url link similar websites mutually
The quantity of link carries out drop power operation when the quantity to interlink, which is more than, interlinks amount threshold to website or exterior chain.
Preferably, the Anchor Text is the contextual information where link.
Preferably, the client feedback step includes setting App clients or Web client.
Compared in the prior art, protrusion effect of the invention is as follows:
The present invention judges the reliability in search result source by the multiple confirmation to searching for information source, and in this, as foundation,
Weight in search result analytic process is redistributed, to improve the accuracy and reliability of search result.
Meanwhile the present invention can monitor the keyword ranking in all kinds of websites in real time, reduce all kinds of bursts, abnormal conditions
Influence to search result avoids the adverse effect that artificial malicious link brings network search engines.
In addition, the analysis system and analysis method of the present invention can also be applied in the system of all kinds of close functions, it is each
Arithmetic processing system of the class based on internet big data provides reliable information source, applicability and versatile.
In conclusion the present invention provides effective link analysis system and analysis method, using effect it is good and
It is compatible strong, there is very high use and promotional value.
Just attached drawing in conjunction with the embodiments below, the embodiment of the present invention is described in further detail, so that of the invention
Technical solution is more readily understood, grasps.
Description of the drawings
Fig. 1 is the structure diagram of analysis system in the present invention.
Specific implementation mode
As shown, present invention is disclosed a kind of search engine link analysis system and analysis methods.
Specifically, a kind of search engine link analysis system, including:
Internet cloud platform unit obtains the data information in WWW for establishing data connection with WWW.
Information scratching unit obtains data information for the operation requests according to user in WWW, and logarithm it is believed that
Breath reprint and is issued.
Information memory cell, the data information grabbed for receiving information scratching unit, and data information is carried out
Storage backup.In the present embodiment, described information storage unit is Elasticsearch databases.
Information operation processing unit, the operation requests for receiving user, and according to operation requests in information memory cell
Interior acquisition data information, and carry out calculation process.
Client feedback unit, the operation requests for keying in user, and by the handling result of information operation processing unit
Feed back to user.
Described information placement unit includes:
More crawler servers, for capturing data information in WWW.
An at least Website server, the operation requests for receiving user are completed according to operation and control crawler server
Data information captures, and the data information grabbed reprinting is issued.
Described information operation processing unit includes:
Network segment enquiry module, for the network segment belonging to query web IP.
Inquiry of the domain name module is used for nslookup IP and domain name owner information.
Threshold setting module, for anti-chain number rate of climb threshold value, outer chain growth speed threshold value, identical number of links to be arranged
Threshold value and the amount threshold that interlinks handle foundation as judgement.
Anti-chain number rate of climb judgment module, the rate of climb for detecting website anti-chain number are simultaneously compared, and work as detection
When the website anti-chain number rate of climb is more than anti-chain number rate of climb threshold value, the processing of drop power is carried out to website or emphasis monitoring is handled.
Outer chain growth speed judgment module, growth rate for detecting website exterior chain are simultaneously compared, when detection website
When outer chain growth speed is more than outer chain growth speed threshold value, drop power operation is carried out to the exterior chain of website.
First content comparing module, for comparing anchor file and linking content of pages, when anchor file with link content of pages
When unrelated, drop power operation is carried out to exterior chain.
Secondary content comparing module, for comparing the website anti-chain page and link content of pages, when the website anti-chain page and
When link content of pages is unrelated, drop power operation is carried out to exterior chain.
Website exterior chain analysis module compares website exterior chain content for detecting, and it is identical to obtain existing link in website
Anti-chain quantity carries out drop power operation when linking identical anti-chain quantity more than identical link amount threshold to website or exterior chain.
Link analysis module in website compares website url linked contents for detecting, and obtains phase between url link similar websites
The quantity mutually linked carries out drop power operation when the quantity to interlink, which is more than, interlinks amount threshold to website or exterior chain.
The Anchor Text is the contextual information where link.
The client feedback unit includes App clients or Web client.
Present invention further teaches a kind of search engine link analysis methods, including:
S1, internet cloud platform step establish data connection with WWW, obtain the data information in WWW.
S2, information scratching step, the operation requests according to user obtain data information in WWW, and to data information
Reprint and issues.
S3, information storing step receive the data information that information scratching unit has grabbed, and are deposited to data information
Lay in part.
S4, information operation processing step, receive the operation requests of user, and according to operation requests in information memory cell
Data information is obtained, and carries out calculation process.
S5, client feedback step key in the operation requests of user, and the handling result of information operation processing unit is anti-
Feed user.
Described information crawl step includes:
Sub-step is arranged in S21, crawler server, and crawler server is arranged, data information is captured in WWW.
Sub-step is arranged in S22, Website server, and Website server is arranged, and receives the operation requests of user, according to operation control
Crawler server processed completes data information crawl, and the data information grabbed reprinting is issued.
Described information operation processing step includes:
S41, the network segment inquire sub-step, the network segment belonging to query web IP.
S42, inquiry of the domain name sub-step, nslookup IP and domain name owner information.
S43, threshold value set sub-step, setting anti-chain number rate of climb threshold value, outer chain growth speed threshold value, identical link number
Threshold value and the amount threshold that interlinks are measured, foundation is handled as judgement.
S44, the anti-chain number rate of climb judge sub-step, detect the rate of climb of website anti-chain number and are compared, work as inspection
When the survey grid station anti-chain number rate of climb is more than anti-chain number rate of climb threshold value, website is carried out at the processing of drop power or emphasis monitoring
Reason.
S45, outer chain growth speed judge sub-step, detect the growth rate of website exterior chain and are compared, when detection net
When outer chain growth speed of standing is more than outer chain growth speed threshold value, it is believed that be there are a large amount of hair chain advertisements to cause, at this time to website
Exterior chain carry out drop power operation.
S46, first content compare sub-step, compare and anchor file and link content of pages, when anchor file with link in the page
When holding unrelated, drop power operation is carried out to exterior chain.
S47, secondary content compare sub-step, the comparison website anti-chain page and link content of pages, when the website anti-chain page
When unrelated with link content of pages, drop power operation is carried out to exterior chain.
Link analysis sub-step outside S48, website, detection compare website exterior chain content, and it is identical to obtain existing link in website
Anti-chain quantity, when linking identical anti-chain quantity and being more than identical link amount threshold, it is believed that be because of forum, blog group
Hair causes, and carries out drop power operation to website or exterior chain at this time.
Link analysis sub-step in S49, website, detection compare website url linked contents, and acquisition url is linked between similar website
The quantity to interlink, when the quantity to interlink, which is more than, interlinks amount threshold, it is believed that led because establishing station group
It causes, drop power operation is carried out to website or exterior chain at this time.
The Anchor Text is the contextual information where link.
The client feedback step includes setting App clients or Web client.
The present invention judges the reliability in search result source by the multiple confirmation to searching for information source, and in this, as
Foundation redistributes the weight in search result analytic process, to improve the accuracy and reliability of search result.
Meanwhile the present invention can monitor the keyword ranking in all kinds of websites in real time, reduce all kinds of bursts, abnormal conditions
Influence to search result avoids the adverse effect that artificial malicious link brings network search engines.
In addition, the analysis system and analysis method of the present invention can also be applied in the system of all kinds of close functions, it is each
Arithmetic processing system of the class based on internet big data provides reliable information source, applicability and versatile.
In conclusion the present invention provides effective link analysis system and analysis method, using effect it is good and
It is compatible strong, there is very high use and promotional value.
It is obvious to a person skilled in the art that invention is not limited to the details of the above exemplary embodiments, Er Qie
In the case of without departing substantially from spirit and essential characteristics of the invention, the present invention can be realized in other specific forms.Therefore, no matter
From the point of view of which point, the present embodiments are to be considered as illustrative and not restrictive, and the scope of the present invention is by appended power
Profit requires rather than above description limits, it is intended that all by what is fallen within the meaning and scope of the equivalent requirements of the claims
Variation is included within the present invention, and should not be considered as the note of any attached drawing table in claim and be limited the claims involved.
In addition, it should be understood that although this specification is described in terms of embodiments, but not each embodiment is only wrapped
Containing an independent technical solution, this description of the specification is merely for the sake of clarity, and those skilled in the art should
It considers the specification as a whole, the technical solutions in the various embodiments may also be suitably combined, forms those skilled in the art
The other embodiment being appreciated that.
Claims (10)
1. a kind of search engine link analysis system, including:
Internet cloud platform unit obtains the data information in WWW for establishing data connection with WWW;
Information scratching unit obtains data information for the operation requests according to user in WWW, and to data information into
Row is reprinted and is issued;
Information memory cell, the data information grabbed for receiving information scratching unit, and data information is stored
Backup;
Information operation processing unit, the operation requests for receiving user, and obtained in information memory cell according to operation requests
Data information is taken, and carries out calculation process;
Client feedback unit, the operation requests for keying in user, and the handling result of information operation processing unit is fed back
To user.
2. search engine link analysis system according to claim 1, which is characterized in that described information placement unit packet
It includes:
Crawler server, for capturing data information in WWW;
Website server, the operation requests for receiving user complete data information crawl according to operation and control crawler server,
And the data information grabbed reprinting is issued.
3. search engine link analysis system according to claim 1, which is characterized in that described information operation processing unit
Including:
Network segment enquiry module, for the network segment belonging to query web IP;
Inquiry of the domain name module is used for nslookup IP and domain name owner information;
Threshold setting module, for anti-chain number rate of climb threshold value, outer chain growth speed threshold value, identical link amount threshold to be arranged
And the amount threshold that interlinks, handle foundation as judgement;
Anti-chain number rate of climb judgment module, the rate of climb for detecting website anti-chain number are simultaneously compared, when detection website
When the anti-chain number rate of climb is more than anti-chain number rate of climb threshold value, the processing of drop power is carried out to website or emphasis monitoring is handled;
Outer chain growth speed judgment module, growth rate for detecting website exterior chain are simultaneously compared, when detection website exterior chain
When growth rate is more than outer chain growth speed threshold value, drop power operation is carried out to the exterior chain of website;
First content comparing module, for comparing anchor file and linking content of pages, when anchor file is unrelated with content of pages is linked
When, drop power operation is carried out to exterior chain;
Secondary content comparing module, for comparing the website anti-chain page and link content of pages, when the website anti-chain page and link
When content of pages is unrelated, drop power operation is carried out to exterior chain;
Website exterior chain analysis module compares website exterior chain content for detecting, existing in acquisition website to link identical anti-chain
Quantity carries out drop power operation when linking identical anti-chain quantity more than identical link amount threshold to website or exterior chain;
Link analysis module in website compares website url linked contents for detecting, and obtains mutual chain between url link similar websites
The quantity connect carries out drop power operation when the quantity to interlink, which is more than, interlinks amount threshold to website or exterior chain.
4. search engine link analysis system according to claim 3, it is characterised in that:The Anchor Text is link place
Contextual information.
5. search engine link analysis system according to claim 1, which is characterized in that the client feedback unit packet
Include App clients or Web client.
6. a kind of search engine link analysis method, including:
S1, internet cloud platform step establish data connection with WWW, obtain the data information in WWW;
S2, information scratching step, the operation requests according to user obtain data information in WWW, and are carried out to data information
Reprinting issues;
S3, information storing step receive the data information that information scratching unit has grabbed, and to data information store standby
Part;
S4, information operation processing step receive the operation requests of user, and are obtained in information memory cell according to operation requests
Data information, and carry out calculation process;
S5, client feedback step key in the operation requests of user, and the handling result of information operation processing unit are fed back to
User.
7. search engine link analysis method according to claim 6, which is characterized in that described information crawl step packet
It includes:
Sub-step is arranged in S21, crawler server, and crawler server is arranged, data information is captured in WWW;
Sub-step is arranged in S22, Website server, and Website server is arranged, and receives the operation requests of user, is climbed according to operation and control
Worm server completes data information crawl, and the data information grabbed reprinting is issued.
8. search engine link analysis method according to claim 6, which is characterized in that described information operation processing step
Including:
S41, the network segment inquire sub-step, the network segment belonging to query web IP;
S42, inquiry of the domain name sub-step, nslookup IP and domain name owner information;
S43, threshold value set sub-step, setting anti-chain number rate of climb threshold value, outer chain growth speed threshold value, identical number of links threshold
It is worth and the amount threshold that interlinks, as judging to handle foundation;
S44, the anti-chain number rate of climb judge sub-step, detect the rate of climb of website anti-chain number and are compared, when detection net
When the anti-chain number rate of climb of standing is more than anti-chain number rate of climb threshold value, the processing of drop power is carried out to website or emphasis monitoring is handled;
S45, outer chain growth speed judge sub-step, detect the growth rate of website exterior chain and are compared, when outside detection website
When chain growth speed is more than outer chain growth speed threshold value, drop power operation is carried out to the exterior chain of website;
S46, first content compare sub-step, compare and anchor file and link content of pages, when anchor file with link content of pages without
Guan Shi carries out drop power operation to exterior chain;
S47, secondary content compare sub-step, the comparison website anti-chain page and link content of pages, when the website anti-chain page and chain
Connect content of pages it is unrelated when, to exterior chain carry out drop power operation;
Link analysis sub-step outside S48, website, detection compare website exterior chain content, and it is identical anti-to obtain existing link in website
Chain quantity carries out drop power operation when linking identical anti-chain quantity more than identical link amount threshold to website or exterior chain;
Link analysis sub-step in S49, website detects and compares website url linked contents, between acquisition url link similar websites mutually
The quantity of link carries out drop power operation when the quantity to interlink, which is more than, interlinks amount threshold to website or exterior chain.
9. search engine link analysis method according to claim 8, it is characterised in that:The Anchor Text is link place
Contextual information.
10. search engine link analysis method according to claim 6, it is characterised in that:The client feedback step
Including setting App clients or Web client.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810431864.7A CN108804540B (en) | 2018-05-08 | 2018-05-08 | Search engine link analysis system and analysis method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810431864.7A CN108804540B (en) | 2018-05-08 | 2018-05-08 | Search engine link analysis system and analysis method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108804540A true CN108804540A (en) | 2018-11-13 |
CN108804540B CN108804540B (en) | 2020-12-22 |
Family
ID=64091926
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810431864.7A Active CN108804540B (en) | 2018-05-08 | 2018-05-08 | Search engine link analysis system and analysis method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108804540B (en) |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090138464A1 (en) * | 2007-11-28 | 2009-05-28 | James Paul Schneider | Method for removing network effects from search engine results |
CN102663054A (en) * | 2012-03-29 | 2012-09-12 | 奇智软件(北京)有限公司 | Method and device for determining weight of website |
CN103425691A (en) * | 2012-05-22 | 2013-12-04 | 阿里巴巴集团控股有限公司 | Search method and search system |
CN103714149A (en) * | 2013-12-26 | 2014-04-09 | 华中科技大学 | Self-adaptive incremental deep web data source discovery method |
CN104199830A (en) * | 2014-07-31 | 2014-12-10 | 渠成 | Search engine optimization big data management platform |
CN105468729A (en) * | 2015-11-23 | 2016-04-06 | 深圳大粤网络视界有限公司 | Internet mobile vertical search engine |
-
2018
- 2018-05-08 CN CN201810431864.7A patent/CN108804540B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090138464A1 (en) * | 2007-11-28 | 2009-05-28 | James Paul Schneider | Method for removing network effects from search engine results |
CN102663054A (en) * | 2012-03-29 | 2012-09-12 | 奇智软件(北京)有限公司 | Method and device for determining weight of website |
CN103425691A (en) * | 2012-05-22 | 2013-12-04 | 阿里巴巴集团控股有限公司 | Search method and search system |
CN103714149A (en) * | 2013-12-26 | 2014-04-09 | 华中科技大学 | Self-adaptive incremental deep web data source discovery method |
CN104199830A (en) * | 2014-07-31 | 2014-12-10 | 渠成 | Search engine optimization big data management platform |
CN105468729A (en) * | 2015-11-23 | 2016-04-06 | 深圳大粤网络视界有限公司 | Internet mobile vertical search engine |
Non-Patent Citations (1)
Title |
---|
冯亚飞: "基于社区发现的搜索引擎反作弊方法", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
Also Published As
Publication number | Publication date |
---|---|
CN108804540B (en) | 2020-12-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Cooley et al. | Data preparation for mining world wide web browsing patterns | |
CN102710646B (en) | Method and system for collecting phishing websites | |
US8972412B1 (en) | Predicting improvement in website search engine rankings based upon website linking relationships | |
CN102567407B (en) | Method and system for collecting forum reply increment | |
Baeza-Yates et al. | Crawling the infinite Web: five levels are enough | |
CN110019689A (en) | Position matching process and position matching system | |
CN105260469B (en) | A kind of method, apparatus and equipment for handling site maps | |
CN104615627B (en) | A kind of event public feelings information extracting method and system based on microblog | |
WO2021114454A1 (en) | Method and apparatus for detecting crawler request | |
CN108429721A (en) | A kind of recognition methods of web crawlers and device | |
CN104182412A (en) | Webpage crawling method and webpage crawling system | |
CN106202232A (en) | A kind of analysis method and device of power-off event | |
CN106156230A (en) | A kind of method and device generating interior chain | |
CN106126688A (en) | Based on WEB content and the intelligent network information acquisition system of structure excavation, method | |
CN104077293A (en) | Webpage acquisition method and device | |
CN105824880A (en) | Webpage grasping method and device | |
CN103279492B (en) | A kind of method and apparatus capturing webpage | |
CN113656673A (en) | Master-slave distributed content crawling robot for advertisement delivery | |
CN102024042B (en) | Method, device and system for monitoring picture showing effect | |
CN114139048A (en) | Tracking method for user behavior data and page data | |
Poornalatha et al. | Web page prediction by clustering and integrated distance measure | |
CN108804540A (en) | search engine link analysis system and analysis method | |
Ali et al. | An integrated framework for web data preprocessing towards modeling user behavior | |
CN110263283A (en) | Website detection method and device | |
KR101556714B1 (en) | Method, system and computer readable recording medium for providing search results |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information |
Address after: Room 1101, building 1, Rongsheng business center, 135 wangdun Road, Suzhou Industrial Park, Jiangsu Province Applicant after: SUZHOU WENDAO NETWORK TECHNOLOGY Co.,Ltd. Address before: 215123 E-1804 388, Shui Shui Road, Suzhou Industrial Park, Jiangsu. Applicant before: SUZHOU WENDAO NETWORK TECHNOLOGY Co.,Ltd. |
|
CB02 | Change of applicant information | ||
GR01 | Patent grant | ||
GR01 | Patent grant |