CN108804540A - search engine link analysis system and analysis method - Google Patents

search engine link analysis system and analysis method Download PDF

Info

Publication number
CN108804540A
CN108804540A CN201810431864.7A CN201810431864A CN108804540A CN 108804540 A CN108804540 A CN 108804540A CN 201810431864 A CN201810431864 A CN 201810431864A CN 108804540 A CN108804540 A CN 108804540A
Authority
CN
China
Prior art keywords
website
chain
information
link
data information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810431864.7A
Other languages
Chinese (zh)
Other versions
CN108804540B (en
Inventor
袁学文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Wen Dao Network Polytron Technologies Inc
Original Assignee
Suzhou Wen Dao Network Polytron Technologies Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Wen Dao Network Polytron Technologies Inc filed Critical Suzhou Wen Dao Network Polytron Technologies Inc
Priority to CN201810431864.7A priority Critical patent/CN108804540B/en
Publication of CN108804540A publication Critical patent/CN108804540A/en
Application granted granted Critical
Publication of CN108804540B publication Critical patent/CN108804540B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1458Management of the backup or restore process
    • G06F11/1464Management of the backup or restore process for networked environments

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Present invention is disclosed a kind of search engine link analysis system and analysis method, system includes internet cloud platform unit, information scratching unit, information memory cell, information operation processing unit and client feedback unit;Method includes internet cloud platform step, information scratching step, information storing step member, information operation processing step and client feedback step.The present invention judges the reliability in search result source, and in this, as foundation, redistributed to the weight in search result analytic process, to improve the accuracy and reliability of search result by the confirmation to searching for information source.Meanwhile the present invention can monitor the keyword ranking in all kinds of websites in real time, reduce the influence of all kinds of bursts, abnormal conditions to search result, avoid the adverse effect that artificial malicious link brings network search engines.

Description

Search engine link analysis system and analysis method
Technical field
The present invention relates to a kind of analysis system and analysis methods, and in particular to a kind of search engine link analysis system and point Analysis method, belongs to field of Internet search.
Background technology
With the universal of internet, the continuous development of network search engines, people increasingly incline when consulting various information Use search engine, network search engines utilization rate in people's daily life and popularity rate also higher and higher in selection.
Also just because of such development trend, the ranking system of network search engines also comes into being.In general, net The ranking system of network search engine can be automatic to tie according to information such as the click volumes of keyword in the volumes of searches of keyword, website The ranking of search result is calculated, and is presented to user in the form of from high to low.
But in actual application process, technical staff has found, current existing search engine ranking system is easy to Influenced by the malice of all kinds of illegal websites, informal forum etc. in network, be especially embodied in blog group, forum mass-sending with And several aspects such as station group.For station group and blog group, can in a short time it be copied by modes such as replication links greatly The keyword of amount, and for forum mass-sends, it can also be big for keyword manufacture by way of voting to target keyword The click volume of amount.Both above-mentioned ways all can generate malice to the ranking system of search engine to be influenced, and search result is caused Accuracy substantially reduced with reliability.
In conclusion how to provide a kind of search engine link analysis system and analysis method, drawn with improving web search The accuracy of search result is held up, those skilled in that art institute urgent problem to be solved is just become.
Invention content
In view of the prior art there are drawbacks described above, the purpose of the present invention is to propose to a kind of search engine link analysis system and Analysis method.
The purpose of the present invention will be achieved by the following technical programs:
A kind of search engine link analysis system, including:
Internet cloud platform unit obtains the data information in WWW for establishing data connection with WWW;
Information scratching unit obtains data information for the operation requests according to user in WWW, and to data information into Row is reprinted and is issued;
Information memory cell, the data information grabbed for receiving information scratching unit, and data information is stored Backup;
Information operation processing unit, the operation requests for receiving user, and obtained in information memory cell according to operation requests Data information is taken, and carries out calculation process;
Client feedback unit, the operation requests for keying in user, and the handling result of information operation processing unit is fed back To user.
Preferably, described information placement unit includes:
Crawler server, for capturing data information in WWW;
Website server, the operation requests for receiving user complete data information crawl according to operation and control crawler server, And the data information grabbed reprinting is issued.
Preferably, described information operation processing unit includes:
Network segment enquiry module, for the network segment belonging to query web IP;
Inquiry of the domain name module is used for nslookup IP and domain name owner information;
Threshold setting module, for anti-chain number rate of climb threshold value, outer chain growth speed threshold value, identical link amount threshold to be arranged And the amount threshold that interlinks, handle foundation as judgement;
Anti-chain number rate of climb judgment module, the rate of climb for detecting website anti-chain number are simultaneously compared, when detection website When the anti-chain number rate of climb is more than anti-chain number rate of climb threshold value, the processing of drop power is carried out to website or emphasis monitoring is handled;
Outer chain growth speed judgment module, growth rate for detecting website exterior chain are simultaneously compared, when detection website exterior chain When growth rate is more than outer chain growth speed threshold value, drop power operation is carried out to the exterior chain of website;
First content comparing module, for comparing anchor file and linking content of pages, when anchor file is unrelated with content of pages is linked When, drop power operation is carried out to exterior chain;
Secondary content comparing module, for comparing the website anti-chain page and link content of pages, when the website anti-chain page and link When content of pages is unrelated, drop power operation is carried out to exterior chain;
Website exterior chain analysis module compares website exterior chain content for detecting, existing in acquisition website to link identical anti-chain Quantity carries out drop power operation when linking identical anti-chain quantity more than identical link amount threshold to website or exterior chain;
Link analysis module in website compares website url linked contents for detecting, and obtains mutual chain between url link similar websites The quantity connect carries out drop power operation when the quantity to interlink, which is more than, interlinks amount threshold to website or exterior chain.
Preferably, the Anchor Text is the contextual information where link.
Preferably, the client feedback unit includes App clients or Web client.
A kind of search engine link analysis method, including:
S1, internet cloud platform step establish data connection with WWW, obtain the data information in WWW;
S2, information scratching step, the operation requests according to user obtain data information in WWW, and are carried out to data information Reprinting issues;
S3, information storing step receive the data information that information scratching unit has grabbed, and to data information store standby Part;
S4, information operation processing step receive the operation requests of user, and are obtained in information memory cell according to operation requests Data information, and carry out calculation process;
S5, client feedback step key in the operation requests of user, and the handling result of information operation processing unit are fed back to User.
Preferably, described information crawl step includes:
Sub-step is arranged in S21, crawler server, and crawler server is arranged, data information is captured in WWW;
Sub-step is arranged in S22, Website server, and Website server is arranged, and receives the operation requests of user, is climbed according to operation and control Worm server completes data information crawl, and the data information grabbed reprinting is issued.
Preferably, described information operation processing step includes:
S41, the network segment inquire sub-step, the network segment belonging to query web IP;
S42, inquiry of the domain name sub-step, nslookup IP and domain name owner information;
S43, threshold value set sub-step, setting anti-chain number rate of climb threshold value, outer chain growth speed threshold value, identical number of links threshold It is worth and the amount threshold that interlinks, as judging to handle foundation;
S44, the anti-chain number rate of climb judge sub-step, detect the rate of climb of website anti-chain number and are compared, when detection net When the anti-chain number rate of climb of standing is more than anti-chain number rate of climb threshold value, the processing of drop power is carried out to website or emphasis monitoring is handled;
S45, outer chain growth speed judge sub-step, detect the growth rate of website exterior chain and are compared, when outside detection website When chain growth speed is more than outer chain growth speed threshold value, drop power operation is carried out to the exterior chain of website;
S46, first content compare sub-step, compare and anchor file and link content of pages, when anchor file with link content of pages without Guan Shi carries out drop power operation to exterior chain;
S47, secondary content compare sub-step, the comparison website anti-chain page and link content of pages, when the website anti-chain page and chain Connect content of pages it is unrelated when, to exterior chain carry out drop power operation;
Link analysis sub-step outside S48, website, detection compare website exterior chain content, and it is identical anti-to obtain existing link in website Chain quantity carries out drop power operation when linking identical anti-chain quantity more than identical link amount threshold to website or exterior chain;
Link analysis sub-step in S49, website detects and compares website url linked contents, between acquisition url link similar websites mutually The quantity of link carries out drop power operation when the quantity to interlink, which is more than, interlinks amount threshold to website or exterior chain.
Preferably, the Anchor Text is the contextual information where link.
Preferably, the client feedback step includes setting App clients or Web client.
Compared in the prior art, protrusion effect of the invention is as follows:
The present invention judges the reliability in search result source by the multiple confirmation to searching for information source, and in this, as foundation, Weight in search result analytic process is redistributed, to improve the accuracy and reliability of search result.
Meanwhile the present invention can monitor the keyword ranking in all kinds of websites in real time, reduce all kinds of bursts, abnormal conditions Influence to search result avoids the adverse effect that artificial malicious link brings network search engines.
In addition, the analysis system and analysis method of the present invention can also be applied in the system of all kinds of close functions, it is each Arithmetic processing system of the class based on internet big data provides reliable information source, applicability and versatile.
In conclusion the present invention provides effective link analysis system and analysis method, using effect it is good and It is compatible strong, there is very high use and promotional value.
Just attached drawing in conjunction with the embodiments below, the embodiment of the present invention is described in further detail, so that of the invention Technical solution is more readily understood, grasps.
Description of the drawings
Fig. 1 is the structure diagram of analysis system in the present invention.
Specific implementation mode
As shown, present invention is disclosed a kind of search engine link analysis system and analysis methods.
Specifically, a kind of search engine link analysis system, including:
Internet cloud platform unit obtains the data information in WWW for establishing data connection with WWW.
Information scratching unit obtains data information for the operation requests according to user in WWW, and logarithm it is believed that Breath reprint and is issued.
Information memory cell, the data information grabbed for receiving information scratching unit, and data information is carried out Storage backup.In the present embodiment, described information storage unit is Elasticsearch databases.
Information operation processing unit, the operation requests for receiving user, and according to operation requests in information memory cell Interior acquisition data information, and carry out calculation process.
Client feedback unit, the operation requests for keying in user, and by the handling result of information operation processing unit Feed back to user.
Described information placement unit includes:
More crawler servers, for capturing data information in WWW.
An at least Website server, the operation requests for receiving user are completed according to operation and control crawler server Data information captures, and the data information grabbed reprinting is issued.
Described information operation processing unit includes:
Network segment enquiry module, for the network segment belonging to query web IP.
Inquiry of the domain name module is used for nslookup IP and domain name owner information.
Threshold setting module, for anti-chain number rate of climb threshold value, outer chain growth speed threshold value, identical number of links to be arranged Threshold value and the amount threshold that interlinks handle foundation as judgement.
Anti-chain number rate of climb judgment module, the rate of climb for detecting website anti-chain number are simultaneously compared, and work as detection When the website anti-chain number rate of climb is more than anti-chain number rate of climb threshold value, the processing of drop power is carried out to website or emphasis monitoring is handled.
Outer chain growth speed judgment module, growth rate for detecting website exterior chain are simultaneously compared, when detection website When outer chain growth speed is more than outer chain growth speed threshold value, drop power operation is carried out to the exterior chain of website.
First content comparing module, for comparing anchor file and linking content of pages, when anchor file with link content of pages When unrelated, drop power operation is carried out to exterior chain.
Secondary content comparing module, for comparing the website anti-chain page and link content of pages, when the website anti-chain page and When link content of pages is unrelated, drop power operation is carried out to exterior chain.
Website exterior chain analysis module compares website exterior chain content for detecting, and it is identical to obtain existing link in website Anti-chain quantity carries out drop power operation when linking identical anti-chain quantity more than identical link amount threshold to website or exterior chain.
Link analysis module in website compares website url linked contents for detecting, and obtains phase between url link similar websites The quantity mutually linked carries out drop power operation when the quantity to interlink, which is more than, interlinks amount threshold to website or exterior chain.
The Anchor Text is the contextual information where link.
The client feedback unit includes App clients or Web client.
Present invention further teaches a kind of search engine link analysis methods, including:
S1, internet cloud platform step establish data connection with WWW, obtain the data information in WWW.
S2, information scratching step, the operation requests according to user obtain data information in WWW, and to data information Reprint and issues.
S3, information storing step receive the data information that information scratching unit has grabbed, and are deposited to data information Lay in part.
S4, information operation processing step, receive the operation requests of user, and according to operation requests in information memory cell Data information is obtained, and carries out calculation process.
S5, client feedback step key in the operation requests of user, and the handling result of information operation processing unit is anti- Feed user.
Described information crawl step includes:
Sub-step is arranged in S21, crawler server, and crawler server is arranged, data information is captured in WWW.
Sub-step is arranged in S22, Website server, and Website server is arranged, and receives the operation requests of user, according to operation control Crawler server processed completes data information crawl, and the data information grabbed reprinting is issued.
Described information operation processing step includes:
S41, the network segment inquire sub-step, the network segment belonging to query web IP.
S42, inquiry of the domain name sub-step, nslookup IP and domain name owner information.
S43, threshold value set sub-step, setting anti-chain number rate of climb threshold value, outer chain growth speed threshold value, identical link number Threshold value and the amount threshold that interlinks are measured, foundation is handled as judgement.
S44, the anti-chain number rate of climb judge sub-step, detect the rate of climb of website anti-chain number and are compared, work as inspection When the survey grid station anti-chain number rate of climb is more than anti-chain number rate of climb threshold value, website is carried out at the processing of drop power or emphasis monitoring Reason.
S45, outer chain growth speed judge sub-step, detect the growth rate of website exterior chain and are compared, when detection net When outer chain growth speed of standing is more than outer chain growth speed threshold value, it is believed that be there are a large amount of hair chain advertisements to cause, at this time to website Exterior chain carry out drop power operation.
S46, first content compare sub-step, compare and anchor file and link content of pages, when anchor file with link in the page When holding unrelated, drop power operation is carried out to exterior chain.
S47, secondary content compare sub-step, the comparison website anti-chain page and link content of pages, when the website anti-chain page When unrelated with link content of pages, drop power operation is carried out to exterior chain.
Link analysis sub-step outside S48, website, detection compare website exterior chain content, and it is identical to obtain existing link in website Anti-chain quantity, when linking identical anti-chain quantity and being more than identical link amount threshold, it is believed that be because of forum, blog group Hair causes, and carries out drop power operation to website or exterior chain at this time.
Link analysis sub-step in S49, website, detection compare website url linked contents, and acquisition url is linked between similar website The quantity to interlink, when the quantity to interlink, which is more than, interlinks amount threshold, it is believed that led because establishing station group It causes, drop power operation is carried out to website or exterior chain at this time.
The Anchor Text is the contextual information where link.
The client feedback step includes setting App clients or Web client.
The present invention judges the reliability in search result source by the multiple confirmation to searching for information source, and in this, as Foundation redistributes the weight in search result analytic process, to improve the accuracy and reliability of search result.
Meanwhile the present invention can monitor the keyword ranking in all kinds of websites in real time, reduce all kinds of bursts, abnormal conditions Influence to search result avoids the adverse effect that artificial malicious link brings network search engines.
In addition, the analysis system and analysis method of the present invention can also be applied in the system of all kinds of close functions, it is each Arithmetic processing system of the class based on internet big data provides reliable information source, applicability and versatile.
In conclusion the present invention provides effective link analysis system and analysis method, using effect it is good and It is compatible strong, there is very high use and promotional value.
It is obvious to a person skilled in the art that invention is not limited to the details of the above exemplary embodiments, Er Qie In the case of without departing substantially from spirit and essential characteristics of the invention, the present invention can be realized in other specific forms.Therefore, no matter From the point of view of which point, the present embodiments are to be considered as illustrative and not restrictive, and the scope of the present invention is by appended power Profit requires rather than above description limits, it is intended that all by what is fallen within the meaning and scope of the equivalent requirements of the claims Variation is included within the present invention, and should not be considered as the note of any attached drawing table in claim and be limited the claims involved.
In addition, it should be understood that although this specification is described in terms of embodiments, but not each embodiment is only wrapped Containing an independent technical solution, this description of the specification is merely for the sake of clarity, and those skilled in the art should It considers the specification as a whole, the technical solutions in the various embodiments may also be suitably combined, forms those skilled in the art The other embodiment being appreciated that.

Claims (10)

1. a kind of search engine link analysis system, including:
Internet cloud platform unit obtains the data information in WWW for establishing data connection with WWW;
Information scratching unit obtains data information for the operation requests according to user in WWW, and to data information into Row is reprinted and is issued;
Information memory cell, the data information grabbed for receiving information scratching unit, and data information is stored Backup;
Information operation processing unit, the operation requests for receiving user, and obtained in information memory cell according to operation requests Data information is taken, and carries out calculation process;
Client feedback unit, the operation requests for keying in user, and the handling result of information operation processing unit is fed back To user.
2. search engine link analysis system according to claim 1, which is characterized in that described information placement unit packet It includes:
Crawler server, for capturing data information in WWW;
Website server, the operation requests for receiving user complete data information crawl according to operation and control crawler server, And the data information grabbed reprinting is issued.
3. search engine link analysis system according to claim 1, which is characterized in that described information operation processing unit Including:
Network segment enquiry module, for the network segment belonging to query web IP;
Inquiry of the domain name module is used for nslookup IP and domain name owner information;
Threshold setting module, for anti-chain number rate of climb threshold value, outer chain growth speed threshold value, identical link amount threshold to be arranged And the amount threshold that interlinks, handle foundation as judgement;
Anti-chain number rate of climb judgment module, the rate of climb for detecting website anti-chain number are simultaneously compared, when detection website When the anti-chain number rate of climb is more than anti-chain number rate of climb threshold value, the processing of drop power is carried out to website or emphasis monitoring is handled;
Outer chain growth speed judgment module, growth rate for detecting website exterior chain are simultaneously compared, when detection website exterior chain When growth rate is more than outer chain growth speed threshold value, drop power operation is carried out to the exterior chain of website;
First content comparing module, for comparing anchor file and linking content of pages, when anchor file is unrelated with content of pages is linked When, drop power operation is carried out to exterior chain;
Secondary content comparing module, for comparing the website anti-chain page and link content of pages, when the website anti-chain page and link When content of pages is unrelated, drop power operation is carried out to exterior chain;
Website exterior chain analysis module compares website exterior chain content for detecting, existing in acquisition website to link identical anti-chain Quantity carries out drop power operation when linking identical anti-chain quantity more than identical link amount threshold to website or exterior chain;
Link analysis module in website compares website url linked contents for detecting, and obtains mutual chain between url link similar websites The quantity connect carries out drop power operation when the quantity to interlink, which is more than, interlinks amount threshold to website or exterior chain.
4. search engine link analysis system according to claim 3, it is characterised in that:The Anchor Text is link place Contextual information.
5. search engine link analysis system according to claim 1, which is characterized in that the client feedback unit packet Include App clients or Web client.
6. a kind of search engine link analysis method, including:
S1, internet cloud platform step establish data connection with WWW, obtain the data information in WWW;
S2, information scratching step, the operation requests according to user obtain data information in WWW, and are carried out to data information Reprinting issues;
S3, information storing step receive the data information that information scratching unit has grabbed, and to data information store standby Part;
S4, information operation processing step receive the operation requests of user, and are obtained in information memory cell according to operation requests Data information, and carry out calculation process;
S5, client feedback step key in the operation requests of user, and the handling result of information operation processing unit are fed back to User.
7. search engine link analysis method according to claim 6, which is characterized in that described information crawl step packet It includes:
Sub-step is arranged in S21, crawler server, and crawler server is arranged, data information is captured in WWW;
Sub-step is arranged in S22, Website server, and Website server is arranged, and receives the operation requests of user, is climbed according to operation and control Worm server completes data information crawl, and the data information grabbed reprinting is issued.
8. search engine link analysis method according to claim 6, which is characterized in that described information operation processing step Including:
S41, the network segment inquire sub-step, the network segment belonging to query web IP;
S42, inquiry of the domain name sub-step, nslookup IP and domain name owner information;
S43, threshold value set sub-step, setting anti-chain number rate of climb threshold value, outer chain growth speed threshold value, identical number of links threshold It is worth and the amount threshold that interlinks, as judging to handle foundation;
S44, the anti-chain number rate of climb judge sub-step, detect the rate of climb of website anti-chain number and are compared, when detection net When the anti-chain number rate of climb of standing is more than anti-chain number rate of climb threshold value, the processing of drop power is carried out to website or emphasis monitoring is handled;
S45, outer chain growth speed judge sub-step, detect the growth rate of website exterior chain and are compared, when outside detection website When chain growth speed is more than outer chain growth speed threshold value, drop power operation is carried out to the exterior chain of website;
S46, first content compare sub-step, compare and anchor file and link content of pages, when anchor file with link content of pages without Guan Shi carries out drop power operation to exterior chain;
S47, secondary content compare sub-step, the comparison website anti-chain page and link content of pages, when the website anti-chain page and chain Connect content of pages it is unrelated when, to exterior chain carry out drop power operation;
Link analysis sub-step outside S48, website, detection compare website exterior chain content, and it is identical anti-to obtain existing link in website Chain quantity carries out drop power operation when linking identical anti-chain quantity more than identical link amount threshold to website or exterior chain;
Link analysis sub-step in S49, website detects and compares website url linked contents, between acquisition url link similar websites mutually The quantity of link carries out drop power operation when the quantity to interlink, which is more than, interlinks amount threshold to website or exterior chain.
9. search engine link analysis method according to claim 8, it is characterised in that:The Anchor Text is link place Contextual information.
10. search engine link analysis method according to claim 6, it is characterised in that:The client feedback step Including setting App clients or Web client.
CN201810431864.7A 2018-05-08 2018-05-08 Search engine link analysis system and analysis method Active CN108804540B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810431864.7A CN108804540B (en) 2018-05-08 2018-05-08 Search engine link analysis system and analysis method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810431864.7A CN108804540B (en) 2018-05-08 2018-05-08 Search engine link analysis system and analysis method

Publications (2)

Publication Number Publication Date
CN108804540A true CN108804540A (en) 2018-11-13
CN108804540B CN108804540B (en) 2020-12-22

Family

ID=64091926

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810431864.7A Active CN108804540B (en) 2018-05-08 2018-05-08 Search engine link analysis system and analysis method

Country Status (1)

Country Link
CN (1) CN108804540B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090138464A1 (en) * 2007-11-28 2009-05-28 James Paul Schneider Method for removing network effects from search engine results
CN102663054A (en) * 2012-03-29 2012-09-12 奇智软件(北京)有限公司 Method and device for determining weight of website
CN103425691A (en) * 2012-05-22 2013-12-04 阿里巴巴集团控股有限公司 Search method and search system
CN103714149A (en) * 2013-12-26 2014-04-09 华中科技大学 Self-adaptive incremental deep web data source discovery method
CN104199830A (en) * 2014-07-31 2014-12-10 渠成 Search engine optimization big data management platform
CN105468729A (en) * 2015-11-23 2016-04-06 深圳大粤网络视界有限公司 Internet mobile vertical search engine

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090138464A1 (en) * 2007-11-28 2009-05-28 James Paul Schneider Method for removing network effects from search engine results
CN102663054A (en) * 2012-03-29 2012-09-12 奇智软件(北京)有限公司 Method and device for determining weight of website
CN103425691A (en) * 2012-05-22 2013-12-04 阿里巴巴集团控股有限公司 Search method and search system
CN103714149A (en) * 2013-12-26 2014-04-09 华中科技大学 Self-adaptive incremental deep web data source discovery method
CN104199830A (en) * 2014-07-31 2014-12-10 渠成 Search engine optimization big data management platform
CN105468729A (en) * 2015-11-23 2016-04-06 深圳大粤网络视界有限公司 Internet mobile vertical search engine

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
冯亚飞: "基于社区发现的搜索引擎反作弊方法", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Also Published As

Publication number Publication date
CN108804540B (en) 2020-12-22

Similar Documents

Publication Publication Date Title
Cooley et al. Data preparation for mining world wide web browsing patterns
CN102710646B (en) Method and system for collecting phishing websites
US8972412B1 (en) Predicting improvement in website search engine rankings based upon website linking relationships
CN102567407B (en) Method and system for collecting forum reply increment
Baeza-Yates et al. Crawling the infinite Web: five levels are enough
CN110019689A (en) Position matching process and position matching system
CN105260469B (en) A kind of method, apparatus and equipment for handling site maps
CN104615627B (en) A kind of event public feelings information extracting method and system based on microblog
WO2021114454A1 (en) Method and apparatus for detecting crawler request
CN108429721A (en) A kind of recognition methods of web crawlers and device
CN104182412A (en) Webpage crawling method and webpage crawling system
CN106202232A (en) A kind of analysis method and device of power-off event
CN106156230A (en) A kind of method and device generating interior chain
CN106126688A (en) Based on WEB content and the intelligent network information acquisition system of structure excavation, method
CN104077293A (en) Webpage acquisition method and device
CN105824880A (en) Webpage grasping method and device
CN103279492B (en) A kind of method and apparatus capturing webpage
CN113656673A (en) Master-slave distributed content crawling robot for advertisement delivery
CN102024042B (en) Method, device and system for monitoring picture showing effect
CN114139048A (en) Tracking method for user behavior data and page data
Poornalatha et al. Web page prediction by clustering and integrated distance measure
CN108804540A (en) search engine link analysis system and analysis method
Ali et al. An integrated framework for web data preprocessing towards modeling user behavior
CN110263283A (en) Website detection method and device
KR101556714B1 (en) Method, system and computer readable recording medium for providing search results

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: Room 1101, building 1, Rongsheng business center, 135 wangdun Road, Suzhou Industrial Park, Jiangsu Province

Applicant after: SUZHOU WENDAO NETWORK TECHNOLOGY Co.,Ltd.

Address before: 215123 E-1804 388, Shui Shui Road, Suzhou Industrial Park, Jiangsu.

Applicant before: SUZHOU WENDAO NETWORK TECHNOLOGY Co.,Ltd.

CB02 Change of applicant information
GR01 Patent grant
GR01 Patent grant