CN106339372A - Search engine optimization method and device - Google Patents

Search engine optimization method and device Download PDF

Info

Publication number
CN106339372A
CN106339372A CN201510390418.2A CN201510390418A CN106339372A CN 106339372 A CN106339372 A CN 106339372A CN 201510390418 A CN201510390418 A CN 201510390418A CN 106339372 A CN106339372 A CN 106339372A
Authority
CN
China
Prior art keywords
search engine
analysis result
address
log analysis
daily record
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510390418.2A
Other languages
Chinese (zh)
Other versions
CN106339372B (en
Inventor
叶磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201510390418.2A priority Critical patent/CN106339372B/en
Publication of CN106339372A publication Critical patent/CN106339372A/en
Application granted granted Critical
Publication of CN106339372B publication Critical patent/CN106339372B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a search engine optimization method and device; the search engine optimization method comprises the following steps: obtaining a log formed by a search engine reptile crawling over website contents; parsing the log according to a configured log format, and obtaining log data contained by the log; obtaining a log parse result under a selected dimension according to the log data, wherein the selected dimension is one dimension or a combination of dimensions of the data contained by the log data; optimizing the search engine on the website according to the log parse result. The novel method can parse the log of the search engine reptile crawling over website contents, thus obtaining the log data, and further obtaining the log parse result under the selected dimension according to the log data; the novel method and device can carry out website SEO according to the log parse result, thus simplifying SEO operations, improving SEO usability, timely finding SEO problems in website update maintenance, and improving SEO efficiency.

Description

The method and apparatus of search engine optimization
Technical field
The present invention relates to Internet technical field, more particularly, to a kind of method and apparatus of search engine optimization.
Background technology
Optimize (search engine optimization in site search engine;In hereinafter referred to as: seo), each search The reptile of engine all can capture to various websites, produces the user behaviors log of substantial amounts of reptile, and at this moment seo personnel needs are right Massive logs are analyzed, and obtain analysis result, further according to the working experience of seo personnel, start to carry out seo to website, To reach the purpose to website seo.
But, the user behaviors log of the reptile of search engine is analyzed needing to be held by the seo personnel with seo professional knowledge OK, and many companies, team or individual do not have the knowledge of seo, therefore cannot be to the user behaviors log of the reptile of search engine It is analyzed, and then also just cannot be carried out seo.In addition, the dimension that different seo personnel consider is different, easily cause omission. And, in prior art, seo personnel need to draw analysis result from a large amount of daily records, then manually editor carries out seo, Complex operation, inefficiency.
Content of the invention
The purpose of the present invention is intended at least solve one of technical problem in correlation technique to a certain extent.
For this reason, the first of the present invention purpose is to propose a kind of method of search engine optimization.The method can be by parsing Search engine reptile crawls the daily record of web site contents, obtains daily record data, and then is obtained in selected dimension according to daily record data Under log analysis result, such that it is able to seo is carried out to website according to this log analysis result, simplify the operation of seo, Improve the ease for use of seo, and the seo problem that network upgrade occurs in safeguarding can be found in time, improve the efficiency of seo.
Second object of the present invention is to propose a kind of device of search engine optimization.
To achieve these goals, the method for the search engine optimization of first aspect present invention embodiment, comprising: obtain search Engine reptile crawls produced daily record during web site contents;According to the journal format of configuration, described daily record is parsed, obtain The daily record data that described daily record comprises;The log analysis result being obtained under selected dimension according to described daily record data, described Selected dimension be the dimension of the data included by described daily record data one of or combination;According to described log analysis result pair Website scans for engine optimization.
The method of the search engine optimization of the embodiment of the present invention, crawls the daily record of web site contents by parsing search engine reptile, Obtain daily record data, and then the log analysis result obtaining under selected dimension according to daily record data, should such that it is able to basis Log analysis result carries out seo to website, simplifies the operation of seo, improves the ease for use of seo, and can find in time The seo problem that network upgrade occurs in safeguarding, improves the efficiency of seo.
To achieve these goals, the device of a kind of search engine optimization of second aspect present invention embodiment, comprising: obtain Module, crawls produced daily record during web site contents for obtaining search engine reptile;Parsing module, for according to configuration Journal format parses to described daily record, obtains the daily record data that described daily record comprises;Obtain module, for according to described Daily record data obtains the log analysis result under selected dimension, and described selected dimension is included by described daily record data The dimension of data one of or combination;Optimization module, for according to described obtain module obtain log analysis result to website Scan for engine optimization.
The device of the search engine optimization of the embodiment of the present invention, parses search engine reptile by parsing module and crawls web site contents Daily record, obtain daily record data, and then obtain module log analysis result under selected dimension is obtained according to daily record data, Thus optimization module can carry out seo according to this log analysis result to website, simplify the operation of seo, improve seo Ease for use, and can find in time network upgrade safeguard in the seo problem that occurs, improve the efficiency of seo.
The aspect that the present invention adds and advantage will be set forth in part in the description, and partly will become bright from the following description Aobvious, or recognized by the practice of the present invention.
Brief description
The above-mentioned and/or additional aspect of the present invention and advantage will be apparent from from the following description of the accompanying drawings of embodiments With easy to understand, wherein:
Fig. 1 is the flow chart of one embodiment of method of search engine optimization of the present invention;
Fig. 2 is the schematic diagram of one embodiment of log analysis result in the method for search engine optimization of the present invention;
Fig. 3 is the structural representation of one embodiment of device of search engine optimization of the present invention.
Specific embodiment
Embodiments of the invention are described below in detail, the example of described embodiment is shown in the drawings, wherein identical from start to finish Or the element that similar label represents same or similar element or has same or like function.Retouch below with reference to accompanying drawing The embodiment stated is exemplary, is only used for explaining the present invention, and is not considered as limiting the invention.On the contrary, this Bright embodiment includes falling into all changes in the range of the spirit of attached claims and intension, modification and equivalent.
Fig. 1 is the flow chart of one embodiment of method of search engine optimization of the present invention, as shown in figure 1, this search engine is excellent The method changed may include that
Step 101, obtains search engine reptile and crawls produced daily record during web site contents.
Step 102, parses to above-mentioned daily record according to the journal format of configuration, obtains the daily record data that above-mentioned daily record comprises.
The present embodiment supports multiple journal formats, for example: generic log form (common log format;Hereinafter referred to as: Clf), national Super calculates application program (national center for supercomputer applications;With Lower abbreviation: ncsa) center general journal file format, combination journal format (combined log format) and make by oneself Adopted journal format etc..
After configuration log form, according to the journal format of configuration, above-mentioned daily record can be parsed, obtain above-mentioned daily record bag The daily record data containing, usual daily record all can comprise the data in table 1 below, is also by the requisite data of seo.
Table 1
From table 1 it follows that above-mentioned daily record data includes the data of at least one dimension following: the ip of search engine reptile Address, search engine reptile crawl the time of web site contents, the requesting method of search engine reptile, search engine reptile please Seek the ua of address, the conditional code of websites response and search engine reptile.
Step 103, the log analysis result being obtained under selected dimension according to above-mentioned daily record data, above-mentioned selected dimension The dimension of the data included by above-mentioned daily record data one of or combination.
Specifically, behavior during search engine crawler capturing web site contents be can determine according to above-mentioned daily record data.Climb in table 1 Worm ua " mozilla/5.0 (compatible;baiduspider/2.0; + http://www.baidu.com/search/spider.html) " occur in that 2 times, simply it is designated as " baiduspider ", The daily record of each search engine reptile generation can be analyzed by reptile ua from daily record data.For example: by reptile ua " baiduspider ", can learn it is the reptile daily record of Baidu search engine, can according to reptile ua " baiduspider " To filter out Baidu's reptile daily record, can learn that Baidu reptile accesses the situation of various web page resources, as crawled web site contents Time, requesting method, request address and conditional code etc., can obtain the day under selected dimension according to above-mentioned daily record data Will analysis result, as shown in Fig. 2 Fig. 2 is one embodiment of log analysis result in the method for search engine optimization of the present invention Schematic diagram.
In Fig. 2,3.1 is the screening of reptile ua dimension, and 3.2 is the screening of time dimension, and 3.3 is the screening of conditional code, The log analysis result of single dimension can be obtained, it is also possible to obtain the log analysis result of combination dimension in the present embodiment.
3.4 is the log analysis result to Baidu reptile, from crawl amount and the summation of 3.4 visible each request paths and right The Optimizing Suggestions in paths at different levels.
3.5 be select request path under more detailed logging analyze information it is seen that as request address "/index.html " crawl Amount, the conditional code of websites response, and the Optimizing Suggestions to each request address.
Step 104, scans for engine optimization according to above-mentioned log analysis result to website.
In a kind of implementation of the present embodiment, according to log analysis result, website being scanned for engine optimization can be: root Provide the Optimizing Suggestions to website according to above-mentioned log analysis result, according to above-mentioned Optimizing Suggestions, seo is carried out to website.
As described above, in Fig. 2,3.4 and 3.5 give the Optimizing Suggestions to website, therefore can be built according to above-mentioned optimization View carries out seo to website.
In another kind of implementation of the present embodiment, according to log analysis result, website being scanned for engine optimization can be: Judge whether above-mentioned log analysis result triggers Automatic Optimal condition, if it is, automatic seo is carried out to website.
Wherein, above-mentioned Automatic Optimal condition includes automatically forbidding capturing the address carrying dynamic parameter;So, above-mentioned daily record divides Analysis result triggering Automatic Optimal condition can be: in above-mentioned log analysis result, the request address of search engine reptile is dynamic for carrying The address of state parameter;Carrying out automatic search engine optimization to website can be: address above mentioned is automatically added to web crawlers row Except in standard (robots.txt) file, to forbid capturing address above mentioned.
Above-mentioned Automatic Optimal condition can include being derived automatically from the abnormal address of conditional code, and so, above-mentioned log analysis result is touched From dynamic optimal conditions can be: above-mentioned log analysis result includes the abnormal request address of conditional code;Website is carried out certainly Dynamic search engine optimization can be: be derived automatically from the abnormal request address of conditional code, be supplied to seo personnel and processed.
Above-mentioned Automatic Optimal condition can include automatically forbidding capturing the request address more than x level path, and x is integer, x >=1; At this moment, log analysis result triggering Automatic Optimal condition can be: above-mentioned log analysis result is included more than x level path Request address;Carrying out automatic search engine optimization to website can be: address above mentioned is automatically added to robots.txt file In, to forbid capturing address above mentioned.Wherein, the concrete numerical value of x voluntarily can be arranged implementing, and the present embodiment is to x Size be not construed as limiting, for example, x can be 5.
Certainly above-mentioned Automatic Optimal condition is only several examples of the present invention, does not constitute limitation of the invention, the present invention The Automatic Optimal condition that offer can configure, simultaneously these Automatic Optimal conditions can also extend, when Automatic Optimal condition When being triggered, carry out automatic seo, thus reducing the problem being unfavorable for seo causing in the what's new of website, fall The low cost of seo.
In the method for above-mentioned search engine optimization, crawl the daily record of web site contents by parsing search engine reptile, obtain daily record Data, and then the log analysis result being obtained under selected dimension according to daily record data, such that it is able to according to this log analysis Result carries out seo to website so that not possessing the team of seo ability and individual can carry out seo, simplifies the behaviour of seo Make, improve the ease for use of seo, and the seo problem that network upgrade occurs in safeguarding can be found in time, improve seo's Efficiency, provides automatic seo function simultaneously, reduces the cost of seo Continuous optimization.
Fig. 3 is the structural representation of one embodiment of device of search engine optimization of the present invention, the search engine in the present embodiment The device optimizing can realize the flow process of embodiment illustrated in fig. 1 of the present invention, as shown in figure 3, the device of this search engine optimization May include that acquisition module 31, parsing module 32, obtain module 33 and optimization module 34;
Wherein, acquisition module 31, crawl produced daily record during web site contents for obtaining search engine reptile;
Parsing module 32, for parsing to above-mentioned daily record according to the journal format of configuration, obtains the day that above-mentioned daily record comprises Will data;The present embodiment supports multiple journal formats, for example: generic log form (common log format;Following letter Claim: clf), the center general journal file format of ncsa, combination journal format (combined log format) and from Define journal format etc..After configuration log form, parsing module 32 can be according to the journal format of configuration, to above-mentioned daily record Parsed, obtained the daily record data that comprises of above-mentioned daily record, usual daily record all can be comprised the data in table 1 below, be also into The requisite data of row seo.From table 1 it follows that the daily record data that parsing module 32 obtains includes following at least one The data of individual dimension: the ip address of search engine reptile, search engine reptile crawl the time of web site contents, search engine is climbed The requesting method of worm, the ua of the request address of search engine reptile, the conditional code of websites response and search engine reptile.
Obtain module 33, for the log analysis result being obtained under selected dimension according to above-mentioned daily record data, above-mentioned selected Dimension be the dimension of data included by above-mentioned daily record data one of or combination;Specifically, can according to above-mentioned daily record data To determine behavior during search engine crawler capturing web site contents.Reptile ua " mozilla/5.0 (compatible in table 1; baiduspider/2.0;+ http://www.baidu.com/search/spider.html) " occur in that 2 times, simple note For " baiduspider ", the daily record of each search engine reptile generation can be analyzed from daily record data by reptile ua.Example As: by reptile ua " baiduspider ", can learn it is the reptile daily record of Baidu search engine, according to reptile ua " baiduspider " just can filter out Baidu's reptile daily record, can learn that Baidu reptile accesses the situation of various web page resources, As crawled time, requesting method, request address and conditional code of web site contents etc., obtain module 33 according to above-mentioned daily record data The log analysis result under selected dimension can be obtained, as shown in Figure 2.In Fig. 2,3.1 is the sieve of reptile ua dimension Choosing, 3.2 is the screening of time dimension, and 3.3 is the screening of conditional code, and the daily record that can obtain single dimension in the present embodiment divides Analysis result, it is also possible to obtain the log analysis result of combination dimension (being the combination of two or more dimensions).3.4 being Log analysis result to Baidu reptile, from crawl amount and the summation of 3.4 visible each request paths, and to paths at different levels Optimizing Suggestions.3.5 is the more detailed logging analysis information under request path selected it is seen that as request address "/index.html " Crawl amount, the conditional code of websites response, and Optimizing Suggestions to each request address.
Optimization module 34, for scanning for engine optimization according to the log analysis result obtaining module 33 acquisition to website.
In a kind of implementation of the present embodiment, optimization module 34, specifically for being given to net according to above-mentioned log analysis result The Optimizing Suggestions stood, carry out seo according to above-mentioned Optimizing Suggestions to website.As described above, in Fig. 2,3.4 and 3.5 give Optimizing Suggestions to website, therefore can carry out seo according to above-mentioned Optimizing Suggestions to website.
In another kind of implementation of the present embodiment, optimization module 34, specifically for judging whether above-mentioned log analysis result is touched From dynamic optimal conditions, if it is, automatic seo is carried out to website.
Wherein, above-mentioned Automatic Optimal condition includes automatically forbidding capturing the address carrying dynamic parameter;So, above-mentioned daily record divides Analysis result triggering Automatic Optimal condition can be: in above-mentioned log analysis result, the request address of search engine reptile is dynamic for carrying The address of state parameter;Then optimization module 34, specifically for being automatically added to web crawlers exclusion standard by address above mentioned (robots.txt) in file, to forbid capturing address above mentioned.
Above-mentioned Automatic Optimal condition can include being derived automatically from the abnormal address of conditional code, and so, above-mentioned log analysis result is touched From dynamic optimal conditions can be: above-mentioned log analysis result includes the abnormal request address of conditional code;Then optimization module 34, The request address abnormal specifically for being derived automatically from conditional code, is supplied to seo personnel and is processed.
Above-mentioned Automatic Optimal condition can include automatically forbidding capturing the request address more than x level path, and x is integer, x >=1; At this moment, above-mentioned log analysis result triggering Automatic Optimal condition can be: above-mentioned log analysis result is included more than x level road The request address in footpath;Then optimization module 34, specifically for address above mentioned is automatically added in robots.txt file, with Forbid capturing address above mentioned.Wherein, the concrete numerical value of x voluntarily can be arranged implementing, the size to x for the present embodiment It is not construed as limiting, for example, x can be 5.
Certainly above-mentioned Automatic Optimal condition is only several examples of the present invention, does not constitute limitation of the invention, the present invention The Automatic Optimal condition that offer can configure, simultaneously these Automatic Optimal conditions can also extend, when Automatic Optimal condition When being triggered, optimization module 34 carries out automatic seo, thus reduce cause in the what's new of website be unfavorable for seo Problem, reduce the cost of seo.
In the device of above-mentioned search engine optimization, the day that search engine reptile crawls web site contents is parsed by parsing module 32 Will, obtains daily record data, and then obtains the log analysis result that module 33 obtains under selected dimension according to daily record data, Thus optimization module 34 seo can be carried out to website according to this log analysis result so that do not possess seo ability team and Individual can carry out seo, simplifies the operation of seo, improves the ease for use of seo, and can find network upgrade in time The seo problem occurring in maintenance, improves the efficiency of seo, provides automatic seo function simultaneously, reduces seo lasting excellent The cost changed.
It should be noted that in describing the invention, term " first ", " second " etc. are only used for describing purpose, and It is not intended that instruction or hint relative importance.Additionally, in describing the invention, unless otherwise stated, " multiple " It is meant that two or more.
In flow chart or here any process described otherwise above or method description are construed as, represent and include one Or more are used for realizing the module of the code of the executable instruction of step, fragment or the part of specific logical function or process, And the scope of the preferred embodiment of the present invention includes other realization, order that is shown or discussing wherein can not be pressed, Including according to involved function by substantially simultaneously in the way of or in the opposite order, carry out perform function, this should be by the present invention's Embodiment person of ordinary skill in the field understood.
It should be appreciated that each several part of the present invention can be realized with hardware, software, firmware or combinations thereof.In above-mentioned reality Apply in mode, the software that multiple steps or method can be executed in memory and by suitable instruction execution system with storage or Firmware is realizing.For example, if realized with hardware, and the same in another embodiment, can use well known in the art under Any one of row technology or their combination are realizing: have the logic gates for data-signal is realized with logic function Discrete logic, there is the special IC of suitable combinational logic gate circuit, programmable gate array (programmable gate array;Hereinafter referred to as: pga), field programmable gate array (field programmable gate array;Hereinafter referred to as: fpga) etc..
The all or part of step that those skilled in the art are appreciated that to realize that above-described embodiment method carries is can Completed with the hardware instructing correlation by program, described program can be stored in a kind of computer-readable recording medium, This program upon execution, including one or a combination set of the step of embodiment of the method.
Additionally, each functional module in each embodiment of the present invention can be integrated in a processing module or each Module is individually physically present it is also possible to two or more modules are integrated in a module.Above-mentioned integrated module both may be used To be realized in the form of hardware, it would however also be possible to employ the form of software function module is realized.If described integrated module is with soft The form of part functional module is realized and as independent production marketing or when using it is also possible to be stored in an embodied on computer readable In storage medium.
Storage medium mentioned above can be read-only storage, disk or CD etc..
In the description of this specification, reference term " embodiment ", " some embodiments ", " example ", " specific example ", Or the description of " some examples " etc. means specific features with reference to this embodiment or example description, structure, material or feature It is contained at least one embodiment or the example of the present invention.In this manual, the schematic representation of above-mentioned term is differed Surely identical embodiment or example are referred to.And, the specific features of description, structure, material or feature can be any One or more embodiments or example in combine in an appropriate manner.
Although embodiments of the invention have been shown and described above it is to be understood that above-described embodiment is exemplary, It is not considered as limiting the invention, those of ordinary skill in the art within the scope of the invention can be to above-described embodiment It is changed, changes, replacing and modification.

Claims (14)

1. a kind of method of search engine optimization is it is characterised in that include:
Obtain search engine reptile and crawl produced daily record during web site contents;
According to the journal format of configuration, described daily record is parsed, obtain the daily record data that described daily record comprises;
The log analysis result being obtained under selected dimension according to described daily record data, described selected dimension is described daily record The dimension of the data included by data one of or combination;
Engine optimization is scanned for according to described log analysis result to website.
2. method according to claim 1 is it is characterised in that described daily record data includes at least one dimension following Data: the Internet Protocol ip address of described search engine reptile, described search engine reptile crawl web site contents time, The requesting method of described search engine reptile, the described request address of search engine reptile, the conditional code of websites response and described The user agent of search engine reptile.
3. method according to claim 1 and 2 it is characterised in that described according to described log analysis result to website Scan for engine optimization to include:
Optimizing Suggestions to website are provided according to described log analysis result, according to described Optimizing Suggestions, described website is searched Optimization held up in index.
4. method according to claim 1 and 2 it is characterised in that described according to described log analysis result to website Scan for engine optimization to include:
Judge whether described log analysis result triggers Automatic Optimal condition;
If it is, automatic search engine optimization is carried out to website.
5. method according to claim 4 is it is characterised in that described Automatic Optimal condition includes automatically forbidding that crawl is taken Address with dynamic parameter;
Described log analysis result triggering Automatic Optimal condition includes: search engine reptile described in described log analysis result Request address is the address carrying dynamic parameter;
Described automatic search engine optimization is carried out to website include: described address is automatically added to web crawlers exclusion standard literary composition In part, to forbid capturing described address.
6. method according to claim 4 is it is characterised in that described Automatic Optimal condition includes being derived automatically from conditional code Abnormal address;
Described log analysis result triggers Automatic Optimal condition and includes: described log analysis result includes abnormal the asking of conditional code Ask address;
Described automatic search engine optimization is carried out to website include: be derived automatically from the abnormal request address of conditional code, be supplied to and search Index is held up optimization personnel and is processed.
7. method according to claim 4 is it is characterised in that described Automatic Optimal condition includes automatically forbidding that crawl is big In the request address in x level path, described x is integer, x >=1;
Described log analysis result triggering Automatic Optimal condition includes: described log analysis result is included more than x level path Request address;
Described automatic search engine optimization is carried out to website include: described address is automatically added to web crawlers exclusion standard literary composition In part, to forbid capturing described address.
8. a kind of device of search engine optimization is it is characterised in that include:
Acquisition module, crawls produced daily record during web site contents for obtaining search engine reptile;
Parsing module, for parsing to described daily record according to the journal format of configuration, obtains the daily record that described daily record comprises Data;
Obtain module, for the log analysis result being obtained under selected dimension according to described daily record data, described selected Dimension be the dimension of the data included by described daily record data one of or combination;
Optimization module, for scanning for engine optimization according to the described log analysis result obtaining module acquisition to website.
9. device according to claim 8 it is characterised in that described parsing module obtain daily record data include following The data of at least one dimension: the Internet Protocol ip address of described search engine reptile, described search engine reptile crawl net The time of content of standing, the requesting method of described search engine reptile, the request address of described search engine reptile, websites response Conditional code and described search engine reptile user agent.
10. device according to claim 8 or claim 9 it is characterised in that
Described optimization module, specifically for providing the Optimizing Suggestions to website according to described log analysis result, according to described excellent Change suggestion and engine optimization is scanned for described website.
11. devices according to claim 8 or claim 9 it is characterised in that
Described optimization module, specifically for judging whether described log analysis result triggers Automatic Optimal condition, if it is, Automatic search engine optimization is carried out to website.
12. devices according to claim 11 are it is characterised in that described Automatic Optimal condition includes automatically forbidding capturing Carry the address of dynamic parameter;Described log analysis result triggering Automatic Optimal condition includes: institute in described log analysis result The request address stating search engine reptile is the address carrying dynamic parameter;
Described optimization module, specifically for being automatically added to described address in web crawlers exclusion standard file, to forbid grabbing Take described address.
13. devices according to claim 11 are it is characterised in that described Automatic Optimal condition includes being derived automatically from state The abnormal address of code;Described log analysis result triggering Automatic Optimal condition includes: described log analysis result includes state The abnormal request address of code;
Described optimization module, specifically for being derived automatically from the request address of conditional code exception, is supplied to search engine optimization personnel Processed.
14. devices according to claim 11 are it is characterised in that described Automatic Optimal condition includes automatically forbidding capturing More than the request address in x level path, described x is integer, x >=1;Described log analysis result triggers Automatic Optimal condition bag Include: described log analysis result includes the request address more than x level path;
Described optimization module, specifically for being automatically added to described address in web crawlers exclusion standard file, to forbid grabbing Take described address.
CN201510390418.2A 2015-07-06 2015-07-06 Method and device for optimizing search engine Active CN106339372B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510390418.2A CN106339372B (en) 2015-07-06 2015-07-06 Method and device for optimizing search engine

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510390418.2A CN106339372B (en) 2015-07-06 2015-07-06 Method and device for optimizing search engine

Publications (2)

Publication Number Publication Date
CN106339372A true CN106339372A (en) 2017-01-18
CN106339372B CN106339372B (en) 2020-01-17

Family

ID=57825946

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510390418.2A Active CN106339372B (en) 2015-07-06 2015-07-06 Method and device for optimizing search engine

Country Status (1)

Country Link
CN (1) CN106339372B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110263283A (en) * 2019-06-19 2019-09-20 郑州悉知信息科技股份有限公司 Website detection method and device
CN111143645A (en) * 2018-11-02 2020-05-12 千寻位置网络有限公司 Method and device for carrying out SEO (secure enclave) automatic optimization by using web crawler
CN113238920A (en) * 2021-05-14 2021-08-10 杭州志卓科技股份有限公司 Data analysis system and method for quantitative evaluation of search engine optimization result
CN114295073A (en) * 2021-12-09 2022-04-08 江苏互旦网络科技有限公司 System for search engine automatic optimization

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101833581A (en) * 2010-05-11 2010-09-15 廖达伦 SEO website construction realizing method and system capable of optimizing search engine
CN104199830A (en) * 2014-07-31 2014-12-10 渠成 Search engine optimization big data management platform

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101833581A (en) * 2010-05-11 2010-09-15 廖达伦 SEO website construction realizing method and system capable of optimizing search engine
CN104199830A (en) * 2014-07-31 2014-12-10 渠成 Search engine optimization big data management platform

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111143645A (en) * 2018-11-02 2020-05-12 千寻位置网络有限公司 Method and device for carrying out SEO (secure enclave) automatic optimization by using web crawler
CN110263283A (en) * 2019-06-19 2019-09-20 郑州悉知信息科技股份有限公司 Website detection method and device
CN113238920A (en) * 2021-05-14 2021-08-10 杭州志卓科技股份有限公司 Data analysis system and method for quantitative evaluation of search engine optimization result
CN114295073A (en) * 2021-12-09 2022-04-08 江苏互旦网络科技有限公司 System for search engine automatic optimization
CN114295073B (en) * 2021-12-09 2023-08-08 江苏互旦网络科技有限公司 Automatic optimizing system for search engine

Also Published As

Publication number Publication date
CN106339372B (en) 2020-01-17

Similar Documents

Publication Publication Date Title
US11533357B2 (en) Systems and methods for tag inspection
US8473495B2 (en) Centralized web-based software solution for search engine optimization
CN103605738B (en) Web page access data statistical method and device
Loftus Demonstrating success: Web analytics and continuous improvement
US10083222B1 (en) Automated categorization of web pages
CN106339372A (en) Search engine optimization method and device
EP1958119A2 (en) System and method for appending security information to search engine results
US20190087180A1 (en) Identifying equivalent javascript events
CN106603296A (en) Log processing method and device
US8954413B2 (en) Methods and apparatus for adaptively harvesting pertinent data
EP3133504A2 (en) Method and device for knowledge base construction
CN107784113A (en) Html web page collecting method, device and computer-readable recording medium
CN103617390A (en) Malicious webpage judgment method, device and system
US10291492B2 (en) Systems and methods for discovering sources of online content
CN105653563A (en) Control method for grabbing webpage, dynamical updating method for black list and white list and related apparatus
CN106202368A (en) Prestrain method and apparatus
US20140129490A1 (en) Image url-based junk detection
Falk An ontology for threat intelligence
US20140317006A1 (en) Market specific reporting mechanisms for social content objects
CN103838865B (en) For excavating the method and device of ageing kind of subpage
JP6763433B2 (en) Information gathering system, information gathering method, and program
Zineddine Search engines crawling process optimization: a webserver approach
CN103117892B (en) Add method and the device of website visiting record
CN104504125A (en) Web page data monitoring method and device
Phan Building application powered by web scraping

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant