CN103902597B - The method and apparatus for determining relevance of searches classification corresponding to target keyword - Google Patents

The method and apparatus for determining relevance of searches classification corresponding to target keyword Download PDF

Info

Publication number
CN103902597B
CN103902597B CN201210581476.XA CN201210581476A CN103902597B CN 103902597 B CN103902597 B CN 103902597B CN 201210581476 A CN201210581476 A CN 201210581476A CN 103902597 B CN103902597 B CN 103902597B
Authority
CN
China
Prior art keywords
keyword
target
routing information
classification
searching order
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201210581476.XA
Other languages
Chinese (zh)
Other versions
CN103902597A (en
Inventor
孙宇
谭广明
韩彦俊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201210581476.XA priority Critical patent/CN103902597B/en
Publication of CN103902597A publication Critical patent/CN103902597A/en
Application granted granted Critical
Publication of CN103902597B publication Critical patent/CN103902597B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The object of the present invention is to provide a kind of methods and apparatus for determining relevance of searches classification corresponding to target keyword.Specifically, according to the searching order routing information of target keyword, target critical term clustering belonging to the target keyword is determined from one or more keyword clusterings;According to the target critical term clustering, relevance of searches classification corresponding to the target keyword is determined, to be used for subsequent processing.Compared with prior art, the present invention, which passes through, determines target critical term clustering belonging to target keyword, and then relevance of searches classification corresponding to the target keyword, to be used for subsequent processing, relevance of searches classification corresponding to keyword is effectively determined to realize, and the automatic test to batch keyword data, only Optimizing Search engine search sequence does not provide reference, and improves the testing efficiency to search engine relevance.

Description

The method and apparatus for determining relevance of searches classification corresponding to target keyword
Technical field
The present invention relates to Internet technical fields more particularly to a kind of for determining search phase corresponding to target keyword The technology of closing property classification.
Background technique
Currently, with the development of internet technology and Internet application to user learn, work with life infiltration, people More and more by network acquisition information, keyword is such as inputted by search engine, search engine is by taking certain search Sortord, which determines, returns to the search result that user matches with keyword, however the search result and use of search engine return The matching degree of the search sequence of family input largely affects the accuracy that user obtains information.Correspondingly, if can mention The matching degree of the search sequence of search result and user's input that high search engine returns, will greatly improve user and obtain letter The efficiency of breath.Therefore, it is necessary to the correlations to search engine to carry out effective assessment test, such as according to keyword and search result Matching degree classify to keyword, determine relevance of searches classification corresponding to keyword, effectively determine keyword Corresponding relevance of searches classification, and improve the testing efficiency of search engine relevance.
Summary of the invention
The object of the present invention is to provide a kind of for determining relevance of searches class method for distinguishing corresponding to target keyword With equipment.
According to an aspect of the invention, there is provided a kind of for determining relevance of searches class corresponding to target keyword Method for distinguishing, wherein method includes the following steps:
A determines the mesh according to the searching order routing information of target keyword from one or more keyword clusterings Mark target critical term clustering belonging to keyword;
B determines relevance of searches classification corresponding to the target keyword according to the target critical term clustering, with In subsequent processing.
According to another aspect of the present invention, it additionally provides a kind of for determining the correlation of search corresponding to target keyword The classification of property classification determines equipment, wherein the category determines that equipment includes:
Determining device is clustered, for the searching order routing information according to target keyword, from one or more keywords Target critical term clustering belonging to the target keyword is determined in cluster;
Classification determining device, for determining and being searched corresponding to the target keyword according to the target critical term clustering Rope correlation classification, to be used for subsequent processing.
According to a further aspect of the invention, a kind of computer equipment is additionally provided, wherein the computer equipment includes such as The aforementioned classification for determining relevance of searches classification corresponding to target keyword according to a further aspect of the present invention determines Equipment.
According to a further aspect of the invention, it additionally provides a kind of for determining the correlation of search corresponding to target keyword Property classification search engine, wherein the search engine include as it is aforementioned according to a further aspect of the present invention for determining target The classification of relevance of searches classification corresponding to keyword determines equipment.
According to a further aspect of the invention, it additionally provides a kind of for determining the correlation of search corresponding to target keyword The search engine plug-in unit of property classification, wherein the search engine plug-in unit includes such as aforementioned being used for according to a further aspect of the present invention Determine that the classification of relevance of searches classification corresponding to target keyword determines equipment.
Compared with prior art, the present invention, which passes through, determines target critical term clustering belonging to target keyword, and then described Relevance of searches classification corresponding to target keyword effectively determines keyword institute to realize to be used for subsequent processing Corresponding relevance of searches classification, and to the automatic test of batch keyword data, only Optimizing Search engine search is not arranged Sequence provides reference, and improves the testing efficiency to search engine relevance.Moreover, the present invention may further determine that target keyword Corresponding preferred searching order routing information, to adjust the searching order routing information of the target keyword, thus into one Step realizes the sequence of Optimizing Search engine search, improves user information and obtains efficiency.Further, the present invention may further determine that excellent Change keyword set, the Optimizing Search sequence routing information of one or more of keywords to be optimized is determined, for adjusting The searching order routing information of one or more of keywords to be optimized is searched to further realize Optimizing Search engine Rope sequence improves user information and obtains efficiency.
Detailed description of the invention
By reading a detailed description of non-restrictive embodiments in the light of the attached drawings below, of the invention other Feature, objects and advantages will become more apparent upon:
Fig. 1 show one aspect according to the present invention for determining relevance of searches classification corresponding to target keyword Equipment schematic diagram;
Fig. 2 shows in accordance with a preferred embodiment of the present invention for determining relevance of searches corresponding to target keyword The equipment schematic diagram of classification;
Fig. 3 show according to a further aspect of the present invention for determining relevance of searches classification corresponding to target keyword Method flow diagram;
Fig. 4 show in accordance with a preferred embodiment of the present invention for determining relevance of searches corresponding to target keyword The method flow diagram of classification.
The same or similar appended drawing reference represents the same or similar component in attached drawing.
Specific embodiment
Present invention is further described in detail with reference to the accompanying drawing.
Fig. 1 show one aspect according to the present invention for determining relevance of searches classification corresponding to target keyword Classification determines equipment 1, wherein classification determines that equipment 1 includes cluster determining device 11 and classification determining device 12.Specifically, gather Class determining device 11 according to the searching order routing information of target keyword, determined from one or more keyword clusterings described in Target critical term clustering belonging to target keyword;Classification determining device 12 according to the target critical term clustering, determine described in Relevance of searches classification corresponding to target keyword, to be used for subsequent processing.Here, the meaning of described search correlation refers to The matching degree of keyword and search result.Here, classification determine equipment 1 include but is not limited to the network equipment, user equipment or The network equipment and user equipment are integrated constituted equipment by network.Wherein, the network equipment includes but is not limited to net The cloud that network host, single network server, multiple network server collection or multiple servers are constituted.Here, cloud is by based on cloud The a large amount of hosts or network server for calculating (Cloud Computing) are constituted, wherein and cloud computing is one kind of distributed computing, A super virtual computer consisting of a loosely coupled set of computers.The user equipment its include but is not limited to appoint What one kind can carry out the electronic product of human-computer interaction with user by keyboard, remote controler, touch tablet or voice-operated device, such as count Calculation machine, smart phone, PDA or IPTV etc..The network includes but is not limited to internet, wide area network, Metropolitan Area Network (MAN), local area network, VPN Network, wireless self-organization network (Ad Hoc network) etc..Those skilled in the art will be understood that above-mentioned classification determines that equipment 1 is only Citing, other network equipments or user equipment existing or be likely to occur from now on are such as applicable to the present invention, should also be included in Within the scope of the present invention, and it is incorporated herein by reference.
Specifically, cluster determining device 11 carries out clustering processing to multiple sample keywords first according to pre-defined rule, with Obtain one or more keyword clusterings;Further according to the searching order routing information of target keyword, from one or more keys Target critical term clustering belonging to the target keyword is determined in term clustering.It is searched here, the meaning of described search sequence refers to Index, which is held up, analyzes the understanding of user's input keyword and demand, with certain algorithm, in the predetermined web data extracted It is picked out in library and inputs the webpage that keyword matches with user, and provide it to user comprising but be not limited to such as theme The selection of matching degree result is sorted, good result proposes power sequence, cheating click is suppressed, general rise of prices of the stocks and other securities selected ci poem takes sequence, topic/abstract assembles row Sequence etc., wherein it may include many sub- sequences that the result, which proposes power sequence, and such as: web sites authority proposes power, official website proposes power, page Face richness proposes power, click proposes power etc..Here, described search engine includes but is not limited to that the Google of Google company such as is searched for Engine, baidu search engine of Baidu company etc., and as Google ToolBar of Google company, Baidu company hundred Degree searches the search engines plug-in unit such as MSN ToolBar of despot, Microsoft.Here, described search sequence routing information is for showing The sort algorithm code path information that search engine is passed through in determining candidate search sort result information process can be used and be searched Rope sequence ID of trace route path (Strategy Identifier, SID) and branch's mark (Branch Identifier, BID) carry out table Show, wherein branch's mark is subordinated to searching order ID of trace route path, can be transported when carrying out fine-grained mark to searching order With.Those skilled in the art will be understood that above-mentioned searching order, search engine and searching order routing information are only for example, other Searching order or search engine or searching order routing information existing or be likely to occur from now on are such as applicable to the present invention, It should be included within the scope of the present invention, and be incorporated herein by reference.
Specifically, cluster determining device 11 carries out clustering processing to multiple sample keywords, such as first according to pre-defined rule Using unsupervised learnings methods such as k-means, ISODATA, chain methods, to obtain one or more of keyword clusterings. Wherein, the pre-defined rule includes but is not limited to following at least any one:
It is crucial to the multiple sample according to the corresponding searching order routing information of the multiple sample keyword Word carries out clustering processing, to obtain one or more of keyword clusterings;
Information is recorded according to the historical search of the corresponding search user of the multiple sample keyword, to described more A sample keyword carries out clustering processing, to obtain one or more of keyword clusterings;
Meet the statistics in the content of pages information of predetermined quality degree threshold value according to the multiple each leisure of sample keyword Information carries out clustering processing to the multiple sample keyword, to obtain one or more of keyword clusterings.
For example, when the pre-defined rule includes according to the corresponding searching order path letter of the multiple sample keyword Breath carries out clustering processing to the multiple sample keyword, when obtaining one or more of keyword clusterings, it is assumed that multiple Sample keyword such as sample keyword I to VI, corresponding searching order routing information are as shown in table 1 below, wherein S_* Indicate the searching order ID of trace route path or branch's mark in the searching order path that keyword search request processing is passed through:
Sample keyword Searching order routing information
I S_A→S_C→S_D→S_B→S_E→S_G
II S_A→S_E→S_G→S_C→S_D→S_B
III S_A→S_F→S_C→S_D→S_E→S_G
IV S_A→S_C→S_D→S_F→S_E→S_G
V S_A→S_E→S_G→S_C→S_B→S_D
VI S_M→S_N→S_C→S_B→S_G→S_D
Table 1
Determining device 11 is then clustered according to the corresponding searching order routing information of sample keyword I to VI, using k- The unsupervised learnings methods such as means, ISODATA, chain method carry out clustering processing to sample keyword I to VI, obtain one Or multiple keyword clusterings are such as: sample keyword I, III and VI are such as classified as one kind by 1. the first keyword clustering of cluster1; 2. the second keyword clustering cluster2, is such as classified as one kind for keyword II and V;3. third keyword clustering cluster3, Sample keyword VI is such as classified as one kind;For another example, when the pre-defined rule includes according to the multiple each leisure of sample keyword Meet the statistical information in the content of pages information of predetermined quality degree threshold value, the multiple sample keyword is carried out at cluster Reason, when obtaining one or more of keyword clusterings, then clusters determining device 11 according to each leisure of sample keyword I to VI Meet the statistical information in the content of pages information of predetermined quality degree threshold value, as each comfortable satisfaction of sample keyword I to VI is predetermined The content of pages information of matter metric threshold such as belongs to high confidence and appoints in site page such as http://www.sina.com.cn/ Existing frequency information, using unsupervised learnings methods such as k-means, ISODATA, chain methods, to sample keyword I to VI into Row clustering processing obtains one or more keyword clusterings such as: 1. the first keyword clustering of cluster1, such as by sample key Word I, II and III are classified as one kind;2. sample keyword IV and VI are such as classified as one kind by the second keyword clustering cluster2; 3. third keyword clustering cluster3, is such as classified as one kind for sample keyword V.
Those skilled in the art will be understood that the above-mentioned mode for carrying out clustering processing to the multiple sample keyword is only Citing, other modes for carrying out clustering processing to the multiple sample keyword that are existing or being likely to occur from now on are for example applicable It in the present invention, should also be included within the scope of protection of the present invention, and be incorporated herein by reference.
Then, cluster determining device 11 is closed further according to the searching order routing information of target keyword from one or more Target critical term clustering belonging to the target keyword is determined in keyword cluster.Specifically, cluster determining device 11 is logical first It crosses search engine, browser, provide the application programming interfaces (API) of the third party devices such as target keyword equipment, obtain Target keyword, alternatively, obtaining the target critical that user is inputted by user equipment by dynamic web page techniques such as ASP, JSP Word;Then, cluster determining device 11 is gathered further according to the searching order routing information of target keyword from one or more keywords Target critical term clustering belonging to the target keyword is determined in class.
For example, it is assumed that test man A assessment search engine relevance test process in, in test platform keyword input field Target keyword goal-query is inputted, then clusters determining device 11 by dynamic web page techniques such as ASP, JSP, can get The target keyword goal-query that test man A is inputted by user equipment.
Those skilled in the art will be understood that the mode of above-mentioned acquisition target keyword is only for example, other are existing or modern The mode for the acquisition target keyword being likely to occur afterwards is such as applicable to the present invention, should also be included in the scope of the present invention with It is interior, and be incorporated herein by reference.
Finally, searching order routing information of the cluster determining device 11 further according to target keyword, is closed from one or more Target critical term clustering belonging to the target keyword is determined in keyword cluster.Here, described in cluster determining device 11 is determining The method of target critical term clustering includes but is not limited to following at least any one:
1) by the class searching order path of the searching order routing information of the target keyword and the keyword clustering Information is compared, with target critical term clustering belonging to the determination target keyword.For example, it is assumed that cluster determining device 11 After carrying out clustering processing to sample keyword I to VI as shown in Table 1, obtained each keyword clustering and the characterization key The class searching order routing information of term clustering is as shown in table 2 below:
Keyword clustering Class searching order routing information
cluster1 S_A→S_C→S_D→S_E→S_G→S_F
cluster2 S_A→S_E→S_G→S_C→S_D→S_B
cluster3 S_M→S_N→S_C→S_B→S_G→S_D
Table 2
Assuming that cluster determining device 11 obtain target keyword goal-query searching order routing information be S_A → Sequence in S_C → S_D → S_B → S_E → S_G, with the class searching order routing information of the first keyword clustering cluster1 And searching order ID of trace route path it is all the same routing information it is most, then cluster determining device 11 determine target keyword goal- Target critical term clustering belonging to query is the first keyword clustering cluster1.
2) reference for including in the searching order routing information of the target keyword and the keyword clustering is crucial The searching order routing information of word is compared, with target critical term clustering belonging to the determination target keyword.For example, false If cluster determining device 11 obtain target keyword goal-query searching order routing information be S_A → S_C → S_D → S_B → S_E → S_G, it is identical as the searching order routing information of keyword I in the first keyword clustering cluster1, then gather Class determining device 11 determines that target critical term clustering belonging to target keyword goal-query is the first keyword clustering cluster1。
Those skilled in the art will be understood that the side of target critical term clustering belonging to the above-mentioned determination target keyword Formula is only for example, target critical term clustering belonging to other described target keywords of determination that is existing or being likely to occur from now on Mode is such as applicable to the present invention, should also be included within the scope of protection of the present invention, and is incorporated herein by reference.
Classification determining device 12 can count each right with reference to keyword included by the target critical term clustering first The relevance of searches description information answered determines relevance of searches classification corresponding to target critical term clustering;Then, further according to institute Relevance of searches classification corresponding to target critical term clustering is stated, determines relevance of searches class corresponding to the target keyword Not, to be used for subsequent processing.Here, described search correlation classification includes but is not limited to such as high correlation classification, lower phase Closing property classification, uncorrelated classification, cheating keyword categories etc..Here, the subsequent processing includes but is not limited to such as: 1) to target Keyword carries out Screening Treatment, such as whether as test data etc.;2) the searching order information of optimization aim keyword.This field Technical staff will be understood that above-mentioned relevance of searches classification and subsequent processing mode are only for example, other are existing or from now on may The relevance of searches classification or subsequent processing mode of appearance are such as applicable to the present invention, should also be included in the scope of the present invention with It is interior, and be incorporated herein by reference.
For example, it is assumed that cluster determining device 11 determines that target critical term clustering belonging to target keyword goal-query is First keyword clustering cluster1, and the institute of sample keyword I, II and III included by keyword clustering cluster1 are right The relevance of searches description information answered is respectively that correlation is high, correlation is high, correlation is low, since the high correlation of correlation is retouched State information account for correlation description information total quantity ratio meet be greater than threshold value such as 0.65, then classification determining device 12 determination target Relevance of searches classification corresponding to keyword goal-query is the high classification of correlation.For another example, it is assumed that cluster determining device 11 Determine that target critical term clustering belonging to target keyword goal-query is the first keyword clustering cluster2, and keyword The corresponding relevance of searches description information for clustering sample keyword IV and VI included by cluster2 is respectively correlation It is low, correlation is low, due to the low correlation description information of correlation account for correlation description information total quantity ratio satisfaction be greater than Threshold value such as 0.65, then classification determining device 12 determines that relevance of searches classification corresponding to target keyword goal-query is phase The low classification of closing property.
Those skilled in the art will be understood that relevance of searches classification corresponding to the above-mentioned determination target keyword Mode is only for example, relevance of searches class corresponding to other described target keywords of determination that is existing or being likely to occur from now on It is such as applicable to the present invention otherwise, should also be included within the scope of protection of the present invention, and is contained in by reference herein This.
Classification determines constantly to work between each device of equipment 1.Specifically, cluster determining device 11 continues According to the searching order routing information of target keyword, the target keyword institute is determined from one or more keyword clusterings The target critical term clustering of category;Classification determining device 12 continues to determine the target critical according to the target critical term clustering Relevance of searches classification corresponding to word, to be used for subsequent processing.Here, it should be understood by those skilled in the art that " lasting " refers to Classification determines the determination that each device of equipment 1 constantly carries out target critical term clustering respectively and relevance of searches classification really It is fixed, until classification determines that equipment 1 stops the determination of target critical term clustering in a long time.
Preferably, the keyword clustering includes the class searching order routing information for characterizing the keyword clustering, Cluster determining device 11 includes that comparing unit (not shown), similarity determining unit (not shown) and cluster determination unit (are not shown Out).The preferred embodiment is described below with reference to Fig. 1: comparing unit is by the searching order path of the target keyword Information is compared with class searching order routing information corresponding to one or more of keyword clusterings, to determine State the searching order routing information of target keyword and the smallest edit distance of the class searching order routing information;Similarity is true Order member determines that the searching order routing information of the target keyword and class search are arranged according to the smallest edit distance The sequence similarity of paths of sequence routing information;Determination unit is clustered according to the sequence similarity of paths, determines that the target is closed Keyword cluster.
Specifically, comparing unit carries out clustering processing to multiple sample keywords, such as uses k- first according to pre-defined rule The unsupervised learnings methods such as means, ISODATA, chain method, to obtain described in one or more of keyword clusterings determinations One or more keyword clusterings.Here, mode and cluster that comparing unit obtains one or more of keyword clusterings are really Determine device 11 obtain one or more of keyword clusterings mode it is same or similar, for simplicity, therefore no longer superfluous herein State, and include by reference and this.
Then, comparing unit is by the searching order routing information of the target keyword and one or more of keywords The corresponding class searching order routing information of cluster is compared, with the searching order path of the determination target keyword The smallest edit distance of information and the class searching order routing information.For example, it is assumed that the target keyword that comparing unit obtains The searching order routing information of goal-query is S_A → S_C → S_D → S_B → S_E → S_G, and described in comparing unit determination One or more keyword clusterings are as shown in Table 2 above, then comparing unit is by the searching order of target keyword goal-query Routing information S_A → S_C → S_D → S_B → S_E → S_G is serialized to obtain character string goal-string= " ACDBEG " equally carries out class searching order routing information corresponding to keyword clustering cluster1 to cluster3 Serializing obtains corresponding character string such as cluster1-string=" ACDEGF ", cluster2-string=" AEGSDB ", Cluster3-string=" MNCBGD ", then, comparing unit pass through the smallest edit distances such as Dynamic Programming, matrix method Algorithm calculates separately character string goal-string corresponding to the searching order routing information of target keyword goal-query The corresponding class searching order routing information institute of=" ACDBEG " and keyword clustering cluster1 to cluster3 is right The character string answered such as cluster1-string=" ACDEGF ", cluster2-string=" AEGSDB ", cluster3- The smallest edit distance of string=" MNCBGD ", such as obtains target keyword goal-query and keyword clustering cluster1 To cluster3 pair smallest edit distance be respectively as follows: 2,6 and 6.
Those skilled in the art will be understood that the mode of the above-mentioned determination smallest edit distance is only for example, other are existing Or the mode of the determination smallest edit distance that is likely to occur from now on be such as applicable to the present invention, should also be included in the present invention Within protection scope, and it is incorporated herein by reference.
Similarity determining unit determines the searching order path letter of the target keyword according to the smallest edit distance The sequence similarity of paths of breath and the class searching order routing information.For example, connect example, similarity determining unit is according to comparing The smallest edit distance that unit determines, the sequence similarity of paths is determined by following formula (1):
Wherein, d is smallest edit distance, then similarity determining unit determines target keyword according to above-mentioned formula (1) The searching order routing information of goal-query is searched with the respective class of keyword clustering cluster1 to cluster3 respectively The sequence similarity of paths of rope sequence routing information is respectively as follows: 1/3,1/7 and 1/7.
Preferably, the smallest edit distance that similarity determining unit is determined according to comparing unit, passes through following formula (2) the sequence similarity of paths is determined:
Wherein, α is normalization coefficient,For the average string length of character string corresponding to class searching order routing information, D is smallest edit distance, wherein normalization coefficient α can be calculated by following formula (3):
Wherein, x indicates the statistical length of character string during the test corresponding to class searching order routing information, if α= 0.5, and character string corresponding to the corresponding class searching order routing information of keyword clustering cluster1 to cluster3 Average string length is 6, then similarity determining unit can determine target keyword goal-query's according to above-mentioned formula (2) Searching order routing information is believed with the respective class searching order path keyword clustering cluster1 to cluster3 respectively The sequence similarity of paths of breath is respectively as follows: 1,3/7 and 3/7.
Those skilled in the art will be understood that the mode of the above-mentioned determination sequence similarity of paths is only for example, other are existing The mode of the determination sequence similarity of paths that is having or being likely to occur from now on is such as applicable to the present invention, should also be included in this Within invention protection scope, and it is incorporated herein by reference.
Determination unit is clustered according to the sequence similarity of paths, determines the target critical term clustering, as described in determining The target critical term clustering belonging to target keyword is that the sequence similarity of paths meets predetermined threshold as corresponding to 0.8 Keyword clustering.For example, connecting example, similarity determining unit determines the searching order path of target keyword goal-query The information sequence path with the respective class searching order routing information of keyword clustering cluster1 to cluster3 respectively Similarity is respectively as follows: 1,3/7 and 3/7, then clusters determination unit and determine the target belonging to target keyword goal-query Keyword clustering is cluster1.
Preferably, cluster determining device 11 can also obtain one or more keywords to be measured to be processed first, using as The target keyword;Then, poly- from one or more keywords according to the searching order routing information of the target keyword Target critical term clustering belonging to the target keyword is determined in class;Classification determining device 12 can also be first according to the target Keyword clustering determines relevance of searches classification corresponding to the target keyword;Then, according to described search correlation class Not, Screening Treatment is carried out to the target keyword.
Specifically, cluster determining device 11 can also be set by such as search engine, browser, offer keyword to be measured first The application programming interfaces (API) of standby equal third party devices obtain one or more keywords to be measured to be processed, using as described Target keyword;Then, according to the searching order routing information of the target keyword, from one or more keyword clusterings Determine target critical term clustering belonging to the target keyword.Here, cluster determining device 11 determines the target keyword The mode of affiliated target critical term clustering and aforementioned cluster determining device 11 determine target critical belonging to the target keyword The mode of term clustering is same or similar, and for simplicity, therefore details are not described herein, and include by reference and this.
Then, classification determining device 12 can also determine the target keyword first according to the target critical term clustering Corresponding relevance of searches classification.Here, classification determining device 12 determines that search corresponding to the target keyword is related Property class the side of relevance of searches classification corresponding to the target keyword is determined with aforementioned categories determining device 12 otherwise Formula is same or similar, and for simplicity, therefore details are not described herein, and include by reference and this.
Then, classification determining device 12 carries out at screening the target keyword according to described search correlation classification Reason.For example, it is assumed that the target keyword that cluster determining device 11 obtains include such as query1, query2, query3 and Query4, and classification determining device 12 determines that the target keyword query1, query2, query3 and query4 are corresponding described Relevance of searches classification be respectively correlation is high, in correlation, correlation is low, correlation is high, then 12 basis of classification determining device The corresponding described search correlation classification of target keyword query1, query2, query3 and query4, sieves it Choosing processing, such as the keyword query3 for belonging to the low classification of correlation is screened from keyword set to be measured, with to its into Row later period searching order Advance data quality.
Preferably, classification determines that equipment 1 further includes that set determining device (not shown) and path optimizing determining device (are not shown Out).Specifically, set determining device determines keyword set to be optimized corresponding to the keyword clustering;Path optimizing is true Determine device common search row according to corresponding to the one or more keyword to be optimized that the keyword set to be optimized includes Sequence routing information determines the Optimizing Search sequence routing information of one or more of keywords to be optimized, for adjusting institute State the searching order routing information of one or more keywords to be optimized.
Specifically, the relevance of searches description letter for all keywords that set determining device can include according to keyword clustering Breath, such as relevance of searches is high, relevance of searches is low, finds out the low classification of relevance of searches, using as the keyword clustering Corresponding keyword set to be optimized.Here, the keyword set to be optimized, which corresponds to, belongs to the low classification of relevance of searches Keyword.For example, it is assumed that the keyword for belonging to the low classification of relevance of searches in keyword clustering cluster1 is that sample is crucial The keyword for belonging to the low classification of relevance of searches in word III, keyword clustering cluster2 is sample keyword IV and VI, crucial The keyword for not belonging to the low classification of relevance of searches in term clustering cluster3, then gathering determining device can be by keyword clustering All keywords for belonging to the low classification of relevance of searches that cluster1 includes into cluster3 are as the key to be optimized Set of words such as includes sample keyword III, IV and VI.
Preferably, set determining device can also be by the actual search knot of all keywords included by the keyword clustering Fruit relevant information is compared with system index information, and the keyword set to be optimized is determined from all keywords. Here, the actual search results relevant information includes but is not limited to as returned to search result quantity, obtaining click volume, return station The authority of point, the quality degree of content of pages for returning to website etc..Here, the system index information includes as returned to search knot Fruit quantity, the authority for returning to website etc..For example, set determining device can will cluster the determining keyword of determining device 11 Cluster the letter of all keywords, that is, sample keyword I to VI actual search results correlation included by cluster1 to cluster3 Breath is compared with system index information, the keyword set to be optimized is determined from all keywords, such as by sample The actual search results relevant information of keyword I to VI is unsatisfactory for the keyword of system index information, as the pass to be optimized Keyword set.
Those skilled in the art will be understood that the mode of the above-mentioned determination keyword set to be optimized is only for example, other The mode of the determination keyword set to be optimized that is existing or being likely to occur from now on is such as applicable to the present invention, also should include Within the scope of the present invention, and it is incorporated herein by reference.
Then, the one or more pass to be optimized that path optimizing determining device includes according to the keyword set to be optimized The sequence routing information of common search corresponding to keyword determines the Optimizing Search sequence of one or more of keywords to be optimized Routing information, with the searching order routing information for adjusting one or more of keywords to be optimized.For example, example is connected, Gathering the keyword set to be optimized that determining device determines includes sample keyword III, IV and VI, then path optimizing determines Device can the common search according to corresponding to sample keyword III, IV and VI sort routing information such as S_C → S_D → S-G, make For the Optimizing Search sort routing information, with the searching order path for adjusting one or more of keywords to be optimized Information such as deletes the Optimizing Search sequence path letter for including in the searching order routing information of sample keyword III, IV and VI Breath, alternatively, by the Optimizing Search for including in the searching order routing information of sample keyword III, IV and VI sequence path letter Breath is by other searching order routing information common search as corresponding to the keyword of the high classification of relevance of searches sequence path letter Breath replaces.
Preferably, the above-mentioned classification for being used to determine relevance of searches classification corresponding to target keyword can be determined equipment 1, it is combined with existing search engine, constitutes a kind of new search engine, existing search engine includes but is not limited to such as Google search engine, baidu search engine of Baidu company of Google company etc..
Preferably, the above-mentioned classification for being used to determine relevance of searches classification corresponding to target keyword can be determined equipment 1, it is combined with existing search engine plug-in unit, constitutes a kind of new search engine plug-in unit, existing includes but is not limited to such as Google ToolBar of Google company, the Baidu of Baidu company search the search engines such as MSN ToolBar of despot, Microsoft Plug-in unit.
Fig. 2 shows in accordance with a preferred embodiment of the present invention for determining relevance of searches corresponding to target keyword The equipment schematic diagram of classification, wherein classification determines that equipment 1 includes cluster determining device 11 ', classification determining device 12 ', preferably road Diameter determining device 13 ', adjustment device 14 ' and offer device 15 '.Specifically, cluster determining device 11 ' is according to target keyword Searching order routing information determines that target keyword belonging to the target keyword is poly- from one or more keyword clusterings Class;Classification determining device 12 ' determines relevance of searches corresponding to the target keyword according to the target critical term clustering Classification, to be used for subsequent processing;Preferred path determining device 13 ' determines preferred searching order corresponding to the target keyword Routing information;Device 14 ' is adjusted according to the preferred searching order routing information, adjusts the searching order of the target keyword Routing information;If search sequence matches with the target keyword, device 15 ' is provided according to the target critical adjusted The searching order routing information of word, search result corresponding to the search sequence is supplied to corresponding to the search sequence User.Here, cluster determining device 11 ', classification determining device 12 ' are same or similar with corresponding intrument shown in Fig. 1 respectively, so Place repeats no more, and is incorporated herein by reference.
And specifically, it is preferable to which path determining device 13 ' determines preferred searching order path corresponding to the target keyword Information.Here, preferred path determining device 13 ' determine the preferred searching order routing information mode include but is not limited to Under any one of at least:
1) by the sequence of each common search with reference to corresponding to keyword included by target critical term clustering path Information, as the preferred searching order routing information.For example, it is assumed that cluster determining device 11 ' determines target keyword goal- Target critical term clustering belonging to query is keyword clustering cluster1, then preferred path determining device 13 ' can incite somebody to action Each sequence of the common search with reference to corresponding to keyword, that is, sample keyword I, III and VI path letter included by cluster1 Breath such as S_A → S_C → S_D → S_E → S_G, as the preferred searching order routing information.
2) to each searching order routing information with reference to corresponding to keyword included by the target critical term clustering Processing for statistical analysis, with the determination preferred searching order routing information, as counted the search row for showing that keyword is passed by The frequency is met the path of predetermined threshold, or preferably searched using high frequency searching order path as described by sequence path frequency information Rope sequence routing information.For example, connecting example, preferred path determining device 13 ' can close each reference included by cluster1 Searching order routing information processing for statistical analysis corresponding to keyword, that is, sample keyword I, III and VI, the frequency is met Predetermined threshold such as frequency of occurrence meets 2 searching order path such as S_A → S_C → S_D → S_E → S_G → S_F, as described It is preferred that searching order routing information.
Those skilled in the art will be understood that the mode of the above-mentioned determination preferred searching order routing information is only for example, The mode of other determination preferred searching order routing informations that are existing or being likely to occur from now on is such as applicable to the present invention, It should also be included within the scope of protection of the present invention, and be incorporated herein by reference.
Device 14 ' is adjusted according to the preferred searching order routing information, adjusts the searching order of the target keyword Routing information.For example, it is assumed that the preferred searching order routing information that preferred path determining device 13 ' determines is S_A → S_C → S_D → S_E → S_G, then adjust device 14 ' according to the preferred searching order routing information, by such as adjustment algorithm or Machine learning model such as SVM model adjusts the searching order routing information of target keyword goal-query, such as by target critical The searching order routing information of word goal-query is adjusted to preferentially to execute the preferred searching order routing information S_A → S_C →S_D→S_E→S_G。
Those skilled in the art will be understood that the mode of the above-mentioned determination keyword set to be optimized is only for example, other The mode of the determination keyword set to be optimized that is existing or being likely to occur from now on is such as applicable to the present invention, also should include Within the scope of the present invention, and it is incorporated herein by reference.
If search sequence matches with the target keyword, device 15 ' is provided according to the target critical adjusted The searching order routing information of word, search result corresponding to the search sequence is supplied to corresponding to the search sequence User.Specifically, device 15 ' is provided and obtains search sequence first;Then, judge the search sequence and the target keyword Whether match, if matching, provides device 15 ' according to the searching order routing information of the target keyword adjusted, incite somebody to action Search result corresponding to the search sequence is supplied to user corresponding to the search sequence.Matched contain here, described Including search sequence and the target keyword, completely the same, search sequence is contained in the target keyword to justice.
Specifically, device 15 ' is provided first by dynamic web page techniques such as ASP, JSP, or is provided by search engine Application programming interfaces (API), obtain the search sequence that inputs by user equipment of user.For example, if search user B passes through it PC equipment inputs keyword " fresh flower " in search engine search column, by "enter" key", provides device 15 ' and passes through such as ASP, JSP Or the dynamic web page techniques such as PHP, the keyword " fresh flower " of search user B input can be got.Those skilled in the art should manage The mode for solving above-mentioned acquisition search sequence is only for example, other modes of acquisition search sequence that are existing or being likely to occur from now on It is such as applicable to the present invention, should also be included within the scope of protection of the present invention, and is incorporated herein by reference.
Then, the target keyword that device 15 ' is obtained according to cluster determining device 11 ' is provided, is compared by text Mode, judge whether the search sequence matches with the target keyword.
If matching, device 15 ' is provided according to the searching order routing information of the target keyword adjusted, it will be described Search result corresponding to search sequence is supplied to user corresponding to the search sequence.For example, connecting example, it is assumed that cluster is true The target keyword such as " fresh flower ", " the fresh flower express delivery " for determining the acquisition of device 11 ', then provide device 15 ' and judge search sequence " fresh flower " matches with target keyword such as " fresh flower ", " fresh flower express delivery ", then, provides device 15 ' for search engine according to tune Search result corresponding to the searching order routing information of the target keyword after whole such as " fresh flower Baidu discussion bar ", " fresh flower Picture materials day is off line ", as search result corresponding to search sequence " fresh flower ", and it is dynamic by ASP, JSP or PHP etc. The communication mode of state web technologies or other agreements, such as http or https communication protocol, are supplied to the search sequence Corresponding user, that is, user B browses such as the user equipment of the user for user.
Fig. 3 show according to a further aspect of the present invention for determining relevance of searches classification corresponding to target keyword Method flow diagram.
Specifically, in step sl, classification determines equipment 1 according to the searching order routing information of target keyword, from one Target critical term clustering belonging to the target keyword is determined in a or multiple keyword clusterings;In step s 2, classification is true Locking equipment 1 determines relevance of searches classification corresponding to the target keyword, according to the target critical term clustering to be used for Subsequent processing.Here, the meaning of described search correlation refers to the matching degree of keyword and search result.Here, classification is true Locking equipment 1 includes but is not limited to that the network equipment, user equipment or the network equipment and user equipment are integrated by network and are constituted Equipment.Wherein, the network equipment includes but is not limited to network host, single network server, multiple network server collection Or the cloud that multiple servers are constituted.Here, cloud by based on cloud computing (Cloud Computing) a large amount of hosts or network service Device is constituted, wherein cloud computing is one kind of distributed computing, a super void consisting of a loosely coupled set of computers Quasi- computer.The user equipment its include but is not limited to any one can with user by keyboard, remote controler, touch tablet or The electronic product, such as computer, smart phone, PDA or IPTV etc. of voice-operated device progress human-computer interaction.The network include but It is not limited to internet, wide area network, Metropolitan Area Network (MAN), local area network, VPN network, wireless self-organization network (Ad Hoc network) etc..This field Technical staff will be understood that above-mentioned classification determines that equipment 1 is only for example, other network equipments that are existing or being likely to occur from now on Or user equipment is such as applicable to the present invention, should also be included within the scope of protection of the present invention, and includes by reference herein In this.
Specifically, in step sl, classification determines that equipment 1 according to pre-defined rule, carries out multiple sample keywords first Clustering processing, to obtain one or more keyword clusterings;Further according to the searching order routing information of target keyword, from one Or target critical term clustering belonging to the target keyword is determined in multiple keyword clusterings.Here, described search sequence Meaning refers to that search engine inputs the understanding of keyword to user and demand is analyzed, and with certain algorithm, extracts predetermined Web database in pick out and input the webpage that matches of keyword with user, and do not provide it to user comprising but not Be limited to as theme matching degree result choose sequence, good result mention power sequence, cheating click suppress, general rise of prices of the stocks and other securities selected ci poem takes sequence, topic/ Abstract assembling sequence etc., wherein it may include many sub- sequences that the result, which proposes power sequence, and such as: web sites authority proposes power, official Net proposes power, page richness proposes power, clicks the power of proposing etc..Here, described search engine includes but is not limited to such as Google company Google search engine, baidu search engine of Baidu company etc., and as Google company Google ToolBar, hundred The search engines plug-in unit such as MSN ToolBar of despot, Microsoft are searched by the Baidu of degree company.Here, described search sequence path letter Cease the sort algorithm code path letter passed through in determining candidate search sort result information process for showing search engine Breath can identify (Branch with searching order ID of trace route path (Strategy Identifier, SID) and branch Identifier, BID) indicate, wherein branch's mark is subordinated to searching order ID of trace route path, searching order is carried out it is thin It can be employed when the mark of granularity.Those skilled in the art will be understood that above-mentioned searching order, search engine and searching order road Diameter information is only for example, other searching orders or search engine or searching order routing information existing or be likely to occur from now on It is such as applicable to the present invention, should also be included within the scope of protection of the present invention, and is incorporated herein by reference.
Specifically, in step sl, classification determines that equipment 1 according to pre-defined rule, carries out multiple sample keywords first Clustering processing, such as k-means, ISODATA, chain method unsupervised learning method are used, it is one or more of to obtain Keyword clustering.Wherein, the pre-defined rule includes but is not limited to following at least any one:
It is crucial to the multiple sample according to the corresponding searching order routing information of the multiple sample keyword Word carries out clustering processing, to obtain one or more of keyword clusterings;
Information is recorded according to the historical search of the corresponding search user of the multiple sample keyword, to described more A sample keyword carries out clustering processing, to obtain one or more of keyword clusterings;
Meet the statistics in the content of pages information of predetermined quality degree threshold value according to the multiple each leisure of sample keyword Information carries out clustering processing to the multiple sample keyword, to obtain one or more of keyword clusterings.
For example, when the pre-defined rule includes according to the corresponding searching order path letter of the multiple sample keyword Breath carries out clustering processing to the multiple sample keyword, when obtaining one or more of keyword clusterings, it is assumed that multiple Sample keyword such as sample keyword I to VI, corresponding searching order routing information are as shown in table 3 below, wherein S_* Indicate the searching order ID of trace route path or branch's mark in the searching order path that keyword search request processing is passed through:
Sample keyword Searching order routing information
I S_A→S_C→S_D→S_B→S_E→S_G
II S_A→S_E→S_G→S_C→S_D→S_B
III S_A→S_F→S_C→S_D→S_E→S_G
IV S_A→S_C→S_D→S_F→S_E→S_G
V S_A→S_E→S_G→S_C→S_B→S_D
VI S_M→S_N→S_C→S_B→S_G→S_D
Table 3
Then in step sl, classification determines that equipment 1 is believed according to the corresponding searching order path sample keyword I to VI Breath, using unsupervised learnings methods such as k-means, ISODATA, chain methods, carries out at cluster sample keyword I to VI Reason, obtains one or more keyword clusterings such as: 1. the first keyword clustering of cluster1, such as by sample keyword I, III and VI is classified as one kind;2. the second keyword clustering cluster2, is such as classified as one kind for keyword II and V;3. third keyword is poly- Sample keyword VI is such as classified as one kind by class cluster3;For another example, when the pre-defined rule includes being closed according to the multiple sample Each leisure of keyword meets the statistical information in the content of pages information of predetermined quality degree threshold value, to the multiple sample keyword into Row clustering processing, when obtaining one or more of keyword clusterings, then in step sl, classification determines equipment 1 according to sample Each leisure of this keyword I to VI meets the statistical information in the content of pages information of predetermined quality degree threshold value, such as sample keyword I The content of pages information for meeting predetermined quality degree threshold value to each leisure of VI such as belongs to high confidence and appoints site page such as http: // The frequency information occurred in www.sina.com.cn/, using unsupervised learnings sides such as k-means, ISODATA, chain methods Method carries out clustering processing to sample keyword I to VI, obtains one or more keyword clusterings such as: 1. the first keyword clustering of Sample keyword I, II and III are such as classified as one kind by cluster1;2. the second keyword clustering cluster2, such as closes sample Keyword IV and VI are classified as one kind;3. third keyword clustering cluster3, is such as classified as one kind for sample keyword V.
Those skilled in the art will be understood that the above-mentioned mode for carrying out clustering processing to the multiple sample keyword is only Citing, other modes for carrying out clustering processing to the multiple sample keyword that are existing or being likely to occur from now on are for example applicable It in the present invention, should also be included within the scope of protection of the present invention, and be incorporated herein by reference.
Then, in step sl, classification determines equipment 1 further according to the searching order routing information of target keyword, from one Target critical term clustering belonging to the target keyword is determined in a or multiple keyword clusterings.Specifically, in step sl, Classification determines equipment 1 first by the application of the third party devices such as search engine, browser, offer target keyword equipment Routine interface (API) obtains target keyword, alternatively, obtaining user by dynamic web page techniques such as ASP, JSP and passing through user The target keyword of equipment input;Then, cluster determining device 11 further according to target keyword searching order routing information, from Target critical term clustering belonging to the target keyword is determined in one or more keyword clusterings.
For example, it is assumed that test man A assessment search engine relevance test process in, in test platform keyword input field Input target keyword goal-query, then in step sl, classification determines that equipment 1 passes through the dynamic web page techniques such as ASP, JSP, The target keyword goal-query that test man A is inputted by user equipment can be got.
Those skilled in the art will be understood that the mode of above-mentioned acquisition target keyword is only for example, other are existing or modern The mode for the acquisition target keyword being likely to occur afterwards is such as applicable to the present invention, should also be included in the scope of the present invention with It is interior, and be incorporated herein by reference.
Finally, in step sl, classification determines equipment 1 further according to the searching order routing information of target keyword, from one Target critical term clustering belonging to the target keyword is determined in a or multiple keyword clusterings.Here, in step sl, class Not Que Ding the method that determines the target critical term clustering of equipment 1 include but is not limited to it is following any one of at least:
1) by the class searching order path of the searching order routing information of the target keyword and the keyword clustering Information is compared, with target critical term clustering belonging to the determination target keyword.For example, it is assumed that in step sl, class Not Que Ding after equipment 1 carries out clustering processing to sample keyword I to VI as shown in table 3, obtained each keyword clustering and The class searching order routing information for characterizing the keyword clustering is as shown in table 4 below:
Keyword clustering Class searching order routing information
cluster1 S_A→S_C→S_D→S_E→S_G→S_F
cluster2 S_A→S_E→S_G→S_C→S_D→S_B
cluster3 S_M→S_N→S_C→S_B→S_G→S_D
Table 4
Assuming that in step sl, classification determines the searching order path for the target keyword goal-query that equipment 1 obtains Information is S_A → S_C → S_D → S_B → S_E → S_G, the class searching order path with the first keyword clustering cluster1 In information sequence and searching order ID of trace route path it is all the same routing information it is most, then in step sl, classification determines that equipment 1 is true The target critical term clustering belonging to keyword goal-query that sets the goal is the first keyword clustering cluster1.
2) reference for including in the searching order routing information of the target keyword and the keyword clustering is crucial The searching order routing information of word is compared, with target critical term clustering belonging to the determination target keyword.For example, false If in step sl, classification determine equipment 1 obtain target keyword goal-query searching order routing information be S_A → The searching order routing information of keyword I in S_C → S_D → S_B → S_E → S_G, with the first keyword clustering cluster1 Identical, then in step sl, classification determines that equipment 1 determines that target critical term clustering belonging to target keyword goal-query is First keyword clustering cluster1.
Those skilled in the art will be understood that the side of target critical term clustering belonging to the above-mentioned determination target keyword Formula is only for example, target critical term clustering belonging to other described target keywords of determination that is existing or being likely to occur from now on Mode is such as applicable to the present invention, should also be included within the scope of protection of the present invention, and is incorporated herein by reference.
In step s 2, classification determines that equipment 1 can count each reference included by the target critical term clustering first Relevance of searches description information corresponding to keyword determines relevance of searches classification corresponding to target critical term clustering;So Afterwards, it further according to relevance of searches classification corresponding to the target critical term clustering, determines corresponding to the target keyword Relevance of searches classification, to be used for subsequent processing.Here, described search correlation classification includes but is not limited to such as high correlation Classification, compared with low correlation classification, uncorrelated classification, cheating keyword categories etc..Here, the subsequent processing includes but is not limited to Such as: 1) Screening Treatment being carried out to target keyword, such as whether as test data etc.;2) searching order of optimization aim keyword Information.Those skilled in the art will be understood that above-mentioned relevance of searches classification and subsequent processing mode are only for example, other are existing Or the relevance of searches classification or subsequent processing mode that are likely to occur from now on be such as applicable to the present invention, should also be included in this hair Within bright protection scope, and it is incorporated herein by reference.
For example, it is assumed that in step sl, classification determines that equipment 1 determines that target belonging to target keyword goal-query is closed Keyword cluster be the first keyword clustering cluster1, and sample keyword I, II included by keyword clustering cluster1 and The corresponding relevance of searches description information of III is respectively that correlation is high, correlation is high, correlation is low, due to correlation height Correlation description information account for correlation description information total quantity ratio meet be greater than threshold value such as 0.65, then in step s 2, Classification determines that equipment 1 determines that relevance of searches classification corresponding to target keyword goal-query is the high classification of correlation.Again Such as, it is assumed that in step sl, classification determines that equipment 1 determines that target critical term clustering belonging to target keyword goal-query is First keyword clustering cluster2, and sample keyword IV's and VI included by keyword clustering cluster2 is corresponding Relevance of searches description information is respectively that correlation is low, correlation is low, since the low correlation description information of correlation accounts for correlation Property description information total quantity ratio meet be greater than threshold value such as 0.65, then in step s 2, classification determine equipment 1 determine target pass Relevance of searches classification corresponding to keyword goal-query is the low classification of correlation.
Those skilled in the art will be understood that relevance of searches classification corresponding to the above-mentioned determination target keyword Mode is only for example, relevance of searches class corresponding to other described target keywords of determination that is existing or being likely to occur from now on It is such as applicable to the present invention otherwise, should also be included within the scope of protection of the present invention, and is contained in by reference herein This.
Classification determines constantly to work between each step of equipment 1.Specifically, in step sl, classification is true Locking equipment 1 is persistently according to the searching order routing information of target keyword, from one or more keyword clusterings described in determination Target critical term clustering belonging to target keyword;In step s 2, classification determines that equipment 1 continues according to the target keyword Cluster, determines relevance of searches classification corresponding to the target keyword, to be used for subsequent processing.Here, art technology Personnel should understand that " to continue " to refer to that classification determines that each step of equipment 1 constantly carries out target critical term clustering really respectively Fixed and relevance of searches classification determination, until classification determines that equipment 1 stops target critical term clustering really in a long time It is fixed.
Preferably, the keyword clustering includes the class searching order routing information for characterizing the keyword clustering, Step S1 includes step S11 (not shown), step S12 (not shown) and step S13 (not shown).It is excellent to this below with reference to Fig. 3 Select embodiment to be described: in step s 11, classification determines searching order routing information of the equipment 1 by the target keyword It is compared with class searching order routing information corresponding to one or more of keyword clusterings, with the determination mesh Mark the searching order routing information of keyword and the smallest edit distance of the class searching order routing information;In step s 12, Classification determines equipment 1 according to the smallest edit distance, determine the searching order routing information of the target keyword with it is described The sequence similarity of paths of class searching order routing information;In step s 13, classification determines equipment 1 according to the sequence path Similarity determines the target critical term clustering.
Specifically, in step s 11, classification determines that equipment 1 according to pre-defined rule, carries out multiple sample keywords first Clustering processing, such as k-means, ISODATA, chain method unsupervised learning method are used, it is one or more of to obtain Keyword clustering determines one or more of keyword clusterings.Here, in step s 11, it is described that classification determines that equipment 1 obtains For the mode of one or more keyword clusterings in step sl, classification determines that equipment 1 obtains one or more of keywords The mode of cluster is same or similar, and for simplicity, therefore details are not described herein, and include by reference and this.
Then, in step s 11, classification determines equipment 1 by the searching order routing information of the target keyword and institute It states the class searching order routing information corresponding to one or more keyword clusterings to be compared, be closed with the determination target The smallest edit distance of the searching order routing information of keyword and the class searching order routing information.For example, it is assumed that in step In S11, classification determines that the searching order routing information for the target keyword goal-query that equipment 1 obtains is S_A → S_C → S_ D → S_B → S_E → S_G, and in step s 11, classification determines that equipment 1 determines that one or more of keyword clusterings are as above It states shown in table 2, then in step s 11, classification determines searching order routing information of the equipment 1 by target keyword goal-query S_A → S_C → S_D → S_B → S_E → S_G is serialized to obtain character string goal-string=" ACDBEG ", equally will The class searching order routing information, which is serialized, corresponding to keyword clustering cluster1 to cluster3 is corresponded to Character string such as cluster1-string=" ACDEGF ", cluster2-string=" AEGSDB ", cluster3-string= " MNCBGD ", then, in step s 11, classification determines that equipment 1 passes through the smallest edit distances such as Dynamic Programming, matrix method Algorithm calculates separately character string goal-string corresponding to the searching order routing information of target keyword goal-query The corresponding class searching order routing information institute of=" ACDBEG " and keyword clustering cluster1 to cluster3 is right The character string answered such as cluster1-string=" ACDEGF ", cluster2-string=" AEGSDB ", cluster3- The smallest edit distance of string=" MNCBGD ", such as obtains target keyword goal-query and keyword clustering cluster1 To cluster3 pair smallest edit distance be respectively as follows: 2,6 and 6.
Those skilled in the art will be understood that the mode of the above-mentioned determination smallest edit distance is only for example, other are existing Or the mode of the determination smallest edit distance that is likely to occur from now on be such as applicable to the present invention, should also be included in the present invention Within protection scope, and it is incorporated herein by reference.
In step s 12, classification determines that equipment 1 according to the smallest edit distance, determines searching for the target keyword The sequence similarity of paths of rope sequence routing information and the class searching order routing information.For example, example is connected, in step S12 In, classification determines the smallest edit distance that equipment 1 is determined according to comparing unit, is determined by following formula (4) described Sort similarity of paths:
Wherein, d is smallest edit distance, then similarity determining unit determines target keyword according to above-mentioned formula (4) The searching order routing information of goal-query is searched with the respective class of keyword clustering cluster1 to cluster3 respectively The sequence similarity of paths of rope sequence routing information is respectively as follows: 1/3,1/7 and 1/7.
Preferably, in step s 12, classification determines the smallest edit distance that equipment 1 is determined according to comparing unit, leads to Following formula (5) is crossed to determine the sequence similarity of paths:
Wherein, α is normalization coefficient,For the average string length of character string corresponding to class searching order routing information, D is smallest edit distance, wherein normalization coefficient α can be calculated by following formula (6):
Wherein, x indicates the statistical length of character string during the test corresponding to class searching order routing information, if α= 0.5, and character string corresponding to the corresponding class searching order routing information of keyword clustering cluster1 to cluster3 Average string length is 6, then in step s 12, classification determines that equipment 1 can determine target keyword according to above-mentioned formula (5) The searching order routing information of goal-query is searched with the respective class of keyword clustering cluster1 to cluster3 respectively The sequence similarity of paths of rope sequence routing information is respectively as follows: 1,3/7 and 3/7.
Those skilled in the art will be understood that the mode of the above-mentioned determination sequence similarity of paths is only for example, other are existing The mode of the determination sequence similarity of paths that is having or being likely to occur from now on is such as applicable to the present invention, should also be included in this Within invention protection scope, and it is incorporated herein by reference.
In step s 13, classification determines that equipment 1 according to the sequence similarity of paths, determines that the target keyword is poly- Class, determined, the target critical term clustering belonging to the target keyword meets predetermined threshold for the sequence similarity of paths It is worth the keyword clustering as corresponding to 0.8.For example, connecting example, in step s 12, classification determines that equipment 1 determines target keyword The searching order routing information of goal-query is searched with the respective class of keyword clustering cluster1 to cluster3 respectively The sequence similarity of paths of rope sequence routing information is respectively as follows: 1,3/7 and 3/7, then in step s 13, classification determines that equipment 1 is true The target critical term clustering belonging to keyword goal-query that sets the goal is cluster1.
Preferably, in step sl, classification determines equipment 1 can also obtain one or more key to be measured to be processed first Word, using as the target keyword;Then, according to the searching order routing information of the target keyword, from one or more Target critical term clustering belonging to the target keyword is determined in a keyword clustering;In step s 2, classification determines equipment 1 Also relevance of searches classification corresponding to the target keyword can be determined first according to the target critical term clustering;Then, According to described search correlation classification, Screening Treatment is carried out to the target keyword.
Specifically, in step sl, classification determine equipment 1 can also first by such as search engine, browser, offer to The application programming interfaces (API) for surveying the third party devices such as keyword equipment obtain one or more keyword to be measured to be processed, Using as the target keyword;Then, it according to the searching order routing information of the target keyword, is closed from one or more Target critical term clustering belonging to the target keyword is determined in keyword cluster.Here, in step sl, classification determines equipment 1 determine the mode of target critical term clustering belonging to the target keyword and it is aforementioned in step sl, classification determines that equipment 1 is fixed The mode of target critical term clustering belonging to the target keyword is same or similar, and for simplicity, therefore details are not described herein, And include by reference and this.
Then, in step s 2, classification determine equipment 1 can also first according to the target critical term clustering, determine described in Relevance of searches classification corresponding to target keyword.Here, in step s 2, classification determines that equipment 1 determines that the target is closed Relevance of searches class corresponding to keyword otherwise with it is aforementioned in step s 2, classification determines that equipment 1 determines that the target is closed Relevance of searches class corresponding to keyword is same or similar otherwise, and for simplicity, therefore details are not described herein, and with reference Mode include and this.
Then, in step s 2, classification determines equipment 1 according to described search correlation classification, to the target keyword Carry out Screening Treatment.For example, it is assumed that classification determines that the target keyword that equipment 1 obtains includes such as in step S 1 Query1, query2, query3 and query4, and in step s 2, classification determines that equipment 1 determines the target keyword The corresponding described search correlation classification of query1, query2, query3 and query4 be respectively correlation is high, in correlation, Correlation is low, correlation is high, then in step s 2, classification determine equipment 1 according to target keyword query1, query2, The corresponding described search correlation classification of query3 and query4, carries out Screening Treatment to it, and it is low such as to belong to correlation The keyword query3 of classification is screened from keyword set to be measured, to carry out later period searching order Advance data quality to it.
Preferably, classification determines that equipment 1 further includes step S6 (not shown) and step S7 (not shown).Specifically, in step In rapid S6, classification determines that equipment 1 determines keyword set to be optimized corresponding to the keyword clustering;In the step s 7, class It Que Ding equipment 1 be public according to corresponding to the one or more keywords to be optimized that the keyword set to be optimized includes searches Rope sequence routing information determines the Optimizing Search sequence routing information of one or more of keywords to be optimized, for adjusting The searching order routing information of whole one or more of keywords to be optimized.
Specifically, in step s 6, classification determines the search for all keywords that equipment 1 can include according to keyword clustering Correlation description information, such as relevance of searches is high, relevance of searches is low, finds out the low classification of relevance of searches, using as institute State keyword set to be optimized corresponding to keyword clustering.Here, the keyword set to be optimized, which corresponds to, belongs to search The keyword of the low classification of correlation.For example, it is assumed that belonging to the key of the low classification of relevance of searches in keyword clustering cluster1 Word is sample keyword III, and the keyword that the low classification of relevance of searches is belonged in keyword clustering cluster2 is that sample is crucial The keyword of the low classification of relevance of searches, then in step s 6, class are not belonged in word IV and VI, keyword clustering cluster3 Not Que Ding equipment 1 can include into cluster3 by keyword clustering cluster1 all belong to the low classification of relevance of searches Keyword such as includes sample keyword III, IV and VI as the keyword set to be optimized.
Preferably, in step s 6, classification determines that equipment 1 can also be by all keywords included by the keyword clustering Actual search results relevant information be compared with system index information, from all keywords determination it is described to be optimized Keyword set.Here, the actual search results relevant information includes but is not limited to as returned to search result quantity, obtaining point The amount of hitting, the authority for returning to website, the quality degree of content of pages for returning to website etc..Here, the system index information includes The authority for such as returning to search result quantity, returning to website.For example, in step s 6, classification determines that equipment 1 can be by it in step All keywords, that is, sample keyword I included by the keyword clustering cluster1 to cluster3 determined in rapid S1 is extremely The actual search results relevant information of VI is compared with system index information, is determined from all keywords described to excellent Change keyword set, the actual search results relevant information of sample keyword I to VI is such as unsatisfactory for the pass of system index information Keyword, as the keyword set to be optimized.
Those skilled in the art will be understood that the mode of the above-mentioned determination keyword set to be optimized is only for example, other The mode of the determination keyword set to be optimized that is existing or being likely to occur from now on is such as applicable to the present invention, also should include Within the scope of the present invention, and it is incorporated herein by reference.
Then, in the step s 7, classification determines the one or more that equipment 1 includes according to the keyword set to be optimized The sequence routing information of common search corresponding to keyword to be optimized, determines the optimization of one or more of keywords to be optimized Searching order routing information, with the searching order routing information for adjusting one or more of keywords to be optimized.For example, Example is connected, in step s 6, the keyword set to be optimized that classification determines that equipment 1 determines includes sample keyword III, IV And VI, then in the step s 7, classification determines that equipment 1 can the sequence of the common search according to corresponding to sample keyword III, IV and VI Routing information such as S_C → S_D → S-G, as Optimizing Search sequence routing information, with one or more of for adjusting The searching order routing information of keyword to be optimized, as delete sample keyword III, IV and VI searching order routing information in Including the Optimizing Search sort routing information, alternatively, by the searching order routing information of sample keyword III, IV and VI Including the Optimizing Search sequence routing information by the keyword of the high classification of other searching order routing informations such as relevance of searches Corresponding common search sequence routing information replaces.
Fig. 4 show in accordance with a preferred embodiment of the present invention for determining relevance of searches corresponding to target keyword The method flow diagram of classification.
Wherein, classification determines that equipment 1 includes step S1 ', step S2 ', step S3 ', step S4 ' and step S5 '.Specifically Ground, in step S1 ', classification determines equipment 1 according to the searching order routing information of target keyword, from one or more keys Target critical term clustering belonging to the target keyword is determined in term clustering;In step S2 ', classification determines 1 basis of equipment The target critical term clustering determines relevance of searches classification corresponding to the target keyword, to be used for subsequent processing;? In step S3 ', classification determines that equipment 1 determines preferred searching order routing information corresponding to the target keyword;In step In S4 ', classification determines that equipment 1 according to the preferred searching order routing information, adjusts the searching order of the target keyword Routing information;If search sequence matches with the target keyword, in step S5 ', after classification determines equipment 1 according to adjustment The target keyword searching order routing information, search result corresponding to the search sequence is supplied to described look into Ask user corresponding to sequence.Here, to correspond to step with shown in Fig. 3 respectively same or similar by step S1 ' and step S2 ', so Place repeats no more, and is incorporated herein by reference.
Specifically, in step S3 ', classification determines that equipment 1 determines preferred search row corresponding to the target keyword Sequence routing information.Here, in step S3 ', classification determine the mode of preferred searching order routing information described in equipment 1 include but It is not limited to following at least any one:
1) by the sequence of each common search with reference to corresponding to keyword included by target critical term clustering path Information, as the preferred searching order routing information.For example, it is assumed that classification determines that equipment 1 determines target in step S1 ' Target critical term clustering belonging to keyword goal-query is keyword clustering cluster1, then in step S3 ', classification is true Locking equipment 1 public with reference to corresponding to keyword, that is, sample keyword I, III and VI can be searched each included by cluster1 Rope sequence routing information such as S_A → S_C → S_D → S_E → S_G, as the preferred searching order routing information.
2) to each searching order routing information with reference to corresponding to keyword included by the target critical term clustering Processing for statistical analysis, with the determination preferred searching order routing information, as counted the search row for showing that keyword is passed by The frequency is met the searching order path of predetermined threshold by sequence path frequency information, or using high frequency searching order path as institute State preferred searching order routing information.For example, connecting example, in step S3 ', classification determines that equipment 1 can be by cluster1 packet The each searching order routing information place for statistical analysis with reference to corresponding to keyword i.e. sample keyword I, III and VI included Reason, by the frequency meet predetermined threshold for example frequency of occurrence meet 2 searching order path such as S_A → S_C → S_D → S_E → S_G → S_F, as the preferred searching order routing information.
Those skilled in the art will be understood that the mode of the above-mentioned determination preferred searching order routing information is only for example, The mode of other determination preferred searching order routing informations that are existing or being likely to occur from now on is such as applicable to the present invention, It should also be included within the scope of protection of the present invention, and be incorporated herein by reference.
In step S4 ', classification determines that equipment 1 according to the preferred searching order routing information, adjusts the target and closes The searching order routing information of keyword.For example, it is assumed that classification determines the preferred search row that equipment 1 determines in step S3 ' Sequence routing information is S_A → S_C → S_D → S_E → S_G, then in step S4 ', classification determines equipment 1 according to the preferred search Sort routing information, by such as adjustment algorithm or machine learning model such as SVM model, adjusts target keyword goal- The searching order routing information of target keyword goal-query is such as adjusted to preferential by the searching order routing information of query Execute the preferred searching order routing information S_A → S_C → S_D → S_E → S_G.
Those skilled in the art will be understood that the mode of the above-mentioned determination keyword set to be optimized is only for example, other The mode of the determination keyword set to be optimized that is existing or being likely to occur from now on is such as applicable to the present invention, also should include Within the scope of the present invention, and it is incorporated herein by reference.
If search sequence matches with the target keyword, in step S5 ', after classification determines equipment 1 according to adjustment The target keyword searching order routing information, search result corresponding to the search sequence is supplied to described look into Ask user corresponding to sequence.Specifically, in step S5 ', classification determines that equipment 1 obtains search sequence first;Then, judge Whether the search sequence matches with the target keyword, if matching, in step S5 ', classification determines equipment 1 according to tune Search result corresponding to the search sequence is supplied to institute by the searching order routing information of the target keyword after whole State user corresponding to search sequence.Here, the matched meaning includes search sequence and the target keyword complete one It causes, search sequence is contained in the target keyword.
Specifically, in step S5 ', classification determines equipment 1 first by the dynamic web page techniques such as ASP, JSP, Huo Zhetong The application programming interfaces (API) of search engine offer are provided, the search sequence that user is inputted by user equipment is obtained.For example, if Search user B inputs keyword " fresh flower " by its PC equipment in search engine search column, by "enter" key", in step S5 ' In, classification determines that equipment 1 by dynamic web page techniques such as ASP, JSP or PHP, can get search user B input Keyword " fresh flower ".It will be understood by those skilled in the art that the mode of above-mentioned acquisition search sequence is only for example, other it is existing or The mode for the acquisition search sequence being likely to occur from now on is such as applicable to the present invention, should also be included in the scope of the present invention with It is interior, and be incorporated herein by reference.
Then, in step S5 ', classification determines the target keyword that equipment 1 obtains in step S 1 ' according to it, In such a way that text compares, judge whether the search sequence matches with the target keyword.
If matching, in step S5 ', classification determines equipment 1 according to the searching order of the target keyword adjusted Search result corresponding to the search sequence is supplied to user corresponding to the search sequence by routing information.For example, connecing Upper example, it is assumed that in step S 1 ', classification determines the target keyword such as " fresh flower " of the acquisition of equipment 1, " fresh flower express delivery ", Then in step S5 ', classification determines that equipment 1 judges search sequence " fresh flower " and target keyword such as " fresh flower ", " fresh flower express delivery " Match, then, in step S5 ', classification determines search of the equipment 1 by search engine according to the target keyword adjusted Sort search result corresponding to routing information such as " fresh flower Baidu discussion bar ", " fresh flower picture materials day is off line ", as inquiry Search result corresponding to sequence " fresh flower ", and pass through the dynamic web page techniques such as ASP, JSP or PHP or other agreements Communication mode, such as http or https communication protocol is supplied to user corresponding to the search sequence i.e. user B, such as should The user equipment of user is browsed for user.
It should be noted that the present invention can be carried out in the assembly of software and/or software and hardware, for example, can adopt With specific integrated circuit (ASIC), general purpose computer or any other realized similar to hardware device.In one embodiment In, software program of the invention can be executed to implement the above steps or functions by processor.Similarly, of the invention Software program (including relevant data structure) can be stored in computer readable recording medium, for example, RAM memory, Magnetic or optical driver or floppy disc and similar devices.In addition, some of the steps or functions of the present invention may be implemented in hardware, example Such as, as the circuit cooperated with processor thereby executing each step or function.
In addition, a part of the invention can be applied to computer program product, such as computer program instructions, when its quilt When computer executes, by the operation of the computer, it can call or provide according to the method for the present invention and/or technical solution. And the program instruction of method of the invention is called, it is possibly stored in fixed or moveable recording medium, and/or pass through Broadcast or the data flow in other signal-bearing mediums and transmitted, and/or be stored according to described program instruction operation In the working storage of computer equipment.Here, according to one embodiment of present invention including a device, which includes using Memory in storage computer program instructions and processor for executing program instructions, wherein when the computer program refers to When enabling by processor execution, method and/or skill of the device operation based on aforementioned multiple embodiments according to the present invention are triggered Art scheme.
It is obvious to a person skilled in the art that invention is not limited to the details of the above exemplary embodiments, Er Qie In the case where without departing substantially from spirit or essential attributes of the invention, the present invention can be realized in other specific forms.Therefore, no matter From the point of view of which point, the present embodiments are to be considered as illustrative and not restrictive, and the scope of the present invention is by appended power Benefit requires rather than above description limits, it is intended that all by what is fallen within the meaning and scope of the equivalent elements of the claims Variation is included in the present invention.Any reference signs in the claims should not be construed as limiting the involved claims.This Outside, it is clear that one word of " comprising " does not exclude other units or steps, and odd number is not excluded for plural number.That states in device claim is multiple Unit or device can also be implemented through software or hardware by a unit or device.The first, the second equal words are used to table Show title, and does not indicate any particular order.

Claims (18)

1. a kind of for determining relevance of searches class method for distinguishing corresponding to target keyword, wherein this method includes following Step:
A determines that the target is closed from one or more keyword clusterings according to the searching order routing information of target keyword Target critical term clustering belonging to keyword, wherein described search sequence routing information is for showing that search engine is determining candidate The sort algorithm code path information passed through in search results ranking information process;
B determines relevance of searches classification corresponding to the target keyword, after being used for according to the target critical term clustering Continuous processing, wherein the meaning of relevance of searches refers to the matching degree of keyword and search result, described search correlation classification Including in high correlation classification, low correlation classification, uncorrelated classification, cheating keyword categories any one of at least.
2. according to the method described in claim 1, wherein, the keyword clustering includes for characterizing the keyword clustering Class searching order routing information;
Wherein, the step a includes:
Described in corresponding to the searching order routing information of the target keyword and one or more of keyword clusterings Class searching order routing information is compared, and is searched for the searching order routing information of the determination target keyword and the class The smallest edit distance of sequence routing information;
According to the smallest edit distance, determine that the searching order routing information of the target keyword and class search are arranged The sequence similarity of paths of sequence routing information;
According to the sequence similarity of paths, the target critical term clustering is determined.
3. method according to claim 1 or 2, wherein this method further include:
X determines preferred searching order routing information corresponding to the target keyword;
According to the preferred searching order routing information, the searching order routing information of the target keyword is adjusted;
Wherein, this method further include:
If search sequence matches with the target keyword, according to the searching order road of the target keyword adjusted Search result corresponding to the search sequence is supplied to user corresponding to the search sequence by diameter information.
4. according to the method described in claim 3, wherein, the step x includes:
Each sequence of the common search with reference to corresponding to keyword path letter according to included by the target critical term clustering Breath, determines the preferred searching order routing information.
5. according to the method described in claim 3, wherein, the step x includes:
It unites to each searching order routing information with reference to corresponding to keyword included by the target critical term clustering Analysis processing is counted, with the determination preferred searching order routing information.
6. method according to claim 1 or 2, wherein this method further include:
Y determines keyword set to be optimized corresponding to the keyword clustering;
According to common search sequence corresponding to the one or more keyword to be optimized that the keyword set to be optimized includes Routing information determines the Optimizing Search sequence routing information of one or more of keywords to be optimized, with described for adjusting The searching order routing information of one or more keywords to be optimized.
7. according to the method described in claim 6, wherein, the step y includes:
By the actual search results relevant information of all keywords included by the keyword clustering and system index information into Row compares, and the keyword set to be optimized is determined from all keywords.
8. method according to claim 1 or 2, wherein the step a includes:
One or more keywords to be measured to be processed are obtained, using as the target keyword;
According to the searching order routing information of the target keyword, the mesh is determined from one or more keyword clusterings Mark target critical term clustering belonging to keyword;
Wherein, the step b includes:
According to the target critical term clustering, relevance of searches classification corresponding to the target keyword is determined;
According to described search correlation classification, Screening Treatment is carried out to the target keyword.
9. a kind of classification for determining relevance of searches classification corresponding to target keyword determines equipment, wherein the category Determine that equipment includes:
Determining device is clustered, for the searching order routing information according to target keyword, from one or more keyword clusterings Target critical term clustering belonging to the middle determination target keyword, wherein described search sequence routing information is for showing to search Index holds up the sort algorithm code path information passed through in determining candidate search sort result information process;
Classification determining device, for determining search phase corresponding to the target keyword according to the target critical term clustering Closing property classification, to be used for subsequent processing, wherein the meaning of relevance of searches refers to the matching degree of keyword and search result, Described search correlation classification include high correlation classification, low correlation classification, uncorrelated classification, cheating keyword categories in extremely It is any one of few.
10. classification according to claim 9 determines equipment, wherein the keyword clustering includes for characterizing the pass The class searching order routing information of keyword cluster;
Wherein, the cluster determining device includes:
Comparing unit, for by the searching order routing information of the target keyword and one or more of keyword clusterings The corresponding class searching order routing information is compared, with the searching order routing information of the determination target keyword With the smallest edit distance of the class searching order routing information;
Similarity determining unit, for determining the searching order path of the target keyword according to the smallest edit distance The sequence similarity of paths of information and the class searching order routing information;
Determination unit is clustered, for determining the target critical term clustering according to the sequence similarity of paths.
11. classification according to claim 9 or 10 determines equipment, wherein the category determines equipment further include:
Preferred path determining device, for determining preferred searching order routing information corresponding to the target keyword;
Device is adjusted, for adjusting the searching order road of the target keyword according to the preferred searching order routing information Diameter information;
Wherein, the category determines equipment further include:
Device is provided, if matching for search sequence and the target keyword, according to the target keyword adjusted Searching order routing information, search result corresponding to the search sequence is supplied to use corresponding to the search sequence Family.
12. classification according to claim 11 determines equipment, wherein the preferred path determining device is used for:
Each sequence of the common search with reference to corresponding to keyword path letter according to included by the target critical term clustering Breath, determines the preferred searching order routing information.
13. classification according to claim 11 determines equipment, wherein the preferred path determining device is used for:
It unites to each searching order routing information with reference to corresponding to keyword included by the target critical term clustering Analysis processing is counted, with the determination preferred searching order routing information.
14. classification according to claim 9 or 10 determines equipment, wherein the category determines equipment further include:
Gather determining device, for determining keyword set to be optimized corresponding to the keyword clustering;
Path optimizing determining device, the one or more keyword to be optimized for including according to the keyword set to be optimized Corresponding common search sequence routing information determines the Optimizing Search sequence path of one or more of keywords to be optimized Information, with the searching order routing information for adjusting one or more of keywords to be optimized.
15. classification according to claim 14 determines equipment, wherein the set determining device is used for:
By the actual search results relevant information of all keywords included by the keyword clustering and system index information into Row compares, and the keyword set to be optimized is determined from all keywords.
16. classification according to claim 9 or 10 determines equipment, wherein the cluster determining device is used for:
One or more keywords to be measured to be processed are obtained, using as the target keyword;
According to the searching order routing information of the target keyword, the mesh is determined from one or more keyword clusterings Mark target critical term clustering belonging to keyword;
Wherein, the classification determining device is used for:
According to the target critical term clustering, relevance of searches classification corresponding to the target keyword is determined;
According to described search correlation classification, Screening Treatment is carried out to the target keyword.
17. a kind of for determining the search engine of relevance of searches classification corresponding to target keyword, wherein the search engine Equipment is determined including the classification as described in any one of claim 9 to 16.
18. a kind of for determining the search engine plug-in unit of relevance of searches classification corresponding to target keyword, wherein the search Engine plug-in unit includes that the classification as described in any one of claim 9 to 16 determines equipment.
CN201210581476.XA 2012-12-27 2012-12-27 The method and apparatus for determining relevance of searches classification corresponding to target keyword Active CN103902597B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210581476.XA CN103902597B (en) 2012-12-27 2012-12-27 The method and apparatus for determining relevance of searches classification corresponding to target keyword

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210581476.XA CN103902597B (en) 2012-12-27 2012-12-27 The method and apparatus for determining relevance of searches classification corresponding to target keyword

Publications (2)

Publication Number Publication Date
CN103902597A CN103902597A (en) 2014-07-02
CN103902597B true CN103902597B (en) 2019-03-08

Family

ID=50993925

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210581476.XA Active CN103902597B (en) 2012-12-27 2012-12-27 The method and apparatus for determining relevance of searches classification corresponding to target keyword

Country Status (1)

Country Link
CN (1) CN103902597B (en)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9892167B2 (en) * 2015-03-31 2018-02-13 Rovi Guides, Inc. Methods and systems for generating cluster-based search results
CN105184616B (en) * 2015-09-29 2020-06-19 北京奇艺世纪科技有限公司 Method and device for directionally delivering business object
CN106610989B (en) * 2015-10-22 2021-06-01 北京国双科技有限公司 Search keyword clustering method and device
CN105976821B (en) * 2016-06-17 2020-02-07 北京小米移动软件有限公司 Animal language identification method and device
US10318562B2 (en) 2016-07-27 2019-06-11 Google Llc Triggering application information
CN106294618A (en) * 2016-08-01 2017-01-04 北京百度网讯科技有限公司 Searching method and device
CN106599027A (en) * 2016-11-01 2017-04-26 四川用联信息技术有限公司 Method for realizing keyword optimization based on improved ant colony algorithm
CN108062337B (en) * 2016-11-09 2021-03-16 北京国双科技有限公司 Method and device for labeling crawler seeds
CN108319626B (en) * 2017-01-18 2022-06-03 阿里巴巴集团控股有限公司 Object classification method and device based on name information
CN107590169B (en) * 2017-04-14 2020-03-06 南方科技大学 Operator gateway data preprocessing method and system
CN107977405B (en) * 2017-11-16 2021-01-22 北京三快在线科技有限公司 Data sorting method, data sorting device, electronic equipment and readable storage medium
WO2020061910A1 (en) * 2018-09-27 2020-04-02 北京字节跳动网络技术有限公司 Method and apparatus used for generating information
CN111309947A (en) * 2018-12-11 2020-06-19 华勤通讯技术有限公司 Picture display method and system of mobile terminal
CN109918624B (en) * 2019-03-18 2022-10-04 北京搜狗科技发展有限公司 Method and device for calculating similarity of webpage texts
CN111507400B (en) * 2020-04-16 2023-10-31 腾讯科技(深圳)有限公司 Application classification method, device, electronic equipment and storage medium
CN111523036B (en) * 2020-04-24 2023-12-19 北京百度网讯科技有限公司 Search behavior mining method and device and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101059805A (en) * 2007-03-29 2007-10-24 复旦大学 Network flow and delaminated knowledge library based dynamic file clustering method
CN101079026A (en) * 2007-07-02 2007-11-28 北京百问百答网络技术有限公司 Text similarity, acceptation similarity calculating method and system and application system
CN101639831A (en) * 2008-07-29 2010-02-03 华为技术有限公司 Search method, search device and search system
CN102411580A (en) * 2010-09-20 2012-04-11 腾讯科技(深圳)有限公司 Retrieval method and device for extensible markup language (XML) files

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101059805A (en) * 2007-03-29 2007-10-24 复旦大学 Network flow and delaminated knowledge library based dynamic file clustering method
CN101079026A (en) * 2007-07-02 2007-11-28 北京百问百答网络技术有限公司 Text similarity, acceptation similarity calculating method and system and application system
CN101639831A (en) * 2008-07-29 2010-02-03 华为技术有限公司 Search method, search device and search system
CN102411580A (en) * 2010-09-20 2012-04-11 腾讯科技(深圳)有限公司 Retrieval method and device for extensible markup language (XML) files

Also Published As

Publication number Publication date
CN103902597A (en) 2014-07-02

Similar Documents

Publication Publication Date Title
CN103902597B (en) The method and apparatus for determining relevance of searches classification corresponding to target keyword
CN107862022B (en) Culture resource recommendation system
CN102792262B (en) Use the method and system of claim analysis sequence intellectual property document
CN100465954C (en) Reinforced clustering of multi-type data objects for search term suggestion
US7739221B2 (en) Visual and multi-dimensional search
US9031944B2 (en) System and method for providing multi-core and multi-level topical organization in social indexes
CN109189904A (en) Individuation search method and system
US20160299981A1 (en) Search engine that applies feedback from users to improve search results
CN103838754B (en) Information retrieval device and method
US20080005105A1 (en) Visual and multi-dimensional search
WO2017000610A1 (en) Webpage classification method and apparatus
CN106156372B (en) A kind of classification method and device of internet site
CN106204156A (en) A kind of advertisement placement method for network forum and device
CN107844533A (en) A kind of intelligent Answer System and analysis method
CN110706015B (en) Feature selection method for advertisement click rate prediction
CN106709037A (en) Movie recommendation method based on heterogeneous information network
CN103150369A (en) Method and device for identifying cheat web-pages
Zhou et al. Relevance feature mapping for content-based multimedia information retrieval
Nasraoui et al. A framework for mining evolving trends in web data streams using dynamic learning and retrospective validation
CN111523055A (en) Collaborative recommendation method and system based on agricultural product characteristic attribute comment tendency
Jaho et al. ISCoDe: a framework for interest similarity-based community detection in social networks
CN104462259B (en) It is a kind of for providing the method and apparatus of timeliness picture search result
CN112488716A (en) Abnormal event detection system
CN103778206A (en) Method for providing network service resources
CN108764324A (en) A kind of text data immediate processing method based on K-Means algorithms and co-occurrence word

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant