WO2012052983A1 - Method for scoring and ranking search engine keywords at a website - Google Patents

Method for scoring and ranking search engine keywords at a website Download PDF

Info

Publication number
WO2012052983A1
WO2012052983A1 PCT/IL2010/000861 IL2010000861W WO2012052983A1 WO 2012052983 A1 WO2012052983 A1 WO 2012052983A1 IL 2010000861 W IL2010000861 W IL 2010000861W WO 2012052983 A1 WO2012052983 A1 WO 2012052983A1
Authority
WO
WIPO (PCT)
Prior art keywords
website
search
keywords
search engine
method
Prior art date
Application number
PCT/IL2010/000861
Other languages
French (fr)
Inventor
Noam Fine
Joseph Marouani
Original Assignee
Simplygen Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Simplygen Ltd. filed Critical Simplygen Ltd.
Priority to US25313510P priority Critical
Priority to US61/253,135 priority
Publication of WO2012052983A1 publication Critical patent/WO2012052983A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3322Query formulation using system suggestions
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Abstract

The invention provides a method for scoring the suitability of one or more keywords to a website search engine. For each of the one or more keywords, a search of the website is performed by the search engine, and a score is calculated indicative of the suitability of the keyword to the website in a calculation involving search results returned by the search engine. The scoring of keywords may be used to rank keywords according to their calculated scores. Ranked keywords may used in autocompletion of data input to a website search engine query box at a website. As a user enters a keyword into a search engine query box at a website, the user is presented with a menu of keywords consistent with the data being input by the user where the keywords are appear in the menu in an order determined by their rank.

Description

2010/000861

1

METHOD FOR SCORING AND RANKING SEARCH ENGINE KEYWORDS

AT A WEBSITE

FIELD OF THE INVENTION

This invention relates to methods for performing searches at a website.

BACKGROUND OF THE INVENTION

An internet website is commonly searched for material of interest to a user by searching the website for the occurrence of one or more keywords. A local, or site specific, search engine is usually provided by the website. A user will input keywords relevant to the material of interest, and the search engine will return a list of search results determined by the specifications of the search engine. It may happen that material of interest to the user in the website is not revealed by the search engine due to inherent limitations of the search engine. Due to these inherent limitations of the search engine, some keywords are more suited for use with the search engine than other keywords.

SUMMARY OF THE INVENTION

In its first aspect, the present invention provides a system and method for scoring the suitability of one or more predetermined keywords for use with a website search engine. Keywords having a relatively high score are more suitable for the search engine and will tend to return a higher proportion of the material of interest in the website than keywords having a low score. In accordance with this aspect of the invention, for each keyword, a search of the website is performed by the website search engine, and a score is assigned to the keyword indicative of the suitability of the keyword for use by the search engine. In one embodiment, a scoring scheme is used that assigns a score indicative of a magnitude of the search results returned by the search engine. A score of 1 may be assigned if the search produced one or more search results, while a score of 0 may be assigned if the search produced no search results. The magnitude may be, for example, the number of search results returned by the search engine, the number of pages of the search results, the number of lines in the search results, or the number of links in the search results. The magnitude may first be normalized to a distribution of search result magnitudes obtained by searching with a plurality of keywords. Determination of the number of page or the number of lines the search results may be determined, for example, by a HTML parser that can count the number of elements on the page. In another embodiment, searches by the website search engine using the keyword initiated by users are logged over a time period. For each search performed using the keyword, the user's response is logged, and the score is calculated based on a cumulative assessment of the responses of the users to the search results. For example, the score may be indicative of the number of times a user utilized the search results (for example by opening a link appearing in the search results) in comparison to the number of times a user ignored the search results (for example, by returning to a previous page or performing another search with another keyword).

In its second aspect, the invention provides a system and method for ranking the relevance of two or more predetermined keywords to a website. In accordance with this aspect of the invention, each of the two or more predetermined keywords is scored for relevance to the website by the method of the invention, and the two or more keywords are then ranked according to their score. In one embodiment, the final ranking of the keywords may involve, in addition to the scores, a preliminary ranking assigned to the keywords.

The ranking of the keywords may be used for example in an autocomplete feature of the search engine. As a user enters into a search engine query box at a website a search keyword, a menu of keywords is presented to the user consistent with the keyword data being input (eg keywords beginning with the letters successively inputed) and ranked by the method of the invention. In this way, the user is presented with keywords in their order of suitability to the search engine. Thus, in another of its aspects, the present invention provides a method for autocompletion which suggests to the user keywords most suitable for the search engine.

The ranking of the keywords may also be used to score the suitability of a search engines to a website. The score of a search engine is calculated from the scores of P T/IL2010/000861

3

searches returned by the search engine for a plurality of predetermined keywords, by the method of the invention. Thus, in another of its aspects, the invention provides a method for scoring the suitability of a search engine to a website. For a given website, the scores calculated for two or more search engines may be used to rank the search engines for suitability to the website. Similarly, for a given search engine, the score calculated for the search engine at two or more different websites may be used to assign an overall score to the search engine.

Thus, in its first aspect, the invention provides a method for scoring the suitability of one or more keywords to a website search engine comprising:

(a) for each of the one or more keywords, performing a search of the website by the search engine;

(b) calculating a score indicative of the suitability of the keyword to the website in a calculation involving search results returned by the search engine.

In the method of scoring according to the invention, the calculation of the suitability score may involve a magnitude of the search results returned by the search engine. In one embodiment, a first value is calculated for the score if the magnitude of the search results is above a threshold, and a second value is calculated if the magmtude of the search results is below the threshold, where the first value is different from the second value. The magnitude may be any one or more of a number of search results returned by the search engine, a number of pages of the search results, a number of lines in the search results, and a number of links in the search results.

The method of scoring keywords in accordance with the invention may further comprise:

(a) logging searches requested by users using the website search engine over a time period,

(b) for each search performed by a user, logging the user's response to the search results;

(c) calculating the score in a calculation involving a cumulative assessment of the responses of the users to the search results.

The score may be indicative of a number of times a user utilized the search results in comparison to a number of times the user did not utilize the search results. For example, utilizing the search results may comprise opening a link appearing in the search results. 861

4

In another of its aspects, the invention provides a method for ranking relevance of two or more predetermined keywords to a website search engine comprising:

(a) for each of the two or more predetermined keywords, scoring the keyword for suitability to the website search engine by the method of the invention;

(b) ranking the two or more keywords according to their scores.

The ranking of the keywords may further involve a preliminary ranking assigned to the keywords.

In another of its aspects, the invention provides a method for autocompletion of data input to a website search engine query box at a website comprising:

(a) as a user enters a keyword into a search engine query box at a website, presenting to the user a menu of keywords consistent with the data being input by the user where the keywords are ranked according to suitability to the website search engine by the method of the invention.

The invention also provides a method for scoring the suitability of one or more predetermined search engines to a website; comprising,:

(a) calculating the suitability of a plurality of predetermined keywords to the search engine by the method of the invention;

(b) calculating a score of the suitability of the of the search engine for the website in a calculation involving the suitability scores of the predetermined keywords.

The invention still further provides a method for ranking two or more search engines for suitability to a predetermined website comprising:

(a) calculating the scores for each of the two or more search engines by the method of the invention;

(b) using the scores calculated in step (a) to rank the two or more search engines.

The invention also provides a method to calculate a score of a search engine comprising:

(a) for each of one or more predetermined websites, calculating a score for suitability of the search engine to the website by the method of the invention;

(b) calculating a score of the search engine based upon the scores calculated in step (a). IL2010/000861

5

BRIEF DESCRIPTION OF THE DRAWINGS

In order to understand the invention and to see how it may be carried out in practice, embodiments will now be described, by way of non-limiting example only, with reference to the accompanying drawings, in which:

Fig. 1 shows a flow chart of a method for scoring the suitability of one or more predetermined keywords to a website search engine, in accordance with one embodiment of the invention;

Fig. 2 shows a webpage of a website showing results of a search of the website using the keyword "shlomo ";

Fig. 3 shows the webpage showing results of a search of the website using the keyword "madonna";

Fig. 4 shows a method for ranking the suitability of two or more predetermined keywords to a search engine in accordance with one embodiment of the invention;

Fig. 5 shows a method for autocompletion of keywords input to a search engine at a website in accordance with one embodiment of the invention;

Fig. 6 shows a method for scoring the suitability of a predetermined search engine to a website in accordance with one embodiment of the invention;

Fig. 7 shows a method for ranking two or more search engines for suitability to a predetermined website; and

Fig. 8 shows a method for assigning a quality score to a search engine.

DETAILED DESCRIPTION OF EMBODIMENTS

Fig. 1 shows a method 2 for scoring the suitability of one or more predetermined keywords to a website search engine, in accordance with one embodiment of the invention. In step 4, the keyword is input to the search engine, and a search of the website is performed for the keyword using the search engine (step 6). The search results are received (step 8) and a score of the suitability of the keyword to the website is calculated based in the search results.

Fig. 2 shows a webpage 50 of a website. The website has a search engine that is actvated when a user inputs a keyword 52 into a search window 52. In the example shown in Fig. 2, a user has input the keyword "shlomo ". The user then activated the 2010/000861

6

website search engine by clicking of the search button 54. Upon conducting a search of the website using the input keyword "shlomo", the search engine displayed on the webpage 50 the the search result 56 "sorry no results for 'shlomo "'.

Fig. 3 shows again the webpage 50. In the example fo Fig. 3, a user has input the keyword "madonna". The user then activated the website search engine by clicking of the search button 54. Upon conducting a search of the website using the input keyword "madonna", the search engine displayed on the webpage 50 the the search results 66. An analysis of the search results would yield the following paramter values:

Number of links : 20

Number of words : 1035

Number of Lines: 80

Page length:3000 pixels

Pattern of "no results" found in page : false

Pattern of "Paging" found in page : true

Number of pages : 10

Thus, keyword "madonna" is a better keyword for the website search engine than the keyword " hlomo. Hence, the keyword "madonna" is assigned a higher score than the keyword " hlomo ".

The calculation of a score of a keyword may be based upon a statistical analysis of keyword searches performed by searching the website and using each of a large number of test keywords, and compiling statistics relating to the search results. For example, the statistical analysis may involve any one or more of the following parameters:

The average number of links in the search results.

The average number of words in the search results.

The average number of lines in the search results.

The average page length: in the search results.

The average number of pages in the search results.

The average of assignging a value of "1" to a search result if the result includes at lelast one hit and assigning a value of "0" of the results did not include any hits.

The average of assignging a value of "1" to a search result if the search results include more than one page and assigning a value of "0" if the search results include exactly one page. The calculation of a score of a keyword may involve a minimum value of any one of or more paramters observed using the test keywords. For example, calculation may involve any one or more of the following paramters.

The minimum number of links in the search results.

The minimum number of words in the search results.

The minimum number of lines in the search results.

The minimum page length: in the search results.

The minimum number of pages in the search results.

The minimum of assignging a value of "1" to a search result if the result includes at lelast one hit and assigning a value of "0" of the results did not include any hits.

The minimum of assignging a value of "1" to a search result if the search results include more than one page and assigning a value of "0" if the search results include exactly one page.

Thus, in one example of the scoring of a keyword, the following algebraic expression is used:

sqrt( umber of links/minimum number of links)*

sqrt(abs(Number of words-number of words in the average case)/min base case number of word)

+ (page length-minimum page length)/l 00

+ (pattern of pages found:number of pages/5;0)

*( pattern of no results found:0;l);

Wherein "sqrt" indicates the square root, and "abs" indicates the absolute value. In Fig. 4 shows a method 12 for ranking the suitability of two or more predetermined keywords to a search engine in accordance with one embodiment of this aspect of the invention. In step 14 each of the keywords is scored for suitability for use by the search engine as described above. Then, in step 16, the keywords are ranked according to their scores. Thus, in the above example, the keyword "madonna " receives a higher ranking than the keyword "shlomo".

Fig. 5 shows a method 20 for autocompletion of keywords input to a search engine at a website in accordance with one embodiment of this aspect of the invention. A predetermined list of keywords is ranked for suitability for the search engine ine (step 22). Then, when a keyword is being input to the search engine by a user, keywords from the predetermined list of keywords consistent with the input letters is displayed to the user in the order of the ranking of the keywords.

Fig. 6 shows a method 26 for scoring the suitability of a predetermined search engine to a website in accordance with one embodiment of the invention. In step 28, for each of one or more keywords, a search of the website is performed using the search engine. The results of each search performed are received (step 30), and a score is calculated for each search (step 32), as explained above. A score of the suitability of the search engine to the website is then calculated in a calculation involving the scores of the searches. The sutability score maybe, for example, the average of the search scores.

Fig. 7 shows a method 34 for ranking two or more search engines for suitability to a predetermined website. In step 36, for each of the two or more search engines, a score of the suitability of the search engine to the website is calculated as explained above. Then in step 38, the two or more search engines are ranked according to their scores

Fig. 8 shows a method 40 for assigning a quality score to a search engine. In step 42, for each of one or more websites, a search of the website is conducted by the search engine using one or more predetermined keywords. For each search performed, a suitability score of the search engine to the website is calculated as described above (step 44). The score calculated for the search engine at the one or more websites is then used to assign quality score to the search engine (step 46). The quality score may be, for example, the average of the suitability scores of the search engines to the one or more websites.

Claims

CLAIMS:
1. A method for scoring the suitability of one or more keywords to a website search engine comprising:
(a) for each of the one or more keywords, performing a search of the website by the search engine;
(b) calculating a score indicative of the suitability of the keyword to the website in a calculation involving search results returned by the search engine.
2. The method according to Claim 1 wherein the calculation of the suitability score involves a magnitude of the search results returned by the search engine.
3. The method according to Claim 2 wherein a first value is calculated for the score if the magnitude of the search results is above a threshold, and a second value is calculated if the magnitude of the search results is below the threshold, the first value being different from the second value.
4. The method according to Claim 2 or 3 wherein the magnitude is a number of search results returned by the search engine.
5. The method according to Claim 2 or 3 wherein the magnitude is a number of pages of the search results.
6. The method according to Claim 2 or 3 wherein the magnitude is a number of lines in the search results.
7. The method according to Claim 2 wherein the magnitude is a number of links in the search results.
8. The method according to any one of the previous claims further comprising:
(a) logging searches requested by users using the website search engine over a time period,
(b) for each search performed by a user, logging the user's response to the search results;
(c) calculating the score in a calculation involving a cumulative assessment of the responses of the users to the search results.
9. The method according to Claim 8 wherein the score is indicative of a number of times a user utilized the search results in comparison to a number of times the user did not utilize the search results.
10. The method according to Claim 9 wherein utilizing the search results comprises opening a link appearing in the search results.
11. A method for ranking relevance of two or more predetermined keywords to a website search engine comprising:
(a) for each of the two or more predetermined keywords, scoring the keyword for suitability to the website search engine by the method of any one of claims 1 to 10;
(b) ranking the two or more keywords according to their scores.
12. The method according to Claim 11 wherein the ranking of the keywords further involves a preliminary ranking assigned to the keywords.
13. A method for autocompletion of data input to a website search engine query box at a website comprising:
(a) as a user enters a keyword into a search engine query box at a website, presenting to the user a menu of keywords consistent with the data being input by the user where the keywords are listed in the menu in an order determined by the ranking of the keywords, wherein the keywords are ranked according to suitability to the website search engine by the method of Claim 11.
14. A method for scoring the suitability of one or more predetermined search engines to a website; comprising,:
(a) calculating the suitability of a plurality of predetermined keywords to the search engine by the method of any one of Claims 1 to 11;
(b) calculating a score of the suitability of the of the search engine for the website in a calculation involving the suitability scores of the predetermined keywords.
15. A method for ranking two or more search engines for suitability to a predetermined website comprising:
(a) calculating the scores for each of the two or more search engines by the method of Claim 14;
(b) using the scores calculated in step (a) to rank the two or more search engines.
16. A method to calculate a score of a search engine comprising: (a) for each of one or more predetermined websites, calculating a score for suitability of the search engine to the website by the method of Claim 14;
(b) calculating a score of the search engine based upon the scores calculated in step (a).
PCT/IL2010/000861 2010-10-20 2010-10-20 Method for scoring and ranking search engine keywords at a website WO2012052983A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US25313510P true 2010-10-20 2010-10-20
US61/253,135 2010-10-20

Publications (1)

Publication Number Publication Date
WO2012052983A1 true WO2012052983A1 (en) 2012-04-26

Family

ID=43608719

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IL2010/000861 WO2012052983A1 (en) 2010-10-20 2010-10-20 Method for scoring and ranking search engine keywords at a website

Country Status (1)

Country Link
WO (1) WO2012052983A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9031929B1 (en) 2012-01-05 2015-05-12 Google Inc. Site quality score
WO2015174997A1 (en) * 2014-05-13 2015-11-19 Intuit Inc. Ranking autocomplete results based on a business cohort
US10127314B2 (en) 2012-03-21 2018-11-13 Apple Inc. Systems and methods for optimizing search engine performance

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6169986B1 (en) * 1998-06-15 2001-01-02 Amazon.Com, Inc. System and method for refining search queries
US20070244866A1 (en) * 2006-04-18 2007-10-18 Mainstream Advertising, Inc. System and method for responding to a search request
US20080235187A1 (en) * 2007-03-23 2008-09-25 Microsoft Corporation Related search queries for a webpage and their applications
US20080243819A1 (en) * 2007-04-02 2008-10-02 Microsoft Corporation Search macro suggestions relevant to search queries

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6169986B1 (en) * 1998-06-15 2001-01-02 Amazon.Com, Inc. System and method for refining search queries
US20070244866A1 (en) * 2006-04-18 2007-10-18 Mainstream Advertising, Inc. System and method for responding to a search request
US20080235187A1 (en) * 2007-03-23 2008-09-25 Microsoft Corporation Related search queries for a webpage and their applications
US20080243819A1 (en) * 2007-04-02 2008-10-02 Microsoft Corporation Search macro suggestions relevant to search queries

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
None

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9031929B1 (en) 2012-01-05 2015-05-12 Google Inc. Site quality score
US9760641B1 (en) 2012-01-05 2017-09-12 Google Inc. Site quality score
US10127314B2 (en) 2012-03-21 2018-11-13 Apple Inc. Systems and methods for optimizing search engine performance
WO2015174997A1 (en) * 2014-05-13 2015-11-19 Intuit Inc. Ranking autocomplete results based on a business cohort
US10083252B2 (en) 2014-05-13 2018-09-25 Intuit Inc. Ranking autocomplete results based on a business cohort

Similar Documents

Publication Publication Date Title
KR101230687B1 (en) Link-based spam detection
US9116994B2 (en) Search engine optimization for category specific search results
US8407229B2 (en) Systems and methods for aggregating search results
US7685197B2 (en) System and methods for indentifying the potential advertising value of terms found on web pages
JP5431727B2 (en) Relevancy determination method, information collection method, objects organized method and retrieval system
US8452758B2 (en) Methods and systems for improving a search ranking using related queries
US9015176B2 (en) Automatic identification of related search keywords
US8244752B2 (en) Classifying search query traffic
US7873640B2 (en) Semantic analysis documents to rank terms
White et al. Assessing the scenic route: measuring the value of search trails in web logs
US20090182725A1 (en) Determining entity popularity using search queries
US8402021B2 (en) Providing posts to discussion threads in response to a search query
US20130173614A1 (en) Generation of refinement terms for search queries
US8095582B2 (en) Dynamic search engine results employing user behavior
US20080091670A1 (en) Search phrase refinement by search term replacement
US9171088B2 (en) Mining for product classification structures for internet-based product searching
US20050222989A1 (en) Results based personalization of advertisements in a search engine
US9697249B1 (en) Estimating confidence for query revision models
US20140280106A1 (en) Presenting comments from various sources
US8799265B2 (en) Semantically associated text index and the population and use thereof
US8321278B2 (en) Targeted advertisements based on user profiles and page profile
EP2560111A2 (en) Systems and methods for facilitating the gathering of open source intelligence
US8078602B2 (en) Search engine for a computer network
CN102625936B (en) Query suggestions from the document
CN103136329B (en) More integrated query revised model

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 10784588

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase in:

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 10784588

Country of ref document: EP

Kind code of ref document: A1