WO2001016807A1 - An internet search system for tracking and ranking selected records from a previous search - Google Patents

An internet search system for tracking and ranking selected records from a previous search

Info

Publication number
WO2001016807A1
WO2001016807A1 PCT/US2000/024286 US0024286W WO2001016807A1 WO 2001016807 A1 WO2001016807 A1 WO 2001016807A1 US 0024286 W US0024286 W US 0024286W WO 2001016807 A1 WO2001016807 A1 WO 2001016807A1
Authority
WO
Grant status
Application
Patent type
Prior art keywords
search
web
record
system
collection
Prior art date
Application number
PCT/US2000/024286
Other languages
French (fr)
Inventor
Aaron Clyde Crandall
Jason Wayne Bosarge
Louis Hernandez
Original Assignee
Mindcrossing, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor ; File system structures therefor
    • G06F17/30861Retrieval from the Internet, e.g. browsers
    • G06F17/30876Retrieval from the Internet, e.g. browsers by using information identifiers, e.g. encoding URL in specific indicia, browsing history
    • G06F17/30884Bookmark management
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/30Information retrieval; Database structures therefor ; File system structures therefor
    • G06F17/30861Retrieval from the Internet, e.g. browsers
    • G06F17/30864Retrieval from the Internet, e.g. browsers by querying, e.g. search engines or meta-search engines, crawling techniques, push systems

Abstract

A system and method that enable Internet users to access selected records retrieved from results set (510) that are derived from earlier search queries. The invention tracks and ranks selected records that users deemed valuable to a search query. When an Internet user submits a search query (400), the system creates three distinct but related queries for searching all accessible web sites, a collection database (512) and a rank database (514). The collection database includes a list of collections, i.e., records that other users deem relevant to a search topic and that are selected from result sets derived with earlier search queries; the collection is organized in a folder/file based hierarchical format. The rank database includes Uniform Resource Locators (URLs) for identifying web sites that are bookmarked by other users. Each record in the collection and rank databases has an associated score that is used to organize records retrieved from those databases. Thus, when the user submits a search query to the system, it returns selected records from the collection and rank databases, in addition to other related web sites from the Internet.

Description

AN INTERNET SEARCH SYSTEM FOR TRACKING AND RANKING SELECTED RECORDS FROM A PREVIOUS SEARCH

Field of the Invention This invention relates to a computer search system and more particularly to a search system and method in a computer network for sharing and ranking retrieved preselected records that are organized in a hierarchical format.

Background of the Invention

Advances in computer processing power and network communications have made information from a wide variety sources available to users on computer networks. Computer networking allows network computer users to share information, software applications and hardware devices and internetworking enables a set of physical networks to be connected into a single network such as the

Internet. The World Wide Web (Web), a hypermedia system used on the Internet, enables hypertext linking, whereby documents automatically reference or link other documents located on connected computer networks around the world. Thus, users connected to the Internet have almost instant access to information stored in relatively distant regions.

A page of information on the Web may include references to other Web pages and may include a broad range of multimedia data including textual, graphical, audio, and animation information. Currently, Internet users retrieve information from the Internet, through the Web, by 'visiting' a web site on a computer that is connected to the Internet.

The web site is, in general terms, a server application that displays information stored on a network server computer. The web site accepts connections from client programs, such as Internet browsers, and the client programs allow Internet users to access information displayed on the web site. As the number of physical networks connected to the Internet continue to grow, so too will the number of web sites that are accessible to Internet users. Presently, the only practical way to search the Internet for web sites that are related to a specific topic is to use an Internet search program or engine, such as AltaVista™ or Yahoo™.

In order to use a search engine to search for web sites related to a specific topic, the Internet user submits a search query containing search terms to a software program on a network server computer. The software program executes the search program to retrieve web sites that are related to the specified topic and to store the retrieved web sites in a result set. Search programs typically search all accessible web sites and index and rank retrieved web sites. While search engines typically implement different search techniques for indexing and ranking retrieved web sites, a common technique is for the search engine to count the number of times a search term appears on a web site. Some search engines also consider how often the search term appears in the beginning of a document because documents with search terms in the beginning are more likely to be relevant to the search topic.

Based on the search topic and the number of retrieved web sites, the user may be required to scan through thousands, if not hundreds of thousands, of records in the result set. Moreover, the user may be required to 'click' hyperlinks in each record and wait for the web site associated with the hyperlink to display its contents. It is only after the web site's contents are displayed and examined by the user that the user can determine if the web site is relevant to the information sought. Often, this can be a slow process and users may have to go to a lot of immaterial web sites in order to find relevant web sites. Additionally, the user has no way of determining if another user has performed a search on the same topic and no way of sharing already conducted search and research quickly and easily with other Internet users via an Internet search engine. Summary of the Invention The present invention relates to a system and method that enable Internet users to access selected records retrieved from result sets that are derived from earlier search queries. The invention tracks and ranks selected records that users deemed valuable to a search query. When an Internet user submits a search query, the system creates three distinct but related queries for searching all accessible web sites, a collection database and a rank database. The collection database includes a list of collections, i.e., records that other users deem relevant to a search topic and that are selected from result sets derived with earlier search queries; the collection is organized in a folder/file based hierarchical format. The rank database includes

Uniform Resource Locators (URLs) for identifying web sites that are bookmarked by other users. Each record in the collection and rank databases has an associated score that is used to organize records retrieved from those databases. Thus, when the user submits a search query to the system, it returns selected records from the collection and rank databases, in addition to other related web sites from the

Internet.

Specifically, when a user performs a search on a precise topic, the user selects records consisting of URLs and/or other data, such as files and images, from the retrieved result set, and organizes the selected records in a collection. Collection records containing bookmarks of URLs may also come from other sources, such as web sites that are manually bookmarked by the user or bookmarks that are uploaded through a computer browser. The user may share the collection with other Internet users by designating it as a public collection and submitting it for approval. When a collection is approved, it is assigned a URL, several related keywords, and a description. It is then cleared to show up as a shared resource.

Public collections are searchable by search engines and viewable by other users. Hence, other Internet users performing the same search may benefit from the research of the collection's creator. This system saves time in that other Internet - A - users may not have to scan through unrelated records in a result set in order to locate pertinent web sites and/or information.

It is therefore an object of the present invention to provide a system that combines records retrieved from the three distinct queries into a result set and sorts the records in the result set. The system assigns separate scores to the three queries in order to sort records in the result set that were retrieved through the respective queries. The system also assigns separate scores to each record in the sorted result set to further determine the relevancy of each record to the search query. The system increases the score associated with each record each time a search term occurs in the record and based on the location of the search term in the record. The system may further sort the result set based on the user's sorting choice.

It is another object of the invention to use the first distinct query to locate records containing a phrase made up of all terms in the search query, but excluding a first special character or Boolean term. The system uses the second distinct query to locate records containing all terms in the search query and it uses the third distinct query to locate records containing any search term or variation of the search term.

It is another object of the invention to allow the user to bookmark web pages that contain relevant information, to store URL records corresponding to the bookmark web page in the rank database, and to track and rank the URL records based on the bookmarking activity on the associated web site.

It is another object of the invention to track and rank URL records associated with bookmarked web pages by assigning a score to each URL record and maintaining the assigned score. The system periodically recalculates scores assigned to each URL record in order to prevent the earliest created records from always having the highest scores. The system also utilizes fields in the URL record in a predefined formula in order to periodically recalculate the score associated with each record in the rank database. It is a further object of the invention to maintain keywords for describing the bookmarked web page in the URL record, assigning a score to each keyword, and selecting the highest scoring keywords.

Additional features and advantages of the invention will be set forth in the description that follows, and in part will be apparent from the description, or may be learned by practice of the invention. The objectives and advantages of the invention will be realized and attained by the system particularly pointed out in the written description and claims hereof as well as the appended drawings.

To achieve these and other advantages and in accordance with the purpose of the invention, as embodied and broadly described, the present invention provides a system connected to an expanded network for ranking bookmark web pages, the system comprising a brower for bookmarking selected web sites; a first database for storing records comprising Uniform Resource Locator (URL) associated with each bookmarked web sitemeans for storing each record in the first database; and a software component for searching the first database, individually tracking records associated with bookmarked web pages, assigning a score to each record, and maintaining the assigned score.

Brief Description of the Drawings The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention that together with the description serve to explain the principles of the invention. In the drawings: Fig. 1 illustrates the computer network in which the inventive search and rank system may be incorporated;

Fig. 2 illustrates the TCP/IP Layering Model Protocol used during communications between components on the computer network; Fig. 3 illustrates a preferred embodiment of the search system that is used to search for web sites related to a particular topic;

Fig. 4-A illustrates a search query;

Fig. 4-B illustrates a search query that includes special characters; Fig. 4-C illustrates a search query that includes Boolean characters;

Fig. 5 illustrates how the search engine converts a single search query into three different queries according to a preferred embodiment of the invention;

Fig. 6 illustrates the bookmark hierarchical tree structure according to a preferred embodiment of the invention; Fig. 7 illustrates a URL record that is stored in a collection database and a rank database according to a preferred embodiment of the invention;

Fig. 8 illustrates a flowchart of how a preferred embodiment of the system processes search queries; and

Fig. 9 illustrates a flowchart of how the inventive system tracks and ranks bookmark web pages according to a preferred embodiment of the invention.

Detailed Description Reference will now be made in detail to the preferred embodiments of the present invention, examples of which are illustrated in the accompanying drawings. The present invention described below extends the functionality of the inventive search and rank system and methods for utilizing the system.

Fig. 1 is an example of a local area network (LAN) 100 that is configured to transmit information to components inside and outside of LAN 100. It comprises a server 102, four computer systems 104-110, and peripherals, such as printers and other devices 112, that may be shared by components on LAN 100. Computer systems 104- 110 may serve as clients for server 102 and/or as clients and/or servers for each other and/or for other components connected to LAN 100. Components on LAN 100 are preferably connected together by cable media, for example copper or fiber-optic cable, and the network typology may be a token ring topology 114. It should be apparent to those of ordinary skill in the art that other media, for example, wireless media, such as optical and radio frequency, may also connect LAN 100 components. It also should be apparent that other network topologies, such as Ethernet, may be used.

Data may be transferred between components on LAN 100 in packets, i.e., blocks of data that are individually transmitted over LAN 100. Routers 120, 122 create an expanded network by connecting LAN 100 to other computer networks, such as the Internet, other LANs or Wide Area Networks (WAN). Routers are hardware devices that may include a conventional processor, memory, and separate

I/O interface for each network to which it connects. Hence, components on the expanded network may share information and services with each other. In order for communications to occur between components of physically connected networks, all components on the expanded network and the routers that connect them must adhere to a standard protocol. Computer networks connected to the Internet and to other networks typically use TCP/TP Layering Model Protocol. It should be noted that other internetworking protocols may be used.

As illustrated in Fig. 2, TCP/TP Layering Model comprises an application layer or (Layer 5) 202, a transport layer or (Layer 4) 204, an Internet layer or (Layer 3) 206, a network interface layer or (Layer 2) 208, and a physical layer or

(Layer 1) 210. Application layer protocols 202 specify how each software application connected to the network uses the network. Transport layer protocols 204 specify how to ensure reliable transfer among complex protocols. Internet layer protocols 206 specify the format of packets sent across the network as well as mechanisms used to forward packets from a computer through one or more routers to a final destination. Network interface layer protocols 208 specify how to organize data into frames and how a computer transmits frames over the network; and physical layer protocols 210 correspond to the basic network hardware. By using TCP/TP Layering model protocols, any component connected to the network can communicate with any other component connected directly or indirectly to the one of the attached networks.

Presently, an Internet user may log on the Internet through server 102 and a browser application program, such as Microsoft Explorer™ or Netscape™ browser, on client computer system 104-110. The browser displays text and/or graphic information on Web pages through the client computer system 104-110 screen and permits the user to navigate through the Web using a mouse. Furthermore, some of the Web page information may be highlighted to indicate that an item is a hypertext, or pointer to another document. Fig. 3 illustrates an inventive search engine 300 that is used to search for web sites related to a particular topic. Search engine 300 is preferably executed on server 102 although it may be executed on client computer system 104- 110. The user may access search engine 300 through the browser and submit a search query related to a specific topic to it.

Fig. 4-A illustrates a search query 400 that may be submitted to a currently used search engine, such as AltaVista™ search engine. Search query 400 includes multiple query terms 402. The user may modify search query 400 with special characters, such as a plus sign, '+', a minus sign, '-', and left and right parenthesis, '()', and with Boolean terms such as 'AND', 'OR', and 'AND NOT'. For example, when the user enters several query terms 402 in a search query, AltaVista ™ search engine searches for pages containing at least one of those query terms. Adding a plus sign, '+', before a query term 402 ensures that the query term 402 will appear on every page the AltaVista ™ search engine returns. If the minus sign, '-', precedes query term 402, the search engine will only retrieve records that do not contain the query term. The user may also search for exact phrases by enclosing search terms in quotation marks. Fig. 4-B illustrates a search query that includes special characters and Fig. 4-C illustrates a search query that includes Boolean characters. It should be apparent that other modifications may be made to search query 400.

Fig. 5 illustrates how search engine 300 in the inventive system processes search query 400. Specifically in a preferred embodiment of the invention, search engine 300 converts search query 400 into a phrase match query 504, an all- word query 506, and an any- word query 508. Phrase match query 504 is used for locating records containing a phrase made up of all query terms 402, but excluding the first special character or Boolean term. Search engine 300 uses all-word query 506 to locate records containing all search terms 402 and it uses any- word query 508 to search for records containing any search terms 402 entered by the user and to search for records that contain a variation of each search term 402. For example, if the user entered 'record' as a search term, any-word query 508 will retrieve records containing 'record', 'records', 'recording', and 'recorded'. During the query conversion, if search engine 300 determines that multiple queries 504-508 will produce identical results, then only one of the queries is used. Upon executing queries 504-508, search engine 300 also eliminates duplicate records retrieved by multiple queries.

After queries 504-508 are constructed, search engine 300 searches a collection database 512 and a rank database 514 for any records that satisfy the queries. Collection database 512 contains 'collections,' which is a user created folder based on hierarchical format. Each collection contains organized and preselected records, related to a specific topic, that are derived from earlier searches. Rank database 514 contains searchable URL records that are associated with bookmarked web pages. During collection database 512 search, search engine 300 associates a score with each query 504-508 in order to determine the precedence of records retrieved from each query. Query scores may be modified to vary the priority of records retrieved by each query 504-508. Records retrieved from queries 504-508 are combined into one result set 510 and they are sorted according to their associated scores. Result set 510 is typically truncated at a predetermined number to limit the number of records that are transmitted to the user. A record score is associated with each record in result set 510 for further determining the relevancy of each record to the search term entered by the user. Search engine 300 increases a record's score each time a search term occurs in the record and based on the location of the search term in the record. Thereafter, the entire result set 510 is sorted according to the user's sorting choice. Examples of sorting based on the user's choice include sorting based on the record's score; sorting based on the record's creation date; sorting based on the record activity level; sorting based on the rating assigned to each record by other users; sorting based on the late updated date of each record; and sorting based on the number of URLs links associated with each record. After sorting result set 510, it is displayed on the web page for the user to examine.

In the inventive system, the user may also bookmark each web page, in result set 510, that contains relevant information. Moreover, the user may use tools in the inventive system to bookmark pages found through other search engines. The URLs associated with the bookmarked web pages are organized into a hierarchical tree structure similar to a computer file structure. Fig. 6 illustrates a preferred embodiment of the bookmark hierarchical tree structure 600. Structure 600 may be categorized into collections 602-606 and 612 and folders 608.

Collections 602-606, 612 may be designated as public collections 604, 612, collections viewable by any user and collections that may appear in searched result set 510, or designated as private collections 602, 606, collections viewable only by the creator. Upon creating a public collection, the user must identify the collection type either as a "general list" of bookmarks or as a specific topic. An example of a general list identification is "John Smith's Favorite" and example of a specific topic identification is "Computer Gaming Sites". If the user identifies the collection as a general list, the record is stored in rank database 514 and a URL is provided to the user for publishing the collection. Thereafter, other users may access the collection either through a result set returned by search engine 300 or by entering the URL directly into a browser. If the user identifies the collection as a specific topic, the collection must be submitted for approval. The collection is evaluated based on predefined criteria, for example, the number of links in the collection or no empty folders in a collection. Upon approval of the collection, the system prompts the user to enter personal information such as creator's email address and the title, description and keywords associated with the collection. The collection is then stored in collection and rank databases 512, 514 and it is searchable by search engine 300.

The system tracks and ranks each URL based on bookmarking activity performed on the associated web site. Since a score associated with a bookmarked web page is increased each time a user bookmarks the pages, a collection 602-606, 612 may improve search results even though it is designated as a private collection.

In other words, a user casts a vote for a web site by bookmarking the site.

Specifically, a bookmark manager application 516 in the system tracks individually bookmarked web pages, assigns scores to associated URL records and maintains the assigned score. Bookmark manager 516 searches rank database 514 and increases the URL record's score each time the associated web page is bookmarked by a user. Fig. 7 illustrates the URL record 700 associated with bookmarked web pages. Record 700 is stored in collection and rank databases 512, 514 and it includes the URL of an associated bookmarked web page 702, a title of the web page 704, and a collection placement 706. Collection placement 706 is used to specify which collections and/or folders, if any, contain the bookmark.

Record 700 may also include a description of the web page 708 and keywords or phrases 710 that are related to the contents of the web page.

The first time a particular web page is bookmarked by a user, record 700 is transmitted to the rank database 604. If record 700 already exists in database 604, a score associated with it is incremented and if it does not exist, it is inserted into the database and a score is assigned to it. Then the system updates statistical information about the collection. Examples of statistical information include the number of bookmarks in collection 502-506, 512, the number of folders in the collection 502-506, 512, the names of folders that are contained in the root of collection 502-506, 512 and the last update date of collection 502-506, 512.

After record 700 is inserted into 604 rank database, bookmark manager 516 processes any keywords 710 included in record 700 by determining if keyword 710 already exists for that URL in another record. If it does, bookmark manager 516 increments a score associated with that keyword. If it does not exist, bookmark manager 516 assigns a score to the keyword. The top ten keywords 710, keywords with highest scores, are selected and placed in a list where they can be easily searched and associated with the appropriate URLs. In order to prevent URLs associated with earlier bookmarked pages from having higher scores because of an earlier bookmark date and not necessarily because of relevance to a search topic, bookmark manager 516 periodically, for example once a month, recalculates/rebuilds the score associated with each URL record. Each record 700 in rank database 514 includes six additional fields 710-720 for storing six time periods. The first field 712 stores the number of times the associated page was bookmarked since the last rebuild process, the second field 714 stores the number of times the associated page was bookmarked between the last rebuild process and the second last rebuild process and so on. Thus, the sixth field 722 initially stores the number of times between the two earliest rebuild time periods that the associated page was bookmarked. It should be noted that the length of the time between rebuild periods is not as important as consistency between time periods.

Bookmark manager 516 uses the following formula to recalculate the score associated with each URL record:

Score = (Pl*3) + (P2*2.5) + (P3*2) + (P4 * 1.5) + (P5*l) + (P6 *.5)

-PI represents the number stored in the first field; P2 represents the number stored in the second field; P3 represents the number stored in the third field; P4 represents the number stored in the fourth field; P5 represents the number stored in the fifth field; and P6 represents the number in the sixth field.

Upon rebuilding URL record's 700 score, bookmark manager 516 sets the value of first field 712 to zero and shifts the value of fields 714-720. Hence, second field's 714 value is set to the previous value of first field 712; third field's 716 value is set to the previous value of second field 714; fourth field' s 718 value is set to the previous value of third field 716; and fifth field's 720 value is set to the previous value of fourth field 718. The previous value of fifth field 720 is added to the value of the sixth field 722 and the sixth field 722 is used as an accumulator.

Fig. 8 illustrates a flow chart of how system 300 processes search queries 400 entered by the user according to a preferred embodiment. In Step 810, system

300 converts search query 400 into a phrase match query 504, an all-word query

506 and an any-word query 508 and associates a score with each query. In Step

820, search engine 300 searches collection and rank databases 512-514, and searches all accessible web sites for any sites that satisfy the queries. In Step 830, search engine 300 combines records retrieved from each query 504-508 in a result set. In Step 840, search engine 300 assigns a score to each record in the combined result set and adjusts the assigned score in order further determine the relevance of each record to the search topic. In Step 850, search engine 300 sorts the result set

510 based on the user's sorting choice and displays the sorted result set on a computer screen through a browser.

Fig. 9 illustrates a flow chart of how the system processes collections and bookmarked web pages in the collection and rank databases 512-514 according to a preferred embodiment of the invention. In Step 910, a user bookmarks a web page that contains information relevant to a specific topic and places the bookmark in a collection. Thereafter the user designates the collection as a public or private collection. In Step 920, search engine 300 determines if the bookmark is part of a collection and if the collection is a public or private collection and it updates statistical information about the collection. In Step 930, the bookmark manager searches rank database 514 for the bookmark and it maintains the score of the associated URL record 700. If this is the first bookmark of a particular page, URL record 700 is stored in rank database 514 and a score is assigned to record 700, else, the score is incremented. In step 940, the bookmark manager provides a URL for the collection to the user and the user may submit a specific topic-public collection for approval. In Step 950, upon approval of the collection the user updates the collection information for storage in the collection database. In Step 960, the bookmark manager processes any keywords included in URL record 700 and selects the keywords with the highest score. In Step 970, the bookmark manager periodically recalculates the score associated with each record and resets the values of the appropriate fields.

The foregoing description has been directed to specific embodiments of this invention. It will be apparent, however, that other variations and modifications may be made to the described embodiments, with the attainment of some or all of their advantages. Therefore, it is the object of the appended claims to cover all such variations and modifications as come within the true spirit and scope of the invention.

Claims

CLAIMSWhat is claimed is:
1. A system connected to an expanded network for ranking bookmark web pages, the system comprising: a browser for bookmarking selected web sites; a first database for storing records comprising Uniform Resource Locator (URL) associated with each bookmarked web site; and a software component that searches the first database, tracks individual records associated with bookmarked web pages, assigns a score to each record, and maintains the assigned score.
2. The system of claim 1 , wherein the software component increase the record score each time a user bookmarks the web site associated with the record.
3. The system of claim 2 wherein the record in the first database comprises: a URL of the bookmarked web site; a title for identifying the web site; and a placement number for specifying other files where the URL associated with the bookmark web site is stored.
4. The system of claim 3, wherein the record further comprises: a description of the web site; and a plurality of keywords that relate to contents of the web page.
5. The system of claim 4, wherein the record in the first database may be stored in a second database.
6. The system of claim 5, wherein the software component: assigns a score to each of the plurality of keywords when the record is first inserted in the first database; determines if each keyword in the plurality of keywords already exits in another record; and increments the score assigned to each keyword in the plurality of keywords if the keyword exists in another record;
7. The system of claim 6, wherein the software component selects a second plurality of keywords with the highest scores and stores the second plurality of keywords in a table.
8. The system of claim 7, wherein the software component periodically recalculates the score associated with the record.
9. The system of claim 8, wherein the record further comprises a plurality of additional fields for storing time periods.
10. The system of claim 9, wherein a first of the plurality of additional fields is used for storing a number of times the web page was bookmarked since a last recalculation; a second of the plurality of additional fields is used for storing a number of times the web page was bookmarked since a second last recalculation; a third of the plurality of additional fields is used for storing a number of times the web page was bookmarked since a third last recalculation; a fourth of the plurality of additional fields is used for storing a number of times the web page was bookmarked since a fourth last recalculation; a fifth of the plurality of additional fields is used for storing a number of times the web page was bookmarked since a fifth last recalculation; and a sixth of the plurality of additional fields is used for storing a number of times the web page was bookmarked since an initial sixth last recalculation.
11. The system of claim 10 wherein the sixth of the plurality of additional fields is used as an accumulator and upon each recalculation, the software component adds the value in the fifth of the plurality of additional fields to the value in the sixth of the plurality of additional fields.
12. The system of claim 11 , wherein the software component assigns new values to the plurality of additional fields after each periodic recalculation.
13. The system of claim 12, wherein the software component uses a predefined formula and the values of the plurality of additional fields to periodically recalculate the score associated with each record in the first database.
14. The system of claim 13, wherein the predefined formula comprises: a first product of three times a value in the first of the plurality of additional fields; a second product of two point five times a value in the second of the plurality of additional fields; a third product of two times a value in the third of the plurality of additional fields; a fourth product of one point five times a value in the fourth of the plurality of additional fields; a fifth product of one times a value in the fifth of the plurality of additional fields; and a sixth product of point five times a value in the sixth of the plurality of additional fields.
15. The system of claim 14, wherein the first to sixth products are added together in the predefined formula.
16. A system connected to an expanded network for ranking bookmark web pages, the system comprising: a browser for bookmarking selected web sites; a first database for storing records comprising Uniform Resource Locator
(URL) associated with each bookmarked web site; and means for transmitting records associated with bookmarked web sites to the first database, for searching the first database, for tracking individual records associated with bookmarked web pages, for assigning a score to each record, and for maintaining the assigned score.
17. The system of claim 16, further comprising means for increasing the record score each time a user bookmarks the web site associated with the record.
18. The system of claim 17, further comprising means for creating records in the first database, the record comprises: a URL of the bookmarked web site; a title for identifying the web site; and a placement number for specifying other files where the URL associated with the bookmark web site is stored.
19. The system of claim 18, further comprising means for storing a description of the web site and a plurality of keywords that relate to contents of the web page in the record.
20. The system of claim 19, further comprising means for storing the record in the first database in a second database.
21. The system of claim 20, further comprising means for: assigning a score to each of the plurality of keywords when the record is first inserted in the first database; determining if each keyword in the plurality of keywords already exits in another record; and incrementing the score assigned to each keyword in the plurality of keywords if the keyword exists in another record;
22. The system of claim 21, further comprising means for selecting a second plurality of keywords with the highest scores and for storing the second plurality of keywords in a table.
23. The system of claim 22, further comprising means for periodically recalculating the score associated with the record.
24. The system of claim 23, further comprising means for storing time periods in a plurality of additional fields in the record.
25. The system of claim 24, further comprising means for: storing a number of times the web page was bookmarked since a last recalculation in a first of the plurality of additional fields; storing a number of times the web page was bookmarked since a second last recalculation in a second of the plurality of additional fields; storing a number of times the web page was bookmarked since a third last recalculation in a third of the plurality of additional fields; storing a number of times the web page was bookmarked since a fourth last recalculation in a fourth of the plurality of additional fields; storing a number of times the web page was bookmarked since a fifth last recalculation in a fifth of the plurality of additional fields; and storing a number of times the web page was bookmarked since an initial sixth last recalculation in a sixth of the plurality of additional fields.
26. The system of claim 25, further comprising means for using the sixth of the plurality of additional fields as an accumulator upon each recalculation and for adding the value in the fifth of the plurality of additional fields to the value in the sixth of the plurality of additional fields.
27. The system of claim 26, further comprising means for assigning new values to the plurality of additional fields after each periodic recalculation.
28. The system of claim 27, further comprising means for using a predefined formula and the values of the plurality of additional fields to periodically recalculate the score associated with each record in the first database.
29. A method for ranking bookmark web pages in a system that is connected to an expanded network, the method comprises the steps of: bookmarking selected web sites through a browser; storing records comprising Uniform Resource Locator (URL) associated with each bookmarked web site in a first database; and transmitting records associated with bookmarked web sites to the first database, searching the first database, tracking individual records associated with bookmarked web pages, assigning a score to each record, and maintaining the assigned score.
30. The method of claim 29, further comprising the step of increasing the record score each time a user bookmarks the web site associated with the record.
31. The system of claim 30, further comprising the step of storing the record in the first database in a second database.
32. The method of claim 31, further comprising the steps of: assigning a score to each keyword in a plurality of keywords in a record when the record is first inserted in the first database; determining if each keyword in the plurality of keywords already exits in another record; and incrementing the score assigned to each keyword in the plurality of keywords if the keyword exists in another record;
33. The method of claim 32, further comprising the steps of selecting a second plurality of keywords with the highest scores and storing the second plurality of keywords in a table.
34. The method of claim 33, further comprising the step of periodically recalculating the score associated with the record.
35. The method of claim 34, further comprising the step of storing time periods in a plurality of additional fields in the record.
36. The method of claim 35, further comprising the steps of: storing a number of times the web page was bookmarked since a last recalculation in a first of the plurality of additional fields; storing a number of times the web page was bookmarked since a second last recalculation in a second of the plurality of additional fields; storing a number of times the web page was bookmarked since a third last recalculation in a third of the plurality of additional fields; storing a number of times the web page was bookmarked since a fourth last recalculation in a fourth of the plurality of additional fields; storing a number of times the web page was bookmarked since a fifth last recalculation in a fifth of the plurality of additional fields; and storing a number of times the web page was bookmarked since an initial sixth last recalculation in a sixth of the plurality of additional fields.
37. The method of claim 36, further comprising the steps of: using the sixth of the plurality of additional fields as an accumulator after each recalculation; and adding the value in the fifth of the plurality of additional fields to the value in the sixth of the plurality of additional fields.
38. The method of claim 37, further comprising the step of assigning new values to the plurality of additional fields after each periodic recalculation.
39. The method of claim 38, further comprising the step of using a predefined formula and the values of the plurality of additional fields to periodically recalculate the score associated with each record in the first database.
PCT/US2000/024286 1999-08-31 2000-08-31 An internet search system for tracking and ranking selected records from a previous search WO2001016807A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US38635199 true 1999-08-31 1999-08-31
US09/386,351 1999-08-31

Publications (1)

Publication Number Publication Date
WO2001016807A1 true true WO2001016807A1 (en) 2001-03-08

Family

ID=23525239

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/024286 WO2001016807A1 (en) 1999-08-31 2000-08-31 An internet search system for tracking and ranking selected records from a previous search

Country Status (1)

Country Link
WO (1) WO2001016807A1 (en)

Cited By (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001080087A1 (en) * 2000-04-14 2001-10-25 Rightnow Technologies, Inc. Temporal updates of relevancy rating of retrieved information in an information search system
GB2368424A (en) * 2000-06-23 2002-05-01 Ibm Processing unit for sharing queries
EP1348173A1 (en) * 2000-11-21 2003-10-01 Singingfish.Com A system and process for searching a network
US6665655B1 (en) 2000-04-14 2003-12-16 Rightnow Technologies, Inc. Implicit rating of retrieved information in an information search system
WO2005029366A1 (en) * 2003-09-12 2005-03-31 Google Inc. Methods and systems for improving a search ranking using related queries
EP1536350A2 (en) * 2003-11-26 2005-06-01 Nokia Corporation System and method for creating dynamic internet bookmark
WO2005062210A1 (en) * 2003-12-03 2005-07-07 Google, Inc. Methods and systems for personalized network searching
WO2005066842A1 (en) * 2003-12-31 2005-07-21 Google, Inc Systems and methods for unification of search results
WO2006007575A2 (en) * 2004-07-01 2006-01-19 Aol Llc Analyzing a query log for use in managing category-specific electronic content
WO2007134184A2 (en) * 2006-05-10 2007-11-22 Google Inc. Presenting search result information
US7716205B1 (en) 2009-10-29 2010-05-11 Wowd, Inc. System for user driven ranking of web pages
US7761439B1 (en) 2004-06-30 2010-07-20 Google Inc. Systems and methods for performing a directory search
US8135737B2 (en) 2004-12-29 2012-03-13 Aol Inc. Query routing
US8255819B2 (en) 2006-05-10 2012-08-28 Google Inc. Web notebook tools
US8676797B2 (en) 2006-05-10 2014-03-18 Google Inc. Managing and accessing data in web notebooks
US8694511B1 (en) 2007-08-20 2014-04-08 Google Inc. Modifying search result ranking based on populations
US8694374B1 (en) 2007-03-14 2014-04-08 Google Inc. Detecting click spam
US8799273B1 (en) 2008-12-12 2014-08-05 Google Inc. Highlighting notebooked web content
US8874555B1 (en) 2009-11-20 2014-10-28 Google Inc. Modifying scoring data based on historical changes
US8898152B1 (en) 2008-12-10 2014-11-25 Google Inc. Sharing search engine relevance data
US8909655B1 (en) 2007-10-11 2014-12-09 Google Inc. Time based ranking
US8924379B1 (en) 2010-03-05 2014-12-30 Google Inc. Temporal-based score adjustments
US8938463B1 (en) 2007-03-12 2015-01-20 Google Inc. Modifying search result ranking based on implicit user feedback and a model of presentation bias
US8959093B1 (en) 2010-03-15 2015-02-17 Google Inc. Ranking search results based on anchors
US8972391B1 (en) 2009-10-02 2015-03-03 Google Inc. Recent interest based relevance scoring
US8972394B1 (en) 2009-07-20 2015-03-03 Google Inc. Generating a related set of documents for an initial set of documents
US9002867B1 (en) 2010-12-30 2015-04-07 Google Inc. Modifying ranking data based on document changes
US9009146B1 (en) 2009-04-08 2015-04-14 Google Inc. Ranking search results based on similar queries
US9058395B2 (en) 2003-05-30 2015-06-16 Microsoft Technology Licensing, Llc Resolving queries based on automatic determination of requestor geographic location
US9064014B2 (en) * 2010-08-20 2015-06-23 Rakuten, Inc. Information provisioning device, information provisioning method, program, and information recording medium
CN104765619A (en) * 2015-05-07 2015-07-08 苏州首旗信息科技有限公司 Singing recording software
US9092510B1 (en) 2007-04-30 2015-07-28 Google Inc. Modifying search result ranking based on a temporal element of user feedback
US9183499B1 (en) 2013-04-19 2015-11-10 Google Inc. Evaluating quality based on neighbor features
US9235627B1 (en) 2006-11-02 2016-01-12 Google Inc. Modifying search result ranking based on implicit user feedback
US9418104B1 (en) 2009-08-31 2016-08-16 Google Inc. Refining search results
US9623119B1 (en) 2010-06-29 2017-04-18 Google Inc. Accentuating search results

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6041360A (en) * 1997-11-21 2000-03-21 International Business Machines Corporation Web browser support for dynamic update of bookmarks
US6055542A (en) * 1997-10-29 2000-04-25 International Business Machines Corporation System and method for displaying the contents of a web page based on a user's interests

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6055542A (en) * 1997-10-29 2000-04-25 International Business Machines Corporation System and method for displaying the contents of a web page based on a user's interests
US6041360A (en) * 1997-11-21 2000-03-21 International Business Machines Corporation Web browser support for dynamic update of bookmarks

Cited By (68)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2377300B (en) * 2000-04-14 2004-02-25 Rightnow Tech Inc Temporal updates of relevancy rating of retrieved information in an information search system
US6434550B1 (en) 2000-04-14 2002-08-13 Rightnow Technologies, Inc. Temporal updates of relevancy rating of retrieved information in an information search system
GB2377300A (en) * 2000-04-14 2003-01-08 Rightnow Tech Inc Temporal updates of relevancy rating of retrieved information in an information search system
US6665655B1 (en) 2000-04-14 2003-12-16 Rightnow Technologies, Inc. Implicit rating of retrieved information in an information search system
WO2001080087A1 (en) * 2000-04-14 2001-10-25 Rightnow Technologies, Inc. Temporal updates of relevancy rating of retrieved information in an information search system
GB2368424A (en) * 2000-06-23 2002-05-01 Ibm Processing unit for sharing queries
US7725526B1 (en) 2000-06-23 2010-05-25 International Business Machines Corporation System and method for web based sharing of search engine queries
EP1348173A1 (en) * 2000-11-21 2003-10-01 Singingfish.Com A system and process for searching a network
EP1348173A4 (en) * 2000-11-21 2006-11-29 Thomson Licensing A system and process for searching a network
US9058395B2 (en) 2003-05-30 2015-06-16 Microsoft Technology Licensing, Llc Resolving queries based on automatic determination of requestor geographic location
WO2005029366A1 (en) * 2003-09-12 2005-03-31 Google Inc. Methods and systems for improving a search ranking using related queries
US8024326B2 (en) 2003-09-12 2011-09-20 Google Inc. Methods and systems for improving a search ranking using related queries
EP1536350A2 (en) * 2003-11-26 2005-06-01 Nokia Corporation System and method for creating dynamic internet bookmark
EP1536350A3 (en) * 2003-11-26 2006-10-18 Nokia Corporation System and method for creating dynamic internet bookmark
US8612415B2 (en) 2003-12-03 2013-12-17 Google Inc. Personalized network searching
US8166017B2 (en) 2003-12-03 2012-04-24 Google Inc. Personalized network searching
US8150830B2 (en) 2003-12-03 2012-04-03 Google Inc. Personalized network searching
CN101739467B (en) 2003-12-03 2012-09-05 咕果公司 Personalized network searching method and system
US8015170B2 (en) 2003-12-03 2011-09-06 Google Inc. Personalized network searching
WO2005062210A1 (en) * 2003-12-03 2005-07-07 Google, Inc. Methods and systems for personalized network searching
US7523096B2 (en) 2003-12-03 2009-04-21 Google Inc. Methods and systems for personalized network searching
US9679067B2 (en) 2003-12-03 2017-06-13 Google Inc. Personalized network searching
US8886626B2 (en) 2003-12-03 2014-11-11 Google Inc. Personalized network searching
US7437353B2 (en) 2003-12-31 2008-10-14 Google Inc. Systems and methods for unification of search results
WO2005066842A1 (en) * 2003-12-31 2005-07-21 Google, Inc Systems and methods for unification of search results
US7761439B1 (en) 2004-06-30 2010-07-20 Google Inc. Systems and methods for performing a directory search
WO2006007575A3 (en) * 2004-07-01 2006-05-18 America Online Inc Analyzing a query log for use in managing category-specific electronic content
US8073867B2 (en) 2004-07-01 2011-12-06 Aol Inc. Analyzing a query log for use in managing category-specific electronic content
US8768908B2 (en) 2004-07-01 2014-07-01 Facebook, Inc. Query disambiguation
US9183250B2 (en) 2004-07-01 2015-11-10 Facebook, Inc. Query disambiguation
WO2006007575A2 (en) * 2004-07-01 2006-01-19 Aol Llc Analyzing a query log for use in managing category-specific electronic content
US7379949B2 (en) 2004-07-01 2008-05-27 Aol Llc Analyzing a query log for use in managing category-specific electronic content
JP2008505407A (en) * 2004-07-01 2008-02-21 エーオーエル エルエルシー Analysis of the query log for use in management of category-specific electronic content
US8135737B2 (en) 2004-12-29 2012-03-13 Aol Inc. Query routing
WO2007134184A2 (en) * 2006-05-10 2007-11-22 Google Inc. Presenting search result information
US9256676B2 (en) 2006-05-10 2016-02-09 Google Inc. Presenting search result information
US9852191B2 (en) 2006-05-10 2017-12-26 Google Llc Presenting search result information
WO2007134184A3 (en) * 2006-05-10 2008-01-17 Google Inc Presenting search result information
US8255819B2 (en) 2006-05-10 2012-08-28 Google Inc. Web notebook tools
US8676797B2 (en) 2006-05-10 2014-03-18 Google Inc. Managing and accessing data in web notebooks
US9235627B1 (en) 2006-11-02 2016-01-12 Google Inc. Modifying search result ranking based on implicit user feedback
US9811566B1 (en) 2006-11-02 2017-11-07 Google Inc. Modifying search result ranking based on implicit user feedback
US8938463B1 (en) 2007-03-12 2015-01-20 Google Inc. Modifying search result ranking based on implicit user feedback and a model of presentation bias
US8694374B1 (en) 2007-03-14 2014-04-08 Google Inc. Detecting click spam
US9092510B1 (en) 2007-04-30 2015-07-28 Google Inc. Modifying search result ranking based on a temporal element of user feedback
US8694511B1 (en) 2007-08-20 2014-04-08 Google Inc. Modifying search result ranking based on populations
US8909655B1 (en) 2007-10-11 2014-12-09 Google Inc. Time based ranking
US9152678B1 (en) 2007-10-11 2015-10-06 Google Inc. Time based ranking
US8898152B1 (en) 2008-12-10 2014-11-25 Google Inc. Sharing search engine relevance data
US8799273B1 (en) 2008-12-12 2014-08-05 Google Inc. Highlighting notebooked web content
US9009146B1 (en) 2009-04-08 2015-04-14 Google Inc. Ranking search results based on similar queries
US8972394B1 (en) 2009-07-20 2015-03-03 Google Inc. Generating a related set of documents for an initial set of documents
US8977612B1 (en) 2009-07-20 2015-03-10 Google Inc. Generating a related set of documents for an initial set of documents
US9418104B1 (en) 2009-08-31 2016-08-16 Google Inc. Refining search results
US9697259B1 (en) 2009-08-31 2017-07-04 Google Inc. Refining search results
US8972391B1 (en) 2009-10-02 2015-03-03 Google Inc. Recent interest based relevance scoring
US9390143B2 (en) 2009-10-02 2016-07-12 Google Inc. Recent interest based relevance scoring
US7716205B1 (en) 2009-10-29 2010-05-11 Wowd, Inc. System for user driven ranking of web pages
US7873623B1 (en) 2009-10-29 2011-01-18 Wowd, Inc. System for user driven ranking of web pages
US8898153B1 (en) 2009-11-20 2014-11-25 Google Inc. Modifying scoring data based on historical changes
US8874555B1 (en) 2009-11-20 2014-10-28 Google Inc. Modifying scoring data based on historical changes
US8924379B1 (en) 2010-03-05 2014-12-30 Google Inc. Temporal-based score adjustments
US8959093B1 (en) 2010-03-15 2015-02-17 Google Inc. Ranking search results based on anchors
US9623119B1 (en) 2010-06-29 2017-04-18 Google Inc. Accentuating search results
US9064014B2 (en) * 2010-08-20 2015-06-23 Rakuten, Inc. Information provisioning device, information provisioning method, program, and information recording medium
US9002867B1 (en) 2010-12-30 2015-04-07 Google Inc. Modifying ranking data based on document changes
US9183499B1 (en) 2013-04-19 2015-11-10 Google Inc. Evaluating quality based on neighbor features
CN104765619A (en) * 2015-05-07 2015-07-08 苏州首旗信息科技有限公司 Singing recording software

Similar Documents

Publication Publication Date Title
Bar-Yossef et al. Template detection via data mining and its applications
Schwartz et al. A comparison of Internet resource discovery approaches
US7499940B1 (en) Method and system for URL autocompletion using ranked results
US7225407B2 (en) Resource browser sessions search
Bar-Ilan Data collection methods on the Web for infometric purposes—A review and analysis
US7788274B1 (en) Systems and methods for category-based search
US6574625B1 (en) Real-time bookmarks
Lempel et al. The stochastic approach for link-structure analysis (SALSA) and the TKC effect1
US6647381B1 (en) Method of defining and utilizing logical domains to partition and to reorganize physical domains
US7152064B2 (en) Searching tool and process for unified search using categories and keywords
Bowman et al. Harvest: A scalable, customizable discovery and access system
US6493702B1 (en) System and method for searching and recommending documents in a collection using share bookmarks
US6795820B2 (en) Metasearch technique that ranks documents obtained from multiple collections
Hotho et al. Information retrieval in folksonomies: Search and ranking
US6480837B1 (en) Method, system, and program for ordering search results using a popularity weighting
US6145003A (en) Method of web crawling utilizing address mapping
US6560600B1 (en) Method and apparatus for ranking Web page search results
US5875446A (en) System and method for hierarchically grouping and ranking a set of objects in a query context based on one or more relationships
US6772141B1 (en) Method and apparatus for organizing and using indexes utilizing a search decision table
US7290061B2 (en) System and method for internet content collaboration
US20080082486A1 (en) Platform for user discovery experience
US20040024752A1 (en) Method and apparatus for search ranking using human input and automated ranking
US20030220913A1 (en) Techniques for personalized and adaptive search services
US6947924B2 (en) Group based search engine generating search results ranking based on at least one nomination previously made by member of the user group where nomination system is independent from visitation system
US20050097160A1 (en) Method for providing information about a site to a network cataloger

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase in:

Ref country code: JP