WO2002095624A2 - A method for organizing an internet search according to user purposeful activities - Google Patents

A method for organizing an internet search according to user purposeful activities Download PDF

Info

Publication number
WO2002095624A2
WO2002095624A2 PCT/IL2002/000385 IL0200385W WO02095624A2 WO 2002095624 A2 WO2002095624 A2 WO 2002095624A2 IL 0200385 W IL0200385 W IL 0200385W WO 02095624 A2 WO02095624 A2 WO 02095624A2
Authority
WO
WIPO (PCT)
Prior art keywords
web site
web
site
list
sites
Prior art date
Application number
PCT/IL2002/000385
Other languages
French (fr)
Other versions
WO2002095624A3 (en
Inventor
Mordechai Chachamu
Original Assignee
The Parity Bit Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by The Parity Bit Ltd. filed Critical The Parity Bit Ltd.
Publication of WO2002095624A2 publication Critical patent/WO2002095624A2/en
Publication of WO2002095624A3 publication Critical patent/WO2002095624A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/954Navigation, e.g. using categorised browsing

Definitions

  • the present invention generally relates to a computerized search method. More specifically, the present invention relates to a method for the categorization of Internet sites according to the intended activity of the user.
  • the present invention provides an Internet search method, based on directory selections of purposeful activities intended by system users. More specifically, said invention relates to a method for organizing a list of Web sites offering online access on an electronic network, said list including categories of purposeful activities for searches by users.
  • the Internet which started as a novel way to share information world wide, grew to become the largest 'virtual library' on earth, with millions of sites, or 'books,' containing over a billion pages.
  • the users require some help to find specific information buried under such a phenomenal amount of data.
  • the need for a comprehensive 'index' is filled by a variety of Search Engines (SE's), e.g. AltaVistaTM, while Directories, e.g. Yahoo!TM, provide the 'book catalogue,' or list of major general categories, related to grouping of site's content.
  • SE's Search Engines
  • Directories e.g. Yahoo!TM
  • MSE Meta Search Engine
  • an online service site is defined as a site that requires or expects the user to be active while visiting the site.
  • searching is an activity performed regularly by almost all Web users.
  • the problem is — users are not happy with their search results, and the situation is getting worse all the time. What's wrong with the existing SE's? Why is it, then, that the SE's fail to provide the users with good search results?
  • the problem has two main sources: The Web's size; and The user's behavior and intention.
  • Unrelated manipulation i.e., an algorithm is used to determine the site's ranking and thereby artificially boost a site to occupy a prominent position, irrespective of the 'true' relevance of that site to the user's search intention.
  • the most popular search site is directory-based Yahoo, and its popularity is increasing unabated.
  • directories are delivering far better search results than SE's. Yet, directories are also failing to fill the search gap for the following reasons:
  • Consistency The Web is in constant flux. The content of a page or a site may change significantly over time or be moved elsewhere. Unlike the SE, which rebuilds its index every time its crawler revisits the Web, the directory has no such automatic update;
  • the actual categorization of a given site is under the sole discretion of the editors. It may not be the right category according to both the expectation of the user and/or the site owner; and
  • the present invention relates to a method for organizing a list of Web sites offering online access on an electronic network, said list including a plurality of categories of purposeful activities for searches by users of said electronic network, and said method including: searching for a plurality of said Web sites that are unlisted; adding said unlisted Web site to said list, if said Web site offers an online service; assigning at least one of said plurality of categories to said Web site that has been added to said list; and constructing a Web site of said electronic network wherein said users have said online access to said list.
  • the method may be that said electronic network is the Internet.
  • the method may include reviewing said Web site for consideration within a specified time period.
  • the method may include charging a fee for reviewing.
  • the method may include assigning sub-categories to said at least one of said plurality of categories.
  • the method may include the option of bidding by said Web site for ranking on said list.
  • the method may include bidding for the first page of rankings.
  • the method may include bidding for ranking.
  • the method may include bidding for ranking in the first five.
  • the method may include bidding for ranking in the first ten.
  • the method may include declining the option of bidding by said Web site.
  • the method may include random listing of said Web site that declines the option of bidding.
  • the method may include performing a periodic quality assurance check on said Web site.
  • the method may include generating a random number by which to select among said Web sites for performing a periodic quality assurance check.
  • the method may include performing a periodic quality assurance check on said Web site which comprises checking for linkrot.
  • the method may include removing said Web site from said list if said Web site fails said periodic quality assurance check.
  • the service provided by the present invention has configured an activity directory to classify all the web sites, or other form of node on an electronic network, such as the Internet, that offer an online service.
  • the user is able to search the activity directory by using naturally constructed phrases, illustrated in the above examples.
  • the directory's topmost classification level has, for example, activities like Buy, Exchange, Bid, Play, Publish, Advertise, Trade, Bank, Order, Bet, Study, Obtain etc.
  • the active directory site generates revenues by selling placement in its search results.
  • the amount paid is determined by a periodical auction among all the partipants in every directory category.
  • all the sites listed under Buy -> Books -> Computers compete in an auction for the first places on the list.
  • the winners have their listing displayed in bold face or otherwise highlighted.
  • the other sites are randomally assigned a lower place or ordered using another criterion, such as alphabetically. Sites may or may not be charged for being listed on the active directory site.
  • the active directory From the subscriber site owners' point of view, the active directory generates the most targeted audience. Clearly, if a user has reached the list of sites classified under Buy -> Books -> Computers, it is because he intends to buy such a book. Capturing the attention of that user is the best advertisement expenditure that any of the listed sites can make.
  • the active directory Web Site involves the following methods and components. Information for more than 50,000 sites is assembled and categorized.
  • the 'front-end' user interface, or browser are written and fully tested, and are designed to be as simple and clean as possible, providing just the information required.
  • the 'back-end,' or database and the database tools are, preferably, very robust and fast.
  • the active directory emphasizes the use of human classifiers, or 'editors for Web site analysis,' in contrast to reliance on software and computers. Thus, accumulation and classification of data requires a substantial effort, which cannot be effectively short-circuited by any computer program.
  • Fig. 1 is a bar chart illustration of the prior art trend in consumer usage of electronic communications media
  • Fig. 2 is a schematic block diagram illustrating a system for active Internet searches by users, constructed and operated in accordance with the principles of the present invention.
  • Fig. 3 is a flow chart of the sequence of processing for a participating Web site, in accordance with the principles of the present invention.
  • Figure 1 is a bar chart illustration of the prior art trend in consumer usage of electronic communications media 100.
  • Figure 2 which is a schematic block diagram 200 illustrating users actively searching the Internet, is constructed in accordance with an exemplary embodiment of the present invention.
  • Internet 230 is shown as having a user/client side 232 and a server side 234.
  • server side 234 are shown online Web site servers 242, 244 and 246, of various service providers, as well as active directory Web site server 250.
  • Servers 242, 244 and 246 may, for example, embody sites that want to sell shares online, offer chats between users and provide exchanges of time-share houses online, respectively.
  • Web site server 250 is, in turn connected to a database and database tools for storage, such as activities 252, sub-categories 254, geographical locations 256, user feedback 258 and bid information 260.
  • Active directory site server 250 embodies a new web directory that is fundamentally different from all prior art directories on the net. It is based on the following principles:
  • the first categorization level of activities, prime activity 252 of the directory is based on action or intention, rather than an abstract concept or object.
  • search results will be related to user's geographical location 256. 5.
  • the first N for example five, places of the search results are given to highest bidders, stored with listed sites/bid information 260 among the sites that comprise each category. The auction takes place on a periodic basis.
  • the User Interface for user interaction and feedback 258 is minimal and functional, allowing speedy operation even on a slow connection, and easy navigation even on small PDA's or WAP phones.
  • active directory Web site server 250 only deals with, and accepts listing from sites, which offer online activity, it removes the heap of static information and clearly maps out the boundaries of the virtual city. Its aim is to become the authoritative guide to it.
  • WAP Wireless Application Protocol
  • Active directory Web site server 250 highest level is the list of the user's possible intentions. In other words, every search starts with the phrase T want to... ' followed by the appropriate verb. This is in contrast with the traditional directory, like Yahoo! TM, which has the following highest level categories: Arts and Humanities, Business and Economy, Computer and Internet, Education, Entertainment, Government, Health, News & Media, Adventure & Sports, Reference, Regional, Science, Social Science, Society & Culture.
  • the directory branches to different directions of sub-categories 254, based on up-front analysis and user's feedback 258. For instance, if the intention was 'to chat', it is logical to assume that there are two main criteria to select a chat room: Subject and Language. Using the combination of the two criteria, the user can find a list of chat providers that exactly match his expectations.
  • active directory Web site server 250 divides the query into three parts:
  • the user is then presented with a list of online shops that deal with programming language books.
  • Fig. 3 is a flow chart of the sequence of processing 300 for a participating Web site, in accordance with the principles of the present invention.
  • a new Web site is located, and is considered for listing.
  • the Web site is analyzed as to whether it offers an online Service 310. If so, as an optional feature of an exemplary embodiment, the site is asked to pay $200 315, for example, to be assigned to have a review within 48 hours 320, and if the fee is not paid, the site waits for review at the end of the review queue 325. In any case, when the site is reviewed, it is assigned a primary activity and sub-categories 330.
  • the site is willing to bid on its ranking in its category 335, and the bid is among the top N in the category 340, the site is listed according to its bid ranking on the Web page in its category 355. If unwilling to bid, or the bid is not among the top five, then the site is listed randomly, or otherwise sorted on the Web page 350.
  • Aquality assurance random number is generated 360, for all listed sites. If the generated number matches the Web site 365, then the site is checked for 'linkrot' 370. If 'linkrot' is found, the site is removed from the list 375. If 'linkrot' is not found, or if the generated number does not match the site, then the number generation cycle is repeated 360.
  • each site is marked with the date it was entered. Every day in every month, the QA process checks all the sites that have been entered on that particular day of any previous month. Thus, no site is ignored for more than a month.
  • Active directory Web site server 250 is preferably built to maximize the user's satisfaction during access of the service. The following guidelines ensure that the user's expectations are met and even superseded:
  • a strict Quality Assurance process guarantees that the categorization of sites is correct. Sites are closely scrutinized to ensure that their content and mode of operation merits inclusion in the listing;
  • the percentage of 'dead links' is preferably kept below 0.5%, with continuous efforts to reduce it further;
  • the site has a clear, concise and functional layout. Note the contrast, for example, between the clean www.google.comTM and the overloaded www.excite.comTM;
  • Advertisement' is the age-old elusive goal of marketing directors. The aim is to promote the service or product to a receptive audience. For example, an advertiser will avoid paying to promote gardening tools in a Motor Boat magazine or fishing equipment in a Home and Garden publication. Yet this is exactly what happens on the Web, because placements have been targeted haphazardly, with insufficient control. UK residents see banner advertisements of US-related products; toy retailers promote themselves on financial sites, etc.
  • banner advertisement effectiveness has declined.
  • the click-through rate (percentage of people that actually click on the banner advertisement) has been falling steadily in the last couple of years, reaching an average of 0.5%.
  • banners are considered a big nuisance for the majority of users, claiming the disreputable honor of being the highest-ranking cause of complaint. They cause slow response time for all the pages in which they appear and unlike TV advertisement they are always present - you cannot switch channels, and sometimes 'lock-in' the user, with no 'back-click' option.
  • Little wonder then that advertising agencies are urging companies to consider very carefully where they actually place those banners. Their advice is simple - find a site where the audience is actually interested in the product. In other words - 'Target Advertisement'.
  • the active directory Web site of the present invention allows participating service provider Web sites to purchase the position, or rank, in which their site appears in the search result page. Users tend to click on the first link that seems suitable for their needs. If all the links are equally relevant, the user's tendency is to click on the first link.
  • the purchase is done in the form an auction.
  • Each week/fortnight/month (or other period) all the companies listed in a given category are asked to bid for their place.
  • the available places are preferably allocated in the strict order of the bidding amount. To maximize revenues, the following measures are taken:
  • the first N e.g. 5, places are visible without the need to scroll, if possible
  • Active directory Web site server 250 has three stages of operation:
  • the active directory Web site signs reciprocal agreements with major Search Engines, so that when the user searches for something that is not available on the site directory, the Search Engine is invoked to provide some results; similarly, if the active directory Web site can best answer the user's query on the affiliated Search Engine - the Search Engine lists the active directory Web site results first.

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

A method for organizing a list of Web sites offering online access on an electronic network, said list comprising a plurality of categories of purposeful activities for searches by users of said electronic network, and said method comprising; (a) searching for a plurality of said Web sites that are unlisted; (b) adding said unlisted Web site to said list, if said Web site offers an online service; (c) assigning at least one of said plurality of categories to said Web site that has been added to said list; and(d) constructing a Web site of said electronic network wherein said users have said online access to said list.

Description

A METHOD FOR ORGANIZING AN INTERNET SEARCH ACCORDING TO USER PURPOSEFUL ACTIVITIES
FIELD OF THE INVENTION
The present invention generally relates to a computerized search method. More specifically, the present invention relates to a method for the categorization of Internet sites according to the intended activity of the user. The present invention provides an Internet search method, based on directory selections of purposeful activities intended by system users. More specifically, said invention relates to a method for organizing a list of Web sites offering online access on an electronic network, said list including categories of purposeful activities for searches by users.
BACKGROUND OF THE INVENTION
It is well known that the World Wide Web (WWW), or the Internet, is growing phenomenally. Web sites appear faster than ever, showing up at a rate of more than 4,400 sites per day. As of 1999 there were 5.0 million sites. The number of Web pages, which may be the best gauge of size, has also skyrocketed in the last few years. To date there are around 1.5 billion Web pages, an 88 percent increase from 1998. This suggests 1.9 million Web pages are created each day. The expected number is 8 billion in 2002, exceeding the world's population.
The Internet, which started as a novel way to share information world wide, grew to become the largest 'virtual library' on earth, with millions of sites, or 'books,' containing over a billion pages. The users require some help to find specific information buried under such a phenomenal amount of data. The need for a comprehensive 'index' is filled by a variety of Search Engines (SE's), e.g. AltaVista™, while Directories, e.g. Yahoo!™, provide the 'book catalogue,' or list of major general categories, related to grouping of site's content. For the academic community, which comprised the bulk of the Web users in the early 90's, the directory solution was clearly inadequate. When a user wanted to access specific information from a given site, the directory simply did not provide it. The clear analogy of the Web site to a book is very appropriate. One cannot be expected to find a specific word or issue in a book by reading it all - that's the purpose of the index.
Taking full advantage of the nature of the Web, where links serve as the binding material between various pages, programs were written to 'crawl' the Web, i.e. accessing pages by jumping from link to link, and index them. The user then uses keywords to search within the index, retrieving the list of pages containing the searched-for information. Not surprisingly, one of the first programs that were written to perform that task was named 'Webcrawler'™. Such programs are now known as Search Engines (SE). Notable examples are Northern Light™, Alta Vista ™. and Google™.
Once the SE index was searched for a particular keyword and all the reverences to it were found, the results are presented to the user. Here, too, it is a program that decides the order of presentation. This is not an easy task. Clearly, the retrieved references must be sorted somehow - but what criteria should be used? Ideally, the relevance of the page to the user's search criteria should be the primary criterion. The obvious problem is the definition of what is relevant and the means to determine it in the extracted pages. Each SE has its own definition of 'relevance' and its unique method of deciding the degree of relevance for each page. Several forms of search engines have been patented. For example, Wu teaches a search engine applied to a hierarchy to return documents matching a query term, in US Patent 5,991,756, entitled "Information Retrieval from Hierarchal Compound Documents."
The diversity in the results produced by the various SE's can be attributed to the following factors:
The number of indexed pages varies considerably among them;
The number of pages retrieved from any site and the depth of coverage varies;
The syntax used to query the database varies; and The definition of 'relevance' and the algorithm used to implement it is unique to each SE.
Consequently, users have learned to visit more than one SE in order to get results that are more comprehensive. As soon as the trend was observed, new sites were developed that would save the user the time and trouble to actually fire the same query to each SE, the Meta Search Engine (MSE) does it for them. Some notable examples of MSE' s are AskJeeves™, Mamma™ etc. While very useful in its own right, a MSE has no database of its own. It relies almost entirely on the results produced by the 'real' SE.
However, the last few years witnessed a radical shift in the Web's identity. The 'virtual library' is now but one of the many 'buildings' in the Web's 'virtual city' or 'global village'. In it, there are shops, casinos, real estate agents, banks, brokers, theaters, insurance firms, betting shops, doctors, fortunetellers, auction houses, radio and TV stations and much, much more. New companies were formed to take advantage of the new opportunities, e.g. amazon.com™ and existing companies expanded their distribution channels to encompass the Web, e.g., dell.com™.
Thus, all the sites on the Internet can be broadly divided into two categories:
Those that were created to provide information, i.e. a sophisticated 'book'; and Those that provide an online service.
To clarify, an online service site is defined as a site that requires or expects the user to be active while visiting the site.
With the change in the profile of the Internet, a similar shift took place in the Web user's community. While the first wave of surfers were 'techies', academic and eager to be in the forefront of technology, the later joiners were just 'users.' They regard the Web as yet another communication tool, much like a phone, fax or pager. They do want to get information, but they also would like to take advantage of the new and exciting services that the 'virtual city' has to offer. Therefore two type of users are broadly defined as follows:
(a) The 'reader,' for whom the Web is regarded as a 'library,' is interested in reading the latest news, getting a stock quote and learning of the latest gossip about their favorite pop group; and
(b) The 'doer,' for whom the Web is regarded as a 'virtual city' or 'global village', and on the other hand, wants to sell his shares online, chat with other fans and buy a new shirt online. Doers want to do things, rather than just 'browse', yet none of the existing SEs and Directories cater adequately to such users.
Obviously, the same person can be at times a 'reader' and at other times a 'doer'. Actually, one can change his attitude several times while online.
As mentioned hereinabove, searching is an activity performed regularly by almost all Web users. The problem is — users are not happy with their search results, and the situation is getting worse all the time. What's wrong with the existing SE's? Why is it, then, that the SE's fail to provide the users with good search results?
The problem has two main sources: The Web's size; and The user's behavior and intention.
With over five billion pages already and growing at a tremendous rate, the SE's simply cannot hope to cope. Not only are thousands of pages added every day, but also existing pages get modified - both from in content and location. The latter is particularly annoying, when a 'promising looking' link found after many search attempts turns out to be a 'dead link', the phenomenon known as 'linkrot'. A recent study estimates that a staggering "22% of the links in the NorthernLight SE are broken".
Arguably, most users will be quite happy to put up with partial Web coverage and 'linkrot' if only the results delivered to them were in line with their expectations. Unfortunately, this is rarely the case. There are two main reasons for that: The SE is not a mind reader. When a user searches for some keyword(s), the SE cannot determine the context of those terms; and
Unrelated manipulation, i.e., an algorithm is used to determine the site's ranking and thereby artificially boost a site to occupy a prominent position, irrespective of the 'true' relevance of that site to the user's search intention.
The behaviour of both users and providers of Web search facilities has proven that directories are more effective than search engines:
The most popular search site is directory-based Yahoo, and its popularity is increasing unabated; and
In recent years most SE's have added directory features, most recently Google™. A successful SE, Lycos, has even- decided to abandon its crawler based search results and transform itself entirely to a directory.
Clearly, then, directories are delivering far better search results than SE's. Yet, directories are also failing to fill the search gap for the following reasons:
Size: The directories are aiming to categorize the entire Web. There are millions of Web sites, so the task is enormous;
Consistency: The Web is in constant flux. The content of a page or a site may change significantly over time or be moved elsewhere. Unlike the SE, which rebuilds its index every time its crawler revisits the Web, the directory has no such automatic update;
Categorization: To meaningfully categorize the entire Web, the directory tree becomes very deep. For instance, Yahoo! ™ places 'Star War books' under: Home > Entertainment > Movies and Film > Genres > Science Fiction and Fantasy > Series > Star Wars > Books.
Moreover, the actual categorization of a given site is under the sole discretion of the editors. It may not be the right category according to both the expectation of the user and/or the site owner; and
Context: Just like SE's, the directory is not a mind reader. When searching for 'sell a house' in the Open Directory™, the following link was typical: 'BidText.com: A textbook auction house that enables college students to sell their old books...';
Evidently, the results do contain the user's search terms, but its intention is totally lost.
Therefore, there is a need to provide a system for searching the Internet that effectively combines the search engine and directory concepts, and that delivers to its users search results where all the listed sites found are directly relevant to the user's search objective.
SUMMARY OF THE INVENTION
Accordingly, it is a principal object of the present invention to overcome the disadvantages associated with prior art Internet search systems.
It is a further object of the present invention to distinguish all the Web sites that offer an online service, and to classify the type of service offered.
It is yet another object of the present invention to sell placement in its search results.
It is still a further object of the present invention to generate the most targeted audience.
The present invention relates to a method for organizing a list of Web sites offering online access on an electronic network, said list including a plurality of categories of purposeful activities for searches by users of said electronic network, and said method including: searching for a plurality of said Web sites that are unlisted; adding said unlisted Web site to said list, if said Web site offers an online service; assigning at least one of said plurality of categories to said Web site that has been added to said list; and constructing a Web site of said electronic network wherein said users have said online access to said list.
The method may be that said electronic network is the Internet. The method may include reviewing said Web site for consideration within a specified time period. The method may include charging a fee for reviewing. The method may include assigning sub-categories to said at least one of said plurality of categories. The method may include the option of bidding by said Web site for ranking on said list. The method may include bidding for the first page of rankings. The method may include bidding for ranking. The method may include bidding for ranking in the first five. The method may include bidding for ranking in the first ten. The method may include declining the option of bidding by said Web site. The method may include random listing of said Web site that declines the option of bidding. The method may include performing a periodic quality assurance check on said Web site. The method may include generating a random number by which to select among said Web sites for performing a periodic quality assurance check. The method may include performing a periodic quality assurance check on said Web site which comprises checking for linkrot. The method may include removing said Web site from said list if said Web site fails said periodic quality assurance check.
Using human editors, the service provided by the present invention, has configured an activity directory to classify all the web sites, or other form of node on an electronic network, such as the Internet, that offer an online service. The user is able to search the activity directory by using naturally constructed phrases, illustrated in the above examples. The directory's topmost classification level has, for example, activities like Buy, Exchange, Bid, Play, Publish, Advertise, Trade, Bank, Order, Bet, Study, Obtain etc.
The active directory site generates revenues by selling placement in its search results.
The amount paid is determined by a periodical auction among all the partipants in every directory category. Thus, for instance, all the sites listed under Buy -> Books -> Computers compete in an auction for the first places on the list. The winners have their listing displayed in bold face or otherwise highlighted. The other sites are randomally assigned a lower place or ordered using another criterion, such as alphabetically. Sites may or may not be charged for being listed on the active directory site.
From the subscriber site owners' point of view, the active directory generates the most targeted audience. Clearly, if a user has reached the list of sites classified under Buy -> Books -> Computers, it is because he intends to buy such a book. Capturing the attention of that user is the best advertisement expenditure that any of the listed sites can make.
Development of the active directory Web Site involves the following methods and components. Information for more than 50,000 sites is assembled and categorized. The 'front-end' user interface, or browser are written and fully tested, and are designed to be as simple and clean as possible, providing just the information required. The 'back-end,' or database and the database tools are, preferably, very robust and fast. In short, the active directory emphasizes the use of human classifiers, or 'editors for Web site analysis,' in contrast to reliance on software and computers. Thus, accumulation and classification of data requires a substantial effort, which cannot be effectively short-circuited by any computer program.
It is the intention of the active directory Web site to gain a high penetration of the Web community by means of strategic alliances with other search engines, portals and telephone companies. The majority of prior art search engines and directories try to extend the time spent by the user within their site, for example, by providing all sorts of additional information and the use of banner advertisements. By contrast, the active directory site has no incentive to keep the user on its site, and does not allocate resources for that purpose.
Other features and advantages of the invention will become apparent from the following drawings and description. BRIEF DESCRIPTION OF THE DRAWINGS
For a better understanding of the invention with regard to the embodiments thereof, reference is made to the accompanying drawings, in which like numerals designate corresponding elements or sections throughout, and in which:
Fig. 1 is a bar chart illustration of the prior art trend in consumer usage of electronic communications media;
Fig. 2 is a schematic block diagram illustrating a system for active Internet searches by users, constructed and operated in accordance with the principles of the present invention; and
Fig. 3 is a flow chart of the sequence of processing for a participating Web site, in accordance with the principles of the present invention.
It will be appreciated that the preferred embodiments described as follows are cited by way of example, and that the present invention is not limited to what is particularly shown and described. Rather, the scope of the present invention, as defined by appended claims, includes both combinations and sub-combinations of the various features described, as well as variations and modifications thereof, which would occur to persons skilled in the art upon reading the descriptions, and which are not disclosed in the prior art. Detailed Description Of Preferred Embodiments
Reference is now made to Figure 1, which is a bar chart illustration of the prior art trend in consumer usage of electronic communications media 100.
Figure 2, which is a schematic block diagram 200 illustrating users actively searching the Internet, is constructed in accordance with an exemplary embodiment of the present invention.
In figure 2, user 210 and user 220 are sitting at their Web browsers and accessing the Internet 230. Internet 230 is shown as having a user/client side 232 and a server side 234. On server side 234 are shown online Web site servers 242, 244 and 246, of various service providers, as well as active directory Web site server 250. Servers 242, 244 and 246 may, for example, embody sites that want to sell shares online, offer chats between users and provide exchanges of time-share houses online, respectively. Web site server 250 is, in turn connected to a database and database tools for storage, such as activities 252, sub-categories 254, geographical locations 256, user feedback 258 and bid information 260.
Active directory site server 250 embodies a new web directory that is fundamentally different from all prior art directories on the net. It is based on the following principles:
1. Only sites that provide an online service will be included in the active directory. It will cater mainly to users that wish to be active online, the 'doers' in the web users' community.
2. The first categorization level of activities, prime activity 252 of the directory is based on action or intention, rather than an abstract concept or object.
3. The subsequent levels of the directory such as sub-categories 254, is built differently for each action.
4. Whenever appropriate, search results will be related to user's geographical location 256. 5. The first N, for example five, places of the search results are given to highest bidders, stored with listed sites/bid information 260 among the sites that comprise each category. The auction takes place on a periodic basis.
6. The other places are randomly assigned after each auction or otherwise sorted
7. There will be no other advertisement of any kind.
8. Strict Quality Assurance process ensures minimal 'linkrot'.
9. The User Interface for user interaction and feedback 258 is minimal and functional, allowing speedy operation even on a slow connection, and easy navigation even on small PDA's or WAP phones.
From the 5 million WWW sites only a small fraction is related to online activity. To gauge the probable number of such sites, the keyword 'online' was used to search the Yahoo!™ directory. The results indicated that there are about 33,000 sites and 250 categories that use this keyword. The same search in the Open Directory™ produces 500 categories and 65,000 sites. Even quadrupling that number, results in only about 5% of the web. Yet it is precisely those sites of which the Web's virtual city is comprised, and that the 'doers' of the users' community wish to find. Obviously, many of those sites are of crucial importance to their owners too. They rely on the revenues from e-commerce and other online activities to survive and prosper.
Unfortunately, those sites are deeply buried beneath mountains of static information. Since active directory Web site server 250 only deals with, and accepts listing from sites, which offer online activity, it removes the heap of static information and clearly maps out the boundaries of the virtual city. Its aim is to become the authoritative guide to it.
Estimating the exact number of the 'doers' among the user community is also quite difficult. However, it is abundantly clear that a constantly growing percentage of the users are 'doers'. Evidence shows that the number of commercial transactions on the net is growing strongly, and that growth is predicted to accelerate even faster. Latest research claims that, in effect, all the users are 'doers', as they are online to accomplish a task. Although that may stretch our definition for 'online activity' too far, the point argued is that there is a need for sites that facilitate the user's goal seeking activity.
The real boost to the 'doers' is expected in the next few years, with the introduction of Wireless Application Protocol (WAP) enabled mobile phones. With it, users are able to be online anytime and anywhere, but the small size of the screen prevents intensive browsing. What can be done, quit effectively, are two things: Getting a small chunk of information, like a price quote for a share, or performing an online activity.
There can be many forms for that activity, as explained hereinbelow. The point is that the users cannot use the standard search engines or directories in locating the relevant site. With active directory Web site server 250 this task simple and straightforward.
The major distinction that makes finding information on traditional SE's difficult, but easy with active directory Web site server 250 is the user's intention. When a user is looking for the keywords 'sell a house', his intention, as a 'doer', is to find a site in which he can publish the fact that his house is for sale, along with the information related to it. Thus, his intention in this search is to sell. Similarly, users may want to Buy, Exchange, Bid, Play, Publish, Advertise, Trade, Bank, Order, Bet, Study, Obtain, etc.
Active directory Web site server 250 highest level is the list of the user's possible intentions. In other words, every search starts with the phrase T want to... ' followed by the appropriate verb. This is in contrast with the traditional directory, like Yahoo! ™, which has the following highest level categories: Arts and Humanities, Business and Economy, Computer and Internet, Education, Entertainment, Government, Health, News & Media, Recreation & Sports, Reference, Regional, Science, Social Science, Society & Culture.
Such categorization is not very helpful for a user that wishes to buy, for example, a book on the C++ programming language. Should he look for it under Science,
Reference, News and Media, Education, Computer and Internet or perhaps Business and Economy? One can make a case for each of the options.
Using the Yahoo! ™ direct search facility does not help much. The first result of the search for 'buy book on programming language online' was:
"Book TV - official site for C-SPAN2's™ weekend non-fiction book programming features scheduling information, live Webcasting plus video archives, online author chats, teaching guides, links to buy videotapes." None of the other two results was relevant either.
Once the user's intention is established, the directory branches to different directions of sub-categories 254, based on up-front analysis and user's feedback 258. For instance, if the intention was 'to chat', it is logical to assume that there are two main criteria to select a chat room: Subject and Language. Using the combination of the two criteria, the user can find a list of chat providers that exactly match his expectations.
Using the same example presented above ('buy a book on programming language'), active directory Web site server 250 divides the query into three parts:
Intention: BUY;
Object: BOOK; and
Subject: PROGRAMMING LANGUAGE.
The user is then presented with a list of online shops that deal with programming language books.
Another example could be the verb 'exchange'. Users may want to exchange Opinions', which bring them to the chat area mentioned above, or to exchange their 'house', in which case they are presented with a list of companies that facilitate house exchange for holiday purposes.
Some activities are obviously non-territorial, like 'play' or 'chat' Some, however, clearly are. For instance, when the user wishes to 'buy groceries', it is of no value to him to see the list of the UK online grocery retailers when he lives in the US. To ensure that users only get the relevant search result, every service-provider site listed is analyzed in terms of geographical coverage. Users are given the opportunity to register with the site, by providing just two data elements: email address and a general indication of their earth-bound address. In most instances the latter includes only the city, country and the zip code. Using that information, active directory Web site server 250 is able to filter the query results and present the user with the list of sites that service the user's area.
Fig. 3 is a flow chart of the sequence of processing 300 for a participating Web site, in accordance with the principles of the present invention.
In block 305 a new Web site is located, and is considered for listing. The Web site is analyzed as to whether it offers an online Service 310. If so, as an optional feature of an exemplary embodiment, the site is asked to pay $200 315, for example, to be assigned to have a review within 48 hours 320, and if the fee is not paid, the site waits for review at the end of the review queue 325. In any case, when the site is reviewed, it is assigned a primary activity and sub-categories 330.
If the site is willing to bid on its ranking in its category 335, and the bid is among the top N in the category 340, the site is listed according to its bid ranking on the Web page in its category 355. If unwilling to bid, or the bid is not among the top five, then the site is listed randomly, or otherwise sorted on the Web page 350.
Once listed the site is subject to a periodic review for quality assurance. Aquality assurance random number is generated 360, for all listed sites. If the generated number matches the Web site 365, then the site is checked for 'linkrot' 370. If 'linkrot' is found, the site is removed from the list 375. If 'linkrot' is not found, or if the generated number does not match the site, then the number generation cycle is repeated 360.
In an alternative embodiment, each site is marked with the date it was entered. Every day in every month, the QA process checks all the sites that have been entered on that particular day of any previous month. Thus, no site is ignored for more than a month.
Active directory Web site server 250 is preferably built to maximize the user's satisfaction during access of the service. The following guidelines ensure that the user's expectations are met and even superseded:
A strict Quality Assurance process guarantees that the categorization of sites is correct. Sites are closely scrutinized to ensure that their content and mode of operation merits inclusion in the listing;
There are preferably no duplicate entries, i.e. the same site is not listed more than once, in a search result page;
The percentage of 'dead links' is preferably kept below 0.5%, with continuous efforts to reduce it further;
The site has a clear, concise and functional layout. Note the contrast, for example, between the clean www.google.com™ and the overloaded www.excite.com™;
Users have a variety of ways to retrieve information from the service site; they can use phrases, keywords as well as follow the directory branches; and
Extensive use is made of 'related categories'; for instance, a user that wishes to buy a new DVD player would probably like to compare the prices and the terms offered by the online shops.
'Targeted Advertisement' is the age-old elusive goal of marketing directors. The aim is to promote the service or product to a receptive audience. For example, an advertiser will avoid paying to promote gardening tools in a Motor Boat magazine or fishing equipment in a Home and Garden publication. Yet this is exactly what happens on the Web, because placements have been targeted haphazardly, with insufficient control. UK residents see banner advertisements of US-related products; toy retailers promote themselves on financial sites, etc.
That is probably why banner advertisement effectiveness has declined. The click-through rate (percentage of people that actually click on the banner advertisement) has been falling steadily in the last couple of years, reaching an average of 0.5%. In addition, banners are considered a big nuisance for the majority of users, claiming the disreputable honor of being the highest-ranking cause of complaint. They cause slow response time for all the pages in which they appear and unlike TV advertisement they are always present - you cannot switch channels, and sometimes 'lock-in' the user, with no 'back-click' option. Little wonder then that advertising agencies are urging companies to consider very carefully where they actually place those banners. Their advice is simple - find a site where the audience is actually interested in the product. In other words - 'Target Advertisement'.
The active directory Web site of the present invention allows participating service provider Web sites to purchase the position, or rank, in which their site appears in the search result page. Users tend to click on the first link that seems suitable for their needs. If all the links are equally relevant, the user's tendency is to click on the first link.
The purchase is done in the form an auction. Each week/fortnight/month (or other period) all the companies listed in a given category are asked to bid for their place. The available places are preferably allocated in the strict order of the bidding amount. To maximize revenues, the following measures are taken:
Only the first N, e.g. 5, places are available for bidders;
The first N, e.g. 5, places are visible without the need to scroll, if possible
The place of the other sites are randomly assigned, or otherwise sorted,after each auction; and
Every week/fortnight/month (or other period) a detailed statistical report of the click-through rate for the first N', e.g. 20, positions is sent to all the sites in every category, with details of the sums paid by the last auction winners.
The following additional optional charges are available: A relatively small fee payable for each site listed;
Since a human editor must review the entries before being approved for listing, there may be a considerable delay between the time that a site is submitted and the time that it is approved; by paying a nominal sum, companies can ensure that their site is reviewed quickly;
There are many sites on the web that provide a web search option; the intention is to keep the user on their site for as long as possible; such sites are willing to pay for the option to incorporate search results from active directory Web site server
250 into their service; and
In categories where the competition is intense, more than five places are allocated to bidders (up to a maximum of 10).
The experience of well-known directories shows that the editors do not have to chase after new web sites. Actually, the reverse is true. Site owners are anxious to get their site listed in all the major SE's and directories. Doing so is the best, and cheapest, way to generate web traffic directed to their site. Taking full advantage of the situation, some Directories now charge $200 to get the submitted site reviewed by their editors. The payment only guarantees that the site will be visited within 48 hours, not that it will be listed!
Active directory Web site server 250 has three stages of operation:
Initial Screening - searching the net for suitable sites and submitting them for review;
Site Review - reviewing the submitted sites and populating the database; and
Quality Assurance - random check of the accepted sites and responsibility for the integrity of the database.
True to its own vision of the web as a virtual city, all of the active directory Web site's promotions and marketing efforts are through Web channels. The following tools and methods are used to attract visitors and to raise interest among site owners to encourage them to bid for placement of their site on the search results first page:
All site owners that are listed on the active directory Web site receive a weekly summary containing the following information, for each category in which the site appears:
How many searches were conducted on the active directory Web site site;
In how many search results the site was displayed;
How many users have clicked on any of the sites listed;
How many users have clicked on the site's link;
How many users have chosen to click on each of the first 10 entries; and
What were the winners' bids in the last auction; Many portals and other sites provide their users with a Web search functionality within their site; a special interface program is provided, free of charge, to any site that wants to use the active directory Web site directory; the program checks each query submitted by the users, and if it is phrased in such a way that suits the active directory Web site structure, the directory becomes the first point of reference; The active directory Web site is submitted to all the major Search Engines and Directories; no special effort is made to get a high ranking in any of them; For sites that do not wish to participate in the auction, the active directory Web site can offer an improved chance for a high placing in return for creating links to it on the site home page, or other pages; and
The active directory Web site signs reciprocal agreements with major Search Engines, so that when the user searches for something that is not available on the site directory, the Search Engine is invoked to provide some results; similarly, if the active directory Web site can best answer the user's query on the affiliated Search Engine - the Search Engine lists the active directory Web site results first.

Claims

Claims:
1. A method for organizing a list of Web sites offering online access on an electronic network, said list comprising a plurality of categories of purposeful activities for searches by users of said electronic network, and said method comprising;
(a) searching for a plurality of said Web sites that are unlisted;
(b) adding said unlisted Web site to said list, if said Web site offers an online service;
(c) assigning at least one of said plurality of categories to said Web site that has been added to said list; and
(d) constructing a Web site of said electronic network wherein said users have said online access to said list.
2. A method according to claim 1 , wherein said electronic network is the Internet.
3. A method according to claim 1, and further comprising reviewing said Web site for consideration within a specified time period.
4. A method according to claim 3, wherein a fee is charged for said reviewing.
5. A method according to claim 1, wherein said assigning further comprises assigning sub-categories to said at least one of said plurality of categories.
6. A method according to claim 1, wherein said adding further comprises the option of bidding by said Web site for ranking on said list.
7. A method according to claim 6, further comprising declining the option of bidding by said Web site.
8. A method according to claim 6, wherein said bidding is for ranking.
9. A method according to claim 8, wherein said bidding is for the first page of rankings.
10. A method according to claim 8, wherein said bidding is for ranking in the first five rankings.
11. A method according to claim 8, wherein said bidding is for ranking in the first ten rankings.
12. A method according to claim 11, and further comprising random listing of said Web site that declines the option of bidding.
13. A method according to claim 1, and further comprising performing a periodic quality assurance check on said Web site.
14. A method according to claim 13, and further comprising generating a random number by which to select among said Web sites for said performing a periodic quality assurance check.
15. A method according to claim 13, wherein said performing a periodic quality assurance check on said Web site comprises checking for linkrot.
16. A method according to claim 13, and further comprising removing said Web site from said list if said Web site fails said periodic quality assurance check.
PCT/IL2002/000385 2001-05-22 2002-05-19 A method for organizing an internet search according to user purposeful activities WO2002095624A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
IL14328801A IL143288A0 (en) 2001-05-22 2001-05-22 A method for organizing an internet search according to user purposeful activities
IL143288 2001-05-22

Publications (2)

Publication Number Publication Date
WO2002095624A2 true WO2002095624A2 (en) 2002-11-28
WO2002095624A3 WO2002095624A3 (en) 2003-10-16

Family

ID=11075428

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IL2002/000385 WO2002095624A2 (en) 2001-05-22 2002-05-19 A method for organizing an internet search according to user purposeful activities

Country Status (2)

Country Link
IL (1) IL143288A0 (en)
WO (1) WO2002095624A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004102925A1 (en) * 2003-05-16 2004-11-25 Docomo Communications Laboratories Europe Gmbh Personalized discovery of services

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0777189A2 (en) * 1995-12-01 1997-06-04 Matsushita Electric Industrial Co., Ltd. Hyper-text document preparing apparatus
WO1999053669A1 (en) * 1998-04-15 1999-10-21 Linkguard Ltd Computer system for managing links and method using said system
EP1014283A1 (en) * 1998-12-17 2000-06-28 Picom Software System Ltd. Intranet-based cataloguing and publishing system and method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0777189A2 (en) * 1995-12-01 1997-06-04 Matsushita Electric Industrial Co., Ltd. Hyper-text document preparing apparatus
WO1999053669A1 (en) * 1998-04-15 1999-10-21 Linkguard Ltd Computer system for managing links and method using said system
EP1014283A1 (en) * 1998-12-17 2000-06-28 Picom Software System Ltd. Intranet-based cataloguing and publishing system and method

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
ABDELZAHER T F ET AL: "Web server QoS management by adaptive content delivery" QUALITY OF SERVICE, 1999. IWQOS '99. 1999 SEVENTH INTERNATIONAL WORKSHOP ON LONDON, UK 31 MAY-4 JUNE 1999, PISCATAWAY, NJ, USA,IEEE, US, 31 May 1999 (1999-05-31), pages 216-225, XP010335409 ISBN: 0-7803-5671-3 *
BRIN ET AL.: "The Anatomy of a Large Scale Hypertext Web Search Engine" STANFORD UNIVERSITY CS PUBLICATION, [Online] 1998, XP002247969 Stanford, Ca. Retrieved from the Internet: <URL:seregeyÐcs.stanford.edu> [retrieved on 2003-07-15] *
FERNANDEZ M ET AL: "STRUDEL: a Web-site management system" ACM PROCEEDINGS OF SIGMOD. INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, XX, XX, vol. 26, no. 2, 13 May 1997 (1997-05-13), pages 549-552, XP002181629 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8086658B2 (en) 2003-05-06 2011-12-27 Ntt Docomo, Inc. Personalized discovery of services
WO2004102925A1 (en) * 2003-05-16 2004-11-25 Docomo Communications Laboratories Europe Gmbh Personalized discovery of services
WO2004102417A1 (en) * 2003-05-16 2004-11-25 Docomo Communications Laboratories Europe Gmbh Personalized service selection
CN1771712B (en) * 2003-05-16 2012-08-01 株式会社Ntt都科摩 Synergistic discovery of services

Also Published As

Publication number Publication date
IL143288A0 (en) 2002-04-21
WO2002095624A3 (en) 2003-10-16

Similar Documents

Publication Publication Date Title
US8260786B2 (en) Method and apparatus for categorizing and presenting documents of a distributed database
US7231395B2 (en) Method and apparatus for categorizing and presenting documents of a distributed database
US9076148B2 (en) Dynamic pricing models for digital content
CN102124462B (en) Query identification and association
KR100857049B1 (en) Automatically targeting web-based advertisements
JP4672726B2 (en) Method and system for identifying keywords used when issuing keyword-targeted advertisements
US8082298B1 (en) Selecting an advertising message for presentation on a page of a publisher web site based upon both user history and page context
Clay et al. Search engine optimization all-in-one for dummies
US20050149390A1 (en) Method and system for generating and placing keyword-targeted advertisements
US20040133469A1 (en) System and method of promote website using Cycle Hits and Hits History
US20080228587A1 (en) Sponsored listing recommendation engine
US20110035256A1 (en) Systems and methods for prioritized selection of media properties for providing user profile information used in advertising
US20090112844A1 (en) Representative keyword selection
US20050149388A1 (en) Method and system for placing advertisements based on selection of links that are not prominently displayed
US20140012672A1 (en) Content-based bidding in online advertising
US20070233503A1 (en) Bidding for on-line survey placement
US20140012671A1 (en) Content-based targeted online advertisement
WO2002095624A2 (en) A method for organizing an internet search according to user purposeful activities
CA2550853C (en) Method and system for placing advertisements based on selection of links that are not prominently displayed
Alqahtani et al. Search Engine Strategy Report Case Study University of Dundee
Brown The complete guide to Google advertising: including tips, tricks, & strategies to create a winning advertising plan
Henshaw What next for Internet journals?
Thies How To Prosper With The New Google
McDonald et al. The Internet as a marketing medium: An introduction to Search Engine Optimization
Kassel Powerize: If It's Free, It's for Me, But.

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SD SE SG SI SK SL TJ TM TN TR TT TZ UA UG US UZ VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP

WWW Wipo information: withdrawn in national office

Country of ref document: JP