WO1997038378A1 - Procede d'organisation d'informations extraites du reseau internet a l'aide d'un systeme de representation expert - Google Patents

Procede d'organisation d'informations extraites du reseau internet a l'aide d'un systeme de representation expert Download PDF

Info

Publication number
WO1997038378A1
WO1997038378A1 PCT/US1997/005782 US9705782W WO9738378A1 WO 1997038378 A1 WO1997038378 A1 WO 1997038378A1 US 9705782 W US9705782 W US 9705782W WO 9738378 A1 WO9738378 A1 WO 9738378A1
Authority
WO
WIPO (PCT)
Prior art keywords
documents
search
concepts
knowledge based
query
Prior art date
Application number
PCT/US1997/005782
Other languages
English (en)
Inventor
Thomas Kirk
Original Assignee
At & T Corp.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by At & T Corp. filed Critical At & T Corp.
Priority to JP9536436A priority Critical patent/JP2000508450A/ja
Priority to EP97921115A priority patent/EP0976062A1/fr
Publication of WO1997038378A1 publication Critical patent/WO1997038378A1/fr

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/353Clustering; Classification into predefined classes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/335Filtering based on additional data, e.g. user or group profiles

Definitions

  • This invention relates to the field of accessing information on the Internet and, more particularly, to a method of organizing information retrieved from the Internet using a knowledge based representation system.
  • the Internet is a series of inter-connected networks which facilitate the exchange of information, data, and files. Users connected to the Internet have access to the vast amount of information on these networks.
  • a typical way of getting access to the Internet is through an online service server. Referring to FIG. 1, networks 110, 1 12, and 1 14 are connected to Internet 100 via online service servers 120, 122, and 124, respectively.
  • Another way of getting access to the Internet is through a dial-in Internet provider.
  • P.C personal computer
  • Routers which connect computers and networks, direct traffic in a network and on the Internet. Routers 160, 162. 164. and 166 examine packets of data that travel across the networks and Internet to determine where the data is headed.
  • Online service servers and Internet providers allow users to search the World Wide Web ("Web"), a globally connected network on the Internet, using software programs known as search engines 130, 132, 134, and 154. Search engines are also known as search tools and Web crawlers. These search engines travel across the Web gathering documents by following the hypertext links found in Web (home) pages 140, 142, 144, and 156.
  • Web World Wide Web
  • Keywords One way of searching the Internet is by keywords. For example, a user types in a query string of keywords that describes the information he is looking for.
  • the search engine searches databases on the Internet and results are returned in hypertext markup language ("HTML") pages.
  • HTML hypertext markup language
  • a user can then view a document of interest by "clicking" on a link to that document. Clicking refers to the process of actuating a mouse switch by centering a cursor on the desired item.
  • present search engines provide for searching of keywords on the Internet, the vast amounts of information on the Internet makes getting relevant information difficult.
  • keyword searches typically result in a return of vast amounts of information that the user must browse through in order to retrieve the relevant information. Thus, what is required is a more effective method of retrieving information from the Internet. Summary of the Invention
  • This information retrieval and management system associates a knowledge base with search servers to improve the relevance and precision of search tasks.
  • the knowledge base provides a user profile (topic taxonomy) that reflects the interests and preferences of the user for organizing information.
  • the system uses this knowledge base to organize the results of keyword searches.
  • the system automatically categorizes and segments search results in accordance with the knowledge base to provide for easy searching of relevant information.
  • the system displays the search results over a subset of the knowledge based topic taxonomy, segmenting the results in a way that makes it easy to find the most relevant documents, and filtering out irrelevant results.
  • FIG. 1 illustrates a diagram of computers and networks and their connection to the Internet for discussion of the environment in which the present invention operates;
  • FIG. 2 is a block diagram of an exemplary knowledge based browser displaying a graphical representation of a concept generalization taxonomy in accordance with the principles of the present invention
  • FIG. 2a is an actual screen display of the exemplary knowledge based browser of FIG. 2;
  • FIG. 3 is a block diagram of a search interface in accordance with the principles of the present invention.
  • FIG. 4 shows a flow diagram illustrating the steps required for a user to retrieve information from the Internet and organize it using knowledge based representation.
  • FIG. 1 there is shown an environment for the present invention including exemplary networks 1 10, 1 12, and 1 14 and P.C.'s 158 and 159 which are inter-connected to Internet 100.
  • These networks comprise users who are connected to one another in, for example, a token ring network (network 114) or through an Ethernet network (networks 1 10 and 1 12).
  • Each network further comprises a server 120, 122, and 124.
  • a server is a host computer that allows users to communicate with each other on the network or with users outside the network through the Internet.
  • Users on P.C.'s 158 and 159 may subscribe to Internet Provider 150, which allows users to communicate with each other and other users on the Internet. Any user may search for information available on the Internet.
  • search engines 130, 132, 134, and 154 that allow users to search by keywords.
  • search engines are computer programs which are search-application based programs that run on online service servers 120, 122, 124, and Internet provider 150. Searching by keywords typically results in a return of vast amounts of information that the user must browse through in order to get the desired information.
  • client/server model By client/server model is intended a user running a piece of software on his computer or a shared program of a server-the client ⁇ to use the resources of a distant server computer (other servers connected on the Internet).
  • client/server model is intended a user running a piece of software on his computer or a shared program of a server-the client ⁇ to use the resources of a distant server computer (other servers connected on the Internet).
  • a user on P.C. 1 10a may search for information on online service servers 122 and 124 and Internet provider 150.
  • a user on P.C. 156 may search for information on online service servers 120, 122, and 124.
  • the distant servers e.g., online service servers 120, 122, and 124 and Internet provider 150, are also called hosts because they serve many users of many networks. The hosts allow many different clients to access their resources at the same time; the hosts are not devoted to a single user.
  • indexes present a highly structured way of finding information. Indexes let users browse through information by categories such as arts, computers, entertainment, sports, etc.
  • a user on his P.C. 110a can click on a category by, typically, using his mouse 110b and is presented with a series of subcategories. For example, under sports a user may find baseball, basketball, football, etc. Depending on the size of the index, there may be several layers of subcategories. When the user gets to the subcategory he is interested in, he will be presented with a list of relevant documents. To get to those documents, the user clicks on links to them.
  • “Yahoo!” is the name of a popular index on the Internet. Yahoo! and other indexes also allow users to search through them by typing in words that describe information that the user is looking for. The user then gets a set of search results-links to documents that match his search. To get the information, the user clicks on a link to the document.
  • search engines also known as search tools.
  • Search engines operate on essentially static pre-built indexes, i.e., the indexes are built up from online content and stored in a database on a search server. Web crawlers are used by the search engines for gathering the online content that is retrieved and indexed in the search server's database.
  • Some popular Internet search engines include Lycos, WebCrawler, and Alta Vista.
  • a user types in keywords that describe the information he wants. Results that match the user's search criteria from the search are sent back to the user. From the list of results, the user can retrieve a document by clicking on a link to that document.
  • indexes and search engines allow users to find information on the Internet, the information found is typically large and often difficult to locate relevant information. Therefore, it is desirable to automatically categorize search results found on the Internet so as to allow users to easily browse through the search results to find relevant information.
  • the present invention is directed to a knowledge based information retrieval and management system that enhances searches on any multi-network system such as the Internet.
  • the system provides users with means to superimpose a tailored conceptual organization over the information found on the Internet, thereby enriching the usefulness of and access to that information.
  • the system is integrated with existing Web browsers 130, 132, 134, and 154 to create a seamless environment combining hypertext browsing with conceptual navigation.
  • the system may also be stored on a personal computer, e.g., P.C. 11 Oa, in which case only users with access to that personal computer may use the system.
  • FIG. 2 it illustrates an exemplary knowledge based browser which displays a graphical representation of a concept generalization taxonomy 200 in accordance with the present invention.
  • a taxonomy is a generalization hierarchy which graphically displays relationships between concepts.
  • a concept is an abstract description of an object.
  • Nodes in FIG. 2 correspond to knowledge base concepts (e.g., 210, 220, 230, 212, 214, etc.), and edges (e.g., 210a. 210b, 220a. etc.) connecting the nodes indicate subsumption relationships between the concepts.
  • a feature of the present invention is the system can manage the subsumption relationships automatically based on concepts and instances (270, 280).
  • An instance is a specific realization of a concept, i.e., a concept is an abstract description of something while an instance of that concept is a real object that satisfies that description. For example, when a new document is added to the knowledge based browser as an instance, the system infers all the places it belongs in the taxonomy.
  • FIG. 2a is an actual screen display of the exemplary knowledge based browser of FIG. 2, illustrating the concept generalization taxonomy 200 and the subsumption relationships between concepts and instances.
  • the search interface operates similarly to that of the knowledge base browser.
  • the search interface uses a knowledge base to refine search results by segmenting and categorizing results with respect to a user's concept generalization taxonomy. For example, after results from a keyword search have been combined in a result set for display, the system provides an additional refining step that can further focus the result set.
  • Refining the result set against the knowledge base involves retrieving the documents in the result set and processing them with the knowledge base pattern matchers. Textual patterns associated with concepts in the knowledge base allow the knowledge representation system to categorize and organize these documents within the concept taxonomy. Each pattern in the knowledge base is associated with a concept. Stated another way, each document is compared against these pattern matchers to determine whether there are any concepts that match the document.
  • the output of this comparison process is a set of specific concepts in the knowledge base that have some correspondence to the content of the document.
  • a record of a match between a concept and the document is made in the knowledge base by creating a temporary instance whose description includes the matched concepts.
  • the refined search result is presented graphically over a subset of the knowledge base topic taxonomy. This subset is defined by those concepts having one or more of the temporary instances created during the matching process. This is illustrated in FIG. 3 where only those concepts that match the contents of a document are displayed.
  • the present invention of using a knowledge based representation system in organizing data is especially helpful when a keyword search results in thousands of documents.
  • By running pattern matchers against those documents one can quickly narrow down those documents that are most relevant to the user.
  • the knowledge based representation system (browser and search interface) of the present invention allow users to quickly find relevant information.
  • Another feature of the taxonomy is that by grouping the results according to concepts, a user may zoom in on the pan that he thinks is most relevant. This further enhances searching on the Internet by saving browsing time.
  • the search interface further implements transparent, concurrent access to multiple index servers in order to maximize query coverage and minimize response latency.
  • the query system ensures that only those index servers capable of handling the query are consulted.
  • the user interface provides editors for extending and reorganizing the concept hierarchy.
  • the user interface also provides for a navigation browser that maintains an interactive graphical map of the navigation history.
  • the navigation browser is a tree-structured graphical representation of the user's browsing history. Its function is as follows: as the user browses, he generates an ordered sequence of the web sites he visits, following links from one page to another. As he backtracks and makes new browsing choices, the browsing history becomes a branching tree. The navigation browser keeps track of these choices adding new nodes to the tree for every site/page visited. This tree, besides showing an overview of the browsing history, becomes an alternative way to navigate (by clicking on the node in the tree to return to the associated page).
  • the system architecture separates the knowledge base from the client to allow the user to maintain a consistent view of his information space regardless of the client's location. By keeping the knowledge base in one place, the environment can follow the user from one platform to another.
  • An advantage of the separation is to help ensure continuous availability of the system server since it provides shared access to the knowledge base and performs autonomous monitoring tasks even when the client is inactive or disconnected.
  • the knowledge base may be stored on another server, separated from the client.
  • this flowchart illustrates the steps required for a user to retrieve information from the Internet and organize it using a knowledge based representation system in accordance to the present invention.
  • a user enters a query string of keywords to be searched on his personal computer 110a using a knowledge based Web browser 130 in accordance with the present invention.
  • the knowledge based Web browser is a software that may be installed in either a client 1 10a or server 120.
  • step 403. the query string is pre-processed to determine which search servers are capable of understanding the query syntax. This is done by examining the Universal Resource Locator ("URL") of the query string to determine which server(s) to send the request to.
  • URL Universal Resource Locator
  • the query has to be translated into specific query syntax of the server that the user is requesting information.
  • a query translator is provided with an interface to the server for serving the query.
  • step 405 queries are sent to each server that can handle the expression. Queries may be sent out serially or concurrently. An advantage of sending out the queries concurrently is reduction of latency in both the network and search process. In other words, all servers can work on a query at the same time.
  • step 407 depending on the result size threshold, individual servers may need to be queried repeatedly in order to gather the specified number of matches.
  • Most servers in order to limit the amount of resources that are used for a given query, will break the results coming back into some reasonable sets that are returned. For example, if there is a hundred hits for a search, a server may be set up to return only ten hits at a time. As such, if the specified number of matches is reached, then the procedure proceeds. If the specified number of matches has not been reached, then the servers are repeatedly queried until it has been reached. In step 409, the results that come back from the servers are merged into a single result set.
  • Each item in the result set consists of a reference to a document (a URL) and possibly a single line of descriptive text.
  • step 41 1 if the user desires further refinement of the result set. he can request that the results be compared against the knowledge base pattern matchers. Else, the result set is displayed for the user.
  • step 413 the document for each reference in the result set is retrieved.
  • step 415 the pattern matcher(s) is applied to the document text to determine whether there are any topic concepts that match the text.
  • step 417 a list of topic concepts that match the text of the document are generated.
  • step 419 an instance is created for each document that matches a concept.
  • step 421 the instance for the document is classified in the knowledge base's topic taxonomy.
  • steps 413-421 is parallelized to minimize the effects of network latency in gathering the text, since the result set may contain dozens or hundreds of documents to retrieve.
  • the system incrementally displays the post- processed results graphically over a subset of the topic taxonomy, where the subset is defined by the collection of concepts having one or more instances from the search result. This is done to categorize and segment the search result with respect to concepts that are familiar and meaningful to the user.
  • the search result may be browsed at various levels of detail, depending on how specific one wishes the segments to be. What has been described is merely illustrative of the application of the principles of the present invention. Other arrangements and methods can be implemented by those skilled in the art without departing from the spirit and scope of the present invention.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Système et procédé d'organisation d'une représentation électronique de documents dans un système de représentation expert. Le système de représentation expert fonctionne dans un environnement où les ordinateurs et les réseaux sont interconnectés et où les documents peuvent être extraits des ordinateurs et des réseaux. Une consultation est créée pour la recherche des documents. Le système détermine quels sont les ordinateurs et les réseaux capables de comprendre la syntaxe de la consultation ou interrogation. Cette dernière est envoyée à chacun des ordinateurs et réseaux pouvant la traiter. Le système reçoit des résultats en rapport avec les documents provenant des ordinateurs et réseaux. On fusionne les résultats pour obtenir un seul ensemble de résultats. Chacun des résultats contient une référence relative à chaque document. Les documents sont ensuite affinés en les comparant à des motifs de correspondance de texte de la base de connaissance. L'affinage s'effectue en extrayant un document pour chacune des références puis en appliquant les motifs de correspondance aux documents. Le système détermine une liste de concepts qui correspondent aux documents. Le système fournit les documents au système de représentation expert sous la forme d'exemples des concepts.
PCT/US1997/005782 1996-04-10 1997-04-08 Procede d'organisation d'informations extraites du reseau internet a l'aide d'un systeme de representation expert WO1997038378A1 (fr)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP9536436A JP2000508450A (ja) 1996-04-10 1997-04-08 インターネットから検索される情報を知識ベース表現を使用して編成する方法
EP97921115A EP0976062A1 (fr) 1996-04-10 1997-04-08 Procede d'organisation d'informations extraites du reseau internet a l'aide d'un systeme de representation expert

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US1791296P 1996-04-10 1996-04-10
US82694097A 1997-04-08 1997-04-08
US60/017,912 1997-04-08
US08/826,940 1997-04-08

Publications (1)

Publication Number Publication Date
WO1997038378A1 true WO1997038378A1 (fr) 1997-10-16

Family

ID=26690501

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1997/005782 WO1997038378A1 (fr) 1996-04-10 1997-04-08 Procede d'organisation d'informations extraites du reseau internet a l'aide d'un systeme de representation expert

Country Status (4)

Country Link
EP (1) EP0976062A1 (fr)
JP (1) JP2000508450A (fr)
CA (1) CA2251043A1 (fr)
WO (1) WO1997038378A1 (fr)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2000077690A1 (fr) * 1999-06-15 2000-12-21 Kanisa Inc. Systeme et procede de gestion de documents bases sur plusieurs taxonomies des connaissances
US6418439B1 (en) * 1997-11-12 2002-07-09 Ncr Corporation Computer system and computer implemented method for translation of information into multiple media variations
EP1277117A1 (fr) * 2000-03-20 2003-01-22 Siftology Interface pour presenter des informations
SG97922A1 (en) * 2000-08-21 2003-08-20 Kent Ridge Digital Labs Knowledge discovery system
US6711585B1 (en) 1999-06-15 2004-03-23 Kanisa Inc. System and method for implementing a knowledge management system
KR100434902B1 (ko) * 2000-08-28 2004-06-07 주식회사 에이전트엑스퍼트 지식 기반 맞춤 정보 제공 시스템 및 그 서비스 방법
US6980984B1 (en) 2001-05-16 2005-12-27 Kanisa, Inc. Content provider systems and methods using structured data
US7028250B2 (en) 2000-05-25 2006-04-11 Kanisa, Inc. System and method for automatically classifying text
US7065532B2 (en) * 2002-10-31 2006-06-20 International Business Machines Corporation System and method for evaluating information aggregates by visualizing associated categories
US7206778B2 (en) 2001-12-17 2007-04-17 Knova Software Inc. Text search ordered along one or more dimensions
US7337158B2 (en) 2000-03-06 2008-02-26 Consona Crm Inc. System and method for providing an intelligent multi-step dialog with a user
WO2010006416A1 (fr) * 2008-06-30 2010-01-21 Ali Davar Système et procédé pour interagir avec une pluralité de moteurs de recherche
USRE42262E1 (en) 1999-08-30 2011-03-29 Frederick Monocacy Llc Method and apparatus for representing and navigating search results

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6609124B2 (en) * 2001-08-13 2003-08-19 International Business Machines Corporation Hub for strategic intelligence

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1990008360A1 (fr) * 1989-01-12 1990-07-26 Telebase Systems, Inc. Systeme et procede servant a extraire des informations de plusieurs bases de donnees
JPH06124308A (ja) * 1992-10-14 1994-05-06 Fujitsu Ltd 情報整理処理装置
WO1995012173A2 (fr) * 1993-10-28 1995-05-04 Teltech Resource Network Corporation Synthese d'exploration de bases de donnees a caracteristiques determinees par l'utilisateur

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH09259138A (ja) * 1996-03-21 1997-10-03 N T T Data Tsushin Kk 分類情報表示方法及び情報検索装置

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1990008360A1 (fr) * 1989-01-12 1990-07-26 Telebase Systems, Inc. Systeme et procede servant a extraire des informations de plusieurs bases de donnees
JPH06124308A (ja) * 1992-10-14 1994-05-06 Fujitsu Ltd 情報整理処理装置
WO1995012173A2 (fr) * 1993-10-28 1995-05-04 Teltech Resource Network Corporation Synthese d'exploration de bases de donnees a caracteristiques determinees par l'utilisateur

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
BALDAZO R: "NAVIGATING WITH A WEB COMPASS", BYTE, vol. 21, no. 3, 1 March 1996 (1996-03-01), pages 97/98, XP000600179 *
GINSBERG A: "A UNIFIED APPROACH TO AUTOMATIC INDEXING AND INFORMATION RETRIEVAL", IEEE EXPERT, vol. 8, no. 5, 1 October 1993 (1993-10-01), pages 46 - 56, XP000413472 *
JACOBS P S ET AL: "SCISOR: EXTRACTING INFORMATION FROM ON-LINE NEWS", COMMUNICATIONS OF THE ASSOCIATION FOR COMPUTING MACHINERY, vol. 33, no. 11, 1 November 1990 (1990-11-01), pages 88 - 97, XP000173090 *
PATENT ABSTRACTS OF JAPAN vol. 018, no. 417 (P - 1781) 4 August 1994 (1994-08-04) *

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6418439B1 (en) * 1997-11-12 2002-07-09 Ncr Corporation Computer system and computer implemented method for translation of information into multiple media variations
US7401087B2 (en) 1999-06-15 2008-07-15 Consona Crm, Inc. System and method for implementing a knowledge management system
US6711585B1 (en) 1999-06-15 2004-03-23 Kanisa Inc. System and method for implementing a knowledge management system
WO2000077690A1 (fr) * 1999-06-15 2000-12-21 Kanisa Inc. Systeme et procede de gestion de documents bases sur plusieurs taxonomies des connaissances
USRE44794E1 (en) 1999-08-30 2014-03-04 Frederick Monocacy Llc Method and apparatus for representing and navigating search results
USRE42262E1 (en) 1999-08-30 2011-03-29 Frederick Monocacy Llc Method and apparatus for representing and navigating search results
US7539656B2 (en) 2000-03-06 2009-05-26 Consona Crm Inc. System and method for providing an intelligent multi-step dialog with a user
US7337158B2 (en) 2000-03-06 2008-02-26 Consona Crm Inc. System and method for providing an intelligent multi-step dialog with a user
EP1277117A4 (fr) * 2000-03-20 2005-08-17 Siftology Interface pour presenter des informations
EP1277117A1 (fr) * 2000-03-20 2003-01-22 Siftology Interface pour presenter des informations
US7028250B2 (en) 2000-05-25 2006-04-11 Kanisa, Inc. System and method for automatically classifying text
SG97922A1 (en) * 2000-08-21 2003-08-20 Kent Ridge Digital Labs Knowledge discovery system
KR100434902B1 (ko) * 2000-08-28 2004-06-07 주식회사 에이전트엑스퍼트 지식 기반 맞춤 정보 제공 시스템 및 그 서비스 방법
US6980984B1 (en) 2001-05-16 2005-12-27 Kanisa, Inc. Content provider systems and methods using structured data
US7206778B2 (en) 2001-12-17 2007-04-17 Knova Software Inc. Text search ordered along one or more dimensions
US7065532B2 (en) * 2002-10-31 2006-06-20 International Business Machines Corporation System and method for evaluating information aggregates by visualizing associated categories
WO2010006416A1 (fr) * 2008-06-30 2010-01-21 Ali Davar Système et procédé pour interagir avec une pluralité de moteurs de recherche

Also Published As

Publication number Publication date
CA2251043A1 (fr) 1997-10-16
EP0976062A1 (fr) 2000-02-02
JP2000508450A (ja) 2000-07-04

Similar Documents

Publication Publication Date Title
EP0958541B1 (fr) Explorateur de reseau intelligent utilisant un indexeur conceptuel incrementiel
US7428533B2 (en) Automatic generation of taxonomies for categorizing queries and search query processing using taxonomies
US6101491A (en) Method and apparatus for distributed indexing and retrieval
US7383299B1 (en) System and method for providing service for searching web site addresses
US6321228B1 (en) Internet search system for retrieving selected results from a previous search
US7054875B2 (en) Method and apparatus for notifying a user of new data entered into an electronic system
US6704729B1 (en) Retrieval of relevant information categories
US6363377B1 (en) Search data processor
US20140344306A1 (en) Information service that gathers information from multiple information sources, processes the information, and distributes the information to multiple users and user communities through an information-service interface
US9275145B2 (en) Electronic document retrieval system with links to external documents
EP1678639A2 (fr) Systemes et procedes de traitement de recherche utilisant des super unites
WO2001016807A1 (fr) Systeme de recherche dans l'internet servant au suivi et au classement d'enregistrements choisis, a partir d'une recherche precedente
US7013300B1 (en) Locating, filtering, matching macro-context from indexed database for searching context where micro-context relevant to textual input by user
WO1997038378A1 (fr) Procede d'organisation d'informations extraites du reseau internet a l'aide d'un systeme de representation expert
KR100445943B1 (ko) 근접 검색식을 이용한 정보 검색 시스템 및 방법
Barifah et al. Exploring usage patterns of a large-scale digital library
Liu et al. Visualizing document classification: A search aid for the digital library
Fafalios et al. Exploratory professional search through semantic post-analysis of search results
Bamboat et al. Web content mining techniques for structured data: A review
Deogun et al. Structural abstractions of hypertext documents for web-based retrieval
Abdulmunim et al. Links Evaluation and Ranking Based on Semantic Metadata Analysis
DeKoven et al. The magazine archive includes every article published in Communications of the ACM for over the past 50 years.
KR20020049694A (ko) 컨셉 랭킹 기법을 이용한 문서 색인 시스템 및 방법
Naing et al. Core: A Search and Browsing Tool for Semantic Instances of Web Sites
Pardakhe et al. Enhancement of web search engine results using keyword frequency based ranking

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): CA JP MX

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
ENP Entry into the national phase

Ref document number: 2251043

Country of ref document: CA

Ref country code: CA

Ref document number: 2251043

Kind code of ref document: A

Format of ref document f/p: F

WWE Wipo information: entry into national phase

Ref document number: PA/a/1998/008385

Country of ref document: MX

WWE Wipo information: entry into national phase

Ref document number: 1997921115

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 1997921115

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 1997921115

Country of ref document: EP