WO2002027544A1 - Information access - Google Patents
Information access Download PDFInfo
- Publication number
- WO2002027544A1 WO2002027544A1 PCT/GB2001/004248 GB0104248W WO0227544A1 WO 2002027544 A1 WO2002027544 A1 WO 2002027544A1 GB 0104248 W GB0104248 W GB 0104248W WO 0227544 A1 WO0227544 A1 WO 0227544A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- tag
- information
- list
- tags
- user
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/80—Information retrieval; Database structures therefor; File system structures therefor of semi-structured data, e.g. markup language structured data such as SGML, XML or HTML
- G06F16/83—Querying
- G06F16/835—Query processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/80—Information retrieval; Database structures therefor; File system structures therefor of semi-structured data, e.g. markup language structured data such as SGML, XML or HTML
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/80—Information retrieval; Database structures therefor; File system structures therefor of semi-structured data, e.g. markup language structured data such as SGML, XML or HTML
- G06F16/83—Querying
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9538—Presentation of query results
Definitions
- This invention relates to information access and finds particular application in locating information contained in documents that have been annotated using a structured markup language.
- search engines of various types have been implemented in software to assist with identifying data sets that contain information of at least some relevance to a user's search criteria.
- search engines are often able to make use of already constructed indexes to particular fields or domains of information, or to exploit summary or keyword data stored within data sets themselves.
- HTML Hypertext Markup Language
- Search engines may search for a ⁇ META> tag within a web page and compare any associated keywords with a user's search criteria to determine whether or not the information in the page is likely to be relevant.
- XML extensible Markup Language
- HTML markup language a mark-up language
- Exploitation of meta-information provided by XML tagging has the potential to dramatically reduce the number of irrelevant hits returned compared with current HTML-based search engines.
- HTML tags are, but for a small core of standard tags, entirely user-definable. To some extent, the usefulness of XML tagging is therefore subject to the skills of a document author.
- XML does allow user communities, from industry groups to single users, to develop an individual mark-up language that best suits their needs.
- OASIS Organisation for the Advancement of Structured Information Standards
- a known XML search engine such as "GoXML" provides a largely conventional keyword-based search facility to locate relevant information in conventional web pages as well as XML tagged documents.
- GoXML compiles and presents a flat list of the tags that mark up document parts within which search keywords were found, together with a conventional list of references to those documents. The user can then explore this list of "hit” tags by selecting a particular tag, causing the document list to be reduced to only those documents where a search keyword was found to occur in a part marked up by the selected tag.
- GoXML does not carry out further analysis of "hit" tags to enable a user to fully exploit the potential contextual information provided by those tags and to navigate the search results more effectively.
- a method of accessing sets of information stored in an information system comprising the steps of:
- step (iv) receiving a selection signal specifying a tag structure from the list output at step (iii); (v) adjusting said list of references from step (ii) to comprise references only to said identified sets of information that contain the tag structure selected at step (iv);
- step (vii) repeating step (iv) in respect of said adjusted list of tag structures, and step (v) to identify a more specific list of references to sets of information.
- apparatus and methods are provided to enable a user to locate and retrieve sets of information relevant to search criteria specified in a search query submitted by the user.
- apparatus and methods are designed to enable the user to exploit contextual information provided within documents that have been annotated using tags defined according to a structured markup language such as XML.
- a structured markup language such as XML.
- embodiments of the present invention enable the user to use XML or other markup language tags, inserted into a document by the author, to help identify those documents from a potentially large set of search results that are most relevant to the original search query or, more particularly, to what the user hoped to find.
- Embodiments of the present invention are largely concerned with analysis of search results, enabling a user to exploit contextual information provided by markup language annotations in documents identified in the search.
- Largely conventional search engines and search techniques may be used to obtain a set of search results on the basis of a user's search query.
- the otherwise conventional search engine or other information retrieval tool must be arranged to not only to locate portions of documents matching a user's search query, but also to identify and return annotating tags associated with those matching portions, according to the particular markup language used.
- the structure of annotating tags used in a particular structured markup language must be identified and returned in the search results, preserving that tag structure for analysis by novel and inventive features of the present invention, to be described in detail below.
- the method of said first aspect includes the steps of: (viii) detecting, following receipt of the selection signal at step (iv), a request for access to a corresponding set of information listed in step (v);
- the method provides a further enhancement to the tag analysis process by monitoring, over a period of time, the selection of tags by users from each presented tag list and monitoring any subsequent access by a user of particular documents listed in the resultant reduced document lists.
- the apparatus records a history of tag selection by users in general, or by a particular user or group of users, and their subsequent document retrieval activity in respect of each distinct tag and/or tag structure. This historical data is then used to weight each of the distinct tags and tag structures according to likelihood that they resulted in a selection of documents relevant to those users.
- the apparatus is then able to present a given tag list in a ranking order of decreasing usefulness for example, when particular tags known from the historical records appear in a set of search results.
- Figure 1 is a diagram showing features of an information searching apparatus according to a preferred embodiment of the present invention.
- Figure 2 is a flow chart showing steps in operation of an information searching apparatus according to a first embodiment of the present invention
- Figure 3 is a flow diagram showing steps in operation of a context analysis module according to a first embodiment of the present invention
- Embodiments of the present invention are largely concerned with analysis of search results, enabling a user to exploit contextual information provided by markup language annotations in documents identified in the search.
- Largely conventional search engines and search techniques may be used to obtain a set of search results on the basis of a user's search query.
- the otherwise conventional search engine or other information retrieval tool must be arranged to not only to locate portions of documents matching a user's search query, but also to identify and return annotating tags associated with those matching portions, according to the particular markup language used.
- the structure of annotating tags used in a particular structured markup language must be identified and returned in the search results, preserving that tag structure for analysis by novel and inventive features of the present invention, to be described in detail below.
- Search results comprise a list of references to documents found by the search engine to have portions matching the search query, for example a list of document URLs if those documents are stored on web servers and accessible over the Internet, together with the respective tag structures associated with each of the matching portions.
- the search results are presented to the user as a list of the identified tags and tag structures together with a list of the identified document references. For example, in a hierarchical tag structure such as that used with XML, the full structure of tags surrounding a matching portion of text will be presented in the tag list, with, optionally, a list of the lowest level tags.
- a user is provided with apparatus having a user interface and facilities to enable the user to navigate through the returned set of search results making use of information provided by returned tags.
- a user may select one or more particular tags or tag structures from the tag list presented at the user interface and, in response to that selection, the apparatus will present at the user interface, from the set of document references, a list of only those documents containing the selected tags or tag structures associated with matching text.
- a user may have selected a particular tag because the words used in those tags were suggestive of a context relevant to the type of information the user was seeking. Now that the document list has been reduced the apparatus then adjusts the displayed tag list to include not only the selected tag or tags, but also any other tags and tag structures associated with matching text from the documents in the reduced list.
- Identification of such additional tags may be highly relevant to the user because they may be suggestive of other contexts that might reveal relevant information, especially as those tags occurred in the same documents as the original tag selection.
- This so called "double filtering" technique may be extended by the user by making a further selection from the adjusted tag list and further restricting or otherwise altering the list of documents being investigated.
- the apparatus provides an enhancement to the tag analysis process by using a thesaurus to identify different tags within the list that may have a similar meaning, or by using clustering techniques to identify tags that may relate to similar contexts. Such related tags may then be grouped together in the tag list presented at the user interface to enable a user to see that such tags may be so related and to providing the opportunity for the user to select the group of related tags rather than individual tags or tag structures in the "double filtering" navigation process outlined above.
- the apparatus provides a further enhancement to the tag analysis process by monitoring, over a period of time, the selection of tags by users from each presented tag list and monitoring any subsequent access by a user of particular documents listed in the resultant reduced document lists.
- the apparatus records a history of tag selection by users in general, or by a particular user or group of users, and their subsequent document retrieval activity in respect of each distinct tag and/or tag structure. This historical data is then used to weight each of the distinct tags and tag structures according to likelihood that they resulted in a selection of documents relevant to those users.
- the apparatus is then able to present a given tag list in a ranking order of decreasing usefulness for example, when particular tags known from the historical records appear in a set of search results.
- the apparatus is arranged, on the basis of previous document access by users, to establish a profile of the typical information content of portions of documents associated with each distinct tag and tag structure.
- Known document summarisers and key term extractors may be used to extract such profile information each time a document is accessed by a user.
- Typical information content of a given tag may then be made available to users as required. This helps to overcome problems in exploiting tags when lack of standardisation has resulted in different document authors using different tags or obscure choices of tag to represent the same or a similar context in particular fields of information.
- an information retrieval apparatus 100 is shown according to preferred embodiments of the present invention, for use in searching for relevant information stored in file servers 105, web servers for example, and accessible over a communications network 1 10 such as the Internet.
- the information searching apparatus is arranged to receive search queries supplied by users from terminal equipment 1 15, typically submitted using a conventional browser product installed on a user's terminal equipment 1 1 5, a web browser for example, and transmitted over the communications network 105 by means of a router 120.
- the information searching apparatus 100 includes a user interface 1 25 for receiving search queries from users (1 1 5) and for returning search results to their terminal equipment 1 1 5, a search engine 130 and a context analysis module 135.
- Context analysis module 135 is arranged in particular to analyse and to present, via the user interface 125, XML tag information enclosing portions of documents that were found by the search engine 130 to match the search query, in a way that enables users to exploit the contextual information provided by those tags.
- processing begins at STEP 200 with receipt of a search query via the user interface 125.
- the search query specifies search criteria, such as a set of keywords or phrases, to be used in identifying potentially relevant sets of information.
- search criteria such as a set of keywords or phrases
- the received search criteria are passed to the search engine 1 30 and the search engine 130 is activated to begin searching for relevant documents stored in file servers 105.
- Search engine 130 may be any one of a number of different types of known search engine arranged to use the supplied search criteria in any appropriate way to identify relevant information.
- a potentially relevant document is located by search engine 130, at STEP 210, then at STEP 21 5 a reference to the located document, for example a URL if the document is a web page located on a web server, is added to search results being compiled by the search engine 130.
- the located document is an XML document
- the located document is analysed to identify a full hierarchy of XML tags enclosing a portion of the located document containing relevant information.
- search engine 130 may be adapted to carry out basic XML tag identification once it has established that the located document is an XML document.
- the context analysis module 135 may identify XML tags by direct access to a document identified by the search engine 130. Any identified XML tags are added to the search results at STEP 230, preserving the tag hierarchy. Processing then moves to STEP 235 to determine whether all accessible documents have been searched.
- processing proceeds directly to STEP 235 to determine whether all accessible documents have been searched.
- the context analysis module 135 may be arranged to provide a number of particularly useful functions, exploiting any contextual information provided by XML tags, to assist a user in navigating and selecting from a set of search results. Such functions are of particular use when search results contain a great many "hits" in response to a particular search query. Further embodiments of the present invention, to be described below, relate to the different levels of functionality that may be provided by the context analysis module 135.
- context analysis module 1 35 provides a basic tag listing and grouping facility, accessible to users via the user interface 1 25, preserving and displaying a hierarchy of tags where more than one level of tagging was detected in a particular document. This enables search results to be grouped and selected by users for further examination according to tag group, the assumption being that tags of the same or a similar name are indicative of a similar information context.
- This basic tag listing and grouping function of context analysis module 135 will now be described with reference to Figure 3.
- context analysis begins at STEP 300 with receipt from the search engine 130 of a set of search results.
- all the XML tags identified in the search results are selected and an ordered list of XML tags is generated, preserving the hierarchical structure of tags where there is more than one level enclosing a relevant section of a document.
- a count is made of the number of document references from the search results in which the same tag or tag hierarchy was identified.
- the ordered tag list and associated document count is presented to the originating user via the user interface 1 25.
- a list of all the identified document references is also presented via the user interface 1 25 in a conventional format, for example including a document address or other reference together with, if enabled, a precis of the relevant section of each document.
- the context analysis module 135 is arranged to accept, via the user interface 1 25, user selection of any tag or group of tags from the displayed tag list, or selection of an option to exit. If the user does not want to exit, at STEP 330, then at STEP 335 the list of document references is adjusted to show references for only those documents in which one or more of the selected tags were found by the search engine 130. So, for example, if there were found at STEP 310 to be 17 documents in which a relevant portion was contained within an XML tag ⁇ PRODUCT TYPE> , then if ⁇ PRODUCT TYPE> were selected from the tag list at STEP 325, the user would then see at STEP 335 those 1 7 document references listed via the user interface.
- the tag list itself is then adjusted at STEP 340 to display only those tags identified in those documents referenced in the adjusted document list.
- This adjustment to the tag list may bring in extra tags that were not selected at STEP 325 because in one or more of the documents containing the selected tag(s), the search engine 130 may have identified more than one potentially relevant portion, each portion being enclosed by different tags.
- This additional tag information can be very useful when navigating through the search results because the adjusted tag list is more likely to contain tags related in context (at least from the point of view of the user submitting the original search query) given that they occurred within documents located using the same search query.
- the user may select one or more of those additional tags and hence, at STEP 335, identify and view any further documents containing potentially relevant portions in the context of those additional tags.
- a user may use the tag list to "drill-down" to those documents most likely to contain relevant information by navigating through tags that appear to suggest the most relevant context.
- Adjustment of the listed document references and, in response, to the listed tags to correspond to the listed documents provides a double filtering mechanism that is particularly effective in helping a user to navigate through search results and select a potentially relevant subset of documents for further investigation, making full use of contextual information provided by XML tags.
- a user may expand analysis of the search results by restoring the full list of displayed tags and selecting another starting point.
- Further basic sorting facilities may be provided by the context analysis module 1 35 according to the first embodiment.
- a so called "stop list” may be used by the context analysis module 1 35 to eliminate particularly basic XML tags from consideration and display in a tag list.
- tags might include ⁇ CHAPTER> , ⁇ SECTION > , ⁇ PARAGRAPH S ⁇ WORDS > and other such tags that provide only structural information about the layout of a document and little about the informational context of a portion identified by the search engine 130.
- tags such as ⁇ SUMMARY > or ⁇ PRECIS > provide useful information about the context, within the document, of a matching word or phrase, suggesting that the matching word or phrase is more likely to be indicative of the primary information content of the document as a whole.
- stop lists are used in relation to a particular information search is preferably an option selectable by a user via the user interface 125.
- XML tags in particular are simply words. Aside from those that are standardised for XML itself, different words may be used in different XML implementations to mean largely the same thing.
- One author might tag part of a document as ⁇ summary > while another might tag the same part of another document as ⁇ precis > ; or a section of one document might be about software agents and tagged ⁇ agents > while in another document the same tag is used to tag a section about estate agents.
- the context analysis module 1 35 is provided with access to a thesaurus for use in identifying synonyms and helping to disambiguate tags.
- a general purpose thesaurus may be used, for example one such as WordNet, as disclosed in "WordNet: An Electronic Lexical Database", edited by Christiane Fellbaum, MIT Press, May 1998., or, for more specialised information searches, a ready-made domain-specific thesaurus may be accessed, or even created using a clustering technique - see below.
- the context analysis module 135 may present tags in a list along with identified synonyms from the thesaurus to help clarify the context of the tag.
- tags found to be related in meaning following reference to the thesaurus, may be grouped together in the presented tag list to enable a user to select the whole group when narrowing down the list of documents to be investigated.
- clustering techniques such as those disclosed in “Clustering Algorithms”, Rasmussen, E., in “Information Retrieval: Data Structures and Algorithms”, edited by Frakes, W. & Baeza-Yates, R., Prentice-Hall, New Jersey, USA, 1992, may be used to identify tags having a similar meaning or used in a similar context in the returned search results.
- a numerical value representative of a measure of the contextual 'similarity' of a pair of tags Ti and Tj returned in the search results may be calculated as:
- [Ti] and [Tj] are the number of documents in the search results in which tags Ti and Tj respectively were identified in relation to relevant information
- Ti n Tj is the number of documents in which Ti and Tj co-occur. This measure of similarity takes a value between 0 and 1 , with 0 meaning that the tags share no similarity of context (no documents contain both the tags) and 1 meaning that all documents in the search results contain both the tags and hence that the two tags are likely to have been used in the same information context.
- a matrix of values for the above measure of context similarity is calculated for the tags and tag structures returned in a given set of search results. This matrix may then be used to identify groups of tags that may be related in context, for example by identifying a set of tags for which each combination of two tags selected from the set has a value of the similarity measure exceeding a predetermined threshold. The most similar tags may then be presented in groups for selection by a user in the tag list.
- Weightings may represent the probability that a given tag or tag structure will result in a selection of documents from the search results that contains documents of relevance to the particular user or group of users.
- the apparatus of the third embodiment is provided with an information access monitor for monitoring selection of tags and access to referenced documents by users.
- the information retrieval monitor is arranged with access to the user interface 1 25 to monitor all tag selections by users and any requests by users to access documents included in corresponding lists.
- the monitor also includes a store for recording history of selection for each distinct tag and tag structure and for recording weightings calculated in respect of each tag.
- the monitor checks for an entry in the store for that particular tag. If not, then an entry is created for the tag. If necessary, certain "low value" words may be removed from the stored tag, or words may be stemmed to render them into a more standardised form. For each tag, a counter is maintained both for the number of times that selection of the tag was selected and for the number of times that selection of t he tag was followed by an access request by the user for a document listed in the resultant reduced document list (see STEPs 325 and 335 of Figure 3). These counters may then be used to calculate, for each tag, a weighting representing a measure of the probability that selection of the tag results in a list containing relevant documents for that user.
- the monitor may be further enhanced to monitor the duration of a document access by a user, providing further information on the relevance of the accesses document to the user. Longer duration access to documents may trigger a double increment, for example, of the second of the two counters mentioned above.
- Operation of the information retrieval monitor described above may be triggered each time a new set of results is returned in response to a search query and the initial tag list is presented at the user interface 1 25. Weightings may be recalculated each time a user accesses a document so that they are immediately available for use in ranking each presented tag list.
- a user profile of keywords or terms may be stored in respect of each user of the apparatus. Such a profile may be used to represent the interests of a user and particularly contextual information of relevance to that user's interests.
- a known measure of relevance may be calculated for each tag in a tag list with respect to the words and terms in the user profile. The measure of relevance may be used to rank the tags in the list in order of relevance to the user profile as a further assistance to a user in selecting tags most likely to result in an efficient navigation of a set of search results leading to a list of the most relevant documents from the search.
- known document summarisers and key term extractors may be used to accumulate a profile of information content typically associated with each of a set of distinct tags, for example the tags stored by the information retrieval monitor of the third embodiment described above.
- key terms indicative of the information content of the matching portion of that document may be extracted and stored in association with the particular tag selection that preceded access of that document. Such terms may be further summarised to build up a profile of a tag for presentation to users as required. This feature provides further assistance to users in understanding the intended meaning of a tag, particularly in the absence of standardised use of tags.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2001290086A AU2001290086A1 (en) | 2000-09-29 | 2001-09-24 | Information access |
EP01969965A EP1323075A1 (en) | 2000-09-29 | 2001-09-24 | Information access |
US10/380,068 US7340459B2 (en) | 2000-09-29 | 2001-09-24 | Information access |
CA002422694A CA2422694A1 (en) | 2000-09-29 | 2001-09-24 | Information access |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB0023938.4 | 2000-09-29 | ||
GBGB0023938.4A GB0023938D0 (en) | 2000-09-29 | 2000-09-29 | Information access |
EP00311274.5 | 2000-12-15 | ||
EP00311274 | 2000-12-15 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2002027544A1 true WO2002027544A1 (en) | 2002-04-04 |
Family
ID=26073382
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/GB2001/004248 WO2002027544A1 (en) | 2000-09-29 | 2001-09-24 | Information access |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP1323075A1 (en) |
AU (1) | AU2001290086A1 (en) |
CA (1) | CA2422694A1 (en) |
WO (1) | WO2002027544A1 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2005052810A1 (en) * | 2003-11-28 | 2005-06-09 | Canon Kabushiki Kaisha | Method of constructing preferred views of hierarchical data |
US7401063B2 (en) | 2002-12-16 | 2008-07-15 | General Electric Company | Process for semi-automatic maintenance of a knowledge base using tagged examples |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH08255155A (en) * | 1995-03-16 | 1996-10-01 | Fuji Xerox Co Ltd | Device and method for full-text registered word retrieval |
-
2001
- 2001-09-24 CA CA002422694A patent/CA2422694A1/en not_active Abandoned
- 2001-09-24 WO PCT/GB2001/004248 patent/WO2002027544A1/en active Application Filing
- 2001-09-24 EP EP01969965A patent/EP1323075A1/en not_active Ceased
- 2001-09-24 AU AU2001290086A patent/AU2001290086A1/en not_active Abandoned
Non-Patent Citations (8)
Title |
---|
DANIEL EGNOR & ROBERT LORD: "XYZFind - Searching in Context with XML", INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 24 July 2000 (2000-07-24) - 28 July 2000 (2000-07-28), Athens, GR, pages 1 - 9, XP002167790 * |
JOHN DAVIES: "QUIZXML: An XML Search Engine", INFORMER: BRITISH COMPUTER SOCIETY INFORMATION RETRIEVAL SPECIALIST GROUP, ISSN 0950-4974, vol. 10, no. Winter 2000, 18 December 2000 (2000-12-18), UK, pages 2, XP002167787, Retrieved from the Internet <URL:http://www.irsg.eu.org/Informer/> [retrieved on 20010518] * |
KEVIN D. MUNROE ET AL.: "Blended Browsing and Querying of XML in a Lazy Mediator System", EDBT 2000 - INTERNATIONAL CONFERENCE ON EXTENDING DATABASE TECHNOLOGY, 27 March 2000 (2000-03-27) - 31 March 2001 (2001-03-31), DE, pages 1 - 2, XP002167788, Retrieved from the Internet <URL:http://www.db.ucsd.edu/publications/edbt00-bbq-demo.pdf> [retrieved on 20010516] * |
MUNROE K D ET AL: "BBQ: a visual interface for integrated browsing and querying of XML", ADVANCES IN VISUAL INFORMATION MANAGEMENT. VISUAL DATABASE SYSTEMS. IFIP TC2 WG2.6 FIFTH WORKING CONFERENCE ON VISUAL DATABASE SYSTEMS, PROCEEDINGS OF 5TH IFIP WG2.6 WORKING CONFERENCE ON VISUAL DATABASE SYSTEMS, FUKUOKA, JAPAN, 10-12 MAY 2000, 2000, Norwell, MA, USA, Kluwer Academic Publishers, USA, pages 1 - 15, XP002167785, ISBN: 0-7923-7835-0 * |
See also references of EP1323075A1 * |
WOLFF J E ET AL: "Searching and browsing collections of structural information", PROCEEDINGS IEEE ADVANCES IN DIGITAL LIBRARIES 2000, PROCEEDINGS IEEE ADVANCES IN DIGITAL LIBRARIES 2000, WASHINGTON, DC, USA, 22-24 MAY 2000, 2000, Los Alamitos, CA, USA, IEEE Comput. Soc, USA, pages 141 - 150, XP002167791, ISBN: 0-7695-0659-3 * |
XML GLOBAL TECHNOLOGIES: "GoXML version 2.0", XML GLOBAL TECHNOLOGIES WHITE PAPER, 5 September 2000 (2000-09-05), USA, pages 1 - 8, XP002167789, Retrieved from the Internet <URL:http://www.goxml.com/dev/whitepaper/GoxmlWhitepaper.pdf> [retrieved on 20010517] * |
XYZFIND: "New Technology for Search Over Heterogeneous Structured Data", XYZFIND CORPORATION WHITE PAPER, 25 April 2000 (2000-04-25), USA, pages 1 - 24, XP002167786 * |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7401063B2 (en) | 2002-12-16 | 2008-07-15 | General Electric Company | Process for semi-automatic maintenance of a knowledge base using tagged examples |
WO2005052810A1 (en) * | 2003-11-28 | 2005-06-09 | Canon Kabushiki Kaisha | Method of constructing preferred views of hierarchical data |
JP2007519086A (en) * | 2003-11-28 | 2007-07-12 | キヤノン株式会社 | Method for building a preferred view of hierarchical data |
US7664727B2 (en) | 2003-11-28 | 2010-02-16 | Canon Kabushiki Kaisha | Method of constructing preferred views of hierarchical data |
AU2004292680B2 (en) * | 2003-11-28 | 2010-04-22 | Canon Kabushiki Kaisha | Method of constructing preferred views of hierarchical data |
JP4637113B2 (en) * | 2003-11-28 | 2011-02-23 | キヤノン株式会社 | Method for building a preferred view of hierarchical data |
Also Published As
Publication number | Publication date |
---|---|
AU2001290086A1 (en) | 2002-04-08 |
CA2422694A1 (en) | 2002-04-04 |
EP1323075A1 (en) | 2003-07-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7340459B2 (en) | Information access | |
US11369792B2 (en) | Personalized network searching | |
US6122647A (en) | Dynamic generation of contextual links in hypertext documents | |
US6519586B2 (en) | Method and apparatus for automatic construction of faceted terminological feedback for document retrieval | |
US10540406B2 (en) | Search system and method with text function tagging | |
US6256623B1 (en) | Network search access construct for accessing web-based search services | |
US8856145B2 (en) | System and method for determining concepts in a content item using context | |
US6516312B1 (en) | System and method for dynamically associating keywords with domain-specific search engine queries | |
JP4857075B2 (en) | Method and computer program for efficiently retrieving dates in a collection of web documents | |
US20070185860A1 (en) | System for searching | |
US20050091203A1 (en) | Method and apparatus for improving the readability of an automatically machine-generated summary | |
US20080294619A1 (en) | System and method for automatic generation of search suggestions based on recent operator behavior | |
US20070022085A1 (en) | Techniques for unsupervised web content discovery and automated query generation for crawling the hidden web | |
US20080097958A1 (en) | Method and Apparatus for Retrieving and Indexing Hidden Pages | |
US20120179667A1 (en) | Searching through content which is accessible through web-based forms | |
US20030120654A1 (en) | Metadata search results ranking system | |
US20040030686A1 (en) | Method and system of searching a database of records | |
US6317740B1 (en) | Method and apparatus for assigning keywords to media objects | |
US20080313178A1 (en) | Determining searchable criteria of network resources based on commonality of content | |
US20050114317A1 (en) | Ordering of web search results | |
WO2001024046A2 (en) | Authoring, altering, indexing, storing and retrieving electronic documents embedded with contextual markup | |
WO1997049048A1 (en) | Hypertext document retrieval system and method | |
EP3159812A1 (en) | System and method for providing definitions | |
Mukherjea et al. | Automatically determining semantics for world wide web multimedia information retrieval | |
WO2002027544A1 (en) | Information access |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2001969965 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 10380068 Country of ref document: US |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2422694 Country of ref document: CA |
|
WWP | Wipo information: published in national office |
Ref document number: 2001969965 Country of ref document: EP |
|
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
NENP | Non-entry into the national phase |
Ref country code: JP |