EP1282864A2 - System und verfahren um informationen von digitalen quellen zu erfassen und zu verwalten - Google Patents
System und verfahren um informationen von digitalen quellen zu erfassen und zu verwaltenInfo
- Publication number
- EP1282864A2 EP1282864A2 EP00946945A EP00946945A EP1282864A2 EP 1282864 A2 EP1282864 A2 EP 1282864A2 EP 00946945 A EP00946945 A EP 00946945A EP 00946945 A EP00946945 A EP 00946945A EP 1282864 A2 EP1282864 A2 EP 1282864A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- item
- context
- user
- file
- meta
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000000034 method Methods 0.000 title claims abstract description 37
- 238000004891 communication Methods 0.000 claims description 5
- 230000008685 targeting Effects 0.000 claims description 2
- 238000001914 filtration Methods 0.000 claims 1
- 239000012536 storage buffer Substances 0.000 claims 1
- 230000001131 transforming effect Effects 0.000 claims 1
- 239000000872 buffer Substances 0.000 description 12
- 238000010586 diagram Methods 0.000 description 10
- 239000003795 chemical substances by application Substances 0.000 description 7
- 239000003623 enhancer Substances 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 241000283080 Proboscidea <mammal> Species 0.000 description 5
- 230000002776 aggregation Effects 0.000 description 5
- 238000004220 aggregation Methods 0.000 description 5
- 241000272470 Circus Species 0.000 description 4
- 241000406668 Loxodonta cyclotis Species 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 4
- 238000007792 addition Methods 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 230000009471 action Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 241001422033 Thestylus Species 0.000 description 1
- 230000004931 aggregating effect Effects 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 230000001174 ascending effect Effects 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/955—Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
- G06F16/9562—Bookmark management
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
- G06F40/35—Discourse or dialogue representation
Definitions
- One basic class of capture techniques is the Standalone capture techniques. For example, if a user desires to save a paragraph on a Web page, he may print the contents of a Web page, save the Web page objects as HTML or text files, copy and paste the Web page contents to a "carrier" document using products such as Microsoft Word or PowerPoint, or simply write down Web page contents of interest on paper.
- Another capture technique uses
- Browser-based page capture tools which include the Favorites or Bookmark tools available in most browsers, which capture and manage links to interesting sites visited, and offline browsers (also known as “page grabbers”), which capture and manage the entire content of
- carrier documents to capture Web objects is extremely labor and system resource-intensive, often results in subtle or even radical alteration in the appearance of the captured object, and, unless extraordinary means such as storing objects one-per-document under a document management system are employed, the use of carrier documents provides only limited and coarse-grained management and retrieval capabilities.
- Browser-based page capture tools also have serious limitations. Capturing links to Web sites and Web pages or capturing the entire content of these sites and pages is, by definition, a coarse-grained action so there will be extraneous information accessed and the user will have no control over the way the captured content is organized. Page capture tools (whether link- or content-based) typically utilize only a name string, URL, and structure information for classification and management. Additional classification tools such as keywords, and notes are generally either unavailable or effectively unusable since coarse- grained, statically aggregated pages containing multiple items that typically cannot be described in specific, meaningful terms.
- high-traffic Web sites such as search portals both for gathering promotions targeting information about customers and for actually delivering the advertising.
- These high-traffic sites typically obtain information about customer interests and buying patterns through two primary techniques.
- sites implicitly gather information through tracking "click trails" to determined what sites a user visits, and what links a user follows.
- sites explicitly gather information through promotions, contests, and site personalization features in which customers voluntarily provide more information about themselves in exchange for additional services, greater online convenience, or a chance to win prizes.
- a major limitation has been the inability of current Internet information gathering techniques to provide specific actionable information about consumers.
- portal site “personalization” has its limits, since users must "opt in” and set up personalized pages, yielding limited amounts of user data.
- the ability to provide a context some underlying, surrounding, or background information is highly desirable in assisting search engines in finding the desired information. For example, a user may be seeking information on the treatment of elephants in Africa, but a query will contain numerous results on the treatment of elephants in circuses.
- the present invention provides a system and method for capturing and managing information from digital sources.
- the present invention allows a user to capture and manage granular information from a digital source, such as a Web site, e-mail or electronic document.
- a digital source such as a Web site, e-mail or electronic document.
- a user highlights the desired information on the Web site and activates a capture control on a user interface such as a browser indicating that the highlighted information is to be captured.
- the captured information hereafter also referred to as an item, is stored in a uniform file format such as enriched HTML/XML which advantageously permits fine-grained access to and retrieval of the captured information.
- Each item is stored in a item access system.
- a item access system is a file system in which items are stored in files.
- the item is also analyzed and organized by assigning meta-data to the captured information.
- meta-data include the captured information's name, category, keywords, source, source description, date and time of capture.
- the user may select the meta-data for a captured item suggested by the present invention or alternatively, the user may enter his or her own meta-data.
- the present invention also manages the items of granular information using a Context
- the Context Database comprising the items, any associated files and the meta-data associated with the items. Once an item of information is captured and stored in a file, its meta-data is integrated within a database, such as an SQL database, to aid in the classification, annotation, structuring, and management of the captured information.
- the Context Database allows a user to dynamically aggregate the captured information. In other words, the user may combine or reorganize different captured information items.
- a Context Database is a WebBase in which the items are stored in a markup language such as HTML / XML so that items are easily managed over the Internet.
- the WebBase embodiment of the present invention brings together the power of a database and the ubiquity of Web pages. This melding of Web pages and a SQL database enables many peripheral features, like sharing and importing/exporting with little additional work.
- the present invention uses the Context Database to deliver targeted promotions for merchandise, services, and content, and to assist the user in finding additional information similar to the information that has already been captured.
- the present invention analyzes the meta-data associated with each captured item and the relationships among the items and the relationships between items and their meta-data stored in the Context Database to create a Summary of the particular user's interest at any given moment.
- the resulting Summary provides a synopsis of the user's interest in a format that can be analyzed by a computer and can then be used to provide targeted and timely promotional links to specific merchandise, services, and content when the user retrieves an item from the WebBase.
- the present invention uses the meta-data associated with the captured information in the Context Database to improve searches performed on digital information systems such as the Internet.
- the present invention analyzes a Summary of the user's interests relevant to the search to determine which words to use in the search and how to use those words in the query.
- the meta-data associated with captured information is used to create a powerful search query which adds information from the user's Summary of interests which can then be passed along to existing search portals.
- the present invention advantageously allows a user to capture and manage granular information from digital sources.
- Figure 1 shows one embodiment of a system for capturing and managing granular information from a digital source in accordance with the present invention.
- Figure 2A shows an embodiment of a user interface employed in a system for capturing granular information in accordance with the present invention.
- Figure 2B shows an embodiment of aggregated granular items of a collection displayed in a browser display.
- Figure 2C shows an embodiment of a pop-up window through which a user may indicate whether the selected item is to be saved in its granular form by selecting "Web
- Figure 2D shows an embodiment of a user interface, a pop-up window, in which metadata is displayed to the user and which accepts the additions or changes to the meta-data from the user.
- Figure 2E shows an embodiment of user interface as a pop-up window which is displayed to the user to allow annotation of an item.
- Figure 2F shows an embodiment of user interface as a pop-up window which is displayed to a user for changing the appearance of the item.
- Figure 3 A illustrates an embodiment of a computer-implemented system for capturing and managing items of both a coarse-grained and granular information from digital sources as embodied within the environment of a browser.
- Figure 3B illustrates an embodiment of a Context Database as a WebBase comprising files, including a file for each item captured, stored in a hierarchy of collections, and meta-data stored as SQL database entries according to table definitions.
- Figure 3C shows an embodiment of a user interface, a pop-up window, for allowing a user to create a Filter of his or her Context Database.
- Figure 3D shows an embodiment of a user interface, a pop-up window in which the filters that apply to a collection are displayed.
- Figure 4 is a flow diagram of an embodiment of an overall method of formatting an item to a uniform file format.
- Figures 5A and 5B show an overall flow diagram of one embodiment of a process of generating meta-data for an item.
- Figure 6A illustrates an embodiment of an item access system as a file system hierarchy for an item and the objects it references.
- Figure 6B illustrates an example of how 3 items with references to external files are stored in the file system embodiment of Figure 6A.
- Figure 7A is a flow diagram of an embodiment of a method for creating a Context
- Figure 7B illustrates an example of applying the method in Figure 7A.
- Figure 8A is a flow diagram of an embodiment of a method of selecting a targeted advertisement using the Context Summary in accordance with the present invention.
- Figure 8B is a flow diagram of an embodiment of a method of enhanced searching.
- Figure 9A illustrates a user's selection of an item in an embodiment of a user interface.
- Figure 9B illustrates an example of an enhanced search query and suggested keywords produced from the method of Figure 8B.
- Figure 9C shows an embodiment of a web page including a targeted ad selected from the method of Figure 8 A.
- Figure 1 shows one embodiment of a system for capturing and managing granular information from a digital source in accordance with the present invention
- a client computer 118' having an operating system 164', a memory area comprising a storage area for a user's Context Database 140', display 108' in which a user interface 160' such as a browser display, input devices keyboard 1 14 and mouse 1 16, a server 128 in communication 142' over a network 159' such as the Internet with the client computer 118', the server hosting a website 130 with which the client computer 118' communicates and a storage area 138 for providing an alternate storage location for a plurality of Context Databases 140', digital sources such as a website 144 and an e-mail or electronic document 148 are also in communication 150, 154 with the client computer 118'.
- a user interface 160' such as a browser display
- input devices keyboard 1 14 and mouse 1 16 input devices keyboard 1 14 and mouse 1
- FIG. 1 Another embodiment of a client computer is also shown in Figure 1 as a personal digital assistant 1 18' having a display 108' and a stylus 126 as a user input device, and an operating system 164'.
- the personal digital assistant (PDA) is also in communication 158 over a network 159' with server 128.
- Figure 3 A shows that the operating system has a clipboard structure 168 and a drag and drop infrastructure 170.
- the PDA embodiment 118' also has a user interface, for example a browser display 160', for displaying information to the user.
- the browser display 160' has a capture control, shown here embodied as a toolbar with three click buttons 120', 122' and 124' and an item access window 162 for displaying items in the user's Context Database.
- highlight click button 120' activates a highlighting feature for selecting information of interest.
- One click capture button 122' is activated by the user to indicate that the information is to be captured.
- a third click button 124' is shown in these examples which will cause the user's granular items to be displayed to the user if they are not visible.
- each display indicates a user's selection 110 of a granular item of information 112' that is selected responsive to a user's operation of an input device, such as the mouse 1 16 or the stylus 126 in accordance with the operating system's drop and drag infrastructure.
- the keyboard 1 14 may also be used as well as any other input devices for selecting the desired granular item of information.
- the user may also click on the highlight button 124', run a cursor over the granular information to be selected using an input device. The user then clicks on the one click capture button 122' to indicate to capture the selected item .
- FIG. 2A illustrates an embodiment of a browser display 200 generated by one version of the invention.
- the browser display 200 has a toolbar 202, a browser window 204, a advertising panel 206, a browser plug-in window 222 comprising a search word entry bar 208, a collection map 212 for displaying the user's items in the structure of the user's Context Database, which is a WebBase in this embodiment, further comprising click buttons including file menu button 214, one click capture button 216, send button 218, Web search button 220, Context Database search button 227, and a tools button 225, and an item list 210.
- a collection entitled PTO 226 comprises a number of items including two items of granular information, each stored in a file identified by the names of "Intellectual Property Symposium of the Americas" 228 and "Some Weil-Known U.S. Trademarks" 230.
- a browser window 204 is displaying the contents 215 of the selected item "Intellectual Property Symposium of the Americas"228.
- a advertising panel 206 is also included in the displayed webpage 200, for displaying relevant advertisements and promotions as selected in one embodiment, based upon matches of keywords of the user's Context Database with keywords of various advertisements and promotions.
- Figure 2B illustrates an aspect of the invention in which granular items are aggregated for the user.
- the browser window 204 displays all the granular items in that collection in one aggregated format display 217.
- items may be dynamically aggregated.
- Figure 2B shows two items, "Intellectual Property Symposium of the Americas" 228 and "Some Well-Known U.S. Trademarks Create One Hundred Years" 230 displayed together when the PTO collection was selected.
- the items of captured information are processed such that previously autonomous documents may be intelligently merged. This facilitates browsing, sharing, exporting, importing, searching and sending collections of related granular items to others.
- many other forms of dynamic aggregation of WebBase items are possible.
- the ability to perform dynamic aggregation means that information can be easily and flexibly reorganized and repurposed in ways that would simply not be possible with traditional document based systems.
- Mixed-type formats include but are not limited to text, images, sounds, third party embeddable objects such as movies, applets, flash, Shockwave, as well as links and other files such as word and excel documents.
- users In capturing items, users have the option of capturing coarse grained information or fine-grained information.
- coarse grained information is a webpage, a document or an encompassing file.
- fine grained or granular information is a sub-component of a document or an encompassing file.
- the user may indicate whether the selected item is to be saved in its granular form, which typically is less than the data of an entire document, by selecting "Web Content" 258 for the item or whether to save the link to the webpage from which the item came by selecting "Web Link" 260.
- a name 262 is provided for the item, a Notes section 264, and a folder 226 is suggested for the item, all of which may be changed or added to by the user.
- a pop-up window 299 as indicated in Figure 2D is presented to the user for presenting a name for the item 242, a category 244, and suggestions for more keywords 246' which the user may select by manipulating the scroll bars 248 and the Add 291 , Manage 293 and OK 253 click buttons.
- Also presented to the user in a window 257 of the box is the source of the item, a description of the source and the date and time of capture.
- a user accepts the item to be inserted in his Context Database by clicking the OK button 253. The user can also adds notes or change the appearance of the item by clicking on the "Notes" 292 button and the "Appearance" 294 button respectively.
- a pop-up window 252 is displayed to the user to allow entry of notes, and a pop-up window, an example of which is shown in Figure 2F, is shown that a user can use to change the appearance of the item.
- Figure 3 A illustrates an embodiment of a computer-implemented system 320 for capturing and managing granular information from digital sources embodied as a browser plug-in to a browser 360 running on client computer 118'.
- the user's selection of a one click capture button 120' or 216 is communicated to a capture control 322.
- the capture control 322 interacts with operating system 164 and application specific access protocols. For example, when a user has selected 1 10' a granular item of information 112' as indicated in Figure 1 and has activated the one click capture button, the item is pasted to the operating system clipboard.
- the capture control 322 in the embodiment shown examines the captured content on the clipboard 168 in the available clipboard formats and selects the one that affords the best translation to a Uniform File Format. The item in the selected format is then passed along to the Uniform File Formatter, 324.
- the capture control 322 examines the browser's internal data structure or document object model and passes along the relevant file formatting information from this data set to the Uniform File Formatter 324.
- the Uniform File Formatter 324 places captured items in a uniform format.
- the Uniform File Formatter transforms all items into an "aggretabable" format so that the items may be dynamically aggregated.
- Figure 2B shows the items in the selected PTO collection 226 aggregated into one displayed HTML document 217 in the browser window 204.
- An item stored in aggregatable format maintains its presentation integrity regardless of its placement in an aggregated display.
- the meta-data generator 326 is shown as a separate element, but its functions and those of the uniform file formatter 324 can be performed together in a single pass of the captured item.
- the meta-data generator 326 associates meta-data with an item. Examples of meta-data include but are not limited to, a name, a category, keywords, a source, and a source description. Meta-data is used to provide context to a user's Context Database 140'.
- the generated meta-data is presented to a user through a user interface as shown in Figure 2D for approval and / or for changes and additions.
- the meta-data generator includes those changes and / or additions that are received from a user 338.
- the meta-data generator communicates the meta-data to the Context Database Manager 330 so that the meta-data can be stored in the user's Context Database 140'. Furthermore, the Uniform File Formatter 324 and the Meta- Data Generator 326 communicate with the Generated File Manager 328.
- the Generated File Manager 328 will store the item, now in its aggregatable format in a file system 354.
- the Generated File Manager dynamically generates a header that is written to a buffer.
- the Generated File Manager will locate the item, or items if a collection is selected, in "aggregateable" format, and append it or them to this header.
- the resulting buffer is written to a file and displayed to the user.
- the Context Database Manager 330 stores, retrieves and searches for requested data for the other components of the system from the Context Database 140'.
- the Context Database comprises the user's formatted files 354, database 356 of meta-data and other attributes, for example a pointer to a location in memory of an aggregatable item, associated with the user's items, and Seed Keywords 358. Examples of Seed Keywords can be found in Appendix C of the Provisional Application.
- Seed Keywords 358 are used by the Context Summarizer 332 under certain conditions in creating a Context Summary.
- the Context Summarizer 332 creates a Context Summary, which in one embodiment is a weighted, keyword-based Summary of the user's Context Database 140 by querying the Database.
- the Context Summarizer 332 comprises a Context Filter 336 which prioritizes the keywords in the Context Summary based on the currently selected item or collection.
- Context Summarizer creates a Context Summary each time a user selects an item or collection and sends the Summary to the Profile Analyzer 342, located on the server 128 in this embodiment.
- the Context Summarizer creates a Context Summary of keywords when requested by the Search Enhancer to which it returns the Summary.
- the Profile Analyzer 342 receives the Context Summary on each selection of an item or collection.
- the Context Summarizer 332 includes the category of an item.
- the Profile Analyzer 342 will obtain the category of the selected item or collection by requesting it from the Context Database Manager 330. The Profile Analyzer then sends the Summary and the category for the current selection to a Promoter Interface 344 located on a server 128.
- the Promoter Interface selects a promotion to be displayed to the user.
- the Promoter Interface compares the keywords of the current selection with keywords 345 associated with the promotions 343 stored on the server in order to find matches. If a match of the keywords is found, those promotions whose keywords match will be selected.
- the Promoter Interface will generate an HTML document for display in an advertising panel 206.
- the Search Enhancer 346 parses a user's search string entered in a search query entry bar 208.
- the Search Enhancer will send a request to the Context Database Manager 330 to search the Context Database to find occurrences of the words in the meta-data and to retrieve items or collections associated with the words. If any items or collections are associated with the words in the search string, the Search Enhancer requests a Context Summary for each item or collection from the Context Summarizer.
- the Search Enhancer creates a Search Summary from the Context Summaries which is sent along with the user's selected search engine to a Query Generator 348 on server 128 in this embodiment.
- the Query Generator 339 will place the words of the Search Summary in the proper form according to Search Engine Grammar Protocols 339.
- Figure 3B illustrates an embodiment of the Context Database as a WebBase 370 comprising an SQL database 376 and the user's files 378.
- the user's files comprises the user's items 372, as well as associated files 374 comprising files for referenced objects 374 such as an HTML frame or an embedded object such as an image or movie.
- the use of an SQL database aids in the classification, annotation, structuring, and management of the captured information.
- the SQL database comprises tables that define relationships so as to provide context to the user in managing the WebBase. Examples of table definitions were included in Appendix B of the Provisional Application, which has been incorporated by reference.
- the tables include an object table 380, a relationship table 382, a Filter table 392, a Filter Relationship table 394, an info map table 388, an info maps expkeys table 390, a tag table 386 and a Tag Relationship table 384.
- an item is represented by an entry in the object table 380.
- Columns include but are not limited to a key for keying into a particular item, the name of the item, notes, its source, source description, a pointer to the file's location, appearance attributes such as background color, and its child order with respect to a parent collection.
- Each item belongs in one or more collections as defined in the relationship table 382. For example, each item can be related as a child of one or more parent collections, and each collection can in turn be related as a child of one or more parent collections.
- keywords, categories, and names for each item are represented by entries in the Tag table 386 which are related to the items in the Object Table by relationships defined in the Tag Relationship table 384.
- the info table 388 is defined to keep the current state of the user's selections and the view the user sees of the collections.
- the Info Map Exp Keys interact with the Relationship table 382 and tracks the current open tree path by recognizing what collections and items are open. When the system is operating, this information will be in random access memory (RAM) can be easily obtained from the RAM rather than performing database queries on the Info Map tables.
- RAM random access memory
- the Filter 392 and Filter Relationship 394 tables assist the user in sifting through his or her Context Database.
- the Filter table 392 comprises several columns defining fields such as a key for indexing into the desired rows, a filter or agent name, annotation, and associated keywords.
- the Filter Relationship table 392 relates the items in the object table that comply with the selected filter to the filter.
- Figure 3C shows a pop-up window 395 as a user interface entitled "Agent Properties" to allow a user to define a filter, in this embodiment is called an agent.
- the Filter table 392 comprises columns corresponding to the user's entry of words in the entry bars 415, 401 , 393, 403 and 405 to be related to the following list: an agent's name, a name of an item, keywords of an item, notes for an item, and a source of an item.
- the user has entered "patent" in the keywords entry bar 393. This will cause the Context Database manager to create a Filter Relationship row for this agent "patent” that relates the items or collections having the word "patent" associated with them with the filter.
- the PTO collection has the patent agent applied to it.
- the "government" agent 41 1 will be applied to the PTO collection also.
- those items or collections associated with the filter will be displayed to the user.
- Figure 4 is a flow diagram of an embodiment of an overall method of formatting an item to a uniform file format.
- Figures 5 A and 5B describe an embodiment of a process of generating meta-data.
- a single pass through the captured item is efficiently employed for both purposes of uniform file formatting and generating meta-data.
- a single format that is able to encompass many other formats is highly desirable as a uniform file format.
- formats examples include markup languages such as HTML and XML, a URL to an ASCII file (including local files), Rich Text Format, proprietary binary formats such as those used with some word processors for embedded files and hyperlinks, local binary files such as those storing images and other multimedia, a URL to a binary file, and references to "self-managed" embedded objects such as a registered component that interprets and displays data and these objects amalgamated with other data such as Microsoft Excel, Microsoft Word, Acrobat PDF, Flash, and ASCII text.
- HTML is used in the following examples as the unifying format. All conversions occur recursively to ensure that the referenced files are appropriately converted as well if necessary.
- Figure 4 illustrates an embodiment of a method for converting an item of information into a uniform aggregatable HTML file format.
- the format of the item is determined 402.
- the Capture Control 322 provided the format.
- a local representation of the current item is generated 404 in HTML.
- file formatting and meta-data generation are done together in a single pass of the capture item.
- a name suggestion buffer and a keyword suggestion buffer are prepared 418.
- the item's characters are converted 422 to a single unifying character set.
- An example of a universal character set is the Unicode or the common Unicode mapping UTF8. This conversion is an example of an "aggregateable" transformation. This allows for the aggregation of two items which were previously incompatible based on prorprietaty character set formats.
- references to other external files are downloaded 408.
- the downloaded files are made 410 HTML compatible. For example, IMG tags are placed around images, and anchor tags around links.
- the references in the item to the external files are resolved 412 to point to the local copies of the files.
- the file is placed in aggregateable format 414 by for example, converting absolutely positioned objects to relatively positioned ones. This allows an item to maintain it's presentation integrity regardless of where it falls in an aggregated display.
- the item's header information can be removed from an item's HTML (if it exists) and stored in the Context Database.
- An example of an attributes that may be saved from the header information to maintain presentation integrity are the colors and fonts in which portions of the item are to be displayed.
- pointers to the file in a unified aggregatable format are stored 416 as meta-data including attributes of the item for later header generation upon dynamic aggregation.
- a captured item may already be in a version of HTML.
- HTML is generated in the version used as the locally stored HTML.
- HTML often references external files to be embedded in the viewable document.
- the referenced information is downloaded and resolved to point to the local copies. Again, this stage is managed recursively to get all the referenced files and their referenced files.
- a Java Script manages all references with a consistent and unique prefix.
- Frames can be converted to IFRAMEs, or alternatively to TABLEs in order to view combined frames captured on pre- HTML 4.0 systems. Again, a recursive approach is used so that frames within frames within frames are acceptable. Images, sound files, and movies may be stored in local or remote image files. The file is downloaded if necessary, and reference is made to the local copy. In capturing Java applets referenced, the applets are downloaded and resolved to reference the local copy. Capture of applet is initiated through the capture of the HTML file in which it resides.
- Some captured items or referenced files have parameters that can be entered.
- the embedded application's data file is downloaded as well as the referenced files from the parameter list. Reference to the local copy must be resolved, and the parameters must be altered so that the content can be displayed without loss of information, formatting or function.
- An example of this would be when capturing a Macromedia Flash file. Flash allows Flash developers to rely on certain parameters such as an input file. When the Flash file is downloaded, this input file must be downloaded as well. The parameter inside the flash tags need to change to reference the new downloaded version.
- the application may be downloaded or executed.
- a heuristic is used to analyze the parameter list by parsing its values. If it has "/" and ".” and extension we assume it's a reference to a file which is downloaded. The reference is resolved to the local copy, and the parameters are altered, so content can be displayed without loss of information, formatting or function.
- URLs are simple references to files (HTML). In one embodiment, the user is given two options. First, capture the file that is referenced, in which the capture process is simply restarted for treating the data as the file type rather than as a URL.
- the other option is to capture only the URL as a bookmark.
- the URL is then wrapped in the appropriate HTML/XML reference tags, for example ⁇ A HREF> ⁇ /A>.
- RTF Rich Text Format
- HTML/XML these formats typically cannot be embedded into an HTML viewer.
- the file is treated as an HTML link, and the reference to the file is wrapped in the ⁇ A HREF> ⁇ /A> tags.
- the file type is analyzed by considering the file header information, file suffix, and the operating system registered suffix. Then based on industry standard suffix, the file is wrapped in an appropriate HTML tag such as but not limited to an "IMG" tag for an image, a "SOUND” tag for a sound file, or an "EMBED" or "OBJECT” tag for a recognized HTML/XML plug-in.
- the remote file may be downloaded and managed as in the case of the local binary file.
- references to "self-managed" embedded objects to be converted to HTML may request to wrap the object in an "OBJECT" tag with the appropriate parameters as determined by a query to the operating system. If not attainable from the operating system, then a predefined dictionary of embeddable objects is consulted. In converting ASCII text to HTML/XML, the text is passed through a conversion filter.
- Formatting is done to maintain the integrity of the text. For example, spaces are converted to “ ”, CRLFs are converted to “ ⁇ BR>", “>” are converted to “>” and " ⁇ " converted to "<”.
- HTML has rules and constraints which the content of the text may trigger or violate.
- the text is passed through a conversion filter that outputs HTML which when viewed in an HTML interpreter (like a Web browser) displays the content that was captured.
- Figures 5A and 5B show an overall flow diagram of one embodiment of a process of generating meta-data for an item as may be executed by the meta-data generator.
- Meta-data is associated with every item the user captures, and as shown in Figure 2D, this meta-data can be viewed and changed by the user.
- the meta-data generated comprises a name, category, keywords, source and source description for each item.
- the name suggestion buffer and the keyword suggestion buffer have been prepared 418.
- the content of the capture item was filtered to provide "relevant" information, meaning information whose formatting tags (bold, italic, position, etc.) has been removed.
- the relevant information has been stored in both buffers.
- relevant information examples include certain description tags, such as HTML's "ALT" tag, which are considered “relevant” to the item's suggested name and would be retained in this example.
- This buffer may be limited at a particular size, for example 5K, so as not to negatively affect the product's capture performance.
- the meta-data generator creates a suggested name of the first N, a preselected number as may be determined by, for example, file system constraints whole words whose summed length including spaces is less than a predetermined number of characters 510, for example, 128.
- a suggested name is determined, it is checked 512 for an inappropriate closing construct. If the suggested name does end in an inappropriate closing construct, the last word in the name is removed 514 recursively until the closing construct is no longer inappropriate. On this event, the suggested name is selected 516 as the name of the item.
- inappropriate constructs are examples of inappropriate constructs:
- a category is assigned to it such as "Business” or "Travel.”
- This collection is the "relevant" context collection from which the item automatically inherits its category if one was not already selected by the user.
- Figure 5B shows that the item's category is assigned based on the first "relevant" context collection with which it was associated 518.
- the source of an item is associated with the item based upon information received from the source application during the transfer of data 520.
- Part of an item's meta-data in this embodiment is a description.
- the description can also be created from the name suggestion buffer of relevant information. For example, if the unified file format is in a parsable markup language 522, and there is a title tag 524, the item's description or source description can be set to the content in the title tag 526. If there is no title tag associated with the item 522, 524, then the item's description can be set to its source 528.
- keywords are associated with the item.
- the relevant information is stored not only in a name suggestion buffer, but also in a keyword suggestion buffer 506.
- a larger keyword buffer, 50K is desired.
- the meta-data generator will draw on keywords from one or more of the following sets of keywords.
- the first set would be the User Keywords which refers to the set of all user keywords, rather than the context-specific set described as Context Keywords.
- a user can enter keywords to be associated with an item or accept keywords suggested for an item. These keywords form the set of user keywords.
- Context Keywords are those keywords associated with items and collections in the path of the item from the top of the collections hierarchy to the item. The path could have many levels of collections or no levels. Limits as to the number of levels, for example three (3) levels, to be searched and a minimum and maximum number of keywords may be desirable.
- An example of such a set of Context Keywords is the union of:
- a capture destination of an item has a category assigned to it 530. If it does, an SQL query of the Object 380, Relationship 382 , Tag 386 and Tag Relationship 384 tables in the user's Context Database, for example a WebBase, is made to capture all the keywords that satisfy the set which is a union of the previous three sets. The result from the query is the Context Keywords set for the particular item.
- an SQL query of the Object 380, Relationship 382 , Tag 386 and Tag Relationship 384 tables in the user's Context Database for example a WebBase
- the destination collection has a category
- the seed keywords associated with its category can also be analyzed for matches.
- a keyword threshold for example 3, for the minimum number of keywords sought is used as well as a maximum keyword suggestions which is a parameter that determines the maximum number of keywords that will be suggested for an item.
- the meta-data generator loops through the Context keyword set and the seed words associated with the category of the destination collection matching each set against the captured content 532.
- the loop is bounded by the Keyword Threshold and the Maximum Keyword Suggestions 532. Additionally, matches made against the earlier set(s) will never be replaced by matches from later sets. If the Keyword Threshold is not satisfied 536 after going through all these keywords, then the selection of possible keywords for matching is broadened as in the case where the destination collection does not have a category. Otherwise, the keywords are assigned to the item as tracked by the meta-data in the user's Context Database 538.
- the keyword sets include the Context Keywords, all User Keywords and all the Seed Keywords.
- the meta-data generator loops through the Context keyword set, all the user keywords entered and all the seed keywords matching each set against the captured content 534. The loop is bounded by the Keyword Threshold and the Maximum Keyword Suggestions 534. The resulting keywords are then assigned to the item in the user's WebBase 538. In the embodiment of Figure 3B, the meta-data generator would send the resulting keywords to the WebBase manager which would integrate these keywords with the user's WebBase as defined by the tables in the WebBase.
- Figure 6A illustrates an example of an item access system, a file system hierarchy for storing an item and the objects it references.
- top directory is an Installation Directory 602.
- the Data Directory 604 which comprises Item HTML Files 606, Generated Display Files 612 and a Item Associated Directory 608 which is at the same level as the Item HTML Files.
- the Files Associated with that directory 610 meaning the objects which the associated Item file references.
- Figure 6B illustrates an example of 3 items formatted to HTML with references to external files using the file system hierarchy of Figure 6A.
- the Main Directory 614 is the topmost directory in the file system hierarchy.
- the first item 618 has been captured from a web site and has been written to the file system as 123.htm (618) in the Data Directory 616.
- the files it referenced were also downloaded into the directory called 123 (624) which is at the same level as 123.htm in the file system hierarchy.
- the relevant references are changed to the 123 directory (624) from references to the World Wide Web.
- the second capture item was from another web page and is stored under the Data Directory 616 as 234.htm (620), and the referenced files to 234 were downloaded and saved in directory 234 (630) which is at the same level as 234.htm in the file system hierarchy.
- the relevant references are changed to the 234 directory (624) from references to the World Wide Web.
- the third item is a text file that was captured as a copy from the file system.
- the uniform file format is HTML so this text file has been converted to HTML.
- the HTML file is saved under the Data Directory 616 as 345.htm (622), and the text file 640 is stored into the directory called 345 (638) which is at the same level as 345.htm (622) in the file system hierarchy.
- Display_Pagel.htm 642 This page contains some standard HTML tags including ⁇ HTML> and ⁇ BODY> and some generated HTML that displays the meta-data and appearance preferences suffixed by the entire 123.htm. This simple, blind concatenation is possible for two reasons. First, when the system creates these granular HTML files, it makes certain that they don't contain any document global information, so they can be strung together endlessly. The system generates them so that they all reference the appropriate subdirectories; there is no need to subsequently re-process the associated file reference, since all the Items and generated files are at the same directory level.
- Figure 7A is a flow diagram of an embodiment of a method for creating a Context Summary based upon the user's Context Database.
- the use of meta-data provides an information context to enhance targeted advertising and searching.
- the method may be embodied as computer instructions, stored in a computer usable medium, that are executed on a processor.
- the Context Summarizer 332 distills "usable" pieces of the information out of a Context Database and creates a succinct Summary of the users interest at any given moment.
- the Context Summarizer 332 comprises a Database Summarizer 334 and a Context Filter 336.
- the Database Summarizer analyzes the meta-data associated with each item and the relationships defined in the Context Database, and is able to generate a weighted, keyword-based Summary of the user's interests. This Summarization method effectively provides a multidimensional synopsis of the user's interests in a format that can efficiently be analyzed by a computer.
- the meta-data specifically the name and keywords associated with each item, are analyzed.
- Each item is represented by an entry in the Object table 380, and a keyword is represented by an entry in the Tag table 392.
- Each item entry has zero or more keywords, as defined in the TagRelationship table 394.
- Each item belongs in one or more collections as defined in the Relationship table 382.
- Each collection has one or more parent collections as defined in the Relationship table 380.
- the Context Summarizer creates a Context Summary based on the input of two types of information or seeds, the first being a selection of an item or collection, as may be indicated by a user click, and the second being one or more words entered by the user.
- the seed is of the second type
- the method comprises the actions used for a selection of an item or collection as described in Figure 7A.
- a current level variable is set to the level of the item or collection selected by the user, the current selection, 702 in the file system hierarchy.
- the level of the current selection is the root level.
- the keywords associated with the item or collection of the current level are obtained 704. All the keywords at this level are weighed with a formula, such as [100 / (level - 0.05)], that prioritizes keywords that are closest to the root level 706.
- a determination is made as to whether the keyword is associated with an item or collection in the current open tree 708.
- the current open tree is the path in the file system hierarchy from the top level collection, for example All Collections in Figure 2 A, to the root level of the item or collection. For example, if the user clicks on the item "Intellectual Property Symposium of the Americas" 228 , the current open tree would include, "Intellectual Property Symposium of the Americas" 228, the PTO collection 226, the Patent collection 224, and the All Collections collection 223.
- a Context Filter is applied to the keyword 710 to provide additional weight to this keyword.
- the Context Filter adds additional weighting to collections and items in the currently active path. This enables the present invention to not only know the user's interests, but also to know the user's specific interest at any given moment.
- the name of the item or collection 712 has a lower weight applied to it 712.
- the weight of the name may be determined by [75/((level)2 - 0.05)]. If it is determined that the item or collection is in the current open tree 714, the Context Filter is also applied to the name 716. If there is another level up in the file hierarchy 718, the current level is set to the next level up 720, and the process repeats until there are no more levels to traverse 722.
- the name/keyword weighting is cumulative; if the same keyword was used to define 3 items in the hierarchy their individual weighting would be added together 719.
- Figure 7B shows an example of creating a Context Summary using the embodiment of Figure 7A.
- the levels of the tree are assigned in ascending order from 1 for the root level 748, 2 for parent level 746, 3 for the grandparent level 744, up to level 5 740.
- the example uses the keyword weighting defined by [100/level-0.05].
- the weight of a keyword should be highest at the root level 748 and should decrease somewhat for each level we go up in the tree.
- the 0.05 is subtracted so that a keyword that shows up twice at level 2 746 will be given a weight that is slightly less than the weight of a keyword at level 1 748.
- the Context Filter in this example is embodied as an additional multiplicative factor. If the item or collection being evaluated is in the current open tree as denoted by the asterisk, the weighting will be multiplied by 2.5. This factor was chosen so that a keyword/title from an item or collection at level 4 in the current open tree would have more weight than a keyword/title from another at level 2 which is not in the current open tree.
- FIG 8A is a flow diagram of an embodiment of a method of selecting a targeted advertisement using the Context Summary in accordance with the present invention.
- a Context Summary and a category is received for an item or collection currently selected by the user 802.
- the category is included with a prioritized list of Context Summary keywords.
- One example of how the keywords can be prioritized is by applying the weighted method described in Figures 7A and 7B.
- the Context Summary keyword having the highest priority is selected 804 as the current keyword.
- the current keyword is compared 806 with keywords that are associated with promotions. If a match 808 occurs, an HTML document is generated 816 that embeds some of the promotions of the matching keywords for display to the user, such as in the Advertising Panel 206 of Figures 2 A and 2B.
- the Context Summary keyword having the next highest priority is selected 812 as the current keyword and the comparison 806 is repeated, along with the checks on matches 808 and the number of keywords 810 until either a match is found or there are no more keywords.
- promotions are selected 814 having the same category as the selected item or collection, and the HTML document is generated 816.
- An example of such a generated document is the advertisement 970 for the lowest fares available in Figure 9C that is brought up by the user's click on Africa 972 in the Travel collection 974.
- Another aspect of the present invention provides "searching in context".
- search searching in context
- the Context Summary is used to synthesize a powerful search query.
- This query which is generally much more sophisticated and complete than the ones a user would typically type, is passed along to existing search portals. Because the query provides the search portals with dramatically more information than most manually entered queries, the search engines typically return results containing more relevant hits.
- Figure 8B is a flow diagram of an embodiment of a method of providing an enhanced search string to a user.
- this method is performed by the Search Enhancer 346.
- a user may initiate a search. The user may click on an item or collection and request a search on it such as by clicking on Web search button 220, or a user may enter a search string in, for example the search query bar 208.
- the user input indicating a search request is received 818, and a determination is made whether a search string 820 has been entered. If not, then an item or collection has been selected, so a Context Summary is created for the selected item or collection 824. If a search string was entered, the search string is parsed 822 into the user's search words. The keywords in the user's Context Database 140 are searched 826 for occurrences of the user's search words. If no occurrences have been found, the method terminates 827 in this embodiment. Otherwise, the items and collections having the user's search words 830. For each item or collection, a Context Summary is created 832. Weights of keywords that appear more than once in all of the Context Summaries are summed 834. Whether the user clicked on an item or collection, or entered keywords, a Search
- a number of keywords, the number of which does not exceed a second maximum number of keywords, Q, that have weights that pass a threshold value are designated 838 as priority search words.
- the Search Summary keywords are placed in the search query grammar protocol of the user's selected search engine 840 with grammar specific emphasis given to the priority search words.
- An example of such a protocol is including a "+" sign before a word to indicate that the word must absolutely be contained in the result.
- the Search Summary keywords are then displayed 842 to the user with the priority search words marked to the user.
- FIG. 9B An example of such a list of Summary Keywords 963 is shown in Figure 9B.
- the plus “+” signs before “vacation” 960 and “Map” 966 indicates that they are suggested as priority search words to the user.
- the user may select any of the words listed under "More Suggestions” by simply selecting from the Search Summary those words that she would like to add to the query.
- Figure 9A illustrates a user interface window in which the user has clicked on
- Kenya 920 would not affect the weighting at all, in the context of Kenya's Masai Mara 918.
- the Context Summary for the this click may include the following: Travel (a keyword associated with every collection under Trips),
- Africa a keyword associated with Masai Mara, Elephant and the name of a collection
- Masai Mara Keyword for Elephants and the name of a collection
- Kenya Keyword for Masai Mara, Elephant and the name of a collection
- Trips Keyword for Africa, Kenya, Masai Mara and the name of a collection
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Business, Economics & Management (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Strategic Management (AREA)
- General Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Accounting & Taxation (AREA)
- Development Economics (AREA)
- General Health & Medical Sciences (AREA)
- Finance (AREA)
- Tourism & Hospitality (AREA)
- Databases & Information Systems (AREA)
- Economics (AREA)
- Marketing (AREA)
- General Business, Economics & Management (AREA)
- Game Theory and Decision Science (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Entrepreneurship & Innovation (AREA)
- Data Mining & Analysis (AREA)
- Human Resources & Organizations (AREA)
- Primary Health Care (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Information Transfer Between Computers (AREA)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP06019863A EP1760613A3 (de) | 1999-07-02 | 2000-06-30 | System und Verfahren zur Erfassung und Verwaltung von Information aus einer digitalen Quelle |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14223799P | 1999-07-02 | 1999-07-02 | |
US142237P | 1999-07-02 | ||
PCT/US2000/018111 WO2001002984A2 (en) | 1999-07-02 | 2000-06-30 | System and method for capturing and managing information from digital source |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP06019863A Division EP1760613A3 (de) | 1999-07-02 | 2000-06-30 | System und Verfahren zur Erfassung und Verwaltung von Information aus einer digitalen Quelle |
Publications (1)
Publication Number | Publication Date |
---|---|
EP1282864A2 true EP1282864A2 (de) | 2003-02-12 |
Family
ID=22499112
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP00946945A Withdrawn EP1282864A2 (de) | 1999-07-02 | 2000-06-30 | System und verfahren um informationen von digitalen quellen zu erfassen und zu verwalten |
Country Status (8)
Country | Link |
---|---|
EP (1) | EP1282864A2 (de) |
JP (1) | JP2003524823A (de) |
KR (1) | KR20020075359A (de) |
AU (1) | AU6062800A (de) |
BR (1) | BR0011779A (de) |
CA (1) | CA2377576A1 (de) |
NZ (1) | NZ516340A (de) |
WO (1) | WO2001002984A2 (de) |
Families Citing this family (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU5868801A (en) | 2000-05-29 | 2001-12-11 | Kaisha Saora Kabushiki | System and method for saving browsed data |
KR100955693B1 (ko) * | 2001-11-27 | 2010-05-03 | 강민수 | 텍스트 콘텐츠를 이용한 광고 방법. |
GB2378775A (en) * | 2001-05-03 | 2003-02-19 | Connextra Ltd | Provision of Secondary Information based on Context of Primary Information |
US20050114198A1 (en) * | 2003-11-24 | 2005-05-26 | Ross Koningstein | Using concepts for ad targeting |
US7664733B2 (en) * | 2003-04-11 | 2010-02-16 | Ricoh Company, Ltd. | Techniques for performing operations on a source symbolic document |
US20070022110A1 (en) * | 2003-05-19 | 2007-01-25 | Saora Kabushiki Kaisha | Method for processing information, apparatus therefor and program therefor |
KR100948256B1 (ko) | 2004-06-24 | 2010-03-18 | 한국전자통신연구원 | 타겟팅 지원을 위한 확장된 서술구조와 이를 적용한티비애니타임 서비스 방법 및 시스템 |
US7689915B2 (en) | 2004-07-29 | 2010-03-30 | Canon Kabushiki Kaisha | Image processing apparatus and image processing method using image attribute information and thumbnail displays for display control |
EP1826685B1 (de) * | 2006-02-03 | 2009-05-13 | Hurra Communications GmbH | Verfahren zur Auswahl und Darstellung mindestens einer Zusatzinformation |
WO2009120004A2 (ko) * | 2008-03-24 | 2009-10-01 | Kang Min Soo | 디지털 콘텐츠 관련 메타 정보 활용 키워드 광고 방법 및 그 관련 시스템 |
KR101061806B1 (ko) | 2008-03-24 | 2011-09-05 | 강민수 | 유무선 네트워크를 통하여 미디어 서비스를 수행하는 미디어 서비스 시스템의 정보 처리 방법 |
KR100926123B1 (ko) * | 2008-07-21 | 2009-11-11 | 강민수 | 디지털 콘텐츠에 포함된 텍스트 콘텐츠를 이용한 광고 방법 |
CN102129632A (zh) * | 2010-01-13 | 2011-07-20 | 阿里巴巴集团控股有限公司 | 一种抓取网页信息的方法、装置及系统 |
CN103971244B (zh) | 2013-01-30 | 2018-08-17 | 阿里巴巴集团控股有限公司 | 一种商品信息的发布与浏览方法、装置及系统 |
CN111079377B (zh) * | 2019-12-03 | 2022-12-13 | 哈尔滨工程大学 | 一种面向中文医疗文本命名实体识别的方法 |
KR102468630B1 (ko) | 2022-08-05 | 2022-11-22 | 주식회사 레이첼블루 | 아이템 가치평가를 통한 아이템 바꿔쓰기 매칭 플랫폼 서비스 제공 장치 및 방법 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1995034051A1 (en) * | 1994-06-06 | 1995-12-14 | Spectragraphics Corporation | Method and apparatus for capturing and distributing graphical data |
-
2000
- 2000-06-30 JP JP2001508705A patent/JP2003524823A/ja active Pending
- 2000-06-30 AU AU60628/00A patent/AU6062800A/en not_active Abandoned
- 2000-06-30 BR BR0011779-0A patent/BR0011779A/pt not_active Application Discontinuation
- 2000-06-30 NZ NZ516340A patent/NZ516340A/en unknown
- 2000-06-30 EP EP00946945A patent/EP1282864A2/de not_active Withdrawn
- 2000-06-30 KR KR1020027000027A patent/KR20020075359A/ko not_active Application Discontinuation
- 2000-06-30 WO PCT/US2000/018111 patent/WO2001002984A2/en not_active Application Discontinuation
- 2000-06-30 CA CA002377576A patent/CA2377576A1/en not_active Abandoned
Non-Patent Citations (1)
Title |
---|
See references of WO0102984A3 * |
Also Published As
Publication number | Publication date |
---|---|
JP2003524823A (ja) | 2003-08-19 |
KR20020075359A (ko) | 2002-10-04 |
WO2001002984A3 (en) | 2002-11-28 |
NZ516340A (en) | 2004-07-30 |
CA2377576A1 (en) | 2001-01-11 |
BR0011779A (pt) | 2003-07-01 |
WO2001002984A2 (en) | 2001-01-11 |
AU6062800A (en) | 2001-01-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7086002B2 (en) | System and method for creating and editing, an on-line publication | |
US9348872B2 (en) | Method and system for assessing relevant properties of work contexts for use by information services | |
US6836768B1 (en) | Method and apparatus for improved information representation | |
US7895595B2 (en) | Automatic method and system for formulating and transforming representations of context used by information services | |
US7882115B2 (en) | Method and apparatus for improved information representation | |
US7370061B2 (en) | Method for querying XML documents using a weighted navigational index | |
US7774333B2 (en) | System and method for associating queries and documents with contextual advertisements | |
US20060155728A1 (en) | Browser application and search engine integration | |
US20090144240A1 (en) | Method and systems for using community bookmark data to supplement internet search results | |
US20070078889A1 (en) | Method and system for automated knowledge extraction and organization | |
US20070185860A1 (en) | System for searching | |
KR101393839B1 (ko) | 링크된 용어들을 포함하는 활성 요약들을 제공하는 검색시스템 | |
CN100462969C (zh) | 利用互联网为公众提供和查询信息的方法 | |
WO2001050349A1 (en) | Electronic document customization and transformation utilizing user feedback | |
JP2002334106A (ja) | 話題抽出装置、方法、プログラム及びそのプログラムを記録する記録媒体 | |
EP1282864A2 (de) | System und verfahren um informationen von digitalen quellen zu erfassen und zu verwalten | |
EP1247213B1 (de) | Verfahren und gerät zum erzeugen eines auf einer formatvorlage basierten index für ein strukturiertes dokument | |
KR100403947B1 (ko) | 개인용 전자문서를 생성하는 방법, 시스템 및 컴퓨터프로그램 제조물 | |
KR100495034B1 (ko) | 인포박스를 이용한 정보제공 시스템 및 방법 | |
EP1760613A2 (de) | System und Verfahren zur Erfassung und Verwaltung von Information aus einer digitalen Quelle | |
Arnicans et al. | Intelligent integration of information from semistructured web data sources on the basis of ontology and meta-models | |
Shinagawa et al. | Dynamic generation and browsing of virtual www space based on user profiles | |
KR20100084265A (ko) | 사용자 피드백을 이용하여 평가된 컨텐츠로부터 정보를 추출하고 이를 이용하기 위한 방법 및 장치 | |
Evans | Assisting World Wide Web Navigation | |
Ellerin | The search continues: three publishers' site search solutions |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20020107 |
|
AK | Designated contracting states |
Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
AX | Request for extension of the european patent |
Extension state: AL LT LV MK RO SI |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: G06F 17/30 20060101AFI20060531BHEP |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: INTERWOVEN INC. |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN WITHDRAWN |
|
18W | Application withdrawn |
Effective date: 20061117 |