WO1999057656A1 - Method and apparatus for simultaneously accessing a plurality of dispersed databases - Google Patents
Method and apparatus for simultaneously accessing a plurality of dispersed databases Download PDFInfo
- Publication number
- WO1999057656A1 WO1999057656A1 PCT/US1999/009483 US9909483W WO9957656A1 WO 1999057656 A1 WO1999057656 A1 WO 1999057656A1 US 9909483 W US9909483 W US 9909483W WO 9957656 A1 WO9957656 A1 WO 9957656A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- results
- query
- database
- search
- client
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9538—Presentation of query results
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9532—Query formulation
Definitions
- the present invention is related generally to the field of database searching, and more specifically to simultaneous searching for data across a wide area network such as the Internet, the network including a plurality of clients and servers and a plurality of databases.
- a wide area computer network comprises a geographically disperse, interconnected plurality of computers capable of sharing data and/or processing capacity.
- the Internet is the world's largest WAN, growing at an annual rate some estimate to be above one thousand percent.
- March of 1998 there were an estimated 320 million pages of information posted on the World Wide Web (the graphics-capable portion of the Internet), with uncounted millions of gigabytes of additional information stored in non-Web based, though Web accessible, databases.
- the present invention addresses the need for an efficient method of finding data on a large scale WAN such as the Internet, including the visible and hidden portions of the World Wide Web, and the need to efficiently update found information as content evolves and grows.
- a number of challenges face the computer user accessing the Web and attempting to locate information about any particular subject matter.
- the immensity of the visible Web makes sorting through data found through currently available search engines difficult and time consuming.
- found data may include a substantial quantity of material not related to the sought-after material, but discovered anyway through simple boolean word association or other search mechanisms known to those skilled in the art to which the present invention pertains.
- the user must instead browse to the proper database access page and provide a boolean or other description of the desired information, in a manner which is redundant when performed in addition to a similar exercise required for searching the visible web.
- the user would be well-served by a mechanism for differentiating between newly found data and data previously discovered and analyzed by the user.
- a single subject database e.g., a healthcare database
- the majority of the single subject database entries comprises a hierarchical listing of hidden web databases, all entries being organized by subject matter and each including a description of database content and a search term entry interface customized for the particular database access page format.
- a user may establish a single query that the application then broadcasts to each desired hidden database to obtain indirectly accessible information.
- the results of the query are cached on the user's computer and displayed, preferably in HTML format.
- There are also listings in the database which provide an interface to search engines hosted at a dedicated search server.
- Each of these search engines includes a subject matter-limited listing of visible web sites that are particularly relevant to the databases subject.
- the user's query can be broadcast through the dedicated search server to obtain directly accessible information from the visible web.
- the search results of the visible web sites can then be displayed in HTML format similar to the results of hidden web searches.
- Each database is preferably updated at a regular interval, such as monthly or weekly, via remote download from a server on the WAN, or by other data transport means.
- Desired keywords are preferably cached and shared among database search interfaces.
- FIG. 1 illustrates a wide area computer network environment in which the method and system of the present invention may be utilized.
- FIG. 2 illustrates a user interface for use with the present invention.
- FIG. 3 illustrates a hidden database user interface as utilized with the present invention.
- FIG. 4 illustrates a feedback display provided in response to the search requested through the interface illustrated in FIG. 3.
- FIG. 5 illustrates a differentiated data feedback display of the present invention.
- FIG. 6 is a listing of the main dialogs offered in one embodiment of the invention.
- the present invention is preferably implemented as a software application 10 executed at least primarily by a client computer 12 connected to a wide area network 16 (such as the Internet) including a plurality of client computers and server computers 14.
- Application 10 stores and accesses at least one single subject database.
- the majority of the single subject database entries comprise hierarchical listings of hidden web databases or sources, all entries being organized by subject matter and each including a description of database content, URL information to locate the database and a search protocol for the database, such as a term entry interface customized for the particular database access page format.
- Application 10 obtains indirectly accessible information by issuing queries to the listed hidden web databases.
- the single subject database entries also comprise listings for search engines hosted at a dedicated search server 17.
- application 10 By routing queries through the dedicated search server, application 10 obtains directly accessible information from the visible web.
- Application 10 also provides a timing interface 18, illustrated in FIG. 2, for the user to set times (such as by the hour or the day of the week) for the client to monitor the results of a specific hidden web database or visible web query (preferably executed through the search engine provided by client 12 for the desired hidden database or databases, or the user's desired visible web search terms).
- Client 12 preferably stores the user's preferred monitoring schedule on a hard drive or similar stable memory local to the client and checks the schedule every time client application 10 is activated, as well as at predetermined intervals (e.g., every 15 minutes) thereafter while application 10 is activated. If a schedule check reveals query results are due to be monitored, client 12 obtains indirectly accessible information by sending the user's desired query to the desired sites from the database and directly accessible information by sending the query to the search engine server dedicated to a specific group of visible web sites, and retrieves the results. Client 12 is then preferably directed by application 10 to compare new results to previously retrieved results using a difference algorithm, and to display the difference in HTML format on a current results viewing page.
- a schedule check reveals query results are due to be monitored
- client 12 obtains indirectly accessible information by sending the user's desired query to the desired sites from the database and directly accessible information by sending the query to the search engine server dedicated to a specific group of visible web sites, and retrieves the results.
- Client 12 is then preferably directed by application 10 to
- a server 17 dedicated to visible web search functions is preferably directed by client 12 to do a previous and current results comparison and send an HTML-formatted results page to client 12 for display to the user.
- comparisons may be accomplished exclusively on client 12 or by the client and server in combination.
- the process of query comparison and difference display for hidden web databases preferably begins with monitor set-up.
- the user preferably schedules monitor times via interface 18 after viewing results of an initial query, or before sending an initial query. If a particular query has not already been executed, client formats and sends the query to a database server at the next desired time interval.
- a database query for the term "Crohn's" is illustrated in FIG. 3. There may be fewer queries made than requested by the user if the number of sites set to be monitored at a given time exceeds the available bandwidth of the data exchange connection 15 between client 12 and network 16, but as many queries as possible will be made at any given time according to system capability.
- the software is configured so that any queries not performed at a given monitoring time due to bandwidth and time constraints will be queued for execution at the next available opportunity, such as the next time application 10 is run, whether or not a scheduled monitoring event is due.
- Client 12 retrieves the query results, preferably in HTML format, and provides a mechanism to the user to view these results within a Web browser.
- client 12 caches HTML (and related graphics) for the initial results page.
- Subsequent monitor queries triggered at the appropriate time intervals are accomplished by client 12 formatting and sending a query to a database server at a particular database-housing Web site, such as one that might be housed on server 19.
- the user may choose to limit results processing to preferably between one and twenty sites at a time, although additional simultaneity may be accommodated through the use of accelerated hardware on client 12 and a high bandwidth connection 15 to network 16.
- Client 12 then preferably utilizes a difference algorithm to compare the current query results to HTML results and related graphics, previously cached in a long-term stable memory. Extraneous information such as advertising banners are preferably removed to allow the user to focus on new results.
- An example of a result comparison HTML display is provided in FIG. 5. If the query results have not changed, client 12 notifies the user. If the query results have changed, client 12 notifies the user and creates an HTML document which displays the differences between the old and new query results and highlights the differences in the body of the text on the most recent HTML results page.
- the provided results page preferably also provides link elements within the text to navigate between each of the differences and links to view previous and current results.
- Client 12 then provides the user a mechanism to view results within a browser, and replaces a previously cached HTML results document (and related graphics) with a current results document.
- the client application finally caches the most recent query results, and provides means for the user to view the most recent results.
- Client 12 will preferably only compare a newest scheduled search result to a first search or subsequent, most recently changed result.
- client 12 formats and sends the query to server 17 at a next predetermined time interval.
- Server 17 then sends an HTML result page and results summary document back to client, in response to which client provides to user the usual means for viewing these results in a Web browser, and client caches both an HTML results page and a summary document.
- client 12 formats and sends both query and a previous result summary document to server 17, which uses the previous summary document and current query summary document to compare current query results to previous results, and sends an HTML-formatted changed results page back to client 12 (thus, the page displays only new or different results, not unchanged results).
- Client 12 then provides the customary means for the user to view results in a Web browser, and client caches the newest HTML results page and newest summary document for later comparison.
- Server 17 may also be configured to maintain a user's query and search preferences and run the monitoring functions automatically. Server 17 can then notify user of any changed results by communicating directly with client 12 during the next execution of the application, by email, or by network independent methods such as paging or automated phone notification.
- the user can establish the criteria for triggering a change notification.
- client 12 Upon inquiry by the user, client 12 preferably displays the monitor times and current monitoring status for each query set to be monitored. Client 12 preferably does not save HTML for the difference results pages, but rather saves only the HTML for the most recently changed results page, which the user can access from a site monitor interface. For visible web monitored query results, the changed results page provides link to HTML page displaying all current results.
- server 17 can also be configured to run confidential searches. Instead of client 12 issuing the query directly to an invisible web database, the query can be routed through server 17. The result of this operation is that the invisible web database sees the query as issuing from dedicated server 17, not client 12.
- application 10 offers a variety of user configuration options to optimize the searching patterns and displayed results for a given user.
- Fig. 6 shows a listing of the dialogs prompted by the application 10 to guide the user in configuring the application and issuing the searches.
- the present invention therefore provides a method and apparatus for simultaneously and intelligently searching and accessing data from otherwise difficult to reach databases.
- the apparatus preferably includes a single subject database (e.g., a healthcare database, although a plurality of such databases may be resident on a single client) including a hierarchical listing of hidden web databases organized by subject matter.
- the apparatus further comprises means of access to search engines hosted on a dedicated service provider server, the search engines accessing an index of visible web sites that are particularly relevant to the selected subject matter. Search results are preferably cached on the user's computer, allowing for easy sorting of new and old data and differentiated display to the user.
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Information Transfer Between Computers (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU38748/99A AU3874899A (en) | 1998-05-01 | 1999-04-30 | Method and apparatus for simultaneously accessing a plurality of dispersed databases |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US8385698P | 1998-05-01 | 1998-05-01 | |
US60/083,856 | 1998-05-01 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO1999057656A1 true WO1999057656A1 (en) | 1999-11-11 |
WO1999057656A9 WO1999057656A9 (en) | 2000-01-27 |
Family
ID=22181132
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US1999/009483 WO1999057656A1 (en) | 1998-05-01 | 1999-04-30 | Method and apparatus for simultaneously accessing a plurality of dispersed databases |
Country Status (2)
Country | Link |
---|---|
AU (1) | AU3874899A (en) |
WO (1) | WO1999057656A1 (en) |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001052111A2 (en) * | 2000-01-13 | 2001-07-19 | Interlink Network Resources, Inc. | System and method for internet broadcast searching |
WO2001086496A1 (en) * | 2000-05-11 | 2001-11-15 | W Start | System and method for processing data for targeted access to a server from a natural language query |
WO2002016542A1 (en) * | 2000-08-18 | 2002-02-28 | Anderson Merchandisers Lp | System and method for an interactive shopping news and price information service |
GB2372587A (en) * | 2000-12-15 | 2002-08-28 | Hutchison Telephone Company Lt | Automatic downloading for mobile computing devices |
DE10113902A1 (en) * | 2001-03-21 | 2002-09-26 | Matthias Jaekle | Processing program of events dates involves downloading pages from Internet, searching downloaded pages for event information, storing event information found in result table or database |
US6584468B1 (en) | 2000-09-29 | 2003-06-24 | Ninesigma, Inc. | Method and apparatus to retrieve information from a network |
WO2004102305A2 (en) * | 2003-05-16 | 2004-11-25 | Nhn Corporation | A method of providing website searching service and a system thereof |
US6847974B2 (en) | 2001-03-26 | 2005-01-25 | Us Search.Com Inc | Method and apparatus for intelligent data assimilation |
US8255291B1 (en) | 2000-08-18 | 2012-08-28 | Tensilrus Capital Nv Llc | System, method and apparatus for interactive and comparative shopping |
WO2013106423A1 (en) * | 2012-01-10 | 2013-07-18 | Google Inc. | Method and apparatus for animating transitions between search results |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5692073A (en) * | 1996-05-03 | 1997-11-25 | Xerox Corporation | Formless forms and paper web using a reference-based mark extraction technique |
US5787470A (en) * | 1996-10-18 | 1998-07-28 | At&T Corp | Inter-cache protocol for improved WEB performance |
US5855020A (en) * | 1996-02-21 | 1998-12-29 | Infoseek Corporation | Web scan process |
-
1999
- 1999-04-30 WO PCT/US1999/009483 patent/WO1999057656A1/en active Application Filing
- 1999-04-30 AU AU38748/99A patent/AU3874899A/en not_active Abandoned
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5855020A (en) * | 1996-02-21 | 1998-12-29 | Infoseek Corporation | Web scan process |
US5692073A (en) * | 1996-05-03 | 1997-11-25 | Xerox Corporation | Formless forms and paper web using a reference-based mark extraction technique |
US5787470A (en) * | 1996-10-18 | 1998-07-28 | At&T Corp | Inter-cache protocol for improved WEB performance |
Cited By (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7000007B1 (en) | 2000-01-13 | 2006-02-14 | Valenti Mark E | System and method for internet broadcast searching |
WO2001052111A2 (en) * | 2000-01-13 | 2001-07-19 | Interlink Network Resources, Inc. | System and method for internet broadcast searching |
WO2001052111A3 (en) * | 2000-01-13 | 2003-12-24 | Interlink Network Resources In | System and method for internet broadcast searching |
WO2001086496A1 (en) * | 2000-05-11 | 2001-11-15 | W Start | System and method for processing data for targeted access to a server from a natural language query |
FR2808907A1 (en) * | 2000-05-11 | 2001-11-16 | Start W | Internet search engine that uses natural language to find requested data over the Internet that minimizes repeated entry of search criteria, improves the effectiveness of searching and uses natural language |
WO2002016542A1 (en) * | 2000-08-18 | 2002-02-28 | Anderson Merchandisers Lp | System and method for an interactive shopping news and price information service |
US10636058B2 (en) | 2000-08-18 | 2020-04-28 | Tensilrus Capital Nv Llc | System and method for an interactive shopping news and price information service |
WO2002016542A3 (en) * | 2000-08-18 | 2003-07-17 | Anderson Merchandisers Lp | System and method for an interactive shopping news and price information service |
US9037504B2 (en) | 2000-08-18 | 2015-05-19 | Tensilrus Capital Nv Llc | System and method for an interactive shopping news and price information service |
US8255291B1 (en) | 2000-08-18 | 2012-08-28 | Tensilrus Capital Nv Llc | System, method and apparatus for interactive and comparative shopping |
US7177818B2 (en) | 2000-08-18 | 2007-02-13 | Mark Nair | System and method for an interactive shopping news and price information service |
US6584468B1 (en) | 2000-09-29 | 2003-06-24 | Ninesigma, Inc. | Method and apparatus to retrieve information from a network |
GB2372587A (en) * | 2000-12-15 | 2002-08-28 | Hutchison Telephone Company Lt | Automatic downloading for mobile computing devices |
GB2372587B (en) * | 2000-12-15 | 2005-06-22 | Hutchison Telephone Company Lt | Automatic downloading for mobile computing devices |
DE10113902A1 (en) * | 2001-03-21 | 2002-09-26 | Matthias Jaekle | Processing program of events dates involves downloading pages from Internet, searching downloaded pages for event information, storing event information found in result table or database |
US6847974B2 (en) | 2001-03-26 | 2005-01-25 | Us Search.Com Inc | Method and apparatus for intelligent data assimilation |
WO2004102305A3 (en) * | 2003-05-16 | 2005-02-03 | Nhn Corp | A method of providing website searching service and a system thereof |
WO2004102305A2 (en) * | 2003-05-16 | 2004-11-25 | Nhn Corporation | A method of providing website searching service and a system thereof |
WO2013106423A1 (en) * | 2012-01-10 | 2013-07-18 | Google Inc. | Method and apparatus for animating transitions between search results |
Also Published As
Publication number | Publication date |
---|---|
AU3874899A (en) | 1999-11-23 |
WO1999057656A9 (en) | 2000-01-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6766315B1 (en) | Method and apparatus for simultaneously accessing a plurality of dispersed databases | |
US8429201B2 (en) | Updating a database from a browser | |
KR100799658B1 (en) | Host-based Intelligent Results Related to a Character Stream | |
US5892919A (en) | Spell checking universal resource locator (URL) by comparing the URL against a cache containing entries relating incorrect URLs submitted by users to corresponding correct URLs | |
US7949702B2 (en) | Method and apparatus for synchronizing cookies across multiple client machines | |
US7487145B1 (en) | Method and system for autocompletion using ranked results | |
US5978828A (en) | URL bookmark update notification of page content or location changes | |
US6633867B1 (en) | System and method for providing a session query within the context of a dynamic search result set | |
CN101427229B (en) | Technique for modifying presentation of information displayed to end users of a computer system | |
US8271546B2 (en) | Method and system for URL autocompletion using ranked results | |
US20020073165A1 (en) | Real-time context-sensitive customization of user-requested content | |
US20030110161A1 (en) | Method, product, and apparatus for providing search results | |
EP1208460B1 (en) | System and method of presenting channelized data | |
US6810395B1 (en) | Method and apparatus for query-specific bookmarking and data collection | |
US8020106B2 (en) | Integration of personalized portals with web content syndication | |
US7707142B1 (en) | Methods and systems for performing an offline search | |
US20080195588A1 (en) | Personalized Search Method and System for Enabling the Method | |
US20050120016A1 (en) | Searching in a computer network | |
WO2007019380A2 (en) | Enhanced favorites service for web browsers and web applications | |
WO1998045978A2 (en) | Method and apparatus for providing remote site administrators with user hits on mirrored web sites | |
JP2001503537A (en) | Identify changed data in online data repositories | |
US7120628B1 (en) | System and method for enabling a user to subscribe to updates from information sources | |
WO1999057656A1 (en) | Method and apparatus for simultaneously accessing a plurality of dispersed databases | |
US7761439B1 (en) | Systems and methods for performing a directory search | |
WO2001011443A2 (en) | Internet hosting system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GE GH GM HU ID IL IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GH GM KE LS MW SD SL SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
AK | Designated states |
Kind code of ref document: C2 Designated state(s): AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GE GH GM HU ID IL IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: C2 Designated state(s): GH GM KE LS MW SD SL SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
COP | Corrected version of pamphlet |
Free format text: PAGES 1/6-6/6, DRAWINGS, REPLACED BY NEW PAGES 1/7-7/7; DUE TO LATE TRANSMITTAL BY THE RECEIVING OFFICE |
|
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
NENP | Non-entry into the national phase |
Ref country code: KR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 09704234 Country of ref document: US |
|
122 | Ep: pct application non-entry in european phase |