CN1296853C - Predictive caching and highlighting of web pages - Google Patents

Predictive caching and highlighting of web pages Download PDF

Info

Publication number
CN1296853C
CN1296853C CNB028061012A CN02806101A CN1296853C CN 1296853 C CN1296853 C CN 1296853C CN B028061012 A CNB028061012 A CN B028061012A CN 02806101 A CN02806101 A CN 02806101A CN 1296853 C CN1296853 C CN 1296853C
Authority
CN
China
Prior art keywords
user
web
interest
web document
browser
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB028061012A
Other languages
Chinese (zh)
Other versions
CN1522418A (en
Inventor
瑞克·A·汉密尔顿
约翰·S·兰弗得
史蒂文·J.·利普顿
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Publication of CN1522418A publication Critical patent/CN1522418A/en
Application granted granted Critical
Publication of CN1296853C publication Critical patent/CN1296853C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • G06F16/9574Browsing optimisation, e.g. caching or content distillation of access to content, e.g. by caching

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A web browser predictively and automatically searches for web documents linked to a currently displayed web page which contain terms of interest to a web browser user. Linked documents containing terms of interest are automatically retrieved and stored while the user views the current document such that if the user selects the link to stored document, it will be displayed without waiting for it to download. To further assist the user in finding the documents containing the user's interest terms, links in the current page leading to the documents of interest are highlighted, and special fast links to those pages may be created and displayed for even greater noticeability and usability by the user.

Description

The method and system that the predictability of webpage is browsed
Technical field
The present invention is relevant with web browser and server technology, and is specifically relevant with the web browser technology that the ability of browsing of paying the utmost attention to individual subscriber interest is provided.
Background technology
The Internet and WWW have become the indispensable ingredient of business management, personal lifestyle and course of education.
The center of technique of internet is web browser technology and Internet server technology.
Internet server has " content " such as document, image or graphic file, form, audio clips, and these contents all are that to have system and a browser that the Internet connects available.
Web browser or " client computer " computing machine can be to web Address requests documents, and suitable web server responds to this, send one or more web documents, image or graphic file, form, audio clips etc.Is HTML (Hypertext Markup Language) (" HTTP ") from server to the prevailing agreement that browser sends web document and content.
Fig. 1 shows the primary customer's machine-server configures situation of communicating by letter with in-house network the Internet.Client browser computing machine (1) is furnished with the Internet access (2) that arrives WWW (3) by the common unit such as dial-up telephone line and modulator-demodular unit, cable modem or LAN (Local Area Network) (" LAN ").Web browser computing machine (1) also be furnished with suitable such as Netscape Navigator or the web browsing software the Explorer of Microsoft.Web server computer (5) is furnished with equally with the similarly the Internet access (4) of device or the high bandwidth apparatus such as T1 and T3 data line and web server software external member arrival WWW (3).Perhaps, also can be that client-server is passed through in-house network (a 6) interconnection such as the LAN of corporations.These are configured in this technical field is well-known.
Prevailing internet content or Doctype are HTML (Hypertext Markup Language) (" HTML ") documents, but extended formatting is well-known equally in this technical field, such as Adobe Portable Document Format (" PDF ").HTML, PDF and other web documents provide in document and have made the user can select " hyperlink " that another document or website check (hyperlink).Hyperlink is literal or the zone of indicating specially in the document, and pointed document is retrieved or obtained to the order browser software when being chosen by the user.
Usually, when the user chose an ordinary hyperlinks, the current page that shows in graphic user interface (" the GUI ") form of web browser disappeared, and shows the up-to-date page that receives.If female page or leaf is an index, IBM website www.patents.ibm.com for example, and the user wishes to visit each follow-up link (for example read and have the relevant document that how to use the prompting of this website), then the just disappearance of female page or leaf or index page, and show this new page or leaf (for example helping page or leaf).
Along with the raising of the computing power of web browser computing machine and the communication bandwidth of web browser computing machine enlarge markedly, be to consider that processing and throughput speed that these are bigger transmit and filter this content for a difficult problem of the mechanism that internet website and content are provided.
In the field of using and send to aspect desktop or the client computer particularly like this in the information that better, the more effective mode of exploitation will be fit to the user based on web.
Yet, the present normally non intelligent software package of some web browsers.As the web browser of current existence, their interested any article or document of their universal demand user manual search, this is normally very burdensome, because they just can find a document that substantial connection is arranged after often need downloading many documents.
Search engine provides " intelligence " to a certain degree for browsing, the user can point to a search engine address with his non intelligent browser, import the key word of some retrievals, by selecting hyperlink or the web address in the Search Results, inspect the document that each returns one at a time by manually again the sensing of web browser being provided.Yet search engine is unactual to be searched for whole the Internet, they just to the search engine operator usually their the internet content index set up of the process of the manual submission by other website operator of inspection search for.Therefore, the user often need seek the information of particular topic with several search engines, because each search engine will return different results according to their index content.
In order partly to address this problem, well-known other two kinds of technology in this technical field have been developed.First kind of technology is called " ultimate search engine (metasearchengine) ", is an engine in a plurality of search engines.Ultimate search engine is not held its index, submits to a plurality of search engines simultaneously but will inquire about, and comes the highest user that returns to what each search engine in these search engines returned.Though this is more useful than manually visiting each search engine of inquiring about one by one, the result is dissatisfied as expected usually.Usually, return in the minority of the inventory upper top of listed and search key coupling and not to be most interested in, thus the user often Access Column returning in the middle of the inventory or terminal website.Though ultimate search engine can return 5 clauses and subclauses in top from 4 search engines, may filtering probably be information of interest.
The second approach that addresses this problem is called (crawler) engine of web " crawler type browser ".These servers periodically contact with other servers, the web site contents of institute's index before " index again ", and this keeps them is newer, with any their index of information revenue that can obtain from a website recently.Therefore yet owing to have thousands of new websites to be suspended on the line every day, a crawler type browser will be visited these new websites and is practically impossible.Therefore, in addition web crawler type browser may not provide the covering fully of internet content yet.
In each US patent, some other trial has been proposed, comprise and create one " communities of some intelligent agents ", utilization is based on the mutual classification and the filtration of server, " intelligent assistant " of the client-side that triggers when in a web document, running into special mark, and automatic " bookmark " function.Put it briefly, these technology that proposed and method all need the cooperation of certain server side and client-side, and this makes these technology be difficult to large-scale promotion application.
Several years ago, introduce a kind of client-side technology, downloaded the interior all webpages of a hyperlink of the current webpage of packing into of browser.Owing to collected all documents that directly link from current webpage of visiting, therefore the user then selects the whichever document can to obtain from the Cache in the local memory immediately, thereby does not need waiting for server that the up-to-date webpage of choosing is sent to browser.Be through with and read this down one page (it is a current page now) and when having selected a subsequent document Deng the user, this subsequent document is by high-speed cache, therefore also can show and do not have a transmission delay.Yet this processing mode has some shortcomings when one of visit " link is enriched " webpage.For example, a welcome news site webpage has 60 direct linked document of the homepage from new business of surpassing.Therefore, for the communication network of web browser Computer Service for when the user reads homepage and 60 all direct chaiming files of will before the user selects a hyperlink on the homepage, packing into may be rendered as a bottleneck or time restriction factor.Thereby these webpages that directly have only minority in webpages of link are searched homepage and decision the reader and are checked that next document can successfully download in the used time.Unfortunately, these successful web pages downloaded during the inspection homepage may be users and uninterested, because download function is not classified to webpage or determine which webpage may be or may not be interested measure.
Summary of the invention
Therefore the present invention a kind of predictability is provided has browsed the method for the user's interest web document of a web browser in first aspect, described web browser has a user display, a user input apparatus and a persistent storage, described web document contains some speech and can visit from described web browser by a chained address in the current page, and described method comprises the following steps: to receive from a chained address part of a web document; Whether the described part of determining a web document contains the one or more predetermined speech of described user's interest; And exist interested one or more speech to respond at described documentation section to determining, receive and store the integral body of described web document; Wherein said reception, determine and storing step is carried out when the user checks described current page.
This method of first aspect present invention preferably also comprise for from one the one web document, in the predetermined quantity chained address addressable a plurality of web documents, repeat a web document of described reception a part, determine whether described part contains the step of the integral body of interested speech and a reception and a web document of storage.
This method of first aspect present invention preferably also is included on the described web browser display device and highlights a step of leading to the link of a web document.
This method of first aspect present invention preferably also is included in and creates a quick link of leading to the described web document of storing on the described browser display device.
In second aspect, the invention provides a kind of calculation machine program, described computer program comprise the method that when the computer system of packing into is carried out, makes described computer system carry out first aspect present invention program code in steps.
In the third aspect, the invention provides a kind of enhancing web browser that can browse to predictability the user's interest web document of a web browser, described web document contains some speech and can visit from described browser by a chained address in the current page, and described system comprises: the processor of an executive routine code; One is the user display of user's display message; A user input apparatus that receives user's input; A persistent storage of storing data and information comprises the user's items of interest inventory that is stored in wherein, and described items of interest inventory contains some speech of user's interest; And browser based on interest by the predictability of described processor execution, described browser is used for from the part of a web document of a chained address reception, whether the described part of determining a web document contains one or more items of interest speech, finds the integral body that one or more items of interest speech respond and receive and store described web document to determining in described documentation section; The browser based on interest of wherein said predictability is carried out described reception, is determined and storage operation when described user checks described current page.
Best, the browser based on interest of the predictability of the system of third aspect present invention also comprises a standard web browser that has a browser plug-in, described browser plug-in is used for from the part of a web document of a chained address reception, whether the described part of determining a web document contains one or more items of interest speech, and finds the integral body that one or more items of interest speech respond and receive and store described web document to determining in described documentation section.
Best, described web document comprises html document.
Best, the system of third aspect present invention comprises that also one highlights a link of leading to the link of a web document of being stored and highlights device on described web browser display device.
Best, the system of third aspect present invention also comprises a quick creator, and described being linked at fast pointed to the described web document of storing on the described web browser display device.
Therefore the present invention suitably and preferably make the web browser can predictability the web document that contains the interested project of web browser client of search and current shown web page interlinkage automatically.The document that contains interested project that is linked is suitably retrieved automatically when the user checks current document and is stored, if thereby the user has selected to lead to the link of the document of being stored, just can show this document and need not wait for again that its downloads.In order further to help the user to search the document that contains the user's interest project, can highlight the link of leading to interested documents in the current page, can also create and show the special-purpose link fast of leading to these webpages, so that better remind and be user-friendly to.
Therefore, the preferred embodiments of the present invention provide a kind of web browsing method and system valuably, can be according to user's interest project or keyword list predictability ground from web server computer such as WWW and distributed data base retrieving information.In addition, useful is, this new system and method and widely used browser technology compatibility such as telephone set, internet equipment, personal digital assistant and the pocket PC of personal computer, support web only need few support or compounding technique that does not even need server side.In addition, useful is that this new system and method highlights the information of predictability high-speed cache or leads to the link of this information on user's display, the information that the user can be made things convenient for and check the predictability high-speed cache apace.
Some preferred embodiments also preferably provide a kind of system and method that a browser is configured to comprise user's items of interest inventory.This method provides one to show user's inventory of the key word of frequent search, and this inventory can be used for other software programs on the same client computer web browser computing machine.
Description of drawings
Illustrate a preferred embodiment of the present invention below in conjunction with accompanying drawing, in these accompanying drawings:
Fig. 1 shows the well-known configuration between the Internet client or web browser, web server system and communication network;
Fig. 2 illustration the well-known architecture of web browser and web server system;
Fig. 3 shows the typical tree structure of some hyperlink documents on the website; And
Fig. 4 has disclosed the configuration of the preferred embodiments of the present invention.
Embodiment
For this explanation, suppose all and find and the task of the Webpage correlation of packing into is all carried out by a web browser application such as the Explorer of the Navigator of Netscape or Microsoft.In fact, illustrated here embodiment of the present invention can use the software related with the web browser to realize, this software can be also can not be the part of browser itself, such as the stand-alone software application or browser plug-in module of a cooperation.Thereby the personnel that are familiar with this technical field can recognize, that works is realized by any software as illustrated here in the establishment of items of interest inventory, and its result can be used for other and browser function associated and software.
Fig. 2 shows the Common Hardware ﹠ Software architecture of typical web server and web browser computer system.Web browser computing machine (20) interconnects with communication mode with web server computer (22) by the Internet or in-house network (21).The web browser comprises the Standard User interface arrangement (23) such as graphoscope or monitor, keyboard and mouse.The hardware platform of web browser computing machine (20) comprises central processing unit (" CPU ") (24), disc driver (25), user's interface device I/O (26) and network interface unit (" NIC ") (27).NIC can be one of some well-known kinds in this technical field, comprises dail-up modem, LAN (Local Area Network) (" LAN ") card or cable modem interface.The software that web browser computing machine (20) is carried out can comprise some device drivers and a basic input/output (" BIOS ") (28), and operating system (203), application program (202) and applet interpreter (29) and applet (201).The web browser program such as the Navigator of Netscape, is the application program that can be carried out by CPU (24).This architecture with a web server computer is well-known with being configured in this technical field.
In this preferred embodiment, the web browser application modification of program of standard becomes to comprise some logics and increased functionality.Enhancing on these functions has utilized some existing forces of existing web browser, such as:
(1) explains the web document that is received;
(2) all or part of can show in current web browser display form to make a web document;
(3) the control designator of explicit user option icons, drop down list or other patterns in web browser display form;
(4) receive the selection of user to the control designator of the user option icon, drop down list and other patterns that in the web browser receives form, show; And
(5) data item such as document, record and cookie in the non-volatile storage such as hard disk drive and non-volatile RAM or ROM particularly in foundation, storage and the access system memory.
Because a general configuration and the architecture of above-mentioned web browser are well-known in this technical field, so all the other explanations of invention preferred embodiment are put up with preferably as a Windows[TM at the Microsoft on the IBM compatible computer] browser plug-in of the Navigator of the Netscape that moves under the operating system step and the function that realize provide.Yet, the personnel that are familiar with correlative technology field can recognize, under the situation that does not deviate from scope of the present invention, other operating systems such as the Solaris of UNIX, Linux and Sun Microsystem, other computer hardwares such as the telephone set of the RS6000 of IBM, the iMac of Apple (TM), personal digital assistant and support web, and also can adopt such as java script or other software implementation modes of compiling the damp program.In also having some embodiment, the servlet of web server or program can be safeguarded the items of interest inventory, make this inventory can be the client-side program according to request and plug-in unit used.
Put it briefly, the preferred embodiments of the present invention have been improved the original notion and the function of web browser.Preferably this web browser determines which key word may be that the web browser client is interested.These items of interest preferably are stored in the non-volatile storage of system, can be visited as a plane (flat) text document by the present invention.Also can adopt other embodiments of items of interest inventory, such as the record in a database, all these embodiments all are to be visited easily by other programs of the browser plug-in that comprises the preferred embodiment of the present invention.
Can cooperate the preferred embodiments of the present invention to adopt other method of setting up the items of interest inventory or systems, yet top illustrated system and method provide some process usefuls that produce the items of interest inventory.
Table 1 shows the example of items of interest inventory embodiment after generation.User's items of interest inventory of this example provides with variable (" the CSV ") form by CSV, stipulates that wherein colon ": " indicates a total class of showing some subclasses.If do not have colon after class or the project, just suppose that all available subclass and projects under this class all are interested.
Table 1: user's items of interest inventory document examples
Politics<CR 〉
Physical culture: baseball, professional basketball, motorcycle sport<CR 〉
<EOF>
User's items of interest inventory preferably user is directly editable, if therefore a user wishes to delete the items of interest that may add in the past, he just can make easily like this of a common text document editing machine or a database program.Equally, if a user wishes to add an items of interest after a while, he just can call menu again or directly edit a document.
The preferred embodiments of the present invention provide two user's optional processs, are used for according to the ground retrieval of user's items of interest inventory predictability and the high-speed cache information from the web server.In first process, the special-purpose hyperlinked information that has only " items of interest " is preferential high-speed cache, thereby improved the process of well-known webpage by web browser cache all " 1 redirect (1hop) ".Here second process of Jie Shiing highlights any hyperlink that contains user's item of interest purpose information of leading to, for example, make the user note these links by highlighting literal or image on the web browser display device, independently sweeping and so in the web browser window or in the special framework in former web browser window at one.
For more clear and special-purpose in the detailed description below, adopt following term:
" items of interest (interest term) " is that the final user is interested from Ming Dynasty style speech or phrase;
" N redirect scanning (N hop scan) " is illustrated in web browser wherein will attempt the link space of packing into and check webpage and associated text in predictability ground;
" link interested (interest link) " is those addressable hyperlink in containing item of interest purpose " N redirect scanning ";
" fast link (fast link) " is a highly-visible link of extracting from contain the mixed and disorderly background that a generic web page of leading to the direct link of being found that contains item of interest purpose webpage shows;
" dark link (deep-linking) " is a term of accepting usually, refer to pull out the web content from the depths, website of a mechanism, and perhaps by a series of URL retrieve data, and the webpage in the middle of needn't packing into or visit;
" gaze duration (contemplation time) " is defined as the time that the user spends on a webpage that provides, be to determine and to highlight any interested of current webpage of packing into to link the time that branch uses for the web browser; And
The length of the text that " TB " downloads when being Webpage search items of interest of browser scanning for example is unit with the byte.
N redirect scanning as discussed above, is that follow is lighted predictability scanning or the retrieval that the document that can lead to carries out in " N " individual hyperlink.Fig. 3 shows the typical tree structure or the expression of a web site contents.Each webpage has some hyperlink webpages from it, and these hyperlink are shown the arrow line that points to another webpage or other webpages from a webpage.The degree of depth or the space of information searched in variable " N " expression with respect to starting point.
For example, 1 redirect scanning (for example, N=1) (51) retrieve all by from current web page (50) single " click " or the addressable hyperlink document of hyperlink, be the page 2,3 and 4 in this example, web page contents to these documents scans, and sees user's items of interest whether occurs.
Equally, 2 redirects scanning (N=2) (52) are retrieved all and are passed through secondary " click " from the addressable document of current web page hyperlink, and for example the page that scans for all 1 redirects in this example adds page 2a, 3a, 3b, 4a and 4b.
From the tree-like expansion of this synoptic diagram as seen, need the data volume considered relatively the value index number of N increase, high-order scanning will more practical, but the processor speed of the web browser computing machine of computer network communication bandwidth that need be bigger and Geng Gao.
Fig. 4 there is shown the implementation structure of preferred embodiment.Items of interest predictability scanner plug-in unit (43) is operation in web browser computing machine (20) is gone up web browser program (40) environment, the link that highlights for the user shows with the user I/O (23) of web browser computing machine, link and the display frame that produced fast are as described in the following description.User's items of interest inventory (42) in a simple text document or some data-base recordings for example is to be stored on the hard disk drive or in the non-volatile storage of web browser (20) from its medium (41) visit.Perhaps, user's items of interest inventory (42) also can be from one by addressable web of web browser (20) or network server access.
Items of interest predictability scanner plug-in unit (43) also utilizes communicating by letter of the communication capacity (such as its network interface unit and communication protocol (TCP/IP)) of web browser computing machine (20) and web browser program (40) and display capabilities (such as the HTTP) some parts of (3) or other computer networks retrieval web document from the Internet selectively.
The preferred embodiments of the present invention are operated during the gaze duration to a current web page, according to the hyperlink document of user's items of interest predictability ground retrieval in N redirect scanning space.Suppose that interested key word can be stored in web browser and/or the associated software.Then, download utilization to such item of interest purpose knowledge by " reading in advance " predictability (read-ahead).
This webpage in case a web browser is packed into after a user is to the selection of any webpage or other actions of selection webpage (such as bookmark of selection, navigation keys), predictability high-speed cache process just begins immediately.The starting point that the current webpage of packing into and checking is set to N redirect scanning is " current page " in other words.
So the preferred embodiments of the present invention are analyzed the source of current page, such as the HTML of current page, begin to download all direct webpages that are called 1 redirect webpage that link with current page.Following being loaded in of each webpage successfully receives byte number or the K word joint number that predetermined amount of data is for example provided by TB) the back termination.
Then, the download of each webpage is partly scanned, determine whether they contain the Any user items of interest.If after having downloaded predetermined byte, in the plaintext of this webpage or first speech, do not find the Any user items of interest, just stop to download.Because stop the download of whole webpage, browser has just been saved the network bandwidth and time, so these resources of being saved can be used for scanning next possible webpage interested.If the discovery items of interest is just recovered and is finished download, with the whole web storage of link in Cache.
Browser is checked next 1 redirect webpage when the user continues to watch attentively current webpage of packing into, is again next then, all obtains scanning and high-speed cache up to 1 time all as required redirect webpages.
If 1 time all redirect webpages all obtains scanning before the user finishes to inspect current page, just increase time redirect level (hop level), by downloading each 2 redirect is that the webpage of 3 redirects etc. scans the web site contents of depth layer in succession then, search key, if it is find items of interest, as explained above such with regard to the whole webpage of high-speed cache.
This predictability scanning process can be described with the pseudo-code of table 2.
Table 2: the pseudo-code of predictability scanning process
UNTIL (user selects the link among the current_page):
FOR?hop=1?to?N:
Scan_page=current_page
catalog?all?referenced_links?from?current_page
randomly?order?from?first?to?last?all?referenced_links
FROM?first?TO?last?referenced_link:
download?document?portion?at?referenced_link
scan?portion?far?occurrences?of?interest?terms
IF?occurrences?found,THEN:
complete?download?of?document
store?document?in?cache
highlight?referenced_link
Create " fast link " to cached document (choosing wantonly)
ELSE?discard?portion?of?document
NEXT referenced link/* search linked document in this redirect
Next part */
NEXT hop/* search from current page once more next group document * of redirect/
When finding that some 1 redirect webpages contain user's items of interest, just make the user note these webpages by any method in the Several Methods.At first, can in the demonstration of current page, highlight and lead to hyperlink or the link that contains item of interest purpose webpage, such as by the color, font or the size that change these hyperlink of demonstration.Strengthen among the embodiment at one of the present invention, can be in current window in a separate frame of a side, top or the bottom of current page or at one, independently set up one " link fast " in the web browser window.
This provides improved web browser display for the user, can highlight the link of leading to user's interested documents probably according to user's items of interest inventory, makes the user can browse current website more efficiently.
It should be noted, if adopting fast, link shows, repeatedly the link interested of redirect can be 1 redirect, that is to say, be illustrated in the path of leading to link interested next step, after this be illustrated in this path next step or the like, perhaps, repeatedly the link interested of redirect also can be " dark link ".Under latter event, first link that illustrates in framework, form and so on is linked to items of interest deeply, even it is only by repeatedly redirect is just addressable, and the demonstration of first link of top layer can be to highlight.In a further improved embodiment of the present invention, can highlight 1 redirect link of leading to interested documents with a kind of method that highlights, and available another kind highlights method and highlights the repeatedly redirect link of leading to interested documents.For example, the red literal that the interested link of 1 redirect can be arranged to glimmer, and the repeatedly redirect link of leading to interested documents can illustrate or highlight with stable red literal.The HTML sign indicating number that color, font and flicker attribute are set is well-known, so the browser plug-in of this preferred embodiment is just passable as long as change this part these attributes of web browser display of current web page.
Note also that the preferred embodiments of the present invention are carried out " breadth-first search ", rather than pass through " N time redirect " deep drilling from a starting point that provides.Perhaps, can carry out " depth-first search ", though it seems that from inventor of the present invention this search is not quite practical, efficient is not high yet, because may omit or skip the link in some documentation sections that are not included in initial downloaded yet.Can adopt any search technique, here the principle that is disclosed is blanket.
Be also to be understood that if desired, can be independently safeguarding the common inventory that a webpage interested " links " fast in form or the framework, even the user enters a specific path downwards.For example, be thought of as a user who is in webpage " A " and list a situation of showing the inventory of interested link " B " and " C ".Can be easy to see that the user can enter interested link " B ", independently still keep a quick link of leading to webpage " C " in framework or the form simultaneously.After reading " B ",, keeping " C " to make this other path of before not taked of user's rebound immediately in the link form fast if when reading, there are some interested links to arrive.
Incorporate the preferred embodiments of the present invention into a web browing system or product, can obtain browsing the device that intelligence is arranged of a large amount of contents of WWW and website thereof according to user's interest.
Though more than provided and some object lessons and an explanation that preferred embodiment is relevant, but the personnel that are familiar with this technical field can recognize, can make various replacements and engineering under the situation that does not deviate from scope of the present invention selects, including, but not limited to this method being embodied as an application program, the portable language script, the enhancing of server side program or script or browser, use one such as the telephone set of supporting web, different web browser computing machines of internet equipment or personal digital assistant and so on, and using such as Windows[TM] other operating systems the CE.

Claims (13)

1. browse to a predictability method of the user's interest web document of a web browser, described web browser has a user display, a user input apparatus and a persistent storage, described web document contains some speech and can visit from described web browser by a chained address in the current page, and described method comprises the following steps:
Receive the part of a web document from a chained address;
Whether the described part of determining a web document contains the one or more predetermined speech of described user's interest; And
Exist interested one or more speech to respond to determining at described documentation section, receive and store the integral body of described web document; Wherein
Described reception, determine and storing step is carried out when the user checks described current page.
2. one kind as the method that in claim 1, proposed, described method also comprise for from one the one web document, in the chained address of predetermined quantity addressable a plurality of web documents, repeat a web document of described reception a part, determine whether described part contains the step of the integral body of interested speech and a reception and a web document of storage.
3. one kind as the method that proposed in claim 1, described method also is included on the described web browser display device and highlights a step of leading to the link of a web document.
4. one kind as the method that proposed in claim 1, described method also is included in and creates a quick link of leading to the described web document of storing on the described web browser display device.
5. browse to an energy predictability enhancing web browser of the user's interest web document of a web browser, described web document contains some speech and can visit from described browser by a chained address in the current page, and described system comprises:
The processor of an executive routine code;
One is the user display of user's display message;
A user input apparatus that receives user's input;
A persistent storage of storing data and information comprises the user's items of interest inventory that is stored in wherein, and described items of interest inventory contains some speech of user's interest; And
A browser based on interest by the predictability of described processor execution, described browser is used for from the part of a web document of a chained address reception, whether the described part of determining a web document contains one or more items of interest speech, finds the integral body that one or more items of interest speech respond and receive and store described web document to determining in described documentation section; Wherein
The browser based on interest of described predictability is carried out described reception, is determined and storage operation when described user checks described current page.
6. one kind as the system that in claim 5, proposed, the browser based on interest of wherein said predictability comprises a standard web browser that has a browser plug-in, described browser plug-in is used for from the part of a web document of a chained address reception, whether the described part of determining a web document contains one or more items of interest speech, and finds the integral body that one or more items of interest speech respond and receive and store described web document to determining at described documentation section.
7. one kind as the system that proposed in claim 5 or 6, wherein said web document comprises html document.
8. one kind as the system that proposed in claim 5 or 6, described system comprise that also one highlights a link of leading to the link of a web document of being stored and highlights device on described web browser display device.
9. one kind as the system that proposed in claim 7, described system comprise that also one highlights a link of leading to the link of a web document of being stored and highlights device on described web browser display device.
10. one kind as the system that in claim 5 or 6, proposed, described system also comprises a quick creator, described being linked at fast pointed to the described web document of storing on the described web browser display device.
11. the system as being proposed in claim 7, described system also comprises a quick creator, and described fast key is connected on and points to the described web document of storing on the described web browser display device.
12. the system as being proposed in claim 8, described system also comprises a quick creator, and described being linked at fast pointed to the described web document of storing on the described web browser display device.
13. the system as being proposed in claim 9, described system also comprises a quick creator, and described fast key is connected on and points to the described web document of storing on the described web browser display device.
CNB028061012A 2001-03-08 2002-03-06 Predictive caching and highlighting of web pages Expired - Fee Related CN1296853C (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/801,590 2001-03-08
US09/801,590 US6874019B2 (en) 2001-03-08 2001-03-08 Predictive caching and highlighting of web pages

Publications (2)

Publication Number Publication Date
CN1522418A CN1522418A (en) 2004-08-18
CN1296853C true CN1296853C (en) 2007-01-24

Family

ID=25181534

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB028061012A Expired - Fee Related CN1296853C (en) 2001-03-08 2002-03-06 Predictive caching and highlighting of web pages

Country Status (9)

Country Link
US (1) US6874019B2 (en)
EP (1) EP1368752A2 (en)
JP (1) JP2004531797A (en)
KR (1) KR100583874B1 (en)
CN (1) CN1296853C (en)
CA (1) CA2437933A1 (en)
IL (1) IL157679A0 (en)
TW (1) TW552521B (en)
WO (1) WO2002073460A2 (en)

Families Citing this family (71)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6981040B1 (en) * 1999-12-28 2005-12-27 Utopy, Inc. Automatic, personalized online information and product services
US7747611B1 (en) 2000-05-25 2010-06-29 Microsoft Corporation Systems and methods for enhancing search query results
US6968332B1 (en) * 2000-05-25 2005-11-22 Microsoft Corporation Facility for highlighting documents accessed through search or browsing
US7113935B2 (en) 2000-12-06 2006-09-26 Epicrealm Operating Inc. Method and system for adaptive prefetching
US20030074635A1 (en) * 2001-10-11 2003-04-17 International Business Machines Corporation Method, apparatus, and program for finding and navigating to items in a set of web pages
JP2002351736A (en) * 2001-03-23 2002-12-06 Matsushita Electric Ind Co Ltd Document data processor, server device, terminal device and document data processing system
US6877136B2 (en) * 2001-10-26 2005-04-05 United Services Automobile Association (Usaa) System and method of providing electronic access to one or more documents
MXPA04008752A (en) * 2002-03-11 2004-12-13 Research In Motion Ltd System and method for pushing data to a mobile device.
US20030225855A1 (en) * 2002-05-30 2003-12-04 International Business Machines Corporation Method and apparatus for realtime provision of related subject matter across internet content providers
US7568002B1 (en) 2002-07-03 2009-07-28 Sprint Spectrum L.P. Method and system for embellishing web content during transmission between a content server and a client station
US7360210B1 (en) 2002-07-03 2008-04-15 Sprint Spectrum L.P. Method and system for dynamically varying intermediation functions in a communication path between a content server and a client station
US7801945B1 (en) 2002-07-03 2010-09-21 Sprint Spectrum L.P. Method and system for inserting web content through intermediation between a content server and a client station
GB2393802A (en) * 2002-10-01 2004-04-07 Hewlett Packard Co Establishment of network connections
EP1400903A1 (en) * 2002-09-19 2004-03-24 Sony United Kingdom Limited Information storage and retrieval
US20050177564A1 (en) * 2003-03-13 2005-08-11 Fujitsu Limited Server, method, computer product, and terminal device for searching item data
US20040221232A1 (en) * 2003-04-30 2004-11-04 International Business Machines Corporation Method for readily storing and accessing information in electronic documents
US7904585B1 (en) * 2003-09-05 2011-03-08 Skyware, Inc. Predictive browser and protocol package
US7949960B2 (en) * 2003-09-30 2011-05-24 Sap Ag Predictive rendering of user interfaces
US8234373B1 (en) 2003-10-27 2012-07-31 Sprint Spectrum L.P. Method and system for managing payment for web content based on size of the web content
US7873537B2 (en) * 2003-12-04 2011-01-18 International Business Machines Corporation Providing deep linking functions with digital rights management
US8522131B1 (en) 2004-04-14 2013-08-27 Sprint Spectrum L.P. Intermediation system and method for enhanced rendering of data pages
US9172679B1 (en) 2004-04-14 2015-10-27 Sprint Spectrum L.P. Secure intermediation system and method
US7853782B1 (en) 2004-04-14 2010-12-14 Sprint Spectrum L.P. Secure intermediation system and method
GB2415063A (en) * 2004-06-09 2005-12-14 Oracle Int Corp Data retrieval method
GB2416221A (en) * 2004-07-10 2006-01-18 Hewlett Packard Development Co Analysing a multi stage process
US7590631B2 (en) * 2004-09-02 2009-09-15 Hewlett-Packard Development Company, L.P. System and method for guiding navigation through a hypertext system
US7512973B1 (en) 2004-09-08 2009-03-31 Sprint Spectrum L.P. Wireless-access-provider intermediation to facilliate digital rights management for third party hosted content
US8732610B2 (en) * 2004-11-10 2014-05-20 Bt Web Solutions, Llc Method and apparatus for enhanced browsing, using icons to indicate status of content and/or content retrieval
US20060069617A1 (en) * 2004-09-27 2006-03-30 Scott Milener Method and apparatus for prefetching electronic data for enhanced browsing
US8327440B2 (en) 2004-11-08 2012-12-04 Bt Web Solutions, Llc Method and apparatus for enhanced browsing with security scanning
US7600011B1 (en) 2004-11-04 2009-10-06 Sprint Spectrum L.P. Use of a domain name server to direct web communications to an intermediation platform
US7496600B2 (en) * 2004-12-02 2009-02-24 Taiwan Semiconductor Manufacturing Co., Ltd. System and method for accessing web-based search services
US20070183493A1 (en) * 2005-02-04 2007-08-09 Tom Kimpe Method and device for image and video transmission over low-bandwidth and high-latency transmission channels
US20060294223A1 (en) * 2005-06-24 2006-12-28 Microsoft Corporation Pre-fetching and DNS resolution of hyperlinked content
EP2036307A1 (en) * 2006-06-30 2009-03-18 International Business Machines Corporation A method and apparatus for caching broadcasting information
US7660787B2 (en) * 2006-07-19 2010-02-09 International Business Machines Corporation Customized, personalized, integrated client-side search indexing of the web
US20080097979A1 (en) * 2006-10-19 2008-04-24 International Business Machines Corporation System and method of finding related documents based on activity specific meta data and users' interest profiles
JP4915219B2 (en) * 2006-11-24 2012-04-11 富士通株式会社 Hypertext conversion program, method and apparatus
US9021352B2 (en) * 2007-05-17 2015-04-28 Adobe Systems Incorporated Methods and apparatus for predictive document rendering
US20080301573A1 (en) * 2007-05-30 2008-12-04 Liang-Yu Chi System and method for indicating page component focus
US20080301300A1 (en) * 2007-06-01 2008-12-04 Microsoft Corporation Predictive asynchronous web pre-fetch
US7877369B2 (en) * 2007-11-02 2011-01-25 Paglo Labs, Inc. Hosted searching of private local area network information
US7877368B2 (en) * 2007-11-02 2011-01-25 Paglo Labs, Inc. Hosted searching of private local area network information with support for add-on applications
US20100162126A1 (en) * 2008-12-23 2010-06-24 Palm, Inc. Predictive cache techniques
KR101132220B1 (en) * 2008-12-30 2012-04-26 엔에이치엔(주) Method, system and computer-readable recording medium for providing web page using cache
US8250053B2 (en) * 2009-02-24 2012-08-21 Microsoft Corporation Intelligent enhancement of a search result snippet
ES2454765T3 (en) * 2009-04-14 2014-04-11 Freedom Scientific Inc. Document navigation method
US20110022945A1 (en) * 2009-07-24 2011-01-27 Nokia Corporation Method and apparatus of browsing modeling
US8365064B2 (en) * 2009-08-19 2013-01-29 Yahoo! Inc. Hyperlinking web content
US20110209040A1 (en) * 2010-02-24 2011-08-25 Microsoft Corporation Explicit and non-explicit links in document
CN101777081A (en) * 2010-03-08 2010-07-14 中兴通讯股份有限公司 Method and device for improving webpage access speed
CN102238204A (en) * 2010-04-23 2011-11-09 腾讯科技(深圳)有限公司 Network data acquisition method and system
US8706854B2 (en) 2010-06-30 2014-04-22 Raytheon Company System and method for organizing, managing and running enterprise-wide scans
US8788762B2 (en) 2010-09-30 2014-07-22 Nokia Corporation Methods and apparatuses for data resource provision
US8924873B2 (en) 2010-11-23 2014-12-30 International Business Machines Corporation Optimizing a user interface for a computing device
US20120137201A1 (en) * 2010-11-30 2012-05-31 Alcatel-Lucent Usa Inc. Enabling predictive web browsing
US9454607B1 (en) * 2010-12-10 2016-09-27 A9.Com, Inc. Image as database
US8948794B2 (en) 2011-03-14 2015-02-03 Nokia Corporation Methods and apparatuses for facilitating provision of a map resource
US8687840B2 (en) 2011-05-10 2014-04-01 Qualcomm Incorporated Smart backlights to minimize display power consumption based on desktop configurations and user eye gaze
US8612418B2 (en) * 2011-07-14 2013-12-17 Google Inc. Mobile web browser for pre-loading web pages
US9146909B2 (en) 2011-07-27 2015-09-29 Qualcomm Incorporated Web browsing enhanced by cloud computing
US10127314B2 (en) * 2012-03-21 2018-11-13 Apple Inc. Systems and methods for optimizing search engine performance
CN103067908A (en) * 2012-12-27 2013-04-24 北京小米科技有限责任公司 Data processing method, device and terminal
CN103118081B (en) * 2013-01-18 2016-01-13 北京奇虎科技有限公司 Server, client, the system and method for browsing pages in prestrain browser
US20150113093A1 (en) * 2013-10-21 2015-04-23 Frank Brunswig Application-aware browser
US20160127497A1 (en) * 2014-11-03 2016-05-05 Evgeny Himmelreich Smart site preloading
US10169481B2 (en) * 2015-02-18 2019-01-01 Adobe Systems Incorporated Method for intelligent web reference preloading based on user behavior prediction
CN110191229B (en) * 2019-05-29 2021-05-04 Oppo(重庆)智能科技有限公司 Display method and related device
FR3097070B1 (en) * 2019-06-05 2022-06-10 Amadeus Sas SYSTEM AND METHOD FOR BROWSER-BASED TARGET DATA EXTRACTION
US20220318484A1 (en) * 2021-04-02 2022-10-06 Relativity Oda Llc Systems and methods of previewing a document
US11314928B1 (en) 2021-08-03 2022-04-26 Oracle International Corporation System and method for configuring related information links and controlling a display

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000215138A (en) * 1999-01-22 2000-08-04 Casio Comput Co Ltd Information searching device and storage medium which stores program
US6182133B1 (en) * 1998-02-06 2001-01-30 Microsoft Corporation Method and apparatus for display of information prefetching and cache status having variable visual indication based on a period of time since prefetching

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6131085A (en) * 1993-05-21 2000-10-10 Rossides; Michael T Answer collection and retrieval system governed by a pay-off meter
US5867799A (en) * 1996-04-04 1999-02-02 Lang; Andrew K. Information system and method for filtering a massive flow of information entities to meet user information classification needs
JPH1063679A (en) 1996-08-23 1998-03-06 Nippon Telegr & Teleph Corp <Ntt> Information presentation device
JPH10207901A (en) 1997-01-22 1998-08-07 Nippon Telegr & Teleph Corp <Ntt> Method and system for providing information
KR100571059B1 (en) * 1997-08-06 2006-04-14 태크욘 인코포레이티드 Distributed Systems and Methods for Prefetching
US5848410A (en) * 1997-10-08 1998-12-08 Hewlett Packard Company System and method for selective and continuous index generation
US6009410A (en) * 1997-10-16 1999-12-28 At&T Corporation Method and system for presenting customized advertising to a user on the world wide web
US6009429A (en) * 1997-11-13 1999-12-28 International Business Machines Corporation HTML guided web tour
US6078928A (en) * 1997-12-12 2000-06-20 Missouri Botanical Garden Site-specific interest profiling system
US6094649A (en) * 1997-12-22 2000-07-25 Partnet, Inc. Keyword searches of structured databases
US6085226A (en) * 1998-01-15 2000-07-04 Microsoft Corporation Method and apparatus for utility-directed prefetching of web pages into local cache using continual computation and user models
US6088731A (en) * 1998-04-24 2000-07-11 Associative Computing, Inc. Intelligent assistant for use with a local computer and with the internet
US6151630A (en) * 1998-05-15 2000-11-21 Avaya Technology Corp. Non-redundant browsing of a sequencing of web pages
US20010051927A1 (en) * 2000-06-08 2001-12-13 Blinkspeed, Inc. Increasing web page browsing efficiency by periodically physically distributing memory media on which web page data are cached
JP2002259544A (en) * 2001-03-02 2002-09-13 Willone Corp System of electronic exhibition

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6182133B1 (en) * 1998-02-06 2001-01-30 Microsoft Corporation Method and apparatus for display of information prefetching and cache status having variable visual indication based on a period of time since prefetching
JP2000215138A (en) * 1999-01-22 2000-08-04 Casio Comput Co Ltd Information searching device and storage medium which stores program

Also Published As

Publication number Publication date
CN1522418A (en) 2004-08-18
KR100583874B1 (en) 2006-05-26
EP1368752A2 (en) 2003-12-10
WO2002073460A2 (en) 2002-09-19
JP2004531797A (en) 2004-10-14
US6874019B2 (en) 2005-03-29
KR20030082607A (en) 2003-10-22
IL157679A0 (en) 2004-03-28
CA2437933A1 (en) 2002-09-19
WO2002073460A3 (en) 2003-09-18
US20020165925A1 (en) 2002-11-07
TW552521B (en) 2003-09-11

Similar Documents

Publication Publication Date Title
CN1296853C (en) Predictive caching and highlighting of web pages
US8166010B2 (en) Processing and sending search results over a wireless network to a mobile device
CN100476830C (en) Network resource searching method and system
Bickmore et al. Web page filtering and re-authoring for mobile users
US7653623B2 (en) Information searching apparatus and method with mechanism of refining search results
US20070067305A1 (en) Display of search results on mobile device browser with background process
US20040003028A1 (en) Automatic display of web content to smaller display devices: improved summarization and navigation
US20070136318A1 (en) Document-based information and uniform resource locator (URL) management
CN101517511A (en) System, process and software arrangement for assisting in navigating the internet
JP2009500719A (en) Query search by image (query-by-imagesearch) and search system
WO2001063919A1 (en) Systems and methods for generating and providing previews of electronic files such as web files
CN1321278A (en) Systems, methods and computer program products for assigning, generating and delivering content to intranet users
JP2002544595A (en) Modification of data files representing documents in a hierarchical structure of linked documents
CN1434948A (en) Method and apparatus for processing web documents
KR100359233B1 (en) Method for extracing web information and the apparatus therefor
CN1417709A (en) Information search system and method
EP1148427A1 (en) Method of and system for creating a button type bookmark in a web browser displaying a user-selected part of the corresponding data file
US7975238B2 (en) Identifying previously bookmarked hyperlinks in a received Web page in a World Wide Web network browser system for searching
CN1366253A (en) Communication system possessing world wide web file close up function, close up method and medium
EP1910941A1 (en) Processing and sending search results over a wireless network to a mobile device
CN1245937A (en) Method for simultaneously implementing several searches of engine retrieval
CN1460211A (en) Personal banner creating program
CN1308286A (en) Internet command language processing method, system and program product
CN1783078A (en) System and method for accessing web-based search services
Krishna et al. Design and Implementation of Mobile World Wide Web Search Engines

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20070124