CN1296853C - Predictive browsing method and system for web pages - Google Patents

Predictive browsing method and system for web pages Download PDF

Info

Publication number
CN1296853C
CN1296853C CNB028061012A CN02806101A CN1296853C CN 1296853 C CN1296853 C CN 1296853C CN B028061012 A CNB028061012 A CN B028061012A CN 02806101 A CN02806101 A CN 02806101A CN 1296853 C CN1296853 C CN 1296853C
Authority
CN
China
Prior art keywords
web
document
interest
system
user
Prior art date
Application number
CNB028061012A
Other languages
Chinese (zh)
Other versions
CN1522418A (en
Inventor
瑞克·A·汉密尔顿
约翰·S·兰弗得
史蒂文·J.·利普顿
Original Assignee
国际商业机器公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority to US09/801,590 priority Critical patent/US6874019B2/en
Application filed by 国际商业机器公司 filed Critical 国际商业机器公司
Publication of CN1522418A publication Critical patent/CN1522418A/en
Application granted granted Critical
Publication of CN1296853C publication Critical patent/CN1296853C/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • G06F16/9574Browsing optimisation, e.g. caching or content distillation of access to content, e.g. by caching

Abstract

本发明所提出的web浏览器预测性自动搜索与当前所显示的网页链接的含有web浏览器用户感兴趣的项目的web文档。 web documents containing items of interest to the user web browser provided by the present invention predictability web browser automatically searches and web links currently displayed. 所链接的含有感兴趣的项目的文档在用户查看当前文档的同时自动检索出来并予以存储,因此如果用户选择了通向所存储的文档的链接,就可以显示这个文档而不用再等待它下载。 Documents containing items of interest linked automatically retrieves the user to view the current document at the same time and be out of storage, so if the user selects a link leading to the stored documents, you can display the document without having to wait for it to download. 为了进一步帮助用户查找含有用户感兴趣的项目的文档,可以突出显示当前页内通向感兴趣的文档的链接,还可以创建和显示通向这些网页的专用快速链接,以便更好地提醒和方便用户使用。 To further help users find documents containing items of interest to the user, can highlight the link leads to a document of interest within the current page, you can also create and display special quick links leading to these pages, for better and convenient reminder users.

Description

网页的预测性浏览的方法和系统 Prediction of web browsing and systems

技术领域 FIELD

本发明与web浏览器和服务器技术有关,具体地说与提供优先考虑用户个人兴趣的浏览能力的web浏览技术有关。 The present invention relates to web browser and web server technologies related specifically to provide priority to users' personal interests of the ability to browse browsing technologies.

背景技术 Background technique

因特网和万维网已经成为商业经营、个人生活和教育过程不可或缺的组成部分。 Internet and World Wide Web has become a commercial business, personal life and an integral part of the educational process.

因特网技术的中心是web浏览器技术和因特网服务器技术。 Internet technology center is a web browser technology and the Internet server technology.

因特网服务器存有诸如文档、图像或图形文件、表格、音频剪辑之类的“内容”,这些内容都是具有因特网连接的系统和浏览器可得到的。 Internet server there, such as "Content" category of documents, images or graphics files, tables, audio clips, the content of the system, both with an Internet connection and a browser available.

web浏览器或“客户机”计算机可以向web地址请求文档,适当的web服务器对此作出响应,发送一个或多个web文档、图像或图形文件、表格、音频剪辑等。 web browser or "client" computers can request documents to web address, appropriate web server response to this, send one or more web documents, images or graphics files, tables, audio clips and so on. 从服务器向浏览器发送web文档和内容的最普通的协议是超文本传输协议(“HTTP”)。 The most common protocol for sending documents and web content from a server to browser is hypertext transfer protocol ( "HTTP").

图1示出了因特网和内部网通信的基本客户机-服务器配置情况。 FIG 1 shows a basic clients of the Internet and intranets - server configuration. 客户机浏览器计算机(1)配有通过诸如拨号电话线和调制解调器、电缆调制解调器或局域网(“LAN”)之类的普通装置到达万维网(3)的因特网接入(2)。 Browser client computer (1) with arrival web (3) by an ordinary means such as a dial-up telephone line and a modem, cable modem or a local area network ( "LAN") such Internet access (2). web浏览器计算机(1)还配有适当的诸如Netscape的Navigator或Microsoft的Explorer之类的web浏览软件。 web browser on the computer (1) web browsing software comes with a Navigator or Microsoft's Explorer appropriate, such as Netscape and the like. web服务器计算机(5)同样配有用类似的装置或者诸如T1和T3数据线之类的高带宽装置和web服务器软件套件到达万维网(3)的因特网接入(4)。 a web server computer (5) with the same or a similar means useful for high bandwidth data devices T1 and T3 lines, and the like, such as a web server software package reaches the web (3) Internet Access (4). 或者,也可以是客户机和服务器通过一个诸如社团LAN之类的内部网(6)互连。 Alternatively, a client and a server via a LAN such as an intranet or the like associations interconnect (6). 这些配置在这个技术领域内是众所周知的。 These configurations in this technical area is well known.

最普通的因特网内容或文档类型是超文本标注语言(“HTML”)文档,但是其他格式在这个技术领域内同样是众所周知的,诸如Adobe可移植文档格式(“PDF”)之类。 The most common types of Internet content or document is a hypertext markup language ( "HTML") documents, but other formats in this technical area is also well known, such as Adobe Portable Document Format ( "PDF") and the like. HTML、PDF和其他web文档在文档内提供了使用户可以选择另一个文档或者网站进行查看的“超链接”(hyperlink)。 HTML, PDF and other web document provides the user can select another document or Web site to view the "Hyperlink" (hyperlink) within the document. 超链接是文档内专门标明的文字或区域,在被用户选中时命令浏览器软件检索或取得所指出的文档。 Hyperlinks are specially highlighted text within a document or region, the command browser software to retrieve when selected by the user or obtain documentation, as indicated.

通常,在用户选中一个普通超链接时,在web浏览器的图形用户界面(“GUI”)视窗内显示的当前页消失,而显示最新接收到的页面。 In general, when the user selects a normal hyperlink, the current page displayed in the web browser-based graphical user interface ( "GUI") window disappears and display the latest received page. 如果母页是一个索引,例如IBM网站www.patents.ibm.com,而用户希望访问每个后继的链接(例如读取具有有关如何使用该站点的提示的文档),于是母页或索引页就消失,而显示这个新页(例如帮助页)。 If the mother is an index page, for example, the IBM Web site www.patents.ibm.com, and the user wants to access each subsequent link (such as reading a document with tips on how to use the site's), so the parent or index pages on disappeared, and display the new page (eg help pages).

随着web浏览器计算机的计算能力的提高和web浏览器计算机的通信带宽显著增大,对于提供因特网网站和内容的机构的一个难题是考虑到这些较大的处理和吞吐速度来传送和过滤这种内容。 With the communication bandwidth computing power web browser on your computer and web browser on your computer increases significantly, a problem for institutions to provide Internet sites and content that take into account these larger throughput and processing speed transmission and filtering it kinds of content.

在基于web应用的领域内和在开发更好、更有效的方式将适合用户的信息传送给桌面或客户机方面尤为如此。 In the field of web-based applications and the development of better, more effective ways to convey information to suit the user's desktop or client applies particularly to the case.

然而,现在的一些web浏览器通常是非智能的软件包。 However, some of today's web browsers often non-intelligent package. 作为当前存在的web浏览器,它们普遍要求用户手动搜索他们感兴趣的任何文章或文档,这通常是很累赘的,因为他们经常需要下载许多文档后才能找到一个有密切关系的文档。 As there is currently a web browser, they generally require the user to manually search for any articles or documents they are interested in, which is usually very cumbersome, because they often need to download the document to find a close relationship after many documents.

搜索引擎为浏览提供了某种程度的“智能”,用户可以将他的非智能浏览器指向一个搜索引擎地址,输入一些检索的关键字,通过选择搜索结果中的超链接或者通过手动地再将web浏览器指向所提供的web地址,一次一个地检阅每个返回的文档。 Search engine for the browser provides a certain degree of "intelligence", the user can place his non-smart browser to address a search engine, enter some search keywords, by selecting the hyperlink in the search results or by manually again web browser to the web address provided, one at a time to review each returned document. 然而,搜索引擎并没有实际对整个因特网进行搜索,它们只是对搜索引擎运营方通常通过一个检阅其他网站运营方的手动提交的过程所建立的它们自己的因特网内容索引进行搜索。 However, the search engines do not actually search the entire Internet, they are just for the search engine operator is usually through a review of their own Internet content indexing process other website operators manually submitted by the established search. 因此,用户经常需用若干个搜索引擎寻找特定主题的信息,因为每个搜索引擎将根据它们自己的索引内容返回不同的结果。 Therefore, users often need to use several search engines to find information on a particular topic, because each search engine will return different results depending on their own index content.

为了部分地解决这个问题,已经开发了在这个技术领域内众所周知的另外两种技术。 In part to address this problem have been developed well-known in this technical field the other two techniques. 第一种技术称为“终极搜索引擎(metasearchengine)”,是多个搜索引擎中的一个引擎。 The first technique is called "the ultimate search engine (metasearchengine)", it is a multiple search engine engines. 终极搜索引擎并不持有它自己的索引,而是将查询同时提交给多个搜索引擎,并将这些搜索引擎中每个搜索引擎返回的排在最高的返回给用户。 The ultimate search engine does not hold its own index, but to submit queries to multiple search engines and these search engines for each search engine returns the highest row returned to the user. 虽然这比手动地逐个访问每个所查询的搜索引擎有用,但结果通常不如所预期的那样满意。 While each of these search engine queries more useful than manually, one by one visit, but the results are usually not as satisfactory as expected. 通常,在所列与搜索关键字匹配的清单上顶部的少数返回项不是最感兴趣的,因此用户时常访问列在返回清单中间或末端的站点。 Usually, listed on the list and search keyword matching the return of a small number of top items is not the most interesting, so users often access the column in the middle or at the end return to the list of sites. 终极搜索引擎虽然可以返回来自4个搜索引擎的顶部5个条目,但可能滤除了很有可能是感兴趣的信息。 While the ultimate search engine can return to the top five entries from four search engines, but may filter out the information likely to be of interest.

解决这个问题的第二条途径称为web“履带式浏览器”(crawler)引擎。 The second way to solve this problem is called the web "crawler Browser" (crawler) engine. 这些服务器周期性地与其他服务器接触,“重新索引”以前所索引的网站内容,这使它们保持是较新的,将任何最近可从一个网站得到的信息收入它们的索引。 These servers periodically contact with other servers, "re-index" previously indexed site content, which makes them keep the newer, any recent income information available from a site they index. 然而,由于每天有数以千计新网站挂到线上,因此一个履带式浏览器要访问这些新的站点实际上是不可能的。 However, since there are thousands of new sites linked to online every day, so a crawler browser to access the new site is virtually impossible. 因此,甚至web履带式浏览器也可能提供不了因特网内容的完全覆盖。 Therefore, even web crawler browsers may not provide complete coverage of Internet content.

在各个US专利中已经提出其他一些尝试,包括创建一个“一些智能代理的共同体”,利用基于服务器的交互分类和过滤,在一个web文档内遇到专用标志时触发的客户机侧的“智能助理”,以及自动“书签”功能。 In each of US patent number of other attempts have been made, including the creation of a "community of some intelligent agent", using "intelligent assistant client-side interaction based on classification and filtering server, triggered when encountered special marks in a web document "and automatic" bookmark "feature. 概括地说,所提出的这些技术和方法都需要一定的服务器侧和客户机侧的配合,这使得这些技术很难大规模推广应用。 In a nutshell, these technologies and the proposed method requires some coordination server side and client side, which makes it difficult to large-scale application of these technologies.

几年前,引入了一种客户机侧技术,下载浏览器当前装入的网页的一个超链接内所有的网页。 A few years ago, we introduced a client-side technology, download all the pages within a hyperlink browser currently loaded web page. 由于收集了从当前所访问的网页直接链接的所有文档,因此用户接着选择无论哪个文档都立即可从本机存储器内的高速缓存器得到,从而不需要等待服务器将最新选中的网页发送给浏览器。 Since the collection of all documents from the website currently visiting a direct link, so user can then select whichever documents are available from the cache memory in the machine immediately, without waiting for the server to send the latest selected web browser . 等用户结束了读这个下一页(现在它是当前页)而选择了一个后续文档时,这个后续文档已经被高速缓存,因此也可以显示而没有传输延迟。 Etc. When the user finishes reading the next page (now it is the current page) and select a follow-up document, the follow-up document has been cached, it can also display and no transmission delay. 然而,这种处理方式在访问一个“链接丰富的”网页时具有一些缺点。 However, this approach has some drawbacks when accessing a "rich links" page. 例如,一个受欢迎的新闻站点的一个网页可能有超过60个从新业务的主页直接链接的文档。 For example, a page of a popular news site may have a document over 60 new business home page direct link. 因此,为web浏览器计算机服务的通信网络对于在用户阅读主页时并要在用户选择主页上的一个超链接前装入所有的60个直接链接文档可能呈现为一个瓶颈或时间限制因素。 Therefore, the communication network is a web browser when the computer service for the user to read the front page and to select a hyperlink on the home page to load all of the 60 direct links in the document may appear as a limiting factor or bottleneck time user. 因而,这些直接链接的网页中只有少数的网页在读者查找主页和决定查看下一个文档所用的时间内可以成功地下载。 Therefore, these direct links to web pages in only a few pages to find a home and decided to see the next time you can successfully download the document was in the audience. 不幸的是,这些在检阅主页期间成功下载的网页可能是用户并不感兴趣的,因为下载功能没有对网页进行分类或确定哪个网页可能是或可能不是感兴趣的措施。 Unfortunately, during the review of these home pages successfully downloaded the user may not be interested, because there is no download webpages are categorized to determine which page or measures may or may not be of interest.

发明内容 SUMMARY

因此本发明在第一方面提供了一种预测性地浏览一个web浏览器系统的用户感兴趣的web文档的方法,所述web浏览器系统具有一个用户显示器、一个用户输入装置和一个持久性存储装置,所述web文档含有一些词并且可通过当前页面中的一个链接地址从所述web浏览器系统访问,所述方法包括下列步骤:从一个链接地址接收一个web文档的一部分;确定一个web文档的所述部分是否含有所述用户感兴趣的一个或多个预定词;以及对确定在所述文档部分存在感兴趣的一个或多个词作出响应,接收和存储所述web文档的整体;其中所述接收、确定和存储步骤在用户查看所述当前页面的同时执行。 Thus in a first aspect the present invention provides a method of predicting web browsing a web document of interest to a user browser system, said system having a web browser user display, a user input device and a persistent store means the web document contains a number of words and is accessible from the web browser system via a link in the current page address, said method comprising the steps of: receiving a link address from a portion of the web document; determining a web document whether the portion contains one or more predetermined words interest to the user; and in response to determining one or more words of interest is present in the document section, receiving and storing the entire web document; wherein said receiving, determining and storing steps in view of the current page a user simultaneously.

本发明第一方面的这种方法最好还包括对于从一个第一web文档起、在预定数量链接地址内可访问的多个web文档,重复所述接收一个web文档的一部分、确定所述部分是否含有感兴趣的词以及接收和存储一个web文档的整体的步骤。 This method of the first aspect of the present invention preferably further comprises a first web to from the document, a predetermined number of links within a plurality of web addresses accessible documents, repeating the receiving portion of a web document, the determination section whether it contains interesting word, and receive and store a whole step document web.

本发明第一方面的这种方法最好还包括在所述web浏览器显示器上突出显示一个通向一个web文档的链接的步骤。 This method of the first aspect of the present invention preferably further comprises a link to a web document leads to the step of highlighting on the web browser display.

本发明第一方面的这种方法最好还包括在所述浏览器显示器上创建一个通向所述所存储的web文档的快速链接。 This method of the first aspect of the present invention preferably further comprises creating a quick link to the web document stored on the leading browser display.

在第二方面,本发明提供了一种算机程序,所述计算机程序包括在装入一个计算机系统执行时使所述计算机系统执行本发明第一方面的方法的所有步骤的程序代码。 In a second aspect, the present invention provides program code for all the steps of a computer program, the computer program comprising causing the computer system to perform the method of the first aspect of the present invention, when loaded into a computer system executing.

在第三方面,本发明提供了一种能预测性地浏览一个web浏览器系统的用户感兴趣的web文档的增强web浏览器系统,所述web文档含有一些词并且可通过当前页面中的一个链接地址从所述浏览器系统访问,所述系统包括:一个执行程序代码的处理器;一个为用户显示信息的用户显示器;一个接收用户输入的用户输入装置;一个存储数据和信息的持久性存储装置,包括存储于其中的用户感兴趣项目清单,所述感兴趣项目清单含有用户感兴趣的一些词;以及一个由所述处理器执行的预测性的基于兴趣的浏览器,所述浏览器用来从一个链接地址接收一个web文档的一部分,确定一个web文档的所述部分是否含有一个或多个感兴趣项目词,对确定在所述文档部分内发现一个或多个感兴趣项目词作出响应接收和存储所述web文档的整体;其中所述预测性的基于兴趣的浏览 In a third aspect, the present invention provides an enhanced system capable web browser predictively browsing a web browser of interest to users of the system web document, the web document and may contain some of the words in a page by the current accessing the link address from the browser system, said system comprising: a processor executing program code; a user display displaying the user information; a user input device for receiving user input; a persistent storage to store data and information means, including the list of items stored in the user interest therein, the list of items of interest contains some words of the user of interest; and a browser-based predictive interest executed by the processor, to the browser receiving a link address from a portion of the web document, a web document to determine whether the portion of the item of interest comprising one or more words in response to receiving a discovery to determine one or more items of interest within the document portion term and overall storage of the web document; wherein the predictability of interest-based browsing 在所述用户查看所述当前页面的同时执行所述接收、确定和存储操作。 View the current page in the user while performing the receiving, determining and storing operation.

最好,本发明第三方面的系统的预测性的基于兴趣的浏览器还包括一个带有一个浏览器插件的标准web浏览器,所述浏览器插件用来从一个链接地址接收一个web文档的一部分,确定一个web文档的所述部分是否含有一个或多个感兴趣项目词,以及对确定在所述文档部分内发现一个或多个感兴趣项目词作出响应接收和存储所述web文档的整体。 Preferably, the browser based on interest predictive system of the third aspect of the present invention further comprises a standard web browser with a browser plug-in, a browser plug-in for receiving a web document from a link address part of a web document to determine whether the portion of the item of interest comprising one or more words, and in response to receiving and storing the web document to determine the one or more items of interest found in the words of the document portion integrally .

最好,所述web文档包括HTML文档。 Preferably, the web document includes an HTML document.

最好,本发明第三方面的系统还包括一个在所述web浏览器显示器上突出显示一个通向一个所存储的web文档的链接的链接突出显示器。 Preferably, the system of the third aspect of the present invention further comprises a link that leads to a highlighting a web document stored in the link projection display of the web browser display.

最好,本发明第三方面的系统还包括一个快速链接创建器,所述快速链接在所述web浏览器显示器上指向所述所存储的web文档。 Preferably, the system of the third aspect of the present invention further comprises a quick link creator, the quick links on the web browser displays a web pointing to the stored document.

因此本发明适当和可取地使web浏览器可以预测性自动搜索与当前所显示的网页链接的含有web浏览器用户感兴趣的项目的web文档。 Thus, the present invention is suitably and desirably so that the web browser can automatically search for predictive web document containing items of interest to the user's web browser and web links currently displayed. 所链接的含有感兴趣的项目的文档在用户查看当前文档的同时适当地自动检索出来并予以存储,从而如果用户选择了通向所存储的文档的链接,就可以显示这个文档而不用再等待它下载。 The linked document contains an item of interest to properly automatically retrieves the user to view the current document at the same time come out and be stored, so that if the user selects a link leading to the stored documents, you can display the document without having to wait for it download. 为了进一步帮助用户查找含有用户感兴趣的项目的文档,可以突出显示当前页内通向感兴趣的文档的链接,还可以创建和显示通向这些网页的专用快速链接,以便更好提醒和方便用户使用。 To further help users find documents containing items of interest to the user, you can highlight the link leads to interest in the current page of the document, you can also create and display special quick links leading to these pages, in order to better alerts and user-friendly use.

因此,本发明的优选实施例有益地提供了一种web浏览方法和系统,可以根据用户感兴趣的项目或关键字清单预测性地从诸如万维网之类的计算机网络服务器和分布式数据库检索信息。 Accordingly, preferred embodiments of the present invention advantageously provide a web browsing method and system, according to the information items of interest to a user or a keyword list predictively retrieved from the World Wide Web or the like, such as a computer network server and distributed database. 此外,有益的是,这种新的系统和方法与广泛使用的诸如个人计算机、支持web的电话机、因特网设备、个人数字助理和袖珍PC之类的浏览器技术兼容,只需极少甚至不需服务器侧的支持或配合技术。 In addition, it is beneficial, such as a personal computer, the new systems and methods widely used web-enabled telephones, Internet appliances, personal digital assistants and the like pocket PC's browser technology is compatible, not even minimal with technical support or server-side needs. 此外,有益的是,这种新的系统和方法在用户的显示器上突出显示预测性高速缓存的信息或者通向这种信息的链接,使用户可以方便和快速地查看预测性高速缓存的信息。 In addition, it is beneficial, this new system and method to highlight the predictive cache of information or links leading to this information, so that users can easily and quickly view information predictive cache on the user's monitor.

一些优选实施例还可取地提供了一种将一个浏览器系统配置成包括一个用户感兴趣项目清单的系统和方法。 Some preferred embodiments also desirable to provide a system is configured to a browser system and method include a list of items of interest to the user. 这种方法提供了一个列有用户最经常搜索的关键字的清单,这个清单可以用于同一个客户机web浏览器计算机上的其他软件程序。 This method provides a list of user lists the most frequently searched keywords, this list can be used with other software programs on a client web browser on your computer.

附图说明 BRIEF DESCRIPTION

下面将结合附图举例说明本发明的一个优选实施例,在这些附图中:图1示出了在因特网客户机或web浏览器系统、web服务器系统和通信网络之间的众所周知的配置;图2例示了web浏览器和web服务器系统的众所周知的体系结构;图3示出了一个网站上的一些超链接文档的典型树形结构;以及图4揭示了本发明的优选实施例的配置。 The following drawings illustrate embodiments in conjunction with a preferred embodiment of the present invention, in the drawings: FIG 1 shows a known disposed between the Internet web browser, or client systems, web server system and the communication network; FIG. 2 illustrates a known architecture of a web browser and a web server system; FIG. 3 shows a typical tree structure hyperlinks to documents on a web site; and Figure 4 discloses a preferred configuration of an embodiment of the present invention.

具体实施方式 Detailed ways

对于本说明来说,假设所有与搜出和装入网页关联的任务都由一个诸如Netscape的Navigator或Microsoft的Explorer之类的web浏览器应用程序执行。 For the purposes of this description, it is assumed that all web pages loaded with seized and associated tasks are handled by a web browser application such as Netscape's Navigator or Microsoft's Explorer, being executed. 实际上,本发明的在这里所说明的实施例可以用与web浏览器关联的软件实现,这软件可以是也可以不是浏览器本身的一部分,诸如一个协作的独立软件应用程序或浏览器插件模块之类。 Indeed, the embodiments described herein may be implemented with the present invention associated with the web browser software, this software may be or may not be part of the browser itself, such as an independent software collaborative application or browser plug-in module such as. 因而,熟悉该技术领域的人员可以认识到,感兴趣项目清单的编制如在这里所说明的那样可以由任何软件实现,其结果可以用于其他与浏览器有关的功能和软件。 Thus, people familiar with the art may recognize that the preparation of the list of items of interest as described herein may be implemented as any software, results can be used for other functions related to the browser and software.

图2示出了典型的web服务器和web浏览器计算机系统的通用硬件和软件体系结构。 Figure 2 shows a common hardware and software architecture of a typical web server and web browser of the computer system. web浏览器计算机(20)通过因特网或内部网(21)与web服务器计算机(22)以通信方式互连。 web browser on the computer (20) communicatively interconnected via the Internet or an intranet (21) with the web server computer (22). web浏览器系统包括诸如计算机显示器或监视器、键盘和鼠标之类的标准用户接口装置(23)。 The system includes a standard web browser user interface device (23) such as a computer display or monitor, a keyboard and a mouse. web浏览器计算机(20)的硬件平台包括中央处理器(“CPU”)(24)、磁盘驱动器(25)、用户接口装置I/O(26)和网络接口卡(“NIC”)(27)。 web browser on the computer (20) The hardware platform includes a central processing unit ( "CPU") (24), a magnetic disk drive (25), user interface means I / O (26) and a network interface card ( "NIC") (27) . NIC可以是在该技术领域内若干众所周知的品种之一,包括拨号上网调制解调器、局域网(“LAN”)卡或者电缆调制解调器接口。 NIC may be in one of several well known in the art varieties, including dial-up modem, a local area network ( "LAN") card or a modem interface cable. web浏览器计算机(20)执行的软件可以包括一些设备驱动器和一个基本输入/输出系统(“BIOS”)(28),以及操作系统(203)、应用程序(202)和小应用程序解释器(29)和小应用程序(201)。 (20) software executing a web browser on the computer may include some device drivers and a basic input / output system ( "BIOS") (28), and an operating system (203), the application (202) and an applet interpreters ( 29) and applets (201). web浏览器程序,诸如Netscape的Navigator之类,是一个可以由CPU(24)执行的应用程序。 web browser program, such as Netscape's Navigator or the like, an application can be executed by the CPU (24). 这种具有一个web服务器计算机的体系结构和配置在该技术领域内是众所周知的。 This web has a configuration server computer architecture and in the art is well known.

在这个优选实施例中,标准的web浏览器应用软件程序修改成包括一些逻辑和功能增强。 In this preferred embodiment, a standard web browser software application program logic and modified to include a number of enhancements. 这些功能上的增强利用了现有web浏览器的一些现有能力,诸如:(1)解释所接收的web文档;(2)使一个web文档全部或一部分可以在当前web浏览器显示视窗内显示;(3)在web浏览器显示视窗内显示用户选项图标、下拉清单或其他模式的控制指示符;(4)接收用户对在web浏览器接收视窗内显示的用户选项图标、下拉清单和其他模式的控制指示符的选择;以及(5)建立、存储和访问系统存储器内特别是诸如硬盘驱动器和非易失RAM或ROM之类的持久性存储器内的诸如文档、记录和cookie之类的数据项。 These features enhance the ability to use some of the existing conventional web browser, such as: (1) Explanation of the received web document; (2) contacting all or part of a web document display window may be displayed on the current web browser ; (3) in the web browser displays user options icons, pull-down list or other mode indicator in the control window; (4) receives the user's user options icon displayed in the web browser receives window, drop-down lists and other modes selection control indicator; and (5) to establish, in particular, such as a hard drive in the persistent memory or non-volatile RAM and a ROM such as a document, record and the like cookie data items within the memory storage and access system .

由于上述web浏览器系统的一般配置和体系结构在该技术领域内是众所周知的,因此发明优选实施例的其余说明将就可取地作为一个在IBM兼容计算机上的Microsoft的Windows[TM]操作系统下运行的Netscape的Navigator的浏览器插件实现的步骤和功能给出。 Since the general configuration and architecture of said web browser system in the art is well known, the remaining description of the preferred embodiment of the invention will preferably as a Microsoft-compatible computer on Windows [TM] in IBM Operating System Netscape Navigator steps and functions of running a browser plug-in implementation is given. 然而,熟悉相关技术领域的人员可以认识到,在不背离本发明的范围的情况下,诸如UNIX、Linux和Sun Microsystem的Solaris之类的其他操作系统,诸如IBM的RS6000、Apple的iMac(TM)、个人数字助理和支持web的电话机之类的其他计算机硬件,以及诸如Java脚本或编泽程序之类的其他软件实施方式也可以采用。 However, persons familiar with the art may recognize that, without departing from the scope of the present invention, such as UNIX, other operating systems Linux and Sun Microsystem's Solaris or the like, such as IBM's RS6000, Apple's iMac (TM) , personal digital assistants and other web-enabled computer hardware telephones and the like, as well as other embodiments such as Java script or software for compiling a program like may also be used. 在还有一些实施例中,web服务器的小服务程序或程序可以维护感兴趣项目清单,使这个清单根据请求可为客户机侧程序和插件所用。 In some embodiments, the small service program or web server can maintain a list of items of interest, so this list may, upon request to the client-side program and the plug-ins used.

概括地说,本发明的优选实施例改进了web浏览器的原来概念和功能。 Broadly speaking, embodiments improve the original concept and function of the web browser is preferably present invention. 可取的是这种web浏览器确定哪些关键字可能是web浏览器用户感兴趣的。 It is preferable that the web browser to determine which keywords might be of interest to web browser users. 这些感兴趣项目可取地存储在系统的持久性存储器内,是可由本发明作为一个平面(flat)文本文档访问的。 The item of interest preferably stored in the persistent storage system, by the present invention as is (Flat) a text document accessed plane. 也可以采用感兴趣项目清单的其他实施方式,诸如在一个数据库内的记录之类,所有这些实施方式都是可由包括本发明优选实施例的浏览器插件的其他程序容易访问的。 Other embodiments may be employed list of items of interest, such as a record in a database or the like, all of which are embodiments may include other browser plug-in program of the preferred embodiment of the present invention is easily accessible.

可以配合本发明的优选实施例采用其他建立感兴趣项目清单的方法或系统,然而上面所说明的系统和方法提供了产生感兴趣项目清单的一些有用方法。 It can be used with the present preferred embodiment of the invention using other methods or systems of interest to establish a list of items, but the systems and methods described above provide a number of useful methods of producing the list of items of interest.

表1示出了感兴趣项目清单实施例在产生后的一个例子。 Table 1 shows a list of items of interest in the example embodiment generates embodiment. 这个例子的用户感兴趣项目清单以由逗号分隔的变量(“CSV”)格式给出,其中规定冒号“:”标明一个列有一些子类的总类。 The example given in the list of items the user is interested variable ( "the CSV") separated by commas format, wherein the predetermined colons ":" indicating a general category listed some subclasses. 如果一个类或项目后没有冒号,就假设所有在这个类下可得到的子类和项目都是感兴趣的。 If the project is not a class or a colon, it is assumed that all available in this class and subclass projects are of interest.

表1:用户感兴趣项目清单文档实例政治<CR> Table 1: list of items of interest to the user document instance political & lt; CR & gt;

体育:棒球,职业篮球,摩托车运动<CR> Sports: baseball, professional basketball, motorcycling & lt; CR & gt;

<EOF> & Lt; EOF & gt;

用户感兴趣项目清单最好是用户直接可编辑的,因此如果一个用户希望删除一个以前可能已经添加的感兴趣项目,他就可以用一个普通的文本文档编辑器或数据库程序很方便地这样做。 Users interested in the project list of the best user directly editable, so if a user wants to delete a previously might have added interest in the project, he can use a plain text document editor or database program is very easy to do so. 同样,如果一个用户稍后希望添加一个感兴趣项目,他就可以重新调用菜单或者直接编辑一个文档。 Similarly, if a user wants to add items of interest later, he can recall the menu or directly edit a document.

本发明的优选实施例提供了两个用户可选过程,用于根据用户感兴趣项目清单预测性地检索和高速缓存来自web服务器的信息。 Preferred embodiments of the present invention provides two user selectable process for information from a web server according to the list of items of interest to a user and retrieved predictive cache. 在第一过程中,只有“感兴趣项目”的专用超链接信息将优先高速缓存,从而改进了众所周知的由web浏览器高速缓存所有的“1次跳转(1hop)”的网页的过程。 In the first process, only "items of interest" special priority hyperlink information will be cached to improve the well-known by the web browser cache all "1 hops (1hop)" process web pages. 在这里揭示的第二过程对任何通向含有用户感兴趣项目的信息的超链接进行突出显示,例如通过在web浏览器显示器上突出显示文字或图像、在一个独立的web浏览器视窗内或者在原web浏览器视窗的一个专用框架内扫视之类,使用户注意这些链接。 Hyperlink information item of the second user is interested in the process disclosed herein contain any lead is highlighted, for example, by highlighting text or image on a web browser to display, in a separate web browser window or the original panning or the like within a dedicated web browser window frame, so that the user's attention these links.

为了在下面的详细说明中更为清晰和专用,采用以下术语:“感兴趣项目(interest term)”是最终用户感兴趣的自明式词或词组;“N次跳转扫描(N hop scan)”表示在其中web浏览器将试图预测性地装入和检查网页和所关联的文本的链接空间;“感兴趣链接(interest link)”是那些在含有感兴趣项目的“N次跳转扫描”内可访问的超链接;“快速链接(fast link)”是一个从含有一个通向所发现的含有感兴趣项目的网页的直接链接的普通网页显示的杂乱背景中提取的高度可见链接;“深链接(deep-linking)”是一个通常接受的术语,指从一个机构的网站深处拉出web内容,或者通过一系列URL检索数据,而不必装入或访问中间的网页;“凝视时间(contemplation time)”定义为用户在一个给出的网页上花费的时间,是可供web浏览器系统确定和突出显示当前所装入的网页的任何感兴趣的 For the following detailed description of specific and more clearly, the following terms: "item of interest (interest term)" is a word or phrase of interest of formula evident end user; "N hops scans (N hop scan)" represents wherein the web browser will attempt to link and load the spatial predictively checking pages and associated text; "link of interest (interest link)" are those containing an item of interest "N hops scans" within hyperlinks accessible; highly visible link cluttered background "Quick Links (fast link)" is a direct link from a web page containing containing items of interest are found leading to a general page that appears extracted; "deep links (deep-linking) "it is a generally accepted term for web content drawn from the depths of a body site, URL, or through a series of data retrieval, without having to access web pages or intermediate loaded;" fixation time (contemplation time ) "is defined as the time a user on a given webpage spent, is available for web browsers and the system determines that highlight any interest in the currently loaded web page 接分支使用的时间;以及“TB”是浏览器扫描一个网页搜索感兴趣项目时下载的文本的长度,例如以字节为单位。 Time then branch used; and "TB" is the length of time the browser to download a Web search to scan items of interest text, for example, in bytes.

N次跳转扫描,正如以上所讨论的,是对从出发点起在“N”个超链接内可通达的文档进行的预测性扫描或检索。 N hops scanning, as discussed above, is a predictive scanning or retrieval from the starting point in the "N" hyperlinks can access the document is. 图3示出了一个网站内容的典型树形结构或表示。 A typical tree structure Figure 3 shows the content of a website or FIG. 每个网页具有一些从它起的超链接网页,这些超链接示为从一个网页指向另一个网页或另一些网页的箭头线。 Each page has a page from its hyperlinks from these hyperlinks is shown another web page or other pointing arrow line from a web page. 变量“N”表示相对于出发点查找信息的深度或空间。 Variable "N" means with respect to the starting point to find the depth or spatial information.

例如,一个1次跳转扫描(例如,N=1)(51)检索所有通过从当前网页(50)单次“点击”或超链接可访问的超链接文档,在这个例子中即为页面2、3和4,对这些文档的网页内容进行扫描,看是否出现用户感兴趣项目。 For example, once a jump scan (e.g., N = 1) (51) retrieves all documents via hyperlinks from the current web page (50) a single "click" or hyperlink accessible, namely in this example page 2 3 and 4, the web page content of these documents are scanned to see if users are interested in the project appears.

同样,2次跳转扫描(N=2)(52)检索所有通过二次“点击”从当前网页超链接可访问的文档,例如在这个例子中为所有1次跳转扫描的页面加上页面2a、3a、3b、4a和4b。 Similarly, 2 Jump scanning (N = 2) (52) retrieves all through the second "click" from the document the current Web page hyperlink to access, such as in this case all jump once scanned pages plus page 2a, 3a, 3b, 4a and 4b.

从这个示意图的树形展开可见,需考虑的数据量可能相对N的值指数增长,高阶扫描将更实用,但需要更大的计算机网络通信带宽和更高的web浏览器计算机的处理器速度。 From this schematic the tree expansion visible, the amount of data may be considered a relative value of N exponential growth, the high-order scanning will be useful, but require greater processor speed computer network communication bandwidth and higher web browser computer .

来看图4,图中示出了优选实施例的实现结构。 Figure 4, there is shown a preferred embodiment of the implementation structure. 感兴趣项目预测性扫描器插件(43)在web浏览器计算机(20)上web浏览器程序(40)环境内运行,用web浏览器计算机的用户I/O(23)为用户显示突出显示的链接、快速链接和所产生的显示框架,如在以下说明中所述。 Predictive widget items of interest to the scanner (43 is) a web browser program on a computer web browser (20) (40) operating within the environment, with the web browser of the computer user I / O (23) displayed for the user highlighted link, quick links and displaying the generated frame, as described in the following description. 一个简单的文本文档或一些数据库记录内的用户感兴趣项目清单(42)从它的存储媒体(41)访问,例如是存储在一个硬盘驱动器上或者在web浏览器系统(20)的持久性存储器内。 A simple text document or some interest to the user the list of items (42) within a database record (41) accessed from its storage medium, for example, stored on a hard drive or (20) of a persistent memory in the web browser system Inside. 或者,用户感兴趣项目清单(42)也可以从一个由web浏览器系统(20)可访问的web或网络服务器访问。 Alternatively, a user interested in the list of items (42) can (20) is accessible from a web browser by the system's web server or network access.

感兴趣项目预测性扫描器插件(43)还利用web浏览器计算机(20)的通信能力(诸如它的网络接口卡和通信协议(TCP/IP)之类)和web浏览器程序(40)的通信和显示能力(诸如HTTP之类)有选择地从因特网(3)或其他计算机网络检索web文档的一些部分。 (20) an item of interest communications capability predictive widget scanner (43) further computer using a web browser (such as a network interface card and its communication protocol (TCP / IP) or the like) and the web browser program (40) communications and display capabilities (such as HTTP) selectively retrieves portions of the web document from the Internet (3) or other computer networks.

本发明的优选实施例在对一个当前网页的凝视时间期间进行操作,根据用户感兴趣项目预测性地检索在N次跳转扫描空间内的超链接文档。 Preferred embodiments of the present invention operates during a current time gaze page, the user hyperlinked documents retrieved item of interest predictively N hops in accordance with the scanning space. 假设感兴趣的关键字可以存储在web浏览器系统和/或所关联的软件内。 Assuming that interest keyword can be stored in the web browser system and / or the associated software. 接着,由“预读”(read-ahead)的预测性下载利用对这样的感兴趣项目的知识。 Next, the predictive "pre-reading" (read-ahead) downloads using the knowledge of such items of interest.

一旦一个web浏览器在一个用户对任何网页的选择或选择网页的其他动作(诸如选择一个书签、导航按键之类)后装入该网页,预测性高速缓存过程就立即开始。 Once loaded into the page in a web browser a user operation of any other web page selection or selection (such as selecting a bookmark, a navigation key or the like), the predictive cache process starts immediately. 当前装入和查看的网页设置为N次跳转扫描的出发点或者说“当前页”。 And view the currently loaded page is set to N times the starting point jump scan or "this page."

本发明的优选实施例于是分析当前页的源,诸如当前页的HTML之类,开始下载所有直接与当前页链接的称为1次跳转网页的网页。 Preferred embodiments of the present invention then analyzes the source of this page, such as HTML or the like of this page, begins downloading all the pages are linked directly with this page is called a page hops. 每个网页的下载在成功地接收到预定数据量例如由TB给出的字节数或千字节数)后中止。 Discontinuation download each page in the number of bytes successfully received a predetermined amount of data, for example, given by TB or a few thousand bytes).

接着,对每个网页的下载部分进行扫描,确定它们是否含有任何用户感兴趣项目。 Next, scan the download section of each page to determine whether they contain any user interested in the project. 如果在下载了预定个字节后,在这个网页的明文或元词内未发现任何用户感兴趣项目,就停止下载。 If, after downloading a predetermined number of bytes, in plain text or meta words on this page does not find any user interested in the project, will stop downloading. 由于停止整个网页的下载,浏览器就节约了网络带宽和时间,所节约的这些资源于是可以用来扫描下一个可能的感兴趣网页。 Due to stop downloading the entire page, the browser saves network bandwidth and time, saving these resources can then be used to scan the next page might be interested. 如果发现感兴趣项目,就恢复和完成下载,将链接的整个网页存储在高速缓存器内。 If you find an item of interest, and to resume the download is complete, the entire page will be a link in the storage cache.

在用户继续注视当前装入的网页时浏览器检查下一个1次跳转网页,然后是再下一个,直到根据需要所有的1次跳转网页都得到扫描和高速缓存。 The browser when the user checks continue to monitor the currently loaded page 1 next jump page, and then again the next, until all of the necessary primary and jump pages have been scanned cache.

如果所有的1次跳转网页在用户结束检阅当前页前都得到扫描,就增加次跳转层次(hop level),通过下载每个2次跳转然后是3次跳转等的网页扫描相继深度层的网站内容,搜索关键字,如果发现感兴趣项目就高速缓存整个网页,如上面所说明的那样。 If you have been before all once the user ends the jump page review of the current page scanning, increased levels of hops (hop level), by downloading each twice and then jump three jumps and other pages have been scanned depth website layer content, search keywords, if you find an item of interest to cache the entire page, as explained above.

这种预测性扫描过程可以用表2的伪码描述。 Such predictive scanning process can be described by the pseudocode in Table 2.

表2:预测性扫描过程的伪码UNTIL(用户选择current_page中的链接):FOR hop=1 to N:Scan_page=current_pagecatalog all referenced_links from current_pagerandomly order from first to last all referenced_linksFROM first TO last referenced_link:download document portion at referenced_linkscan portion far occurrences of interest termsIF occurrences found,THEN:complete download of documentstore document in cachehighlight referenced_linkcreate″fast link″to cached document(任选)ELSE discard portion of documentNEXT referenced link/*搜索在本次跳转中链接的文档的下一部分*/NEXT hop/*搜索从当前页再次跳转的下一组文档*/ Table 2: Pseudo Code UNTIL predictive scanning process (the user selects the link current_page in): FOR hop = 1 to N: Scan_page = current_pagecatalog all referenced_links from current_pagerandomly order from first to last all referenced_linksFROM first TO last referenced_link: download document portion at referenced_linkscan portion far occurrences of interest termsIF occurrences found, THEN: complete download of documentstore document in cachehighlight referenced_linkcreate "fast link" to cached document (optional) ELSE discard portion of the document documentNEXT referenced link / * Search link in the current jump in the next part * / nEXT hop / * search from the current page again to jump to the next set of documents * /

在发现一些1次跳转网页含有用户感兴趣项目时,就通过若干方法中任何一种方法使用户注意这些网页。 When it finds a number of 1 Jump page contains user interested in the project, allowing users to pay attention to these pages by any of several methods. 首先,可以在当前页的显示中突出显示通向含有感兴趣项目的网页的超链接或链接,诸如通过改变显示这些超链接的颜色、字体或大小之类。 First of all, can highlight items of interest leads to a web page containing the hyperlink or link, such as displaying the color of hyperlinks, font size, or the like by changing the display of the current page. 在本发明的一个增强实施例中,可以在当前视窗中沿着当前页的一侧、顶部或底部的一个独立框架内或在一个独立的web浏览器视窗内建立一个“快速链接”。 In one embodiment of the present invention enhances may be, in a separate frame, or the top or bottom of establishing a "Quick Link" along one side of this page in a separate web browser window in the current window.

这为用户提供了改进的web浏览器显示,可以按照用户感兴趣项目清单突出显示很可能通向用户感兴趣的文档的链接,使用户可以更为高效地浏览当前的网站。 This provides users with an improved web browser display, the user can follow the list of items of interest to highlight the link is likely to lead to a document of interest to the user, allowing users to more efficiently navigate the current site.

应指出的是,如果采用快速链接显示,多次跳转的感兴趣链接可以是1次跳转的,也就是说,示出在通向感兴趣链接的路径中的下一步,此后示出在这个路径中的下一步等等,或者,多次跳转的感兴趣链接也可以是“深链接的”。 It should be noted that, if the quick links display, repeatedly jump links may be of interest to jump once, that is, shows the next step in the path to the link of interest, after shows in the next step in this path, etc., or, repeatedly jump links of interest can also be a "deep link." 在后一种情况下,框架、视窗之类内示出的第一个链接深链接到感兴趣项目,即使它是仅通过多次跳转才可访问的,并且顶层第一个链接的显示可以是突出显示。 In the latter case, the first link shows the framework, deep window or the like linked to the item of interest, even though it is available only accessible via multiple hops, and the first link may be the top display highlight. 在本发明的一个进一步改进的实施例中,可以用一种突出显示方法突出显示通向感兴趣的文档的1次跳转链接,而可用另一种突出显示方法突出显示通向感兴趣的文档的多次跳转链接。 In a further modified embodiment of the present invention may be highlighted in a method of highlighting a link hops leading document of interest, the method can be used to highlight other documents of interest leads to the highlighted multiple jump link. 例如,1次跳转的感兴趣的链接可以设置成闪烁的红文字,而通向感兴趣的文档的多次跳转链接可以用稳定的红文字示出或突出显示。 For example, a jump of links of interest 1 can be set to flashing red text, leading to the interest of the document can be used multiple times to jump link steady red text shown or highlighted. 设置颜色、字体和闪烁属性的HTML码是众所周知的,因此这个优选实施例的浏览器插件只要改变当前网页这部分的web浏览器显示的这些属性就可以了。 Set the color, font attributes, and flashing HTML code is well known, browser plug-in this preferred embodiment as long as they are currently changing the page which web browser portion of the display on it.

还需注意的是,本发明的优选实施例进行“宽度优先搜索”,而不是从一个给出的出发点通过“N个次跳转”深钻。 Also note that the preferred embodiment of the present invention "BFS", rather than from a starting point given by deep drilling "N th hops." 或者,也可以进行“深度优先搜索”,虽然从本发明的发明者看来这种搜索不大实用,效率也不高,因为可能遗漏或跳过一些没有包含在最初下载的文档部分内的链接。 Alternatively, a "depth-first search," although the inventors of the present invention seems such a search is not practical, efficiency is not high, because it may miss or skip some links not included in the initial download of the documentation section . 可以采用任何一种搜索技术,在这里所揭示的原理是普遍适用的。 Can use any kind of search technology, the principles disclosed herein are generally applicable.

还应该理解,如果需要,可以在独立的视窗或框架内维护一个感兴趣网页“快速链接”的共同清单,即使用户向下进入一条特定的路径。 It should also be understood that, if necessary, to maintain interest in a page "Quick Links" common list in a separate window or frame, even if the user down into a specific path. 例如,考虑为一个处在网页“A”的用户列出一个列有感兴趣的链接“B”和“C”的清单的情况。 For example, consider a user in a web page "A" lists of lists a list of links of interest situation "B" and "C" is. 可以很容易看到,用户会进到感兴趣的链接“B”,同时在独立的框架或视窗内仍然保持着一个通向网页“C”的快速链接。 Can easily see, users will link into the "B" of interest, while in a separate frame or window still maintained a rapid link leads to a page of "C". 在阅读“B”后,如果在阅读时有一些感兴趣的链接到达,在快速链接视窗内保持着“C”使用户可以立即跳回这个先前没有采取的另外路径。 After reading "B", if there is some interest in reading the link to reach and maintain a "C" in the Quick Links window allows the user to instantly jump back to this other path not previously taken.

将本发明的优选实施例并入一个web浏览系统或产品,可以得到根据用户的兴趣浏览万维网及其网站的大量内容的较有智能的装置。 The embodiment incorporating a web browser or the system of the present invention, preferably the product, a lot of content browsing apparatus according to the World Wide Web site and users' interests are more intelligent can be obtained.

虽然以上给出了与一个优选实施例有关的一些具体例子和说明,但熟悉该技术领域的人员可以认识到,在不背离本发明的范围的情况下可以作出各种替换和工程选择,包括但不局限于将这种方法实现为一个应用程序、可移植语言脚本、服务器侧程序或脚本或者浏览器的增强,使用一个诸如支持web的电话机、因特网设备或个人数字助理之类的不同web浏览器计算机,以及使用诸如Windows[TM]CE之类的其他操作系统。 Although some specific examples given above and illustrated relate to a preferred embodiment, but persons skilled in the art may be appreciated without departing from the scope of the present invention may be made of various alternative and engineering choices, including but is not limited to this method is implemented as an application, portable scripting languages, server-side program or script or browser enhancements, such as using a web-enabled phone, Internet web different devices or personal digital assistants browser computer, as well as other operating systems such as Windows [TM] CE and the like.

Claims (13)

1.一种预测性地浏览一个web浏览器系统的用户感兴趣的web文档的方法,所述web浏览器系统具有一个用户显示器、一个用户输入装置和一个持久性存储装置,所述web文档含有一些词并且可通过当前页面中的一个链接地址从所述web浏览器系统访问,所述方法包括下列步骤:从一个链接地址接收一个web文档的一部分;确定一个web文档的所述部分是否含有所述用户感兴趣的一个或多个预定词;以及对确定在所述文档部分存在感兴趣的一个或多个词作出响应,接收和存储所述web文档的整体;其中所述接收、确定和存储步骤在用户查看所述当前页面的同时执行。 A predictive web browser to browse a web document of interest to the system user, said system having a web browser user display, a user input device and a persistent storage device, the web document comprising Some words and may be, said method comprising the steps of the current page from a link address to access the system through the web browser: receiving a portion of a web document from a link address; determining a web document whether the portion containing the one or more predetermined words of interest to said user; and made to determine the presence of one or more words of interest in the partial response document, receiving and storing the entire web document; wherein said receiving, determining and storing step a user to view the current page simultaneously.
2.一种如在权利要求1中所提出的方法,所述方法还包括对于从一个第一web文档起、在预定数量的链接地址内可访问的多个web文档,重复所述接收一个web文档的一部分、确定所述部分是否含有感兴趣的词以及接收和存储一个web文档的整体的步骤。 2. A method in a web set forth in claim 1, said method further comprising for a first web from the document within a predetermined number of the plurality of link address to access a web document, repeating the receiving part of the document, determining whether the portion of the entire words, and the step of receiving and storing a web document of interest contained.
3.一种如在权利要求1中所提出的方法,所述方法还包括在所述web浏览器显示器上突出显示一个通向一个web文档的链接的步骤。 3. A method steps set forth in claim 1, said method further comprising on the web browser displays a web leads to highlight a linked document.
4.一种如在权利要求1中所提出的方法,所述方法还包括在所述web浏览器显示器上创建一个通向所述所存储的web文档的快速链接。 4. A method as set forth in claim 1, the method further comprises creating a quick link to the web document stored on the leading web browser display.
5.一种能预测性地浏览一个web浏览器系统的用户感兴趣的web文档的增强web浏览器系统,所述web文档含有一些词并且可通过当前页面中的一个链接地址从所述浏览器系统访问,所述系统包括:一个执行程序代码的处理器;一个为用户显示信息的用户显示器;一个接收用户输入的用户输入装置;一个存储数据和信息的持久性存储装置,包括存储于其中的用户感兴趣项目清单,所述感兴趣项目清单含有用户感兴趣的一些词;以及一个由所述处理器执行的预测性的基于兴趣的浏览器,所述浏览器用来从一个链接地址接收一个web文档的一部分,确定一个web文档的所述部分是否含有一个或多个感兴趣项目词,对确定在所述文档部分内发现一个或多个感兴趣项目词作出响应接收和存储所述web文档的整体;其中所述预测性的基于兴趣的浏览器在所述用户查看所述当 A predictively can browse a web browser system reinforcing web browser system interest to a user a web document, the web document and may contain some of the words from the current page in the browser via a link address system access, the system comprising: a program code executed by the processor; a user display displaying the user information; receiving a user input of the user input means; storing data and a persistent storage device information stored therein comprising users interested in the project list, the list of items of interest contains some words of interest to the user; and a browser-based interest predictive executed by the processor, the browser used to receive from a link to a web address part of the document, determining the portion of a web document whether the item of interest comprising one or more words, one or more words in response item of interest to receive and store the web document to determine the document found in the portion whole; wherein the predictive interest based on the browser when the user views 页面的同时执行所述接收、确定和存储操作。 Page while performing the receiving, determining and storing operation.
6.一种如在权利要求5中所提出的系统,其中所述预测性的基于兴趣的浏览器包括一个带有一个浏览器插件的标准web浏览器,所述浏览器插件用来从一个链接地址接收一个web文档的一部分,确定一个web文档的所述部分是否含有一个或多个感兴趣项目词,以及对确定在所述文档部分发现一个或多个感兴趣项目词作出响应接收和存储所述web文档的整体。 6. A system as set forth in claim 5, wherein the predictive interest based on standard browser includes a browser plug having a web browser, a browser plug-in for a link from address of the receiving part of a web document, to determine the portion of a web document whether the item of interest comprising one or more words, and in response to receiving and storing the determined one or more items of interest found in the document portion term the overall web documents.
7.一种如在权利要求5或6中所提出的系统,其中所述web文档包括HTML文档。 A system as set forth 5 or claim 6, wherein the web document includes an HTML document.
8.一种如在权利要求5或6中所提出的系统,所述系统还包括一个在所述web浏览器显示器上突出显示一个通向一个所存储的web文档的链接的链接突出显示器。 8. A projection display system link 5 or claim 6 proposed, said system further comprises a highlighting a link leading to a web document stored on the web browser display.
9.一种如在权利要求7中所提出的系统,所述系统还包括一个在所述web浏览器显示器上突出显示一个通向一个所存储的web文档的链接的链接突出显示器。 A system as claimed in claim 7 raised, the system further comprises a link that leads to a highlighting a web document stored in the link projection display of the web browser display.
10.一种如在权利要求5或6中所提出的系统,所述系统还包括一个快速链接创建器,所述快速链接在所述web浏览器显示器上指向所述所存储的web文档。 10. A system as set forth in 5 or 6, claim, said system further comprising a quick link creator, the quick links on the web browser displays a web pointing to the stored document.
11.一种如在权利要求7中所提出的系统,所述系统还包括一个快速链接创建器,所述快速键接在所述web浏览器显示器上指向所述所存储的web文档。 11. A system as claimed in claim 7 raised, the system further comprising a quick link creator, the rapid bonded in the web browser on the display pointing to the stored web documents.
12.一种如在权利要求8中所提出的系统,所述系统还包括一个快速链接创建器,所述快速链接在所述web浏览器显示器上指向所述所存储的web文档。 12. A system as set forth in claim 8, said system further comprising a quick link creator, the quick links on the web browser displays a web pointing to the stored document.
13.一种如在权利要求9中所提出的系统,所述系统还包括一个快速链接创建器,所述快速键接在所述web浏览器显示器上指向所述所存储的web文档。 13. A system as set forth in claim 9, said system further comprising a quick link creator, the rapid bonded in the web browser on the display pointing to the stored web documents.
CNB028061012A 2001-03-08 2002-03-06 Predictive browsing method and system for web pages CN1296853C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US09/801,590 US6874019B2 (en) 2001-03-08 2001-03-08 Predictive caching and highlighting of web pages

Publications (2)

Publication Number Publication Date
CN1522418A CN1522418A (en) 2004-08-18
CN1296853C true CN1296853C (en) 2007-01-24

Family

ID=25181534

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB028061012A CN1296853C (en) 2001-03-08 2002-03-06 Predictive browsing method and system for web pages

Country Status (9)

Country Link
US (1) US6874019B2 (en)
EP (1) EP1368752A2 (en)
JP (1) JP2004531797A (en)
KR (1) KR100583874B1 (en)
CN (1) CN1296853C (en)
CA (1) CA2437933A1 (en)
IL (1) IL157679D0 (en)
TW (1) TW552521B (en)
WO (1) WO2002073460A2 (en)

Families Citing this family (67)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6981040B1 (en) * 1999-12-28 2005-12-27 Utopy, Inc. Automatic, personalized online information and product services
US7747611B1 (en) 2000-05-25 2010-06-29 Microsoft Corporation Systems and methods for enhancing search query results
US6968332B1 (en) * 2000-05-25 2005-11-22 Microsoft Corporation Facility for highlighting documents accessed through search or browsing
US7113935B2 (en) 2000-12-06 2006-09-26 Epicrealm Operating Inc. Method and system for adaptive prefetching
JP2002351736A (en) * 2001-03-23 2002-12-06 Matsushita Electric Ind Co Ltd Document data processor, server device, terminal device and document data processing system
US20030074635A1 (en) * 2001-10-11 2003-04-17 International Business Machines Corporation Method, apparatus, and program for finding and navigating to items in a set of web pages
US6877136B2 (en) * 2001-10-26 2005-04-05 United Services Automobile Association (Usaa) System and method of providing electronic access to one or more documents
AU2003212140A1 (en) * 2002-03-11 2003-09-22 Research In Motion Limited System and method for pushing data to a mobile device
US20030225855A1 (en) * 2002-05-30 2003-12-04 International Business Machines Corporation Method and apparatus for realtime provision of related subject matter across internet content providers
US7801945B1 (en) 2002-07-03 2010-09-21 Sprint Spectrum L.P. Method and system for inserting web content through intermediation between a content server and a client station
US7568002B1 (en) 2002-07-03 2009-07-28 Sprint Spectrum L.P. Method and system for embellishing web content during transmission between a content server and a client station
US7360210B1 (en) 2002-07-03 2008-04-15 Sprint Spectrum L.P. Method and system for dynamically varying intermediation functions in a communication path between a content server and a client station
EP1400903A1 (en) * 2002-09-19 2004-03-24 Sony United Kingdom Limited Information storage and retrieval
GB2393802A (en) * 2002-10-01 2004-04-07 Hewlett Packard Co Establishment of network connections
US20050177564A1 (en) * 2003-03-13 2005-08-11 Fujitsu Limited Server, method, computer product, and terminal device for searching item data
US20040221232A1 (en) * 2003-04-30 2004-11-04 International Business Machines Corporation Method for readily storing and accessing information in electronic documents
US7904585B1 (en) * 2003-09-05 2011-03-08 Skyware, Inc. Predictive browser and protocol package
US7949960B2 (en) * 2003-09-30 2011-05-24 Sap Ag Predictive rendering of user interfaces
US8234373B1 (en) 2003-10-27 2012-07-31 Sprint Spectrum L.P. Method and system for managing payment for web content based on size of the web content
US7873537B2 (en) * 2003-12-04 2011-01-18 International Business Machines Corporation Providing deep linking functions with digital rights management
US9172679B1 (en) 2004-04-14 2015-10-27 Sprint Spectrum L.P. Secure intermediation system and method
US8522131B1 (en) 2004-04-14 2013-08-27 Sprint Spectrum L.P. Intermediation system and method for enhanced rendering of data pages
US7853782B1 (en) 2004-04-14 2010-12-14 Sprint Spectrum L.P. Secure intermediation system and method
GB2415063A (en) * 2004-06-09 2005-12-14 Oracle Int Corp Data retrieval method
GB2416221A (en) * 2004-07-10 2006-01-18 Hewlett Packard Development Co Analysing a multi stage process
US7590631B2 (en) * 2004-09-02 2009-09-15 Hewlett-Packard Development Company, L.P. System and method for guiding navigation through a hypertext system
US7512973B1 (en) 2004-09-08 2009-03-31 Sprint Spectrum L.P. Wireless-access-provider intermediation to facilliate digital rights management for third party hosted content
US8327440B2 (en) 2004-11-08 2012-12-04 Bt Web Solutions, Llc Method and apparatus for enhanced browsing with security scanning
US20060069617A1 (en) * 2004-09-27 2006-03-30 Scott Milener Method and apparatus for prefetching electronic data for enhanced browsing
US8732610B2 (en) * 2004-11-10 2014-05-20 Bt Web Solutions, Llc Method and apparatus for enhanced browsing, using icons to indicate status of content and/or content retrieval
US7600011B1 (en) 2004-11-04 2009-10-06 Sprint Spectrum L.P. Use of a domain name server to direct web communications to an intermediation platform
US7496600B2 (en) * 2004-12-02 2009-02-24 Taiwan Semiconductor Manufacturing Co., Ltd. System and method for accessing web-based search services
EP1844612B1 (en) * 2005-02-04 2017-05-10 Barco NV Method and device for image and video transmission over low-bandwidth and high-latency transmission channels
US20060294223A1 (en) * 2005-06-24 2006-12-28 Microsoft Corporation Pre-fetching and DNS resolution of hyperlinked content
CN101455057A (en) * 2006-06-30 2009-06-10 国际商业机器公司 A method and apparatus for caching broadcasting information
US7660787B2 (en) * 2006-07-19 2010-02-09 International Business Machines Corporation Customized, personalized, integrated client-side search indexing of the web
US20080097979A1 (en) * 2006-10-19 2008-04-24 International Business Machines Corporation System and method of finding related documents based on activity specific meta data and users' interest profiles
JP4915219B2 (en) * 2006-11-24 2012-04-11 富士通株式会社 Hypertext conversion program, method and apparatus
CN100578502C (en) 2007-01-05 2010-01-06 中兴通讯股份有限公司 Embedded browser browsing method and system
US9021352B2 (en) * 2007-05-17 2015-04-28 Adobe Systems Incorporated Methods and apparatus for predictive document rendering
US20080301573A1 (en) * 2007-05-30 2008-12-04 Liang-Yu Chi System and method for indicating page component focus
US20080301300A1 (en) * 2007-06-01 2008-12-04 Microsoft Corporation Predictive asynchronous web pre-fetch
US7877369B2 (en) * 2007-11-02 2011-01-25 Paglo Labs, Inc. Hosted searching of private local area network information
US7877368B2 (en) * 2007-11-02 2011-01-25 Paglo Labs, Inc. Hosted searching of private local area network information with support for add-on applications
US20100162126A1 (en) * 2008-12-23 2010-06-24 Palm, Inc. Predictive cache techniques
KR101132220B1 (en) * 2008-12-30 2012-04-26 엔에이치엔(주) Method, system and computer-readable recording medium for providing web page using cache
US8250053B2 (en) * 2009-02-24 2012-08-21 Microsoft Corporation Intelligent enhancement of a search result snippet
JP5490875B2 (en) * 2009-04-14 2014-05-14 フリーダム サイエンティフィック インコーポレイテッド Document navigation method and a computer system
US20110022945A1 (en) * 2009-07-24 2011-01-27 Nokia Corporation Method and apparatus of browsing modeling
US8365064B2 (en) * 2009-08-19 2013-01-29 Yahoo! Inc. Hyperlinking web content
US20110209040A1 (en) * 2010-02-24 2011-08-25 Microsoft Corporation Explicit and non-explicit links in document
CN101777081A (en) * 2010-03-08 2010-07-14 中兴通讯股份有限公司 Method and device for improving webpage access speed
CN102238204A (en) * 2010-04-23 2011-11-09 腾讯科技(深圳)有限公司 Network data acquisition method and system
US8706854B2 (en) * 2010-06-30 2014-04-22 Raytheon Company System and method for organizing, managing and running enterprise-wide scans
US8788762B2 (en) 2010-09-30 2014-07-22 Nokia Corporation Methods and apparatuses for data resource provision
US8924873B2 (en) 2010-11-23 2014-12-30 International Business Machines Corporation Optimizing a user interface for a computing device
US20120137201A1 (en) * 2010-11-30 2012-05-31 Alcatel-Lucent Usa Inc. Enabling predictive web browsing
US9454607B1 (en) * 2010-12-10 2016-09-27 A9.Com, Inc. Image as database
US8948794B2 (en) 2011-03-14 2015-02-03 Nokia Corporation Methods and apparatuses for facilitating provision of a map resource
US8687840B2 (en) 2011-05-10 2014-04-01 Qualcomm Incorporated Smart backlights to minimize display power consumption based on desktop configurations and user eye gaze
US8612418B2 (en) * 2011-07-14 2013-12-17 Google Inc. Mobile web browser for pre-loading web pages
US9146909B2 (en) 2011-07-27 2015-09-29 Qualcomm Incorporated Web browsing enhanced by cloud computing
US10127314B2 (en) * 2012-03-21 2018-11-13 Apple Inc. Systems and methods for optimizing search engine performance
CN103067908A (en) * 2012-12-27 2013-04-24 北京小米科技有限责任公司 Data processing method, device and terminal
US20150113093A1 (en) * 2013-10-21 2015-04-23 Frank Brunswig Application-aware browser
US20160127497A1 (en) * 2014-11-03 2016-05-05 Evgeny Himmelreich Smart site preloading
US10169481B2 (en) * 2015-02-18 2019-01-01 Adobe Systems Incorporated Method for intelligent web reference preloading based on user behavior prediction

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6131085A (en) * 1993-05-21 2000-10-10 Rossides; Michael T Answer collection and retrieval system governed by a pay-off meter
US5867799A (en) * 1996-04-04 1999-02-02 Lang; Andrew K. Information system and method for filtering a massive flow of information entities to meet user information classification needs
JPH1063679A (en) 1996-08-23 1998-03-06 Nippon Telegr & Teleph Corp <Ntt> Information presentation device
JPH10207901A (en) 1997-01-22 1998-08-07 Nippon Telegr & Teleph Corp <Ntt> Method and system for providing information
JP3774807B2 (en) * 1997-08-06 2006-05-17 タキオン インコーポレイテッドTachyon,Inc. How to prefetch a distributed system and object
US5848410A (en) * 1997-10-08 1998-12-08 Hewlett Packard Company System and method for selective and continuous index generation
US6009410A (en) * 1997-10-16 1999-12-28 At&T Corporation Method and system for presenting customized advertising to a user on the world wide web
US6009429A (en) * 1997-11-13 1999-12-28 International Business Machines Corporation HTML guided web tour
US6078928A (en) * 1997-12-12 2000-06-20 Missouri Botanical Garden Site-specific interest profiling system
US6094649A (en) * 1997-12-22 2000-07-25 Partnet, Inc. Keyword searches of structured databases
US6085226A (en) * 1998-01-15 2000-07-04 Microsoft Corporation Method and apparatus for utility-directed prefetching of web pages into local cache using continual computation and user models
US6182133B1 (en) * 1998-02-06 2001-01-30 Microsoft Corporation Method and apparatus for display of information prefetching and cache status having variable visual indication based on a period of time since prefetching
US6088731A (en) * 1998-04-24 2000-07-11 Associative Computing, Inc. Intelligent assistant for use with a local computer and with the internet
US6151630A (en) * 1998-05-15 2000-11-21 Avaya Technology Corp. Non-redundant browsing of a sequencing of web pages
JP2000215138A (en) * 1999-01-22 2000-08-04 Casio Comput Co Ltd Information searching device and storage medium which stores program
US20010051927A1 (en) * 2000-06-08 2001-12-13 Blinkspeed, Inc. Increasing web page browsing efficiency by periodically physically distributing memory media on which web page data are cached
JP2002259544A (en) * 2001-03-02 2002-09-13 Willone Corp System of electronic exhibition

Also Published As

Publication number Publication date
CN1522418A (en) 2004-08-18
US6874019B2 (en) 2005-03-29
US20020165925A1 (en) 2002-11-07
WO2002073460A2 (en) 2002-09-19
TW552521B (en) 2003-09-11
JP2004531797A (en) 2004-10-14
KR100583874B1 (en) 2006-05-26
EP1368752A2 (en) 2003-12-10
IL157679D0 (en) 2004-03-28
KR20030082607A (en) 2003-10-22
WO2002073460A3 (en) 2003-09-18
CA2437933A1 (en) 2002-09-19

Similar Documents

Publication Publication Date Title
US7962466B2 (en) Automated tool for human assisted mining and capturing of precise results
US6356908B1 (en) Automatic web page thumbnail generation
US7823054B2 (en) Snapback user interface for accessing different document pages directly without going through intermediate pages
US7788248B2 (en) Immediate search feedback
US8655872B2 (en) Search systems and methods using in-line contextual queries
US8156444B1 (en) Systems and methods for determining a user interface attribute
US6310630B1 (en) Data processing system and method for internet browser history generation
US7721192B2 (en) User interface for a resource search tool
KR100266937B1 (en) Web browser method and system for display and management of server latency
US7406659B2 (en) Smart links
US9177030B2 (en) Systems and methods for providing search results
US6581056B1 (en) Information retrieval system providing secondary content analysis on collections of information objects
US7484181B2 (en) Web page display system
US7454694B2 (en) Method and system for organizing document information in a non-directed arrangement of documents
JP4587634B2 (en) How to enlarge a portion of the document in the browser, device, and program
US6751777B2 (en) Multi-target links for navigating between hypertext documents and the like
US7424510B2 (en) Methods and systems for Web-based incremental searches
US8676868B2 (en) Macro programming for resources
US7680856B2 (en) Storing searches in an e-mail folder
US6453342B1 (en) Method and apparatus for selective caching and cleaning of history pages for web browsers
KR100394544B1 (en) Network-based document reviews methods and systems tulyong
US8015259B2 (en) Multi-window internet search with webpage preload
US6981210B2 (en) Self-maintaining web browser bookmarks
US6763388B1 (en) Method and apparatus for selecting and viewing portions of web pages
US7865511B2 (en) News feed browser

Legal Events

Date Code Title Description
C06 Publication
C10 Entry into substantive examination
C14 Grant of patent or utility model
C17 Cessation of patent right