WO2002023401A2 - Systeme et procede d'acces a des pages web - Google Patents

Systeme et procede d'acces a des pages web Download PDF

Info

Publication number
WO2002023401A2
WO2002023401A2 PCT/US2001/027647 US0127647W WO0223401A2 WO 2002023401 A2 WO2002023401 A2 WO 2002023401A2 US 0127647 W US0127647 W US 0127647W WO 0223401 A2 WO0223401 A2 WO 0223401A2
Authority
WO
WIPO (PCT)
Prior art keywords
web page
content
proxy
differences
web
Prior art date
Application number
PCT/US2001/027647
Other languages
English (en)
Other versions
WO2002023401A3 (fr
Inventor
Richard Hayton
David Halls
Original Assignee
Citrix Systems, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Citrix Systems, Inc. filed Critical Citrix Systems, Inc.
Priority to AU2001288820A priority Critical patent/AU2001288820A1/en
Publication of WO2002023401A2 publication Critical patent/WO2002023401A2/fr
Publication of WO2002023401A3 publication Critical patent/WO2002023401A3/fr

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • G06F16/9574Browsing optimisation, e.g. caching or content distillation of access to content, e.g. by caching
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1095Replication or mirroring of data, e.g. scheduling or transport for data synchronisation between network nodes

Definitions

  • the invention relates in general to accessing web pages and more specifically to a
  • a user may request several web pages in sequence from a web browser. In such a
  • the browser requests the first web page from a server and the server loads
  • the browser When the user selects a second web page, the browser usually discards the
  • first web page from its local memory and requests the second web page from the server.
  • This method of accessing web pages occurs for each web page that the user selects
  • the server and browser may be higher than required when transmitting a web page that is similar to the currently displayed web page.
  • the present invention overcomes this waste
  • the invention features a system and a method that reduces the amount of data sent
  • a user utilizes a browser to request from a proxy a first
  • the web page having a first content.
  • the first content includes a first web link that invokes a
  • the proxy sends the request for a second web page having a second content.
  • the proxy sends the request for
  • the web page interface loads the first web
  • the proxy '
  • the proxy modifies the first web link to point to
  • the proxy then stores the modified first content of the first web page in
  • the 'script routine first transmits the request for the second
  • the proxy forwards this request to the web page interface and the web page interface returns the second web page having the second content to the
  • the proxy scans the second web page for web links that point to similar web
  • the proxy then obtains the differences between the first and second web pages and
  • FIG. 1 is a block diagram of an embodiment of the system used to access two
  • FIGS. 2 A and 2B are sections of a flow diagram illustrating an embodiment of the
  • the network system in one embodiment, the network system, in one embodiment,
  • server computer 50 in communication with a client computer 10
  • a user wishing to access a first web page performs an action on the client 10
  • the user may use a
  • the server 50 loads the first web page into its
  • the client 10 displays the first web page to the user.
  • the browser 20 of the client 10 similarly to the browser 20 of the client 10 .
  • first web page determines the differences between the two web pages. If the differences
  • the server 50 compresses the differences between the two web pages
  • the browser 20 then displays the
  • the server 50 is in communication with a persistent storage device
  • the server 50 further includes a web page interface 40 in communication with a
  • the proxy 30 is in communication with the browser 20 over a communication
  • the client 10 uses the browser 20 to make a first request to
  • the proxy 30 for a first web page over the communication channel 15. The proxy 30 then
  • the proxy 30 must receive the web page content in clear, that is, not
  • the proxy 30' is located on
  • the proxy 30' communicates with the server 50 over a second
  • the proxy 30 obtains the first web page, in one embodiment the proxy 30
  • the modified reference calls a script routine.
  • the script routine is software that the proxy
  • the proxy 30 embeds within the first web page. Then the proxy 30 stores a copy of the modified
  • the proxy 30 stores the first web page in its unmodified state. The proxy 30 then sends the modified first web page,
  • the client 10 then displays the first web page.
  • the client 10 then poses a second request to the server 50 for a second web page.
  • the web page interface 40 loads from the storage device 60 the
  • the proxy 30 determines the differences between the first web page
  • proxy 30 compresses the differences and transmits the compressed differences to the
  • client 10 decompresses the differences and displays a web page corresponding to the
  • the user selects (step 200) a first web page PI that the user
  • the client 10 sends (step 205) a
  • the web page interface 40 loads (step 215) the first web page PI into its memory from the storage device 60.
  • the web page interface 40 loads (step 215) the first web page PI into its memory from the storage device 60.
  • the web page interface 40 creates (step 215) the first web page PI.
  • the web page interface 40 transmits (step
  • the proxy 30 initially scans the first
  • web page PI searches for web links or other calls to other web pages (referred to
  • web links which, if selected, result in the first web page PI being replaced
  • the proxy 30 determines if it is likely
  • the heuristic program uses a predetermined criteria to determine
  • the heuristic program computes the
  • the program determines that the two web pages are similar. In another embodiment, the heuristic program uses the page names of the two web
  • the heuristic program compares the pathname of
  • a web server e.g., a servlet or Active
  • the heuristic program uses a meta tag criteria as the
  • Meta tags are a general mechanism for specifying attributes of
  • web pages are typically used by web browsers 20 and readers of HTML source code.
  • a meta tag can be added to a web page denoting whether a web page is
  • a programmer can add meta tags to web pages manually or to the scripts that
  • the proxy 30 uses meta tags to denote a .
  • tags are added to sets of web pages, such as a
  • the proxy 30 determines similar web pages by keeping a
  • meta tag such as OneOfSet'
  • OneOfSet' is included within the web pages to indicate to the heuristic
  • the proxy 30 maintains two
  • the first database 48 includes
  • 49 contains information relating two or more web pages (e.g., similar / dissimilar).
  • the remote proxy 30' determines similar web
  • the proxy 30 maps the initial web page A to the value of the
  • OneOfSet' meta tag e.g., initial web page A -> ShoppingBasket.
  • the value of the meta tag may be a null value. If the initial web page A has a web
  • the proxy 30 first consults the second database 49 to
  • the proxy 30 determines if the proxy 30 has previously deemed the initial web page A and the reference web page B to be similar. If the second database 49 contains information indicating that
  • the proxy 30 modifies
  • script routine is invoked when the browser 20 requests the reference web page B. If the
  • second database 49 contains information indicating that the initial web page A is
  • the proxy 30 does not modify
  • script routine is not invoked when the browser 20 requests the reference web page B.
  • proxy 30 consults the first database 48. If the first database 48 has no information on the
  • the proxy 30 makes no decision regarding similarity between the
  • the proxy 30 employs one of the other previously described
  • heuristics e.g., compressibility and/or page names
  • the proxy 30 deems the initial web page A similar to - li ⁇
  • the proxy 30 considers web
  • script routine is invoked when the browser 20 requests the reference web page B.
  • the proxy 30 does not consider
  • a modified initial web page A can have some modified web links to web
  • the proxy 30 can alternatively use memory data structures or files stored on a local disk
  • the proxy 30 employs
  • the proxy 30 can alternatively use a single database or
  • the heuristic program can be optimistic; that is, the
  • heuristic program on the proxy 30 assumes that a web link results in a similar web page. For example, if the heuristic program uses the page name criteria, the heuristic program
  • the heuristic program employs the page
  • the proxy 30 retrieves the second web
  • the proxy 30 updates the
  • the proxy 30 can employ the
  • the proxy 30 at this point can examine the second web page to
  • the proxy 30 can store this information in the
  • proxy 30 uses the heuristic program and determines that a web link refers to
  • the proxy 30 modifies (step 225) the first
  • the script routine is software written in JavaScript, a scripting
  • script routine can be written in any computer language so
  • goGetltQ is a JavaScript
  • the proxy 30 responds with either the second web page P2 or the
  • a Submit button (used for
  • goGetForm() is a JavaScript
  • proxy 3Q can modify these web page buttons as described above.
  • proxy 30 then stores (step 230) a copy of the modified first web page PI in its local
  • first web page PI with a second web page are more accurate because the proxy 30 does
  • the proxy 30 marks its copy of the first web page PI to indicate to which client 10 the proxy
  • the proxy 30 then sends (step 235) the first web page PI to the browser 20 over
  • the communication channel 15 and the browser 20 displays (step 240) the first web page
  • the proxy 30 compresses the first web page
  • step 245 If the user then selects (step 245) a second
  • routine transmits (step 255) the second request to the proxy 30.
  • the second web page P2 transmitted by the script routine is a different request than the
  • first request for the first web page PI For example, a first request transmitted by the
  • special name (e.g., "special name) to invoke a servlet or other software to calculate the
  • the script routine also notifies the proxy 30 to compare the currently displayed
  • the script routine also notifies the browser 20 to open a non-displayed window in which the differences between the
  • first web page PI and second web page P2 are stored. In this way, the displayed first
  • the proxy 30 again forwards (step 260) the request (e.g., the second request for the
  • the web page interface 40 creates or
  • the proxy 30 next modifies (step 275) the web links in the
  • the proxy 30 used to modify the web links in the first web page PI .
  • step 280 stores (step 280) the modified second web page P2 and deletes the previously
  • proxy 30 modifies (step 275) the second web age P2 after storing (step 280) the second
  • the proxy 30 calculates the differences between the first web
  • the proxy 30 considers the contents of the two web pages as
  • HTML elements trees of HTML elements. Examples of HTML elements are web links and characters.
  • each element in a tree is referred to as a node.
  • a parent node is a node
  • leaf node node that has one or more children nodes. Nodes that have no children are called leaf
  • the proxy 30 compares the trees for common leaves and
  • the proxy 30 then compresses (step 285) the differences between the first web
  • the proxy 30 concludes that the compressed differences are not smaller than the compressed second web page P2, then the proxy 30 sends the compressed second web page P2 to the client
  • the proxy 30 updates the
  • the proxy 30 updates (step 285) the second database
  • the proxy 30 denotes in
  • the second database 49 that the web pages are similar e.g., first web page PI, second
  • the proxy 30 denotes in the second
  • database 49 that the web pages are dissimilar e.g., first web page PI, second web page
  • the proxy 30 does not compute the differences between the first web page PI and the second web page P2 and therefore does not update
  • the proxy 30 computes the differences
  • the heuristic program uses the first and second database 49 to check the
  • the proxy 30 sends (step 295) the compressed differences between the
  • the proxy 30 also discards (step 290) the stored copy of
  • the proxy 30 sends the compressed
  • predetermined threshold such as by a predetermined number of bytes.
  • the proxy 30 does not compress the second web page P2 and therefore does
  • the proxy 30 always transmits the compressed differences to the client 10. i While the proxy 30 is implementing step 260 through step 295, the script routine
  • the browser 20 decompresses the compressed data
  • step 297 the second web page P2 by incorporating the differences between the first web
  • the first web page PI is capable of modifying itself with
  • the browser 20 stores an original, copy of the first web page PI to allow a
  • the proxy 30 performs a Unix "diff ' command to
  • the browser 20 uses a
  • the browser 20 then discards (step 298) the unneeded first
  • the proxy 30 considers the contents
  • the browser 20 modifies the displayed first web page

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

L'invention concerne un procédé et un système de réduction de la largeur de bande nécessaire à la transmission de pages web à un navigateur. Ce procédé comprend les étapes suivantes: un navigateur demande à un mandataire une première page web comportant un premier contenu, et une interface de page web charge et transmet cette première page web au mandataire lequel conserve une copie de la première page dans sa mémoire locale. Si le navigateur demande une seconde page web semblable dans son contenu à la première page, le mandataire détermine les différences entre le contenu de la première page web et celui de la seconde page web après que le mandataire ait reçu cette seconde page web à partir de l'interface de page web. Puis le mandataire transmet alors ces différences au navigateur, lequel incorpore lesdites différences dans la première page web afin de créer la seconde page web.
PCT/US2001/027647 2000-09-12 2001-09-07 Systeme et procede d'acces a des pages web WO2002023401A2 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2001288820A AU2001288820A1 (en) 2000-09-12 2001-09-07 A system and method for accessing web pages

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US66001000A 2000-09-12 2000-09-12
US09/660,010 2000-09-12

Publications (2)

Publication Number Publication Date
WO2002023401A2 true WO2002023401A2 (fr) 2002-03-21
WO2002023401A3 WO2002023401A3 (fr) 2003-07-31

Family

ID=24647756

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2001/027647 WO2002023401A2 (fr) 2000-09-12 2001-09-07 Systeme et procede d'acces a des pages web

Country Status (2)

Country Link
AU (1) AU2001288820A1 (fr)
WO (1) WO2002023401A2 (fr)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2890815A1 (fr) * 2005-09-14 2007-03-16 Streamezzo Sa Procede de transmission d'un contenu multimedia vers un terminal de radiocommunication, programme d'ordinateur, signal, terminal de radiocommunication et serveur de diffusion correspondants
WO2007065813A1 (fr) 2005-12-06 2007-06-14 International Business Machines Corporation Procede et systeme permettant de fournir des pages de portail asynchrones
CN102298617A (zh) * 2011-08-02 2011-12-28 百度在线网络技术(北京)有限公司 一种用于获取目标页面的方法与设备
WO2013152084A1 (fr) * 2012-04-03 2013-10-10 Google Inc. Système et procédé de de distribution de contenu sur un réseau
CN103618787A (zh) * 2013-11-26 2014-03-05 优视科技有限公司 一种网页展现系统和方法
US10747951B2 (en) 2013-11-26 2020-08-18 Uc Mobile Co., Ltd. Webpage template generating method and server

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0836145A2 (fr) * 1996-10-11 1998-04-15 AT&T Corp. Méthode de transfert et d'affichage de pages de données dans un réseau de données
US6052730A (en) * 1997-01-10 2000-04-18 The Board Of Trustees Of The Leland Stanford Junior University Method for monitoring and/or modifying web browsing sessions

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0836145A2 (fr) * 1996-10-11 1998-04-15 AT&T Corp. Méthode de transfert et d'affichage de pages de données dans un réseau de données
US6052730A (en) * 1997-01-10 2000-04-18 The Board Of Trustees Of The Leland Stanford Junior University Method for monitoring and/or modifying web browsing sessions

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
FLOYD R ET AL: "MOBILE WEB ACCESS USING ENETWORK WEB EXPRESS" IEEE PERSONAL COMMUNICATIONS, IEEE COMMUNICATIONS SOCIETY, US, vol. 5, no. 5, 1 October 1998 (1998-10-01), pages 47-52, XP000786616 ISSN: 1070-9916 *
MUN CHOON CHAN ET AL: "Cache-based compaction: a new technique for optimizing Web transfer" INFOCOM '99. EIGHTEENTH ANNUAL JOINT CONFERENCE OF THE IEEE COMPUTER AND COMMUNICATIONS SOCIETIES. PROCEEDINGS. IEEE NEW YORK, NY, USA 21-25 MARCH 1999, PISCATAWAY, NJ, USA,IEEE, US, 21 March 1999 (1999-03-21), pages 117-125, XP010323762 ISBN: 0-7803-5417-6 *
WILLIAMS S: "HTTP: Delta-Encoding Notes" INTERNET, 17 January 1997 (1997-01-17), XP002157520 Retrieved from the Internet: <URL:htp://ei.cs.vt.edu/williams/DIFF/prel im.html> [retrieved on 2001-01-16] *

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2890815A1 (fr) * 2005-09-14 2007-03-16 Streamezzo Sa Procede de transmission d'un contenu multimedia vers un terminal de radiocommunication, programme d'ordinateur, signal, terminal de radiocommunication et serveur de diffusion correspondants
WO2007031570A1 (fr) * 2005-09-14 2007-03-22 Streamezzo Transmission d' un contenu multimedia vers un terminal de radiocommunication
US8437690B2 (en) 2005-09-14 2013-05-07 Streamezzo Transmission of a multimedia content to a radiocommunication terminal
WO2007065813A1 (fr) 2005-12-06 2007-06-14 International Business Machines Corporation Procede et systeme permettant de fournir des pages de portail asynchrones
US8099518B2 (en) 2005-12-06 2012-01-17 International Business Machines Corporation Method and system for providing asynchronous portal pages
CN102298617A (zh) * 2011-08-02 2011-12-28 百度在线网络技术(北京)有限公司 一种用于获取目标页面的方法与设备
WO2013017009A1 (fr) * 2011-08-02 2013-02-07 百度在线网络技术(北京)有限公司 Procédé d'obtention d'une page cible et équipement associé
WO2013152084A1 (fr) * 2012-04-03 2013-10-10 Google Inc. Système et procédé de de distribution de contenu sur un réseau
CN103618787A (zh) * 2013-11-26 2014-03-05 优视科技有限公司 一种网页展现系统和方法
US10747951B2 (en) 2013-11-26 2020-08-18 Uc Mobile Co., Ltd. Webpage template generating method and server

Also Published As

Publication number Publication date
AU2001288820A1 (en) 2002-03-26
WO2002023401A3 (fr) 2003-07-31

Similar Documents

Publication Publication Date Title
RU2589306C2 (ru) Управление сеансом удаленного просмотра
US8103746B2 (en) Method and system for distributing requests for content
US20020099785A1 (en) Enhanced multimedia mobile content delivery and message system using cache management
JP4865983B2 (ja) ネットワークサーバ
US8589385B2 (en) Historical browsing session management
US7284243B2 (en) Installing content specific filename systems
US8539330B2 (en) Method and system for dynamic web page breadcrumbing using javascript
US7747782B2 (en) System and method for providing and displaying information content
US20150100631A1 (en) Proactive transmission of network content
US6470345B1 (en) Replacement of substrings in file/directory pathnames with numeric tokens
US20130080421A1 (en) Historical browsing session management
US20080201405A1 (en) Method and System for Generating a Graphical Display for a Remote Terminal Session
US20020133566A1 (en) Enhanced multimedia mobile content delivery and message system using load balancing
US20010047394A1 (en) System, method, and computer program product for executing scripts on mobile devices
EP1164473A2 (fr) Gestion d&#39;états des objets de commandes serveur
US20020165925A1 (en) Predictive caching and highlighting of web pages
US20120203873A1 (en) Dynamic content assembly on edge-of-network servers in a content delivery network
US20060020883A1 (en) Web page personalization
US20020116534A1 (en) Personalized mobile device viewing system for enhanced delivery of multimedia
US9456048B2 (en) System, method, and computer program product for server side processing in a mobile device environment
EP2761506B1 (fr) Gestion de session de navigation historique
GB2347329A (en) Converting electronic documents into a format suitable for a wireless device
EP1187040A2 (fr) Placement en antémémoire d&#39;informations personnalisées
US20020052889A1 (en) Method for managing alterations of contents
US8489644B2 (en) System and method for managing virtual tree pages

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PH PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase in:

Ref country code: JP