WO2002023401A2 - Systeme et procede d'acces a des pages web - Google Patents
Systeme et procede d'acces a des pages web Download PDFInfo
- Publication number
- WO2002023401A2 WO2002023401A2 PCT/US2001/027647 US0127647W WO0223401A2 WO 2002023401 A2 WO2002023401 A2 WO 2002023401A2 US 0127647 W US0127647 W US 0127647W WO 0223401 A2 WO0223401 A2 WO 0223401A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- web page
- content
- proxy
- differences
- web
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/957—Browsing optimisation, e.g. caching or content distillation
- G06F16/9574—Browsing optimisation, e.g. caching or content distillation of access to content, e.g. by caching
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/10—Protocols in which an application is distributed across nodes in the network
- H04L67/1095—Replication or mirroring of data, e.g. scheduling or transport for data synchronisation between network nodes
Definitions
- the invention relates in general to accessing web pages and more specifically to a
- a user may request several web pages in sequence from a web browser. In such a
- the browser requests the first web page from a server and the server loads
- the browser When the user selects a second web page, the browser usually discards the
- first web page from its local memory and requests the second web page from the server.
- This method of accessing web pages occurs for each web page that the user selects
- the server and browser may be higher than required when transmitting a web page that is similar to the currently displayed web page.
- the present invention overcomes this waste
- the invention features a system and a method that reduces the amount of data sent
- a user utilizes a browser to request from a proxy a first
- the web page having a first content.
- the first content includes a first web link that invokes a
- the proxy sends the request for a second web page having a second content.
- the proxy sends the request for
- the web page interface loads the first web
- the proxy '
- the proxy modifies the first web link to point to
- the proxy then stores the modified first content of the first web page in
- the 'script routine first transmits the request for the second
- the proxy forwards this request to the web page interface and the web page interface returns the second web page having the second content to the
- the proxy scans the second web page for web links that point to similar web
- the proxy then obtains the differences between the first and second web pages and
- FIG. 1 is a block diagram of an embodiment of the system used to access two
- FIGS. 2 A and 2B are sections of a flow diagram illustrating an embodiment of the
- the network system in one embodiment, the network system, in one embodiment,
- server computer 50 in communication with a client computer 10
- a user wishing to access a first web page performs an action on the client 10
- the user may use a
- the server 50 loads the first web page into its
- the client 10 displays the first web page to the user.
- the browser 20 of the client 10 similarly to the browser 20 of the client 10 .
- first web page determines the differences between the two web pages. If the differences
- the server 50 compresses the differences between the two web pages
- the browser 20 then displays the
- the server 50 is in communication with a persistent storage device
- the server 50 further includes a web page interface 40 in communication with a
- the proxy 30 is in communication with the browser 20 over a communication
- the client 10 uses the browser 20 to make a first request to
- the proxy 30 for a first web page over the communication channel 15. The proxy 30 then
- the proxy 30 must receive the web page content in clear, that is, not
- the proxy 30' is located on
- the proxy 30' communicates with the server 50 over a second
- the proxy 30 obtains the first web page, in one embodiment the proxy 30
- the modified reference calls a script routine.
- the script routine is software that the proxy
- the proxy 30 embeds within the first web page. Then the proxy 30 stores a copy of the modified
- the proxy 30 stores the first web page in its unmodified state. The proxy 30 then sends the modified first web page,
- the client 10 then displays the first web page.
- the client 10 then poses a second request to the server 50 for a second web page.
- the web page interface 40 loads from the storage device 60 the
- the proxy 30 determines the differences between the first web page
- proxy 30 compresses the differences and transmits the compressed differences to the
- client 10 decompresses the differences and displays a web page corresponding to the
- the user selects (step 200) a first web page PI that the user
- the client 10 sends (step 205) a
- the web page interface 40 loads (step 215) the first web page PI into its memory from the storage device 60.
- the web page interface 40 loads (step 215) the first web page PI into its memory from the storage device 60.
- the web page interface 40 creates (step 215) the first web page PI.
- the web page interface 40 transmits (step
- the proxy 30 initially scans the first
- web page PI searches for web links or other calls to other web pages (referred to
- web links which, if selected, result in the first web page PI being replaced
- the proxy 30 determines if it is likely
- the heuristic program uses a predetermined criteria to determine
- the heuristic program computes the
- the program determines that the two web pages are similar. In another embodiment, the heuristic program uses the page names of the two web
- the heuristic program compares the pathname of
- a web server e.g., a servlet or Active
- the heuristic program uses a meta tag criteria as the
- Meta tags are a general mechanism for specifying attributes of
- web pages are typically used by web browsers 20 and readers of HTML source code.
- a meta tag can be added to a web page denoting whether a web page is
- a programmer can add meta tags to web pages manually or to the scripts that
- the proxy 30 uses meta tags to denote a .
- tags are added to sets of web pages, such as a
- the proxy 30 determines similar web pages by keeping a
- meta tag such as OneOfSet'
- OneOfSet' is included within the web pages to indicate to the heuristic
- the proxy 30 maintains two
- the first database 48 includes
- 49 contains information relating two or more web pages (e.g., similar / dissimilar).
- the remote proxy 30' determines similar web
- the proxy 30 maps the initial web page A to the value of the
- OneOfSet' meta tag e.g., initial web page A -> ShoppingBasket.
- the value of the meta tag may be a null value. If the initial web page A has a web
- the proxy 30 first consults the second database 49 to
- the proxy 30 determines if the proxy 30 has previously deemed the initial web page A and the reference web page B to be similar. If the second database 49 contains information indicating that
- the proxy 30 modifies
- script routine is invoked when the browser 20 requests the reference web page B. If the
- second database 49 contains information indicating that the initial web page A is
- the proxy 30 does not modify
- script routine is not invoked when the browser 20 requests the reference web page B.
- proxy 30 consults the first database 48. If the first database 48 has no information on the
- the proxy 30 makes no decision regarding similarity between the
- the proxy 30 employs one of the other previously described
- heuristics e.g., compressibility and/or page names
- the proxy 30 deems the initial web page A similar to - li ⁇
- the proxy 30 considers web
- script routine is invoked when the browser 20 requests the reference web page B.
- the proxy 30 does not consider
- a modified initial web page A can have some modified web links to web
- the proxy 30 can alternatively use memory data structures or files stored on a local disk
- the proxy 30 employs
- the proxy 30 can alternatively use a single database or
- the heuristic program can be optimistic; that is, the
- heuristic program on the proxy 30 assumes that a web link results in a similar web page. For example, if the heuristic program uses the page name criteria, the heuristic program
- the heuristic program employs the page
- the proxy 30 retrieves the second web
- the proxy 30 updates the
- the proxy 30 can employ the
- the proxy 30 at this point can examine the second web page to
- the proxy 30 can store this information in the
- proxy 30 uses the heuristic program and determines that a web link refers to
- the proxy 30 modifies (step 225) the first
- the script routine is software written in JavaScript, a scripting
- script routine can be written in any computer language so
- goGetltQ is a JavaScript
- the proxy 30 responds with either the second web page P2 or the
- a Submit button (used for
- goGetForm() is a JavaScript
- proxy 3Q can modify these web page buttons as described above.
- proxy 30 then stores (step 230) a copy of the modified first web page PI in its local
- first web page PI with a second web page are more accurate because the proxy 30 does
- the proxy 30 marks its copy of the first web page PI to indicate to which client 10 the proxy
- the proxy 30 then sends (step 235) the first web page PI to the browser 20 over
- the communication channel 15 and the browser 20 displays (step 240) the first web page
- the proxy 30 compresses the first web page
- step 245 If the user then selects (step 245) a second
- routine transmits (step 255) the second request to the proxy 30.
- the second web page P2 transmitted by the script routine is a different request than the
- first request for the first web page PI For example, a first request transmitted by the
- special name (e.g., "special name) to invoke a servlet or other software to calculate the
- the script routine also notifies the proxy 30 to compare the currently displayed
- the script routine also notifies the browser 20 to open a non-displayed window in which the differences between the
- first web page PI and second web page P2 are stored. In this way, the displayed first
- the proxy 30 again forwards (step 260) the request (e.g., the second request for the
- the web page interface 40 creates or
- the proxy 30 next modifies (step 275) the web links in the
- the proxy 30 used to modify the web links in the first web page PI .
- step 280 stores (step 280) the modified second web page P2 and deletes the previously
- proxy 30 modifies (step 275) the second web age P2 after storing (step 280) the second
- the proxy 30 calculates the differences between the first web
- the proxy 30 considers the contents of the two web pages as
- HTML elements trees of HTML elements. Examples of HTML elements are web links and characters.
- each element in a tree is referred to as a node.
- a parent node is a node
- leaf node node that has one or more children nodes. Nodes that have no children are called leaf
- the proxy 30 compares the trees for common leaves and
- the proxy 30 then compresses (step 285) the differences between the first web
- the proxy 30 concludes that the compressed differences are not smaller than the compressed second web page P2, then the proxy 30 sends the compressed second web page P2 to the client
- the proxy 30 updates the
- the proxy 30 updates (step 285) the second database
- the proxy 30 denotes in
- the second database 49 that the web pages are similar e.g., first web page PI, second
- the proxy 30 denotes in the second
- database 49 that the web pages are dissimilar e.g., first web page PI, second web page
- the proxy 30 does not compute the differences between the first web page PI and the second web page P2 and therefore does not update
- the proxy 30 computes the differences
- the heuristic program uses the first and second database 49 to check the
- the proxy 30 sends (step 295) the compressed differences between the
- the proxy 30 also discards (step 290) the stored copy of
- the proxy 30 sends the compressed
- predetermined threshold such as by a predetermined number of bytes.
- the proxy 30 does not compress the second web page P2 and therefore does
- the proxy 30 always transmits the compressed differences to the client 10. i While the proxy 30 is implementing step 260 through step 295, the script routine
- the browser 20 decompresses the compressed data
- step 297 the second web page P2 by incorporating the differences between the first web
- the first web page PI is capable of modifying itself with
- the browser 20 stores an original, copy of the first web page PI to allow a
- the proxy 30 performs a Unix "diff ' command to
- the browser 20 uses a
- the browser 20 then discards (step 298) the unneeded first
- the proxy 30 considers the contents
- the browser 20 modifies the displayed first web page
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Transfer Between Computers (AREA)
Abstract
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2001288820A AU2001288820A1 (en) | 2000-09-12 | 2001-09-07 | A system and method for accessing web pages |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US66001000A | 2000-09-12 | 2000-09-12 | |
US09/660,010 | 2000-09-12 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2002023401A2 true WO2002023401A2 (fr) | 2002-03-21 |
WO2002023401A3 WO2002023401A3 (fr) | 2003-07-31 |
Family
ID=24647756
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2001/027647 WO2002023401A2 (fr) | 2000-09-12 | 2001-09-07 | Systeme et procede d'acces a des pages web |
Country Status (2)
Country | Link |
---|---|
AU (1) | AU2001288820A1 (fr) |
WO (1) | WO2002023401A2 (fr) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2890815A1 (fr) * | 2005-09-14 | 2007-03-16 | Streamezzo Sa | Procede de transmission d'un contenu multimedia vers un terminal de radiocommunication, programme d'ordinateur, signal, terminal de radiocommunication et serveur de diffusion correspondants |
WO2007065813A1 (fr) | 2005-12-06 | 2007-06-14 | International Business Machines Corporation | Procede et systeme permettant de fournir des pages de portail asynchrones |
CN102298617A (zh) * | 2011-08-02 | 2011-12-28 | 百度在线网络技术(北京)有限公司 | 一种用于获取目标页面的方法与设备 |
WO2013152084A1 (fr) * | 2012-04-03 | 2013-10-10 | Google Inc. | Système et procédé de de distribution de contenu sur un réseau |
CN103618787A (zh) * | 2013-11-26 | 2014-03-05 | 优视科技有限公司 | 一种网页展现系统和方法 |
US10747951B2 (en) | 2013-11-26 | 2020-08-18 | Uc Mobile Co., Ltd. | Webpage template generating method and server |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0836145A2 (fr) * | 1996-10-11 | 1998-04-15 | AT&T Corp. | Méthode de transfert et d'affichage de pages de données dans un réseau de données |
US6052730A (en) * | 1997-01-10 | 2000-04-18 | The Board Of Trustees Of The Leland Stanford Junior University | Method for monitoring and/or modifying web browsing sessions |
-
2001
- 2001-09-07 AU AU2001288820A patent/AU2001288820A1/en not_active Abandoned
- 2001-09-07 WO PCT/US2001/027647 patent/WO2002023401A2/fr active Application Filing
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0836145A2 (fr) * | 1996-10-11 | 1998-04-15 | AT&T Corp. | Méthode de transfert et d'affichage de pages de données dans un réseau de données |
US6052730A (en) * | 1997-01-10 | 2000-04-18 | The Board Of Trustees Of The Leland Stanford Junior University | Method for monitoring and/or modifying web browsing sessions |
Non-Patent Citations (3)
Title |
---|
FLOYD R ET AL: "MOBILE WEB ACCESS USING ENETWORK WEB EXPRESS" IEEE PERSONAL COMMUNICATIONS, IEEE COMMUNICATIONS SOCIETY, US, vol. 5, no. 5, 1 October 1998 (1998-10-01), pages 47-52, XP000786616 ISSN: 1070-9916 * |
MUN CHOON CHAN ET AL: "Cache-based compaction: a new technique for optimizing Web transfer" INFOCOM '99. EIGHTEENTH ANNUAL JOINT CONFERENCE OF THE IEEE COMPUTER AND COMMUNICATIONS SOCIETIES. PROCEEDINGS. IEEE NEW YORK, NY, USA 21-25 MARCH 1999, PISCATAWAY, NJ, USA,IEEE, US, 21 March 1999 (1999-03-21), pages 117-125, XP010323762 ISBN: 0-7803-5417-6 * |
WILLIAMS S: "HTTP: Delta-Encoding Notes" INTERNET, 17 January 1997 (1997-01-17), XP002157520 Retrieved from the Internet: <URL:htp://ei.cs.vt.edu/williams/DIFF/prel im.html> [retrieved on 2001-01-16] * |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2890815A1 (fr) * | 2005-09-14 | 2007-03-16 | Streamezzo Sa | Procede de transmission d'un contenu multimedia vers un terminal de radiocommunication, programme d'ordinateur, signal, terminal de radiocommunication et serveur de diffusion correspondants |
WO2007031570A1 (fr) * | 2005-09-14 | 2007-03-22 | Streamezzo | Transmission d' un contenu multimedia vers un terminal de radiocommunication |
US8437690B2 (en) | 2005-09-14 | 2013-05-07 | Streamezzo | Transmission of a multimedia content to a radiocommunication terminal |
WO2007065813A1 (fr) | 2005-12-06 | 2007-06-14 | International Business Machines Corporation | Procede et systeme permettant de fournir des pages de portail asynchrones |
US8099518B2 (en) | 2005-12-06 | 2012-01-17 | International Business Machines Corporation | Method and system for providing asynchronous portal pages |
CN102298617A (zh) * | 2011-08-02 | 2011-12-28 | 百度在线网络技术(北京)有限公司 | 一种用于获取目标页面的方法与设备 |
WO2013017009A1 (fr) * | 2011-08-02 | 2013-02-07 | 百度在线网络技术(北京)有限公司 | Procédé d'obtention d'une page cible et équipement associé |
WO2013152084A1 (fr) * | 2012-04-03 | 2013-10-10 | Google Inc. | Système et procédé de de distribution de contenu sur un réseau |
CN103618787A (zh) * | 2013-11-26 | 2014-03-05 | 优视科技有限公司 | 一种网页展现系统和方法 |
US10747951B2 (en) | 2013-11-26 | 2020-08-18 | Uc Mobile Co., Ltd. | Webpage template generating method and server |
Also Published As
Publication number | Publication date |
---|---|
AU2001288820A1 (en) | 2002-03-26 |
WO2002023401A3 (fr) | 2003-07-31 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
RU2589306C2 (ru) | Управление сеансом удаленного просмотра | |
US8103746B2 (en) | Method and system for distributing requests for content | |
US20020099785A1 (en) | Enhanced multimedia mobile content delivery and message system using cache management | |
JP4865983B2 (ja) | ネットワークサーバ | |
US8589385B2 (en) | Historical browsing session management | |
US7284243B2 (en) | Installing content specific filename systems | |
US8539330B2 (en) | Method and system for dynamic web page breadcrumbing using javascript | |
US7747782B2 (en) | System and method for providing and displaying information content | |
US20150100631A1 (en) | Proactive transmission of network content | |
US6470345B1 (en) | Replacement of substrings in file/directory pathnames with numeric tokens | |
US20130080421A1 (en) | Historical browsing session management | |
US20080201405A1 (en) | Method and System for Generating a Graphical Display for a Remote Terminal Session | |
US20020133566A1 (en) | Enhanced multimedia mobile content delivery and message system using load balancing | |
US20010047394A1 (en) | System, method, and computer program product for executing scripts on mobile devices | |
EP1164473A2 (fr) | Gestion d'états des objets de commandes serveur | |
US20020165925A1 (en) | Predictive caching and highlighting of web pages | |
US20120203873A1 (en) | Dynamic content assembly on edge-of-network servers in a content delivery network | |
US20060020883A1 (en) | Web page personalization | |
US20020116534A1 (en) | Personalized mobile device viewing system for enhanced delivery of multimedia | |
US9456048B2 (en) | System, method, and computer program product for server side processing in a mobile device environment | |
EP2761506B1 (fr) | Gestion de session de navigation historique | |
GB2347329A (en) | Converting electronic documents into a format suitable for a wireless device | |
EP1187040A2 (fr) | Placement en antémémoire d'informations personnalisées | |
US20020052889A1 (en) | Method for managing alterations of contents | |
US8489644B2 (en) | System and method for managing virtual tree pages |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PH PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
122 | Ep: pct application non-entry in european phase | ||
NENP | Non-entry into the national phase in: |
Ref country code: JP |