CN103577433A - Intelligent page browsing method, system and device - Google Patents

Intelligent page browsing method, system and device Download PDF

Info

Publication number
CN103577433A
CN103577433A CN201210262968.2A CN201210262968A CN103577433A CN 103577433 A CN103577433 A CN 103577433A CN 201210262968 A CN201210262968 A CN 201210262968A CN 103577433 A CN103577433 A CN 103577433A
Authority
CN
China
Prior art keywords
content blocks
content
proxy server
rule
page
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201210262968.2A
Other languages
Chinese (zh)
Inventor
刘德超
朱晋良
徐新意
秦峰
薛晶晶
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201210262968.2A priority Critical patent/CN103577433A/en
Publication of CN103577433A publication Critical patent/CN103577433A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • G06F16/9577Optimising the visualization of content, e.g. distillation of HTML documents

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention provides an intelligent page browsing method. The method includes: a proxy server receives a request of a user accessing a third party website page through a mobile terminal; the proxy server acquires the page of a third party website; the proxy server analyzes the page to acquire multiple content blocks and a frame structure in the page; the proxy server analyzes the multiple content blocks to determine attributes and/or contents of the content blocks; the proxy server select parts of the content blocks from the content blocks according to corresponding view rules of the mobile terminal and the attributes and/or contents in the content blocks, and transmits the frame structure and the parts of content blocks to the mobile terminal; the mobile terminal adds the parts of contents to the frame structure and displays the parts of content blocks. The contents of the website page are screened and reconstructed according to the view rules, so that the website page is simpler and safer, and user experience is good. The invention further discloses an intelligent page browsing system and a proxy server.

Description

The intelligent browsing method of the page, system and device
Technical field
The present invention relates to field of computer technology, particularly the intelligent browsing method of a kind of page, system and device.
Background technology
In recent years, the mobile terminals such as mobile phone emerge rapidly, and towards intellectuality, multi-functional development.The mobile terminal accessing websites such as increasing user's choice for use mobile phone.Constantly perfect along with mobile terminal function, the browser on the software of many mobile terminals, particularly mobile terminal all supports directly to open the webpage of PC version.Yet different from PC, mobile terminal is limited to the factors such as machine performance, screen size, mode of operation, therefore by mobile terminal, browses PC version webpage and still exist user to experience not good case.PC webpage be not only inconsistent with the screen of mobile terminal aspect page structure and demonstration, and comprised redundant information a large amount of and that main body is irrelevant, advertising message and dangerous link, not only affecting the page shows, and a large amount of garbage consumption take the storage space of mobile terminal, to mobile terminal, cause load pressure.
Summary of the invention
Object of the present invention is intended at least solve one of above-mentioned technological deficiency.
For this reason, first object of the present invention is to propose the intelligent browsing method of a kind of page, and the method is according to browsing rule, by proxy server, obtain the page, and web page contents is screened and reconstruct, web page contents is simplified more, thereby the demonstration being optimized on mobile terminal.
Second object of the present invention is to propose the intelligent browing system of a kind of page.
The 3rd object of the present invention is to propose a kind of proxy server.
For achieving the above object, the embodiment of first aspect present invention discloses the intelligent browsing method of a kind of page, comprising: proxy server receives user by the request of mobile terminal accessing third party site page; Described proxy server obtains the page of described third party's website; Described proxy server obtains a plurality of content blocks and the framed structure in the described page to described page analysis; Described proxy server is determined attribute and/or the content in described a plurality of content blocks to described a plurality of content blocks analyses respectively; Described proxy server is selected part content blocks according to described mobile terminal corresponding attribute and/or the content browsed in rule and described a plurality of content blocks from described a plurality of content blocks, and described framed structure and described partial content piece are sent to described mobile terminal; Described mobile terminal is added into described partial content piece among described framed structure, and shows described partial content piece.
According to the intelligent browsing method of the page of the embodiment of the present invention by third party's site page content is carried out to analysis and selection, content of pages is simplified and reconstruct, make it be more suitable for showing at mobile terminal, the user who effectively improves mobile terminal browsing page experiences.Can also reduce the data traffic of mobile terminal, alleviate the load pressure of mobile terminal simultaneously.
In one embodiment of the invention, the described rule of browsing comprises that main body browses rule, and described proxy server selects part content blocks further to comprise according to described mobile terminal corresponding attribute and/or the content browsed in rule and described a plurality of content blocks from described a plurality of content blocks: attribute and/or content that described proxy server is browsed in regular and described a plurality of content blocks according to described main body are determined the content blocks that belongs to main part in described a plurality of content blocks; The content blocks that belongs to main part described in described proxy server selection.Main body is browsed rule and can be retained and subject correlation message, filters in the page redundant information irrelevant with main body, thereby simplifies content of pages, for user saves flow.
In one embodiment of the invention, the described rule of browsing comprises without advertisement and browses rule, and described proxy server selects part content blocks further to comprise according to described mobile terminal corresponding attribute and/or the content browsed in rule and described a plurality of content blocks from described a plurality of content blocks: described proxy server acquisition characteristic of advertisement storehouse; Described proxy server screens described a plurality of content blocks according to the attribute in described characteristic of advertisement storehouse and described a plurality of content blocks and/or content; Described proxy server is by the partial content block delete mating with described characteristic of advertisement storehouse in described a plurality of content blocks; Described proxy server is using the content blocks after screening as the content blocks of selecting.Without advertisement, browse rule and remove irrelevant contents, reduce page layout structure by filtering advertisements information.
In one embodiment of the invention, the described rule of browsing comprises the safe rule of browsing, and described proxy server selects part content blocks further to comprise according to described mobile terminal corresponding attribute and/or the content browsed in rule and described a plurality of content blocks from described a plurality of content blocks: described proxy server obtains the content in described a plurality of content blocks;
Described proxy server extracts the link in described a plurality of content blocks; Described proxy server judges whether the link in described content blocks is secure link; If it is not secure link that described proxy server judges described link, described proxy server is by the content blocks using other guide piece as selection after the content blocks deletion under described dangerous link.Safety is browsed rule by the security of link is analyzed to the safety of guaranteeing web page contents, thus the safety of protection mobile terminal.
In one embodiment of the invention, also comprise: described proxy server obtains the content in described a plurality of content blocks; Described proxy server extracts the link in described a plurality of content blocks; Described proxy server records described user to the click behavior linking in described a plurality of content blocks, and according to described user's click behavior, generates described user's the custom of browsing, and browses rule according to described individual character corresponding to the custom described user of generation of browsing.
The embodiment of second aspect present invention discloses a kind of page intelligence browing system, comprises mobile terminal and proxy server.Described mobile terminal, for sending the request of access third party site page, and receive framed structure and the partial content piece of the described page that described proxy server sends, and described partial content piece is added among described framed structure, and show described partial content piece; Described proxy server, for obtain the page of described third party's website according to described request, and described page analysis is obtained to a plurality of content blocks and the framed structure in the described page, and respectively described a plurality of content blocks analyses are determined to attribute and/or the content in described a plurality of content blocks, with according to described mobile terminal corresponding attribute and/or the content browsed in rule and described a plurality of content blocks, from described a plurality of content blocks, select part content blocks, and described framed structure and described partial content piece are sent to described mobile terminal.
According to the page intelligence browing system of the embodiment of the present invention by third party's site page content is carried out to analysis and selection, according to browsing rule, content of pages is simplified and reconstruct, make it be more suitable for showing at mobile terminal, the user who effectively improves mobile terminal browsing page experiences.This system can also reduce the data traffic of mobile terminal simultaneously, alleviates the load pressure of mobile terminal.
In one embodiment of the invention, the described rule of browsing comprises that main body browses rule, attribute and/or content that described proxy server is browsed in rule and described a plurality of content blocks according to described main body are determined the content blocks that belongs to main part in described a plurality of content blocks, and described in selecting, belong to the content blocks of main part.Main body is browsed rule and can be retained and subject correlation message, filters in the page redundant information irrelevant with main body, thereby simplifies content of pages.
In one embodiment of the invention, described proxy server carries out word segmentation processing to the content in described a plurality of content blocks, and according to the physical meaning of content in the described a plurality of content blocks of word segmentation result judgement, and when the physical meaning of content blocks and described main body are browsed rule match, judge that corresponding content blocks belongs to the content blocks of main part.
In one embodiment of the invention, the described rule of browsing comprises without advertisement and browses rule, described proxy server screens described a plurality of content blocks according to the attribute in characteristic of advertisement storehouse and described a plurality of content blocks and/or content, and by the partial content block delete mating with described characteristic of advertisement storehouse in described a plurality of content blocks, and using the content blocks after screening as the content blocks of selecting.Without advertisement, browse rule and remove irrelevant contents, reduce page layout structure by filtering advertisements information.
In one embodiment of the invention, the described rule of browsing comprises the safe rule of browsing, described proxy server judges whether the link in content blocks is secure link, and when described link is not secure link, the content blocks using other guide piece as selection after the content blocks under described dangerous link is deleted.Safety is browsed rule by the security of link is analyzed to the safety of guaranteeing web page contents, thus the safety of protection mobile terminal.
In one embodiment of the invention, described proxy server is also for extracting the link of described a plurality of content blocks, and record described user to the click behavior linking in described a plurality of content blocks, and the custom of browsing that generates described user according to described user's click behavior, and browse rule according to described individual character corresponding to the custom described user of generation of browsing.
In one embodiment of the invention, between described mobile terminal and described proxy server, by Json-rpc form, communicate.
The embodiment of third aspect present invention discloses a kind of proxy server, comprising: receiver module, for obtain the page of described third party's website according to the request of mobile terminal transmission; Analysis module, for described page analysis being obtained to a plurality of content blocks and the framed structure of the described page, and determines attribute and/or the content in described a plurality of content blocks to described a plurality of content blocks analyses respectively; Select module, for selecting part content blocks according to described mobile terminal corresponding attribute and/or the content of browsing rule and described a plurality of content blocks from described a plurality of content blocks; Sending module, for being sent to described mobile terminal by described framed structure and described partial content piece.
According to the proxy server of the embodiment of the present invention by browsing rule, third party's site page content is carried out to analysis and selection, content of pages is simplified and reconstruct, make it be more suitable for showing at mobile terminal, the user who effectively improves mobile terminal browsing page experiences.
In one embodiment of the invention, the described rule of browsing comprises that main body browses rule, attribute and/or content that described selection module is browsed in rule and described a plurality of content blocks according to described main body are determined the content blocks that belongs to main part in described a plurality of content blocks, and described in selecting, belong to the content blocks of main part.Main body is browsed rule and can be retained and subject correlation message, filters in the page redundant information irrelevant with main body, thereby simplifies content of pages.
In one embodiment of the invention, the described rule of browsing comprises without advertisement and browses rule, described selection module is screened described a plurality of content blocks according to the attribute in characteristic of advertisement storehouse and described a plurality of content blocks and/or content, and by the partial content block delete mating with described characteristic of advertisement storehouse in described a plurality of content blocks, and using the content blocks after screening as the content blocks of selecting.Without advertisement, browse rule and remove irrelevant contents, reduce page layout structure by filtering advertisements information.
In one embodiment of the invention, the described rule of browsing comprises the safe rule of browsing, described selection module judges whether the link in content blocks is secure link, and when described link is not secure link, the content blocks using other guide piece as selection after the content blocks under described dangerous link is deleted.Safety is browsed rule by the security of link is analyzed to the safety of guaranteeing web page contents, thus the safety of protection mobile terminal.
In one embodiment of the invention, also comprise generation module, for the link of extracting described a plurality of content blocks, and record described user to the click behavior linking in described a plurality of content blocks, and the custom of browsing that generates described user according to described user's click behavior, and browse rule according to described individual character corresponding to the custom described user of generation of browsing.
The aspect that the present invention is additional and advantage in the following description part provide, and part will become obviously from the following description, or recognize by practice of the present invention.
Accompanying drawing explanation
Above-mentioned and/or the additional aspect of the present invention and advantage will become from the following description of the accompanying drawings of embodiments and obviously and easily understand, wherein:
Fig. 1 is the process flow diagram intention according to the intelligent browsing method of the page of the embodiment of the present invention;
Fig. 2 is according to the structural representation of the page intelligence browing system of the embodiment of the present invention;
Fig. 3 is according to the structural representation of the proxy server of the embodiment of the present invention; With
Fig. 4 is according to the slice map of the proxy server of the embodiment of the present invention.
Embodiment
Describe embodiments of the invention below in detail, the example of described embodiment is shown in the drawings, and wherein same or similar label represents same or similar element or has the element of identical or similar functions from start to finish.Below by the embodiment being described with reference to the drawings, be exemplary, only for explaining the present invention, and can not be interpreted as limitation of the present invention.
Below with reference to Fig. 1, describe according to the intelligent browsing method of the page of the embodiment of the present invention, comprise the following steps:
Step S110: receive user by the request of mobile terminal accessing third party site page.
In one embodiment of the invention, between mobile terminal and proxy server, by Json-rpc form, communicate.For example, user can the request to third party's site page by the browser input on mobile terminal.Be understandable that, the browser on above-mentioned mobile terminal is only for exemplary purposes, rather than in order to limit the present invention.Mobile terminal can also pass through the client-access third party website of other type.
Step S120: proxy server obtains the page of third party's website.
For example, the browser on mobile terminal is by open rpc (Remote Procedure Call Protocol, the remote procedure call protocol) interface of SmartLayout in Json-rpc invokes proxy server.Proxy server asks to resolve to rpc, and the request address that obtains third party's website also sends request to third party's website, thereby obtains the page of third party's website.
Step S130: proxy server obtains a plurality of content blocks and the framed structure in the page to page analysis.
For example, proxy server cuts the page according to structure of web page, thereby extracts different page plates.
Step S140: proxy server is determined attribute and/or the content in a plurality of content blocks to a plurality of content blocks analyses respectively.For example, proxy server is analyzed a plurality of content blocks according to the tag attributes in content blocks and content.
Step S150: proxy server is selected part content blocks according to mobile terminal corresponding attribute and/or the content browsed in rule and a plurality of content blocks from a plurality of content blocks, and framed structure and partial content piece are sent to mobile terminal.
For example, proxy server, according to browsing rules selection partial content piece, returns to client according to Json-rpc response format.Proxy server is by chosen content piece, and web page contents has been carried out to screening and framework again.According to different viewing rule, simplified web page contents, deleted redundant information, make web page contents be more suitable for showing on mobile terminal.
Wherein, in one embodiment of the invention, described in browse rule and comprise following four kinds:
(1) main body is browsed rule.It is mainly that web page contents is screened that main body is browsed rule, retains and main body related content, deletes with main body and does not want the redundant information of closing.
According to theme, browse rule selects part content blocks further to comprise from a plurality of content blocks:
Step S1511: attribute and/or content that proxy server is browsed in rule and a plurality of content blocks according to main body are determined the content blocks that belongs to main part in a plurality of content blocks.
Wherein, proxy server is browsed in rule and a plurality of content blocks according to main body attribute and/or content determine that the content blocks that belongs to main part in a plurality of content blocks further comprises:
Step S15111: proxy server obtains the content in a plurality of content blocks.
Step S15112: proxy server carries out word segmentation processing to the content in a plurality of content blocks, and according to word segmentation result, judge the physical meaning of content in a plurality of content blocks.
Particularly, in one embodiment of the invention, proxy server obtains the content in a plurality of content blocks, and content is carried out to word segmentation processing, and extracts feature according to word segmentation result.Then the feature of the page is analyzed, excavated, find out the main part of the page.Be understandable that, the above-mentioned method of finding out extraction page main part is only for exemplary purposes, rather than in order to limit the present invention.Proxy server can also extract the main part of the page by alternate manner.
Step S15113: if the physical meaning of content blocks and main body are browsed rule match, judge that corresponding content blocks belongs to the content blocks of main part.
Step S1512: described proxy server is using the content blocks after screening as the content blocks of selecting.
(2) without advertisement, browse rule.Without advertisement, browse rule and use disclosed characteristic of advertisement storehouse to filter advertisement, thereby reach the object of simplifying content of pages.
Proxy server selects part content blocks further to comprise according to mobile terminal corresponding attribute and/or the content browsed in rule and a plurality of content blocks from a plurality of content blocks:
Step S1521: proxy server obtains characteristic of advertisement storehouse.
Step S1522: proxy server screens a plurality of content blocks according to the attribute in characteristic of advertisement storehouse and a plurality of content blocks and/or content.
Step S1523: proxy server is by the partial content block delete mating with characteristic of advertisement storehouse in a plurality of content blocks.
Step S1524: proxy server is using the content blocks after screening as the content blocks of selecting.
Step S1525: proxy server selects to belong to the content blocks of main part.
(3) safety is browsed rule.Safety is browsed rule by the link in verification webpage, thus the security that improves webpage and mobile terminal.
Proxy server selects part content blocks further to comprise according to mobile terminal corresponding attribute and/or the content browsed in rule and a plurality of content blocks from a plurality of content blocks:
Step S1531: proxy server obtains the content in a plurality of content blocks.
Step S1532: proxy server extracts the link in a plurality of content blocks.
Step S1533: proxy server judges whether the link in content blocks is secure link.
Step S1534: if proxy server judgement link is not secure link, proxy server is by the content blocks using other guide piece as selection after the content blocks deletion under dangerous link.
(4) cool link rule.Cool link rule, by the record to user behavior custom, obtains and retains user's conventional link, retains for the important information of user.
Proxy server selects part content blocks further to comprise according to mobile terminal corresponding attribute and/or the content browsed in rule and a plurality of content blocks from a plurality of content blocks:
Step S1541: proxy server obtains the content in a plurality of content blocks.
Step S1542: proxy server extracts the link in a plurality of content blocks.
Step S1543: proxy server recording user is to the click behavior linking in a plurality of content blocks, and generate user's the custom of browsing according to user's click behavior, and generate individual character corresponding to user and browse rule according to browsing custom.
For example, proxy server to the click behavior linking in a plurality of content blocks, links behavioural analysis, excavation according to user, and the link frequently that counting user is clicked retains and preferential demonstration described link.
Step S160: mobile terminal is added into partial content piece among framed structure, and display section content blocks.
According to the intelligent browsing method of the page of the embodiment of the present invention, by intelligence being browsed to rule application in the PC page, by considering the factors such as mobile phone function, screen size, mode of operation, can effectively improve user's experience that mobile terminal is browsed the PC page.In addition, by proxy server, content of pages is screened and integrated, saved the data traffic of mobile terminal, alleviated the load pressure of mobile terminal.
Below with reference to Fig. 2, describe the page intelligence browing system 100 according to the embodiment of the present invention, comprise mobile terminal 110 and proxy server 120.Mobile terminal 110 is for sending the request of access third party site page, and framed structure and the partial content piece of the page that sends of Receiving Agent server 120, and partial content piece is added among framed structure, and display section content blocks; Proxy server 120 is for obtaining the page of third party's website according to request, and page analysis is obtained to a plurality of content blocks and the framed structure in the page, and respectively a plurality of content blocks analyses are determined to attribute and/or the content in a plurality of content blocks, from a plurality of content blocks, select part content blocks with attribute and/or the content browsed in rule and a plurality of content blocks according to mobile terminal 110 correspondences, and framed structure and partial content piece are sent to mobile terminal.
Wherein, in one embodiment of the invention, browse rule and comprise that four kinds are browsed rule, are respectively:
(1) main body is browsed rule.It is mainly that web page contents is screened that main body is browsed rule, retains and main body related content, deletes with main body and does not want the redundant information of closing.
Attribute and/or content that proxy server 120 is browsed in rule and a plurality of content blocks according to main body are determined the content blocks that belongs to main part in a plurality of content blocks, and select to belong to the content blocks of main part.
Particularly, content in 120 pairs of a plurality of content blocks of proxy server is carried out word segmentation processing, and according to word segmentation result, judge the physical meaning of content in a plurality of content blocks, and when the physical meaning of content blocks and main body are browsed rule match, judge that corresponding content blocks belongs to the content blocks of main part.
(2) without advertisement, browse rule.Without advertisement, browse rule and use disclosed characteristic of advertisement storehouse to filter advertisement, thereby reach the object of simplifying content of pages.
Proxy server 120 screens a plurality of content blocks according to the attribute in characteristic of advertisement storehouse and a plurality of content blocks and/or content, and by the partial content block delete mating with characteristic of advertisement storehouse in a plurality of content blocks, and using the content blocks after screening as the content blocks of selecting.
(3) safety is browsed rule.Safety is browsed rule by the link in verification webpage, thus the security that improves webpage and mobile terminal,
Proxy server 120 judges whether the link in content blocks is secure link, and when link is not secure link, the content blocks using other guide piece as selection after the content blocks under dangerous link is deleted.
(4) cool link rule.Cool link rule, by the record to user behavior custom, obtains and retains user's conventional link, retains for the important information of user.
Proxy server 120 extracts the link in a plurality of content blocks, and recording user is to the click behavior linking in a plurality of content blocks, and the custom of browsing that generates user according to user's click behavior, and generates individual character corresponding to user and browse rule according to browsing custom.
In one embodiment of the invention, between mobile terminal and described proxy server, by Json-rpc form, communicate.
For example, mobile terminal 110 receives the request of user to third party's site page, and mobile terminal 110 is by the open rpc interface of SmartLayout in Json-rpc invokes proxy server 120.120 pairs of rpc requests of proxy server are resolved, and obtain the request address of third party's website and send request to third party's website, thus the page of acquisition third party website.120 pairs of page analyses of proxy server obtain a plurality of content blocks and the framed structure in the page, such as according to structure of web page, the page being cut, thereby extract different page plates.Then proxy server 120 is determined attribute and/or the content in a plurality of content blocks to a plurality of content blocks analyses respectively, according to attribute and/or the content browsed in rule and a plurality of content blocks of mobile terminal 110 correspondences, from a plurality of content blocks, select part content blocks, and framed structure and partial content piece are sent to mobile terminal 110 according to Json-rpc response format.Framed structure and the partial content piece of the page that mobile terminal 110 Receiving Agent servers 120 send, be added into partial content piece among framed structure, and display section content blocks, the page display effect after being optimized.
According to the page intelligence browing system of the embodiment of the present invention, intelligence is browsed to rule application in the PC page, by considering the factors such as mobile phone function, screen size, mode of operation, can effectively improve user's experience that mobile terminal is browsed the PC page.In addition, by proxy server, content of pages is screened and integrated, saved the data traffic of mobile terminal, alleviated the load pressure of mobile terminal.
Below with reference to Fig. 3, describe according to the proxy server 200 of the embodiment of the present invention, comprise receiver module 210, analysis module 220, select module 230, sending module 240 and generation module 250.Receiver module 210 obtains the page of third party's website for the request sending according to mobile terminal; Analysis module 220 is for page analysis being obtained to a plurality of content blocks and the framed structure of the page, and respectively a plurality of content blocks analyses determined to attribute and/or the content in a plurality of content blocks; Select module 230 for selecting part content blocks according to mobile terminal corresponding attribute and/or the content of browsing rule and a plurality of content blocks from a plurality of content blocks; Sending module 240 is for being sent to mobile terminal by framed structure and partial content piece; Generation module 250 is for extracting the link of a plurality of content blocks, and recording user is to the click behavior linking in a plurality of content blocks, and the custom of browsing that generates user according to user's click behavior, and generates individual character corresponding to user and browse rule according to browsing custom
Wherein, in one embodiment of the invention, browse rule and comprise following three kinds:
(1) main body is browsed rule.It is mainly that web page contents is screened that main body is browsed rule, retains and main body related content, deletes with main body and does not want the redundant information of closing.
Attribute and/or the content of selecting module 230 to browse in rule and a plurality of content blocks according to main body are determined the content blocks that belongs to main part in a plurality of content blocks, and select to belong to the content blocks of main part.
Concrete, select the content in 230 pairs of a plurality of content blocks of module to carry out word segmentation processing, and according to word segmentation result, judge the physical meaning of content in a plurality of content blocks, and when the physical meaning of content blocks and main body are browsed rule match, judge that corresponding content blocks belongs to the content blocks of main part.
(2) without advertisement, browse rule.Without advertisement, browse rule and use disclosed characteristic of advertisement storehouse to filter advertisement, thereby reach the object of simplifying content of pages.
Select module 230 according to the attribute in characteristic of advertisement storehouse and a plurality of content blocks and/or content, a plurality of content blocks to be screened, and by the partial content block delete mating with characteristic of advertisement storehouse in a plurality of content blocks, and using the content blocks after screening as the content blocks of selecting.
(3) safety is browsed rule.Safety is browsed rule by the link in verification webpage, thus the security that improves webpage and mobile terminal,
Select module 230 to judge whether the link in content blocks is secure link, and when link is not secure link, the content blocks using other guide piece as selection after the content blocks under dangerous link is deleted.
Fig. 4 is the slice map of an embodiment of proxy server 200, and proxy server is divided into SurfService L1 and PageAnalyser L2 layer.Parsing and the transmission of the SurfService L1 request of being mainly used in, PageAnalyser L2 is mainly used in obtaining of rule.Be understandable that, above-mentioned layering only for illustrative purposes, rather than in order to limit the present invention.Proxy server 200 can also adopt other architecture.As shown in the figure, the open rpc interface of mobile terminal invokes proxy server, SurfServiceL1 resolves rpc request, and after parameter is changed, request is sent to PageAnalyser L2.PageAnalyser L2 browses rule according to acquisition of informations such as the pages, after the page is analyzed, result is arrived to SurfService L1.SurfService L1 resolves the data that PageAnalyser L2 returns, and according to Json-rpc response format, turns back to mobile terminal.
According to the proxy server of the embodiment of the present invention, intelligence is browsed to rule application in the PC page, alleviated the complex pressure of mobile terminal, effectively improve user's experience that mobile terminal is browsed the PC page.
In the description of this instructions, the description of reference term " embodiment ", " some embodiment ", " example ", " concrete example " or " some examples " etc. means to be contained at least one embodiment of the present invention or example in conjunction with specific features, structure, material or the feature of this embodiment or example description.In this manual, the schematic statement of above-mentioned term is not necessarily referred to identical embodiment or example.And the specific features of description, structure, material or feature can be with suitable mode combinations in any one or more embodiment or example.
Although illustrated and described embodiments of the invention, for the ordinary skill in the art, be appreciated that without departing from the principles and spirit of the present invention and can carry out multiple variation, modification, replacement and modification to these embodiment, scope of the present invention is by claims and be equal to and limit.

Claims (20)

1. the intelligent browsing method of the page, is characterized in that, comprises the following steps:
Proxy server receives user by the request of mobile terminal accessing third party site page;
Described proxy server obtains the page of described third party's website;
Described proxy server obtains a plurality of content blocks and the framed structure in the described page to described page analysis;
Described proxy server is determined attribute and/or the content in described a plurality of content blocks to described a plurality of content blocks analyses respectively;
Described proxy server is selected part content blocks according to described mobile terminal corresponding attribute and/or the content browsed in rule and described a plurality of content blocks from described a plurality of content blocks, and described framed structure and described partial content piece are sent to described mobile terminal; And
Described mobile terminal is added into described partial content piece among described framed structure, and shows described partial content piece.
2. the page as claimed in claim 1 intelligence browsing method, it is characterized in that, the described rule of browsing comprises that main body browses rule, and described proxy server selects part content blocks further to comprise according to described mobile terminal corresponding attribute and/or the content browsed in rule and described a plurality of content blocks from described a plurality of content blocks:
Attribute and/or content that described proxy server is browsed in rule and described a plurality of content blocks according to described main body are determined the content blocks that belongs to main part in described a plurality of content blocks; And
The content blocks that belongs to main part described in described proxy server selection.
3. the page as claimed in claim 2 intelligence browsing method, it is characterized in that, attribute and/or content that described proxy server is browsed in rule and described a plurality of content blocks according to described main body determine that the content blocks that belongs to main part in described a plurality of content blocks further comprises:
Described proxy server obtains the content in described a plurality of content blocks;
Described proxy server carries out word segmentation processing to the content in described a plurality of content blocks, and according to the physical meaning of content in the described a plurality of content blocks of word segmentation result judgement; And
If the physical meaning of content blocks and described main body are browsed rule match, judge that corresponding content blocks belongs to the content blocks of main part.
4. the page as described in claim 1-3 any one intelligence browsing method, it is characterized in that, the described rule of browsing comprises without advertisement and browses rule, and described proxy server selects part content blocks further to comprise according to described mobile terminal corresponding attribute and/or the content browsed in rule and described a plurality of content blocks from described a plurality of content blocks:
Described proxy server obtains characteristic of advertisement storehouse;
Described proxy server screens described a plurality of content blocks according to the attribute in described characteristic of advertisement storehouse and described a plurality of content blocks and/or content;
Described proxy server is by the partial content block delete mating with described characteristic of advertisement storehouse in described a plurality of content blocks; And
Described proxy server is using the content blocks after screening as the content blocks of selecting.
5. the page as described in claim 1-4 any one intelligence browsing method, it is characterized in that, the described rule of browsing comprises the safe rule of browsing, and described proxy server selects part content blocks further to comprise according to described mobile terminal corresponding attribute and/or the content browsed in rule and described a plurality of content blocks from described a plurality of content blocks:
Described proxy server obtains the content in described a plurality of content blocks;
Described proxy server extracts the link in described a plurality of content blocks;
Described proxy server judges whether the link in described content blocks is secure link;
If it is not secure link that described proxy server judges described link, described proxy server is by the content blocks using other guide piece as selection after the content blocks deletion under described dangerous link.
6. the intelligence of the page as described in claim 1-5 any one browsing method, is characterized in that, also comprises:
Described proxy server obtains the content in described a plurality of content blocks;
Described proxy server extracts the link in described a plurality of content blocks; And
Described proxy server records described user to the click behavior linking in described a plurality of content blocks, and according to described user's click behavior, generates described user's the custom of browsing, and browses rule according to described individual character corresponding to the custom described user of generation of browsing.
7. the intelligence of the page as described in claim 1-6 any one browsing method, is characterized in that, between described mobile terminal and described proxy server, by Json-rpc form, communicates.
8. a page intelligence browing system, is characterized in that, comprises mobile terminal and proxy server, wherein,
Described mobile terminal, for sending the request of access third party site page, and receive framed structure and the partial content piece of the described page that described proxy server sends, and described partial content piece is added among described framed structure, and show described partial content piece; And
Described proxy server, for obtain the page of described third party's website according to described request, and described page analysis is obtained to a plurality of content blocks and the framed structure in the described page, and respectively described a plurality of content blocks analyses are determined to attribute and/or the content in described a plurality of content blocks, with according to described mobile terminal corresponding attribute and/or the content browsed in rule and described a plurality of content blocks, from described a plurality of content blocks, select part content blocks, and described framed structure and described partial content piece are sent to described mobile terminal.
9. the page as claimed in claim 8 intelligence browing system, it is characterized in that, the described rule of browsing comprises that main body browses rule, attribute and/or content that described proxy server is browsed in rule and described a plurality of content blocks according to described main body are determined the content blocks that belongs to main part in described a plurality of content blocks, and described in selecting, belong to the content blocks of main part.
10. the page as claimed in claim 9 intelligence browing system, it is characterized in that, described proxy server carries out word segmentation processing to the content in described a plurality of content blocks, and according to the physical meaning of content in the described a plurality of content blocks of word segmentation result judgement, and when the physical meaning of content blocks and described main body are browsed rule match, judge that corresponding content blocks belongs to the content blocks of main part.
11. page intelligence browing systems as claimed in claim 8, it is characterized in that, the described rule of browsing comprises without advertisement and browses rule, described proxy server screens described a plurality of content blocks according to the attribute in characteristic of advertisement storehouse and described a plurality of content blocks and/or content, and by the partial content block delete mating with described characteristic of advertisement storehouse in described a plurality of content blocks, and using the content blocks after screening as the content blocks of selecting.
12. page intelligence browing systems as claimed in claim 8, it is characterized in that, the described rule of browsing comprises the safe rule of browsing, described proxy server judges whether the link in content blocks is secure link, and when described link is not secure link, the content blocks using other guide piece as selection after the content blocks under described dangerous link is deleted.
13. page intelligence browing systems as claimed in claim 8, it is characterized in that, described proxy server is also for extracting the link of described a plurality of content blocks, and record described user to the click behavior linking in described a plurality of content blocks, and the custom of browsing that generates described user according to described user's click behavior, and browse rule according to described individual character corresponding to the custom described user of generation of browsing.
14. page as described in claim 8-13 any one intelligence browing systems, is characterized in that, between described mobile terminal and described proxy server, by Json-rpc form, communicate.
15. 1 kinds of proxy servers, is characterized in that, comprising:
Receiver module, for obtaining the page of described third party's website according to the request of mobile terminal transmission;
Analysis module, for described page analysis being obtained to a plurality of content blocks and the framed structure of the described page, and determines attribute and/or the content in described a plurality of content blocks to described a plurality of content blocks analyses respectively;
Select module, for selecting part content blocks according to described mobile terminal corresponding attribute and/or the content of browsing rule and described a plurality of content blocks from described a plurality of content blocks; And
Sending module, for being sent to described mobile terminal by described framed structure and described partial content piece.
16. proxy servers as claimed in claim 15, it is characterized in that, the described rule of browsing comprises that main body browses rule, attribute and/or content that described selection module is browsed in rule and described a plurality of content blocks according to described main body are determined the content blocks that belongs to main part in described a plurality of content blocks, and described in selecting, belong to the content blocks of main part.
17. proxy servers as claimed in claim 16, it is characterized in that, described selection module is carried out word segmentation processing to the content in described a plurality of content blocks, and according to the physical meaning of content in the described a plurality of content blocks of word segmentation result judgement, and when the physical meaning of content blocks and described main body are browsed rule match, judge that corresponding content blocks belongs to the content blocks of main part.
18. proxy servers as claimed in claim 15, it is characterized in that, the described rule of browsing comprises without advertisement and browses rule, described selection module is screened described a plurality of content blocks according to the attribute in characteristic of advertisement storehouse and described a plurality of content blocks and/or content, and by the partial content block delete mating with described characteristic of advertisement storehouse in described a plurality of content blocks, and using the content blocks after screening as the content blocks of selecting.
19. proxy servers as claimed in claim 15, it is characterized in that, the described rule of browsing comprises the safe rule of browsing, described selection module judges whether the link in content blocks is secure link, and when described link is not secure link, the content blocks using other guide piece as selection after the content blocks under described dangerous link is deleted.
20. proxy servers as claimed in claim 15, is characterized in that, also comprise:
Generation module, for the link of extracting described a plurality of content blocks, and record described user to the click behavior linking in described a plurality of content blocks, and the custom of browsing that generates described user according to described user's click behavior, and browse rule according to described individual character corresponding to the custom described user of generation of browsing.
CN201210262968.2A 2012-07-26 2012-07-26 Intelligent page browsing method, system and device Pending CN103577433A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210262968.2A CN103577433A (en) 2012-07-26 2012-07-26 Intelligent page browsing method, system and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210262968.2A CN103577433A (en) 2012-07-26 2012-07-26 Intelligent page browsing method, system and device

Publications (1)

Publication Number Publication Date
CN103577433A true CN103577433A (en) 2014-02-12

Family

ID=50049238

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210262968.2A Pending CN103577433A (en) 2012-07-26 2012-07-26 Intelligent page browsing method, system and device

Country Status (1)

Country Link
CN (1) CN103577433A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104899287A (en) * 2015-06-04 2015-09-09 百度在线网络技术(北京)有限公司 Webpage display method and device
CN105760527A (en) * 2016-03-02 2016-07-13 百度在线网络技术(北京)有限公司 Method and device for displaying third-party page
WO2017215175A1 (en) * 2016-06-16 2017-12-21 乐视控股(北京)有限公司 Page processing method and device, terminal, and server
WO2024093572A1 (en) * 2022-10-31 2024-05-10 北京字跳网络技术有限公司 Method and apparatus for presenting document, and device and storage medium

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104899287A (en) * 2015-06-04 2015-09-09 百度在线网络技术(北京)有限公司 Webpage display method and device
CN104899287B (en) * 2015-06-04 2019-04-19 百度在线网络技术(北京)有限公司 The display methods and device of webpage
CN105760527A (en) * 2016-03-02 2016-07-13 百度在线网络技术(北京)有限公司 Method and device for displaying third-party page
CN105760527B (en) * 2016-03-02 2022-09-27 百度在线网络技术(北京)有限公司 Third-party page display method and device
WO2017215175A1 (en) * 2016-06-16 2017-12-21 乐视控股(北京)有限公司 Page processing method and device, terminal, and server
WO2024093572A1 (en) * 2022-10-31 2024-05-10 北京字跳网络技术有限公司 Method and apparatus for presenting document, and device and storage medium

Similar Documents

Publication Publication Date Title
CN102646135B (en) Webpage collecting method, device and system
US20040095400A1 (en) Reconfiguration of content for display on devices of different types
CN103353886A (en) Method and system for previewing webpage
CN102455857B (en) Method and system for realizing quick links in mobile terminal browser
CN103473302A (en) Lock screen information display method, device and system
CN105095107A (en) Buffer memory data cleaning method and apparatus
CN104965838B (en) Page elements processing method and page elements processing unit
CN102724184B (en) A kind of web page storage sharing method and server
WO2016107465A1 (en) Method, device, and system for implementing card-type desktop
US20120287116A1 (en) Anchors For Displaying Image Sprites, Sub-Regions And 3D Images
CN103577433A (en) Intelligent page browsing method, system and device
CN104426985A (en) Method, device and system for displaying webpage
CN105808221A (en) Card type desktop realization method and apparatus
CN107526755B (en) Data processing method and device
CN105550179B (en) Webpage collection method and browser plug-in
CN104899212B (en) Web page display method, server and system
CN111324836B (en) Page processing method and device, computer equipment and storage medium
CN103838728B (en) The processing method and browser of info web
CN102831150A (en) Interactive method, system and terminal for browser and local application
CN103902608B (en) A kind of web page monitored picture and the method and apparatus compressed
KR20020006722A (en) Method of reformatting webpage and method of providing webpage using the same
EP2801920A1 (en) Method and apparatus for displaying web page
CN102137168A (en) Double-browsing mode supporting client, mobile internet browsing system and browsing method
CN104021126B (en) Webpage content filtering method and server
US20120072492A1 (en) Browsing information gathering system, browsing information gathering method, server, and recording medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20140212

RJ01 Rejection of invention patent application after publication