CN104794118B - A kind of Web information processing methods, devices and systems - Google Patents

A kind of Web information processing methods, devices and systems Download PDF

Info

Publication number
CN104794118B
CN104794118B CN201410021353.XA CN201410021353A CN104794118B CN 104794118 B CN104794118 B CN 104794118B CN 201410021353 A CN201410021353 A CN 201410021353A CN 104794118 B CN104794118 B CN 104794118B
Authority
CN
China
Prior art keywords
web page
webpage
information
page element
processed
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410021353.XA
Other languages
Chinese (zh)
Other versions
CN104794118A (en
Inventor
张凯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Yayue Technology Co ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201410021353.XA priority Critical patent/CN104794118B/en
Publication of CN104794118A publication Critical patent/CN104794118A/en
Application granted granted Critical
Publication of CN104794118B publication Critical patent/CN104794118B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

The embodiment of the invention discloses a kind of Web information processing methods, devices and systems.This method comprises: analyzing web page access request, uniform resource locator (URL) information of the current web page of current request access and the corresponding relationship of preceding previous webpage uniform resource locator (ReferURL) information once requested access to are obtained according to parsing result;The access information of each web page element in the webpage to be processed is determined according to the URL information of each web page element in the URL information of webpage to be processed and the webpage to be processed based on the corresponding relationship;The weights of importance of web page element is determined according to the access information of each web page element in webpage to be processed;The composition information of web page element in webpage to be processed is adjusted according to the weights of importance.It can be improved the efficiency that user obtains useful information using the embodiment of the present invention.

Description

A kind of Web information processing methods, devices and systems
Technical field
This application involves technical field of information processing more particularly to a kind of Web information processing methods, devices and systems.
Background technique
Currently, browser is complete as web page browsing tool when user passes through the page of browser website access At the rendering and displaying work of the page.
Under normal conditions, browser directly from original web (i.e. web page server) obtain webpage information, also, rendering and When displayed web page, the design of original web is deferred to as far as possible, does not modify the layout structure in webpage, i.e., does not modify the DOM knot of webpage Structure, to retain the original typesetting effect of webpage.
In the case where cross-terminal access, in order to realize the size fit of different terminal screen size and same webpage, Browser can adjust the DOM structure of the webpage information of acquisition according to the match information between terminal screen and page size It is whole, so that typesetting effect adjusted can match with the screen size of terminal.
As it can be seen that the original web page provided when handling webpage information or directly according to web page server at present Information is rendered and is shown or according to the mating situation of terminal screen size and webpage size to original web page information DOM structure is adjusted, and typesetting effect adjusted and the screen size of terminal are matched, still, web service Composition information possibility in the DOM structure of original web page information provided by device between each page elements can not be truly anti- The importance of each web page element is mirrored, therefore, it is original provided by the web page server that user needs to expend the more time Valuable information is navigated in webpage information, the efficiency for causing user to obtain useful information is lower.
Summary of the invention
The present invention provides a kind of Web information processing methods, devices and systems, can be improved user and obtain useful information Efficiency.
A kind of Web information processing method, this method comprises:
Analyzing web page access request is positioned according to the unified resource that parsing result obtains the current web page of current request access Accord with the corresponding relationship of (URL) information and preceding previous webpage uniform resource locator (ReferURL) information once requested access to;
Based on the corresponding relationship, according to each web page element in the URL information of webpage to be processed and the webpage to be processed URL information, determine the access information of each web page element in the webpage to be processed;
The weights of importance of web page element is determined according to the access information of each web page element in webpage to be processed;According to institute State the composition information that weights of importance adjusts web page element in webpage to be processed.
A kind of Web information processing device, the device include that corresponding relationship obtains module, access information determining module, important Property weight determination module and composition information adjust module;
The corresponding relationship obtains module, is used for analyzing web page access request, obtains current request according to parsing result and visits Uniform resource locator (URL) information for the current web page asked and the preceding previous webpage uniform resource locator once requested access to (ReferURL) corresponding relationship of information;
The access information determining module according to the URL information of webpage to be processed and is somebody's turn to do for being based on the corresponding relationship The URL information of each web page element in webpage to be processed determines the access information of each web page element in the webpage to be processed;
The weights of importance determining module, for being determined according to the access information of each web page element in webpage to be processed The weights of importance of web page element;
The composition information adjusts module, for adjusting web page element in webpage to be processed according to the weights of importance Composition information.
A kind of Web information processing system, the system include browser, proxy server and Website server;
The browser, for web access requests to be issued proxy server, the webpage that Receiving Agent server returns Adjustment information, according to the web page contents after the webpage adjustment information output adjustment;
The proxy server, the web access requests sent for receiving the browser, according to the web page access Request obtains original web page information from the Website server, the web access requests that the browser is sent is parsed, according to solution Analysis result obtain current request access current web page uniform resource locator (URL) information and it is preceding once request access to before The corresponding relationship of one webpage uniform resource locator (ReferURL) information is based on the corresponding relationship, according to webpage to be processed URL information and the webpage to be processed in each web page element URL information, determine each web page element in the webpage to be processed Access information, the weights of importance of web page element, root are determined according to the access information of each web page element in webpage to be processed Webpage adjustment information is returned to the browser according to the weights of importance;
The Website server, the web access requests sent for receiving the proxy server, according to the webpage Access request returns to original web page information.
As seen from the above technical solution, it in the embodiment of the present invention, by analyzing web page access request, obtains current request and visits Uniform resource locator (URL) information for the current web page asked and the preceding previous webpage uniform resource locator once requested access to (ReferURL) corresponding relationship of information, the corresponding relationship are able to reflect out the association between each webpage that user successively accesses Relationship, i.e. user are having accessed a webpage so which webpage can be followed by accessed, in general, user is after accessing a webpage, very The web page element that the picture in the webpage, URL link etc. are present in webpage DOM structure with URL information may be accessed, and originally Inventive embodiments are exactly able to reflect out each webpage in webpage by the corresponding relationship that analyzing web page access request obtains Therefore the access information of element in the embodiment of the present invention, is based on the corresponding relationship, according to the URL information of webpage to be processed and The URL information of each web page element in the webpage to be processed, is capable of determining that the visit of each web page element in the webpage to be processed Ask information.The access situation of each web page element is able to reflect out the importance of web page element or says net in webpage to be processed The value for the information that page element is included, generally, important web page element or more valuable information can be more Ground is accessed, and therefore, the embodiment of the present invention determines web page element according to the access information of each web page element in webpage to be processed Weights of importance, the composition information of web page element in webpage to be processed is adjusted according to the weights of importance, so that more Valuable information or say that more importantly information can typesetting be in webpage in a manner of more significant, for example, coming net It more significant position or is shown in webpage in a particular format in page.
As it can be seen that the embodiment of the present invention accesses the sequencing relationship of each webpage by excavating user, it is successively suitable according to this Order relation determines the access information of each web page element in webpage to be processed, and then determines the importance of each web page element, Reset is carried out to web page element according to the importance, it can be by more valuable information prior in other words than more significant Mode typesetting in webpage, so as to reduce from the time required for valuable information is navigated in webpage, improve and use The efficiency of family acquisition useful information.
Detailed description of the invention
Fig. 1 is Web information processing method flow diagram provided in an embodiment of the present invention.
Fig. 2 is Web information processing apparatus structure schematic diagram provided in an embodiment of the present invention.
Fig. 3 is the composition schematic diagram of Web information processing system provided in an embodiment of the present invention.
Specific embodiment
Fig. 1 is Web information processing method flow diagram provided in an embodiment of the present invention.
As shown in Figure 1, the process includes:
Step 101, analyzing web page access request obtains the unification of the current web page of current request access according to parsing result Pair of Resource Locator (URL) information and preceding previous webpage uniform resource locator (ReferURL) information once requested access to It should be related to.
In this step, by analyzing received web access requests, the webpage of available user's current accessed The webpage RUL information once accessed before URL information and user, i.e., current URL information and ReferURL information, so as to excavate User accesses the sequencing relation information of each webpage out, wherein in web access requests, ReferURL information can be It is empty.
Generally, service agent module can be set in server side, browser is received by the service agent module and is sent Web access requests, the service agent module according to the web access requests from web page server obtain webpage information, and And the service agent module also analyzes the web access requests, and then obtains the corresponding relationship, in case subsequent step It uses.
Step 102, it is based on the corresponding relationship, according to each in the URL information of webpage to be processed and the webpage to be processed The URL information of web page element determines the access information of each web page element in the webpage to be processed.
After user has accessed webpage to be processed, and has accessed in the webpage to be processed and DOM knot is present in URL information When web page element in structure, such as when having accessed some picture or some link in the webpage to be processed again, then user visits In the web access requests for asking the web page element, the webpage URL information of current accessed is the URL information of the web page element, preceding The webpage URL information once accessed i.e. ReferURL information is the URL information of the webpage to be processed, therefore, is based on the correspondence Relationship is capable of determining that according to the URL information of each web page element in the URL information of webpage to be processed and the webpage to be processed The access information of each web page element in webpage to be processed specifically can be by the service broker that is arranged in server side Module determines the access information.
Wherein, the access information generally comprises amount of access information, it is preferable that may include amount of access and access time letter Breath, so as to obtain the access frequency information in different time sections, and then can according to the amount of access of each web page element with The variation of time and adaptively adjust the weights of importance of web page element so that the weights of importance of web page element is more quasi- Really.
In order to protect individual subscriber privacy, generally without the privately owned identity information of user and specific in the access information Personal behavior data.
Step 103, determine that the importance of web page element is weighed according to the access information of each web page element in webpage to be processed Weight.
Step 104, the composition information of web page element in webpage to be processed is adjusted according to the weights of importance.
In step 103 and 104, generally by the service agent module that is arranged in server side according to the access information The weights of importance for determining web page element adjusts composition information as according to the weights of importance, then both can be by the clothes Business proxy module is completed, and can also be completed by browser.
By adjusting the composition information of web page element in webpage to be processed, the reset of webpage may be implemented, i.e., to webpage Layout carry out the adjustment of region division and/or adjustment and/or display format sequentially so that more important webpage Element can be shown in webpage with more obvious way, for example, being shown on the forward position of webpage, or with special word The formats such as body or color are shown in webpage.
It, as an implementation, can be according to each net in webpage to be processed in above-mentioned steps 103 and step 104 The access information of page element, determines the weights of importance of single web page element, then according to the important of single web page element respectively Property weight, adjusts the composition information of single web page element in webpage to be processed, in other words, web page element weights of importance really It is all as unit of single web page element during fixed and during the adjustment of webpage layout information to be processed.
As another embodiment, it can first be closed according to the DOM structure between each web page element in webpage to be processed System and/or semantic relation, are grouped each web page element in webpage, then according to the visit of each web page element in every group It asks information, determines the synthesis weights of importance of each grouping, according to the synthesis weights of importance of each grouping, adjusted with group to be whole Whole each composition information being grouped in webpage to be processed.
Wherein, the access information of each web page element determines the synthesis weights of importance of each grouping according to every group When, the synthesis weights of importance of each grouping can be directly determined according to the access information of each web page element in every group, it can also Determine the weights of importance of each single web page element respectively according to the access information of each web page element in every group with elder generation, then right The weights of importance of each single web page element is weighted, and is obtained the synthesis weights of importance of each grouping, be can be combined with The access information of a part of web page element and the single webpage member for removing remainder other than a part of web page element in every group The weights of importance of element obtains the synthesis weights of importance of each grouping.
In the another embodiment, by being first grouped to the web page element in webpage, then as unit of group The synthesis weights of importance for determining each grouping, according to the comprehensive weights of importance adjusted as unit of group entirely be grouped in The composition information in webpage is handled, may be implemented always that structural relation or semantic relation is more tight when adjusting composition information Close multiple web page elements are put together, for example, by multiple web page elements under the same Div or the same Tab or same More closely the typesetting always of multiple web page elements together, avoids structural relation more close or semantic pass to group semantic relation More closely multiple web page element typesettings obtain efficiency that is excessively in disorder, and then further increasing user's acquisition useful information for system.
It, can when the access information of each web page element determines the weights of importance of web page element according to webpage to be processed With by the way of online, can also by the way of offline, specifically, can in webpage to be processed each web page element When access information changes, the weights of importance of web page element is updated in real time, alternatively, can periodically or aperiodically basis The access information of each web page element in webpage to be processed, the weights of importance of web page element is determined using offline mode.
When adjusting the composition information of web page element in webpage to be processed according to the weights of importance, can also use The mode of line or offline mode specifically in weights of importance variation, adjust webpage Intranet to be processed in real time The composition information of page element;Alternatively, periodically or aperiodically according to the weights of importance, using offline mode adjustment to Handle the composition information of web page element in webpage.
Wherein, the adjustment function of composition information can be set in server side, also can be set in browser side.Specifically Ground can be adjusted the composition information of web page element in webpage to be processed, after adjustment by server according to the weights of importance Webpage information to be processed issue browser, browser exports webpage to be processed according to webpage information to be processed adjusted;Or The weights of importance is issued browser by person, server, and browser adjusts in webpage to be processed according to the weights of importance The composition information of web page element exports webpage to be processed according to webpage information to be processed adjusted.
In order to further increase process performance, web page element in webpage to be processed adjusted can be cached in server side Composition information or webpage to be processed in web page element weights of importance, so that server side is receiving described in access It, can web page element in the webpage to be processed adjusted of return cache in time when the web access requests of webpage to be processed Composition information or webpage to be processed in web page element weights of importance, browser is according to webpage to be processed adjusted The composition information of interior web page element exports webpage or adjusts typesetting according to the weights of importance of web page element in webpage to be processed Webpage is exported after information again.Wherein, it is needed in the composition information or weights of importance of server side caching regular or indefinite It updates to phase, to guarantee its accuracy.
The above-mentioned Web information processing method provided according to embodiments of the present invention, the embodiment of the invention also provides a kind of nets Page information processing unit, specifically refers to Fig. 2.
Fig. 2 is Web information processing apparatus structure schematic diagram provided in an embodiment of the present invention.
As shown in Fig. 2, the device includes that corresponding relationship obtains module 201, access information determining module 202, importance power Weight determining module 203 and composition information adjust module 204.
Corresponding relationship obtains module 201, is used for analyzing web page access request, obtains current request access according to parsing result Current web page uniform resource locator (URL) information and the preceding previous webpage uniform resource locator once requested access to (ReferURL) corresponding relationship of information.
Access information determining module 202 according to the URL information of webpage to be processed and is somebody's turn to do for being based on the corresponding relationship The URL information of each web page element in webpage to be processed determines the access information of each web page element in the webpage to be processed.
Weights of importance determining module 203, for being determined according to the access information of each web page element in webpage to be processed The weights of importance of web page element.
Composition information adjusts module 204, for adjusting web page element in webpage to be processed according to the weights of importance Composition information.
Wherein, weights of importance determining module 203 can be used for according between web page element each in webpage to be processed DOM structure relationship and/or semantic relation are grouped each web page element in webpage, according to webpage member each in every group The access information of element, determines the synthesis weights of importance of each grouping.
Composition information adjusts module 204, can be used for the synthesis weights of importance according to each grouping, is adjusted with group to be whole Whole each composition information being grouped in webpage to be processed.
And/or weights of importance determining module 203, it can be used for the visit according to web page element each in webpage to be processed It asks information, determines the weights of importance of single web page element respectively.
Composition information adjusts module 204, can be used for the weights of importance according to single web page element, adjusts net to be processed The composition information of single web page element in page.
Weights of importance determining module 203, the access information that can be used for each web page element in webpage to be processed become When change, the weights of importance of web page element is updated in real time;Alternatively, periodically or aperiodically according to each in webpage to be processed The access information of a web page element determines the weights of importance of web page element using offline mode.
And/or composition information adjust module 204, can be used for the weights of importance change when, adjust in real time to Handle the composition information of web page element in webpage;Alternatively, periodically or aperiodically according to the weights of importance, using from Line mode adjusts the composition information of web page element in webpage to be processed.
Wherein, corresponding relationship acquisition module 201, access information determining module 203 and weights of importance determining module 204 In server side;Composition information, which adjusts module 204, can be located at server side, can also be located at browser side.In other words, this hair Bright embodiment provides a kind of Web information processing system, (at least can be realized corresponding relationship acquisition by increasing Agent layer The function of module 201, access information determining module 203 and weights of importance determining module 204) it is that user's visit internet site mentions For the agency of server end, screening is carried out to user behavior in Agent layer and is collected and analyzed, that is, visited for the webpage that user sends It asks request, by the current URL and ReferURL in analysis web access requests, obtains user to web page element each in webpage Access information, and then the weights of importance of web page element in webpage is determined according to the access information, then in the agency Layer is adjusted the web page element composition information in webpage according to the weights of importance in browser end, so that More important web page element is shown in webpage with more obvious way, about webpage information provided in an embodiment of the present invention Processing system also can be found in Fig. 3.
Fig. 3 is the composition schematic diagram of Web information processing system provided in an embodiment of the present invention.
As shown in figure 3, the Web information processing system includes browser 301, proxy server 302 and Website server 303, web access requests are issued proxy server 302 by browser 301, and 302 one side of proxy server visits the webpage It asks and requests access to Website server 303, obtain original web page information, on the other hand, proxy server from Website server 303 302 based on the received web access requests to user behavior carry out screening collect and analyze, that is, for user send webpage visit It asks request, by the current URL and ReferURL in analysis web access requests, obtains user to web page element each in webpage Access information, and then determine according to the access information weights of importance of web page element in webpage, then agency service Device 302 can be adjusted according to original web page information of the weights of importance to acquisition, i.e., in adjustment original web page information Web page contents adjusted are issued browser 301 by the composition information of web page element, and browser 301 is directly exported to be taken from agency The web page contents that business device 302 receives, alternatively, proxy server 302 can also be directly by the importance of web page element in webpage Weight information and original web page information issue browser 301, and browser 301 is according to the weights of importance information to original net Page information is adjusted, i.e., the composition information of web page element in adjustment webpage, then the web page contents after output adjustment.
As it can be seen that browser 301 therein is used to web access requests issuing proxy server 302, Receiving Agent service The webpage adjustment information that device 302 returns, according to the web page contents after the webpage adjustment information output adjustment.
Proxy server 302, the web access requests sent for receiving browser 301, is asked according to the web page access It asks from Website server 303 and obtains original web page information, the web access requests that parsing browser 301 is sent are tied according to parsing Uniform resource locator (URL) information of the current web page of fruit acquisition current request access and the preceding previous net once requested access to The corresponding relationship of page uniform resource locator (ReferURL) information, is based on the corresponding relationship, according to the URL of webpage to be processed The URL information of each web page element, determines the visit of each web page element in the webpage to be processed in information and the webpage to be processed It asks information, the weights of importance of web page element is determined according to the access information of each web page element in webpage to be processed, according to institute It states weights of importance and returns to webpage adjustment information to browser 301.
Wherein, the webpage adjustment information can be the weights of importance information of web page element, and browser 301 is according to described Weights of importance information is adjusted web page element composition information in original web page information, then in the webpage after output adjustment Hold;The webpage adjustment information is also possible to web page contents adjusted, and in other words, proxy server 302 is according to web page element Weights of importance the web page element composition information in original web page information is adjusted, then proxy server 302 is direct Web page contents adjusted are issued into browser 301, the web page contents after the direct output adjustment of browser 301.
Website server 303 is visited for the web access requests that Receiving Agent server 302 is sent according to the webpage Ask that request returns to original web page information.
As it can be seen that the embodiment of the present invention is by analyzing the current pass corresponding with ReferURL URL in a large amount of web access requests System, can excavate access behavior of a large number of users to page elements in webpage, the access behavior of a large number of users is able to reflect out Therefore the importance of each web page element in webpage carries out the web page element in the page in conjunction with the access behavior of a large number of users Importance marking, and according to the reset of the score value progress page, subsequent user can be enabled more efficiently to obtain or visit Ask the information in the page.
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the invention, all in essence of the invention Within mind and principle, any modification, equivalent substitution, improvement and etc. done be should be included within the scope of the present invention.

Claims (13)

1. a kind of Web information processing method, which is characterized in that this method comprises:
Analyzing web page access request obtains the uniform resource locator of the current web page of current request access according to parsing result (URL) corresponding relationship of information and preceding previous webpage uniform resource locator (ReferURL) information once requested access to;Its In, the webpage once requested access to before described is webpage to be processed;
Based on the corresponding relationship, according to the URL of each web page element in the URL information of webpage to be processed and the webpage to be processed Information determines that the access information of each web page element in the webpage to be processed, the access information include: multiple users to each The amount of access and access time information of the amount of access information of web page element or multiple users to each web page element;
The weights of importance of web page element is determined according to the access information of each web page element in webpage to be processed;
The composition information of web page element in webpage to be processed is adjusted according to the weights of importance.
2. the method according to claim 1, wherein being believed according to the access of each web page element in webpage to be processed Breath determines that the weights of importance of web page element includes:
According to the DOM structure relationship and/or semantic relation between each web page element in webpage to be processed, to each in webpage Web page element is grouped, and according to the access information of each web page element in every group, determines the synthesis importance power of each grouping Weight;
Include: according to the composition information that the weights of importance adjusts web page element in webpage to be processed
It is each typesetting letter being grouped in webpage to be processed of integrated regulation to organize according to the synthesis weights of importance of each grouping Breath.
3. the method according to claim 1, wherein being believed according to the access of each web page element in webpage to be processed Breath determines that the weights of importance of web page element includes:
According to the access information of each web page element in webpage to be processed, the weights of importance of single web page element is determined respectively;
Include: according to the composition information that the weights of importance adjusts web page element in webpage to be processed
According to the weights of importance of single web page element, the composition information of single web page element in webpage to be processed is adjusted.
4. method according to claim 1 or 2 or 3, which is characterized in that according to each web page element in webpage to be processed Access information determines that the weights of importance of web page element includes:
In webpage to be processed when the access information variation of each web page element, the importance power of web page element is updated in real time Weight;Alternatively, periodically or aperiodically according to the access information of each web page element in webpage to be processed, using offline mode Determine the weights of importance of web page element;
And/or include: according to the composition information that the weights of importance adjusts web page element in webpage to be processed
In weights of importance variation, the composition information of web page element in webpage to be processed is adjusted in real time;Alternatively, the period Property or aperiodically according to the weights of importance, believed using the typesetting that offline mode adjusts web page element in webpage to be processed Breath.
5. method according to claim 1 or 2 or 3, which is characterized in that adjust net to be processed according to the weights of importance The composition information of web page element includes: in page
Server adjusts the composition information of web page element in webpage to be processed according to the weights of importance, by adjusted wait locate Reason webpage information issues browser, and browser exports webpage to be processed according to webpage information to be processed adjusted;
Alternatively, the weights of importance is issued browser by server, browser adjusts to be processed according to the weights of importance The composition information of web page element in webpage exports webpage to be processed according to webpage information to be processed adjusted.
6. method according to claim 1 or 2 or 3, which is characterized in that this method further include:
Cache the important of web page element in the composition information of web page element in webpage to be processed adjusted or webpage to be processed Property weight, when receiving the web access requests for accessing the webpage to be processed, this of return cache is adjusted to be processed In webpage in the composition information of web page element or webpage to be processed web page element weights of importance.
7. a kind of Web information processing device, which is characterized in that the device includes that corresponding relationship obtains module, access information determines Module, weights of importance determining module and composition information adjust module;
The corresponding relationship obtains module, is used for analyzing web page access request, obtains current request access according to parsing result Uniform resource locator (URL) information of current web page and the preceding previous webpage uniform resource locator once requested access to (ReferURL) corresponding relationship of information;Wherein, the preceding webpage once requested access to is webpage to be processed;
The access information determining module according to the URL information of webpage to be processed and is somebody's turn to do wait locate for being based on the corresponding relationship The URL information for managing each web page element in webpage, determines the access information of each web page element in the webpage to be processed, the visit Ask that information includes: visit of multiple users to the amount of access information or multiple users of each web page element to each web page element The amount of asking and access time information;
The weights of importance determining module, for determining webpage according to the access information of each web page element in webpage to be processed The weights of importance of element;
The composition information adjusts module, for adjusting the typesetting of web page element in webpage to be processed according to the weights of importance Information.
8. device according to claim 7, which is characterized in that
The weights of importance determining module, for according to the DOM structure relationship between each web page element in webpage to be processed And/or semantic relation, each web page element in webpage is grouped, is believed according to the access of each web page element in every group Breath, determines the synthesis weights of importance of each grouping;
The composition information adjusts module, is that integrated regulation is each with group for the synthesis weights of importance according to each grouping The composition information being grouped in webpage to be processed.
9. device according to claim 7, which is characterized in that
The weights of importance determining module, it is true respectively for the access information according to each web page element in webpage to be processed The weights of importance of order web page element;
The composition information adjusts module, for the weights of importance according to single web page element, adjusts single in webpage to be processed The composition information of a web page element.
10. according to device described in claim 7 or 8 or 9, which is characterized in that
The weights of importance determining module, it is real when the access information for web page element each in webpage to be processed changes When update the weights of importance of web page element;Alternatively, periodically or aperiodically according to each webpage in webpage to be processed The access information of element determines the weights of importance of web page element using offline mode;
And/or the composition information adjusts module, for adjusting net to be processed in real time in weights of importance variation The composition information of web page element in page;Alternatively, periodically or aperiodically according to the weights of importance, using offline mode Adjust the composition information of web page element in webpage to be processed.
11. according to device described in claim 7 or 8 or 9, which is characterized in that the corresponding relationship obtains module, access information Determining module and weights of importance determining module are located at server side, the composition information adjustment module be located at server side or Person's browser side.
12. a kind of Web information processing system, which is characterized in that the system includes browser, proxy server and website service Device;
The browser, for web access requests to be issued proxy server, the webpage that Receiving Agent server returns is adjusted Information, according to the web page contents after the webpage adjustment information output adjustment;
The proxy server, the web access requests sent for receiving the browser, according to the web access requests Original web page information is obtained from the Website server, parses the web access requests that the browser is sent, is tied according to parsing Uniform resource locator (URL) information of the current web page of fruit acquisition current request access and the preceding previous net once requested access to The corresponding relationship of page uniform resource locator (ReferURL) information, wherein the webpage once requested access to before described is wait locate Manage webpage;Based on the corresponding relationship, according to each web page element in the URL information of webpage to be processed and the webpage to be processed URL information determines the access information of each web page element in the webpage to be processed, according to each web page element in webpage to be processed Access information determine the weights of importance of web page element, webpage adjustment is returned to the browser according to the weights of importance Information, the access information include: multiple users to the amount of access information of each web page element or multiple users to each net The amount of access and access time information of page element;
The Website server, the web access requests sent for receiving the proxy server, according to the web page access Request returns to original web page information.
13. system according to claim 12, which is characterized in that
The webpage adjustment information includes the weights of importance information of web page element, and the browser is according to the weights of importance Information is adjusted web page element composition information in original web page information, the web page contents after output adjustment;
Alternatively, the webpage adjustment information includes web page contents adjusted, the proxy server is according to the weight of web page element The property wanted weight is adjusted the web page element composition information in original web page information, web page contents adjusted is issued described Browser, the web page contents after the direct output adjustment of browser.
CN201410021353.XA 2014-01-17 2014-01-17 A kind of Web information processing methods, devices and systems Active CN104794118B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410021353.XA CN104794118B (en) 2014-01-17 2014-01-17 A kind of Web information processing methods, devices and systems

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410021353.XA CN104794118B (en) 2014-01-17 2014-01-17 A kind of Web information processing methods, devices and systems

Publications (2)

Publication Number Publication Date
CN104794118A CN104794118A (en) 2015-07-22
CN104794118B true CN104794118B (en) 2019-03-26

Family

ID=53558915

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410021353.XA Active CN104794118B (en) 2014-01-17 2014-01-17 A kind of Web information processing methods, devices and systems

Country Status (1)

Country Link
CN (1) CN104794118B (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106776634A (en) * 2015-11-23 2017-05-31 北京搜狗科技发展有限公司 A kind of method for network access, device and terminal device
CN106599146B (en) * 2016-12-06 2021-05-25 腾讯科技(深圳)有限公司 Cache page processing method and device and cache page updating request processing method and device
CN110674431A (en) * 2019-08-29 2020-01-10 北京浪潮数据技术有限公司 Front-end page display method and device
CN110704787A (en) * 2019-10-15 2020-01-17 支付宝(杭州)信息技术有限公司 Page template configuration method and device and electronic equipment
CN114139078A (en) * 2021-11-29 2022-03-04 中国平安财产保险股份有限公司 Method and device for extracting elements in webpage, computer equipment and readable storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101115068A (en) * 2007-07-19 2008-01-30 潘晓梅 Method and system for adjusting web page display contents on client terminal
CN101446983A (en) * 2009-01-12 2009-06-03 腾讯科技(深圳)有限公司 Method, system and equipment for realizing web page acquisition by mobile terminal
CN101944104A (en) * 2010-08-19 2011-01-12 百度在线网络技术(北京)有限公司 Evaluation method and equipment for importance of webpage sub-blocks
CN102368193A (en) * 2011-08-26 2012-03-07 百度在线网络技术(北京)有限公司 Method and device for providing browsed pages
CN102420842A (en) * 2010-09-28 2012-04-18 腾讯科技(深圳)有限公司 Method and system for sending webpage in mobile network
CN103064845A (en) * 2011-10-20 2013-04-24 北京中搜网络技术股份有限公司 Website information processing device and website information processing method
CN103166981A (en) * 2011-12-08 2013-06-19 腾讯科技(深圳)有限公司 Wireless webpage transcoding method and device

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7058691B1 (en) * 2000-06-12 2006-06-06 Trustees Of Princeton University System for wireless push and pull based services
US20100185684A1 (en) * 2009-01-09 2010-07-22 Amit Madaan High precision multi entity extraction
CN102262629A (en) * 2010-05-24 2011-11-30 腾讯科技(深圳)有限公司 Method and equipment for determining weights of link characters in page
CN102591887B (en) * 2011-01-18 2016-07-06 腾讯科技(深圳)有限公司 Network data pre-head method and system

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101115068A (en) * 2007-07-19 2008-01-30 潘晓梅 Method and system for adjusting web page display contents on client terminal
CN101446983A (en) * 2009-01-12 2009-06-03 腾讯科技(深圳)有限公司 Method, system and equipment for realizing web page acquisition by mobile terminal
CN101944104A (en) * 2010-08-19 2011-01-12 百度在线网络技术(北京)有限公司 Evaluation method and equipment for importance of webpage sub-blocks
CN102420842A (en) * 2010-09-28 2012-04-18 腾讯科技(深圳)有限公司 Method and system for sending webpage in mobile network
CN102368193A (en) * 2011-08-26 2012-03-07 百度在线网络技术(北京)有限公司 Method and device for providing browsed pages
CN103064845A (en) * 2011-10-20 2013-04-24 北京中搜网络技术股份有限公司 Website information processing device and website information processing method
CN103166981A (en) * 2011-12-08 2013-06-19 腾讯科技(深圳)有限公司 Wireless webpage transcoding method and device

Also Published As

Publication number Publication date
CN104794118A (en) 2015-07-22

Similar Documents

Publication Publication Date Title
US10110695B1 (en) Key resource prefetching using front-end optimization (FEO) configuration
US10911554B2 (en) Method and system for tracking web link usage
CN104794118B (en) A kind of Web information processing methods, devices and systems
US8375296B2 (en) Reusing style sheet assets
CA2640025C (en) Methods and devices for post processing rendered web pages and handling requests of post processed web pages
US9032282B2 (en) Method and web content management system for A/B testing or multivariate testing of websites on computers being connected to a web content management system
US20020078165A1 (en) System and method for prefetching portions of a web page based on learned preferences
JP5511609B2 (en) Web page creation system, method and program
US20080114773A1 (en) Apparatus and method for prefetching web page
US20090182941A1 (en) Web Server Cache Pre-Fetching
CN107463641A (en) System and method for improving the access to search result
CN104601408B (en) Website data statistics and analysis method and system for non-open network environment
JP2007164791A (en) Integrated website management system and management method using it
CN109683998A (en) Internationalize implementation method, device and system
CN106598972A (en) Information display method and device as well as intelligent terminal
KR20160024293A (en) Method and apparatus for reducing page load time in a communication system
WO2007137290A2 (en) Search result ranking based on usage of search listing collections
CN106899549A (en) A kind of network security detection method and device
CN105653724B (en) A kind of monitoring method and device of page light exposure
JP5480058B2 (en) Advertisement matching apparatus, method and program
Schubotz et al. Mathoid: Robust, scalable, fast and accessible math rendering for wikipedia
CN107526748A (en) A kind of method and apparatus for identifying user and clicking on behavior
Artail et al. Device-aware desktop web page transformation for rendering on handhelds
JP2017167829A (en) Detection device, detection method, and detection program
CA2751930C (en) Treatment controller

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20221115

Address after: 1402, Floor 14, Block A, Haina Baichuan Headquarters Building, No. 6, Baoxing Road, Haibin Community, Xin'an Street, Bao'an District, Shenzhen, Guangdong 518133

Patentee after: Shenzhen Yayue Technology Co.,Ltd.

Address before: 2, 518044, East 403 room, SEG science and Technology Park, Zhenxing Road, Shenzhen, Guangdong, Futian District

Patentee before: TENCENT TECHNOLOGY (SHENZHEN) Co.,Ltd.