CN102035883A - Method and device for optimizing webpage in network equipment - Google Patents

Method and device for optimizing webpage in network equipment Download PDF

Info

Publication number
CN102035883A
CN102035883A CN2010105697822A CN201010569782A CN102035883A CN 102035883 A CN102035883 A CN 102035883A CN 2010105697822 A CN2010105697822 A CN 2010105697822A CN 201010569782 A CN201010569782 A CN 201010569782A CN 102035883 A CN102035883 A CN 102035883A
Authority
CN
China
Prior art keywords
information unit
info web
user
information
classification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2010105697822A
Other languages
Chinese (zh)
Other versions
CN102035883B (en
Inventor
朱晋良
邢皖甲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Baidu Online Network Technology Beijing Co Ltd
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201010569782.2A priority Critical patent/CN102035883B/en
Publication of CN102035883A publication Critical patent/CN102035883A/en
Application granted granted Critical
Publication of CN102035883B publication Critical patent/CN102035883B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

The invention provides a method and a device for optimizing a webpage in network equipment. The method comprises the steps of: obtaining first webpage information to be treated; analyzing various information units included in the first webpage information to determine the class to which each information unit belongs; and based on a first preset rule, by combining with the classes of the information units, realizing the purpose of converting the first webpage information into second webpage information used for providing for the user equipment. Compared with the prior art, the invention has the advantages of: being capable of remarkably displaying the contents concerned by a user to reduce the searching time of the user, being capable of shielding the advertisement contents and the contents which are not concerned by the user to bring better webpage browsing experience, being capable of removing the redundant contents in the webpage and reducing the loading time of the webpage, and being capable of regulating the webpage structure and quickening the composing speed of the webpage.

Description

A kind of method and apparatus that in the network equipment, is used to optimize webpage
Technical field
The present invention relates to computer networking technology, relate in particular to a kind of method and apparatus that in the network equipment, is used to optimize webpage.
Background technology
Nowadays, by various subscriber equipment browsing pages, become the part in majority's life, yet, along with Internet development, the information that comprises in the webpage is more and more, make the user have to require efforts in webpage, to search the information of oneself needs, and, in order to create profit, often be mingled with more advertisement in the various webpages that the website provides, influencing browsing of user.In addition, because that the webpage of part website is write is improper, also can cause the problem that user's webpage heap(ed) capacity is bigger than normal, the webpage formation speed is slower.
In the prior art, provide the method for shielding advertising message, yet these class methods are often only carried out the advertisement shielding by shielding simple means such as unsteady element, intercepting pop-up window, not only shield effectiveness a little less than, also might shield the information that the user needs.The improper webpage heap(ed) capacity that causes is bigger than normal, webpage composing speed waits problem more slowly and write for webpage, and prior art does not provide effective solution as yet.
Summary of the invention
The purpose of this invention is to provide a kind of method and apparatus that in the network equipment, is used to optimize webpage.
According to an aspect of the present invention, provide the method that is used to optimize webpage in a kind of network equipment, wherein, this method may further comprise the steps:
A obtains the first pending info web;
B analyzes each information unit that described first info web is comprised, with the classification under definite described each information unit;
C in conjunction with the classification of described each information unit, is converted to described first info web second info web that is used to offer described subscriber equipment based on first pre-defined rule.
According to another aspect of the present invention, also provide a kind of network equipment that is used to optimize webpage, wherein, this network equipment comprises:
Deriving means, be used to obtain the described first pending info web;
The category analysis device, be used to analyze each information unit that described first info web is comprised, to determine the classification under described each information unit;
Conversion equipment, be used for,, described first info web be converted to second info web that is used to offer described subscriber equipment in conjunction with the classification of described each information unit based on first pre-defined rule.
Compared with prior art, the present invention has the following advantages: 1) can highlight the content that the user pays close attention to, reduce the time that the user searches; 2) can shield the content that ad content and user do not pay close attention to, bring better web page browsing to experience; 3) can remove redundant content in the webpage, reduce the load time of webpage; 4) can adjust structure of web page, accelerate the composing speed of webpage.
Description of drawings
By reading the detailed description of doing with reference to the following drawings that non-limiting example is done, it is more obvious that other features, objects and advantages of the present invention will become:
Fig. 1 is the grid topological diagram that is used to optimize webpage of one aspect of the invention;
Fig. 2 is the grid topological diagram that is used to optimize webpage of a preferred embodiment of the invention;
Fig. 3 is the flow chart of method that is used to optimize webpage of one aspect of the invention;
Fig. 4 is the flow chart of method that is used to optimize webpage of a preferred embodiment of the invention;
Fig. 5 is the flow chart of method that is used to optimize webpage of another preferred embodiment of the present invention;
Fig. 6 is the flow chart of method that is used to optimize webpage of another preferred embodiment of the present invention;
Fig. 7 is the network equipment structure chart that is used to optimize webpage of one aspect of the invention;
Fig. 8 is the network equipment structure chart that is used to optimize webpage of a preferred embodiment of the invention;
Fig. 9 is the network equipment structure chart that is used to optimize webpage of another preferred embodiment of the present invention;
Figure 10 is the network equipment structure chart that is used to optimize webpage of another preferred embodiment of the present invention;
Same or analogous Reference numeral is represented same or analogous parts in the accompanying drawing.
Embodiment
Below in conjunction with accompanying drawing the present invention is described in further detail.
Fig. 1 is the grid topological diagram that is used to optimize webpage of one aspect of the invention.The user is undertaken alternately by the subscriber equipment 1 and the network equipment 2, and the network equipment 2 obtains info web according to user's interbehavior, and with after this info web that obtains optimization, offers the user via subscriber equipment 1.Wherein, subscriber equipment 1 includes but not limited to: computer, smart mobile phone, PDA or IPTV.The network equipment 2 includes but not limited to: the server group that single network server, a plurality of webserver are formed or based on the cloud that is made of a large amount of computers or the webserver of cloud computing (Cloud Computing), wherein, cloud computing is a kind of of Distributed Calculation, a super virtual machine of being made up of the loosely-coupled computer collection of a group.
Fig. 2 is the grid topological diagram that is used to optimize webpage of a preferred embodiment of the invention.In the present embodiment, the network equipment 2 is further divided into web equipment and optimizing equipment.The user is undertaken alternately by subscriber equipment 1 and web equipment, web equipment is according to user's interbehavior, obtain info web, and this info web sent to optimizing equipment, optimizing equipment feeds back to web equipment after this info web is optimized, and the info web after web equipment will be optimized again offers subscriber equipment 1, so that subscriber equipment 1 is presented to the user according to this info web with webpage.Wherein, subscriber equipment 1 includes but not limited to: computer, smart mobile phone, PDA or IPTV.Web equipment and optimization are started to write and are included but be not limited to: the server group that single network server, a plurality of webserver are formed or based on the cloud that is made of a large amount of computers or the webserver of cloud computing (Cloud Computing), wherein, cloud computing is a kind of of Distributed Calculation, a super virtual machine of being made up of the loosely-coupled computer collection of a group.
See also Fig. 1 and Fig. 3, Fig. 3 is the flow chart of method that is used to optimize webpage of one aspect of the invention.
In step S1, the user can import first request with the interactive device that subscriber equipment 1 carries out man-machine interaction by any, wherein, this first request is used to subscriber equipment 1 request to handle the first pending info web, for example, be used to subscriber equipment 1 request user to wish the info web of browsing, perhaps, be used to subscriber equipment 1 request to be stored on the subscriber equipment 1 but the info web that needs to optimize etc.Wherein, this interactive device can be keyboard, mouse, remote controller, touch pad or voice-operated device etc., and the user can inform that subscriber equipment 1 sends described request by carrying out the predetermined operations mode.For example, be example with the touch plate type human-computer interaction device, the user is by the touch-control touch pad; select certain shown web page interlinkage of subscriber equipment 1, more for example, user 1 is by sliding with default track on touch pad; to open and the corresponding webpage of track that should preset, for example, homepage etc.Certainly, it should be appreciated by those skilled in the art that above-mentioned interactive device only just exemplifies, but not be used to limit the present invention, in fact, other interactive device or modes that can supply the user to be used to the request of importing also all are applicable to the present invention, and be contained in this with way of reference, do not give unnecessary details and do not do.
Then, in step S2, described subscriber equipment 1 is sent to the described network equipment 2 with first request of described user's input.Wherein, the information transmit-receive between the subscriber equipment 1 and the network equipment 2 is undertaken by network, and this network includes but not limited to: 1) cable network; 2) wireless network; 3) local area network (LAN); 4) wide area network; 5) VPN network; 6) wireless self-organization network (AdHoc network) etc.
Then, in step S3, the network equipment 2 obtains the first pending info web.The mode that the network equipment 2 obtains the first pending info web comprises multiple:
1) in first request that subscriber equipment 1 sends, comprise the full content of first info web, after then the network equipment 2 obtains this first request, direct extraction first info web from this first request;
2) in first request that subscriber equipment 1 sends, only comprise the chained address of first info web, after then the network equipment 2 obtains this first request, from described first request, extract the chained address of described pending first info web, according to described chained address, from corresponding website, obtain the described first pending info web again.
Then, in step S4, the network equipment 2 is analyzed each information unit that described first info web is comprised, with the classification under definite described each information unit.
Particularly, 2 pairs of described first info webs of the network equipment are analyzed, and identifying needs the information unit handled in first info web, and by analyzing the factor relevant with information unit, determine the classification that each information unit is affiliated.
Wherein, the network equipment 2 can be determined the classification of described information unit according to following at least one factor:
1) identifier of described information unit;
Particularly, the network equipment 2 is distinguished information unit according to the identifier that is comprised in first info web, and judges the classification under the information unit.
For example, if the network equipment 2 detects identifier "<title〉", then the network equipment 2 judges that the content between two identifiers "<title〉" is an information unit, and this information unit is a title; Again for example, if the network equipment 2 detects identifier "/* " or " // ", then the network equipment 2 judge "/* " or " // " to "; " between content be an information unit, this information unit is note unit etc.
2) content of text of described information unit;
Particularly, the network equipment 2 is distinguished information unit according to the identifier that is comprised in first info web, subsequently, according to the content of text of this information unit, judges the classification that information unit is affiliated.
For example, the network equipment 2 is the advertising words coupling that comprises in the content of text in the information unit and the default advertisement dictionary, if the match is successful, for example coupling obtains " Welcome " etc., judges that then this information unit is an advertising unit.
3) position of described information unit in described first info web;
Particularly, the network equipment 2 is distinguished information unit according to the identifier that is comprised in first info web, and subsequently, the network equipment 2 is judged the classification that information unit is affiliated by the position of this information unit in first info web;
For example, the network equipment 2 analysis obtains surpassing some, and the information unit that structure is close is positioned at 1/5 position behind first info web, judges that then this information unit is an advertising unit.
4) information of the information unit relevant with described unit;
Particularly, the network equipment 2 is distinguished information unit according to the identifier that is comprised in first info web, subsequently, the network equipment 2 by search with this information unit have identical identifier information unit classification or search the classification of information unit of and structural similarity close or the content that comprises with this information unit position, judge the classification under this information unit.Wherein, described structural similarity is meant that part identical in two information units surpasses a predetermined threshold, for example, surpasses 50% etc.At this, those skilled in the art should come to determine a rational predetermined threshold according to the actual requirements.
For example, the network equipment 2 at first finds its last information unit when judging an information unit; Subsequently, with its with treat that the information judged unit compares, when both identifiers are identical, and both text matches degree are higher than a predetermined threshold, judge that then the classification of this information unit is identical with the classification of last information unit.
Need to prove, the network equipment 2 is in the process of the classification of judging information unit, can amid all these factors judge, for example, when the network equipment 2 retrieves the content of text of information unit and the advertising words in the advertisement dictionary is complementary, then further judge position and this information unit adjacent information unit that whether have structural similarity of this information unit in first webpage again, if this information unit is positioned at 1/5 position behind first info web, and information unit with structural similarity, judge that then this information unit is an advertising unit, if this information unit is positioned at the centre position of the first info web 1/3-2/3, and this information unit does not have the adjacent information unit with its structural similarity, judges that then this information unit is not an advertising unit etc.
What need further specify is, above-mentioned for example only for technical scheme of the present invention is described better, but not to the restriction that the present invention did, those skilled in the art should understand that, anyly determine information unit class method for distinguishing by Essential Elements Of Analysis, all should be within the scope of the present invention.
Then, in step S5, the network equipment 2 in conjunction with the classification of described each information unit, is converted to described first info web second info web that is used to offer described subscriber equipment based on first pre-defined rule.
Particularly, but the network equipment 2 is carried out corresponding operating according to the classification of the information unit that is write down in first pre-defined rule and the corresponding relation between the executable operations, so that described first info web is converted to second info web.
For example, set for the css unit in first pre-defined rule, when it is positioned at the original position of first info web, do not operate on it; When it is positioned at other positions of first info web, it is moved to the original position of first info web.When then the classification that detects information unit when the network equipment 2 is the css unit,,, determine whether to carry out with of the operation of css cell moving to original position in conjunction with the current location of css unit according to the rule in first pre-defined rule.Because the structure of css cell influence webpage, and browser in generating the process of webpage normally the content according to first info web from first to last generate, therefore, by the css unit is preposition, can avoid browser after generating a part of webpage, owing to detect the css unit, therefore need regenerate the problem of webpage, accelerated the speed of browser generation webpage.
Need to prove, come the mode of adjustment information cell position according to the classification of information unit, not with the above-mentioned limit that is exemplified as, those skilled in the art should understand that, so long as according to the classification of information unit, the scheme that the information unit that influences structure of web page is preposition all should be within the scope of the present invention.
Again for example, set in first pre-defined rule, deletion note unit is when then the classification that detects information unit when the network equipment 2 is the note unit, with the note element deletion.Do not generate because note does not influence webpage, therefore,, can reduce the time of browser Web page loading content, reduced the flow that the user need download yet, accelerated the speed that webpage presents the note deletion.
Need to prove that the mode of deleting information unit according to the classification of information unit is not with the above-mentioned limit that is exemplified as, those skilled in the art should understand that, so long as according to the classification of information unit, deletion does not influence the scheme of the information unit that webpage generates, all should be within the scope of the present invention.
When the network equipment 2 finish all of first info web are handled after, first info web after this is handled is as second info web.
What need further specify is, according to first pre-defined rule, combining information unit classification, described first info web is converted to the processing method of second info web that is used to offer described subscriber equipment, not with the above-mentioned limit that is exemplified as, for example, processing method also can comprise the shielding rubbish information unit, highlight text unit and header cell or the like.
What need illustrate further is, there is no sequencing between step S4 and the step S5, and the network equipment 2 can promptly be carried out corresponding operation after information unit classification of every judgement, also can judge the classification of all information units after, carry out corresponding operation again.
In step S6, the network equipment 2 sends to subscriber equipment 1 with second info web.
In step S7, subscriber equipment 1 generates webpage to present to the user according to second info web.
See also Fig. 2 and Fig. 3, as a preferred embodiment of the present invention, the network equipment 2 can further comprise web equipment and optimizing equipment.
In the present embodiment, step S1 describes in detail in reference Fig. 1 and embodiment shown in Figure 3, and is contained in this by reference, repeats no more.
In step S2, subscriber equipment 1 is sent to web equipment with first request.Its send mode and above same or similar with reference to the corresponding steps S2 among Fig. 1 and the described embodiment of Fig. 3, and be contained in this by reference, repeat no more.
In step S3, web equipment is according to first acquisition request, first info web.Its obtain manner and same or similar with reference to the corresponding steps S3 among Fig. 1 and the described embodiment of Fig. 3, and be contained in this by reference, do not repeat them here.
Subsequently, web equipment sends to optimizing equipment with first info web, and optimizing equipment is obtained this first pending info web.
Then, optimizing equipment is carried out aforementioned with reference to step S4 and step S5 among Fig. 1 and the embodiment shown in Figure 3, and first info web is treated to second info web.
Then, optimizing equipment sends to web equipment with second info web, and web equipment is execution in step S6 again, and second info web is offered subscriber equipment 1.
At last, subscriber equipment 1 execution in step S7 according to second info web, generates webpage to present to the user.
Fig. 4 is the flow chart of method that is used to optimize webpage of a preferred embodiment of the invention.In the present embodiment.In the present embodiment, step S4 can be finished by the network equipment 2 or the optimizing equipment that is contained in the network equipment 2, and wherein, step S4 further comprises step S41 and step S42.
Step S1 is described in detail in reference Fig. 1 and Fig. 3 or Fig. 2 and embodiment shown in Figure 3 to step S3, and is contained in this by reference, repeats no more.
In step S41, the network equipment 2 carries out matching inquiry according to the chained address of described first info web in ATL, to obtain corresponding classification recognition template.
Particularly, each classification recognition template and the chained address corresponding have been comprised in the ATL with this each classification recognition template, the network equipment 2 mates the chained address of first info web and the chained address in the ATL, obtains the classification recognition template that can successfully mate.Wherein, when the network equipment 2 can successfully match a plurality of chained address, select the highest pairing classification recognition template in chained address of matching degree.
Wherein, matching degree can be calculated according to the similarity degree between the form of expression of two chained addresses, and this form of expression includes but not limited to based on http, https, ftp, the URL address of tencent agreement or IP address, MAC Address etc.For example, the chained address of first info web shows as following URL address: Http:// news.sina.com.cn/society, the network equipment 2 successfully matches a plurality of links in ATL:
www.sina.com.cn
http://finance.sina.com.cn/stock/
http://mobile.sina.com.cn/
Http:// news.sina.com.cn/s/sd/And
http://news.sina.com.cn/society
Wherein, can determine the highest being linked as of chained address form of expression matching degree with first info web according to similarity of character string Http:// news.sina.com.cn/society, this link corresponding " classification recognition template one ", then the network equipment 2 is selected " classification recognition template one " conduct and the corresponding classification recognition template of first info web.
In step S42, each information unit that the network equipment 2 is comprised according to first info web, and, determine the classification that described each information unit is affiliated in conjunction with described classification recognition template.
Particularly, on the basis of first pre-defined rule institute reference factor, the information that the network equipment 2 is further provided according to the classification recognition template comes information unit is carried out the stronger identifying operation of specific aim in conjunction with previous embodiment, below with reference to the above-mentioned reference factor, described in detail:
1) identifier of described information unit;
The network equipment 2 is judged the classification that information unit is affiliated in conjunction with the represented implication of the identifier that writes down in the classification recognition template.
For example, record in " classification recognition template one ", identifier " [ad] " expression advertisement, then the network equipment 2 judges that identifier is an advertising unit for the information unit of " [ad] ".
2) content of text of described information unit;
The network equipment 2 is judged the classification that information unit is affiliated in conjunction with the relevant information of the content of text that writes down in the classification recognition template.
For example, record in " classification recognition template one ", when a text number of words that information unit comprised surpassed a predetermined threshold value, this information unit was important information unit, then the network equipment 2 is declared this information unit for highlighting the unit.
3) position of described information unit in described first info web;
The network equipment 2 is in conjunction with the position of the information unit that writes down in the classification recognition template and the corresponding relation of classification under it, judges the classification under the information unit.
For example, record in " classification recognition template one " is positioned at that the content of 1/3 position is an advertising message behind first info web, and then the network equipment 2 judgements are positioned at that the information unit of 1/3 position is an advertising unit behind first info web.
4) information of the information unit relevant with described information unit;
For example, record in " classification recognition template one ", when existing when surpassing the close information unit in 4 structural similarities and position, this information unit is the information unit that is used for commending contents, then the network equipment 2 judges that such information units are recommendation unit.
Need to prove, the network equipment 2 is in the process of the classification of judging information unit, can amid all these factors judge, for example, record in " classification recognition template one " when existing when surpassing the close information unit in 4 structural similarities and position, needs further judge according to the residing position of information unit, if the residing position of information unit is in first info web in forward 1/2 to 3/4 the position, then this information unit is a recommendation unit; If the residing position of information unit is to lean in the position of back 1/5 in first info web, then this information unit is an advertising unit etc.
What need further specify is, above-mentioned for example only for technical scheme of the present invention is described better, but not to restriction that the present invention did, those skilled in the art should understand that, any by determine the class method for distinguishing of information unit in conjunction with classification recognition template and factor analysis, all should be within the scope of the present invention.
Step S5 is described in detail in reference Fig. 1 and embodiment shown in Figure 3 to step S7, comprises by reference at this, repeats no more.
Preferably, present embodiment also comprises feedback information and/or described second info web via described subscriber equipment transmission according to the user, determines the step of classification recognition template to be updated or to be set up.
Particularly, after subscriber equipment 1 will be presented to the user based on the webpage that second info web generates, the user can send feedback information via subscriber equipment 1 to the network equipment 2 once more by man-machine interaction, and this feedback information comprises the satisfaction that the user optimizes for webpage.The feedback information of the network equipment 2 recording users, and the classification recognition template that adopted of second info web of selecting user's evaluation of estimate to be lower than a predetermined threshold are with as classification recognition template to be updated; Perhaps, if this second info web do not adopt the classification recognition template, the chained address of the network equipment 2 these second info webs of record is then set up and the corresponding classification recognition template in this chained address in ATL determining.
Fig. 5 is the flow chart of method that is used to optimize webpage of another preferred embodiment according to the present invention.In the present embodiment, step S4 further comprises step S4 ', and step S4 ' can be finished by the network equipment 2 or the optimizing equipment that is contained in the network equipment 2.
Step S1 is described in detail in reference Fig. 1 and Fig. 3 or Fig. 2 and embodiment shown in Figure 3 to step S3, and is contained in this by reference, repeats no more.
At step S4, in, the network equipment 2 is by analyzing each information unit that described first info web is comprised in conjunction with user related information, with the classification under definite described each information unit.Wherein, the network equipment 2 obtains this user's user related information by the identification user identity, and the network equipment 2 can be discerned user identity according to following mode: the 1) unique identifier of subscriber equipment 1, for example, the hardware identification code of cell-phone number, subscriber equipment etc.; 2) user's log-on message; 3) be recorded in information among the subscriber equipment cookie etc.User related information can be kept in the network equipment 2, and perhaps, user related information is kept in the subscriber equipment 1, and is obtained by the network equipment 2, and perhaps, the network equipment 2 comprehensively is kept at the information in the subscriber equipment 1 and the network equipment 2, obtains user related information.
Wherein, described user related information can initiatively be provided by the user, or the network equipment obtains according to the user behavior supposition of writing down.The network equipment 2 can be in conjunction with following at least one user related information, the classification of coming the analytical information unit:
1) user's personal attribute comprises age, sex, identity, income, education degree of user etc.;
2) user's preference setting comprises the preference setting of shielding web page content, and the preference that highlights web page contents is provided with etc.;
3) user's historical behavior comprises that the user browses, the behavior record of webpage clicking etc.;
4) user's environmental information, positional information, user's current information of time and the subscriber equipment relevant information etc. that comprise the user place, wherein, the subscriber equipment relevant information includes but not limited to: Virtual network operator, user device type, IMEI, user facility operation system information, screen resolution, software information etc.
For example, be the women when user related information comprises this user, then the network equipment 2 judgements comprise the information unit of vocabulary such as " clothes ", " shopping " for highlighting the unit.
Again for example, highlight title when the user is provided with in preference is provided with, then the network equipment 2 is judged as detected header cell and highlights the unit.
Again for example, when only comprising this user, the user behavior that is write down clicks the behavior of opening webpage by the news pages homepage of Sina website in a Preset Time length, and there is not the behavior that this user further clicks on the webpage of opening, then the network equipment 2 can be judged the only text in the browsing page of this user based on the user behavior that is write down, so other information units beyond the text can be defined as ignoring the unit.
Again for example, the IP address that the network equipment 2 is current according to subscriber equipment 1 judges that the user position is Shanghai, and then when comprising " Shanghai " in the content of text of information unit, the network equipment 2 can determine that this information unit is for highlighting the unit.
Step S5 is described in detail in reference Fig. 1 and embodiment shown in Figure 3 to step S7, is contained in this by reference, repeats no more.
Need to prove, in step S4 ', also can further comprise abovementioned steps S41 and S42,, determine the classification that information unit is affiliated with in conjunction with classification recognition template and user related information.
What need further specify is, above-mentioned for example only for the solution of the present invention is described better, but not limitation of the present invention, those skilled in the art should understand that, judge any other mode of classification under the information unit according to any other user related information and based on user related information, all should be within the scope of the present invention.
Fig. 6 is the flow chart of method that is used to optimize webpage of another preferred embodiment of the present invention.In the present embodiment, step S5 further comprises step S5 ', and step S5 ' can be finished by the network equipment 2 or the optimizing equipment that is contained in the network equipment 2.
Step S1 is described in detail in reference Fig. 1 and Fig. 3, Fig. 2 and Fig. 3, Fig. 4 or embodiment shown in Figure 5 to step S4, and is contained in this by reference, repeats no more.
In step S5 ', the network equipment 2 is according to described first pre-defined rule, and based on the classification of described each information unit, comes described each information unit is carried out corresponding operation, so that described first info web is converted to second info web.
Wherein, described first pre-defined rule comprises with reference to following at least one factor and determines corresponding operation:
1) but default described classification and the corresponding relation between the executable operations;
Particularly, in first pre-defined rule, but stipulated the pairing executable operations of each information unit classification, but the network equipment 2 is according to the corresponding relation between information unit classification and the executable operations, come each information unit is carried out corresponding operation, after all operations was finished, first info web after then will handling was as second info web.
For example, but first pre-defined rule has stipulated that note unit and the pairing executable operations of advertising unit are deletion action, then works as the network equipment 2 and detects the note unit, with this note element deletion;
Again for example, first pre-defined rule has been stipulated to be placed on original position when the css unit is not in the original position of info web, then when the network equipment 2 detects the css unit, detect residing position, css unit, when its position is not original position, it is moved to original position;
Again for example, first pre-defined rule has been stipulated to come the content of text that highlights in the unit is highlighted with red font, then detects when highlighting the unit when the network equipment 2, and the color form that highlights the content of text of unit is changed to redness;
Again for example, first pre-defined rule has stipulated that mark can ignore the unit, then detect to ignore the unit time when the network equipment 2, carry out mark to ignoring the unit, can ignore the unit for subscriber equipment 1 identification, then subscriber equipment 1 can be created on the described unit of ignoring in the webpage according to user's selection, presents to the user; Perhaps, shield this and can ignore the unit, be not presented to the user.
2) user related information;
Particularly, comprise in first pre-defined rule:, come information unit is carried out the rule of corresponding operating according to the classification of user related information and information unit.
For example, if the user stipulates to highlight mode with gray background to highlighting the unit in user preference is provided with, then the network equipment 2 background that will highlight the unit changes to grey;
Again for example, if the user is in surpassing a pre-determined number, can ignore the unit from non-selected presenting, then the network equipment 2 transparency that can ignore the unit is adjusted into 59%, to desalinate processing to ignoring the unit.
Need to prove that the network equipment 2 also can be according to first pre-defined rule, in conjunction with above-mentioned both, first info web is converted to second info web.For example, stipulate in first pre-defined rule, but the pairing executable operations in maskable unit comprises mark, deletion and desalination, need select an operation in conjunction with user related information, then when detecting the maskable unit, the network equipment 2 is selected shielding, deletion or desalination operation according to user related information.
What need further specify is, above-mentioned for example only for content of the present invention is described better, but not limitation of the present invention, those skilled in the art should understand that, come described each information unit is carried out corresponding operation according to first pre-defined rule, described first info web is converted to the scheme of second info web, all should be within the scope of the present invention.
Fig. 7 is the network equipment structure chart that is used to optimize webpage of one aspect of the invention.In the present embodiment, the network equipment 2 comprises deriving means 21, category analysis device 22 and conversion equipment 23.
The user can import first request with the interactive device that subscriber equipment 1 carries out man-machine interaction by any, wherein, this first request is used to subscriber equipment 1 request to handle the first pending info web, for example, be used to subscriber equipment 1 request user to wish the info web of browsing, perhaps, be used to subscriber equipment 1 request to be stored on the subscriber equipment 1 but the info web that needs to optimize etc.Wherein, this interactive device can be keyboard, mouse, remote controller, touch pad or voice-operated device etc., and the user can inform that subscriber equipment 1 sends described request by carrying out the predetermined operations mode.For example, be example with the touch plate type human-computer interaction device, the user is by the touch-control touch pad; select certain shown web page interlinkage of subscriber equipment 1, more for example, user 1 is by sliding with default track on touch pad; to open and the corresponding webpage of track that should preset, for example, homepage etc.Certainly, it should be appreciated by those skilled in the art that above-mentioned interactive device only just for example, but not be used to limit the present invention, in fact, other interactive device or modes that can supply the user to be used to the request of importing also all are applicable to the present invention, and be contained in this with way of reference, do not give unnecessary details and do not do.
Described subscriber equipment 1 is sent to the described network equipment 2 with first request of described user's input.Wherein, the information transmit-receive between the subscriber equipment 1 and the network equipment 2 is undertaken by network, and this network includes but not limited to: 1) cable network; 2) wireless network; 3) local area network (LAN); 4) wide area network; 5) VPN network; 6) wireless self-organization network (Ad Hoc network) etc.
Deriving means 21 obtains the first pending info web.The mode that deriving means 21 obtains the first pending info web comprises multiple:
1) deriving means 21 comprises first sub-deriving means (figure does not show) and the second sub-deriving means (figure does not show).The full content that in first request that subscriber equipment 1 sends, comprises first info web, after then the first sub-deriving means obtained this first request, the second sub-deriving means directly extracted first info web from this first request;
2) deriving means 21 comprises first sub-deriving means (figure does not show) and the second sub-deriving means (figure does not show), and the second sub-deriving means also further comprises extraction element (figure does not show) and the 3rd sub-deriving means (figure does not show).The chained address that in first request that subscriber equipment 1 sends, only comprises first info web, after then the first sub-deriving means obtains this first request, extraction element extracts the chained address of described pending first info web from described first request, the 3rd sub-deriving means obtains the described first pending info web again according to described chained address from corresponding website.
Category analysis device 22 is analyzed each information unit that described first info web is comprised, with the classification under definite described each information unit.
Particularly, 22 pairs of described first info webs of category analysis device are analyzed, and identifying needs the information unit handled in first info web, and by analyzing the factor relevant with information unit, determine the classification that each information unit is affiliated.
Wherein, category analysis device 22 can be determined the classification of described information unit according to following at least one factor:
1) identifier of described information unit;
Particularly, category analysis device 22 is distinguished information unit according to the identifier that is comprised in first info web, and judges the classification under the information unit.
For example, if category analysis device 22 detects identifier "<title〉", judge that then the content between two identifiers "<title〉" is an information unit, this information unit is a title; Again for example, if category analysis device 22 detects identifier "/* " or " // ", then judge "/* " or " // " to "; " between content be an information unit, this information unit is note unit etc.
2) content of text of described information unit;
Particularly, category analysis device 22 is distinguished information unit according to the identifier that is comprised in first info web, subsequently, according to the content of text of this information unit, judges the classification that information unit is affiliated.
For example, category analysis device 22 is the advertising words coupling that comprises in the content of text in the information unit and the default advertisement dictionary, if the match is successful, for example coupling obtains " Welcome " etc., judges that then this information unit is an advertising unit.
3) position of described information unit in described first info web;
Particularly, category analysis device 22 is distinguished information unit according to the identifier that is comprised in first info web, and subsequently, category analysis device 22 is judged the classification that information unit is affiliated by the position of this information unit in first info web;
For example, category analysis device 22 analysis obtains surpassing some, and the information unit that structure is close is positioned at 1/5 position behind first info web, judges that then this information unit is an advertising unit.
4) information of the information unit relevant with described unit;
Particularly, category analysis device 22 is according to the identifier that is comprised in first info web, distinguish information unit, subsequently, category analysis device 22 by search with this information unit have identical identifier information unit classification or search the classification of information unit of and structural similarity close or the content that comprises with this information unit position, judge the classification under this information unit.Wherein, described structural similarity is meant that part identical in two information units surpasses a predetermined ratio threshold value, for example, surpasses 50% etc.At this, those skilled in the art should come to determine a rational predetermined threshold according to the actual requirements.
For example, category analysis device 22 at first finds its last information unit when judging an information unit; Subsequently, with its with treat that the information judged unit compares, when both identifiers are identical, and both text matches degree are higher than a predetermined threshold, judge that then the classification of this information unit is identical with the classification of last information unit.
Need to prove, category analysis device 22 is in the process of the classification of judging information unit, can amid all these factors judge, for example, when category analysis device 22 retrieves the content of text of information unit and the advertising words in the advertisement dictionary is complementary, then further judge position and this information unit adjacent information unit that whether have structural similarity of this information unit in first webpage again, if this information unit is positioned at 1/5 position behind first info web, and information unit with structural similarity, judge that then this information unit is an advertising unit, if this information unit is positioned at the centre position of the first info web 1/3-2/3, and this information unit does not have the adjacent information unit with its structural similarity, judges that then this information unit is not an advertising unit etc.
What need further specify is, above-mentioned for example only for technical scheme of the present invention is described better, but not to the restriction that the present invention did, those skilled in the art should understand that, anyly determine the class method for distinguishing of information unit by Essential Elements Of Analysis, all should be within the scope of the present invention.
Conversion equipment 23 in conjunction with the classification of described each information unit, is converted to described first info web second info web that is used to offer described subscriber equipment based on first pre-defined rule.
Particularly, but conversion equipment 23 is carried out corresponding operating according to the classification of the information unit that is write down in first pre-defined rule and the corresponding relation between the executable operations, so that described first info web is converted to second info web.
For example, set for the css unit in first pre-defined rule, when it is positioned at the original position of first info web, do not operate on it; When it is positioned at other positions of first info web, it is moved to the original position of first info web.Then when category analysis device 22 judges that the classification that obtains information unit is the css unit,,, determine whether to carry out with of the operation of css cell moving to original position in conjunction with the current location of css unit according to the rule in first pre-defined rule.Because the structure of css cell influence webpage, and browser in generating the process of webpage normally the content according to first info web from first to last generate, therefore, by the css unit is preposition, can avoid browser after generating a part of webpage, owing to detect the css unit, therefore need regenerate the problem of webpage, accelerated the speed of browser generation webpage.
Need to prove, come the mode of adjustment information cell position according to the classification of information unit, not with the above-mentioned limit that is exemplified as, those skilled in the art should understand that, so long as according to the classification of information unit, the scheme that the information unit that influences structure of web page is preposition all should be within the scope of the present invention.
Again for example, set in first pre-defined rule, deletion note unit is then when category analysis device 22 judges that the classification that obtains information unit is the note unit, with the note element deletion.Do not generate because note does not influence webpage, therefore,, can reduce the time of browser Web page loading content, reduced the flow that the user need download yet, accelerated the speed that webpage presents the note deletion.
Need to prove that the mode of deleting information unit according to the classification of information unit is not with the above-mentioned limit that is exemplified as, those skilled in the art should understand that, so long as according to the classification of information unit, deletion does not influence the scheme of the information unit that webpage generates, all should be within the scope of the present invention.
When conversion equipment 23 finish all of first info web are handled after, with first info web after handling as second info web.
What need further specify is, according to first pre-defined rule, combining information unit classification, described first info web is converted to the processing method of second info web that is used to offer described subscriber equipment, not with the above-mentioned limit that is exemplified as, for example, processing method also can comprise the shielding rubbish information unit, highlight text unit and header cell or the like.
What need illustrate further is, the performed separately operation of category analysis device 22 and conversion equipment 23 there is no absolute sequencing, category analysis device 22 is after information unit classification of every judgement, conversion equipment 23 can be carried out corresponding operation, after also can working as the classification of category analysis device 22 all information units of judgement, conversion equipment 23 is carried out corresponding operation again.
The network equipment 2 sends to subscriber equipment 1 with second info web that conversion equipment 23 generates, and subscriber equipment 1 generates webpage to present to the user according to second info web.
As a preferred embodiment of the present invention, the network equipment 2 can further comprise web equipment and optimizing equipment.Then deriving means 22 is included in the web equipment, and category analysis device 22 and conversion equipment 23 are included in the optimizing equipment.
Subscriber equipment 1 is sent to web equipment with first request.Its send mode is describing in detail with reference among the embodiment shown in Figure 7, and is contained in this by reference, repeats no more.Deriving means 22 is according to first acquisition request, first info web.Its obtain manner with reference to describing in detail among the embodiment shown in Figure 7, and be contained in this by reference, repeat no more.
Subsequently, web equipment sends to optimizing equipment with first info web, and optimizing equipment is obtained this first pending info web.
Then, category analysis device 22 and conversion equipment 23 are treated to second info web with first info web.Category analysis device 22 and conversion equipment 23 are describing the mode that first info web is treated to second info web in detail with reference among the embodiment shown in Figure 7, and are contained in this by reference, repeat no more.
Then, optimizing equipment sends to web equipment with second info web, and web equipment offers subscriber equipment 1 with second info web again, and subscriber equipment 1 generates webpage to present to the user according to second info web.
Fig. 8 is the network equipment structure chart that is used to optimize webpage of a preferred embodiment of the invention.In the present embodiment, category analysis device 22 can be contained in the network equipment 2 or be contained in the optimizing equipment of the network equipment 2, and wherein, category analysis device 22 also further comprises matching inquiry device 221 and determines device 222.
Deriving means 21 and conversion equipment 23 are being described in detail with reference among the embodiment shown in Figure 7, and are contained in this by reference, repeat no more.
Matching inquiry device 221 carries out matching inquiry according to the chained address of described first info web in ATL 24, to obtain corresponding classification recognition template.
Particularly, each classification recognition template and the chained address corresponding have been comprised in the ATL 24 with this each classification recognition template, matching inquiry device 221 mates the chained address of first info web and the chained address in the ATL, obtains the classification recognition template that can successfully mate.Wherein, when matching inquiry device 221 can successfully match a plurality of chained address, select the highest pairing classification recognition template in chained address of matching degree.
Wherein, matching degree can be calculated according to the similarity degree between the form of expression of two chained addresses, and this form of expression includes but not limited to based on http, https, ftp, the URL address of tencent agreement or IP address, MAC Address etc.For example, the chained address of first info web shows as following URL address Http:// news.sina.com.cn/society, matching inquiry device 221 successfully matches a plurality of links in ATL 24:
www.sina.com.cn
http://finance.sina.com.cn/stock/
http://mobile.sina.com.cn/
Http:// news.sina.com.cn/s/sd/And
http://news.sina.com.cn/society
Wherein, can determine the highest being linked as of chained address form of expression matching degree with first info web according to similarity of character string Http:// news.sina.com.cn/society, this link corresponding " classification recognition template one ", then matching inquiry device 221 is selected " classification recognition template one " conduct and the corresponding classification recognition template of first info web.
Determine each information unit that device 222 is comprised according to first info web, and, determine the classification that described each information unit is affiliated in conjunction with described classification recognition template.
Particularly, in in conjunction with previous embodiment, on the basis of first pre-defined rule institute reference factor, determine the information that device 222 is further provided according to the classification recognition template, come information unit to carry out the stronger identifying operation of specific aim, below with reference to the above-mentioned reference factor, described in detail:
1) identifier of described information unit;
Determine device 222 in conjunction with the represented implication of the identifier that writes down in the classification recognition template, judge the classification that information unit is affiliated.
For example, record in " classification recognition template one ", identifier " [ad] " expression advertisement determines that then device 222 judgement identifiers are advertising unit for the information unit of " [ad] ".
2) content of text of described information unit;
Determine the relevant information of device 222, judge the classification that information unit is affiliated in conjunction with the content of text that writes down in the classification recognition template.
For example, record in " classification recognition template one ", when a text number of words that information unit comprised surpassed a predetermined threshold value, this information unit was important information unit, then definite device 222 is declared this information unit for highlighting the unit.
3) position of described information unit in described first info web;
Determine device 222 in conjunction with the position of the information unit that writes down in the classification recognition template and the corresponding relation of classification under it, judge the classification under the information unit.
For example, record in " classification recognition template one " is positioned at that the content of 1/3 position is an advertising message behind first info web, determines that then device 222 judgements are positioned at that the information unit of 1/3 position is an advertising unit behind first info web.
4) information of the information unit relevant with described information unit;
For example, record in " classification recognition template one ", when existing when surpassing the close information unit in 4 structural similarities and position, this information unit is the information unit that is used for commending contents, determines that then device 222 judges that such information units are recommendation unit.
Need to prove, determine that device 222 is in the process of the classification of judging information unit, can amid all these factors judge, for example, record in " classification recognition template one " when existing when surpassing the close information unit in 4 structural similarities and position, needs further judge according to the residing position of information unit, if the residing position of information unit is in first info web in forward 1/2 to 3/4 the position, then this information unit is a recommendation unit; If the residing position of information unit is to lean in the position of back 1/5 in first info web, then this information unit is an advertising unit etc.
What need further specify is, above-mentioned for example only for technical scheme of the present invention is described better, but not to restriction that the present invention did, those skilled in the art should understand that, any by determine the class method for distinguishing of information unit in conjunction with classification recognition template and factor analysis, all should be within the scope of the present invention.
Preferably, present embodiment also comprises updating device (figure does not show), and updating device is used for feedback information and/or described second info web via described subscriber equipment transmission according to the user, determines classification recognition template to be updated or to be set up.
Particularly, after subscriber equipment 1 will be presented to the user based on the webpage that second info web generates, the user can be once more by man-machine interaction, send feedback information via subscriber equipment 1 to the network equipment 2, this feedback information comprises the satisfaction that the user optimizes for webpage, the feedback information of updating device recording user, and the classification recognition template that adopted of second info web of selecting user's evaluation of estimate to be lower than a predetermined threshold are with as classification recognition template to be updated; Perhaps, if this second info web does not adopt the classification recognition template, then updating device writes down the chained address of this second info web, to determine foundation and the corresponding classification recognition template in this chained address in ATL.
Fig. 9 is the network equipment structure chart that is used to optimize webpage of another preferred embodiment according to the present invention.In the present embodiment, category analysis device 22 can be contained in the network equipment 2 or be contained in the optimizing equipment of the network equipment 2, and wherein, category analysis device 22 also further comprises subclass analytical equipment 223.
Deriving means 21 and conversion equipment 23 are being described in detail with reference among the embodiment shown in Figure 7, and are contained in this by reference, repeat no more.
Subclass analytical equipment 223 is by analyzing each information unit that described first info web is comprised in conjunction with user related information, with the classification under definite described each information unit.Wherein, the network equipment 2 obtains this user's user related information by the identification user identity, and the network equipment 2 can be discerned user identity according to following mode: the 1) unique identifier of subscriber equipment 1, for example, the hardware identification code of cell-phone number, subscriber equipment etc.; 2) user's log-on message; 3) be recorded in information among the subscriber equipment cookie etc.User related information can be kept in the network equipment 2, and perhaps, user related information is kept in the subscriber equipment 1, and is obtained by the network equipment 2, and perhaps, the network equipment 2 comprehensively is kept at the information in the subscriber equipment 1 and the network equipment 2, obtains user related information.
Wherein, described user related information can initiatively be provided by the user, or the network equipment obtains according to the user behavior supposition of writing down.Subclass analytical equipment 223 can be in conjunction with following at least one user related information, the classification of coming the analytical information unit:
1) user's personal attribute comprises age, sex, identity, income, education degree of user etc.;
2) user's preference setting comprises the preference setting of shielding web page content, and the preference that highlights web page contents is provided with etc.;
3) user's historical behavior comprises that the user browses, the behavior record of webpage clicking etc.;
4) user's environmental information, positional information, user's current information of time and the subscriber equipment relevant information etc. that comprise the user place, wherein, the subscriber equipment relevant information includes but not limited to: Virtual network operator, user device type, IMEI, user facility operation system information, screen resolution, software information etc.
For example, be the women when user related information comprises this user, then 223 judgements of subclass analytical equipment comprise the information unit of vocabulary such as " clothes ", " shopping " for highlighting the unit.
Again for example, highlight title when the user is provided with in preference is provided with, then subclass analytical equipment 223 is judged as detected header cell and highlights the unit.
Again for example, the user behavior that is write down in a time span of presetting as the user only comprises that this user clicks the behavior of opening webpage by the news pages homepage of new net, and there is not the behavior that this user further clicks on the webpage of opening, then subclass analytical equipment 223 can be judged the only text in the browsing page of this user based on the user behavior that is write down, so other information units beyond the text can be defined as ignoring the unit.
Again for example, the IP address that subclass analytical equipment 223 is current according to subscriber equipment 1 judges that the user position is Shanghai, and then when comprising " Shanghai " in the content of text of information unit, subclass analytical equipment 223 can determine that this information unit is for highlighting the unit.
Need to prove that subclass analytical equipment 223 also can further comprise matching inquiry device 221 and determine device 222, with in conjunction with classification recognition template and user related information, determines the classification that information unit is affiliated.
What need further specify is, above-mentioned for example only for the solution of the present invention is described better, but not limitation of the present invention, those skilled in the art should understand that, judge any other mode of classification under the information unit according to any other user related information and based on user related information, all should be within the scope of the present invention.
Figure 10 is the network equipment structure chart that is used to optimize webpage of another preferred embodiment of the present invention.In the present embodiment, conversion equipment 23 can be contained in the network equipment 2 or be contained in the optimizing equipment of the network equipment 2, and wherein, conversion equipment 23 also further comprises sub-conversion equipment 231.
Deriving means 21 and category analysis device 22 are described in detail in reference Fig. 7, Fig. 8 or embodiment shown in Figure 9, and are contained in this by reference, repeat no more.
Sub-conversion equipment 231 is according to described first pre-defined rule, and based on the classification of described each information unit, comes described each information unit is carried out corresponding operation, so that described first info web is converted to second info web.
Wherein, described first pre-defined rule comprises with reference to following at least one factor and determines corresponding operation:
1) but default described classification and the corresponding relation between the executable operations;
Particularly, in first pre-defined rule, but stipulated the pairing executable operations of each information unit classification, but sub-conversion equipment 231 is according to the corresponding relation between information unit classification and the executable operations, come each information unit is carried out corresponding operation, after all operations was finished, first info web after then will handling was as second info web.
For example, but first pre-defined rule has stipulated that note unit and the pairing executable operations of advertising unit are deletion action, and then group conversion equipment 231 detects the note unit, with this note element deletion;
Again for example, first pre-defined rule has been stipulated to be placed on original position when the css unit is not in the original position of info web, when then group conversion equipment 231 detects the css unit, detect residing position, css unit, when its position is not original position, it is moved to original position;
Again for example, first pre-defined rule has been stipulated to come the content of text that highlights in the unit is highlighted with red font, and then group conversion equipment 231 detects when highlighting the unit, and the color form that highlights the content of text of unit is changed to redness;
Again for example, first pre-defined rule has stipulated that mark can ignore the unit, then group conversion equipment 231 detects in the time of can ignoring the unit, carry out mark to ignoring the unit, can ignore the unit for subscriber equipment 1 identification, then subscriber equipment 1 can be created on the described unit of ignoring in the webpage according to user's selection, presents to the user; Perhaps, shield this and can ignore the unit, be not presented to the user.
2) user related information;
Particularly, comprise in first pre-defined rule:, come information unit is carried out the rule of corresponding operating according to the classification of user related information and information unit.
For example, if the user stipulates the mode with gray background in user preference is provided with, highlight highlighting the unit, the background that then sub-conversion equipment 231 will highlight the unit changes to grey;
Again for example, if the user is in surpassing a pre-determined number, can ignore the unit from non-selected presenting, the transparency that then sub-conversion equipment 231 can be ignored the unit is adjusted into 59%, to desalinate processing to ignoring the unit.
Need to prove that sub-conversion equipment 231 also can be according to first pre-defined rule, in conjunction with above-mentioned both, first info web is converted to second info web.For example, stipulate in first pre-defined rule, but the pairing executable operations in maskable unit comprises tag delete and desalination, need select an operation in conjunction with user related information, then when detecting the maskable unit, sub-conversion equipment 231 is selected shielding, deletion or desalination operation according to user related information.
What need further specify is, above-mentioned for example only for content of the present invention is described better, but not limitation of the present invention, those skilled in the art should understand that, come described each information unit is carried out corresponding operation according to first pre-defined rule, described first info web is converted to the scheme of second info web, all should be within the scope of the present invention.
Each predetermined threshold among the present invention all can be come according to the actual requirements to determine by those skilled in the art.
To those skilled in the art, obviously the invention is not restricted to the details of above-mentioned one exemplary embodiment, and under the situation that does not deviate from spirit of the present invention or essential characteristic, can realize the present invention with other concrete form.Therefore, no matter from which point, all should regard embodiment as exemplary, and be nonrestrictive, scope of the present invention is limited by claims rather than above-mentioned explanation, therefore is intended to be included in the present invention dropping on the implication that is equal to important document of claim and all changes in the scope.Any Reference numeral in the claim should be considered as limit related claim.In addition, obviously other unit or step do not got rid of in " comprising " speech, and odd number is not got rid of plural number.A plurality of unit of stating in system's claim or device also can be realized by software or hardware by a unit or device.The first, the second word such as grade is used for representing title, and does not represent any specific order.

Claims (22)

1. method that in the network equipment, is used to optimize webpage, wherein, this method may further comprise the steps:
A obtains the first pending info web;
B analyzes each information unit that described first info web is comprised, with the classification under definite described each information unit;
C in conjunction with the classification of described each information unit, is converted to described first info web second info web that is used to offer described subscriber equipment based on first pre-defined rule.
2. method according to claim 1, wherein, according to following at least one factor, determine the classification of described information unit among the described step b:
The identifier of-described information unit;
The content of text of-described information unit;
The position of-described information unit in described first info web;
The information of-the information unit relevant with described information unit.
3. method according to claim 1 and 2, wherein, described step b may further comprise the steps:
-in ATL, carry out matching inquiry according to the chained address of described first info web, to obtain corresponding classification recognition template;
-each information unit of being comprised according to first info web, and in conjunction with described classification recognition template is determined the classification under described each information unit.
4. method according to claim 3, wherein, this method is further comprising the steps of:
-according to feedback information and/or described second info web of user, determine classification recognition template to be updated or to be set up via described subscriber equipment transmission.
5. according to each described method in the claim 1 to 4, wherein, described step b is further comprising the steps of:
-analyze each information unit that described first info web is comprised, in conjunction with user related information, with the classification under definite described each information unit.
6. according to each described method in the claim 1 to 5, wherein, described step c may further comprise the steps:
-based on described first pre-defined rule,, come described each information unit is carried out corresponding operation, so that described first info web is converted to second info web in conjunction with the classification of described each information unit.
7. according to each described method in the claim 1 to 6, wherein, described first pre-defined rule comprises with reference to following at least one factor determines corresponding operation:
But-default described classification and the corresponding relation between the executable operations;
-user related information.
8. according to claim 5 or 7 described methods, wherein, described user related information comprises following at least one:
-user's personal attribute;
-user's preference setting;
-user's historical behavior;
-user's environmental information.
9. according to each described method in the claim 1 to 8, wherein, described step a is further comprising the steps of:
-obtain first request from subscriber equipment, this first request is used to user equipment requests to handle the first pending info web;
-according to described first request, obtain the described first pending info web.
10. method according to claim 9, wherein, the described step of obtaining described pending first info web may further comprise the steps:
-from described first request, extract the chained address of described pending first info web;
-according to described chained address, obtain the described first pending info web.
11. according to each described method in the claim 1 to 10, wherein, the described network equipment comprises: network host, single network server, a plurality of webserver collection or based on the set of computers of cloud computing.
12. a network equipment that is used to optimize webpage, wherein, this network equipment comprises:
Deriving means, be used to obtain the described first pending info web;
The category analysis device, be used to analyze each information unit that described first info web is comprised, to determine the classification under described each information unit;
Conversion equipment, be used for,, described first info web be converted to second info web that is used to offer described subscriber equipment in conjunction with the classification of described each information unit based on first pre-defined rule.
13. the network equipment according to claim 12, wherein, described category analysis device is determined the classification of described information unit according to following at least one factor:
The identifier of-described information unit;
The content of text of-described information unit;
The position of-described information unit in described first info web;
The information of-the information unit relevant with described information unit.
14. according to the claim 12 or the 13 described network equipments, wherein, described category analysis device comprises:
The matching inquiry device, be used for carrying out matching inquiry at ATL, to obtain corresponding classification recognition template according to the chained address of described first info web;
Determine device, be used for each information unit of being comprised according to first info web, and in conjunction with described classification recognition template, determine the classification under described each information unit.
15. the network equipment according to claim 14, wherein, this network equipment also comprises:
Updating device, be used for according to the user determining classification recognition template to be updated or to be set up via feedback information and/or described second info web that described subscriber equipment sends.
16. according to each described network equipment in the claim 12 to 15, wherein, described category analysis device also comprises:
The subclass analytical equipment, be used to analyze each information unit that described first info web is comprised, in conjunction with user related information, to determine the classification under described each information unit.
17. according to each described network equipment in the claim 12 to 16, wherein, described conversion equipment comprises:
Sub-conversion equipment, be used for,, come described each information unit is carried out corresponding operation, so that described first info web is converted to second info web in conjunction with the classification of described each information unit based on described first pre-defined rule.
18. according to each described network equipment in the claim 12 to 17, wherein, described first pre-defined rule comprises with reference to following at least one factor determines corresponding operation:
But-default described classification and the corresponding relation between the executable operations;
-user related information.
19. according to the claim 16 or the 18 described network equipments, wherein, described user related information comprises following at least one:
-user's personal attribute;
-user's preference setting;
-user's historical behavior;
-user's environmental information.
20. according to each described network equipment in the claim 12 to 19, wherein, described deriving means is further comprising the steps of:
The first sub-deriving means, be used to obtain first request from subscriber equipment, this first request is used to user equipment requests to handle the first pending info web;
The second sub-deriving means, be used for according to described first the request, obtain the described first pending info web.
21. the network equipment according to claim 20, wherein, the described second sub-deriving means comprises:
Extraction element, be used for extracting the chained address of described pending first info web from described first request;
The 3rd sub-deriving means, be used for, obtain the described first pending info web according to described chained address.
22. according to each described network equipment in the claim 12 to 21, wherein, this network equipment comprises: network host, single network server, a plurality of webserver collection or based on the set of computers of cloud computing.
CN201010569782.2A 2010-11-26 2010-11-26 Method and device for optimizing webpage in network equipment Active CN102035883B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201010569782.2A CN102035883B (en) 2010-11-26 2010-11-26 Method and device for optimizing webpage in network equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201010569782.2A CN102035883B (en) 2010-11-26 2010-11-26 Method and device for optimizing webpage in network equipment

Publications (2)

Publication Number Publication Date
CN102035883A true CN102035883A (en) 2011-04-27
CN102035883B CN102035883B (en) 2015-07-01

Family

ID=43888200

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201010569782.2A Active CN102035883B (en) 2010-11-26 2010-11-26 Method and device for optimizing webpage in network equipment

Country Status (1)

Country Link
CN (1) CN102035883B (en)

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013060065A1 (en) * 2011-10-27 2013-05-02 北京百度网讯科技有限公司 Method and device for providing target information according to terminal attribute of user equipment
CN103377233A (en) * 2012-04-26 2013-10-30 腾讯科技(深圳)有限公司 Webpage sharing method and corresponding system
CN103425670A (en) * 2012-05-16 2013-12-04 百度在线网络技术(北京)有限公司 Method, device and equipment for providing customers with content recommendation information
CN103838728A (en) * 2012-11-21 2014-06-04 腾讯科技(深圳)有限公司 Webpage information processing method and browser
CN103942231A (en) * 2013-01-18 2014-07-23 联想(北京)有限公司 Webpage displaying method and electronic device
CN104239559A (en) * 2014-09-26 2014-12-24 北京金山安全软件有限公司 Webpage opening method and device
CN104468740A (en) * 2014-11-21 2015-03-25 网宿科技股份有限公司 Intelligent webpage transmission processing system and method
CN104615686A (en) * 2015-01-22 2015-05-13 百度在线网络技术(北京)有限公司 Searching method and device
CN104809172A (en) * 2015-04-10 2015-07-29 百度在线网络技术(北京)有限公司 Page showing method and device
CN104850595A (en) * 2015-04-27 2015-08-19 小米科技有限责任公司 Method and device for optimizing webpage opening time
CN105138698A (en) * 2015-09-25 2015-12-09 百度在线网络技术(北京)有限公司 Dynamic layout method and device for webpages
CN105677649A (en) * 2014-11-18 2016-06-15 中国移动通信集团公司 Customized webpage composing method and device
WO2016124099A1 (en) * 2015-02-03 2016-08-11 阿里巴巴集团控股有限公司 Webpage display method and device
CN105893624A (en) * 2016-04-29 2016-08-24 珠海市魅族科技有限公司 Method and system for displaying data
CN106446156A (en) * 2016-09-22 2017-02-22 宇龙计算机通信科技(深圳)有限公司 Webpage data shielding method and system
CN106844731A (en) * 2017-02-10 2017-06-13 宇龙计算机通信科技(深圳)有限公司 Advertisement shields method and system
CN111813468A (en) * 2015-04-03 2020-10-23 阿里巴巴集团控股有限公司 Method and device for shielding webpage operation and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020198720A1 (en) * 2001-04-27 2002-12-26 Hironobu Takagi System and method for information access
CN101246494A (en) * 2008-03-19 2008-08-20 腾讯科技(深圳)有限公司 Internet web page conversion method, system and equipment
CN101615193A (en) * 2009-07-07 2009-12-30 北京大学 A kind of based on the integrated inquiry system of encyclopaedia data extract
CN101702782A (en) * 2009-11-17 2010-05-05 广州杰赛科技股份有限公司 Digital television webpage monitoring server, system and method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020198720A1 (en) * 2001-04-27 2002-12-26 Hironobu Takagi System and method for information access
CN101246494A (en) * 2008-03-19 2008-08-20 腾讯科技(深圳)有限公司 Internet web page conversion method, system and equipment
CN101615193A (en) * 2009-07-07 2009-12-30 北京大学 A kind of based on the integrated inquiry system of encyclopaedia data extract
CN101702782A (en) * 2009-11-17 2010-05-05 广州杰赛科技股份有限公司 Digital television webpage monitoring server, system and method

Cited By (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013060065A1 (en) * 2011-10-27 2013-05-02 北京百度网讯科技有限公司 Method and device for providing target information according to terminal attribute of user equipment
CN103377233A (en) * 2012-04-26 2013-10-30 腾讯科技(深圳)有限公司 Webpage sharing method and corresponding system
CN103425670A (en) * 2012-05-16 2013-12-04 百度在线网络技术(北京)有限公司 Method, device and equipment for providing customers with content recommendation information
CN103838728B (en) * 2012-11-21 2018-01-09 腾讯科技(深圳)有限公司 The processing method and browser of info web
CN103838728A (en) * 2012-11-21 2014-06-04 腾讯科技(深圳)有限公司 Webpage information processing method and browser
CN103942231B (en) * 2013-01-18 2019-01-15 联想(北京)有限公司 A kind of display methods and electronic equipment of webpage
CN103942231A (en) * 2013-01-18 2014-07-23 联想(北京)有限公司 Webpage displaying method and electronic device
CN104239559A (en) * 2014-09-26 2014-12-24 北京金山安全软件有限公司 Webpage opening method and device
CN105677649B (en) * 2014-11-18 2019-04-23 中国移动通信集团公司 A kind of method and device of individualized webpage typesetting
CN105677649A (en) * 2014-11-18 2016-06-15 中国移动通信集团公司 Customized webpage composing method and device
CN104468740A (en) * 2014-11-21 2015-03-25 网宿科技股份有限公司 Intelligent webpage transmission processing system and method
CN104615686A (en) * 2015-01-22 2015-05-13 百度在线网络技术(北京)有限公司 Searching method and device
CN104615686B (en) * 2015-01-22 2018-11-09 百度在线网络技术(北京)有限公司 A kind of searching method and device
WO2016124099A1 (en) * 2015-02-03 2016-08-11 阿里巴巴集团控股有限公司 Webpage display method and device
CN105989034A (en) * 2015-02-03 2016-10-05 阿里巴巴集团控股有限公司 Webpage display method and webpage display device
CN111813468A (en) * 2015-04-03 2020-10-23 阿里巴巴集团控股有限公司 Method and device for shielding webpage operation and electronic equipment
CN104809172B (en) * 2015-04-10 2019-02-12 百度在线网络技术(北京)有限公司 A kind of webpage representation method and device
CN104809172A (en) * 2015-04-10 2015-07-29 百度在线网络技术(北京)有限公司 Page showing method and device
CN104850595B (en) * 2015-04-27 2018-07-27 小米科技有限责任公司 Optimize the method and apparatus of webpage opening time
CN104850595A (en) * 2015-04-27 2015-08-19 小米科技有限责任公司 Method and device for optimizing webpage opening time
CN105138698A (en) * 2015-09-25 2015-12-09 百度在线网络技术(北京)有限公司 Dynamic layout method and device for webpages
CN105893624A (en) * 2016-04-29 2016-08-24 珠海市魅族科技有限公司 Method and system for displaying data
CN105893624B (en) * 2016-04-29 2020-01-17 珠海市魅族科技有限公司 Data display method and system
CN106446156A (en) * 2016-09-22 2017-02-22 宇龙计算机通信科技(深圳)有限公司 Webpage data shielding method and system
CN106844731A (en) * 2017-02-10 2017-06-13 宇龙计算机通信科技(深圳)有限公司 Advertisement shields method and system

Also Published As

Publication number Publication date
CN102035883B (en) 2015-07-01

Similar Documents

Publication Publication Date Title
CN102035883B (en) Method and device for optimizing webpage in network equipment
CN102306171B (en) A kind of for providing network to access suggestion and the method and apparatus of web search suggestion
CN108566399B (en) Phishing website identification method and system
CN109190049B (en) Keyword recommendation method, system, electronic device and computer readable medium
US9710440B2 (en) Presenting fixed format documents in reflowed format
US20200104353A1 (en) Personalization of content suggestions for document creation
CN101986306B (en) Method and equipment for acquiring yellow page information based on query sequence
CN103076892A (en) Method and equipment for providing input candidate items corresponding to input character string
CN102375885A (en) Method and device for providing search suggestions corresponding to query sequence
CN102737021B (en) Search engine and realization method thereof
CN102141868B (en) Method for quickly operating information interaction page, input method system and browser plug-in
US9280522B2 (en) Highlighting of document elements
CN107193987A (en) Obtain the methods, devices and systems of the search term related to the page
CN105243058A (en) Webpage content translation method and electronic apparatus
JP5724009B2 (en) Search result ranking apparatus and method using reliability of representative
CN103713894A (en) Method and equipment for determining access demand information of user
CN104991896A (en) Method and apparatus for analyzing two-dimension codes
CN102314494A (en) Method and equipment for processing webpage contents
CN104090904A (en) Method and equipment for providing target search result
CN112699295A (en) Webpage content recommendation method and device and computer readable storage medium
CN102937975A (en) Device and method for webpage search
CN103631796A (en) Website sort management method and electronic device
CN105808561A (en) Method and device for extracting abstract from webpage
CN103902164A (en) System and method for word-capturing search in browser window by clicking left mouse button
CN105447191A (en) Intelligent abstracting method for providing graphic guidance steps and corresponding device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant