CN103164423A - Method and device for confirming browser inner core type rendering web pages - Google Patents

Method and device for confirming browser inner core type rendering web pages Download PDF

Info

Publication number
CN103164423A
CN103164423A CN2011104138411A CN201110413841A CN103164423A CN 103164423 A CN103164423 A CN 103164423A CN 2011104138411 A CN2011104138411 A CN 2011104138411A CN 201110413841 A CN201110413841 A CN 201110413841A CN 103164423 A CN103164423 A CN 103164423A
Authority
CN
China
Prior art keywords
webpage
browser
kernel
characteristic information
equipment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2011104138411A
Other languages
Chinese (zh)
Other versions
CN103164423B (en
Inventor
钱毅
应蕾
连城
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201110413841.1A priority Critical patent/CN103164423B/en
Publication of CN103164423A publication Critical patent/CN103164423A/en
Application granted granted Critical
Publication of CN103164423B publication Critical patent/CN103164423B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention provides a method and a device for confirming browser inner core type rendering web pages. The method for confirming browser inner core type rendering the web pages comprises the following steps: according to an obtained to-be-processed web pages, obtaining relevant feature information of a browser of the web pages, and then according to the relevant feature information of the browser and based on predetermined sorting rule, confirming browser inner core type rendering the web pages. Compared with the prior art, the method and the device for confirming browser inner core type rendering the web pages use sorting methods of a decision-making tree, support vector machine (SVM) and the like to screen and sort the web pages to confirm browser inner core type rendering the web pages by extracting feature information of web pages showing, function and the like, and thus artificial screening cost is reduced, meanwhile browser inner core type rendering the web pages of new emerging web pages can be timely confirmed, display effect of the web pages in the browser can be ensured, and user browsing experience is improved.

Description

A kind of method and apparatus of the browser kernel type be used to determining to play up webpage
Technical field
The present invention relates to Internet technical field, relate in particular to a kind of technology of the browser kernel type be used to determining to play up webpage.
Background technology
Development along with Internet technology, multiple browser kernel for resolving and play up webpage has appearred, the Trident kernel that uses as the IE browser, the Gecko kernel that the Firefox browser uses, the Webkit kernel that the Safari browser uses etc., it is used for determining content and corresponding form that browser display can webpage.Because different browser kernels is different from degree of support to the analysis mode of web page contents, there is very big-difference in the effect that same webpage shows in the browser that uses different browser kernels to play up.At present, support the browser of two browser kernels or many browser kernels to play up different webpages by switching browser kernel, to guarantee the display effect of webpage in this browser, prior art is by carrying out sifting sort when realizing the automatic switchover of browser kernel with the browser kernel type of determining to play up this webpage to webpage, the general mode that adopts artificial screening, but due to artificial screening need drop into higher cost of labor and the screening cycle longer, be difficult to realize the large-scale data rapid screening.
Therefore, how to realize effectively determining to play up the browser kernel type of webpage, become one of present problem demanding prompt solution.
Summary of the invention
The method and apparatus that the purpose of this invention is to provide a kind of browser kernel type be used to determining to play up webpage.
According to an aspect of the present invention, provide a kind of method of the browser kernel type be used to determining to play up webpage, the method comprises the following steps:
A obtains pending webpage;
B obtains the relevant characteristic information of browser of described webpage according to described webpage;
The characteristic information that c is relevant according to described browser based on the predtermined category rule, determines to play up the browser kernel type of described webpage.
According to a further aspect in the invention, also provide a kind of equipment of the browser kernel type be used to determining to play up webpage, this equipment comprises:
The first webpage deriving means is used for obtaining pending webpage;
The characteristic information deriving means is used for according to described webpage, obtains the relevant characteristic information of browser of described webpage;
Type is determined device, is used for the characteristic information relevant according to described browser, based on the predtermined category rule, determines to play up the browser kernel type of described webpage.
Compared with prior art, the characteristic informations such as the displaying of the present invention by extracting webpage, function, utilize the sorting techniques such as decision tree, support vector machine (SVM) to carry out sifting sort to determine to play up the browser kernel type of those webpages to those webpages, thereby reduce the artificial screening cost, simultaneously can in time determine to play up to emerging webpage the browser kernel type of this webpage, guarantee the bandwagon effect of webpage in browser, to promote user's viewing experience.
Description of drawings
By reading the detailed description that non-limiting example is done of doing with reference to the following drawings, it is more obvious that other features, objects and advantages of the present invention will become:
Fig. 1 illustrates the equipment schematic diagram of browser kernel type that is used for determining to play up webpage according to one aspect of the invention;
Fig. 2 illustrates the exemplary plot of browser kernel type that is used for determining to play up webpage in accordance with a preferred embodiment of the present invention;
Fig. 3 illustrates the exemplary plot of browser kernel type that is used for determining to play up webpage in accordance with a preferred embodiment of the present invention;
Fig. 4 illustrates the equipment schematic diagram of browser kernel type that is used for determining to play up webpage in accordance with a preferred embodiment of the present invention;
Fig. 5 illustrates the method flow diagram of browser kernel type that is used for determining to play up webpage according to a further aspect of the present invention
Fig. 6 illustrates the method flow diagram of browser kernel type that is used for determining to play up webpage in accordance with a preferred embodiment of the present invention.
In accompanying drawing, same or analogous Reference numeral represents same or analogous parts.
Embodiment
Below in conjunction with accompanying drawing, the present invention is described in further detail.
Fig. 1 illustrates the equipment schematic diagram of browser kernel type that is used for determining to play up webpage according to one aspect of the invention.Determine that equipment 1 comprises that the first webpage deriving means 11, characteristic information deriving means 12 and type determine device 13.
At this, determine that equipment 1 is the network equipment, includes but not limited to the cloud that computing machine, network host, single network server, a plurality of webserver collection or a plurality of server consist of.At this, cloud is by consisting of based on a large amount of computing machines of cloud computing (Cloud Computing) or the webserver, and wherein, cloud computing is a kind of of Distributed Calculation, a super virtual machine that is comprised of the loosely-coupled computing machine collection of a group.
At this, existing browser can be such as the Opera browser of the Maxthon browser of the Safari browser of the Firefox browser of the IE browser of Microsoft company, Mozilla company, Google company, the Safari browser of Apple, the company of roaming, Opera company, 360 browsers of 360 companies, the search dog browser of Sohu.com Inc., the TT of Tengxun browser of company of Tengxun etc.
At this, described browser kernel type includes but not limited to:
1) the Trident kernel of IE browser use;
2) the Presto kernel of Opera browser use;
3) the Webkit kernel of Safari browser use;
4) the Gecko kernel of Firefox browser use.
Those skilled in the art will be understood that above-mentioned browser kernel type only for giving an example, and other browser kernel types existing or that may occur from now on also should be included in protection domain of the present invention as applicable to the present invention, and are contained in this with way of reference.
Below based on Fig. 1 to being described in detail according to one embodiment of the invention.As shown in Figure 1, at first, the first webpage deriving means 11 obtains pending webpage.At this, the described mode of obtaining pending webpage includes but not limited to:
1) the first webpage deriving means 11 obtains pending webpage in the web page repository of determining equipment 1; For example, the first webpage deriving means 11 is answered the Event triggered application programming interface (API) by determining that equipment 1 provides in real time, carries out matching inquiry in the web page repository of this locality, to obtain pending webpage.
2) the first webpage deriving means 11 reads pending webpage by the communication mode of agreement from third party device termly; For example, the first webpage deriving means 11 is via network, and the communication mode by agreement sends to third party device and obtain the request of pending webpage, and receives the pending webpage that this third party device returns in response to this request.For another example, third party device is via network, and the communication mode by agreement is initiatively to determining that equipment 1 sends pending webpage, and the first webpage deriving means 11 receives these webpages by the mode of real-time listening.Wherein, described network includes but not limited to internet, wide area network, Metropolitan Area Network (MAN), LAN (Local Area Network), VPN network, wireless self-organization network (Ad Hoc network) etc.At this, can realize communicating by letter by any communication mode between the first webpage deriving means 11 and third party device, include but not limited to, based on the mobile communication of 3GPP, LTE, WIMAX, based on the computer network communication of TCP/IP, udp protocol and based on the low coverage wireless transmission method of bluetooth, Infrared Transmission standard.
Those skilled in the art will be understood that the above-mentioned mode of pending webpage of obtaining is only for giving an example; other existing or modes of obtaining pending webpage that may occur from now on are as applicable to the present invention; also should be included in protection domain of the present invention, and be contained in this with way of reference.
Then, (same URLs) URL or its corresponding making language document of the pending webpage that characteristic information deriving means 12 for example obtains according to the first webpage deriving means 11 obtain the relevant characteristic information of browser of these webpages.
At this, the characteristic information that described browser is relevant comprises following any one at least:
1) the relevant web page display characteristic information of browser; Wherein, described web page display characteristic information includes but not limited to:
A) the proprietary script feature information corresponding with particular browser kernel type; For example, proprietary constructed fuction " ActiveXObject () ", " VBArray () " that the Trident kernel that uses with the IE browser in JavaScript is corresponding;
B) proprietary Cascading Style Sheet (CSS) characteristic information corresponding with particular browser kernel type; For example, the proprietary attribute that the Trident kernel that uses with the IE browser in CSS is corresponding is as " layout-flow ", " line-break ";
C) web document type; For example, the document statement " doctype " in the markup language web page files is if be used for presenting standard (Standard) pattern (namely strict presentation modes) of the webpage of following newest standards, and recommendation Webkit kernel is played up this webpage; Containing (Quirks) pattern (namely loose presentation modes or compatibility mode) of the webpage that designs if be used for being rendered as conventional browser, recommendation Trident kernel is played up this webpage;
D) webpage label; The corresponding proprietary outmoded webpage label of the Trident kernel that for example, uses with the IE browser "<bgsound〉", "<marquee〉", "<layer〉" etc.
E) page layout mode; For example, if webpage uses the early stage page layout modes such as form (label be "<table〉"), the Trident kernel of recommendation IE browser use is played up this webpage; If webpage use webpage label "<DIV〉" carry out page layout, recommendation Webkit kernel is played up this webpage.
F) Web page subject; For example, the URL(uniform resource locator) corresponding with webpage (URL) comprises the keywords such as " store ", " shop ", the theme that judges this webpage may be the ecommerce webpage, and the newer web standards of the general employing of ecommerce webpage, recommendation Webkit kernel is played up this webpage.
2) the relevant webpage functional character information of browser; Wherein, described webpage functional character information includes but not limited to:
A) comprise the control that needs the particular browser kernel to resolve in the webpage; For example, the proprietary control that the Trident kernel that uses with the IE browser is corresponding is as ActiveX control etc.;
B) webpage adopts asynchronous JavaScript and XML (AJAX) technology to realize;
C) comprise the Flash function in webpage;
D) comprising the picture dynamic effect in webpage shows;
E) comprising the suspension window in webpage shows.
Those skilled in the art will be understood that the relevant characteristic information of above-mentioned browser is only for giving an example; the relevant characteristic information of other browsers existing or that may occur from now on is as applicable to the present invention; also should be included in protection domain of the present invention, and be contained in this with way of reference.
At this, described making language document includes but not limited to:
A) HTML (Hypertext Markup Language) (HTML) file;
B) extensible HyperText Markup Language (XHTML) file;
C) extend markup language (XML) file etc.
At this, described making language document according to webpage with obtain this (etc.) mode of the characteristic information that the browser of webpage is relevant includes but not limited to:
1) characteristic information deriving means 12 is according to the making language document of pending webpage, this making language document is resolved to (DOM Document Object Model) dom tree, then carry out matching inquiry in each node of this dom tree, the eigenwert corresponding with these features that comprises to obtain this webpage according to each relevant characteristic item of predetermined browser; At this, described dom tree means the tree construction data that obtain by making language document is resolved, and each node in this tree is corresponding with label and label substance in making language document.
In an example, the characteristic item that predetermined browser is relevant is whether webpage comprises the proprietary outmoded webpage label corresponding with the Trident kernel "<bgsound〉", "<marquee〉", "<layer〉"; Characteristic information deriving means 12 extracts the html file of pending webpage, the dom tree corresponding with it resolved and generated to this html file, as shown in Figure 2, then the content in each node of this dom tree is resolved respectively, and travel through in each node of this dom tree according to this characteristic item, for example, the HTML content that comprises of node N4 is:
“<bgsound?src=″http:∥abc/music.asp″loop=″-1″>”,
Be that characteristic information deriving means 12 obtains label<bgsound in this node〉be complementary with the characteristic item of being scheduled to, characteristic information deriving means 12 comprises this webpage on the outmoded webpage label of Trident kernel special use as the relevant characteristic information of browser.
2) characteristic information deriving means 12 is according to the making language document of pending webpage, and the characteristic item that this making language document is relevant to predetermined browser carries out string matching, to obtain the characteristic information corresponding with this webpage.
In an example, characteristic information deriving means 12 extracts the html file of pending webpage, and it comprises:
<html>
<head>
<title〉I webpage</title
<meta?http-equiv=″X-UA-Compatible″content=″IE″/>
</head>
<body>
<p〉be applicable to the IE7 browser display</p
</boby>
</html>
If the characteristic item that predetermined browser is relevant is that webpage comprises the proprietary attribute of Trident kernel " content=" IE " ", characteristic information deriving means 12 carries out string matching in this html file according to this characteristic item, when the character string consistent with characteristic item " content=" IE " " obtained in inquiry in this html file, webpage is comprised Trident kernel proprietary attribute " content=" IE " " as the relevant characteristic information of browser;
3) according to the execution script of webpage, the characteristic item that this execution script is relevant to predetermined browser carries out string matching, to obtain the characteristic information corresponding with this webpage.At this, described execution script includes but not limited to: JavaScript, VBScript, ActionScript etc.
In an example, characteristic information deriving means 12 is according to pending webpage, and the JavaScript that extracts in this webpage carries out script, and wherein, this execution script comprises:
<SCRIPT?LANGUAGE=”JScript”>
var?objMyData=new?ActiveXObject(‘this.object’);
</SCRIPT>
If the characteristic item that predetermined browser is relevant is that webpage comprises the proprietary JavaScript scripting object " ActiveXObject " of Trident kernel, characteristic information deriving means 12 carries out string matching according to this characteristic item in carrying out script, when coupling obtains the character string " ActiveXObject " consistent with this characteristic item, webpage is comprised the proprietary JavaScript scripting object of Trident kernel as the relevant characteristic information of browser.
Those skilled in the art will be understood that the above-mentioned mode of the relevant characteristic information of browser of obtaining is only for giving an example; other existing or modes of obtaining the relevant characteristic information of browser that may occur from now on are as applicable to the present invention; also should be included in protection domain of the present invention, and be contained in this with way of reference.
Subsequently, type is determined the relevant characteristic information of browser that device 13 obtains according to characteristic information deriving means 12, based on the predtermined category rule, determines to play up the browser kernel type of described webpage.
Particularly, type is determined the characteristic information that device 13 is relevant according to the browser that has obtained, by sorting techniques such as decision tree classification, support vector machine (SVM) classification, this webpage is classified, to determine to be suitable for playing up the browser kernel type of this webpage.
At this, described predtermined category rule includes but not limited to:
1) decision tree classification; Wherein, described decision tree is the principle of having utilized theory of probability, and utilizes a kind of tree derivation as analysis tool; Its ultimate principle is to represent decision problem with decision point, represent alternative plan with the scheme branch, represent with the probability branch the various results that scheme may occur, compare through the calculating to various schemes profit and loss value under various conditions as a result, for the decision maker provides decision-making foundation.
in an example, only pending webpage is classified for Trident kernel and two kinds of browser kernel types of Webkit kernel, as shown in Figure 3, type determines that device 13 classifies in decision tree according to the characteristic information of certain webpage, at first, carry out the decision-making judgement at the decision point N1 of decision tree, the decision problem of decision point N1 representative is whether webpage comprises the proprietary outmoded webpage label that only is used for the Trident kernel "<bgsound〉", "<marquee〉", "<layer〉" etc., if be judged as "Yes", determine that this webpage uses the Trident kernel to play up, otherwise, the characteristic information that is not used for the classification judgement in these characteristic informations is labeled as the first intermediate data, then, carry out the decision-making judgement according to this first intermediate data at decision point N2, the decision problem of decision point N2 representative is whether webpage comprises the corresponding proprietary control of Trident kernel that uses with the IE browser, as ActiveX control etc., if be judged as "Yes", determine that this webpage uses the Trident kernel to play up, otherwise, the characteristic information that is not used for the classification judgement in this first intermediate data is labeled as the second intermediate data, then, carry out the decision-making judgement according to this second intermediate data at decision point N3, the decision problem of decision point N3 representative is whether webpage uses form (label for<table 〉) to carry out page layout, if be judged as "Yes", determine that this webpage uses the Trident kernel to play up, otherwise determine to use the Webkit kernel to play up.
2) support vector machine (SVM) classification; Wherein, described support vector machine is that the VC that is based upon Statistical Learning Theory ties up on theoretical and structural risk minimization, has avoided local minimum point, can guarantee that the minimax solution that finds is exactly globally optimal solution, has good classification accuracy.
In an example, only pending webpage is classified for Trident kernel and two kinds of browser kernel types of Webkit kernel; Type is determined the relevant corresponding eigenwert of each characteristic item of feature information extraction of browser that device 13 comprises from webpage, and the proper vector of these eigenvalue clusters being synthesized this webpage, and it is weighted calculating as input parameter in the default disaggregated model based on support vector machine, play up the recommendation weights V2 of this webpage with recommendation weights V1 and the use Webkit kernel of determining to use the Trident kernel to play up this webpage.If V1>V2 determines that this webpage uses the Trident kernel to play up, otherwise, determine that this webpage uses the Webkit kernel to play up.
Those skilled in the art will be understood that above-mentioned predtermined category rule only for giving an example, and other predtermined categories rules existing or that may occur from now on also should be included in protection domain of the present invention, and be contained in this with way of reference as applicable to the present invention.
The mode that those skilled in the art also will be understood that the above-mentioned browser kernel type of determining to play up webpage is only for for example; other existing or determining of may occurring are from now on played up the mode of browser kernel type of webpage as applicable to the present invention; also should be included in protection domain of the present invention, and be contained in this with way of reference.
Preferably, the first webpage deriving means 11, characteristic information deriving means 12, the type of determining equipment 1 determine between device 13 it is to work continuously.Particularly, the first webpage deriving means 11 obtains pending webpage constantly; Characteristic information deriving means 12 obtains the relevant characteristic information of browser of described webpage also constantly according to described webpage; Type is determined the characteristic information that device 13 also is correlated with according to described browser constantly, based on the predtermined category rule, determines to play up the browser kernel type of described webpage.At this, it will be understood by those skilled in the art that " continuing " refers to that each device constantly carries out the obtaining of above-mentioned pending webpage, obtaining of characteristic information and determining of browser kernel type that browser is relevant, until satisfy predetermined stoppage condition, for example the first webpage deriving means 11 stops obtaining pending webpage in a long time.
Preferably (with reference to Fig. 1), type is determined the relevant characteristic information of browser that device 13 also obtains according to characteristic information deriving means 12, and plays up record in conjunction with the history of described webpage, determines to play up the browser kernel type of described webpage with weighting.
Particularly, type determines that device 13 according to the characteristic information of webpage, classifies to webpage based on the predtermined category rule, with the browser kernel type and the corresponding recommendation weights thereof that are used for playing up this webpage of determining to recommend; While is according to the identification information of this webpage, record the storehouse from web page browsing the history of extracting with the whole or predetermined quantity of this webpage and play up record, these history are played up record carry out statistical study, to obtain statistic analysis result, then, type is determined device 13 according to this statistic analysis result, the browser kernel type and the corresponding accumulative total access times that are used to determine playing up this webpage in history; Then, type is determined device 13 according to predetermined Weighted Rules, is weighted calculating with reference to information to above-mentioned two kinds, to determine to play up the browser kernel type of those webpages.
At this, the identification information that the storehouse includes but not limited to webpage is recorded in described web page browsing, as URL, the webpage ID of webpage, and corresponding webpage historical viewings time, history play up record etc.This web page browsing is recorded the storehouse and is included but not limited to relational database, Key-Value storage system, file system etc.At this, described webpage records the storehouse and can be stored in definite equipment 1, also can be stored in third party device.
In an example, only pending webpage is classified for Trident kernel and two kinds of browser kernel types of Webkit kernel.at first, type is determined device 13 according to the characteristic information of webpage, webpage is classified determine to use recommendation weights that the Trident kernel plays up this webpage as 6, and the recommendation weights that use the Webkit kernel to play up this webpage is 4, simultaneously, type determines that device 13 is according to the URL of this webpage, record to extract the storehouse with nearest 10 history of this webpage from web page browsing and play up record, and these history are played up record carry out statistical study and use in history the Trident kernel to play up the number of times of this webpage as 2 times to obtain the user, and use the Webkit kernel to play up the number of times 8 times of this webpage, type determines that device 13 is according to predetermined Weighted Rule, the acquisition Determining Weights that classification is recommended in the calculating of the browser kernel type of determining to play up this webpage is 0.6, and the historical Determining Weights of playing up record is 0.4, and be weighted accordingly calculating to obtain the recommendation weights corresponding with the Trident kernel as 4.4 (=6 * 0.6+2 * 0.4), the recommendation weights corresponding with the Webkit kernel are 5.6 (=4 * 0.6+8 * 0.4), and then type determines that it is the Webkit kernel that device 13 determines to play up the browser kernel type of this webpage.
At this, need to prove, the every numerical value in above-mentioned giving an example is only the example of illustration, for reader understanding the present invention, the True Data when being not practical application should not be considered as any restriction to the present patent application protection domain.If no special instructions, the function of other local numerical value that occur with identical herein, for simplicity's sake, repeats no more herein.
The mode that those skilled in the art will be understood that the above-mentioned browser kernel type of determining to play up webpage is only for for example; other existing or determining of may occurring are from now on played up the mode of browser kernel type of webpage as applicable to the present invention; also should be included in protection domain of the present invention, and be contained in this with way of reference.
More preferably (with reference to Fig. 1) determines that equipment 1 also comprises the generator (not shown).This generator determines that with type the determined browser kernel type information of playing up described webpage of device 13 offers the browser of subscriber equipment, to be used for playing up described webpage.
At this, described subscriber equipment can be any electronic product that can carry out man-machine interaction by modes such as keyboard, mouse, telepilot, touch pad or handwriting equipments with the user, such as computing machine, smart mobile phone, PDA or IPTV etc.
Particularly, this generator is with the determined browser kernel type information of playing up webpage, and the communication mode by agreement is sent to subscriber equipment in real time or termly, is written to subsequently playing up in information bank of browser.At this, describedly play up the identification information that information bank includes but not limited to webpage, as URL, the webpage ID of webpage, and the corresponding browser kernel information of playing up this webpage.This is played up information bank and includes but not limited to relational database, Key-Value storage system, file system etc.
At this, the described mode that provides includes but not limited to:
1) use the determined browser kernel type information of playing up each webpage to cover the existing record in information bank of playing up of subscriber equipment fully;
2) the determined browser kernel type information difference of playing up webpage is covered the record in information bank played up of subscriber equipment, the browser kernel type information insertion of playing up certain webpage that soon is not stored in the subscriber equipment browser is played up in information bank; The browser kernel type information of playing up certain webpage of storing but occuring to change covers, and plays up the browser kernel information recording/of this webpage with renewal.
In an example, generator is with the determined browser kernel type information of playing up each webpage, and the communication mode by agreement is sent to subscriber equipment in real time; Subscriber equipment receives these information by real-time listening ground mode, and extract the URL of each webpage in the browser kernel type information play up these webpages, then carry out matching inquiry in playing up information bank according to this URL, obtaining playing up non-existent webpage in information bank at this, and the corresponding browser kernel type information of these non-existent webpages is write this play up in information bank; To play up already present webpage in information bank but browser kernel type information that its corresponding browser kernel type changes covers and writes at this simultaneously, and then browser can be that different webpages switches and uses corresponding browser kernel based on above-mentioned information.
Those skilled in the art will be understood that the above-mentioned mode of browser kernel type information that provides is only for giving an example; other existing or modes that the browser kernel type information is provided that may occur from now on are as applicable to the present invention; also should be included in protection domain of the present invention, and be contained in this with way of reference.
Fig. 4 illustrates the equipment schematic diagram of browser kernel type that is used for determining to play up webpage in accordance with a preferred embodiment of the present invention.Wherein, determine that equipment 1 also comprises preferred webpage deriving means 14 '.
At this, install 11 ' identically with reference to the content of the described device 11 of Fig. 1 and 13 with the front with 13 ' function shown in Fig. 4, for simplicity's sake, it is contained in this with way of reference, do not give unnecessary details and do not do.
Particularly, preferred webpage deriving means 14 ' obtains preferred webpage according to the predetermined filtering rule from the pending webpage that the first webpage deriving means 11 ' obtains; Characteristic information deriving means 12 ' webpage preferred according to these obtains the characteristic information relevant to the browser of these preferred webpages.
At this, described predetermined filtering rule includes but not limited to:
1) obtain the accumulative total number of visits and surpass the webpage of cumulative number threshold value as preferred webpage;
2) obtain the webpage of the first maximum predetermined quantity of accumulative total number of visits as preferred webpage;
3) obtain and browse the frequency and surpass the webpage of frequency threshold value as preferred webpage;
4) obtain the preferred webpage of webpage conduct of browsing the second the highest predetermined quantity of the frequency.
Those skilled in the art will be understood that above-mentioned every default screening rule not only can be separately be used for preferred webpage deriving means 14 ' and obtains preferred webpage, and wherein multinomial combination is obtained preferred webpage for preferred webpage deriving means 14 '.
Those skilled in the art also will be understood that above-mentioned predetermined filtering rule only for giving an example, and other predetermined filtering rules existing or that may occur from now on also should be included in protection domain of the present invention, and be contained in this with way of reference as applicable to the present invention.
In an example, the first webpage deriving means 11 ' obtains pending webpage, then, preferred webpage deriving means 14 ' extracts the URL of these pending webpages, record in web page browsing and carry out matching inquiry in the storehouse, obtaining the accumulative total number of visits of these pending webpages, and will add up number of visits and surpass the webpage of cumulative number threshold value 2000 times as preferred webpage; Characteristic information deriving means 12 ' webpage preferred according to these obtains the characteristic information relevant to the browser of these preferred webpages.
Those skilled in the art will be understood that the above-mentioned mode of obtaining the mode of preferred webpage and/or obtaining the relevant characteristic information of browser is only for for example; other existing or modes of obtaining preferred webpage that may occur from now on and/or the mode of obtaining the relevant characteristic information of browser are as applicable to the present invention; also should be included in protection domain of the present invention, and be contained in this with way of reference.
Fig. 5 illustrates the method flow diagram of browser kernel type that is used for determining to play up webpage according to one aspect of the invention.
At this, determine that equipment 1 is the network equipment, includes but not limited to the cloud that computing machine, network host, single network server, a plurality of webserver collection or a plurality of server consist of.At this, cloud is by consisting of based on a large amount of computing machines of cloud computing (Cloud Computing) or the webserver, and wherein, cloud computing is a kind of of Distributed Calculation, a super virtual machine that is comprised of the loosely-coupled computing machine collection of a group.
At this, existing browser can be such as the Opera browser of the Maxthon browser of the Safari browser of the Firefox browser of the IE browser of Microsoft company, Mozilla company, Google company, the Safari browser of Apple, the company of roaming, Opera company, 360 browsers of 360 companies, the search dog browser of Sohu.com Inc., the TT of Tengxun browser of company of Tengxun etc.
At this, described browser kernel type includes but not limited to:
1) the Trident kernel of IE browser use;
2) the Presto kernel of Opera browser use;
3) the Webkit kernel of Safari browser use;
4) the Gecko kernel of Firefox browser use.
Those skilled in the art will be understood that above-mentioned browser kernel type only for giving an example, and other browser kernel types existing or that may occur from now on also should be included in protection domain of the present invention as applicable to the present invention, and are contained in this with way of reference.
Below based on Fig. 5 to being described in detail according to one embodiment of the invention.As shown in Figure 5, in step S1, determine that equipment 1 obtains pending webpage.At this, the described mode of obtaining pending webpage includes but not limited to:
1) in step S1, determine that equipment 1 obtains pending webpage in its web page repository; For example, in step S1, determine the application programming interface (API) that equipment 1 answers Event triggered to provide by definite equipment 1 in real time, carry out matching inquiry in the web page repository of this locality, to obtain pending webpage.
2) in step S1, determine that equipment 1 reads pending webpage by the communication mode of agreement from third party device termly; For example, in step S1, determine equipment 1 via network, and obtain the request of pending webpage by the communication mode of agreement to the third party device transmission, and receive the pending webpage that this third party device returns in response to this request.For another example, third party device is via network, and the communication mode that passes through agreement in step S1, determines that equipment 1 receives these webpages by the mode of real-time listening initiatively to determining the equipment 1 pending webpage of transmission.Wherein, described network includes but not limited to internet, wide area network, Metropolitan Area Network (MAN), LAN (Local Area Network), VPN network, wireless self-organization network (Ad Hoc network) etc.At this, determine to realize communicating by letter by any communication mode between equipment 1 and third party device, include but not limited to, based on the mobile communication of 3GPP, LTE, WIMAX, based on the computer network communication of TCP/IP, udp protocol and based on the low coverage wireless transmission method of bluetooth, Infrared Transmission standard.
Those skilled in the art will be understood that the above-mentioned mode of pending webpage of obtaining is only for giving an example; other existing or modes of obtaining pending webpage that may occur from now on are as applicable to the present invention; also should be included in protection domain of the present invention, and be contained in this with way of reference.
Then, in step S2, determine (same URLs) URL or its corresponding making language document of the pending webpage that equipment 1 for example obtains in step S1 according to it, obtain the relevant characteristic information of browser of these webpages.
At this, the characteristic information that described browser is relevant comprises following any one at least:
1) the relevant web page display characteristic information of browser; Wherein, described web page display characteristic information includes but not limited to:
A) the proprietary script feature information corresponding with particular browser kernel type; For example, proprietary constructed fuction " ActiveXObject () ", " VBArray () " that the Trident kernel that uses with the IE browser in JavaScript is corresponding;
B) proprietary Cascading Style Sheet (CSS) characteristic information corresponding with particular browser kernel type; For example, the proprietary attribute that the Trident kernel that uses with the IE browser in CSS is corresponding is as " layout-flow ", " line-break ";
C) web document type; For example, the document statement " doctype " in the markup language web page files is if be used for presenting standard (Standard) pattern (namely strict presentation modes) of the webpage of following newest standards, and recommendation Webkit kernel is played up this webpage; Containing (Quirks) pattern (namely loose presentation modes or compatibility mode) of the webpage that designs if be used for being rendered as conventional browser, recommendation Trident kernel is played up this webpage;
D) webpage label; The corresponding proprietary outmoded webpage label of the Trident kernel that for example, uses with the IE browser "<bgsound〉", "<marquee〉", "<layer〉" etc.
E) page layout mode; For example, if webpage uses the early stage page layout modes such as form (label be "<table〉"), the Trident kernel of recommendation IE browser use is played up this webpage; If webpage use webpage label "<DIV〉" carry out page layout, recommendation Webkit kernel is played up this webpage.
F) Web page subject; For example, the URL(uniform resource locator) corresponding with webpage (URL) comprises the keywords such as " store ", " shop ", the theme that judges this webpage may be the ecommerce webpage, and the newer web standards of the general employing of ecommerce webpage, recommendation Webkit kernel is played up this webpage.
2) the relevant webpage functional character information of browser; Wherein, described webpage functional character information includes but not limited to:
A) comprise the control that needs the particular browser kernel to resolve in the webpage; For example, the proprietary control that the Trident kernel that uses with the IE browser is corresponding is as ActiveX control etc.;
B) webpage adopts asynchronous JavaScript and XML (AJAX) technology to realize;
C) comprise the Flash function in webpage;
D) comprising the picture dynamic effect in webpage shows;
E) comprising the suspension window in webpage shows.
Those skilled in the art will be understood that the relevant characteristic information of above-mentioned browser is only for giving an example; the relevant characteristic information of other browsers existing or that may occur from now on is as applicable to the present invention; also should be included in protection domain of the present invention, and be contained in this with way of reference.
At this, described making language document includes but not limited to:
A) HTML (Hypertext Markup Language) (HTML) file;
B) extensible HyperText Markup Language (XHTML) file;
C) extend markup language (XML) file etc.
At this, described making language document according to webpage with obtain this (etc.) mode of the characteristic information that the browser of webpage is relevant includes but not limited to:
1) in step S2, determine that equipment 1 is according to the making language document of pending webpage, this making language document is resolved to (DOM Document Object Model) dom tree, then carry out matching inquiry in each node of this dom tree, the eigenwert corresponding with these features that comprises to obtain this webpage according to each relevant characteristic item of predetermined browser; At this, described dom tree means the tree construction data that obtain by making language document is resolved, and each node in this tree is corresponding with label and label substance in making language document.
In an example, the characteristic item that predetermined browser is relevant is whether webpage comprises the proprietary outmoded webpage label corresponding with the Trident kernel "<bgsound〉", "<marquee〉", "<layer〉"; In step S2, determine that equipment 1 extracts the html file of pending webpage, the dom tree corresponding with it resolved and generated to this html file, as shown in Figure 2, then the content in each node of this dom tree is resolved respectively, and travel through in each node of this dom tree according to this characteristic item, for example, the HTML content that node N4 comprises is:
“<bgsound?src=″http:∥abc/music.asp″loop=″-1″>”,
Namely in step S2, determine that equipment 1 obtains label<bgsound in this node〉be complementary with the characteristic item of being scheduled to, determine that equipment 1 comprises this webpage on the outmoded webpage label of Trident kernel special use as the relevant characteristic information of browser.
2) in step S2, determine equipment 1 according to the making language document of pending webpage, the characteristic item that this making language document is relevant to predetermined browser carries out string matching, to obtain the characteristic information corresponding with this webpage.
In an example, in step S2, determine that equipment 1 extracts the html file of pending webpage, it comprises:
<html>
<head>
<title〉I webpage</title
<meta?http-equiv=″X-UA-Compatible″content=″IE″/>
</head>
<body>
<p〉be applicable to the IE7 browser display</p
</body>
</html>
If the characteristic item that predetermined browser is relevant is that webpage comprises the proprietary attribute of Trident kernel " content=" IE " ", determine that equipment 1 carries out string matching according to this characteristic item in this html file, when the character string consistent with characteristic item " content=" IE " " obtained in inquiry in this html file, webpage is comprised Trident kernel proprietary attribute " content=" IE " " as the relevant characteristic information of browser;
3) according to the execution script of webpage, the characteristic item that this execution script is relevant to predetermined browser carries out string matching, to obtain the characteristic information corresponding with this webpage.At this, described execution script includes but not limited to: JavaScript, VBScript, ActionScript etc.
In an example, in step S2, determine equipment 1 according to pending webpage, the JavaScript that extracts in this webpage carries out script, and wherein, this execution script comprises:
<SCRIPT?LANGUAGE=”JScript”>
var?objMyData=new?ActiveXObject(‘this.object’);
</SCRIPT>
If the characteristic item that predetermined browser is relevant is that webpage comprises the proprietary JavaScript scripting object " ActiveXObject " of Trident kernel, determine that equipment 1 carries out string matching according to this characteristic item in carrying out script, when coupling obtains the character string " ActiveXObject " consistent with this characteristic item, webpage is comprised the proprietary JavaScript scripting object of Trident kernel as the relevant characteristic information of browser.
Those skilled in the art will be understood that the above-mentioned mode of the relevant characteristic information of browser of obtaining is only for giving an example; other existing or modes of obtaining the relevant characteristic information of browser that may occur from now on are as applicable to the present invention; also should be included in protection domain of the present invention, and be contained in this with way of reference.
Subsequently, in step S3, determine the characteristic information that equipment 1 is relevant according to its browser that obtains in step S2, based on the predtermined category rule, determine to play up the browser kernel type of described webpage.
Particularly, in step S3, determine the characteristic information that equipment 1 is relevant according to the browser that has obtained, by sorting techniques such as decision tree classification, support vector machine (SVM) classification, this webpage is classified, to determine to be suitable for playing up the browser kernel type of this webpage.
At this, described predtermined category rule includes but not limited to:
1) decision tree classification; Wherein, described decision tree is the principle of having utilized theory of probability, and utilizes a kind of tree derivation as analysis tool; Its ultimate principle is to represent decision problem with decision point, represent alternative plan with the scheme branch, represent with the probability branch the various results that scheme may occur, compare through the calculating to various schemes profit and loss value under various conditions as a result, for the decision maker provides decision-making foundation.
in an example, only pending webpage is classified for Trident kernel and two kinds of browser kernel types of Webkit kernel, as shown in Figure 3, in step S3, determine that equipment 1 classifies in decision tree according to the characteristic information of certain webpage, at first, carry out the decision-making judgement at the decision point N1 of decision tree, the decision problem of decision point N1 representative is whether webpage comprises the proprietary outmoded webpage label that only is used for the Trident kernel "<bgsound〉", "<marquee〉", "<layer〉" etc., if be judged as "Yes", determine that this webpage uses the Trident kernel to play up, otherwise, the characteristic information that is not used for the classification judgement in these characteristic informations is labeled as the first intermediate data, then, carry out the decision-making judgement according to this first intermediate data at decision point N2, the decision problem of decision point N2 representative is whether webpage comprises the corresponding proprietary control of Trident kernel that uses with the IE browser, as ActiveX control etc., if be judged as "Yes", determine that this webpage uses the Trident kernel to play up, otherwise, the characteristic information that is not used for the classification judgement in this first intermediate data is labeled as the second intermediate data, then, carry out the decision-making judgement according to this second intermediate data at decision point N3, the decision problem of decision point N3 representative is whether webpage uses form (label for<table 〉) to carry out page layout, if be judged as "Yes", determine that this webpage uses the Trident kernel to play up, otherwise determine to use the Webkit kernel to play up.
2) support vector machine (SVM) classification; Wherein, described support vector machine is that the VC that is based upon Statistical Learning Theory ties up on theoretical and structural risk minimization, has avoided local minimum point, can guarantee that the minimax solution that finds is exactly globally optimal solution, has good classification accuracy.
In an example, only pending webpage is classified for Trident kernel and two kinds of browser kernel types of Webkit kernel; In step S3, determine the relevant corresponding eigenwert of each characteristic item of feature information extraction of browser that equipment 1 comprises from webpage, and the proper vector of these eigenvalue clusters being synthesized this webpage, and it is weighted calculating as input parameter in the default disaggregated model based on support vector machine, play up the recommendation weights V2 of this webpage with recommendation weights V1 and the use Webkit kernel of determining to use the Trident kernel to play up this webpage.If V1>V2 determines that this webpage uses the Trident kernel to play up, otherwise, determine that this webpage uses the Webkit kernel to play up.
Those skilled in the art will be understood that above-mentioned predtermined category rule only for giving an example, and other predtermined categories rules existing or that may occur from now on also should be included in protection domain of the present invention, and be contained in this with way of reference as applicable to the present invention.
The mode that those skilled in the art also will be understood that the above-mentioned browser kernel type of determining to play up webpage is only for for example; other existing or determining of may occurring are from now on played up the mode of browser kernel type of webpage as applicable to the present invention; also should be included in protection domain of the present invention, and be contained in this with way of reference.
Preferably, determine that equipment 1 is to work continuously in step S1, step S2 and step S3.Particularly, in step S1, determine that equipment 1 obtains pending webpage constantly; In step S2, determine equipment 1 also constantly according to described webpage, obtain the relevant characteristic information of browser of described webpage; In step S3, determine the characteristic information that equipment 1 also is correlated with according to described browser constantly, based on the predtermined category rule, determine to play up the browser kernel type of described webpage.At this, it will be understood by those skilled in the art that " continuing " refers to determine that equipment 1 constantly carries out the obtaining of above-mentioned pending webpage, obtaining of characteristic information and determining of browser kernel type that browser is relevant in each step, until satisfy predetermined stoppage condition, for example determine that equipment 1 stops obtaining pending webpage in a long time.
Preferably (with reference to Fig. 5) in step S3, determines the characteristic information that equipment 1 also is correlated with according to its browser that obtains, and plays up record in conjunction with the history of described webpage in step S2, determine to play up the browser kernel type of described webpage with weighting.
Particularly, in step S3, determine that equipment 1 according to the characteristic information of webpage, classifies to webpage based on the predtermined category rule, with the browser kernel type and the corresponding recommendation weights thereof that are used for playing up this webpage of determining to recommend; While is according to the identification information of this webpage, record the storehouse from web page browsing the history of extracting with the whole or predetermined quantity of this webpage and play up record, these history are played up record carry out statistical study, to obtain statistic analysis result, then, determine equipment 1 according to this statistic analysis result, the browser kernel type and the corresponding accumulative total access times that are used to determine playing up this webpage in history; Then, determine equipment 1 according to predetermined Weighted Rule, be weighted calculating with reference to information to above-mentioned two kinds, to determine to play up the browser kernel type of those webpages.
At this, the identification information that the storehouse includes but not limited to webpage is recorded in described web page browsing, as URL, the webpage ID of webpage, and corresponding webpage historical viewings time, history play up record etc.This web page browsing is recorded the storehouse and is included but not limited to relational database, Key-Value storage system, file system etc.At this, described webpage records the storehouse and can be stored in definite equipment 1, also can be stored in third party device.
In an example, only pending webpage is classified for Trident kernel and two kinds of browser kernel types of Webkit kernel.at first, in step S3, determine equipment 1 according to the characteristic information of webpage, webpage is classified determine to use recommendation weights that the Trident kernel plays up this webpage as 6, and the recommendation weights that use the Webkit kernel to play up this webpage are 4, simultaneously, determine that equipment 1 is according to the URL of this webpage, record to extract the storehouse with nearest 10 history of this webpage from web page browsing and play up record, and these history are played up record carry out statistical study and use in history the Trident kernel to play up the number of times of this webpage as 2 times to obtain the user, and use the Webkit kernel to play up the number of times 8 times of this webpage, determine that equipment 1 is according to predetermined Weighted Rule, the acquisition Determining Weights that classification is recommended in the calculating of the browser kernel type of determining to play up this webpage is 0.6, and the historical Determining Weights of playing up record is 0.4, and be weighted accordingly calculating to obtain the recommendation weights corresponding with the Trident kernel as 4.4 (=6 * 0.6+2 * 0.4), the recommendation weights corresponding with the Webkit kernel are 5.6 (=4 * 0.6+8 * 0.4), and then the browser kernel type that definite equipment 1 determines to play up this webpage is the Webkit kernel.
At this, need to prove, the every numerical value in above-mentioned giving an example is only the example of illustration, for reader understanding the present invention, the True Data when being not practical application should not be considered as any restriction to the present patent application protection domain.If no special instructions, the function of other local numerical value that occur with identical herein, for simplicity's sake, repeats no more herein.
The mode that those skilled in the art will be understood that the above-mentioned browser kernel type of determining to play up webpage is only for for example; other existing or determining of may occurring are from now on played up the mode of browser kernel type of webpage as applicable to the present invention; also should be included in protection domain of the present invention, and be contained in this with way of reference.
More preferably (with reference to Fig. 5), this process also comprises step S5 (not shown).In step S5, determine that equipment 1 offers its determined browser kernel type information of playing up described webpage in step S3 the browser of subscriber equipment, to be used for playing up described webpage.
At this, described subscriber equipment can be any electronic product that can carry out man-machine interaction by modes such as keyboard, mouse, telepilot, touch pad or handwriting equipments with the user, such as computing machine, smart mobile phone, PDA or IPTV etc.
Particularly, in step S5, determine equipment 1 with the determined browser kernel type information of playing up webpage, the communication mode by agreement is sent to subscriber equipment in real time or termly, is written to subsequently playing up in information bank of browser.At this, describedly play up the identification information that information bank includes but not limited to webpage, as URL, the webpage ID of webpage, and the corresponding browser kernel information of playing up this webpage.This is played up information bank and includes but not limited to relational database, Key-Value storage system, file system etc.
At this, the described mode that provides includes but not limited to:
1) use the determined browser kernel type information of playing up each webpage to cover the existing record in information bank of playing up of subscriber equipment fully;
2) the determined browser kernel type information difference of playing up webpage is covered the record in information bank played up of subscriber equipment, the browser kernel type information insertion of playing up certain webpage that soon is not stored in the subscriber equipment browser is played up in information bank; The browser kernel type information of playing up certain webpage of storing but occuring to change covers, and plays up the browser kernel information recording/of this webpage with renewal.
In an example, in step S5, determine equipment 1 with the determined browser kernel type information of playing up each webpage, the communication mode by agreement is sent to subscriber equipment in real time; Subscriber equipment receives these information by real-time listening ground mode, and extract the URL of each webpage in the browser kernel type information play up these webpages, then carry out matching inquiry in playing up information bank according to this URL, obtaining playing up non-existent webpage in information bank at this, and the corresponding browser kernel type information of these non-existent webpages is write this play up in information bank; To play up already present webpage in information bank but browser kernel type information that its corresponding browser kernel type changes covers and writes at this simultaneously, and then browser can be that different webpages switches and uses corresponding browser kernel based on above-mentioned information.
Those skilled in the art will be understood that the above-mentioned mode of browser kernel type information that provides is only for giving an example; other existing or modes that the browser kernel type information is provided that may occur from now on are as applicable to the present invention; also should be included in protection domain of the present invention, and be contained in this with way of reference.
Fig. 6 illustrates the method flow diagram of browser kernel type that is used for determining to play up webpage in accordance with a preferred embodiment of the present invention.Wherein, this process also comprises step S4 '.
At this, the content of described definite equipment 1 in step S1 and step S3 is identical with reference to Fig. 5 with the front to determine the function of equipment 1 in step S1 ' and step S3 ' shown in Fig. 6, for simplicity's sake, it is contained in this with way of reference, does not give unnecessary details and do not do.
Particularly, in step S4 ', determine that equipment 1 according to the predetermined filtering rule, obtains preferred webpage in the pending webpage that obtains from it among step S1 '; In step S2 ', determine equipment 1 webpage preferred according to these, obtain the characteristic information relevant to the browser of these preferred webpages.
At this, described predetermined filtering rule includes but not limited to:
1) obtain the accumulative total number of visits and surpass the webpage of cumulative number threshold value as preferred webpage;
2) obtain the webpage of the first maximum predetermined quantity of accumulative total number of visits as preferred webpage;
3) obtain and browse the frequency and surpass the webpage of frequency threshold value as preferred webpage;
4) obtain the preferred webpage of webpage conduct of browsing the second the highest predetermined quantity of the frequency.
Those skilled in the art will be understood that above-mentioned every default screening rule not only can be separately be used for determining that equipment 1 obtain preferred webpage, and wherein multinomial combination is used for definite equipment 1 and obtains preferred webpage.
Those skilled in the art also will be understood that above-mentioned predetermined filtering rule only for giving an example, and other predetermined filtering rules existing or that may occur from now on also should be included in protection domain of the present invention, and be contained in this with way of reference as applicable to the present invention.
In an example, in step S1 ', determine that equipment 1 obtains pending webpage, then, in step S4 ', determine that equipment 1 extracts the URL of these pending webpages, records in web page browsing and carries out matching inquiry in the storehouse, obtaining the accumulative total number of visits of these pending webpages, and will add up number of visits and surpass the webpage of cumulative number threshold value 2000 times as preferred webpage; In step S2 ', determine equipment 1 webpage preferred according to these, obtain the characteristic information relevant to the browser of these preferred webpages.
Those skilled in the art will be understood that the above-mentioned mode of obtaining the mode of preferred webpage and/or obtaining the relevant characteristic information of browser is only for for example; other existing or modes of obtaining preferred webpage that may occur from now on and/or the mode of obtaining the relevant characteristic information of browser are as applicable to the present invention; also should be included in protection domain of the present invention, and be contained in this with way of reference.
To those skilled in the art, obviously the invention is not restricted to the details of above-mentioned one exemplary embodiment, and in the situation that do not deviate from spirit of the present invention or essential characteristic, can realize the present invention with other concrete form.Therefore, no matter from which point, all should regard embodiment as exemplary, and be nonrestrictive, scope of the present invention is limited by claims rather than above-mentioned explanation, therefore is intended to be included in the present invention dropping on the implication that is equal to important document of claim and all changes in scope.Any Reference numeral in claim should be considered as limit related claim.In addition, obviously other unit or step do not got rid of in " comprising " word, and odd number is not got rid of plural number.A plurality of unit of stating in the device claim or device also can be realized by software or hardware by a unit or device.The first, the second word such as grade is used for representing title, and does not represent any specific order.

Claims (18)

1. the method for a computer implemented browser kernel type be used to determining to play up webpage, the method comprises the following steps:
A obtains pending webpage;
B obtains the relevant characteristic information of browser of described webpage according to described webpage;
The characteristic information that c is relevant according to described browser based on the predtermined category rule, determines to play up the browser kernel type of described webpage.
2. method according to claim 1, wherein, described step b comprises:
-according to the making language document of described webpage, obtain the characteristic information relevant to the browser of described webpage.
3. method according to claim 1 and 2, wherein, described predtermined category rule comprises following any one at least:
-decision tree classification;
The classification of-support vector machine.
4. the described method of any one according to claim 1 to 3, wherein, described step c comprises:
-the characteristic information relevant according to described browser, and play up record in conjunction with the history of described webpage, determine to play up the browser kernel type of described webpage with weighting.
5. the described method of any one according to claim 1 to 4, wherein, the method also comprises:
-according to the predetermined filtering rule, obtain preferred webpage from described pending webpage;
Wherein, described step b comprises:
-according to described preferred webpage, obtain the characteristic information relevant to the browser of described preferred webpage.
6. method according to claim 5, wherein, described predetermined filtering rule includes but not limited to following any one at least:
-obtain the accumulative total number of visits over the preferred webpage of webpage conduct of cumulative number threshold value;
-obtain the webpage of the first maximum predetermined quantity of accumulative total number of visits as preferred webpage;
-obtain and browse the preferred webpage of webpage conduct that the frequency surpasses frequency threshold value;
-obtain the webpage of browsing the second the highest predetermined quantity of the frequency as preferred webpage.
7. the described method of according to claim 1 to 6 any one, wherein, the characteristic information that described browser is relevant comprises following any one at least:
-web page display characteristic information;
-webpage functional character information.
8. the described method of according to claim 1 to 7 any one, wherein, described browser kernel type comprises following any one at least:
-Trident kernel;
-Presto kernel;
-Webkit kernel;
-Gecko kernel.
9. the described method of any one according to claim 1 to 8, wherein, the method also comprises:
-the determined browser kernel type information of playing up described webpage is offered the browser of subscriber equipment, to be used for playing up described webpage.
10. equipment that is used for determining playing up the browser kernel type of webpage, wherein, this equipment comprises:
The first webpage deriving means is used for obtaining pending webpage;
The characteristic information deriving means is used for according to described webpage, obtains the relevant characteristic information of browser of described webpage;
Type is determined device, is used for the characteristic information relevant according to described browser, based on the predtermined category rule, determines to play up the browser kernel type of described webpage.
11. equipment according to claim 10, wherein, described characteristic information deriving means is used for the making language document according to described webpage, obtains the characteristic information relevant to the browser of described webpage.
12. according to claim 10 or 11 described equipment, wherein, described predtermined category rule comprises following any one at least:
-decision tree classification;
The classification of-support vector machine.
13. according to claim 10 to the described equipment of any one in 12, wherein, described type determines that device is used for the characteristic information relevant according to described browser, and plays up record in conjunction with the history of described webpage, determines to play up the browser kernel type of described webpage with weighting.
14. according to claim 10 to the described equipment of any one in 13, wherein, this equipment also comprises:
Preferred webpage deriving means is used for obtaining preferred webpage according to the predetermined filtering rule from described pending webpage;
Wherein, described characteristic information deriving means is used for according to described preferred webpage, obtains the characteristic information relevant to the browser of described preferred webpage.
15. equipment according to claim 14, wherein, described predetermined filtering rule includes but not limited to following any one at least:
-obtain the accumulative total number of visits over the preferred webpage of webpage conduct of cumulative number threshold value;
-obtain the webpage of the first maximum predetermined quantity of accumulative total number of visits as preferred webpage;
-obtain and browse the preferred webpage of webpage conduct that the frequency surpasses frequency threshold value;
-obtain the webpage of browsing the second the highest predetermined quantity of the frequency as preferred webpage.
16. according to claim 10 to the 15 described equipment of any one, wherein, the characteristic information that described browser is relevant comprises following any one at least:
-web page display characteristic information;
-webpage functional character information.
17. according to claim 10 to the 16 described equipment of any one, wherein, described browser kernel type comprises following any one at least:
-Trident kernel;
-Presto kernel;
-Webkit kernel;
-Gecko kernel.
18. according to claim 10 to the described equipment of any one in 17, wherein, this equipment also comprises:
Generator is for the determined browser kernel type information of playing up described webpage being offered the browser of subscriber equipment, to be used for playing up described webpage.
CN201110413841.1A 2011-12-09 2011-12-09 A kind of method and apparatus for being used to determine to render the browser kernel type of webpage Active CN103164423B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201110413841.1A CN103164423B (en) 2011-12-09 2011-12-09 A kind of method and apparatus for being used to determine to render the browser kernel type of webpage

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201110413841.1A CN103164423B (en) 2011-12-09 2011-12-09 A kind of method and apparatus for being used to determine to render the browser kernel type of webpage

Publications (2)

Publication Number Publication Date
CN103164423A true CN103164423A (en) 2013-06-19
CN103164423B CN103164423B (en) 2017-11-03

Family

ID=48587518

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201110413841.1A Active CN103164423B (en) 2011-12-09 2011-12-09 A kind of method and apparatus for being used to determine to render the browser kernel type of webpage

Country Status (1)

Country Link
CN (1) CN103164423B (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108052650A (en) * 2017-12-26 2018-05-18 百度在线网络技术(北京)有限公司 Information recommendation method, device and electronic equipment
CN108733401A (en) * 2018-05-03 2018-11-02 北京明朝万达科技股份有限公司 A kind of method and device for realizing browser-safe
CN109684584A (en) * 2018-11-15 2019-04-26 北京海泰方圆科技股份有限公司 A kind of intelligent switch method of browser kernel, device, terminal and storage medium
CN109960531A (en) * 2017-12-26 2019-07-02 中国移动通信集团浙江有限公司 A kind of page display method and device
CN111241446A (en) * 2020-01-13 2020-06-05 杭州安恒信息技术股份有限公司 Method, device, equipment and medium for extracting text content of web page
CN112632417A (en) * 2019-09-24 2021-04-09 北京国双科技有限公司 Data processing method, data processing device, storage medium and electronic equipment
CN112887408A (en) * 2021-01-27 2021-06-01 合肥大多数信息科技有限公司 System and method for solving data state sharing of multi-kernel browser
CN113868570A (en) * 2021-08-16 2021-12-31 北京国电通网络技术有限公司 Kernel switching method of multi-core browser and related equipment

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115004180A (en) * 2020-03-17 2022-09-02 深圳市欢太科技有限公司 Information pushing method and device, electronic equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101150803A (en) * 2007-10-24 2008-03-26 优视动景(北京)技术服务有限公司 Method for micro-browser to process network data, micro-browser and its server
CN101251855A (en) * 2008-03-27 2008-08-27 腾讯科技(深圳)有限公司 Equipment, system and method for cleaning internet web page
CN101789000A (en) * 2009-12-28 2010-07-28 青岛朗讯科技通讯设备有限公司 Method for classifying modes in search engine
CN101872347A (en) * 2009-04-22 2010-10-27 富士通株式会社 Method and device for judging type of webpage
CN102156709A (en) * 2011-02-28 2011-08-17 奇智软件(北京)有限公司 Browser engine mode switching method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101150803A (en) * 2007-10-24 2008-03-26 优视动景(北京)技术服务有限公司 Method for micro-browser to process network data, micro-browser and its server
CN101251855A (en) * 2008-03-27 2008-08-27 腾讯科技(深圳)有限公司 Equipment, system and method for cleaning internet web page
CN101872347A (en) * 2009-04-22 2010-10-27 富士通株式会社 Method and device for judging type of webpage
CN101789000A (en) * 2009-12-28 2010-07-28 青岛朗讯科技通讯设备有限公司 Method for classifying modes in search engine
CN102156709A (en) * 2011-02-28 2011-08-17 奇智软件(北京)有限公司 Browser engine mode switching method

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108052650A (en) * 2017-12-26 2018-05-18 百度在线网络技术(北京)有限公司 Information recommendation method, device and electronic equipment
CN109960531A (en) * 2017-12-26 2019-07-02 中国移动通信集团浙江有限公司 A kind of page display method and device
CN108733401A (en) * 2018-05-03 2018-11-02 北京明朝万达科技股份有限公司 A kind of method and device for realizing browser-safe
CN108733401B (en) * 2018-05-03 2021-12-14 北京明朝万达科技股份有限公司 Method and device for realizing browser compatibility
CN109684584A (en) * 2018-11-15 2019-04-26 北京海泰方圆科技股份有限公司 A kind of intelligent switch method of browser kernel, device, terminal and storage medium
CN112632417A (en) * 2019-09-24 2021-04-09 北京国双科技有限公司 Data processing method, data processing device, storage medium and electronic equipment
CN111241446A (en) * 2020-01-13 2020-06-05 杭州安恒信息技术股份有限公司 Method, device, equipment and medium for extracting text content of web page
CN111241446B (en) * 2020-01-13 2023-10-31 杭州安恒信息技术股份有限公司 Method, device, equipment and medium for extracting text content of web page
CN112887408A (en) * 2021-01-27 2021-06-01 合肥大多数信息科技有限公司 System and method for solving data state sharing of multi-kernel browser
CN113868570A (en) * 2021-08-16 2021-12-31 北京国电通网络技术有限公司 Kernel switching method of multi-core browser and related equipment

Also Published As

Publication number Publication date
CN103164423B (en) 2017-11-03

Similar Documents

Publication Publication Date Title
CN103164423A (en) Method and device for confirming browser inner core type rendering web pages
CN102385594B (en) The kernel control method of multi-core browser and device
US8555157B1 (en) Document update generation
CN102708174B (en) Method and device for displaying rich media information in browser
CN101782911B (en) A kind of prompting network resource content method and system
JP6646931B2 (en) Method and apparatus for providing recommendation information
KR20120088792A (en) Characteristic content determination program, characteristic content determination device, characteristic content determination method, recording medium, content generation device, and related content insertion device
CN102314497B (en) Method and equipment for identifying body contents of markup language files
US20150254219A1 (en) Method and system for injecting content into existing computerized data
CN101714164A (en) Methods and apparatus to automatically crawl the internet using image analysis
CN102682082B (en) Network Flash searching system and network Flash searching method based on content structure characteristics
US11907644B2 (en) Detecting compatible layouts for content-based native ads
CN103823907B (en) A kind of method, apparatus and engine for integrating online video resource address
CN103678325A (en) Method and device for providing browsing page corresponding to initial page
CN104090757A (en) Method and device for displaying rich media information in browser
CN105677927A (en) Method and device for providing searching result
US20220114269A1 (en) Page processing method, electronic apparatus and non-transitory computer-readable storage medium
CN104090923A (en) Method and device for displaying rich media information in browser
CN102760150A (en) Webpage extraction method based on attribute reproduction and labeled path
CN104503988A (en) Searching method and device
CN105117434A (en) Webpage classification method and webpage classification system
CN103870495A (en) Method and device for extracting information from website
JP5462591B2 (en) Specific content determination device, specific content determination method, specific content determination program, and related content insertion device
TW201705021A (en) An information retrieving method utilizing webpage visual features and webpage language features and a system using thereof
CN108280102A (en) Internet behavior recording method, device and user terminal

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant