CN104866527A - Dynamic webpage template matching method and device - Google Patents

Dynamic webpage template matching method and device Download PDF

Info

Publication number
CN104866527A
CN104866527A CN201510199105.9A CN201510199105A CN104866527A CN 104866527 A CN104866527 A CN 104866527A CN 201510199105 A CN201510199105 A CN 201510199105A CN 104866527 A CN104866527 A CN 104866527A
Authority
CN
China
Prior art keywords
webpage
web page
dom
page template
converted
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510199105.9A
Other languages
Chinese (zh)
Inventor
陈本峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Metong Cloud (beijing) Technology Co Ltd
Original Assignee
Metong Cloud (beijing) Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Metong Cloud (beijing) Technology Co Ltd filed Critical Metong Cloud (beijing) Technology Co Ltd
Priority to CN201510199105.9A priority Critical patent/CN104866527A/en
Publication of CN104866527A publication Critical patent/CN104866527A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

The application of the invention provides a dynamic webpage template matching method which comprises the following steps: obtaining a webpage to be converted; generating a document object model (DOM) of the webpage to be converted; finding a webpage template matched with the webpage to be converted; converting the webpage according to the found webpage template. With adoption of the method, the webpage suitable for the requested terminal display can be automatically generated, and the method has the advantages that the popularization is easy, data migration is not required, the content is updated in a real-time and automatic manner, the development cost is low, and the compatibility is good.

Description

A kind of method of Dynamic Matching web page template and device thereof
Technical field
The application relates to Web Page Processing field, particularly relates to a kind of method and device thereof of Dynamic Matching web page template.
Background technology
Along with popularizing of mobile device, people more and more get used to browsing traditional PC webpage on the mobile devices such as mobile phone, but bring the reading experience of non-constant because PC webpage cannot adapt to the mobile device screen less relative to PC.How existing PC webpage to be converted to the page being suitable for mobile device display, to become industry focus.
At present, the method for head it off mainly comprises: do adaptations to various main flow mobile device browser end; Adopt hawkeye window to webpage convergent-divergent thus carry out content reorientation, obtaining according to positioning action and currently browse region content concrete in webpage; Or do webpage after again obtaining webpage html source code according to webpage network address to reorder process.But it is chaotic that these solutions exist the typesetting being easy to cause webpage, content is overlapping or bulk is blank, and expends network traffics, the problems such as loading velocity is slower.
Summary of the invention
In view of this, the application provides a kind of method and device thereof of Dynamic Matching webpage, can Automatic Extraction webpage DOM, by its corresponding for webpage template Dynamic Matching, thus obtaining the webpage being suitable for mobile terminal display, the application has to be convenient to promote, without the need to Data Migration, content real time automatic update, the advantage such as cost of development is low, compatibility is good.
The application provides a kind of method of Dynamic Matching web page template, and the method comprises the steps: to obtain webpage to be converted; Generate the DOM of webpage to be converted; For the web page template that Webpage searching to be converted mates with it; According to the web page template conversion webpage found.
The web page template wherein mated with it for Webpage searching to be converted comprises: according to certain rule generating web page template base in advance, in the matching process, according to same rule, searches the web page template matched with webpage to be converted.
Further use heuristic rule is the web page template that Webpage searching to be converted mates with it, specifically comprise: set up heuristic rule list in advance, when the web page template mated with it for Webpage searching to be converted, travel through the node in webpage DOM to be identified successively, generate the web page characteristics of webpage to be identified, these web page characteristics and predefined heuristic rule list are carried out mating of rule, if the web page characteristics of webpage to be identified meets the condition of certain heuristic rule definition, then rule match success, the type of webpage of webpage to be identified is exported further according to the rule of coupling.Set up heuristic rule list in advance to comprise: the DOM obtaining webpage in web page template storehouse; Travel through the node in DOM successively, the web page characteristics of generating web page; The web page characteristics obtained is classified, obtains heuristic rule list.
Further, use webpage DOM to be the web page template that Webpage searching to be converted mates with it, comprising: obtain webpage DOM in web page template storehouse, and the webpage DOM obtained is classified, for the webpage DOM of each type sets up corresponding template; Select the DOM in web page template storehouse successively, calculate the similarity of the DOM in this web page template storehouse with the DOM of described webpage to be converted, if described similarity is greater than predetermined threshold value, determine that described webpage to be converted belongs to classification corresponding to the DOM that selects in this web page template storehouse.
The application also provides a kind of device of web page template Dynamic Matching to comprise: acquisition module: obtain webpage to be converted; DOM generation module: the DOM generating webpage to be converted; Matching module: the web page template mated with it for Webpage searching to be converted; Modular converter: according to the web page template conversion webpage found.
Matching module, according to certain rule generating web page template base in advance, in the matching process, according to same rule, searches the web page template matched with webpage to be converted.
Further, matching module uses heuristic rule to be the web page template that Webpage searching to be converted mates with it, matching module sets up heuristic rule list in advance, matching module comprises: comparing module: travel through the node in webpage DOM to be identified successively, generate the web page characteristics of webpage to be identified, these web page characteristics and predefined heuristic rule list are carried out mating of rule, if the web page characteristics of webpage to be identified meets the condition of certain heuristic rule definition, then rule match success, and the type of webpage of webpage to be identified is exported according to the rule of coupling.Matching module also comprises: template base generation module: the DOM obtaining webpage in web page template storehouse; Travel through the node in DOM successively, the web page characteristics of generating web page; Heuristic rule List Generating Module: classify to the web page characteristics obtained, obtains heuristic rule list.
Further, matching module uses webpage DOM to be the web page template that Webpage searching to be converted mates with it, specifically comprise: first matching module obtains webpage DOM in web page template storehouse, classifies to it, and set up corresponding template for the webpage DOM of each type; Matching module selects the DOM in web page template storehouse successively, calculate the similarity of the DOM in this web page template storehouse with the DOM of described webpage to be converted, if described similarity is greater than predetermined threshold value, determine that described webpage to be converted belongs to classification corresponding to the DOM that selects in this web page template storehouse.
From above technical scheme, the template that the application is mated by Dynamic Selection webpage, for terminal selects the webpage being suitable for its display.
By the application, the Dynamic Matching of webpage can be realized, thus without the need under the prerequisite of secondary development, the appearance form of dynamic conditioning webpage.
Accompanying drawing explanation
In order to be illustrated more clearly in the embodiment of the present application or technical scheme of the prior art, be briefly described to the accompanying drawing used required in embodiment or description of the prior art below, apparently, the accompanying drawing that the following describes is only some embodiments recorded in the application, for those of ordinary skill in the art, other accompanying drawing can also be obtained according to these accompanying drawings.
Fig. 1 is the system construction drawing of terminal, server and the cloud server that the method for the application's Dynamic Matching webpage and device thereof are applied;
Fig. 2 is the process flow diagram of the first embodiment of the method for the application's Dynamic Matching webpage;
Fig. 3 is the process flow diagram of the second embodiment of the method for the application's Dynamic Matching webpage;
Fig. 4 is the process flow diagram of the 3rd embodiment of the method for the application's Dynamic Matching webpage;
Fig. 5 is the pie graph of the device of the application's Dynamic Matching webpage;
Fig. 6 is comparison diagram before and after the webpage after using the dynamic matching method of the application to change; .
Embodiment
During the webpage that the application stores in terminal access server, select the template of mating with this webpage, and according to the template switch webpage selected.By the application, user can obtain the webpage being suitable for terminal demonstration dynamically within the shorter response time, thus obtains better viewing experience.
Certainly, the arbitrary technical scheme implementing the application must not necessarily need to reach above all advantages simultaneously.
Technical scheme in the application is understood better in order to make those skilled in the art, below in conjunction with the accompanying drawing in the embodiment of the present application, technical scheme in the embodiment of the present application is clearly and completely described, obviously, described embodiment is only some embodiments of the present application, instead of whole embodiments.Based on the embodiment in the application, the every other embodiment that those of ordinary skill in the art obtain, all should belong to the scope of the application's protection.
The application's specific implementation is further illustrated below in conjunction with illustrations.
The application provides a kind of method and device thereof of Dynamic Matching webpage, is applied to cloud server and various terminal.
See Fig. 1, the system that the method and apparatus of this Dynamic Matching webpage is applied comprises server 1 and multiple terminal 2 and cloud server 3, wherein server 1 can be various types of server, for interconnected with multiple terminal 2, it stores web page contents, accesses for the multiple terminals 2 be attached thereto.Multiple terminal 2 can be various types of mobile device and can with server 1, cloud server 3 communicate various wired or wireless terminals etc.Cloud server 3 provides the service of Dynamic Matching web page template, can realize wired or wireless connection with server 1 and terminal 2.
Further, described terminal 2, server 1 or cloud server 3 three include control module on hardware, memory module, load module and output module.Wherein, described control module is used for controlling each functional module.Described memory module is used for storage program and intermediate data.Described load module is for receiving the input instruction of user.Described output module is for exporting web page contents.
See Fig. 2, the method for the application's web page template Dynamic Matching comprises:
S1, obtain webpage to be converted.
Code is inserted by the webpage that provides at server 1, realize calling the various services that cloud server 3 provides, this realizes by multiple insertion technology, such as server 1 active request, or cloud server 3 pushes, inserted mode such as can be the code inserting a line dynamic coupling webpage after needing to make the <head> label of the source code of the webpage mated, the place preferably after next-door neighbour <head> label.This codes implement calls the various coupling service that cloud server provides.
When the webpage that terminal 2 access services device 1 stores, if this webpage inserts the code of dynamic coupling webpage in advance, then first this webpage calls the Dynamic Matching function that cloud server 3 provides, and Dynamic Matching function captures web page source code, and further processes.
S2, generate the DOM of webpage to be converted.
DOM is DOM Document Object Model (Document Object Model, be called for short DOM), is W3C(World Wide Web Consortium) standard programming interface of the process easily extensible markup language of Organisation recommendations.Due to the language that HTML is a kind of format, wherein text message needs to be placed in html tag, is provided modifications such as information position, display modes by label.In html format file, under root label, also have the label such as < head >, < body > and < table >, these labels top down form tree-shaped DOM.W3C DOM specification has following regulation to html tag and content of text:
● whole document is a document node
● each html tag is a node element
● the text be included in html element element is text node
● each html attribute is an attribute node
Accordingly, the tree-shaped institutional framework that the DOM of webpage is made up of text node and label node, according to the DOM of these label generating web pages.
S3, the web page template mated with it for Webpage searching to be converted.
According to certain rule generating web page template base in advance, further, in the matching process, according to same rule, search the web page template matched with webpage to be converted, such as, use the URL feature of webpage as matched rule.When generating web page template base, first choose multiple webpage, from the URL character string of these webpages, extract its URL feature, specifically comprise:
URL depth value feature: URL depth value is that the quantitative value of "/" in URL character string adds 1;
URL fullstop quantitative value feature: URL fullstop quantitative value is the quantitative value of ". " in the character string before URL first "/";
URL date feature: search the date literal in URL, if had, to extract respectively date and judge whether the date has legitimacy, if there is legal date literal, then this eigenwert is set to 1, if can not find legal date literal, then this eigenwert is set to 0.
URL type feature: be news web page, forum Web pages, blog web page, advertisement webpage, other types webpage in addition by Web page classifying, obtain the Feature Words of corresponding URL according to these types.The URL Feature Words of such as news web page comprises: " story ", " article ", " content ", " news " and/or " xinwen "; The URL Feature Words of forum Web pages comprises: " detail ", " thread-", " viewthread ", " read-", " tid ", " forum ", " luntan ", " bbs ", " tieba ", " guba ", " shequ ", " tiezi ", " huitie ", " post " and/or " showtopic " etc., can specify according to actual needs.
After setting up web page template storehouse according to the URL feature of webpage, signature analysis is carried out to the URL of webpage to be converted, judges the template of webpage to be converted
The web page template conversion webpage that S4, basis find.
The feature text node content how corresponding to layout webpage to be converted is defined in web page template.According to the rule in web page template again layout webpage to be converted, thus obtain the webpage after conversion.
Such as by extracting the URL feature of webpage to be converted, thus when judging that webpage to be converted is forum website, then extract the feature text node comprising " post people " and " issuing time " in webpage DOM to be converted, and abandon other nodes, and " post people " is placed in webpage upper left side, " issuing time " is placed in webpage upper right side, and then layout generates new web page.
Be described above the first embodiment of the method for the application's Dynamic Matching webpage, below in conjunction with Fig. 3, introduce the second embodiment of the method for the application's Dynamic Matching webpage, wherein, step S1 in step S31, S32 and S34 and Fig. 2, S2, S4 are identical, do not repeat them here, further describe step S33, be described as follows:
Step S33, use heuristic rule are the web page template that Webpage searching to be converted mates with it.
Set up heuristic rule list in advance, following steps can be used to generate heuristic rule list:
Step P1: the DOM obtaining webpage in web page template storehouse;
Step P2: travel through the node in DOM successively, the web page characteristics of generating web page;
Node in traversal DOM, thus the feature of generating web page, such as, web page characteristics can comprise high frequency words feature, architectural feature, label characteristics, chain feature and/or grammar property, can obtain dissimilar web page characteristics according to actual needs.
Step P3, the web page characteristics obtained to be classified, obtain heuristic rule list.
Webpage is presorted, such as be divided into news type webpage, blog type webpage, Forum Type webpage, adline webpage etc., the web page characteristics obtained in step P2 is associated with type of webpage, such as, is divided into news type web page characteristics, blog type web page characteristics, Forum Type web page characteristics etc.Thus form heuristic rule list.
Further, be web page template that Webpage searching to be converted mates with it.
When the web page template mated with it for Webpage searching to be converted, travel through the node in webpage DOM to be identified successively, generate the web page characteristics of webpage to be identified, these web page characteristics and predefined heuristic rule list are carried out mating of rule, if the web page characteristics of webpage to be identified meets the condition of certain heuristic rule definition, then rule match success, and the type of webpage of webpage to be identified is exported according to the rule of coupling.Such as webpage medium-high frequency Feature Words to be identified comprises: " news ", " text ", " source ", " information ", " report ", " daily paper ", " Times ", " evening paper ", " it is reported ", " reporter ", " news ", " media ", " this newspaper ", " special topic ", " according to heuristic rule, " center ", " editor ", " channel ", " important news ", " current events ", " responsible editor ", " news seniority among brothers and sisters " and/or " news seen by mobile phone ", then can judge that this webpage is news type webpage.
Be described above the second embodiment of the method for the application's Dynamic Matching webpage, below in conjunction with Fig. 4, introduce the 3rd embodiment of the method for the application's Dynamic Matching webpage, wherein, step S1 in step S41, S42, S44 and Fig. 2, S2, S4 are identical, do not repeat them here, further describe step S43, be described as follows:
Step S43, use webpage DOM are the web page template that Webpage searching to be converted mates with it.
First obtain webpage DOM in web page template storehouse, and it is classified, and set up corresponding template for the webpage DOM of each type.
For the web page template that Webpage searching to be converted mates with it.
Select the DOM in web page template storehouse successively, calculate the similarity of the DOM in this web page template storehouse with the DOM of described webpage to be converted, if described similarity is greater than predetermined threshold value, determine that described webpage to be converted belongs to classification corresponding to the DOM that selects in this web page template storehouse.
It will be understood by those skilled in the art that, heuristic rule can be used to be the web page template that Webpage searching to be converted mates with it, or use webpage DOM to be the web page template that Webpage searching to be converted mates with it, or first use heuristic rule to be the web page template that Webpage searching to be converted mates with it, if find the web page template of coupling, end process, if do not found, continue to use webpage DOM to be the web page template that Webpage searching to be converted mates with it, if or first used webpage DOM to be the web page template that web page template that Webpage searching to be converted mates with it finds coupling, would end process, if do not found, continue to use heuristic rule to be the web page template that Webpage searching to be converted mates with it.Those skilled in the art can select processing mode as required.
See Fig. 5, the device of the application's web page template Dynamic Matching comprises:
Acquisition module 501: obtain webpage to be converted.
Code is inserted by the webpage that provides at server 1, realize calling the various services that cloud server 3 provides, this realizes by multiple insertion technology, such as server 1 active request, or cloud server 3 pushes, inserted mode such as can be the code inserting a line dynamic coupling webpage after needing to make the <head> label of the source code of the webpage mated, the place preferably after next-door neighbour <head> label.This codes implement calls the various coupling service that cloud server provides.
When the webpage that terminal 2 access services device 1 stores, if this webpage inserts the code of dynamic coupling webpage in advance, then first this webpage calls the Dynamic Matching function that cloud server 3 provides, and Dynamic Matching function captures web page source code, and further processes.。
DOM generation module 502: the DOM generating webpage to be converted.
DOM is DOM Document Object Model (Document Object Model, be called for short DOM), is W3C(World Wide Web Consortium) standard programming interface of the process easily extensible markup language of Organisation recommendations.Due to the language that HTML is a kind of format, wherein text message needs to be placed in html tag, is provided modifications such as information position, display modes by label.In html format file, under root label, also have the label such as < head >, < body > and < table >, these labels top down form tree-shaped DOM.W3C DOM specification has following regulation to html tag and content of text:
● whole document is a document node
● each html tag is a node element
● the text be included in html element element is text node
● each html attribute is an attribute node
Accordingly, the tree-shaped institutional framework that the DOM of webpage is made up of text node and label node, according to the DOM of these label generating web pages.
Matching module 503: the web page template mated with it for Webpage searching to be converted.
According to certain rule generating web page template base in advance, further, in the matching process, according to same rule, search the web page template matched with webpage to be converted, such as, use the URL feature of webpage as matched rule.When generating web page template base, first choose multiple webpage, from the URL character string of these webpages, extract its URL feature, specifically comprise:
URL depth value feature: URL depth value is that the quantitative value of "/" in URL character string adds 1;
URL fullstop quantitative value feature: URL fullstop quantitative value is the quantitative value of ". " in the character string before URL first "/";
URL date feature: search the date literal in URL, if had, to extract respectively date and judge whether the date has legitimacy, if there is legal date literal, then this eigenwert is set to 1, if can not find legal date literal, then this eigenwert is set to 0.
URL type feature: be news web page, forum Web pages, blog web page, advertisement webpage, other types webpage in addition by Web page classifying, obtain the Feature Words of corresponding URL according to these types.The URL Feature Words of such as news web page comprises: " story ", " article ", " content ", " news " and/or " xinwen "; The URL Feature Words of forum Web pages comprises: " detail ", " thread-", " viewthread ", " read-", " tid ", " forum ", " luntan ", " bbs ", " tieba ", " guba ", " shequ ", " tiezi ", " huitie ", " post " and/or " showtopic " etc., can specify according to actual needs.
After setting up web page template storehouse according to the URL feature of webpage, signature analysis is carried out to the URL of webpage to be converted, judges the template of webpage to be converted
Modular converter 504: according to the web page template conversion webpage found.
The feature text node content how corresponding to layout webpage to be converted is defined in web page template.According to the rule in web page template again layout webpage to be converted, thus obtain the webpage after conversion.
Such as by extracting the URL feature of webpage to be converted, thus when judging that webpage to be converted is forum website, then extract the feature text node comprising " post people " and " issuing time " in webpage DOM to be converted, and abandon other nodes, and " post people " is placed in webpage upper left side, " issuing time " is placed in webpage upper right side, and then layout generates new web page.
Be described above the first embodiment of the device of the application's Dynamic Matching webpage, the second embodiment of the device of the application's Dynamic Matching webpage is introduced below in conjunction with Fig. 6, wherein, acquisition module, DOM generation module and modular converter and the acquisition module in Fig. 5, DOM generation module, modular converter are identical, do not repeat them here, further describe the matching module in Fig. 5, be described as follows:
Matching module: use heuristic rule to be the web page template that Webpage searching to be converted mates with it.
Matching module sets up heuristic rule list in advance, and matching module comprises:
Template base generation module: the DOM obtaining webpage in web page template storehouse; Travel through the node in DOM successively, the web page characteristics of generating web page.Travel through the node in DOM successively, the web page characteristics of generating web page comprises: the node in traversal DOM, thus the feature of generating web page, such as, web page characteristics can comprise high frequency words feature, architectural feature, label characteristics, chain feature and/or grammar property, can obtain dissimilar web page characteristics according to actual needs.
Heuristic rule List Generating Module: classify to the web page characteristics obtained, obtains heuristic rule list.
Webpage is presorted, such as be divided into news type webpage, blog type webpage, Forum Type webpage, adline webpage etc., the web page characteristics obtained in step P2 is associated with type of webpage, such as, is divided into news type web page characteristics, blog type web page characteristics, Forum Type web page characteristics etc.Thus form heuristic rule list.
Comparing module: travel through the node in webpage DOM to be identified successively, generate the web page characteristics of webpage to be identified, these web page characteristics and predefined heuristic rule list are carried out mating of rule, if the web page characteristics of webpage to be identified meets the condition of certain heuristic rule definition, then rule match success, and the type of webpage of webpage to be identified is exported according to the rule of coupling.Such as webpage medium-high frequency Feature Words to be identified comprises: " news ", " text ", " source ", " information ", " report ", " daily paper ", " Times ", " evening paper ", " it is reported ", " reporter ", " news ", " media ", " this newspaper ", " special topic ", " according to heuristic rule, " center ", " editor ", " channel ", " important news ", " current events ", " responsible editor ", " news seniority among brothers and sisters " and/or " news seen by mobile phone ", then can judge that this webpage is news type webpage.
Be described above the second embodiment of the device of the application's Dynamic Matching webpage, introduce the 3rd embodiment of the device of the application's Dynamic Matching webpage below, wherein, acquisition module, DOM generation module and modular converter are identical with the acquisition module in Fig. 5, DOM generation module and modular converter, do not repeat them here, wherein further describe the matching module in Fig. 5, be described as follows:
Matching module: use webpage DOM to be the web page template that Webpage searching to be converted mates with it.
First matching module obtains webpage DOM in web page template storehouse, and classifies to it, and sets up corresponding template for the webpage DOM of each type.Concrete coupling step is: select the DOM in web page template storehouse successively, calculate the similarity of the DOM in this web page template storehouse with the DOM of described webpage to be converted, if described similarity is greater than predetermined threshold value, determine that described webpage to be converted belongs to classification corresponding to the DOM that selects in this web page template storehouse.
It will be understood by those skilled in the art that, matching module can use heuristic rule to be the web page template that Webpage searching to be converted mates with it, or use webpage DOM to be the web page template that Webpage searching to be converted mates with it, or first use heuristic rule to be the web page template that Webpage searching to be converted mates with it, if find the web page template of coupling, end process, if do not found, continue to use webpage DOM to be the web page template that Webpage searching to be converted mates with it, if or first used webpage DOM to be the web page template that web page template that Webpage searching to be converted mates with it finds coupling, would end process, if do not found, continue to use heuristic rule to be the web page template that Webpage searching to be converted mates with it.Those skilled in the art can select processing mode as required.
Fig. 6 shows the front and back comparison diagram of the web displaying after the dynamic matching method conversion using the application, when former webpage directly loads on mobile phone, there is the situation that screen and webpage do not match, after using this technology, web page contents is according to the corresponding minimizing of screen size, thus improve website loading velocity, save flow.
It will be understood by those skilled in the art that the embodiment of the application can be provided as method, device (equipment) or computer program.Therefore, the application can adopt the form of complete hardware embodiment, completely software implementation or the embodiment in conjunction with software and hardware aspect.And the application can adopt in one or more form wherein including the upper computer program implemented of computer-usable storage medium (including but not limited to magnetic disk memory, CD-ROM, optical memory etc.) of computer usable program code.
The application describes with reference to according to the process flow diagram of the method for the embodiment of the present application, device (equipment) and computer program and/or block scheme.Should understand can by the combination of the flow process in each flow process in computer program instructions realization flow figure and/or block scheme and/or square frame and process flow diagram and/or block scheme and/or square frame.These computer program instructions can being provided to the processor of multi-purpose computer, special purpose computer, Embedded Processor or other programmable data processing device to produce a machine, making the instruction performed by the processor of computing machine or other programmable data processing device produce device for realizing the function of specifying in process flow diagram flow process or multiple flow process and/or block scheme square frame or multiple square frame.
These computer program instructions also can be stored in can in the computer-readable memory that works in a specific way of vectoring computer or other programmable data processing device, the instruction making to be stored in this computer-readable memory produces the manufacture comprising command device, and this command device realizes the function of specifying in process flow diagram flow process or multiple flow process and/or block scheme square frame or multiple square frame.
These computer program instructions also can be loaded in computing machine or other programmable data processing device, make on computing machine or other programmable devices, to perform sequence of operations step to produce computer implemented process, thus the instruction performed on computing machine or other programmable devices is provided for the step realizing the function of specifying in process flow diagram flow process or multiple flow process and/or block scheme square frame or multiple square frame.
Although described the preferred embodiment of the application, those skilled in the art once obtain the basic creative concept of cicada, then can make other change and amendment to these embodiments.So claims are intended to be interpreted as comprising preferred embodiment and falling into all changes and the amendment of the application's scope.Obviously, those skilled in the art can carry out various change and modification to the application and not depart from the spirit and scope of the application.Like this, if these amendments of the application and modification belong within the scope of the application's claim and equivalent technologies thereof, then the application is also intended to comprise these change and modification.

Claims (10)

1. a method for Dynamic Matching web page template, the method comprises the steps:
Obtain webpage to be converted;
Generate the DOM of webpage to be converted;
For the web page template that Webpage searching to be converted mates with it;
According to the web page template conversion webpage found.
2. the method for Dynamic Matching web page template as claimed in claim 1, the web page template wherein mated with it for Webpage searching to be converted comprises: according to certain rule generating web page template base in advance, in the matching process, according to same rule, search the web page template matched with webpage to be converted.
3. the method for Dynamic Matching web page template as claimed in claim 1, wherein heuristic rule is used to be the web page template that Webpage searching to be converted mates with it, specifically comprise: set up heuristic rule list in advance, when the web page template mated with it for Webpage searching to be converted, travel through the node in webpage DOM to be identified successively, generate the web page characteristics of webpage to be identified, these web page characteristics and predefined heuristic rule list are carried out mating of rule, if the web page characteristics of webpage to be identified meets the condition of certain heuristic rule definition, then rule match success, the type of webpage of webpage to be identified is exported further according to the rule of coupling.
4. the method for Dynamic Matching web page template as claimed in claim 3, wherein sets up heuristic rule list in advance and comprises: the DOM obtaining webpage in web page template storehouse; Travel through the node in DOM successively, the web page characteristics of generating web page; The web page characteristics obtained is classified, obtains heuristic rule list.
5. the method for Dynamic Matching web page template as claimed in claim 1, wherein webpage DOM is used to be the web page template that Webpage searching to be converted mates with it, comprise: obtain webpage DOM in web page template storehouse, and the webpage DOM obtained is classified, for the webpage DOM of each type sets up corresponding template; Select the DOM in web page template storehouse successively, calculate the similarity of the DOM in this web page template storehouse with the DOM of described webpage to be converted, if described similarity is greater than predetermined threshold value, determine that described webpage to be converted belongs to classification corresponding to the DOM that selects in this web page template storehouse.
6. the device of a web page template Dynamic Matching comprises:
Acquisition module: obtain webpage to be converted;
DOM generation module: the DOM generating webpage to be converted;
Matching module: the web page template mated with it for Webpage searching to be converted;
Modular converter: according to the web page template conversion webpage found.
7. the device of web page template Dynamic Matching as claimed in claim 6, wherein matching module is according to certain rule generating web page template base in advance, in the matching process, according to same rule, searches the web page template matched with webpage to be converted.
8. the device of web page template Dynamic Matching as claimed in claim 6, wherein matching module uses heuristic rule to be the web page template that Webpage searching to be converted mates with it, and matching module sets up heuristic rule list in advance, and matching module comprises:
Comparing module: travel through the node in webpage DOM to be identified successively, generate the web page characteristics of webpage to be identified, these web page characteristics and predefined heuristic rule list are carried out mating of rule, if the web page characteristics of webpage to be identified meets the condition of certain heuristic rule definition, then rule match success, and the type of webpage of webpage to be identified is exported according to the rule of coupling.
9. the device of Dynamic Matching web page template as claimed in claim 8, wherein matching module also comprises:
Template base generation module: the DOM obtaining webpage in web page template storehouse; Travel through the node in DOM successively, the web page characteristics of generating web page;
Heuristic rule List Generating Module: classify to the web page characteristics obtained, obtains heuristic rule list.
10. the device of web page template Dynamic Matching as claimed in claim 6, wherein matching module uses webpage DOM to be the web page template that Webpage searching to be converted mates with it, specifically comprises:
First matching module obtains webpage DOM in web page template storehouse, classifies to it, and sets up corresponding template for the webpage DOM of each type; Matching module selects the DOM in web page template storehouse successively, calculate the similarity of the DOM in this web page template storehouse with the DOM of described webpage to be converted, if described similarity is greater than predetermined threshold value, determine that described webpage to be converted belongs to classification corresponding to the DOM that selects in this web page template storehouse.
CN201510199105.9A 2015-04-24 2015-04-24 Dynamic webpage template matching method and device Pending CN104866527A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510199105.9A CN104866527A (en) 2015-04-24 2015-04-24 Dynamic webpage template matching method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510199105.9A CN104866527A (en) 2015-04-24 2015-04-24 Dynamic webpage template matching method and device

Publications (1)

Publication Number Publication Date
CN104866527A true CN104866527A (en) 2015-08-26

Family

ID=53912358

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510199105.9A Pending CN104866527A (en) 2015-04-24 2015-04-24 Dynamic webpage template matching method and device

Country Status (1)

Country Link
CN (1) CN104866527A (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105138698A (en) * 2015-09-25 2015-12-09 百度在线网络技术(北京)有限公司 Dynamic layout method and device for webpages
CN105740475A (en) * 2016-03-16 2016-07-06 中山大学 Web page transformation method and system
CN105827733A (en) * 2016-05-12 2016-08-03 北京金山安全软件有限公司 Data pushing method and device and electronic equipment
CN106326482A (en) * 2016-08-31 2017-01-11 江苏中威科技软件系统有限公司 System of visualized big data collection and analysis and file conversion and method thereof
CN107577783A (en) * 2017-09-15 2018-01-12 电子科技大学 The type of webpage automatic identifying method excavated based on Web architectural features
CN108021598A (en) * 2016-11-04 2018-05-11 广州市动景计算机科技有限公司 Page extraction template matching process, device and server
CN108519986A (en) * 2018-02-24 2018-09-11 阿里巴巴集团控股有限公司 A kind of webpage generating method, device and equipment
CN110442343A (en) * 2018-05-04 2019-11-12 阿里巴巴集团控股有限公司 A kind of conversion of page method and conversion of page device

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101276362A (en) * 2007-03-26 2008-10-01 国际商业机器公司 Apparatus and method for optimizing and differencing web page browsing
US7765236B2 (en) * 2007-08-31 2010-07-27 Microsoft Corporation Extracting data content items using template matching
CN101916285A (en) * 2010-08-20 2010-12-15 北京新岸线网络技术有限公司 Method and device for analyzing internet web page contents
CN102033944A (en) * 2010-12-21 2011-04-27 重庆新媒农信科技有限公司 Mobile terminal-based web page display system and method
CN102546793A (en) * 2011-12-31 2012-07-04 广州杰赛科技股份有限公司 Self-adaption method and system of mobile terminal based on cloud computing
CN103294788A (en) * 2013-05-20 2013-09-11 广州迅睿网络科技有限公司 Universal background processing method and system for websites
CN103412928A (en) * 2013-08-16 2013-11-27 北京乐动卓越科技有限公司 Method and device for realizing browser page intelligent response-type layout on mobile terminal
CN103685476A (en) * 2013-11-26 2014-03-26 优视科技有限公司 Method for displaying web pages by aid of web page templates and web page template server

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101276362A (en) * 2007-03-26 2008-10-01 国际商业机器公司 Apparatus and method for optimizing and differencing web page browsing
US7765236B2 (en) * 2007-08-31 2010-07-27 Microsoft Corporation Extracting data content items using template matching
CN101916285A (en) * 2010-08-20 2010-12-15 北京新岸线网络技术有限公司 Method and device for analyzing internet web page contents
CN102033944A (en) * 2010-12-21 2011-04-27 重庆新媒农信科技有限公司 Mobile terminal-based web page display system and method
CN102546793A (en) * 2011-12-31 2012-07-04 广州杰赛科技股份有限公司 Self-adaption method and system of mobile terminal based on cloud computing
CN103294788A (en) * 2013-05-20 2013-09-11 广州迅睿网络科技有限公司 Universal background processing method and system for websites
CN103412928A (en) * 2013-08-16 2013-11-27 北京乐动卓越科技有限公司 Method and device for realizing browser page intelligent response-type layout on mobile terminal
CN103685476A (en) * 2013-11-26 2014-03-26 优视科技有限公司 Method for displaying web pages by aid of web page templates and web page template server

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105138698A (en) * 2015-09-25 2015-12-09 百度在线网络技术(北京)有限公司 Dynamic layout method and device for webpages
CN105138698B (en) * 2015-09-25 2020-12-11 百度在线网络技术(北京)有限公司 Dynamic layout method and device for webpage
CN105740475A (en) * 2016-03-16 2016-07-06 中山大学 Web page transformation method and system
CN105827733A (en) * 2016-05-12 2016-08-03 北京金山安全软件有限公司 Data pushing method and device and electronic equipment
CN105827733B (en) * 2016-05-12 2019-06-07 珠海豹趣科技有限公司 A kind of method, apparatus and electronic equipment of propelling data
CN106326482A (en) * 2016-08-31 2017-01-11 江苏中威科技软件系统有限公司 System of visualized big data collection and analysis and file conversion and method thereof
CN108021598A (en) * 2016-11-04 2018-05-11 广州市动景计算机科技有限公司 Page extraction template matching process, device and server
CN108021598B (en) * 2016-11-04 2022-05-03 阿里巴巴(中国)有限公司 Page extraction template matching method and device and server
CN107577783A (en) * 2017-09-15 2018-01-12 电子科技大学 The type of webpage automatic identifying method excavated based on Web architectural features
CN108519986A (en) * 2018-02-24 2018-09-11 阿里巴巴集团控股有限公司 A kind of webpage generating method, device and equipment
CN108519986B (en) * 2018-02-24 2022-01-28 创新先进技术有限公司 Webpage generation method, device and equipment
CN110442343A (en) * 2018-05-04 2019-11-12 阿里巴巴集团控股有限公司 A kind of conversion of page method and conversion of page device

Similar Documents

Publication Publication Date Title
CN104866527A (en) Dynamic webpage template matching method and device
US10346484B2 (en) Combining website characteristics in an automatically generated website
EP3518124A1 (en) Webpage rendering method and related device
WO2018000998A1 (en) Interface generation method, apparatus and system
CN108984650B (en) Computer-readable recording medium and computer device
CN102096717A (en) Search method and search engine
CN102141868B (en) Method for quickly operating information interaction page, input method system and browser plug-in
CN103399885B (en) Mining method and device of POI (point of interest) representing images and server
CN104820589B (en) A kind of method and its device of dynamic adaptation webpage
CN105095320A (en) System for identifying, correlating, searching and displaying documents based on relationship superposition and combination
CN103166981A (en) Wireless webpage transcoding method and device
CN111240669B (en) Interface generation method and device, electronic equipment and computer storage medium
CA3061623C (en) File sending in instant messaging applications
KR20220127334A (en) Table browsing methods, devices, electronic devices and storage media in documents
CN102193798B (en) Method for automatically acquiring Open application programming interface (API) based on Internet
CN102073735A (en) Searching method and searching system
CN105653157A (en) Processing method and device for copied text
CN103914486A (en) Document search and display system
CN103377225A (en) Method and device for building knowledge base system
CN101777067B (en) System for recognizing and managing web page contents for mobile communication equipment terminals
US20230259568A1 (en) Webpage history display method and apparatus, and storage medium
CN105824790A (en) Equipment table adapting method, computer equipment and mobile terminal
CN104866545A (en) Method for searching keywords on information display page
CN105159993A (en) Search method and device
CN104850608A (en) Method for searching keywords on information exhibiting page

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
EXSB Decision made by sipo to initiate substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20150826

RJ01 Rejection of invention patent application after publication