CN101500002A - Fusion publishing method and apparatus oriented to Web content - Google Patents

Fusion publishing method and apparatus oriented to Web content Download PDF

Info

Publication number
CN101500002A
CN101500002A CNA2008100569638A CN200810056963A CN101500002A CN 101500002 A CN101500002 A CN 101500002A CN A2008100569638 A CNA2008100569638 A CN A2008100569638A CN 200810056963 A CN200810056963 A CN 200810056963A CN 101500002 A CN101500002 A CN 101500002A
Authority
CN
China
Prior art keywords
web content
client
fusion
document
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2008100569638A
Other languages
Chinese (zh)
Inventor
王劲林
李晔
白鹤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute of Acoustics CAS
Original Assignee
Institute of Acoustics CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Acoustics CAS filed Critical Institute of Acoustics CAS
Priority to CNA2008100569638A priority Critical patent/CN101500002A/en
Publication of CN101500002A publication Critical patent/CN101500002A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

The invention provides a fusion and issuing method aimed at Web content and a device, wherein, the method comprises: a request aimed at certain Web content is transmitted by a client terminal to a service terminal; the requested Web content is searched by the service terminal, then format conversion, object shearing and compression, as well as cache fusion processing are carried out according to network environment at the client terminal and apparatus parameter information of the client terminal to obtain new Web content; finally the new Web content is returned to the client terminal. The fusion and issuing method aimed at Web content and the device have strong universality, stability and expandability, without installing custom-made client terminal software at the terminal, only needing an own browser of the terminal; and the serviceable range can be effectively expanded.

Description

A kind of fusion publishing method and device at web content
Technical field
The present invention relates to the web services technologies field, specifically, the present invention relates to a kind of content delivery method and device of striding computer network and mobile communications network.
Background technology
At present, the data on the Web are different with data in traditional database, and traditional database all has certain data model, can specifically describe specific data according to model.And the data on the Web are very complicated, do not have specific model description, and the data of each website are independent design separately all, and data itself have readme and dynamically changeable.Thereby the data on the Web have certain structure, but because of the existence of readme level, from but a kind of data of non-complete latticeization, this also is referred to as semi-structured data.Semi-structured is the maximum characteristics that Web goes up data.
In the across a network environment, there are the following problems for operation system: some provide the service of semi-structured content the user capture that heterogeneous networks inserts, and have waits for too long, problems such as the response time is slow, download time is long, connection failure; Problems such as the different terminals user exists terminal processing capacity limited owing to the limitation of equipment of itself, and terminal storage space is limited, and the terminal media player is incompatible.
At present, the content merchant on the Internet is most content distributed by Web.Web is exactly a kind of hypertext information system, and the main notion of Web is exactly that hypertext connects, and it makes that text is the linearity of fixing as a book no longer.But can jump to other position from a position.You can therefrom obtain more information.Can forward on other theme.Want to understand the content of some themes as long as on this theme, tap, just can jump on the document that comprises this theme.We just call Web to it this just multi-link property, and wherein, technology most widely used among the Web has XML, HTML, JavaScript, technology such as CSS.
XML represents Extensible Markup Language (abbreviation of eXtensible Markup Language means extendible SGML).XML is the rule of a cover definition semantic marker, and these marks are divided into many parts with document and these parts are labelled.It also is the meta-tag language, has promptly defined to be used to define other relevant with specific area, syntax-languages semantic, structurized SGML.The English full name of HTML is Hyper Text MarkUp Language, and Chinese is called " HTML ".With the different of general text be, a html file not only comprises content of text, also comprises some Tag, and Chinese claims " mark ".The suffix name of a html file is .htm or .html.JavaScript is a kind of based on object and event-driven and script with security performance.The purpose of using it is to be implemented in the Web page with the HTML hypertext language to carry out reciprocation with Web client.It is by realizing in the html language that embeds or be tuned in standard.Its appearance has remedied the defective of html language.JavaScript is a kind of fairly simple programming language, and using method is that JavaScript increases a script, compiling and interpreting separately to the html file of the page.When supporting that the JavaScript browser is opened this page for one, it can be read this script and carry out its instruction.Therefore JavaScript uses and is easier to conveniently, and operation is fast, is applicable to better simply application.CSS is that the abbreviation .CSS language of Cascading Style Sheets (CSS) is a kind of SGML, it does not need compiling, can be directly carry out (belonging to browser-interpreted type language) by browser. the performance .CSS file that CSS is responsible for web page contents (XHTML) in the standard webpage design also can be described as a text invention part, it has comprised number of C SS mark, the CSS file must use css to be the filename suffix. and can be by simple change CSS file, change the general performance form of webpage, can reduce our workload, CSS is by the CSS working group generation of W3C and safeguards. web content is except supporting HTML at present, also support WML, the content of XHTML grammer, WML is a kind of SGML based on XML, is used to specify the narrow-band devices content and the user interface of (comprising mobile phone and beep-pager).XHTML is the abbreviation of TheExtensible HyperText Markup Language XML (extensible Markup Language).What recommendation was followed at present is that W3C recommends XML1.0 on January 26th, 2000.Though XML data transaction ability is powerful, can substitute HTML fully, in the face of thousands of existing websites, directly adopt XML also premature.Therefore, we expand it with the rule of XML on the basis of HTML4.0, have obtained XHTML.Briefly, the purpose of setting up XHTML is exactly to realize the transition of HTML to XML.
For Web service being provided simultaneously for online user and the user on the Internet of mobile communication, exist certain methods to solve the problem of mobile terminal accessing internet web page at present, mainly be divided into following three classes:
1) method of newly-built synchronous WAP website
These class methods are commonplace, implement also simply, only need design a WAP site according to the design of original WEB website separately; Because this station is specially for the mobile device made, does not all have obstacle on network bandwidth consumption and device processes ability, user experience is fine; Yet because this station needs redesign, makes, from cost angle and standpoint of efficiency, all there is significant limitation in this method, is not suitable for large-scale application.Present external CNN, YAHOO etc., and domestic SINA, SOHU, the mobile site of nearly all websites such as 163 is all realized based on such method.
2) method of subscribing to based on RSS
This method realizes going up based on the RSS agreement, obtains up-to-date information by upgrading the RSS content of subscribing to; It is specific that this method can make the user visit targetedly, and interested web site contents has reduced bandwidth load effectively, improves the ground user experience; Then, this method only is applicable to that part has RSS to subscribe to the website of function, and the RSS content that requires the website to provide simultaneously is simpler, can not support complicated RSS file.This method is not the processing method to the weak structure content in fact, and it needs manually on the backstage HTML content to be changed into RSS XML content, inefficiency.At present, popular various RSS reader based on J2ME support this method.
3) based on the method for HTTP-PROXY framework
A) at the processing of specific website
It is AvantGo that the typical case of this method uses; The method that adopts object channel to order by the processing targetedly to Top Site, provides popular news and information to the user, makes this software become one of best mobile device off-line browsing internet solution; The problem of the existence of this method is, can not solve the problem of mobile device access internet webpage comprehensively, can only visit the popular website of fraction.
B) HTML changes the WML processing
This method is the early stage normal method that adopts of using, and the typical case uses WEST; By a HTTP Proxy who supports wap protocol, original html file is converted to the WML form, reach the purpose that adapts to mobile phone screen and suitable WAP browser.This method can make most of html pages can pass through the WAP browser access, and still, this method can not well be discerned the HTML context, that is to say, can not do to handle targetedly to the each several part content in the webpage targetedly.
C) Opera-Mini method
Opera-Mini is the software of current popular in the world mobile device access internet, simultaneously domestic popular UcWeb in addition, these two kinds of softwares all adopt the method based on HTTP-PROXY, adjust layout, the condensed document embedded object of original html document, issue by special file format then etc., reach the purpose of seamless access internet web page on mobile device, yet this method needs special client software support to discern the special file that PROXY issues, be equivalent to the page is solidificated in the backspace file, do not support general browser etc.
More than each scheme all do not consider the residing different network environments of terminal, different, independently service can only be provided simply according to the file type of the kind of terminal, model, support.The weak point of this method comprises: can not dispose business fast; The different service synchronization that guarantee; Can not effectively utilize existing magnanimity Web data resource and provide service for the user.Generally speaking, these methods are not real content fusion methods.
Summary of the invention
The objective of the invention is to overcome the deficiencies in the prior art, provide different content, improve the intelligent of content emerging system, thereby a kind of fusion publishing method and device that can the self adaptation web content be provided according to different terminals, different network bandwidth self adaptives.
The fusion publishing method at web content that the present invention proposes comprises the steps:
1) client is to the request of service end transmission at a certain web content;
2) the server side searches web content of asking is converted to XML document with this web content;
3) the unsupported object of client device described in the described web content of deletion and the residue object is compressed according to the device parameter information of the network environment of client and this client;
4) the object combination with XML document and after handling obtains new web content;
5) new web content is returned to client.
In the technique scheme, described step 2) in, also comprise: web content is carried out information Recognition and information extraction; Described web content is to be converted into XML document from html document.
In the technique scheme, in the described step 3), delete unsupported: travel through the dom tree of former html document, the unsupported Object node of deletion client device to liking.
In the technique scheme, in the described step 3), the residue object is compressed and comprises:, the multimedia object in the former html document is compressed according to the network environment of client and the device parameter information of this client.
In the technique scheme, in the described step 3), also comprise: the URL in the former html document is formatd, obtain formaing URL.
In the technique scheme, described step 4) is: the multimedia object after will compressing is saved in the specified position of described format URL, obtains final DOM document; According to the network environment of client and the device parameter information of this client, XML document and final DOM document are fused into new web content then.
The fusion distributing device at web content that the present invention proposes comprises:
Receiver module is used to receive the request at a certain web content that client sends to service end;
Format converting module is used to search the web content of being asked, and this web content is converted to XML document;
Shear compression module, be used to delete the unsupported object of client device described in the described web content and the residue object compressed according to the network environment of client and the device parameter information of this client;
Composite module is used for the object combination with XML document and after handling, and obtains new web content;
Sending module is used for new web content is sent to client.
In the technique scheme, described format converting module also is used for web content is carried out information Recognition and information extraction; Described format converting module is to be the format converting module that is converted into XML document from html document with web content.
In the technique scheme, described shearing compression module also is used for the URL of former html document is formatd, and obtains formaing URL.
In the technique scheme, described composite module also is used for the multimedia object after the compression is saved in the specified position of described format URL.
The present invention has following technique effect:
1. the present invention has universality: can under the situation that does not change original operation system web content is applied in the network environment of different bandwidth simultaneously, for example simultaneously provide Web business in the Internet, mobile radio communication, television network broadcast; Various terminals can be enjoyed this Web business, mobile phone for example, PDA, set-top box etc.
2. the present invention has stability, and the present invention has adopted the system configuration of loose coupling, can improve the resistance to overturning of delivery system.
3. the present invention has extensibility: system can be extended to other and the similar field of web content.
4. design framework thinking of the present invention is clear and definite, and it is less to implement difficulty, can be widely applied in the production system and go.
5. the present invention does not need terminal that the client software of customization is installed, and the browser that adopts terminal to carry gets final product as the Opera mobile browser, can enlarge the scope of application effectively.
Description of drawings
Below, describe embodiments of the invention in conjunction with the accompanying drawings in detail, wherein:
Fig. 1 is an operational process schematic diagram of the present invention
Fig. 2 is an enforcement system assumption diagram of the present invention
Fig. 3 is that system platform is chosen schematic diagram
Embodiment
The present invention is applied in the Web system, and its applied environment is introduced a kind of implementation method of the present invention as shown in Figure 2 below.
Embodiment 1
The system platform of the service end of present embodiment, as shown in Figure 3.Wherein,
The server that present embodiment is chosen is DELL 1850, and basic configuration is as follows: XEON 3.2G (2M), DDR 2G internal memory, 146GB hard disk, two 1000M network interface cards.
The operating system that present embodiment is chosen is the FREEBSD system that increases income, and version is: FREEBSD 6.2Release.
The WEB server that present embodiment is chosen is the Apache that increases income, and version is: Apache HTTP ServerVersion 2.2.
The HTTP request processing environment that present embodiment is chosen is PHP+MYSQL, and version is respectively: PHP4.4.5 and MYSQL5.0.
Configuration except platform necessity of above introduction also needs to realize the 5C nucleus module, will do careful introduction below.
The main processing module of present embodiment service end is as follows:
Modular converter (Convert module)
In the 5C process module, the present invention choose one increase income based on the project JTidy of JAVA as HTML Parser, it is bearing the responsibility that html document is converted to the DOM document.JTidy is a software kit of developing on the JAVA platform, and it can search the syntax error of html document, and generally speaking, it is used as the instrument of getting rid of HTML syntax error, and simultaneously, JTidy provides the HTML based on the DOM interface to resolve in full.
Except building the JTidy submodule, in the Convert module, also need to realize the request processing sub, be used for handling the HTTPPOST request; Also need the page request submodule, be used for simulating the generic browser request and download original html page and corresponding page embedded object; Also need the device attribute mapping submodule, be used for seeking the corresponding equipment attribute list.
Shear module (Cut module)
Cut module in the 5C flow process is comparatively simple, mainly comprises 2 submodules: dom tree traversal submodule and Object node chooser module; The former is mainly used to travel through the dom tree that the Convert module generates, when running into new Object node, allocating object node chooser module, Object node chooser module determines by the query facility attribute list whether this Object node is supported by equipment, if do not support, then notify dom tree traversal submodule to delete this Object node.
Compression module (Compress module)
Compress module major function is compression URL and embedded multimedia object, mainly comprises LMT calculating sub module, URL compression submodule and multimedia compression submodule.
The LMT submodule need utilize the dom tree traversal submodule in the Cut module, embedded multimedia object in the traversal html document, and extract the MIME attribute, size, and original URL, and the device attribute of utilizing the Conyert module to obtain, the MIME of calculating destination object, size, and new URL.
URL compression submodule is mapped to the short format URL with figure denote to the URL of destination object in the LMT submodule, and inserts in last of LMT table.
Multimedia compression submodule has adopted the FFMPEG (famous in the world encoding and decoding project) that increases income as CODEC, and FFMPEG is the most complete image of increasing income, looks the audio conversion instrument, has very strong autgmentability; This submodule encapsulates it by PHP, handles corresponding multimedia object according to the rule in the LMT table.
Cache module (Cache module)
Cache module major function is to preserve new multimedia object and the new DOM document FDD of generation that the Compress module generates.This module is made up of 2 submodules, and medium preserve submodule and FDD generates submodule.
Medium generate submodule and utilize the multimedia in LMT table and the Compress module to compress submodule, and new compression result is saved on the corresponding position; FDD generates submodule and utilizes dom tree traversal submodule to replace the link URL that corresponding FURL is arranged among the LMT, generates the FDD document.
Fusion Module (Converge module)
The Converge module is relatively more crucial in a native system module, and we realize by PHP, mainly are made up of two submodules: FDD is to the WML modular converter, and FDD is to the XHTML modular converter.Specifically call which module,, depend on the version of browser on the equipment by the Data Sheet decision of equipment.
FDD abbreviates the F2W module as to the WML modular converter, and present embodiment has adopted the Russian famous crossover tool LazyWap Real-time HTML2WAP converter that increases income, and generates new WML document nWML.
FDD abbreviates the F2X module as to the XHTML modular converter, and present embodiment has also adopted the Muscovite H2X transducer of increasing income, and generates new XHTML document nXHTML.
Based on the said system platform, the fusion publishing method at web content provided by the invention comprises the steps:
1) client is to the request of service end transmission at a certain web content;
2) the server side searches web content of asking carries out Convert (conversion) to this web content, Cut (shearing) then successively, Compress (compression), Cache (buffer memory), Converge (fusion) handles, promptly carry out 5C and handle the web content after obtaining handling;
3) web content after service end will be handled returns to client.
Above-mentioned steps 2) in,, handles original web content, change into and to be fit to different networks, the content of different terminals by following steps.
Step 1: conversion (Convert)
This step is one HTML is transformed into the process of XML, is appreciated that into the process of information Recognition, information extraction; In this field, deep research is all arranged both at home and abroad, there is number numerous, algorithm various in style and solution.Convert among the present invention utilizes HTML Parser to resolve html source code, generates the XML format text;
The handling process of Convert step inside is:
1. obtain the User-Agent parameter of client browser,, obtain the corresponding software and hardware parameter (Data Sheet) of terminal equipment,, then get the device parameter of acquiescence if do not have UA among the HTTP HEADER by the UA on the server (User-Agent) storehouse.
2. obtain the HTTP POST request that client browser sends, wherein target URL is included among the Body of HTTP request.
3. server calls target URL (uniform resource position mark URL, the abbreviation of English Uniform/UniversalResource Locator is also referred to as web page address, is the resource addresses of standard on the internet), obtain html source code and corresponding embedded object, and buffer memory gets up.
4. call HTML Parser, the identification html source code, the page is resolved to discernible DOM document (DOM Document Object Model DOM, the abbreviation of English Document Object Model, be a kind of standard object model that is used for representing HTML and XML document), claim among the present invention that this discernible DOM document is ODD.
Step 2: shear (Cut)
Equipment Data Sheet and source code DOM document that this step utilizes step 1 to obtain travel through whole DOM document, and unsupported object among the equipment Data Sheet of deletion step 1 gained obtains new DOM document, is called NDD.Such as original DOM document is
<channel>
<item?src=’../xx.GIF’>there?is?a?pic</item>
<item?src=’../xx.mp3’>there?is?a?audio</item>
</channel>
If equipment is not supported the Mp3 object, the new DOM document of then deleting behind the mp3 object among the DOM is:
<channel>
<itemsrc=’../xx.GIF’>there?is?a?pic</item>
</channel>
Step 3: compression (Compress)
Html document is an abundant multimedia show device, and except simple word information, multimedia messages is a prior part, and the quality that multimedia messages is handled is directly connected to the experience of user on terminal.
The new DOM document that this step utilizes equipment Data Sheet that step 1 obtains and step 2 to obtain travels through whole dom tree, generates into a Linear Mapping table LMT, as shown in Table 1, illustrates this table content below:
In this step, we utilize Principle of Statistics to calculate destination object, and SIZE is an example with the image, and computational methods are as follows:
Calculate the formula of Size:
S n=Min(W m/1024,H m/768)*S o
Wherein:
S n: new object Size,
S o: primary object Size,
W m: the mobile terminal screen width,
H m: the mobile terminal screen height
The formula of molded breadth:
W n=(W m/1024)*W
Wherein:
W n: new object width,
W o: the primary object width,
W m: the mobile terminal screen width
The formula of computed altitude:
H n=(H m/768)*H o
Wherein:
H n: new object height,
H o: the primary object width,
H m: the mobile terminal screen height
Form 1:LMT form schematic diagram
Primary object MIME Image/Jpeg ...
Primary object SIZE 150*150;120K ...
Primary object URL (OURL) /Yahoo!_files/eva_k erry139x119.jpg ...
New object MIME Image/Jpeg ...
New object SIZE 27*28;20K ...
New object URL (NURL) http://www.5cmobile? .com/yahoo10202020/ eva_kerry139x119_30 .jpg ...
Format URL (FURL) http://www.5cmobile .com/1 ...
(MIME introduces: MIME/S-MIME:Multipurpose Internet Mail Extensionsand Secure MIME has illustrated how to arrange message format that message is exchanged in different mailing systems.The form of MIME is flexible, allows to comprise in the mail file of any type.MIME message can comprise the particular data of text, image, sound, video and other application program.Each mime type is made up of two parts, and the front is the big classification of data, for example sound audio, visual image etc., the concrete kind of back definition.Common mime type HTML text .html, .htmltext/html plain text .txttext/plain RTF text .rtf application/rtfGIF figure .GIF image/gif JPEG figure .ipeg, .jpg image/jpeg au audio files .au audio/basic MIDI music file mid, .midi audio/midi, audio/x-midiRealAudio music file .ra, .ram audio/x-pn-realaudio mpeg file .mpg .mpeg video/mpeg avi file .avi video/x-msvideo GZIP file .gz appl ication/x-gzip TAR file .tar application/x-tar)
Format URL in the LMT table is for the representation formats of unified URL and the length of URL, can reduce unnecessary URL information load effectively.This format URL comes accumulation calculating by unique ID of statistics.
After obtaining mapping table LMT, the multimedia object of invocation step one buffer memory according to the result of calculation in the LMT table, calls corresponding processing module and compresses processing.
Step 4: buffer memory (Cache)
A series of compression result that step 2 is obtained are saved in the final URL that needs demonstration, the just position of FURL.The NDD that traversal step two obtains utilizes FURL in the LMT table that step 3 generates (finally need show URL) to replace corresponding OURL (original URL) among the NDD, generates final DOM document, abbreviation FDD.
Step 5: merge (Converge)
The equipment Data Sheet that this step obtains according to the first step, call the XHTML maker, or WML maker, the FDD in utilization and the 4th step generates the new XHTML page (being called for short nXHTML) or the new WML page (being called for short nWML) returns to requesting service by HTTP RESPONSE.The main effect of XHTML maker and WML maker is according to HTML, XHTML, and the grammer of WML, the respective labels in the html document, parameters such as attribute change into the form of expression of XHTML or WML; XHTML maker during this example is implemented has adopted the converter (http://www.it.uc3m.es/jaf/html2xhtml/) of increasing income, and the WML maker has adopted the converter (http://www.xmlmind.com/foconverter/license_pe.html) of increasing income;
In order to verify the feasibility of present embodiment algorithm, the English homepage (http://www.yahoo.com) of picked at random Yahoo, Yahoo a piece of news (http://news.yahoo.com/s/ap/20071121/ap_on_go_pr_wh/cia_leak_mcc lellan), CNN homepage (http://edition.cnn.com/WORLD/), CNN news (http://money.cnn.com//2007/11/20/news/companies/stem_cell/index.htm? cnn=yes) these four pages experimentize.Table 2 is some statistics to the parent page feature:
Setting parent page textual portions Size is S t, parent page inline object sub-population Size is S o, through handling later textual portions Size be
Figure A200810056963D00191
, through handling later inline object Size be , then the compression ratio computing formula is as follows:
R = S t S t + S o &times; S t &prime; S t + S o S t + S o &times; S o &prime; S o
According to the analysis of this paper first, suppose that the textual portions maximum compression ratio is 50%, with the screen size 176*144, support that the portable terminal of WML browser is an example, theoretic maximum compression rate computing formula is as follows:
R &prime; = 176 &times; 144 1024 &times; 768 &times; S o S t + S o + 50 % &times; S t S t + S o
Shown in the following formula of similarity degree of definition realistic compression ratio and theoretical maximum compression rate:
C=-log[(R-R′)/R′]
Shown in the following formula of degrees of offset of definition realistic compression ratio and theoretical maximum compression rate:
E=|C-1|
Table 2: parent page mark sheet
The webpage attribute Text Size Inline object Size
The Yahoo homepage 166,431Bytes 149,812Bytes
The CNN homepage 43,318Bytes 626,718Bytes
Yahoo news 57,287Bytes 650,690Bytes
CNN news 55,750Bytes 274,564Bytes
Chart and analysis as a result
The program running result is as shown in table 3.To the operation result of four pages, calculate R and R ' respectively.
Table 3: program running result
Figure A200810056963D0020165733QIETU
Simulation result and the contrast of Opera Mini 4 browsers, result's following (download refers to the download through overcompression in this table):
Table 4 operation result comparison sheet
This emulation platform download The Opera-Mini download
The Yahoo homepage 35K?Bytes 39K?Bytes
The CNN homepage 41K?Bytes 40K?Bytes
Yahoo news 49K?Bytes 37K?Bytes
CNN news 31K?Bytes 43K?Bytes
The result of native system is equally matched with Opera Mini browser; Yet, since OperaMini4 utilize own software to Java Script and to the page layout user experience support relatively good, but shortcoming is the requirement user must install special software, yet the system that implements according to the inventive method does not need special software support, can reach identical effect yet.

Claims (10)

1. the fusion publishing method at web content comprises the steps:
1) client is to the request of service end transmission at a certain web content;
2) the server side searches web content of asking is converted to XML document with this web content;
3) the unsupported object of client device described in the described web content of deletion and the residue object is compressed according to the device parameter information of the network environment of client and this client;
4) the object combination with XML document and after handling obtains new web content;
5) new web content is returned to client.
2. the fusion publishing method at web content according to claim 1 is characterized in that, described step 2) in, also comprise: web content is carried out information Recognition and information extraction; Described web content is to be converted into XML document from html document.
3. the fusion publishing method at web content according to claim 2 is characterized in that, in the described step 3), travels through the dom tree of former html document, deletes the unsupported Object node of described client device.
4. the fusion publishing method at web content according to claim 3 is characterized in that, in the described step 3), according to the network environment of client and the device parameter information of this client, the multimedia object in the former html document is compressed.
5. the fusion publishing method at web content according to claim 4 is characterized in that, in the described step 3), also comprises: the URL in the former html document is formatd, obtain formaing URL.
6. the fusion publishing method at web content according to claim 5 is characterized in that, described step 4) is: the multimedia object after will compressing is saved in the specified position of described format URL, obtains final DOM document; According to the network environment of client and the device parameter information of this client, XML document and final DOM document are fused into new web content then.
7. fusion distributing device at web content comprises:
Receiver module is used to receive the request at a certain web content that client sends to service end;
Format converting module is used to search the web content of being asked, and this web content is converted to XML document;
Shear compression module, be used to delete the unsupported object of client device described in the described web content and the residue object compressed according to the network environment of client and the device parameter information of this client;
Composite module is used for the object combination with XML document and after handling, and obtains new web content;
Sending module is used for new web content is sent to client.
8. the fusion distributing device at web content according to claim 7 is characterized in that it is to be converted into XML document from html document that described format converting module is used for web content, also is used for web content is carried out information Recognition and information extraction;
9. the fusion distributing device at web content according to claim 7 is characterized in that, described shearing compression module also is used for the URL of former html document is formatd, and obtains formaing URL.
10. the fusion distributing device at web content according to claim 9 is characterized in that, described composite module also is used for the multimedia object after the compression is saved in the specified position of described format URL.
CNA2008100569638A 2008-01-28 2008-01-28 Fusion publishing method and apparatus oriented to Web content Pending CN101500002A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA2008100569638A CN101500002A (en) 2008-01-28 2008-01-28 Fusion publishing method and apparatus oriented to Web content

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA2008100569638A CN101500002A (en) 2008-01-28 2008-01-28 Fusion publishing method and apparatus oriented to Web content

Publications (1)

Publication Number Publication Date
CN101500002A true CN101500002A (en) 2009-08-05

Family

ID=40946883

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2008100569638A Pending CN101500002A (en) 2008-01-28 2008-01-28 Fusion publishing method and apparatus oriented to Web content

Country Status (1)

Country Link
CN (1) CN101500002A (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102457528A (en) * 2010-10-19 2012-05-16 北京邮电大学 Method for adaptively issuing web content facing to mobile phone terminal and system thereof
CN102594872A (en) * 2012-01-09 2012-07-18 百度在线网络技术(北京)有限公司 Offline image optimization method and system
CN102905045A (en) * 2012-10-26 2013-01-30 北京奇虎科技有限公司 Method and server for providing picture data to computing terminal
CN102938792A (en) * 2012-11-26 2013-02-20 北京奇虎科技有限公司 Method for providing picture data for computing terminal and server
CN103067423A (en) * 2011-10-20 2013-04-24 腾讯科技(深圳)有限公司 Browser kernel adaption method and browser
CN103209162A (en) * 2012-01-16 2013-07-17 中国科学院声学研究所 Method and device for deploying Web-type business
CN103248641A (en) * 2012-02-07 2013-08-14 腾讯科技(深圳)有限公司 Network download method, device and system
CN103729382A (en) * 2012-10-16 2014-04-16 腾讯科技(深圳)有限公司 Structural display method and device for WAP page
CN103729425A (en) * 2013-12-24 2014-04-16 腾讯科技(深圳)有限公司 Operation response method, client, browser and operation response system
CN105657070A (en) * 2012-11-26 2016-06-08 北京奇虎科技有限公司 Method and server for providing picture data to computing terminal
CN112882849A (en) * 2021-03-09 2021-06-01 北京字节跳动网络技术有限公司 Information recommendation method, device, system, equipment and storage medium in cloud application

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102457528A (en) * 2010-10-19 2012-05-16 北京邮电大学 Method for adaptively issuing web content facing to mobile phone terminal and system thereof
CN103067423A (en) * 2011-10-20 2013-04-24 腾讯科技(深圳)有限公司 Browser kernel adaption method and browser
US9571556B2 (en) 2011-10-20 2017-02-14 Tencent Technology (Shenzhen) Company Limited Browser kernel adaptation method and browser therefor
CN103067423B (en) * 2011-10-20 2015-10-14 腾讯科技(深圳)有限公司 The method of browser kernel adaptation and browser
CN102594872A (en) * 2012-01-09 2012-07-18 百度在线网络技术(北京)有限公司 Offline image optimization method and system
CN103209162B (en) * 2012-01-16 2016-05-18 中国科学院声学研究所 A kind of Web class service deployment method and device
CN103209162A (en) * 2012-01-16 2013-07-17 中国科学院声学研究所 Method and device for deploying Web-type business
CN103248641A (en) * 2012-02-07 2013-08-14 腾讯科技(深圳)有限公司 Network download method, device and system
CN103729382B (en) * 2012-10-16 2018-08-03 腾讯科技(深圳)有限公司 The structured display method and device of WAP web page
CN103729382A (en) * 2012-10-16 2014-04-16 腾讯科技(深圳)有限公司 Structural display method and device for WAP page
CN102905045A (en) * 2012-10-26 2013-01-30 北京奇虎科技有限公司 Method and server for providing picture data to computing terminal
CN105657070A (en) * 2012-11-26 2016-06-08 北京奇虎科技有限公司 Method and server for providing picture data to computing terminal
CN102938792B (en) * 2012-11-26 2016-05-04 北京奇虎科技有限公司 Method and the server of image data are provided to computing terminal
CN102938792A (en) * 2012-11-26 2013-02-20 北京奇虎科技有限公司 Method for providing picture data for computing terminal and server
CN105657070B (en) * 2012-11-26 2019-02-01 北京奇虎科技有限公司 The method and server of image data are provided to computing terminal
CN103729425A (en) * 2013-12-24 2014-04-16 腾讯科技(深圳)有限公司 Operation response method, client, browser and operation response system
CN103729425B (en) * 2013-12-24 2018-11-16 腾讯科技(深圳)有限公司 Operate response method, client, browser and system
CN112882849A (en) * 2021-03-09 2021-06-01 北京字节跳动网络技术有限公司 Information recommendation method, device, system, equipment and storage medium in cloud application

Similar Documents

Publication Publication Date Title
CN101500002A (en) Fusion publishing method and apparatus oriented to Web content
KR101342067B1 (en) Displaying information on a mobile device
CN101583072B (en) Middleware product for realizing Mobile Internet and method thereof
US7505978B2 (en) Aggregating content of disparate data types from disparate data sources for single point access
US7996754B2 (en) Consolidated content management
Chang et al. XML Web Service‐based development model for Internet GIS applications
US20070192674A1 (en) Publishing content through RSS feeds
US20070192683A1 (en) Synthesizing the content of disparate data types
US20120180073A1 (en) Mobile Device Application Framework
KR20040014999A (en) Method and system for transforming an xml document to at least one xml document structured according to a subset of a set of xml grammar rules
EP1567948A2 (en) Transformation of web description documents
CN101040283A (en) Form related data reduction
CN101609415B (en) Universal service calling system and method based on middleware
CN101409937B (en) Method and apparatus for converting script into data format supported by target system
CN102184266A (en) Method for automatically generating dynamic wireless application protocol (WAP) website for separation of page from data
CN101764767A (en) Network interconnection method, gateway facility and system
CN101763423A (en) Method for realizing presentation of tree-structure data in World Wide Web page as well as system and device therefor
US9626346B2 (en) Method of implementing structured and non-structured data in an XML document
KR20030041432A (en) An XML-based method of supplying Web-pages and its system for non-PC information terminals
WO2002082355A2 (en) A system and method for remotely collecting and displaying data
US20030149745A1 (en) Method and apparatus for accessing information from a network data source
WO2002006981A1 (en) Method of reformatting web page and method of providing web page using the same
Kurz et al. FACADE-a framework for context-aware content adaptation and delivery
KR20020090475A (en) Flash Map on the Wireless Internet
Mohammadi et al. Developing Wireless GIS: Using Java and XML Technologies

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Open date: 20090805