CN101132446A - Web page intelligent snapping system and method thereof - Google Patents

Web page intelligent snapping system and method thereof Download PDF

Info

Publication number
CN101132446A
CN101132446A CNA2006101417366A CN200610141736A CN101132446A CN 101132446 A CN101132446 A CN 101132446A CN A2006101417366 A CNA2006101417366 A CN A2006101417366A CN 200610141736 A CN200610141736 A CN 200610141736A CN 101132446 A CN101132446 A CN 101132446A
Authority
CN
China
Prior art keywords
module
webpage
internet
snapshots
snapshot
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2006101417366A
Other languages
Chinese (zh)
Inventor
林宏
鲍劲松
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
WANWEI INFORMATION TECHN CO Ltd SHANGHAI
Original Assignee
WANWEI INFORMATION TECHN CO Ltd SHANGHAI
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by WANWEI INFORMATION TECHN CO Ltd SHANGHAI filed Critical WANWEI INFORMATION TECHN CO Ltd SHANGHAI
Priority to CNA2006101417366A priority Critical patent/CN101132446A/en
Publication of CN101132446A publication Critical patent/CN101132446A/en
Pending legal-status Critical Current

Links

Images

Abstract

This invention relates to a Web intelligent snapshot system and a method, in which, the system includes a web cut module with one input end connecting to Internet, a web snapshot module with one input connecting to the cut module, an image process module with one input end connected with the snapshot module, an intelligent control module used in carrying out instructions of users to control snapshot process and connected with the Internet, the Web cut module, the snapshot module, the image process module and terminal users, the method includes: 1, transmitting information to webs by Internet, 2, sorting and purifying the webs reasonably and transmitting webs relating to the topic to 3 by judging according to key words and transmitting the interference information not relating to the topic to the recycle bin, 3, snapshooting the webs, 4, compressing and delaminating the web snapshot to be transmitted to radio terminal users timely.

Description

Web page intelligent snapping system and method thereof
Technical field
The present invention relates to computer networking technology, particularly relate to the Web page intelligent snapping technology that is used for portable terminal in a kind of Internet service.
Background technology
Current the Internet (Internet) is very flourishing, and a few days ago, CNNIC (CNNIC) is in Beijing issue " the 18 China Internet network state of development statistical report ".Report shows that China's internet development is raised speed once more, presents the flourish impetus in many aspects, has entered another fast-developing phase.By the end of on June 30th, 2006, China's netizen's number reached 1.23 hundred million people, has compared with the same period of last year increased by 19.4%, and wherein broadband access network netizen number is 7,700 ten thousand people, and the ratio in all netizens is near 2/3.China's website sum has reached 78.84 ten thousand, has wherein increased by 90,000 the first half of this year.Simultaneously, get online without being tethered to a cable also in flourish, in May, 2000, China moved the formal mobile phone WAP (Wireless Application Protocol, WAP (wireless application protocol)) of release service on net, and the surfing Internet with cell phone business progressively becomes second " killer's level " mobile value-added service after note.According to Ai Rui market consultation corporate statistics and analysis: Chinese WAP number of users had only 9,000,000 in 2003, increased more than four times to number of users in 2004,4,600 ten thousand families have been reached, adjustment along with operator's policy in 2004, the amplification of WAP number of users in 2005 will ease up, userbase reaches 7,200 ten thousand families, mainly is that increasing of free WAP application guaranteed the speed that the user increases, and expects Chinese WAP userbase in 2008 and will reach 2.3 hundred million families.Being about to of the maturation of the development of mobile communication technology, especially 2.5G and 3G starts, and value-added service is improved having greatly aspect technical foundation and the transmission rate, will promote the upgrading of portable terminal and the appearance of various value-added services.And along with the popularizing of WiMAX, the value-added service of all kinds of the Internets also will be widely used in mobile value-added service.Present the Internet is at computer development, and personal computer is through the fast development of decades, and view Internet is a mature technique, and forms ripe market.But hand-held mobile terminals such as mobile phone are because processing unit screen advanced inadequately, equipment is little, and resolution compares to PC and seems too little, therefore to allow mobile phone connect the Internet network and obtain information, still on market, all not reach ripe stage in fact technically.Exploitation mobile device online correlation technique can promote the application of Internet on mobile device undoubtedly greatly, drives huge market.The Web page intelligent snapping technology is developed in order to promote the Internet value-added service to use in mobile value-added service.
Summary of the invention
At the defective that exists in the above-mentioned prior art, technical problem to be solved by this invention provides a kind of internet web page that makes and is fit to Web page intelligent snapping system and the method thereof that wireless terminal (mobile phone) etc. is browsed by image processing.
In order to solve the problems of the technologies described above, a kind of Web page intelligent snapping system provided by the present invention comprises:
One webpage cutting module is used for the page is rationally classified and purified, and can filter the interfere information (as advertisement etc.) of the Internet; Its input connects the Internet;
One snapshots of web pages module is used for webpage is clapped into snapshot; Its input connects described webpage cutting module;
One image processing module is used for snapshots of web pages compression back layering, passes to wireless terminal user in real time, and its input connects described snapshots of web pages module;
One intelligent control module is used to carry out user instruction, the control snapshot processes; Connect the Internet, webpage cutting module, snapshots of web pages module, image processing module and terminal use respectively.
In order to solve the problems of the technologies described above, the operation method of a kind of Web page intelligent snapping system provided by the present invention, its step comprises:
1) imports webpage into, import webpage into by the Internet;
2) webpage cutting is rationally classified and is purified the page, judges that according to keyword the webpage relevant with subject content is sent to 3); Be sent to recycle bin with the irrelevant interfere information of subject content (as advertisement etc.);
3) snapshots of web pages is clapped into snapshot to webpage;
4) image processing is compressed snapshots of web pages the back layering, is real-time transmitted to wireless terminal (mobile phone) user.
Utilize Web page intelligent snapping system provided by the invention and method thereof, owing to adopted processing such as webpage cutting module, snapshots of web pages module, image processing module, intelligent control module, internet web page is realized lightweight and multi-modeization, and portable terminals such as suitable mobile phone are browsed.Lightweight is handled and to be meant that the size of the webpage of handling compares with original Internet webpage, huge compression (can reach about tens times) is arranged when not changing effective information, download when having reduced portable terminal and browsing has been accelerated the speed of browsing and downloading.The multi-mode processing is meant the processing of carrying out different mode at the portable terminal of different model, so that the webpage of handling is fit to present diversified portable terminal, for example, the mobile phone of different model has different resolution requirement.In addition, the intelligent snapping technology of native system also has pooling feature, not only can accelerate access speed, can also play certain emergencing action, for example, the user asks the Internet webpage of visiting deleted or connected when losing efficacy, and can check web page contents by the snapshots of web pages in the visit native system buffering.
Description of drawings
Fig. 1 is the structural representation block diagram of embodiment of the invention Web page intelligent snapping system;
Fig. 2 is the operating procedure schematic diagram of Web page intelligent snapping system of the present invention.
Embodiment
Below in conjunction with description of drawings embodiments of the invention are described in further detail, but present embodiment is not limited to the present invention, every employing analog structure of the present invention, method and similar variation thereof all should be listed protection scope of the present invention in.
Referring to shown in Figure 1, a kind of Web page intelligent snapping system that the embodiment of the invention provided comprises:
One webpage cutting module is used for the page is rationally classified and purified, and can filter the interfere information (as advertisement etc.) of the Internet; Its input connects the Internet;
One snapshots of web pages module is used for webpage is clapped into snapshot; Its input connects described webpage cutting module;
One image processing module is used for snapshots of web pages compression back layering, passes to wireless terminal user in real time, and its input connects described snapshots of web pages module;
One intelligent control module is used to carry out user instruction, the control snapshot processes; Connect the Internet, webpage cutting module, snapshots of web pages module, image processing module and terminal use respectively;
The concrete function of the main functional modules of Web page intelligent snapping system of the present invention is as follows.
Page cutting module
The page of commercialization website is very complicated, not only comprises the various information of user's needs, also comprises a large amount of advertisements, menu, information such as picture.Page cutting module utilizes the intelligent Agent technology of AI to set up the filtering rule algorithm, does not damage under the situation of parent page guaranteeing, comes the page is rationally classified and purified.Classification is by classification such as contents, so that snapshot module is set up corresponding snapshot, as physical culture, news etc. sublink useful in the current page; Purification is to filter out before setting up snapshot with the irrelevant information of subject content in the current page, as advertisement etc.The filtering rule algorithm is at first judged the classification and the rank of webpage.Filtering rule can be set up according to Internet resources URL and keyword.Internet resources URL rule is to judge according to concrete Internet resources URL, for example, the homepage of Sina is (http://www.sina.com.cn/), channel for finance and economics below the homepage is (http://finance.sina.com.cn/), and the sub-channel of financing below the channel for finance and economics is (http://finance.sina.com.cn/money/index.shtml).The keyword rule is to judge according to the webpage associative key, and this rule is followed " the keyword network positions service analysis protocol standard " of CNNIC issue such as (CNNIC) tissue and carried out the keyword judgement.Therefore, judge that according to the filtering rule algorithm Flash advertisement page of opening along with other webpages can in time filter, and high level webpage (homepage, primary subnet page or leaf etc.) can carry out classifying content according to keyword etc., be that the webpage of next stage is handled and prepared.The workflow of webpage cutting module is: after internet content is dispatched to the webpage cutting module by intelligent control module, the intelligent Agent technology is set up the filtering rule algorithm in the module, do not damage under the situation of parent page guaranteeing, the page is rationally classified and purified." noise " content that purifies in the webpage of back is deleted (or entering recycle bin) automatically, effectively content continues classification as the primary subnet page or leaf and purifies after classifying, till webpage entered single theme and can't classify and purify, for example, the news column purpose is message first.The webpage that the webpage cutting module was handled enters next flow process snapshots of web pages module.
The snapshots of web pages module
Content when the snapshots of web pages module can be preserved webpage and gathered in this locality is as this webpage is taken a width of cloth snapshot with camera, so be referred to as snapshots of web pages.The snapshots of web pages technology of native system is utilized HTML grammer reconstruct page in the calculator memory pattern space, is the technology of image with the text-converted in the pattern space.The snapping technique of multipath, be exactly snapshot be not simply some pages to be grabbed, but by appropriate algorithm, the subpage frame that access path important in the homepage points to is grabbed in the lump.Pages of Internet is through the classification and the purification of webpage cutting module, effectively webpage can carry out the page snapshot processing of multipath automatically by the snapshots of web pages intelligent Agent to visit web server, for example can carry out snapshot buffer simultaneously to 50-100 the highest link page of page link weight.The snapshots of web pages module under the scheduling of intelligent control module, classification and the purification information that can handle according to the webpage cutting module, the parallel processing system (PPS) of utilization server carries out snapshot to the page of multipath simultaneously and handles, and has improved the efficient of fast photographic system.
Image processing module
Image processing module carries out advanced image processing to the snapshot of snapshots of web pages module, makes snapshots of web pages satisfy the requirement of getting online without being tethered to a cable.Mainly comprise picture lightweight technology, by advanced compress technique, guarantee the photo resolution height, picture weight is little, and for example, present technique can adopt the real (AT﹠amp of American telephone and telegraph experiment; T Labs) " DjVu " compress technique, compression back picture can accurate bitmap form up to standard the 786K picture of 1/100, one secondary 1024*768 pixel size, compression back size is about 10K.Picture real-time technique/timesharing transmission is exactly that the snapshot picture generates in real time, and the picture fragment of generation utilizes the figure laminar flow to be delivered to user terminal in real time, does not need the user to wait for that whole picture grabs fully, just can see.The picture real-time technique of native system, timesharing transmit picture (continuity for the page is downloaded also adopts continuous snapshot mode to snapshot, guarantees the real-time that the user browses).Level of detail technology (level of detail, LOD), resolution based on customer mobile terminal, set up snapshots of web pages picture buffering, along with the view procedure of user's detailed theme, on buffer server, select optimal LOD picture, for example, set up different snapshots of web pages picture bufferings according to the rank of webpage, as homepage, primary subnet page or leaf, secondary subnet page or leaf etc.Native system innovation ground is applied to the processing of 2D picture with the technology among how much of the 3D, adopts graphics plotting in limited time to satisfy cellphone subscriber's interactive operation.
Intelligent control module
Intelligent control module is the core of Web page intelligent snapping technology of the present invention, and it controls other each functional module just as people's brain.For example, page cutting module utilizes the Agent technology of intelligent control module to set up the filtering rule algorithm, could start the classification and the purification of webpage.In addition, intelligent control module also is the control centre of whole system, mainly plays Task Distribution, each intermodule forwards and each module schedules effect.User's query requests passes to the Internet by intelligent control module, and the processing through each module of Web page intelligent snapping technology feeds back to the user then.Certainly because the Agent of intelligent control module has preprocessing function, and user's most request has been stored in the buffering of snapshots of web pages, therefore, intelligent control module can be made reaction fast to user's request.
Referring to shown in Figure 2, the step of the operation method of Web page intelligent snapping system of the present invention comprises:
1) imports webpage into, import webpage into by the Internet;
2) webpage cutting is rationally classified and is purified the page, judges that according to keyword the webpage relevant with subject content is sent to 3); Be sent to recycle bin with the irrelevant interfere information of subject content (as advertisement etc.);
3) snapshots of web pages is clapped into snapshot to webpage;
4) image processing is compressed snapshots of web pages the back layering, is real-time transmitted to mobile phone (wireless terminal) user.
Web page intelligent snapping system of the present invention has used the snapshots of web pages technology of new generation of exploitations such as artificial intelligence agent skill group, snapshots of web pages technology, image processing techniques; Its related key technology is specific as follows.
Artificial intelligence technology: artificial intelligence (AI, Artificial Intelligence) is a branch of computer science, the essence of human intelligence is understood in its attempt, and produce a kind of new intelligence machine that can react in the similar mode of human intelligence, the research in this field comprises robot, language identification, image recognition, natural language processing and expert system etc.The relevant expert has introduced the notion of intelligent Agent, and with this conceptual framework, AI is defined as design and builds rational intelligent Agent as AI, and the standard of the reasonability of Agent behavior as judge intelligence.By to Agent from the perception external environment condition, to implementing action, and the overall process of at last external environment condition being exerted one's influence, the main field that is separated from each other among the AI, as problem solving, knowledge and reasoning, logical action, uncertain knowledge and reasoning, study and communication, perception and action etc. are unified under this framework of intelligent Agent, have formed an integral body that connects each other.
The snapshots of web pages technology: the content when snapshots of web pages can be preserved webpage and gathered in this locality, as this webpage is taken a width of cloth snapshot with camera, so be referred to as snapshots of web pages.All snapshots of web pages information all are to be kept on the server of website of setting, store the content that can also browse this webpage when these snapshots of web pages can temporarily break down in this website by the buffer memory of this website.If certainly because the reason of time, info web has been replaced or has can not find server, the snapshots of web pages of storage of the present invention also can be helped meet an urgent need, for example, the user asks the Internet webpage of visiting deleted or connected when losing efficacy, and can check web page contents by the snapshots of web pages in the visit native system buffering.Though the information in the snapshots of web pages may not be up-to-date, the data of searching in snapshots of web pages is than faster in real web pages, and it is to be kept on the high performance server of setting of the present invention website after all.
Image processing techniques: advanced image fast processing system, mainly comprise picture lightweight technology, by advanced compress technique, guarantee the photo resolution height, picture weight is little; Picture real-time technique, timesharing transmit picture (continuity for the page is downloaded also adopts continuous snapshot mode to snapshot, guarantees the real-time that the user browses); (level of detail LOD), based on the resolution of customer mobile terminal, sets up snapshots of web pages picture buffering to the level of detail technology, along with the view procedure of user's detailed theme, selects optimal LOD picture on buffer server.

Claims (2)

1. a Web page intelligent snapping system is characterized in that, comprising:
One webpage cutting module can filter the interfere information of the Internet; Its input connects the Internet;
One snapshots of web pages module, its input connect described webpage cutting module;
One image processing module, its input connect described snapshots of web pages module;
One intelligent control module is used to carry out user instruction, the control snapshot processes; Connect the Internet, webpage cutting module, snapshots of web pages module, image processing module and terminal use respectively.
2. the operation method of the described Web page intelligent snapping system of claim 1 is characterized in that, the step of method comprises:
1) imports webpage into, import webpage into by the Internet;
2) webpage cutting is rationally classified and is purified the page, judges that according to keyword the webpage relevant with subject content is sent to 3); The interfere information irrelevant with subject content is sent to recycle bin;
3) snapshots of web pages is clapped into snapshot to webpage;
4) image processing is compressed snapshots of web pages the back layering, is real-time transmitted to wireless terminal user.
CNA2006101417366A 2006-08-23 2006-09-28 Web page intelligent snapping system and method thereof Pending CN101132446A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA2006101417366A CN101132446A (en) 2006-08-23 2006-09-28 Web page intelligent snapping system and method thereof

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
CN200610030305.2 2006-08-23
CN200610030305 2006-08-23
CN200610116049.9 2006-09-14
CNA2006101417366A CN101132446A (en) 2006-08-23 2006-09-28 Web page intelligent snapping system and method thereof

Publications (1)

Publication Number Publication Date
CN101132446A true CN101132446A (en) 2008-02-27

Family

ID=39129564

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2006101417366A Pending CN101132446A (en) 2006-08-23 2006-09-28 Web page intelligent snapping system and method thereof

Country Status (1)

Country Link
CN (1) CN101132446A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102057358A (en) * 2008-06-30 2011-05-11 赛门铁克公司 Systems and methods for tracking changes to a volume
CN101583072B (en) * 2008-05-15 2011-09-21 北京凯思昊鹏软件工程技术有限公司 Middleware product for realizing Mobile Internet and method thereof
CN111914201A (en) * 2020-08-07 2020-11-10 腾讯科技(深圳)有限公司 Network page processing method and device
CN112966041A (en) * 2021-02-02 2021-06-15 苍穹数码技术股份有限公司 Data processing method, device, equipment and storage medium

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101583072B (en) * 2008-05-15 2011-09-21 北京凯思昊鹏软件工程技术有限公司 Middleware product for realizing Mobile Internet and method thereof
CN102057358A (en) * 2008-06-30 2011-05-11 赛门铁克公司 Systems and methods for tracking changes to a volume
CN102057358B (en) * 2008-06-30 2013-09-11 赛门铁克公司 Systems and methods for tracking changes to a volume
CN111914201A (en) * 2020-08-07 2020-11-10 腾讯科技(深圳)有限公司 Network page processing method and device
CN111914201B (en) * 2020-08-07 2023-11-07 腾讯科技(深圳)有限公司 Processing method and device of network page
CN112966041A (en) * 2021-02-02 2021-06-15 苍穹数码技术股份有限公司 Data processing method, device, equipment and storage medium
CN112966041B (en) * 2021-02-02 2024-04-26 苍穹数码技术股份有限公司 Data processing method, device, equipment and storage medium

Similar Documents

Publication Publication Date Title
CN101583072B (en) Middleware product for realizing Mobile Internet and method thereof
CN101192227B (en) Log file analytical method and system based on distributed type computing network
CN101778168B (en) Method and system for optimization display of wed pages on browser of mobile terminal
CN102799610B (en) Method and system for collecting network information
CN101777068B (en) Web page pre-reading and integrally browsing system for mobile communication equipment terminals and application method thereof
CN103810176B (en) A kind of info web prefetches access method and device
CN100504879C (en) Dynamic web page segmentation method
CN107908694A (en) Public sentiment clustering method, application server and the computer-readable recording medium of internet news
CN104699736B (en) A kind of distributed larger scale data acquisition system and method based on movable equipment
CN102486794B (en) Method, device and system for acquiring rich-media file
CN106817391A (en) Document breakpoint transmission method and apparatus
CN110417873B (en) Network information extraction system for realizing recording webpage interactive operation
CN102117331B (en) Video search method and system
CN108829704A (en) A kind of big data distributed libray Analysis Service technology
CN104281619A (en) System and method for ordering search results
CN102523296B (en) Method, device and system for optimizing wireless webpage browsing resources
CN101132446A (en) Web page intelligent snapping system and method thereof
CN110362737A (en) Method for pushing, device and the server of recommendation
CN102611643A (en) Method and system for handling emails of mobile terminal
CN107463657A (en) File operation method and terminal
CN101008946A (en) Search method of Chinese mobile communication information and device thereof
CN102591887A (en) Network data pre-fetching method and network data pre-fetching system
CN101354706A (en) Method and apparatus for collecting web page information
CN106972977A (en) The long connection maintaining method of one kind and device
CN103294717A (en) Web page opening method and device based on double-kernel browser

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20080227