CN102915363A - Website storing method and system - Google Patents

Website storing method and system Download PDF

Info

Publication number
CN102915363A
CN102915363A CN2012103979819A CN201210397981A CN102915363A CN 102915363 A CN102915363 A CN 102915363A CN 2012103979819 A CN2012103979819 A CN 2012103979819A CN 201210397981 A CN201210397981 A CN 201210397981A CN 102915363 A CN102915363 A CN 102915363A
Authority
CN
China
Prior art keywords
web page
server
browser
code
network address
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2012103979819A
Other languages
Chinese (zh)
Other versions
CN102915363B (en
Inventor
赵飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd, Qizhi Software Beijing Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201210397981.9A priority Critical patent/CN102915363B/en
Publication of CN102915363A publication Critical patent/CN102915363A/en
Application granted granted Critical
Publication of CN102915363B publication Critical patent/CN102915363B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention discloses a website storing method and system. The website storing system comprises a storing request reception module located at a browser, a storing judgment module located at the browser, an updating judgment module, a storing request sending module located at the browser, a website code obtaining module located at a server, and a website snapshot obtaining module located at the server, wherein the storing judgment module located at the browser comprises an address identification code obtaining sub-module which is located at the browser and is suitable for a website corresponding to obtain a corresponding address identification code based on the website storing request, and an identification code judgment sub-module which is located at the browser and is suitable for judging if the address identification code is a stored address identification code; if the address identification code is the stored address identification code, the address corresponding to the storing request is store; and if the address identification code is not the stored address identification code, the address corresponding to the storing request is not stored. The website storing method and system, disclosed by the invention, have an advantage of guaranteeing a user to browse webpage content when the stored webpage is inaccessible. The website storing method and system, disclosed by the invention, have an advantage of guaranteeing a user to browse webpage content when the stored webpage is inaccessible.

Description

The web site collection method and system
Technical field
The present invention relates to the internet access technical field, be specifically related to a kind of web site collection method, and a kind of web site collection system.
Background technology
The user is when using the browser browsing page, can record and keep that like or commonly used network address by the favorite function that browser carries, when thinking afterwards again to browse the network address that some have collected, only need to directly open collection, click the title of network address in collection, be addressable network address, and need not again input network address or search for corresponding content.
Yet; because the webpage on the Internet is not unalterable; but constantly increase, delete, change; for example: number of site is because such or such former thereby close, and number of site can redirect to other chained addresses, therefore; along with the time of online is more and more longer; originally being collected in the network address in the collection, possibly can't access after a period of time, namely network address often can face the problem of inefficacy in the collection.In this case, the user has to again search corresponding network address or the relevant content of search, and the user experiences non-constant, and has increased the resource cost of client and server.
Therefore, those skilled in the art's technical matters in the urgent need to address is: a kind of mechanism that webpage generates of collecting is provided, and can browsed web content when the webpage of collection can't be accessed to guarantee the user.
Summary of the invention
In view of the above problems, the present invention has been proposed in order to a kind of overcome the problems referred to above or a kind of web site collection method that addresses the above problem at least in part and corresponding a kind of web site collection system are provided.
According to one aspect of the present invention, a kind of web site collection method is provided, comprising:
Browser receives the web site collection request;
Browser obtains corresponding address identifier code according to network address corresponding to described web site collection request;
Browser judges whether described address identifier code is the address identifier code of storage, if then network address corresponding to described collection request collected; If not, then not collection of network address corresponding to described collection request;
When network address corresponding to described web site collection request collected, judge whether web page contents corresponding to described web site collection request upgrades;
In not collection of network address corresponding to described web site collection request, perhaps, when web page contents corresponding to described web site collection request need to upgrade, browser was sent to server with described web site collection request;
Server is according to the web page code of the corresponding network address of described web site collection acquisition request;
Server is preserved described web page code, forms snapshots of web pages.
Alternatively, described method also comprises:
When server is preserved described web page code when unsuccessful, the web page code that the notice browser is uploaded described corresponding network address forms snapshots of web pages.
Alternatively, describedly preserve described web page code when unsuccessful when server, the step that the web page code that the notice browser is uploaded described corresponding network address forms snapshots of web pages comprises:
The server notification browser is uploaded the web page code of described corresponding network address;
Browser obtains web page code and described web page code is uploaded onto the server;
Server is preserved described web page code, forms snapshots of web pages.
Alternatively, described server is preserved described web page code, and the step that forms snapshots of web pages comprises:
Server is preserved described web page code;
Server generates information update time of described web page code;
Server is with described web page code, update time information and address identifier code generating web page snapshot.
According to a further aspect in the invention, provide a kind of web site collection system, having comprised:
Be positioned at the collection request receiving module of browser, be suitable for receiving the web site collection request;
Be positioned at the collection judge module of browser, be suitable for judging whether network address corresponding to described web site collection request collects;
Upgrade judge module, be suitable for when network address corresponding to described web site collection request collected, judge whether web page contents corresponding to described web site collection request upgrades;
Be positioned at the collection request sending module of browser, be suitable for perhaps, when web page contents corresponding to described web site collection request need to upgrade, described web site collection request being sent to server in not collection of network address corresponding to described web site collection request;
Be positioned at the web page code acquisition module of server, be suitable for the web page code according to the corresponding network address of described web site collection acquisition request;
Be positioned at the snapshots of web pages acquisition module of server, be suitable for preserving described web page code, form snapshots of web pages;
Wherein, the described collection judge module that is positioned at browser comprises:
The address identifier code that is positioned at browser is obtained submodule, is suitable for obtaining corresponding address identifier code according to network address corresponding to described web site collection request;
The identification code that is positioned at browser is judged submodule, is suitable for judging whether described address identifier code is the address identifier code of storage, if then network address corresponding to described collection request collected; If not, then not collection of network address corresponding to described collection request.
Alternatively, described system also comprises:
Be positioned at transmission module on the web page code of browser, be suitable for preserving described web page code when unsuccessful when server, the web page code that the notice browser is uploaded described corresponding network address forms snapshots of web pages.
Alternatively, transmission module comprises on the described web page code that is positioned at browser:
Be positioned at the notice submodule of server, be suitable for notifying browser to upload the web page code of described corresponding network address;
The web page code that is positioned at browser obtains submodule, is suitable for obtaining web page code and described web page code is uploaded onto the server;
The web page code that is positioned at server is preserved submodule, is suitable for preserving described web page code, forms snapshots of web pages.
Alternatively, the described snapshots of web pages acquisition module that is positioned at server comprises:
Be positioned at the preservation submodule of server, be suitable for preserving described web page code;
Be positioned at server update temporal information submodule, be suitable for generating information update time of described web page code;
The snapshots of web pages that is positioned at server generates submodule, is suitable for server with described web page code, update time information and address identifier code generating web page snapshot.
A kind of web site collection method according to the present invention can provide a kind of mechanism that webpage generates of collecting, having solved thus that problem that the network address of collecting in the growth collection along with the time can't lose efficacy obtained when the network address in the collection lost efficacy still can the described network address web page contents of normal browsing, saves user resources and improves the beneficial effect that the user experiences.
Above-mentioned explanation only is the general introduction of technical solution of the present invention, for can clearer understanding technological means of the present invention, and can be implemented according to the content of instructions, and for above and other objects of the present invention, feature and advantage can be become apparent, below especially exemplified by the specific embodiment of the present invention.
Description of drawings
By reading hereinafter detailed description of the preferred embodiment, various other advantage and benefits will become cheer and bright for those of ordinary skills.Accompanying drawing only is used for the purpose of preferred implementation is shown, and does not think limitation of the present invention.And in whole accompanying drawing, represent identical parts with identical reference symbol.In the accompanying drawings:
Fig. 1 shows a kind of according to an embodiment of the invention flow chart of steps of web site collection embodiment of the method 1;
Fig. 2 shows a kind of according to an embodiment of the invention flow chart of steps of web site collection embodiment of the method 2;
Fig. 3 shows a kind of according to an embodiment of the invention structured flowchart of web site collection system embodiment 1;
Fig. 4 shows a kind of according to an embodiment of the invention structured flowchart of web site collection system embodiment 2.
Embodiment
Exemplary embodiment of the present disclosure is described below with reference to accompanying drawings in more detail.Although shown exemplary embodiment of the present disclosure in the accompanying drawing, yet should be appreciated that and to realize the disclosure and the embodiment that should do not set forth limits here with various forms.On the contrary, it is in order to understand the disclosure more thoroughly that these embodiment are provided, and can with the scope of the present disclosure complete convey to those skilled in the art.
One of core idea of the embodiment of the invention is, when the user need to collect the website, judge whether network address corresponding to described website has been collected and judged whether web page contents corresponding to described website needs to upgrade, when network address corresponding to described network address do not have collection or web site contents corresponding to described network address to upgrade, when browser was collected described website, announcement server generated the snapshots of web pages of described website.
With reference to Fig. 1, show a kind of according to an embodiment of the invention flow chart of steps of web site collection embodiment of the method 1, specifically can may further comprise the steps:
Step 101: browser receives the web site collection request;
The user is when using the browser browsing page, can record and keep that like or commonly used network address by the favorite function that browser carries, when thinking afterwards again to browse the network address that some have collected, only need to directly open collection, click the title of network address in collection, be addressable network address, and need not again input network address or collect corresponding content.When the user need to collect network address, can be by clicking the collection request of the network address that function notice browsers such as " adding collection " need to collect.
Step 102: browser obtains corresponding address identifier code according to network address corresponding to described web site collection request;
After browser receives user's web site collection request, judge first whether network address corresponding to this web site collection request collects.
Wherein, described address identifier code, can be carried out MD5 to network address corresponding to collection request and calculate corresponding address identifier code as a kind of preferred exemplary of the present embodiment for the unique identification code of address corresponding to this collection request.
MD5 (Message-Digest Algorithm5, Message Digest Algorithm 5) is the widely used a kind of hash function of computer safety field, guarantees that with being used for communication is complete consistent in order to the integrity protection that gives information.MD5 is one of widely used hash algorithm of computing machine, and the basic principle of hash algorithm is that data (such as Chinese character) computing is another fixed-length value, the effect of MD5 is the input message of accepting random length, produces " fingerprint " or " message digest " of a 128-bit for input.At present generally existing MD5 realization of main flow programming language.
It is that a segment information (Message) is produced informative abstract (Message-Digest) that the typical case of MD5 uses, and is tampered preventing.To the concise and to the point narration of MD5 algorithm can for: MD5 processes the information of input with 512 groupings, and each grouping is divided into again 16 32 seats groupings, after having passed through a series of processing, the output of algorithm is comprised of four 32 groupings, will will generate 128 hashed values after these four 32 packet concatenation.
In the MD5 algorithm, at first need information is filled, make its long result to 512 complementations equal 448.Therefore, the position long (Bits Length) of information will be expanded to N*512+448, and N is a nonnegative integer, and N can be zero.The method of filling is as follows, fills one 1 and numerous 0 in the back of information, until just stop the filling with 0 pair of information during the condition above satisfying.Then, this as a result the back additional one with the filling of 64 binary representations before message length.Through the processing in this two steps, position long=N*512+448+64=(N+1) * 512 of present information, namely length is 512 integral multiple just, the reason of doing like this is for satisfying in the later process requirement to message length.
Four 32 numeric parameters that are known as link variable (Chaining Variable) are arranged among the MD5, and they are respectively: A=0x67452301, B=0xefcdab89, C=0x98badcfe, D=0x10325476.After setting these four link variables, just begin to enter the four-wheel loop computation of algorithm, the number of times of circulation is the number of 512 information block in the information.Top four link variables are copied in other four variablees: A is to a, and B is to b, and C is to c, and D is to d.Major cycle has four-wheel, and every to take turns circulation all very similar.The first round is carried out 16 operations, and each operation is to the nonlinear function computing of wherein three dos among a, b, c and the d; Then acquired results is added the 4th variable, a subgroup and a constant of text; Again acquired results is encircled left and move an indefinite number, and one of add among a, b, c or the d; One of replace among a, b, c or the d with this result at last.
For example: network address is carried out MD5 calculate address identifier code and can be expressed as
md5(‘http://www.1.cn’)
The address identifier code that obtains is: 3bc192b291fc8271
Need to prove, using the MD5 algorithm to calculate address identifier code corresponding to collection request among the present invention only is a kind of example, and those skilled in the art can adopt other algorithms or means to realize all being fine, and the present invention is not restricted this.
Step 103: judge whether described address identifier code is the address identifier code of storage, if then network address corresponding to described collection request collected; If not, then not collection of network address corresponding to described collection request.
When using the collection collection network address of browser, browser will get access to described collection network address address identifier code and the form of described address identifier code with file be stored in the local cache, when browser receives the collection request, at first can calculate the address identifier code of the corresponding network address of collection request, then the address identifier code that calculates and the address identifier code in the local cache are compared, if described address identifier code is present in the local cache, then being judged as network address corresponding to described collection request collects, otherwise, then be judged as the not collection of network address corresponding to collection request.
Step 104: when network address corresponding to described web site collection request collected, judge whether web page contents corresponding to described web site collection request upgrades;
As a kind of preferred exemplary of the present embodiment, during browser collection folder collection network address, network address, subscriber set code, web site contents checking string and the check code of the access of described network address is stored in the local cache.Wherein, the subscriber set code refers to a string sequence number of hardware sequence number through a series of encryptions, hash formation, is the unique identification code of computer software and hardware Information generation of installing according to user software in user's registration; Web site contents checking string refers to the website code is carried out a string code of obtaining behind the hash algorithm; Check code is exactly that the network address of described access, subscriber set code, key are carried out a string code of drawing behind the MD5 algorithm, is mainly used in preventing that malice from submitting to,
For example: the calculating of check code can be expressed as md5 (http://www.1.cn+B32CD241A2F27+XXAAWWFssf)
When in step 103, being judged as network address corresponding to described collection request and having collected, can navigate to network address position corresponding to address identifier code described in the collection according to described address identifier code, and then obtain the contents such as network address, subscriber set code, web site contents checking string and check code of the access of described network address.
A kind of preferred exemplary as the present embodiment, network address, subscriber set code, web site contents checking string and the check code of described access can be submitted to a default web site contents checking interface and judge whether web page contents corresponding to described web site collection request upgrades, and its process can be expressed as follows:
(1) browser is submitted to a default web site contents checking interface with network address, subscriber set code, web site contents checking string and the check code of described access
(2) whether the more described web site contents checking string that receives of default web site contents checking interface is consistent with the web site contents checking string in being stored in web site contents checking interface server, if, then described web site contents checking interface is judged as described web site contents does not need to upgrade, if not, then described web site contents verifies that interface is judged as described web site contents and needs to upgrade.
(3) web site contents checking interface is notified browser with judged result, and when web site contents checking interface was judged as described web site contents and does not need to upgrade, browser is judged as described web site contents not to be needed to upgrade; When web site contents checking interface was judged as described web site contents and need to upgrades, browser is judged as described web site contents to be needed to upgrade.
Step 105: in not collection of network address corresponding to described web site collection request, perhaps, when web page contents corresponding to described web site collection request need to upgrade, browser was sent to server with described web site collection request;
Step 106: server is according to the web page code of the corresponding network address of described web site collection acquisition request;
Web page code need just to refer to some special " language " of using in Web Page Design, then the designer is carried out being only the effect that we finally see after " translation " to code by browser by webpage is produced in these " language " tissue layouts.Code commonly used during present Web-Designing has HTML, JavaScript, and ASP, PHP, CGI etc., wherein HTML is most basic web page code.
Server is resolved network address corresponding to described web site collection request after receiving the web site collection request of browser transmission, obtains web page code corresponding to described network address in resolving.The benefit of obtaining web page code with server is to save like this user's surfing flow, minimally consumes user bandwidth.
Step 107: server is preserved described web page code, forms snapshots of web pages.
Snapshots of web pages, English name be Web Cache, the webpage buffer memory.Search engine is when webpage, webpage is backed up, exist in the server buffer of oneself, when the user clicks " snapshots of web pages " link in search engine, the Web page content revealing that search engine grasped and preserved at that time Spider (spider) system out is called " snapshots of web pages ".Snapshots of web pages generally is to be combined with search engine, and the webpage buffer memory that search engine keeps can only work when search, and can't combine with the collection of browser.And in the collection that the present invention can be applied in snapshots of web pages with browser is combined.In specific implementation, snapshots of web pages is at some web page codes that are presented as of server side.
In a preferred embodiment of the present invention, described step 107 can comprise following substep:
Substep S21: server is preserved described web page code;
Substep S22: server generates information update time of described web page code;
Substep S23: server is with described web page code, update time information and address identifier code generating web page snapshot.
In specific implementation, when needs were used snapshots of web pages, server can the snapshots of web pages that update time is the most front send to browser.
With reference to Fig. 2, show a kind of according to an embodiment of the invention flow chart of steps of web site collection embodiment of the method 2, specifically can may further comprise the steps:
Step 201: browser receives the web site collection request;
Step 202: browser obtains corresponding address identifier code according to network address corresponding to described web site collection request;
Step 203: judge whether described address identifier code is the address identifier code of storage, if then network address corresponding to described collection request collected; If not, then not collection of network address corresponding to described collection request.
Step 204: when network address corresponding to described web site collection request collected, judge whether web page contents corresponding to described web site collection request upgrades;
As a kind of preferred exemplary of the present embodiment, during browser collection folder collection network address, network address, subscriber set code, web site contents checking string and the check code of the access of described network address is stored in the local cache.Wherein, the subscriber set code refers to a string sequence number of hardware sequence number through a series of encryptions, hash formation, is the unique identification code of computer software and hardware Information generation of installing according to user software in user's registration; Web site contents checking string refers to the website code is carried out a string code of obtaining behind the hash algorithm; Check code is exactly that the network address of described access, subscriber set code, key are carried out a string code of drawing behind the MD5 algorithm, is mainly used in preventing that malice from submitting to,
For example: the calculating of check code can be expressed as md5 (http://www.1.cn+B32CD241A2F27+XXAAWWFssf)
When in step 203, being judged as network address corresponding to described collection request and having collected, can navigate to network address position corresponding to address identifier code described in the collection according to described address identifier code, and then obtain the contents such as network address, subscriber set code, web site contents checking string and check code of the access of described network address.
As a kind of preferred exemplary of the present embodiment, network address, subscriber set code, web site contents checking string and the check code of described access can be submitted to a default web site contents checking interface and judge whether web page contents corresponding to described web site collection request upgrades.
Step 205: in not collection of network address corresponding to described web site collection request, perhaps, when web page contents corresponding to described web site collection request need to upgrade, browser was sent to server with described web site collection request;
Step 206: server is according to the web page code of the corresponding network address of described web site collection acquisition request;
Step 207: server is preserved described web page code, forms snapshots of web pages.
In a preferred embodiment of the present invention, described step 207 can comprise following substep:
Substep S41: server is preserved described web page code;
Substep S42: server generates information update time of described web page code;
Substep S43: server is with described web page code, update time information and address identifier code generating web page snapshot.
Step 208: when server is preserved described web page code when unsuccessful, the web page code that the notice browser is uploaded described corresponding network address forms snapshots of web pages.
It can be that number of site is maliciously usurped by other people in order to prevent own content that a kind of server is preserved the unsuccessful situation of web page code, can do some restrict access at own server, for example limit other machines to its access frequency, server just can not directly be preserved web page code like this, in specific implementation, web site contents checking string in described web site contents checking string and the preservation check of the presetting interface can be compared and judge whether server preservation web page code is successful, server is preserved the web page code success if described web site contents checking string is present in the default preservation check interface, otherwise it is unsuccessful that server is preserved code.Those skilled in the art adopt other modes all to be fine, and the present invention is not restricted this.
In a preferred embodiment of the present invention, described step 208 can comprise following substep:
S51: the server notification browser is uploaded the web page code of described corresponding network address;
S52: browser obtains web page code and described web page code is uploaded onto the server;
S53: server is preserved described web page code, forms snapshots of web pages.
In specific implementation, snapshots of web pages is at some web page codes that are presented as of server side, and described web page code can directly be obtained when resolving the request message of browser by server; In another aspect of this invention, described web page code also can obtain when the response message of browser resolves server, then web page code is uploaded onto the server.The benefit of obtaining web page code with server is to save user's surfing flow, minimally consume user bandwidth, when server is preserved the web page code failure, can notify browser to obtain web page code uploads, server is preserved described web page code again, can adopt the mode of compressed code that described web page code is uploaded when browser is uploaded described web page code, so also can reduce the roam of uploading, reduce bandwidth.
For the embodiment of Fig. 2 because itself and the embodiment of the method basic simlarity of Fig. 1, so describe fairly simple, relevant part gets final product referring to the part explanation of embodiment of the method.
Need to prove, for embodiment of the method, for simple description, therefore it all is expressed as a series of combination of actions, but those skilled in the art should know, the present invention is not subjected to the restriction of described sequence of movement, because according to the present invention, some step can adopt other orders or carry out simultaneously.Secondly, those skilled in the art also should know, the embodiment described in the instructions all belongs to preferred embodiment, and related action and module might not be that the present invention is necessary.
Show a kind of according to an embodiment of the invention web site collection system embodiment 1 block diagram with reference to Fig. 3, specifically can comprise with lower module:
Be positioned at the collection request receiving module 301 of browser, be suitable for receiving the web site collection request;
Be positioned at the collection judge module 302 of browser, be suitable for judging whether network address corresponding to described web site collection request collects;
In a preferred embodiment of the present invention, the described collection judge module 302 that is positioned at browser can comprise following submodule:
The address identifier code that is positioned at browser is obtained submodule, is suitable for obtaining corresponding address identifier code according to network address corresponding to described web site collection request;
The identification code that is positioned at browser is judged submodule, is suitable for judging whether described address identifier code is the address identifier code of storage, if then network address corresponding to described collection request collected; If not, then not collection of network address corresponding to described collection request.
Upgrade judge module 303, be suitable for when network address corresponding to described web site collection request collected, judge whether web page contents corresponding to described web site collection request upgrades;
Be positioned at the collection request sending module 304 of browser, be suitable for perhaps, when web page contents corresponding to described web site collection request need to upgrade, described web site collection request being sent to server in not collection of network address corresponding to described web site collection request;
Be positioned at the web page code acquisition module 305 of server, be suitable for the web page code according to the corresponding network address of described web site collection acquisition request;
Be positioned at the snapshots of web pages acquisition module 306 of server, be suitable for preserving described web page code, form snapshots of web pages.
In a preferred embodiment of the present invention, the described snapshots of web pages acquisition module 306 that is positioned at server can comprise:
Be positioned at the preservation submodule of server, be suitable for preserving described web page code;
Be positioned at server update temporal information submodule, be suitable for generating information update time of described web page code;
The snapshots of web pages that is positioned at server generates submodule, is suitable for server with described web page code, update time information and address identifier code generating web page snapshot.
For Fig. 3 embodiment because itself and the embodiment of the method basic simlarity of Fig. 1, so describe fairly simple, relevant part gets final product referring to the part explanation of embodiment of the method
Show a kind of according to an embodiment of the invention structured flowchart of web site collection system embodiment with reference to Fig. 4, specifically can comprise with lower module:
Be positioned at the collection request receiving module 401 of browser, be suitable for receiving the web site collection request;
Be positioned at the collection judge module 402 of browser, be suitable for judging whether network address corresponding to described web site collection request collects;
In a preferred embodiment of the present invention, the described collection judge module 402 that is positioned at browser can comprise following submodule:
The address identifier code that is positioned at browser is obtained submodule, is suitable for obtaining corresponding address identifier code according to network address corresponding to described web site collection request;
The identification code that is positioned at browser is judged submodule, is suitable for judging whether described address identifier code is the address identifier code of storage, if then network address corresponding to described collection request collected; If not, then not collection of network address corresponding to described collection request.
Upgrade judge module 403, be suitable for when network address corresponding to described web site collection request collected, judge whether web page contents corresponding to described web site collection request upgrades;
Be positioned at the collection request sending module 404 of browser, be suitable for perhaps, when web page contents corresponding to described web site collection request need to upgrade, described web site collection request being sent to server in not collection of network address corresponding to described web site collection request;
Be positioned at the web page code acquisition module 405 of server, be suitable for the web page code according to the corresponding network address of described web site collection acquisition request;
Be positioned at the snapshots of web pages acquisition module 406 of server, be suitable for preserving described web page code, form snapshots of web pages.
In a preferred embodiment of the present invention, the described snapshots of web pages acquisition module 406 that is positioned at server can comprise following submodule:
Be positioned at the preservation submodule of server, be suitable for preserving described web page code;
Be positioned at server update temporal information submodule, be suitable for generating information update time of described web page code;
The snapshots of web pages that is positioned at server generates submodule, is suitable for server with described web page code, update time information and address identifier code generating web page snapshot.
Be positioned at transmission module 407 on the web page code of browser: server is preserved described web page code when unsuccessful, and the web page code that the notice browser is uploaded described corresponding network address forms snapshots of web pages.
In a preferred embodiment of the present invention, transmission module 407 can comprise following submodule on the described web page code that is positioned at browser:
Be positioned at the notice submodule of server, be suitable for notifying browser to upload the web page code of described corresponding network address;
The web page code that is positioned at browser obtains submodule, is suitable for obtaining web page code and described web page code is uploaded onto the server;
The web page code that is positioned at server is preserved submodule, is suitable for preserving described web page code, forms snapshots of web pages.
For Fig. 4 embodiment because itself and the embodiment of the method basic simlarity of Fig. 1 and Fig. 2, so describe fairly simple, relevant part gets final product referring to the part explanation of embodiment of the method
Intrinsic not relevant with any certain computer, virtual system or miscellaneous equipment with demonstration at this algorithm that provides.Various general-purpose systems also can be with using based on the teaching at this.According to top description, it is apparent constructing the desired structure of this type systematic.In addition, the present invention is not also for any certain programmed language.Should be understood that and to utilize various programming languages to realize content of the present invention described here, and the top description that language-specific is done is in order to disclose preferred forms of the present invention.
In the instructions that provides herein, a large amount of details have been described.Yet, can understand, embodiments of the invention can be in the situation that there be these details to put into practice.In some instances, be not shown specifically known method, structure and technology, so that not fuzzy understanding of this description.
Similarly, be to be understood that, in order to simplify the disclosure and to help to understand one or more in each inventive aspect, in the description to exemplary embodiment of the present invention, each feature of the present invention is grouped together in single embodiment, figure or the description to it sometimes in the above.Yet the method for the disclosure should be construed to the following intention of reflection: namely the present invention for required protection requires the more feature of feature clearly put down in writing than institute in each claim.Or rather, as following claims reflected, inventive aspect was to be less than all features of the disclosed single embodiment in front.Therefore, follow claims of embodiment and incorporate clearly thus this embodiment into, wherein each claim itself is as independent embodiment of the present invention.
Those skilled in the art are appreciated that and can adaptively change and they are arranged in one or more equipment different from this embodiment the module in the equipment among the embodiment.Can be combined into a module or unit or assembly to the module among the embodiment or unit or assembly, and can be divided into a plurality of submodules or subelement or sub-component to them in addition.In such feature and/or process or unit at least some are mutually repelling, and can adopt any combination to disclosed all features in this instructions (comprising claim, summary and the accompanying drawing followed) and so all processes or the unit of disclosed any method or equipment make up.Unless in addition clearly statement, disclosed each feature can be by providing identical, being equal to or the alternative features of similar purpose replaces in this instructions (comprising claim, summary and the accompanying drawing followed).
In addition, those skilled in the art can understand, although embodiment more described herein comprise some feature rather than further feature included among other embodiment, the combination of the feature of different embodiment means and is within the scope of the present invention and forms different embodiment.For example, in the following claims, the one of any of embodiment required for protection can be used with array mode arbitrarily.
All parts embodiment of the present invention can realize with hardware, perhaps realizes with the software module of moving at one or more processor, and perhaps the combination with them realizes.It will be understood by those of skill in the art that and to use in practice microprocessor or digital signal processor (DSP) to realize according to some or all some or repertoire of parts in the collection content generation equipment of the embodiment of the invention.The present invention can also be embodied as be used to part or all equipment or the device program (for example, computer program and computer program) of carrying out method as described herein.Such realization program of the present invention can be stored on the computer-readable medium, perhaps can have the form of one or more signal.Such signal can be downloaded from internet website and obtain, and perhaps provides at carrier signal, perhaps provides with any other form.
It should be noted above-described embodiment the present invention will be described rather than limit the invention, and those skilled in the art can design alternative embodiment in the situation of the scope that does not break away from claims.In the claims, any reference symbol between bracket should be configured to limitations on claims.Word " comprises " not to be got rid of existence and is not listed in element or step in the claim.Being positioned at word " " before the element or " one " does not get rid of and has a plurality of such elements.The present invention can realize by means of the hardware that includes some different elements and by means of the computing machine of suitably programming.In having enumerated the unit claim of some devices, several in these devices can be to come imbody by same hardware branch.The use of word first, second and C grade does not represent any order.Can be title with these word explanations.

Claims (8)

1. web site collection method comprises:
Browser receives the web site collection request;
Browser obtains corresponding address identifier code according to network address corresponding to described web site collection request;
Browser judges whether described address identifier code is the address identifier code of storage, if then network address corresponding to described collection request collected; If not, then not collection of network address corresponding to described collection request;
When network address corresponding to described web site collection request collected, judge whether web page contents corresponding to described web site collection request upgrades;
In not collection of network address corresponding to described web site collection request, perhaps, when web page contents corresponding to described web site collection request need to upgrade, browser was sent to server with described web site collection request;
Server is according to the web page code of the corresponding network address of described web site collection acquisition request;
Server is preserved described web page code, forms snapshots of web pages.
2. the method for claim 1 also comprises:
When server is preserved described web page code when unsuccessful, the web page code that the notice browser is uploaded described corresponding network address forms snapshots of web pages.
3. method as claimed in claim 2 is describedly preserved described web page code when unsuccessful when server, and the step that the web page code that the notice browser is uploaded described corresponding network address forms snapshots of web pages comprises:
The server notification browser is uploaded the web page code of described corresponding network address;
Browser obtains web page code and described web page code is uploaded onto the server;
Server is preserved described web page code, forms snapshots of web pages.
4. such as each described method in the claims 1 to 3, described server is preserved described web page code, and the step that forms snapshots of web pages comprises:
Server is preserved described web page code;
Server generates information update time of described web page code;
Server is with described web page code, update time information and address identifier code generating web page snapshot.
5. web site collection system comprises:
Be positioned at the collection request receiving module of browser, be suitable for receiving the web site collection request;
Be positioned at the collection judge module of browser, be suitable for judging whether network address corresponding to described web site collection request collects;
Upgrade judge module, be suitable for when network address corresponding to described web site collection request collected, judge whether web page contents corresponding to described web site collection request upgrades;
Be positioned at the collection request sending module of browser, be suitable for perhaps, when web page contents corresponding to described web site collection request need to upgrade, described web site collection request being sent to server in not collection of network address corresponding to described web site collection request;
Be positioned at the web page code acquisition module of server, be suitable for the web page code according to the corresponding network address of described web site collection acquisition request;
Be positioned at the snapshots of web pages acquisition module of server, be suitable for preserving described web page code, form snapshots of web pages;
Wherein, the described collection judge module that is positioned at browser comprises:
The address identifier code that is positioned at browser is obtained submodule, is suitable for obtaining corresponding address identifier code according to network address corresponding to described web site collection request;
The identification code that is positioned at browser is judged submodule, is suitable for judging whether described address identifier code is the address identifier code of storage, if then network address corresponding to described collection request collected; If not, then not collection of network address corresponding to described collection request.
6. system as claimed in claim 5 also comprises:
Be positioned at transmission module on the web page code of browser, be suitable for preserving described web page code when unsuccessful when server, the web page code that the notice browser is uploaded described corresponding network address forms snapshots of web pages.
7. system as claimed in claim 6, transmission module comprises on the described web page code that is positioned at browser:
Be positioned at the notice submodule of server, be suitable for notifying browser to upload the web page code of described corresponding network address;
The web page code that is positioned at browser obtains submodule, is suitable for obtaining web page code and described web page code is uploaded onto the server;
The web page code that is positioned at server is preserved submodule, is suitable for preserving described web page code, forms snapshots of web pages.
8. such as each described system in the claim 5 to 7, the described snapshots of web pages acquisition module that is positioned at server comprises:
Be positioned at the preservation submodule of server, be suitable for preserving described web page code;
Be positioned at server update temporal information submodule, be suitable for generating information update time of described web page code;
The snapshots of web pages that is positioned at server generates submodule, is suitable for server with described web page code, update time information and address identifier code generating web page snapshot.
CN201210397981.9A 2012-10-18 2012-10-18 Web site collection method and system Expired - Fee Related CN102915363B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210397981.9A CN102915363B (en) 2012-10-18 2012-10-18 Web site collection method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210397981.9A CN102915363B (en) 2012-10-18 2012-10-18 Web site collection method and system

Publications (2)

Publication Number Publication Date
CN102915363A true CN102915363A (en) 2013-02-06
CN102915363B CN102915363B (en) 2015-12-09

Family

ID=47613729

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210397981.9A Expired - Fee Related CN102915363B (en) 2012-10-18 2012-10-18 Web site collection method and system

Country Status (1)

Country Link
CN (1) CN102915363B (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103399876A (en) * 2013-07-11 2013-11-20 杭州瑞网广通信息技术有限公司 Distributed file system and file positioning method thereof
CN103645968A (en) * 2013-12-02 2014-03-19 北京奇虎科技有限公司 Browser status restoration method and device
CN104580433A (en) * 2014-12-26 2015-04-29 北京奇虎科技有限公司 Method and device for calling favorite data
CN104601671A (en) * 2014-12-26 2015-05-06 北京奇虎科技有限公司 Favorite data storing and obtaining method and device of mobile terminal
CN104679862A (en) * 2015-02-28 2015-06-03 百度在线网络技术(北京)有限公司 Code processing method and device used for webpage
CN106372251A (en) * 2016-09-28 2017-02-01 北京京东尚科信息技术有限公司 Method and device for returning to page display position
CN106557584A (en) * 2016-11-29 2017-04-05 青岛海信移动通信技术股份有限公司 A kind of web site collection method and device
CN106991117A (en) * 2013-11-08 2017-07-28 北京奇虎科技有限公司 Snap processing method, snapshot display method, server, browser and system
CN103685514B (en) * 2013-12-13 2017-11-07 北京奇虎科技有限公司 The store method and browser of the page in web page storage folder
CN109189588A (en) * 2018-08-07 2019-01-11 武汉斗鱼网络科技有限公司 A kind of browser function implementation method, device, terminal and storage medium
CN103761310B (en) * 2014-01-23 2019-04-12 贝壳网际(北京)安全技术有限公司 A kind of web page access record processing method, device and browser
CN110909271A (en) * 2019-11-11 2020-03-24 青岛海信移动通信技术股份有限公司 Communication terminal, server interconnected with communication terminal and control method
CN111428105A (en) * 2020-03-05 2020-07-17 广东睿江云计算股份有限公司 Webpage bookmark management method and system based on crawler cache

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030126560A1 (en) * 2001-12-28 2003-07-03 Koninklijke Philips Electronics N.V. Adaptive bookmarking of often-visited web sites
CN1912869A (en) * 2005-08-11 2007-02-14 腾讯科技(深圳)有限公司 Implementing method of network profile
CN101178736A (en) * 2007-12-11 2008-05-14 腾讯科技(深圳)有限公司 Web page collecting method and web page collecting server
CN101291367A (en) * 2008-05-22 2008-10-22 德信无线通讯科技(北京)有限公司 Browser bookmark displaying method of mobile communication terminal, and mobile communication terminal thereof
CN101382954A (en) * 2008-09-25 2009-03-11 北京搜狗科技发展有限公司 Method and system for providing web site collection name
CN102111267A (en) * 2009-12-28 2011-06-29 北京安码科技有限公司 Website safety protection method based on digital signature and system adopting same
CN102724192A (en) * 2012-06-14 2012-10-10 北京奇乐客科技有限公司 Collaborative editing based network collection method

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030126560A1 (en) * 2001-12-28 2003-07-03 Koninklijke Philips Electronics N.V. Adaptive bookmarking of often-visited web sites
CN1912869A (en) * 2005-08-11 2007-02-14 腾讯科技(深圳)有限公司 Implementing method of network profile
CN101178736A (en) * 2007-12-11 2008-05-14 腾讯科技(深圳)有限公司 Web page collecting method and web page collecting server
CN101291367A (en) * 2008-05-22 2008-10-22 德信无线通讯科技(北京)有限公司 Browser bookmark displaying method of mobile communication terminal, and mobile communication terminal thereof
CN101382954A (en) * 2008-09-25 2009-03-11 北京搜狗科技发展有限公司 Method and system for providing web site collection name
CN102111267A (en) * 2009-12-28 2011-06-29 北京安码科技有限公司 Website safety protection method based on digital signature and system adopting same
CN102724192A (en) * 2012-06-14 2012-10-10 北京奇乐客科技有限公司 Collaborative editing based network collection method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
小痛: "百度收藏 让我的网络收藏更实在", 《电脑迷》 *

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103399876A (en) * 2013-07-11 2013-11-20 杭州瑞网广通信息技术有限公司 Distributed file system and file positioning method thereof
CN106991117B (en) * 2013-11-08 2020-08-14 北京奇虎科技有限公司 Snapshot processing method, snapshot display method, server, browser and system
CN106991117A (en) * 2013-11-08 2017-07-28 北京奇虎科技有限公司 Snap processing method, snapshot display method, server, browser and system
CN103645968B (en) * 2013-12-02 2017-03-15 北京奇虎科技有限公司 A kind of browser status restored method and device
CN103645968A (en) * 2013-12-02 2014-03-19 北京奇虎科技有限公司 Browser status restoration method and device
CN103685514B (en) * 2013-12-13 2017-11-07 北京奇虎科技有限公司 The store method and browser of the page in web page storage folder
CN103761310B (en) * 2014-01-23 2019-04-12 贝壳网际(北京)安全技术有限公司 A kind of web page access record processing method, device and browser
CN104580433B (en) * 2014-12-26 2019-02-26 北京奇虎科技有限公司 Favorites data transfers method and apparatus
CN104601671A (en) * 2014-12-26 2015-05-06 北京奇虎科技有限公司 Favorite data storing and obtaining method and device of mobile terminal
CN104580433A (en) * 2014-12-26 2015-04-29 北京奇虎科技有限公司 Method and device for calling favorite data
CN104679862A (en) * 2015-02-28 2015-06-03 百度在线网络技术(北京)有限公司 Code processing method and device used for webpage
CN106372251A (en) * 2016-09-28 2017-02-01 北京京东尚科信息技术有限公司 Method and device for returning to page display position
CN106372251B (en) * 2016-09-28 2020-03-03 北京京东尚科信息技术有限公司 Method and device for returning page display position
CN106557584A (en) * 2016-11-29 2017-04-05 青岛海信移动通信技术股份有限公司 A kind of web site collection method and device
CN109189588A (en) * 2018-08-07 2019-01-11 武汉斗鱼网络科技有限公司 A kind of browser function implementation method, device, terminal and storage medium
CN109189588B (en) * 2018-08-07 2020-12-15 武汉斗鱼网络科技有限公司 Browser function implementation method, device, terminal and storage medium
CN110909271A (en) * 2019-11-11 2020-03-24 青岛海信移动通信技术股份有限公司 Communication terminal, server interconnected with communication terminal and control method
CN111428105A (en) * 2020-03-05 2020-07-17 广东睿江云计算股份有限公司 Webpage bookmark management method and system based on crawler cache

Also Published As

Publication number Publication date
CN102915363B (en) 2015-12-09

Similar Documents

Publication Publication Date Title
CN102915363B (en) Web site collection method and system
CN102833258B (en) Network address access method and system
CN112073405B (en) Webpage data loading method and device, computer equipment and storage medium
US10015226B2 (en) Methods for making AJAX web applications bookmarkable and crawlable and devices thereof
US9690568B2 (en) Client-side script bundle management system
JP5420087B2 (en) Method and system for providing a message including a universal resource locator
CN102970284B (en) User profile processing method and server
CN107181779B (en) Method, device and system for processing access request
CN104063460A (en) Method and device for loading webpage in browser
CN103002010A (en) Method, device and system for updating data based on incremental data
CN102929984A (en) Website failure searching method and device
CN106126693A (en) The sending method of the related data of a kind of webpage and device
CN105354337A (en) Web crawler implementation method and web crawler system
CN105471665A (en) Website function testing method, website function testing system and website server
CN103810268A (en) Search result recommendation information loading method, device and system and URL detection method, device and system
CN103324713A (en) Data processing method and device in multistage server and data processing system
CN102945259A (en) Searching method and device based on favorites
CN102955907A (en) Password management method device
CN102932469A (en) Method for achieving client browser and client browser
CN102957696A (en) Data processing method and device
CN104065736A (en) URL redirection method, device, and system
CN103793508A (en) Method, device and system for loading recommend information and detecting websites
CN107819748A (en) A kind of anti-identifying code implementation method cracked and device
CN102937982A (en) Method and system for creating collection contents
CN103034692A (en) Method for sharing feedback information of webpage, client end and server

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20151209

CF01 Termination of patent right due to non-payment of annual fee