CN104899320A - Webpage repair method, terminal, server and system - Google Patents

Webpage repair method, terminal, server and system Download PDF

Info

Publication number
CN104899320A
CN104899320A CN201510342371.2A CN201510342371A CN104899320A CN 104899320 A CN104899320 A CN 104899320A CN 201510342371 A CN201510342371 A CN 201510342371A CN 104899320 A CN104899320 A CN 104899320A
Authority
CN
China
Prior art keywords
webpage
server
terminal
snapshot
coupling
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510342371.2A
Other languages
Chinese (zh)
Inventor
郭俊杰
陈庆伟
李华冈
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Anyi Hengtong Beijing Technology Co Ltd
Original Assignee
Anyi Hengtong Beijing Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Anyi Hengtong Beijing Technology Co Ltd filed Critical Anyi Hengtong Beijing Technology Co Ltd
Priority to CN201510342371.2A priority Critical patent/CN104899320A/en
Publication of CN104899320A publication Critical patent/CN104899320A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • G06F16/9566URL specific, e.g. using aliases, detecting broken or misspelled links
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

Abstract

The invention discloses a webpage repair method, a terminal, a server and a system. A specific implementation way of the webpage repair method includes the steps: transmitting snapshot request information to a snapshot server and searching a matched webpage by the snapshot server if a currently accessed webpage is a failure webpage; receiving the matched webpage fed back by the snapshot server; transmitting the matched webpage to an intelligent server for performing security filtering on the matched webpage; detecting whether filtered webpage contents received from the intelligent server contain effective information or not; providing a webpage repair interface for a user if the filtered webpage contents received from the intelligent server contain the effective information. By the implementation way, information of the failure webpage is acquired by performing snapshot search and security filtering on the failure webpage, the information acquiring capacity of a browser is improved, and security and effectiveness of the acquired information can be ensured.

Description

Webpage restorative procedure, terminal, server and system
Technical field
The application relates to field of computer technology, is specifically related to field of terminal technology, particularly relates to webpage restorative procedure, terminal, server and system.
Background technology
In prior art, user carries out browsing with in search procedure at use browser, the problem that the webpage often occurring some reason causes because offered load is excessive etc. cannot show, and user cannot know the information of webpage; When user wishes webpage inefficacy (such as network address changes or comprises Risk Content) of accessing, user also cannot by former network address obtaining information.Existing browser only returns error code to user, cannot provide the information that web page contents is relevant, thus have impact on user by browser searches and the efficiency browsed.
Summary of the invention
In view of above-mentioned defect of the prior art or deficiency, expect to provide a kind of method of repairing the webpage that cannot access.Further, also expect that the webpage repaired can provide more effective information.In order to realize above-mentioned one or more object, this application provides webpage restorative procedure, terminal, server and system.
On the one hand, this application provides a kind of webpage restorative procedure, comprising: if the webpage of current accessed is inefficacy webpage, send snapshot request information to snapshot server, search the webpage of coupling for snapshot server; Receive the webpage of the coupling of snapshot server feedback; Send the webpage of coupling to intelligent server, for intelligent server, safety filtering is carried out to the webpage of described coupling; Detect the web page contents after the filtration received from intelligent server and whether include effective information; And if from the web page contents after the filtration that intelligent server receives, include effective information, provide to user and repair the interface of webpage.
In some implementation, snapshot request information at least comprises URL(uniform resource locator) and the access time of the webpage of current accessed.
In some implementation, webpage restorative procedure also comprises: the operation of repairing webpage in response to user, repairs webpage.
Second aspect, this application provides a kind of webpage restorative procedure, comprising: the snapshot request information that receiving terminal sends; The webpage of coupling is searched based on snapshot request; And Query Result is sent to terminal, for terminal, the webpage of coupling is sent to intelligent server and carries out safety filtering.
In some implementation, snapshot request information at least comprises URL(uniform resource locator) and the access time of the webpage of current accessed.
In some implementation, search the webpage of coupling based on snapshot request, comprising: inquiry and holding time and described access time immediate webpage identical with the URL of the webpage of current accessed in the webpage stored.
The third aspect, this application provides a kind of webpage restorative procedure, comprising: the webpage that receiving terminal sends; Safety filtering is carried out to webpage; And the webpage after filtering is sent to terminal.Wherein, the webpage of the webpage coupling that to be snapshot server arrive based on the snapshot request information searching of terminal, snapshot request information by terminal response in determining that the webpage of current accessed is that inefficacy webpage sends.
In some implementation, snapshot request information at least comprises URL(uniform resource locator) and the access time of the webpage of current accessed.
In some implementation, safety filtering is carried out to webpage, comprising: the model based on training in advance calculates the safety value of webpage; Judge whether safety value exceedes preset security threshold value; And if do not exceed, then based on keywords database, webpage is filtered.
Fourth aspect, this application provides a kind of terminal, comprising: the first transmitting element, for when the webpage of current accessed is inefficacy webpage, sends snapshot request information, search the webpage of coupling for snapshot server to snapshot server; Receiving element, for receiving the webpage of the coupling of snapshot server feedback; Second transmitting element, for sending the webpage of coupling to intelligent server, carries out safety filtering for the webpage of intelligent server to coupling; Whether detecting unit, include effective information for detecting from the web page contents after the filtration of intelligent server reception; And processing unit, for including effective information in response in the web page contents after the filtration received from intelligent server, provide the interface repairing webpage to user.
In some implementation, snapshot request information at least comprises URL(uniform resource locator) and the access time of the webpage of current accessed.
In some implementations, terminal also comprises reparation unit, for repairing the operation of webpage in response to user, repairs webpage.
5th aspect, this application provides a kind of server, comprising: receiving element, for the snapshot request information that receiving terminal sends; Search unit, for searching the webpage of coupling based on snapshot request; And transmitting element, for Query Result is sent to terminal, for terminal, the webpage of coupling is sent to the server with filtering function and carries out safety filtering.
In some implementation, snapshot request information at least comprises URL(uniform resource locator) and the access time of the webpage of current accessed; Search unit for searching the webpage of coupling as follows: inquiry and holding time and access time immediate webpage identical with the URL of the webpage of current accessed in the webpage stored.
6th aspect, this application provides a kind of server, comprising: receiving element, for the webpage that receiving terminal sends; Filter element, for carrying out safety filtering to webpage; And transmitting element, for the webpage after filtration is sent to terminal.Wherein, webpage is the webpage of the coupling arrived based on the snapshot request information searching of terminal, snapshot request information by terminal response in determining that the webpage of current accessed is that inefficacy webpage sends.
In some implementation, snapshot request information at least comprises URL(uniform resource locator) and the access time of the webpage of current accessed.
In some implementation, filter element is used for carrying out safety filtering to webpage as follows: the model based on training in advance calculates the safety value of webpage; Judge whether safety value exceedes preset security threshold value; If do not exceeded, then based on keywords database, webpage is filtered.
7th aspect, this application provides a kind of webpage repair system, comprise as the application's fourth aspect provide terminal, the 5th aspect the server that provides and the 6th aspect the server that provides.
Webpage restorative procedure, terminal, server and system that the application provides, search and safety filtering by carrying out snapshot to inefficacy webpage, obtain the information of inefficacy webpage, improve the ability of browser obtaining information, and can ensure that obtained information security is effective, and then improve user by browser searches and the efficiency browsed.
Accompanying drawing explanation
That is done with reference to the following drawings by reading is described in detail non-limiting example, and the other features, objects and advantages of the application will become more obvious:
Fig. 1 shows the exemplary system architecture can applying the embodiment of the present application;
Fig. 2 shows the exemplary process diagram of the webpage restorative procedure according to the application's embodiment;
Fig. 3 shows the exemplary process diagram of the webpage restorative procedure according to another embodiment of the application;
Fig. 4 shows the exemplary process diagram of the webpage restorative procedure according to another embodiment of the application;
Fig. 5 shows the effect schematic diagram of applying web page restorative procedure in a browser;
Fig. 6 shows the structural representation of the terminal according to the application's embodiment;
Fig. 7 shows the structural representation of the server according to the application's embodiment;
Fig. 8 shows the structural representation of the server according to another embodiment of the application;
Fig. 9 shows the structural representation of the webpage repair system according to the application's embodiment;
Figure 10 shows based on the data interaction schematic diagram in the webpage repair system of the application's embodiment.
Embodiment
Below in conjunction with drawings and Examples, the application is described in further detail.Be understandable that, specific embodiment described herein is only for explaining related invention, but not the restriction to this invention.It also should be noted that, for convenience of description, in accompanying drawing, illustrate only the part relevant to Invention.
It should be noted that, when not conflicting, the embodiment in the application and the feature in embodiment can combine mutually.Below with reference to the accompanying drawings and describe the application in detail in conjunction with the embodiments.
Fig. 1 shows the exemplary system architecture 100 can applying the embodiment of the present application.
As shown in Figure 1, system architecture 100 can comprise terminal device 101,102, network 103 and server 104,105.Network 103 in order to provide the medium of communication link between terminal device 101,102 and server 104,105.Network 103 can comprise various connection type, such as wired, wireless communication link or fiber optic cables etc.
User 110 can use terminal device 101,102 mutual by network 103 and server 104,105, with receipt message or transmission instruction.Terminal device 101,102 can be provided with browser, the message that user is transmitted by browser acquisition network 103.
Terminal device 101,102 can be various electronic equipment, includes but not limited to PC, smart mobile phone, intelligent watch, panel computer, personal digital assistant etc.
The process such as server 104,105 can store the data received, analysis, and result is fed back to terminal device.
Should be appreciated that, the number of the terminal device in Fig. 1, network and server is only schematic.According to realizing needs, the terminal device of arbitrary number, network and server can be had.
Please refer to Fig. 2, it illustrates the exemplary process diagram of the webpage restorative procedure according to the application's embodiment.Be applied in this way in browser in the present embodiment and illustrate, this browser can be installed on to be had in the electronic equipment of network connecting function, such as, can be installed in smart mobile phone, panel computer, pocket computer on knee and desk-top computer.
As shown in Figure 2, in step 201, if the webpage of current accessed is inefficacy webpage, sends snapshot request information to snapshot server, search the webpage of coupling for snapshot server.
In the present embodiment, when user initiates the request of accessed web page, terminal can search corresponding webpage in web page server according to the network address in request, and lookup result is presented to user.If the webpage that user accesses is effective webpage, then terminal can find corresponding web page contents from web page server, and web page contents is supplied to user.If the webpage of user's current accessed is inefficacy webpage, such as the network address of the webpage of user's current accessed there occurs change, or there is security risk due to webpage, web page server does not return results, then terminal can send snapshot request information to snapshot server, and request snapshot server inquires about the webpage mated with the inefficacy webpage of current accessed in the snapshots of web pages of preserving.
In some optional implementations, snapshot request information at least can comprise URL(uniform resource locator) (Uniform Resource Locator, URL) and the access time of the webpage of current accessed.Then snapshot server can search the snapshots of web pages of mating with webpage URL, and from the snapshots of web pages of mating with webpage URL, chooses the snapshot holding time with access time immediate snapshots of web pages as the webpage mated.
Alternatively, HTML (Hypertext Markup Language) status code (HTTP status code) can also be comprised in snapshot request information.HTTP status code is 3 digit numerical code representing web page server http response state, can comprise five kinds of states: message, success, be redirected, request error and server error.Wherein, be redirected, request error and server error represent cannot by current network address accessed web page.Then snapshot server can judge the failure cause of webpage further according to HTTP status code, and searches the webpage of coupling further according to failure cause.The HTTP status code such as returned when browser is 403, then represent that web page server refusal processes the request of accessed web page.If comprise the reason of refusal in the HTTP connection status returned, snapshot server can screen found snapshots of web pages of mating with webpage URL according to the reason of web page server refusal, and returns the result after screening to terminal.
In step 202., the webpage of the coupling of snapshot server feedback is received.
In the present embodiment, terminal can receive the webpage of the coupling found by snapshot server by the network equipment.Particularly, snapshot server can send continuous print data by the network equipment to terminal, and terminal processes after receiving the data, by the webpage of the coupling that data convert finds for snapshot server.
In some optional implementations, terminal can judge the data that snapshot server returns, if comprise web data in the data that return of snapshot server, then therefrom can extract web data and reduce webpage; If do not comprise web data in the data that snapshot server returns, then snapshot server does not find the webpage of coupling, and the webpage of current accessed cannot be repaired.
In step 203, send the webpage of coupling to intelligent server, carry out safety filtering for the webpage of intelligent server to coupling.
In the present embodiment, terminal according to the feedback result of snapshot server, can initiate filter request, with content illegal in filtering web page to intelligent server.
Alternatively, can comprise filtering keys in the filter request that terminal sends to intelligent server, such as " gambling ", " terrorism " etc., so that browser filters web page contents based on filtering keys.
After intelligent server receives the webpage of the coupling that terminal sends, can to first resolving the content of webpage.Such as the word content in webpage, image content and video content can be extracted respectively.Afterwards, safety filtering can be carried out based on diverse ways to the web page contents after parsing.Such as, can search based on keyword match technology the filtering keys comprised in the word content of webpage, alternatively, can also adopt the method for fuzzy matching to comprise in word content and word that key word is close filter.Again such as, feature extraction and texture analysis can be carried out to image content, based on the illegal image characteristic set preset, image content be filtered.Particularly, the characteristics of image extracted can be mated with the characteristics of image in the illegal image characteristic set preset, then filtering coupling image texture or comprise the image content of this image texture.Also can judge whether the video link in webpage is legal, if video link is illegal link, then this video link can be filtered from webpage.Alternatively, the mode of filtration can for deleting or covering.
In some optional implementations, intelligent server can adopt the method for machine learning to carry out safety filtering to web page contents.Such as, intelligent server can set up sample set based on the legal webpage in database, illegal webpage and the webpage through artificial filter, involutory method webpage, illegal webpage and the content through the webpage of artificial filter are analyzed respectively, then the content based on webpage is trained, and draws filtering model.When practical application, can analyze the web page contents of current accessed, web page contents be inputted this filtering model and filter, draw safety, legal web page contents.
In step 204, whether effective information is included the web page contents after detecting the filtration received from intelligent server.
In the present embodiment, terminal can receive data from intelligent server, and detects the effective information whether comprising the webpage of coupling in the data received.Particularly, if the webpage of the coupling that snapshot server finds all is filtered by intelligent server in step 203 in step 202, namely the information in the webpage mated is thus completely shielded, then intelligent server can send message to terminal, does not comprise the effective information of webpage in this message.If the webpage of the coupling that snapshot server finds is in step 203 by intelligent server Partial filtration in step 202, then terminal can comprise filtered web page contents from the data that intelligent server receives, and wherein comprises the effective information of webpage.
Terminal, after the data receiving intelligent server transmission, can be resolved data, and analyzes wherein whether comprise the information relevant to web page contents, i.e. effective information.Alternatively, status information can also be comprised in the data that intelligent server returns.Status information may be used for the filter result indicating webpage, and such as whether webpage is thus completely shielded.The status information that terminal can also return according to intelligent server judges whether include effective information in the web page contents after filtering.
In step 205, if include effective information from the web page contents after the filtration that intelligent server receives, the interface repairing webpage is provided to user.
If include effective information the web page contents after terminal detects the filtration received from intelligent server in step 204, then can determine that webpage can be repaired.At this moment, the web page contents received can be added in buffer memory by terminal, and provides the interface repairing webpage to user.This interface can be presented in the browser of terminal to play window shape formula.The pop-up window of " reparation webpage " is such as configured at the marginal position of browser.
In certain embodiments, above-mentioned webpage restorative procedure can also comprise: the operation of repairing webpage in response to user, repairs webpage.
The interface that user can be provided by terminal knows that the inefficacy webpage of current accessed can be repaired.If user wishes to obtain relevant information further, then can perform the operation of repairing webpage.The interface of the reparation webpage that this operation can be provided by terminal realizes.Particularly, when user selects to repair webpage (such as clicking " reparation webpage "), terminal can show the web page contents of buffer memory to user, thus realizes the reparation of webpage.
The webpage restorative procedure that the above embodiments of the present application provide, terminal can receive the webpage mated with inefficacy webpage that snapshot server finds, and carry out safety filtering by the webpage of intelligent server to coupling, the information of inefficacy webpage can be obtained, realize the reparation of webpage, improve the ability of browser obtaining information, and can ensure that obtained information security is effective.
With further reference to Fig. 3, it illustrates the exemplary process diagram of the webpage restorative procedure according to another embodiment of the application.
As shown in Figure 3, in step 301, the snapshot request information of receiving terminal transmission.
The backup of webpage can be kept in snapshot server when webpage by search engine.In the present embodiment, when the webpage that terminal detects user's current accessed is inefficacy webpage, snapshot request information can be sent to snapshot server, to obtain the snapshot of the webpage of current accessed.Snapshot server can receive snapshot request information by the network equipment from terminal.This network equipment can comprise the equipment of the transmission data such as netting twine, wireless router, fiber optic cables.
In some implementations, snapshot request information at least can comprise URL and the access time of the webpage of current accessed.Further, snapshot request information can also comprise HTTP status code.In some implementations, network interconnection agreement (Internet Protocol, the IP) address of accessed web page can also be comprised in snapshot request information.
In step 302, based on the webpage of snapshot request information searching coupling.
In the present embodiment, snapshot server can be searched in the snapshots of web pages stored according to snapshot request.When comprising the URL of webpage and the access time of current accessed in snapshot request, snapshot server can search the webpage identical with the URL of the webpage of current accessed in the webpage stored.Because snapshot server can preserve the webpage backup of different time, namely preserve the backup of multiple different times of same webpage, therefore snapshot server can find multiple snapshots of web pages according to the URL of the webpage lost efficacy.Further, the snapshot server webpage that holding time and access time immediate webpage can be selected from the multiple snapshots of web pages found to mate the most based on the access time.
In some implementations, snapshot server can search the webpage similar to the URL of the webpage of current accessed in the webpage stored.The Similarity Measure of webpage URL can carry out as follows: extract the website characteristic sum directory feature in URL, calculates the similarity between URL based on website characteristic sum directory feature.Like this, the webpage similar to the webpage URL of current accessed can be drawn.Afterwards, snapshot server can screen similar webpage based on the feature of web page contents, such as, can filter out based on the title of webpage, keyword etc. the webpage mated most with the webpage of current accessed.
In step 303, Query Result is sent to terminal, for terminal, the webpage of coupling is sent to intelligent server and carries out safety filtering.
In the present embodiment, the result of inquiry can be sent to terminal by network by snapshot server.If snapshot server does not find the webpage of coupling in step 302, then can send message to terminal, this message can indicate the Query Result of snapshot server to be the webpage that nothing is mated.Terminal cannot be able to be repaired according to the message determination webpage received.
If snapshot server finds the webpage of one or more coupling in step 302, then the webpage found can be sent to terminal as Query Result.The webpage of one or more coupling, after receiving Query Result, can be sent to intelligent server and carry out safety filtering by terminal.Alternatively, snapshot server can also send message to terminal, informs that terminal finds the webpage of coupling.Then terminal can initiate home page filter request according to this message to intelligent server.
The filtering rule that intelligent server can be preset according to filtering keys word etc. filters the webpage that terminal sends.Particularly, first can carry out analyzing and processing to web page contents, such as word segmentation processing, then mate based on filtering keys dictionary, by keyword filtering from webpage of coupling.The mode of filtering can for covering or deleting.Also can extract the picture in webpage and video, based on the picture library preset and video library, picture and video be filtered.Whether in some implementations, may comprise illegal link in webpage, intelligent server can extract the link in webpage, detect link and mate with linking in illegal chained library, if coupling, then will link filtering from webpage.
Further, intelligent server can adopt legal web page contents to replace illegal web page contents.Such as can choose legal word, picture, the video relevant to web page contents and link and replace illegal word, picture, video and link.Thus recover the original information of webpage as much as possible, and the legitimacy of guarantee information and security.
In above-described embodiment of the application, when the webpage of user's current accessed lost efficacy, the snapshot request that snapshot server can send based on terminal finds the snapshots of web pages of the webpage coupling of inefficacy, and further the snapshots of web pages found is returned terminal, for terminal, snapshots of web pages is sent to intelligent server to filter, realize the reparation to the webpage lost efficacy, thus make user obtain more information by the Query Result of snapshot server.
With further reference to Fig. 4, it illustrates the exemplary process diagram of the webpage restorative procedure according to another embodiment of the application.
As shown in Figure 4, in step 401, the webpage of receiving terminal transmission.
The webpage that terminal sends can for snapshot server based on terminal snapshot request information searching to the webpage of coupling.Snapshot request information can by described terminal response in determining that the webpage of current accessed is that inefficacy webpage sends.
In the present embodiment, if the webpage of user's current accessed is inefficacy webpage, then terminal can send snapshot request information to snapshot server.Snapshot request information can comprise the relevant information of inefficacy webpage and the relevant information of access, the HTTP status code, IP address etc. of the webpage that such as lost efficacy.In some implementations, snapshot request information at least can comprise the webpage that the URL(uniform resource locator) of the webpage of current accessed and access time snapshot server can mate with inefficacy webpage based on the URL in snapshot request information, time and IP address search.After finding, snapshot server can send the webpage of coupling to terminal.The webpage of coupling can be sent to intelligent server by terminal.The webpage of the coupling that intelligent server can be sent by network reception terminal.
In step 402, safety filtering is carried out to webpage.
In the present embodiment, the filtering rule that intelligent server can be preset based on filtering keys word etc. filters the webpage that terminal sends.Particularly, first can carry out analyzing and processing to web page contents, such as word segmentation processing, then mate based on filtering keys dictionary, by keyword filtering from webpage of coupling.Also can for extracting picture in webpage and video to the analyzing and processing of webpage.Then can filter picture and video based on the picture library preset and video library.Whether in some implementations, intelligent server can extract the link in webpage, detect link and mate with linking in illegal chained library, if coupling, then and can by link filtering from webpage.
In some optional implementations, carry out safety filtering to webpage can carry out in the following way: the safety value first calculating described webpage based on the model of training in advance, then judge whether safety value exceedes preset security threshold value, if do not exceeded, then based on keywords database, webpage is filtered; If exceeded, then webpage can be sent to filtering server, to carry out manual analysis.The webpage of a large amount of filtered can be utilized and train as training sample through the webpage of artificial filter.Wherein, the webpage of artificial filter can be divided into lower security grade, middle safe class and high safety grade.The webpage of lower security grade can be the webpage that risk is higher, needs depth analysis and manual analysis.The webpage of middle safe class can be the webpage that can filter based on keywords database, key picture storehouse, key video sequence storehouse and crucial chained library, and the webpage of high safety grade can be the webpage without the need to filtering.The safety value that each safe class is corresponding different is interval.When applying, first can calculate the safety value of webpage to be filtered based on the safety value computation model trained, the safe class of webpage to be filtered is determined in the interval then belonging to safety value.Alternatively, when the safety value of webpage to be filtered does not exceed preset security threshold value, can think that webpage to be filtered is the webpage of middle safe class.This preset security threshold value can draw based on the sample training of big data quantity.Row can be tapped into based on keywords database, key picture storehouse, key video sequence storehouse and crucial chained library to the keyword in webpage, key picture, key video sequence and chaining key to filter, delete or cover the keyword in webpage, key picture, key video sequence and crucial link.
When the safety value of webpage to be filtered exceedes preset security threshold value, can think that webpage to be filtered is the webpage of lower security grade, at this moment webpage can be sent to the filtering server on backstage, to carry out manual analysis to webpage.
Further, intelligent server can choose other words relevant to web page contents, and keyword, key picture, key video sequence and key that picture, video and link are replaced in webpage link.Thus recover the original information of webpage as much as possible, and the legitimacy of guarantee information and security.
In step 403, the webpage after filtration is sent to terminal.
In the present embodiment, the webpage after filtration, after carrying out safety filtering to webpage, can be sent to terminal by network by intelligent server.Webpage after filtration can comprise the interested effective information of user, and security is high.Terminal, after the webpage receiving intelligent server feedback, can provide to user the interface repairing webpage.When user by interface send repair the instruction of webpage time, terminal can by the web displaying after the filtration that receives from intelligent server in browser interface, thus provides the information relevant to the web page contents that lost efficacy for user.
Above-mentioned composition graphs 4 describes in embodiment, and the snapshots of web pages of intelligent server to the inefficacy webpage that terminal obtains from snapshot server carries out safety filtering, can realize the reparation to inefficacy webpage, ensures security and the legitimacy of the info web repaired simultaneously.
It should be noted that, in above-mentioned composition graphs 2, embodiment described by Fig. 3, Fig. 4, snapshot server and intelligent server can be same server.This server can integrated snapshot locating function and safety filtering function.At this moment, server, after the webpage of the coupling found out based on snapshot request, can send the webpage of coupling to terminal, feed back to server after whether effectively being detected afterwards by terminal to the content of webpage.In some implementations, server also directly can carry out safety filtering to the webpage of the coupling found out, and the webpage after filtering is sent to terminal, without the need to lookup result is fed back to terminal.
With further reference to Fig. 5, it illustrates the effect schematic diagram of applying web page restorative procedure in a browser.As shown in Figure 5, browser 510 is when opening webpage, and web page server returns connection status 511 to browser 510.In connection status 511, the HTTP status code of the webpage of current accessed is " 404 Not Found ", and namely browser 510 does not obtain the info web of asking from web page server.At this moment, can think that the webpage that user asks is inefficacy webpage.Browser 510 can initiate request to snapshot server, and snapshot server can be searched the webpage the most similar to the webpage that user asks based on request and lookup result is fed back to browser 510.The webpage that snapshot server returns can be sent to intelligent server and filter by browser afterwards, and receives the webpage after being filtered by intelligent server.At this moment, browser 510 can generate pop-up window 512 in webpage, and this webpage of prompting user can be repaired, and whether inquiry checks the webpage of reparation.When user selects "Yes", browser 510 can show the webpage received from intelligent server in the current page.
With further reference to Fig. 6, it illustrates the structural representation of the terminal according to the application's embodiment.As shown in Figure 6, terminal can comprise the first transmitting element 601, receiving element 602, second transmitting element 603, detecting unit 604 and processing unit 605.Wherein, the first transmitting element 601 may be used for when the webpage of current accessed is inefficacy webpage, sends snapshot request information, search the webpage of coupling for snapshot server to snapshot server.Receiving element 602 may be used for the webpage of the coupling receiving snapshot server feedback.Second transmitting element 603 may be used for the webpage sending coupling to intelligent server, carries out safety filtering for the webpage of intelligent server to coupling.Detecting unit 604 may be used for detecting whether include effective information from the web page contents after the filtration of intelligent server reception.Processing unit 605 may be used for including effective information in the web page contents after in response to the filtration received from intelligent server, provides the interface repairing webpage to user.Alternatively, intelligent server and snapshot server can be same server.This server can have snapshot locating function and safety filtering function.
The snapshot request information that first transmitting element 601 sends can comprise the relevant information of the webpage of user's current accessed, such as HTTP status code, IP address etc.In some implementations, snapshot request information at least can comprise URL(uniform resource locator) and the access time of the webpage of described current accessed.
In some implementations, receiving element 602 can also connect by network the message that snapshot server returns, and judges whether snapshot server inquires the webpage of coupling according to the message received.The web data sent by network reception snapshot server when determining that snapshot server inquires the webpage of coupling.Processing unit 605 can configuration interface in a browser.The interface that user can be configured by processing unit 605 is selected to repair webpage.
In certain embodiments, terminal 600 can also comprise reparation unit 606 (not shown).Repair unit 606 and may be used for the operation of repairing webpage in response to user, webpage is repaired.
The terminal that above-described embodiment provides, by initiating snapshot request to snapshot server, receive the webpage of the coupling that snapshot server returns, afterwards the webpage of coupling is sent to intelligent server and carries out safety filtering, last from the webpage after intelligent server receiving filtration, the reparation to the webpage cannot accessed due to reasons such as webpage are expired, web server load is excessive can be realized, obtain the information of inefficacy webpage, improve the ability of browser obtaining information, and can ensure that obtained information security is effective.
With further reference to Fig. 7, it illustrates the structural representation of the server according to the application's embodiment.As shown in Figure 7, server 700 can comprise receiving element 701, search unit 702 and transmitting element 703.Wherein receiving element 701 may be used for the snapshot request information that receiving terminal sends.Search the webpage of the snapshot request information searching coupling that unit 702 may be used for receiving based on receiving element 701.Transmitting element 703 may be used for Query Result to be sent to terminal, for terminal, the webpage of coupling is sent to the server with filtering function and carries out safety filtering.
In some optional implementations, snapshot request information at least can comprise URL(uniform resource locator) and the access time of the webpage of current accessed.Search the webpage that unit 702 may be used for searching as follows coupling: inquiry and holding time and access time immediate webpage identical with the URL of the webpage of current accessed in the webpage stored, using the webpage that finds as the webpage mated.
The server that the embodiment that above-mentioned composition graphs 7 describes provides, can process the snapshot request that terminal sends, and searches the webpage that the inefficacy webpage of accessing with user matches, and the webpage matched is sent to terminal.The information relevant to the webpage that lost efficacy can be obtained, improve the ability of browser obtaining information.
With further reference to Fig. 8, it illustrates the structural representation of the server according to another embodiment of the application.As shown in Figure 8, server 800 can comprise receiving element 801, filter element 802 and transmitting element 803.Receiving element 801 may be used for the webpage that receiving terminal sends.Filter element 802 may be used for carrying out safety filtering to webpage.Transmitting element 803 may be used for the webpage after by filtration and is sent to terminal.Wherein the webpage that receives of receiving element 801 can for the webpage of coupling arrived based on the snapshot request information searching of terminal.Snapshot request information can by terminal response in determining that the webpage of current accessed is that inefficacy webpage sends.In some implementations, snapshot request information at least can comprise URL(uniform resource locator) and the access time of the webpage of described current accessed.
In some implementations, filter element 802 may be used for carrying out safety filtering to webpage as follows: the model based on training in advance calculates the safety value of webpage; Judge whether safety value exceedes preset security threshold value; If do not exceeded, then can filter webpage based on keywords database; If exceeded, then webpage can be sent to the server of specifying, to carry out manual analysis.Like this, can ensure to comprise effective, safe and reliable content in the webpage after filtering.
The server that the embodiment that above-mentioned composition graphs 8 describes provides, can carry out safety filtering by the webpage matched with inefficacy webpage sent terminal, can repair inefficacy webpage.Make user also can obtain effective info web when webpage loses efficacy.
Should be appreciated that terminal 600, server 700 are corresponding with each step in the method described with reference to figure 2,3,4 respectively with all unit recorded in server 800.Thus, above for the unit that operation and the feature of method description are equally applicable to terminal 600, server 700 and server 800 and wherein comprise, do not repeat them here.
In some implementations, each unit of server 700 and server 800 can be integrated in same server.Namely can perform snapshot by same server and search the operation with safety filtering.This server can comprise the unit in server 700 and 800.
Please refer to Fig. 9, it illustrates the structural representation of the webpage repair system according to the application's embodiment.As shown in Figure 9, webpage repair system 900 can comprise terminal 600, server 700 and server 800.
Terminal 600 can comprise the first transmitting element, receiving element, the second transmitting element, detecting unit and processing unit.First transmitting element may be used for, when the webpage of current accessed is inefficacy webpage, sends snapshot request information, search the webpage of coupling for server 700 to server 700.Receiving element may be used for the webpage of the coupling that reception server 700 feeds back.Second transmitting element may be used for the webpage sending coupling to server 800, carries out safety filtering for the webpage of server 800 to coupling.Detecting unit may be used for detecting whether include effective information from the web page contents after the filtration of server 800 reception.Processing unit may be used for including effective information in the web page contents after in response to the filtration received from server 800, provides the interface repairing webpage to user.
Server 700 can comprise receiving element, search unit and transmitting element.Wherein receiving element may be used for the snapshot request information that receiving terminal 600 sends.Search the webpage of the snapshot request information searching coupling that unit may be used for receiving based on receiving element.Transmitting element may be used for Query Result to be sent to terminal 600, for terminal 600, the webpage of coupling is sent to server 800 and carries out safety filtering.
Server 800 can comprise receiving element, filter element and transmitting element.Receiving element may be used for the webpage that receiving terminal 600 sends.Filter element may be used for carrying out safety filtering to the webpage received.Transmitting element may be used for the webpage after by filtration and is sent to terminal 600.The wherein webpage that receives of the receiving element webpage of coupling that can arrive based on the snapshot request information searching of terminal 600 for server 700.Snapshot request information can by terminal 600 in response to determining that the webpage of current accessed is that inefficacy webpage sends.
In some implementations, the unit in server 700 and server 800 can be integrated in same server, then webpage repair system can comprise terminal 600 and be integrated with the server of the unit in server 700 and 800.
With further reference to Figure 10, it illustrates based on the data interaction schematic diagram in the webpage repair system of the application's embodiment.In the present embodiment, snapshot server 1002 may be used for searching snapshots of web pages, and the webpage that intelligent server 1003 may be used for snapshot server finds filters.
As shown in Figure 10, when user is accessed certain network address by terminal 1001 or retrieved in a search engine, terminal 1001 can detect the content that web page server returns.When webpage cannot be accessed due to factors such as network address is expired, server load is excessive, flow controls, terminal 1001 can initiate snapshot request 1010 to snapshot server 1002, snapshot server is after receiving snapshot request 1010, snapshot can be carried out according to the relevant information of the accessed web page in snapshot request (as URL, access time etc.) to search, if find the snapshots of web pages of coupling, then snapshot server 1002 can send the message 1020 of the webpage of feedback coupling to terminal 1001.Subsequently, terminal 1001 can initiate filter request 1030 to intelligent server 1003, the webpage of coupling is sent to intelligent server 1003, intelligent server 1003 can based on preset rule web page contents is filtered, backward terminal 1001 return feedback filter result message 1040.If comprise the effective information of webpage in the message 1040 of feedback filter result, then webpage can be repaired.Terminal 1001 can repair the request of webpage in response to user, the webpage content display that extracts in the message 1040 from feedback filter result, to user, is realized the reparation of webpage.
The webpage repair system that the above embodiments of the present application provide, search by carrying out snapshot to inefficacy webpage, and safety filtering is carried out to the webpage found, the information of inefficacy webpage can be obtained in terminal, improve the ability of browser obtaining information, and security and the validity of obtained information can be ensured.
As another aspect, present invention also provides a kind of computer-readable recording medium, this computer-readable recording medium can be the computer-readable recording medium comprised in device described in above-described embodiment; Also can be individualism, be unkitted the computer-readable recording medium allocated in terminal device.This computer-readable recording medium stores more than one or one program, and this program can comprise the program code for the method shown in flowchart.
Process flow diagram in accompanying drawing and block diagram, illustrate according to the architectural framework in the cards of the system of various embodiments of the invention, device, method and computer program product, function and operation.In this, each square frame in process flow diagram or block diagram can represent a part for module, program segment or a code, and a part for described module, program segment or code comprises one or more executable instruction for realizing the logic function specified.Also it should be noted that at some as in the realization of replacing, the function marked in square frame also can be different from occurring in sequence of marking in accompanying drawing.Such as, in fact the square frame that two adjoining lands represent can perform substantially concurrently, and they also can perform by contrary order sometimes, and this determines according to involved function.Also it should be noted that, the combination of the square frame in each square frame in block diagram and/or process flow diagram and block diagram and/or process flow diagram, can realize by the special hardware based system of the function put rules into practice or operation, or can realize with the combination of specialized hardware and computer instruction.
More than describe and be only the preferred embodiment of the application and the explanation to institute's application technology principle.Those skilled in the art are to be understood that, invention scope involved in the application, be not limited to the technical scheme of the particular combination of above-mentioned technical characteristic, also should be encompassed in when not departing from described inventive concept, other technical scheme of being carried out combination in any by above-mentioned technical characteristic or its equivalent feature and being formed simultaneously.The technical characteristic that such as, disclosed in above-mentioned feature and the application (but being not limited to) has similar functions is replaced mutually and the technical scheme formed.

Claims (17)

1. a webpage restorative procedure, is characterized in that, described method comprises:
If the webpage of current accessed is inefficacy webpage, sends snapshot request information to snapshot server, search the webpage of coupling for snapshot server;
Receive the webpage of the coupling of described snapshot server feedback;
Send the webpage of described coupling to intelligent server, for described intelligent server, safety filtering is carried out to the webpage of described coupling;
Detect and whether include effective information from the web page contents after the filtration that described intelligent server receives; And
If include effective information the web page contents after the filtration that described intelligent server receives, provide the interface repairing webpage to user.
2. method according to claim 1, is characterized in that, described snapshot request information at least comprises URL(uniform resource locator) and the access time of the webpage of described current accessed.
3. method according to claim 1 and 2, is characterized in that, described method also comprises:
Repair the operation of webpage in response to user, webpage is repaired.
4. a webpage restorative procedure, is characterized in that, described method comprises:
The snapshot request information that receiving terminal sends;
Based on the webpage of described snapshot request information searching coupling; And
Query Result is sent to terminal, for terminal, the webpage of described coupling is sent to intelligent server and carries out safety filtering.
5. method according to claim 4, is characterized in that, described snapshot request information at least comprises URL(uniform resource locator) and the access time of the webpage of described current accessed;
The described webpage searching coupling based on described snapshot request, comprising:
Inquiry and holding time and described access time immediate webpage identical with the URL of the webpage of described current accessed in the webpage stored.
6. a webpage restorative procedure, is characterized in that, described method comprises:
The webpage that receiving terminal sends;
Safety filtering is carried out to described webpage; And
Webpage after filtering is sent to described terminal;
Wherein, the webpage of the described webpage coupling that to be snapshot server arrive based on the snapshot request information searching of terminal, described snapshot request information by described terminal response in determining that the webpage of current accessed is that inefficacy webpage sends.
7. method according to claim 6, is characterized in that, described snapshot request information at least comprises URL(uniform resource locator) and the access time of the webpage of described current accessed.
8. the method according to claim 6 or 7, is characterized in that, describedly carries out safety filtering to described webpage, comprising:
Model based on training in advance calculates the safety value of described webpage;
Judge whether described safety value exceedes preset security threshold value; And
If do not exceeded, then based on keywords database, key picture storehouse, key video sequence storehouse and crucial chained library, described webpage is filtered.
9. a terminal, is characterized in that, described terminal comprises:
First transmitting element, for when the webpage of current accessed is inefficacy webpage, sends snapshot request information to snapshot server, searches the webpage of coupling for snapshot server;
Receiving element, for receiving the webpage of the coupling of described snapshot server feedback;
Second transmitting element, for sending the webpage of described coupling to intelligent server, carries out safety filtering for described intelligent server to the webpage of described coupling;
Whether detecting unit, for including effective information the web page contents after detecting the filtration that receives from described intelligent server; And
Processing unit, for including effective information in response in the web page contents after the filtration received from described intelligent server, provides the interface repairing webpage to user.
10. terminal according to claim 9, is characterized in that, described snapshot request information at least comprises URL(uniform resource locator) and the access time of the webpage of described current accessed.
11. terminals according to claim 9 or 10, is characterized in that, also comprise:
Repair unit, for repairing the operation of webpage in response to user, webpage is repaired.
12. 1 kinds of servers, is characterized in that, described server comprises:
Receiving element, for the snapshot request information that receiving terminal sends;
Search unit, for the webpage based on described snapshot request information searching coupling; And
Transmitting element, for Query Result is sent to terminal, is sent to the server with filtering function for terminal by the webpage of described coupling and carries out safety filtering.
13. servers according to claim 12, is characterized in that, described snapshot request information at least comprises URL(uniform resource locator) and the access time of the webpage of described current accessed;
Described unit of searching is for searching the webpage of coupling as follows:
Inquiry and holding time and described access time immediate webpage identical with the URL of the webpage of described current accessed in the webpage stored.
14. 1 kinds of servers, is characterized in that, described server comprises:
Receiving element, for the webpage that receiving terminal sends;
Filter element, for carrying out safety filtering to described webpage; And
Transmitting element, for being sent to described terminal by the webpage after filtration;
Wherein, described webpage is the webpage of the coupling arrived based on the snapshot request information searching of terminal, described snapshot request information by described terminal response in determining that the webpage of current accessed is that inefficacy webpage sends.
15. servers according to claim 14, is characterized in that, described snapshot request information at least comprises URL(uniform resource locator) and the access time of the webpage of described current accessed.
16. servers according to claims 14 or 15, is characterized in that, described filter element is used for carrying out safety filtering to described webpage as follows:
Model based on training in advance calculates the safety value of described webpage;
Judge whether described safety value exceedes preset security threshold value; And
If do not exceeded, then based on keywords database, described webpage is filtered.
17. 1 kinds of webpage repair systems, is characterized in that, described system comprise as arbitrary in claim 9-11 as described in terminal, server as described in claim 12 or 13 and as arbitrary in claim 14-16 as described in server.
CN201510342371.2A 2015-06-18 2015-06-18 Webpage repair method, terminal, server and system Pending CN104899320A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510342371.2A CN104899320A (en) 2015-06-18 2015-06-18 Webpage repair method, terminal, server and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510342371.2A CN104899320A (en) 2015-06-18 2015-06-18 Webpage repair method, terminal, server and system

Publications (1)

Publication Number Publication Date
CN104899320A true CN104899320A (en) 2015-09-09

Family

ID=54031982

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510342371.2A Pending CN104899320A (en) 2015-06-18 2015-06-18 Webpage repair method, terminal, server and system

Country Status (1)

Country Link
CN (1) CN104899320A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106649581A (en) * 2016-11-17 2017-05-10 青岛海信电器股份有限公司 Method for repairing webpage and client side
CN107784230A (en) * 2017-02-16 2018-03-09 平安科技(深圳)有限公司 The restorative procedure and device of page leak
CN110912918A (en) * 2019-12-02 2020-03-24 泰康保险集团股份有限公司 Page repairing method and device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101634993A (en) * 2009-08-26 2010-01-27 中国科学院地理科学与资源研究所 Agricultural information cooperative service system based on process body and implementation method thereof
CN102402613A (en) * 2011-12-20 2012-04-04 上海电机学院 System and method for filtering text information of webpage
CN103838728A (en) * 2012-11-21 2014-06-04 腾讯科技(深圳)有限公司 Webpage information processing method and browser
CN103870606A (en) * 2014-04-08 2014-06-18 上海语天信息技术有限公司 Webpage information extracting system and extracting method
CN104504071A (en) * 2014-12-22 2015-04-08 北京奇虎科技有限公司 SE (search engine)-based web cache providing method and web search client and server

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101634993A (en) * 2009-08-26 2010-01-27 中国科学院地理科学与资源研究所 Agricultural information cooperative service system based on process body and implementation method thereof
CN102402613A (en) * 2011-12-20 2012-04-04 上海电机学院 System and method for filtering text information of webpage
CN103838728A (en) * 2012-11-21 2014-06-04 腾讯科技(深圳)有限公司 Webpage information processing method and browser
CN103870606A (en) * 2014-04-08 2014-06-18 上海语天信息技术有限公司 Webpage information extracting system and extracting method
CN104504071A (en) * 2014-12-22 2015-04-08 北京奇虎科技有限公司 SE (search engine)-based web cache providing method and web search client and server

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106649581A (en) * 2016-11-17 2017-05-10 青岛海信电器股份有限公司 Method for repairing webpage and client side
CN106649581B (en) * 2016-11-17 2023-04-04 Vidaa(荷兰)国际控股有限公司 Webpage repairing method and client
CN107784230A (en) * 2017-02-16 2018-03-09 平安科技(深圳)有限公司 The restorative procedure and device of page leak
WO2018149374A1 (en) * 2017-02-16 2018-08-23 平安科技(深圳)有限公司 Page vulnerability remediation method and device
CN107784230B (en) * 2017-02-16 2019-06-07 平安科技(深圳)有限公司 The restorative procedure and device of page loophole
CN110912918A (en) * 2019-12-02 2020-03-24 泰康保险集团股份有限公司 Page repairing method and device

Similar Documents

Publication Publication Date Title
CN102200980B (en) Method and system for providing network resources
CN102436564A (en) Method and device for identifying falsified webpage
US20100268776A1 (en) System and Method for Determining Information Reliability
CN103888490A (en) Automatic WEB client man-machine identification method
CN102752288A (en) Method and device for identifying network access action
CN103618696B (en) Method and server for processing cookie information
CN114417197A (en) Access record processing method and device and storage medium
CN101441629A (en) Automatic acquiring method of non-structured web page information
CN107085549B (en) Method and device for generating fault information
CN107590236B (en) Big data acquisition method and system for building construction enterprises
CN103838862B (en) Video searching method, device and terminal
CN103823907A (en) Method, device and engine for integrating on-line video resource addresses
CN105763543A (en) Phishing site identification method and device
CN104391978A (en) Method and device for storing and processing web pages of browsers
CN105069011A (en) Webpage favorite management method, device and system
CN102185830B (en) A kind of method and system of security filtration of network television browser
CN108874802A (en) Page detection method and device
CN106547803B (en) Method and device for crawling incremental resources of website
CN105488402A (en) Dark link detection method and system
CN104899320A (en) Webpage repair method, terminal, server and system
CN108270754B (en) Detection method and device for phishing website
CN116015842A (en) Network attack detection method based on user access behaviors
CN104281629A (en) Method and device for extracting picture from webpage and client equipment
CN103246675B (en) A kind of method and apparatus for being used to capture website data
CN103605742A (en) Method and device for recognizing network resource entity content page

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20150909

RJ01 Rejection of invention patent application after publication