CN109560979A - Data detection method and system, server - Google Patents

Data detection method and system, server Download PDF

Info

Publication number
CN109560979A
CN109560979A CN201710888265.3A CN201710888265A CN109560979A CN 109560979 A CN109560979 A CN 109560979A CN 201710888265 A CN201710888265 A CN 201710888265A CN 109560979 A CN109560979 A CN 109560979A
Authority
CN
China
Prior art keywords
address url
url
address
access
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710888265.3A
Other languages
Chinese (zh)
Inventor
程峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201710888265.3A priority Critical patent/CN109560979A/en
Publication of CN109560979A publication Critical patent/CN109560979A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/02Capturing of monitoring data
    • H04L43/028Capturing of monitoring data by filtering

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

This application provides a kind of data detection method and system, one of method includes: that the address URL is determined from the access data of the address URL, determines page data corresponding with the address URL, detects to page data corresponding with the address URL.Since the application only detects the corresponding page data in the address URL really accessed by user, do not detected to without detecting the corresponding page data in the address URL, it is possible to improve the detection efficiency of page data.

Description

Data detection method and system, server
Technical field
This application involves field of communication technology more particularly to a kind of data detection method and systems.
Background technique
In order to detect the abnormal data of website, the corresponding server in website (subsequent to be known as client server) would generally be by net The address URL for homepage of standing, which is sent to, provides the server (subsequent to be known as detection service device) of detection service.Detection service device is by net The address URL for homepage of standing is added in list to be detected.
For an address URL, detection service device can obtain page number corresponding with the address URL from client server According to, and detect whether page data exception occurs;In the case where page data exists and links, the page is obtained using crawler mode The address URL of link is added in list to be detected by the link of data.Detection service device can be treated one by one in a manner described The corresponding page data in the address URL is detected in detection list.
Since the access situation of the address URL each in website is not quite similar, some addresses URL are largely accessed, some URL The unmanned access in address.It has little significance to what the corresponding page data in the address URL of unmanned access was detected, so column to be detected Table includes a part without detecting the address URL.
It, can be to a part of Clinical significance of detecting during detection service device is detected based on list to be detected in existing scheme The little address URL is detected, this can reduce the detection efficiency of page data.
Summary of the invention
In consideration of it, the application provides a kind of data detection method and system, the detection efficiency of page data can be improved.
To achieve the goals above, this application provides following technological means:
A kind of data detection system, comprising:
Client server sends the visit of the address URL for recording the access data of the address URL in the operational process of website Ask data to detection service device;The page data acquisition instruction comprising the address URL that the detection service device is sent is received, is searched Page data corresponding with the address URL sends page data corresponding with the address URL to detection service device;
Detection service device, for receiving the access data for the address URL that the client server is sent;From the address URL It accesses and determines the address URL in data, Xiang Suoshu client server sends the page data acquisition instruction comprising the address URL, receives The page data corresponding with the address URL that the client server is sent, examines page data corresponding with the address URL It surveys.
A kind of data detection method, comprising:
The address URL is determined from the access data of the address URL;
Determine page data corresponding with the address URL;
Page data corresponding with the address URL is detected.
Optionally, the address URL is determined in the access data from the address URL, comprising:
Obtain multiple access information set in detection cycle;Wherein, the access information set includes a plurality of address URL Access record;
The multiple access information set is analyzed, determines the address URL occurred in the multiple access information set.
Optionally, the address URL is determined in the access data from the address URL, comprising:
Obtain multiple access information set in detection cycle;Wherein, the access information set includes a plurality of address URL Access record;
The multiple access information set is analyzed, with determining the address URL and the URL that the multiple access information set occurs The visitation frequency of location;
By the sequence of visitation frequency from high to low, multiple addresses URL are filtered out.
Optionally, the address URL is determined in the access data from the address URL, comprising:
Obtain multiple access information set in detection cycle;Wherein, the access information set includes a plurality of address URL Access record;
The multiple access information set is analyzed, with determining the address URL, the URL of the multiple access information set appearance The visitation frequency of location and the attribute information of the address URL;
Based on the attribute information of the address URL, the weight of the address URL is determined;
By the visitation frequency of the address URL and the product sequence from high to low of the address URL, multiple addresses URL are filtered out.
Optionally, the address URL is determined in the access data from the address URL, comprising:
Obtain multiple access information set in detection cycle;Wherein, the access information set includes a plurality of address URL Access record;
The multiple access information set is analyzed, with determining the address URL, the URL of the multiple access information set appearance The visitation frequency of location and the attribute information of the address URL;
The visitation frequency of attribute information and the address URL based on the address URL, determines the weight of the address URL;
By the visitation frequency of the address URL and the product sequence from high to low of the address URL, multiple addresses URL are filtered out.
Optionally, further includes:
Improve the detection frequency of the multiple addresses URL filtered out.
Optionally, further includes:
During being detected to page data corresponding with the address URL, if page data corresponding with the address URL There are exceptions, then record the address URL;
After detecting to page data corresponding with the address URL, the address URL of record is sent.
A kind of data detection method, comprising:
In the operational process of website record the address URL access Data Concurrent send the access data of the address URL to detect take Business device;
After receiving the page data acquisition instruction comprising the address URL that the detection service device is sent, search with URL The corresponding page data in location;
Page data corresponding with the address URL is sent to detection service device.
Optionally, the access Data Concurrent that the address URL is recorded in the operational process of website send the access number of the address URL According to detection service device, comprising:
The access of a plurality of address URL of sending cycle record is recorded in the operational process of website, as an address URL Access information set;
The access information collection for sending the address URL is bonded to detection service device.
Optionally, further includes:
It receives the page data that detection service device is sent and the abnormal address URL occurs;
Show that the abnormal address URL occurs in page data.
A kind of data detection system, comprising:
Client server, the access for recording the access data of the address URL in the operational process of website, from the address URL The address URL is determined in data, searches page data corresponding with the address URL, page data corresponding with the address URL is examined It surveys.
By the above technological means, may be implemented it is following the utility model has the advantages that
The application records the access data of the address URL in website is run, and determines from the access data of the address URL The address URL.Since the access data of the address URL are the access data that user's actual site generates in the process, by the application Determine that the address URL is the address URL really accessed by user.
Since the application only detects the corresponding page data in the address URL really accessed by user, not to being not necessarily to The corresponding page data in the detection address URL is detected, due to having filtered that a part has little significance without detecting the address URL, So the detection efficiency of page data can be improved.
Detailed description of the invention
In order to illustrate the technical solutions in the embodiments of the present application or in the prior art more clearly, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this Some embodiments of application for those of ordinary skill in the art without creative efforts, can be with It obtains other drawings based on these drawings.
Fig. 1 a is a kind of structural schematic diagram of data detection system disclosed in the embodiment of the present application;
Fig. 1 b is a kind of flow chart of data detection method disclosed in the embodiment of the present application;
Fig. 2 is the flow chart of another data detection method disclosed in the embodiment of the present application;
Fig. 3 is the flow chart of another data detection method disclosed in the embodiment of the present application.
Specific embodiment
Below in conjunction with the attached drawing in the embodiment of the present application, technical solutions in the embodiments of the present application carries out clear, complete Site preparation description, it is clear that described embodiments are only a part of embodiments of the present application, instead of all the embodiments.It is based on Embodiment in the application, it is obtained by those of ordinary skill in the art without making creative efforts every other Embodiment shall fall in the protection scope of this application.
Term is explained:
URL: Chinese name uniform resource locator, full name in English Uniform/UniversalResource Locator, The also known as address URL.The address URL is succinctly indicated one kind of the resource location from internet and access method, every on internet A resource corresponds to unique URL address.
It will be understood by those skilled in the art that the various servers in following embodiments, are all to calculate equipment (Computing Device) a seed type (in addition to server, calculate equipment further include intelligent terminal, PC, mobile phone, Laptop etc.), the method that the application proposes can be applied in various types of calculating equipment.
In order to facilitate the understanding of those skilled in the art, this application provides a kind of data detection systems.Referring to Fig. 1 a, data Detection system includes: multiple client servers 100 and detection service device 200.
Since the implementation procedure of each client server 100 is consistent, by taking a client server as an example, Client server 100 and the treatment process of detection service device 200 are introduced.
According to embodiment provided by the present application, a kind of data detection method is provided.Referring to Fig. 1 b, following step is specifically included It is rapid:
Step S101: the access data of the transmission of client server 100 address URL to detection service device 200.The address URL Accessing data is recorded in the operational process of website.
The first situation: client server records the access record of whole website data.
Client server 100 is the corresponding server in website.In the operational process of website, terminal access website data is used Family server 100 can recorde the access record of terminal access website data.
Second situation: client server record portion divides the access of website data to record.
Client server according to application scenarios demand, can only record the access record of specified part website data.Example Such as, website data is updated in client server, the access record of the website data updated can be recorded emphatically.
For the first situation and second situation, terminal can be sent to client server 100 during accessing website The access request of the address URL.Client server 100 can store the access record of URL in log, and client server 100 may be used also The access record of the address URL is obtained and stored using script mode.
The access record of URL (is sent out including the source IP address of the address URL, the access time of the address URL, the access address URL Send the IP address of the terminal of access request) etc. contents.During terminal constantly accesses website, client server 100 will The access record for the address URL that storage is much accessed.
The access record that client server 100 can send the address URL in real time is sent to detection service device 200;It can also be with It records according to the access that preset sending cycle sends the address URL to detection service device 200.
It is understood that distinguishing the visit for the address URL that each client server is sent for the ease of detection service device 200 Ask record, client server 100 can also send use during the access for sending the address URL is recorded to detection service device 200 The mark of family server.
It is recorded by the access for sending the address URL according to sending cycle to detection service device 200, it is assumed that sending cycle is 10 minutes, then client server 100 can be in the access note for locally recording a plurality of address URL in a sending cycle (10 minutes) Record.
Then, by the access of one address URL of access record composition of a plurality of address URL of this sending cycle (10 minutes) The mark of the access information set of the address URL and the client server is sent to detection service device 200 by information aggregate.
It is understood that the mark of client server can be the IP address of the client server, client server Factory mark, alternatively, detection service device is the mark etc. of client server distribution, it is not limited here.
Step S102: detection service device 200 receives and stores the access number for the address URL that the client server is sent According to.
In the case where client server 100 sends the access record of the address URL in real time, alternatively, client server 100 is pressed In the case that sending cycle sends the access information set of the address URL, detection service device 200 determines the mark of client server first Know, and determine memory space corresponding with the mark of client server, by the access record of the address URL or the access of the address URL Information aggregate is stored in corresponding memory space.
Step S103: detection service device 200 determines the address URL from the access data of the address URL.
Since page data will not change in real time, and, page data detection process needs to expend more resource, so detection Server 200 can execute the detection process of page data according to certain detection cycle.
Detection service device 200 determine reach detection cycle when, based in the access data for having stored the address URL when Between, come determine this detection cycle record the address URL access data.
By taking detection cycle is 1 hour as an example, it is assumed that detection moment 13:00, then the access in the stored address URL is remembered In record, the access record of access time address URL between 12:00-13:00 is searched.By the address URL between 12:00-13:00 Access record as this detection cycle the address URL access data.
Three kinds of implementations that the address URL is determined from the access data of the address URL are described below:
The first implementation: all addresses URL are determined from the access data of the address URL.
Detection service device 200 can analyze the access record of every address URL in the access data of the address URL, and obtain All addresses URL occurred in the access data of the address URL.
Directly all addresses URL can be added in list to be detected, be based on list to be detected to URL so as to subsequent The corresponding page data in location is detected.
The visitation frequency of each address URL can also be determined during analyzing the access data of the address URL, it can be by It the sequence addition address URL of visitation frequency from high to low, in this way can the address URL higher to visitation frequency to list to be detected Preferentially detected.
Second of implementation: the address hot spot URL is determined from the access data of the address URL.
In order to improve the detection efficiency of page data, page number can be carried out to the part address URL in all addresses URL According to detection.For this purpose, a preset quantity can be set, for indicating to need the URL to preset quantity in a detection cycle The corresponding page data in location is detected.
Detection service device 200 can analyze the access record of every address URL in the access data of the address URL, and obtain The visitation frequency of all addresses URL and each address URL that occur in the access data of the address URL.
Visitation frequency is not quite similar from the address URL occurred in the access data of the address URL, higher for visitation frequency The address URL belong to the address hot spot URL.Therefore, the corresponding page data in the address hot spot URL can be carried out abnormality detection.
Therefore, in the access frequency for all addresses URL and each address URL that the access data for obtaining the address URL occur After secondary, visitation frequency can be ranked up.Then the URL of preset quantity is determined by the sequence of visitation frequency from high to low Location.
The address URL determined is added in list to be detected by detection service device 200, is based on column to be detected so as to subsequent Table detects the corresponding page data in the address URL.
The third implementation: the address emphasis URL is determined from the access data of the address URL.
In order to improve the detection efficiency of page data, page number can be carried out to the part address URL in all addresses URL According to detection.For this purpose, a preset quantity can be set, for indicating to need the URL to preset quantity in a detection cycle The corresponding page data in location is detected.
Step 1: the access data of the analysis detection period corresponding address URL determine that the multiple access information set goes out The visitation frequency of the existing address URL, the address URL and the attribute information of the address URL.
The attribute information of the address URL include: the address URL access time, access the address URL source IP address (send The IP address of the terminal of access request), the letter such as the corresponding page level in the address URL (homepage, the second level page, three-level page etc.) Breath.
Step 2: the attribute information based on the address URL determines the weight of the address URL.
Since the calculating process of weight can be different and different according to the content of attribute information.With the corresponding page in the address URL For surface layer grade, its smaller corresponding weight of the corresponding page level of URL should be higher.
It, can be depending on application concrete scene, it is not limited here about the specific calculating process of weight.
Step 3: calculating the visitation frequency of each address URL and the product of weight.
Detection service device can calculate the visitation frequency of each address URL and the product of respective weights, and the size of product can To indicate the significance level of the address URL.Product is bigger, and the address expression RUL is more important, conversely, product is smaller, indicates the address RUL It is more inessential.
Step 4: by the visitation frequency of the address URL and the product sequence from high to low of the address URL, filtering out multiple URL Address.
Detection service device can be ranked up the product of each address URL, then by visitation frequency from high to low suitable Sequence determines the address URL of preset quantity.
The address URL determined is added in list to be detected by detection service device 200, is based on column to be detected so as to subsequent Table detects the corresponding page data in the address URL.
Above-mentioned three kinds of modes can be with random incorporation, such as can be by second implementation and the third implementation knot It closes, i.e., detection service device can be according to the attribute information of the address URL and the visitation frequency of the address URL, with determining each URL jointly The weight of location.Determining power can be integrated based on more attribute informations using visitation frequency as an attribute of the address URL in this way Weight, so that weight is more accurate.
Be then returned to Fig. 1, enter step S104: detection service device 200 is to the corresponding page in the address URL in list to be detected Face data is detected, and determines that the abnormal address URL occurs in page data.
Detection service device 200, can be one by one to the address URL in list to be detected after step S103 determines list to be detected Corresponding page data is detected.
For an address URL: detection service device 200 can send the acquisition of the page data comprising the address URL and refer to It enables to client server 100, client server 100 is searched send page data to detection with the address URL corresponding page Data Concurrent Server 200;Detection service device 200 detects page data, records the address URL if page data is abnormal.
For the address hot spot URL for using second of implementation to determine in step S103, and, the third realization side The address emphasis URL that formula is determined, detection service device can increase the inspection of hot spot URL and emphasis URL according to actual scene needs Measured frequency, so as to find whether the corresponding page data in the address hot spot URL and the address terminal URL exception occurs in time.
The process detected about detection service device to page data has been mature technology, and details are not described herein.
Step S105: detection service device 200 sends page data and the abnormal address URL occurs to client server 100.
Detection service device 200 sends page data and goes out after detecting to page data corresponding with the address URL Now abnormal URL addressed users server 100.
Step S106: client server 100 receives and shows that the abnormal address URL occurs in page data.
Client server 100, which receives, simultaneously shows that the abnormal address URL occurs in page data, so as to client server is subsequent can To improve to the corresponding page data in the address URL for exception occur.
The present embodiment may be implemented it is following the utility model has the advantages that
The application records the access data of the address URL in website is run, and determines from the access data of the address URL The address URL.Since the access data of the address URL are the access data that user's actual site generates in the process, by the application Determine that the address URL is the address URL really accessed by user.
Since the application only detects the corresponding page data in the address URL really accessed by user, not to being not necessarily to The corresponding page data in the detection address URL is detected, due to having filtered that a part has little significance without detecting the address URL, So the detection efficiency of page data can be improved.
According to another embodiment provided by the present application, a kind of data detection system is provided.User service in the present embodiment Device itself executes in detection process.
Client server, the access for recording the access data of the address URL in the operational process of website, from the address URL The address URL is determined in data, searches page data corresponding with the address URL, page data corresponding with the address URL is examined It surveys.
According to another embodiment provided by the present application, a kind of data detection method is provided.Referring to fig. 2, including following step It is rapid:
Step S201: the access data of the address URL are recorded in the operational process of website.
The first situation: client server is directed to whole website data and provides detection.
Client server is the corresponding server in website.All numbers in the accessible website of terminal in the operational process of website According to client server can recorde the access record of all data in terminal access website.
Second situation: client server provides detection for part website data.
Client server can be detected according to application scenarios demand for part website data.For example, being taken in user Business device updates website data, can focus on to detect the website data of update.In this case, client server is transported in website During row, only record portion the access of website data can be divided to record.
For the first situation and second situation, terminal can be sent to client server 100 during accessing website The access request of the address URL.Client server 100 can store the access record of URL in log, and client server 100 may be used also The access record of the address URL is obtained and stored using script mode.
Step S202: the address URL is determined from the access data of the address URL.
Since page data will not change in real time, and, page data detection process needs to expend more resource, so user Server can execute the detection process of page data according to certain detection cycle.
Client server, based on the time in the access data for having stored the address URL, comes when determining arrival detection cycle Determine the access data of the address URL of this detection cycle record.
By taking detection cycle is 1 hour as an example, it is assumed that detection moment 13:00, then the access in the stored address URL is remembered In record, the access record of access time address URL between 12:00-13:00 is searched.By the address URL between 12:00-13:00 Access record as this detection cycle the address URL access data.
Three kinds of implementations that the address URL is determined from the access data of the address URL are described below:
The first implementation: all addresses URL are determined from the access data of the address URL.
The access record of a plurality of address URL in detection cycle is recorded in the operational process of website, with analyzing a plurality of URL The access of location records, and determines the address URL occurred in the access record of a plurality of address URL.
Directly all addresses URL can be added in list to be detected, be based on list to be detected to URL so as to subsequent The corresponding page data in location is detected.
The visitation frequency of each address URL can also be determined during analyzing the access data of the address URL, it can be by It the sequence addition address URL of visitation frequency from high to low, in this way can the address URL higher to visitation frequency to list to be detected Preferentially detected.
Second of implementation: the address hot spot URL is determined from the access data of the address URL.
In order to improve the detection efficiency of page data, page number can be carried out to the part address URL in all addresses URL According to detection.For this purpose, a preset quantity can be set, for indicating to need the URL to preset quantity in a detection cycle The corresponding page data in location is detected.
The access record of a plurality of address URL in detection cycle is recorded in the operational process of website, with analyzing a plurality of URL The access of location records, and determines the visitation frequency of the address URL and the address URL that occur in the access record of a plurality of address URL, By the sequence of visitation frequency from high to low, multiple addresses URL are filtered out.
Visitation frequency is not quite similar from the address URL occurred in the access data of the address URL, higher for visitation frequency The address URL belong to the address hot spot URL.Therefore, the corresponding page data in the address hot spot URL can be carried out abnormality detection.
The address URL determined is added in list to be detected by client server, is based on list to be detected so as to subsequent The corresponding page data in the address URL is detected.
The third implementation: the address emphasis URL is determined from the access data of the address URL.
Step 11: the access record of a plurality of address URL in detection cycle is recorded in the operational process of website.
Step 12: the access record of the analysis a plurality of address URL determines in the access record of a plurality of address URL The address URL of appearance, the visitation frequency of the address URL and the address URL attribute information.
Step 13: the visitation frequency of attribute information and the address URL based on the address URL, or, based on described The attribute information of the address URL determines the weight of the address URL.
Step 14: by the visitation frequency of the address URL and the product sequence from high to low of the address URL, filtering out multiple URL Address.
About the implementation procedure of the third implementation, the third implementation that may refer in 1b corresponding embodiment is held Row process.
The address URL determined is added in list to be detected by client server, is based on list to be detected so as to subsequent The corresponding page data in the address URL is detected.
Step S203: the corresponding page data in the address URL in list to be detected is detected, determines page data There is the abnormal address URL.
Client server, can be one by one to the address pair URL in list to be detected after step S202 determines list to be detected The page data answered is detected.For an address URL: client server can be searched and the address URL corresponding page Data simultaneously detect page data, record the address URL if page data is abnormal.
For the address hot spot URL for using second of implementation to determine in step S202, and, the third realization side The address emphasis URL that formula is determined, client server can increase the inspection of hot spot URL and emphasis URL according to actual scene needs Measured frequency, so as to find whether the corresponding page data in the address hot spot URL and the address terminal URL exception occurs in time.
Step S204: there is the abnormal address URL in display page data.
Client server shows that the abnormal address URL occurs in page data, so as to client server is subsequent can be to appearance The abnormal corresponding page data in the address URL is improved.
The present embodiment may be implemented it is following the utility model has the advantages that
The application records the access data of the address URL in website is run, and determines from the access data of the address URL The address URL.Since the access data of the address URL are the access data that user's actual site generates in the process, by the application Determine that the address URL is the address URL really accessed by user.
Since the application only detects the corresponding page data in the address URL really accessed by user, not to being not necessarily to The corresponding page data in the detection address URL is detected, due to having filtered that a part has little significance without detecting the address URL, So the detection efficiency of page data can be improved.
According to another embodiment provided by the present application, the application provides a kind of data detection method again.Referring to Fig. 3, packet Include following steps:
Step S301: determine the access data of the address URL in the operational process of website and from the access data of the address URL really Determine the address URL.
Step S302: page data corresponding with the address URL is determined.
Step S303: the corresponding page data in the address URL is detected.
Step S304: there is the abnormal address URL in display page data.
Implementation procedure about the present embodiment may refer to the implementation procedure of Fig. 1 b and Fig. 2, and details are not described herein.
If function described in the present embodiment method is realized in the form of SFU software functional unit and as independent product pin It sells or in use, can store in a storage medium readable by a compute device.Based on this understanding, the embodiment of the present application The part of the part that contributes to existing technology or the technical solution can be embodied in the form of software products, this is soft Part product is stored in a storage medium, including some instructions are used so that calculating equipment (it can be personal computer, Server, mobile computing device or network equipment etc.) execute all or part of step of each embodiment the method for the application Suddenly.And storage medium above-mentioned includes: USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), deposits at random The various media that can store program code such as access to memory (RAM, Random Access Memory), magnetic or disk.
Each embodiment in this specification is described in a progressive manner, the highlights of each of the examples are with it is other The difference of embodiment, same or similar part may refer to each other between each embodiment.
The foregoing description of the disclosed embodiments makes professional and technical personnel in the field can be realized or use the application. Various modifications to these embodiments will be readily apparent to those skilled in the art, as defined herein General Principle can be realized in other embodiments without departing from the spirit or scope of the application.Therefore, the application It is not intended to be limited to the embodiments shown herein, and is to fit to and the principles and novel features disclosed herein phase one The widest scope of cause.

Claims (12)

1. a kind of data detection system characterized by comprising
Client server sends the access number of the address URL for recording the access data of the address URL in the operational process of website According to detection service device;Receive the page data acquisition instruction comprising the address URL that the detection service device is sent, search and The corresponding page data in the address URL sends page data corresponding with the address URL to detection service device;
Detection service device, for receiving the access data for the address URL that the client server is sent;Access from the address URL Determine the address URL in data, Xiang Suoshu client server send include the address URL page data acquisition instruction, described in reception The page data corresponding with the address URL that client server is sent, detects page data corresponding with the address URL.
2. a kind of data detection method characterized by comprising
The address URL is determined from the access data of the address URL;
Determine page data corresponding with the address URL;
Page data corresponding with the address URL is detected.
3. method according to claim 2, which is characterized in that the address URL is determined in the access data from the address URL, Include:
Obtain multiple access information set in detection cycle;Wherein, the access information set includes the visit of a plurality of address URL Ask record;
The multiple access information set is analyzed, determines the address URL occurred in the multiple access information set.
4. method according to claim 2, which is characterized in that the address URL is determined in the access data from the address URL, Include:
Obtain multiple access information set in detection cycle;Wherein, the access information set includes the visit of a plurality of address URL Ask record;
The multiple access information set is analyzed, determines the address URL that the multiple access information set occurs and the address URL Visitation frequency;
By the sequence of visitation frequency from high to low, multiple addresses URL are filtered out.
5. method according to claim 2, which is characterized in that the address URL is determined in the access data from the address URL, Include:
Obtain multiple access information set in detection cycle;Wherein, the access information set includes the visit of a plurality of address URL Ask record;
The multiple access information set is analyzed, determines the address URL that the multiple access information set occurs, the address URL The attribute information of visitation frequency and the address URL;
Based on the attribute information of the address URL, the weight of the address URL is determined;
By the visitation frequency of the address URL and the product sequence from high to low of the address URL, multiple addresses URL are filtered out.
6. method according to claim 2, which is characterized in that the address URL is determined in the access data from the address URL, Include:
Obtain multiple access information set in detection cycle;Wherein, the access information set includes the visit of a plurality of address URL Ask record;
The multiple access information set is analyzed, determines the address URL that the multiple access information set occurs, the address URL The attribute information of visitation frequency and the address URL;
The visitation frequency of attribute information and the address URL based on the address URL, determines the weight of the address URL;
By the visitation frequency of the address URL and the product sequence from high to low of the address URL, multiple addresses URL are filtered out.
7. the method as described in claim 3,4 or 5, which is characterized in that further include:
Improve the detection frequency of the multiple addresses URL filtered out.
8. the method as described in claim 2-6, which is characterized in that further include:
During being detected to page data corresponding with the address URL, if page data corresponding with the address URL exists It is abnormal, then record the address URL;
After detecting to page data corresponding with the address URL, the address URL of record is sent.
9. a kind of data detection method characterized by comprising
The access Data Concurrent for recording the address URL in the operational process of website send the access data of the address URL to detection service device;
After receiving the page data acquisition instruction comprising the address URL that the detection service device is sent, search and the address pair URL The page data answered;
Page data corresponding with the address URL is sent to detection service device.
10. method according to claim 8, which is characterized in that the access for recording the address URL in the operational process of website Data Concurrent send the access data of the address URL to detection service device, comprising:
The access of a plurality of address URL of sending cycle record is recorded in the operational process of website, the visit as an address URL Ask information aggregate;
The access information collection for sending the address URL is bonded to detection service device.
11. method according to claim 8, which is characterized in that further include:
It receives the page data that detection service device is sent and the abnormal address URL occurs;
Show that the abnormal address URL occurs in page data.
12. a kind of data detection system characterized by comprising
Client server, the access data for recording the access data of the address URL in the operational process of website, from the address URL The middle determining address URL is searched page data corresponding with the address URL, is detected to page data corresponding with the address URL.
CN201710888265.3A 2017-09-27 2017-09-27 Data detection method and system, server Pending CN109560979A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710888265.3A CN109560979A (en) 2017-09-27 2017-09-27 Data detection method and system, server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710888265.3A CN109560979A (en) 2017-09-27 2017-09-27 Data detection method and system, server

Publications (1)

Publication Number Publication Date
CN109560979A true CN109560979A (en) 2019-04-02

Family

ID=65863892

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710888265.3A Pending CN109560979A (en) 2017-09-27 2017-09-27 Data detection method and system, server

Country Status (1)

Country Link
CN (1) CN109560979A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2008204425A (en) * 2007-01-26 2008-09-04 Yahoo Japan Corp Processing omission decision program for similarity analysis of url
CN106161427A (en) * 2016-06-08 2016-11-23 北京兰云科技有限公司 A kind of web page processing method, network analhyzer and http server
CN106899549A (en) * 2015-12-18 2017-06-27 北京奇虎科技有限公司 A kind of network security detection method and device

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2008204425A (en) * 2007-01-26 2008-09-04 Yahoo Japan Corp Processing omission decision program for similarity analysis of url
CN106899549A (en) * 2015-12-18 2017-06-27 北京奇虎科技有限公司 A kind of network security detection method and device
CN106161427A (en) * 2016-06-08 2016-11-23 北京兰云科技有限公司 A kind of web page processing method, network analhyzer and http server

Similar Documents

Publication Publication Date Title
CN107609135B (en) Page element determining method and device, and user behavior path determining method and device
US20130246609A1 (en) Methods and apparatus to track web browsing sessions
CN103365928B (en) Information recommendation method and information recommendation device
CN109309596B (en) Pressure testing method and device and server
CN106021079A (en) A Web application performance testing method based on a user frequent access sequence model
JP2006511884A5 (en)
CN103618696B (en) Method and server for processing cookie information
US20080195609A1 (en) Method and System for Generating a Population Representative of a Set of Users of a Communication Network
CN107294919A (en) A kind of detection method and device of horizontal authority leak
CN108304410A (en) A kind of detection method, device and the data analysing method of the abnormal access page
CN107342913B (en) Detection method and device for CDN node
CN111310061B (en) Full-link multi-channel attribution method, device, server and storage medium
WO2016039783A1 (en) Auditing of web-based video
CN106603296A (en) Log processing method and device
CN102904774B (en) Terminal, server and server performance test methods
CN106067879B (en) The detection method and device of information
CN107957938A (en) A kind of method and system for obtaining website test data
CN110555146A (en) method and system for generating network crawler camouflage data
CN106155925A (en) A kind of method and device obtaining data
US20180068024A1 (en) Application Search Results based on a Current Search Query and a Previous Search Query
CN105491172B (en) It is a kind of for determining the method and apparatus of the information of home location of network address
CN109710832A (en) It is a kind of for search for boarding program method and apparatus
CN109816453A (en) A kind of detection method and device for promoting resource link
CN103618761B (en) Method and browser for processing cookie information
JP5707263B2 (en) Fault location diagnostic system and fault location diagnostic method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190402

RJ01 Rejection of invention patent application after publication