CN106453207B - Advertisement material data website verification method and device - Google Patents

Advertisement material data website verification method and device Download PDF

Info

Publication number
CN106453207B
CN106453207B CN201510484812.2A CN201510484812A CN106453207B CN 106453207 B CN106453207 B CN 106453207B CN 201510484812 A CN201510484812 A CN 201510484812A CN 106453207 B CN106453207 B CN 106453207B
Authority
CN
China
Prior art keywords
verification
website
material data
websites
domain name
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510484812.2A
Other languages
Chinese (zh)
Other versions
CN106453207A (en
Inventor
潘青
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd, Qizhi Software Beijing Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201510484812.2A priority Critical patent/CN106453207B/en
Publication of CN106453207A publication Critical patent/CN106453207A/en
Application granted granted Critical
Publication of CN106453207B publication Critical patent/CN106453207B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L63/00Network architectures or network communication protocols for network security
    • H04L63/02Network architectures or network communication protocols for network security for separating internal from external traffic, e.g. firewalls
    • H04L63/0227Filtering policies
    • H04L63/0236Filtering by address, protocol, port number or service, e.g. IP-address or URL
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L63/00Network architectures or network communication protocols for network security
    • H04L63/10Network architectures or network communication protocols for network security for controlling access to devices or network resources
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements

Abstract

The invention discloses a method and a device for verifying a website of advertisement material data, and relates to the technical field of advertisements. The method comprises the following steps: acquiring websites of the unverified advertising material data; extracting a main domain name from the websites of the advertisement material data for the websites belonging to the same main domain name; verifying each main domain name and recording verification results; verifying the website of each advertisement material data; when the verification of a website is not passed, searching a verification result of a main domain name corresponding to the website; and if the verification result shows that the verification is passed, confirming that the verification of the website is passed. The method and the device have the advantages that the probability of misjudgment caused by the fact that the website verification is intercepted by the firewall in the verification process of the website of the advertisement material data is reduced, and the corresponding advertisement material data can be normally online.

Description

Advertisement material data website verification method and device
Technical Field
The invention relates to the technical field of advertisements, in particular to a method and a device for verifying advertisement material data websites.
Background
For the advertisement platform, each advertisement delivery party registers an advertisement account in the advertisement platform, and then the merchant can log in the advertisement platform in its client and upload the edited advertisement material data to the advertisement platform, where the advertisement material data can be understood as data including advertisement content, text and picture of the advertisement content, and also includes a corresponding URL (Uniform Resource Locator). In practical application, the advertisement platform can be normally accessed after being online, the situation that the advertisement material data cannot be accessed is avoided, the advertisement recall rate is improved, and the advertisement platform can verify all websites of uploaded advertisement material data. After the web address of the advertisement material data is verified to be connectable, the advertisement platform will bring the advertisement material data online, so that the advertisement material data can be retrieved and displayed.
However, in practical applications, each website may be provided with a firewall, and one of the functions of the firewall is to avoid an attack of network traffic, for example, for a website with a certain IP address, for a client with the same IP address, if the number of requests received by the server from the client in a short time is greater than a first threshold, the IP address is blocked. Then, for the advertisement platform, since there are a large number of URLs of advertisement material data that need to be verified, the frequency of sending network requests is high, and there may be a large number of URLs for websites with the same IP address, and if an access request exceeding the firewall limit is sent to a website with the same IP address in a short time, it may be blocked by the firewall of the website.
If a URL is actually accessible, and since the IP address of the advertisement platform is blocked by the server where the URL is located in the verification process, the URL is verified to be disconnected, and the advertisement material data corresponding to the URL is not online by the advertisement platform. Therefore, for the advertisement putting party, the advertisement cannot be normally put on line, cannot be retrieved in the advertisement platform, and cannot be displayed to the client; it is also equivalent to making a false verification for the advertising platform.
Disclosure of Invention
In view of the above problems, the present invention has been made to provide an advertising material data website verification apparatus and a corresponding advertising material data website verification method that overcome or at least partially solve the above problems.
According to one aspect of the invention, the invention discloses a method for verifying the website of advertisement material data, which comprises the following steps:
acquiring websites of the unverified advertising material data;
extracting a main domain name from the websites of the advertisement material data for the websites belonging to the same main domain name;
verifying each main domain name and recording verification results;
verifying the website of each advertisement material data; when the verification of a website is not passed, searching a verification result of a main domain name corresponding to the website;
and if the verification result shows that the verification is passed, confirming that the verification of the website is passed.
Preferably, the verifying the website of each advertisement material data includes:
dividing the websites of the advertising material data with the same IP address into a verification group according to the IP address corresponding to the website of each advertising material data;
circularly verifying each verification group for each obtained verification group; wherein, when verifying each verification group, the method comprises the following steps: and selecting a specified number of website data from the unverified websites in the verification group for verification.
Preferably, the circularly verifying each verification packet obtained by the verification method includes:
judging whether verification groups which are not verified exist or not;
if the verification packet which is not verified already exists, circularly selecting the next verification packet which is not verified to verify; wherein the next verification packet to the last verification packet is the first verification packet;
and if the verification packet which is not verified completely does not exist, finishing the verification.
Preferably, the selecting a specified number of web address data for verification from the unverified web addresses in the verification packet includes:
selecting websites one by one from unverified websites for verification;
after selecting the website every time, if the selected website reaches the specified number and the unverified websites still exist, switching to the verification process of the next verification group;
and if the unverified website does not exist, the corresponding verification group exits the loop process and is switched to the verification process of the next verification group.
Preferably, the dividing the web address of the advertisement material data of the same IP address into one verification packet according to the IP address corresponding to the web address of each advertisement material data includes:
acquiring an IP address corresponding to the website according to the website of each advertisement material data;
the web addresses corresponding to the same IP address are divided into a verification packet.
Preferably, dividing the web address of the advertisement material data of the same IP address into one verification packet according to the IP address corresponding to the web address of each advertisement material data, includes:
dividing the websites with the same main domain name into a first group according to the main domain name in the websites of each advertisement material data;
acquiring an IP address corresponding to each main domain name according to the main domain name;
the first packets corresponding to the same IP address are combined into one authentication packet.
According to another aspect of the present invention, the present invention discloses an advertisement material data website verification apparatus, comprising:
the acquisition module is suitable for acquiring websites of the unverified advertising material data;
the main domain name extraction module is suitable for extracting a main domain name from websites of the advertisement material data for the websites belonging to the same main domain name;
the main domain name verification module is suitable for verifying each main domain name and recording a verification result;
the website verification module is suitable for verifying the website of each advertisement material data; when the verification of a website is not passed, searching a verification result of a main domain name corresponding to the website; and if the verification result shows that the verification is passed, confirming that the verification of the website is passed.
Preferably, the website verification module includes:
the IP grouping module is suitable for dividing the websites of the advertising material data with the same IP address into a verification group according to the IP address corresponding to the website of each advertising material data;
the cyclic verification module is suitable for circularly verifying each verification group according to each obtained verification group; wherein, when verifying each verification group, the method comprises the following steps: and selecting a specified number of website data from the unverified websites in the verification group for verification.
Preferably, the loop verification module includes:
the verification grouping judgment module is suitable for judging whether verification grouping which is not verified exists or not;
the cycle selection module is suitable for circularly selecting the next unverified verification packet to carry out verification if the unverified verification packet exists; wherein the next verification packet to the last verification packet is the first verification packet;
and the ending module is suitable for ending the verification if the verification packet which is not verified completely does not exist.
Preferably, the loop verification module includes:
the one-by-one verification module is suitable for selecting websites one by one from unverified websites for verification;
the switching-in judgment module is suitable for switching in the verification process of the next verification group if the selected websites reach the specified number and unverified websites exist after the websites are selected each time;
and the quitting module is suitable for quitting the corresponding verification group from the circulation process and switching to the verification process of the next verification group if the unverified website does not exist.
Preferably, the IP packet module includes:
the IP address acquisition module is suitable for acquiring an IP address corresponding to the website according to the website of each advertisement material data;
the first IP grouping module is suitable for dividing the website corresponding to the same IP address into a verification grouping.
Preferably, the IP packet module includes:
the main domain name grouping module is suitable for dividing the websites with the same main domain name into a first group according to the main domain name in the websites of each advertisement material data;
the main domain name IP acquisition module is suitable for acquiring an IP address corresponding to each main domain name;
and the second IP grouping module is suitable for combining the first groups corresponding to the same IP address into a verification group.
According to the advertisement material data website verification method, a main domain name can be extracted from unverified websites for websites belonging to the same main domain name; and then, performing connectivity verification on the main domain name, and recording a verification result. In the subsequent process of verifying the websites of the advertisement material data, if the verification of a certain website fails, the verification result of the main domain name corresponding to the website is searched, if the verification of the main domain name passes, the website where the website is located is connected, the website is misjudged to be disconnected, and therefore the website is considered to pass the verification, and the corresponding advertisement material data can be on-line. Therefore, the problem that the website of each advertisement material data under the main domain name of a website which can be actually accessed is intercepted by a firewall of a server in the verification process, so that the connectivity verification is not passed, the website is judged by mistake, and the advertisement material data cannot be online is solved, the probability of misjudgment caused by the fact that the website verification is intercepted by the firewall in the verification process of the website of the advertisement material data is reduced, and the corresponding advertisement material data can be online normally is achieved.
The foregoing description is only an overview of the technical solutions of the present invention, and the embodiments of the present invention are described below in order to make the technical means of the present invention more clearly understood and to make the above and other objects, features, and advantages of the present invention more clearly understandable.
Drawings
Various other advantages and benefits will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiments. The drawings are only for purposes of illustrating the preferred embodiments and are not to be construed as limiting the invention. Also, like reference numerals are used to refer to like parts throughout the drawings. In the drawings:
FIG. 1 is a flow chart illustrating a method for verifying a website of advertisement material data according to an embodiment of the present invention;
FIG. 2 is a flowchart illustrating a method for verifying the website of the advertisement material data according to an embodiment of the present invention;
FIG. 2A illustrates an IP verification packet example of an embodiment of the invention;
FIG. 3 is a flowchart illustrating a method for verifying the website of the advertisement material data according to an embodiment of the present invention;
FIG. 4 is a schematic structural diagram illustrating an advertisement material data website verification device according to an embodiment of the present invention;
FIG. 5 is a schematic structural diagram illustrating an apparatus for verifying a website of advertisement material data according to an embodiment of the present invention;
fig. 6 is a schematic structural diagram illustrating an advertisement material data website verification device according to an embodiment of the present invention.
Detailed Description
Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.
One of the core ideas of the embodiment of the invention is that a main domain name can be extracted from unverified websites for the websites belonging to the same main domain name; and then, performing connectivity verification on the main domain name, and recording a verification result. In the subsequent process of verifying the websites of the advertisement material data, if the verification of a certain website fails, the verification result of the main domain name corresponding to the website is searched, if the verification of the main domain name passes, the website where the website is located is connected, the website is misjudged to be disconnected, and therefore the website is considered to pass the verification, and the corresponding advertisement material data can be on-line. If the main domain names of the websites are connected, the websites under the main domain names are also connected under the normal condition, so that the method and the device can reduce the probability of misjudgment caused by interception of the website verification by a firewall in the verification process of the website of the advertisement material data, and ensure that the corresponding advertisement material data can be normally online.
Example one
Referring to fig. 1, a flow diagram illustrating a method for verifying an advertisement material data website according to an embodiment of the present invention is shown, which may specifically include:
step 110, acquiring websites of the unverified advertisement material data;
the embodiment of the invention is applied to an advertisement platform, the advertisement platform can receive the advertisement accounts registered by all advertisement putting parties, and the advertisement putting parties can be understood as merchants. Then each merchant can log in the advertisement platform through the advertisement account, and the advertisement material data is uploaded in the advertisement account.
Wherein, the advertisement platform can be understood as an advertisement server or an advertisement server cluster.
The advertisement material data may include advertisement content and a URL, and the advertisement content may include text, pictures and other data. The advertisement content is used for displaying specific content in a webpage of the client, and the URL is used for guiding the webpage to jump to a target webpage after the user clicks the advertisement content.
For the advertisement material data newly uploaded by the user, after the advertisement platform enables the advertisement material data to be online, the advertisement material data is released to the client for display, and when the user clicks the displayed advertisement material data, the user can normally jump to the page of the corresponding URL, so that the effectiveness of the advertisement material data is guaranteed. Because, if the URL of the advertisement material data cannot be connected after the user clicks the presented advertisement material data in the client, the advertisement material data is effectively invalid, wasting time and operation thereof for the user.
Therefore, the advertisement platform needs to verify the connectivity of the website of each advertisement material data, and the website can be released online after being verified to be connected.
In the advertisement platform, newly uploaded advertisement material data is stored in a basic database, and the basic database stores unverified advertisement material data. When the advertisement account is stored, the advertisement account is also used as a main key of the database for storage. Of course, if the user sets a plurality of advertisement groups in the advertisement account for the advertisement account, and uploads the advertisement material data in the advertisement groups, the database stores the advertisement material data by using the advertisement account as a primary key and the advertisement group as a next primary key.
Then, in the embodiment of the present invention, the embodiment of the present invention may extract websites of various unverified advertisement material data from the basic database. When extracting, extracting according to the advertisement material data, namely extracting the number of websites corresponding to the number of the advertisement material data one by one according to the number of the advertisement material data. In addition, in the embodiment of the invention, the corresponding advertisement material data of each website is recorded for extracting the website.
Step 120, extracting a main domain name from the websites of the advertisement material data for the websites belonging to the same main domain name;
in practice, the URLs of the advertising material data uploaded by a merchant in the advertising accounts registered with one or more of the advertising accounts may all be URLs under a main domain name. Such as http:// www.tuniu.com/guide/d-ouzhou-3600/, http:// www.tuniu.com/g 3600/sources-bj-0/, http:// www.tuniu.com/g 3600/pkg-sh-0/etc., the main domain name of these URLs is www.tuniu.com.
In practical applications, if the main domain name is http:// www.tuniu.com/connected, then the URL under the main domain name is also connected. Then, the invention extracts the main domain name of each URL, and normalizes, and each main domain name is stored.
Step 130, verifying each main domain name and recording verification results;
performing connectivity verification on each main domain name obtained by normalization in step 120, and recording a verification result, where the verification result may include: connected and not connected. Connected means passed, and disconnected means failed.
Certainly, in the embodiment of the present invention, each main domain name may be verified at intervals to reduce a situation that connectivity of the main domain name is changed due to a change of a website where the main domain name is located. For example, the main domain name a is disconnected during the verification at the time a, and the verification result of the main domain name is recorded as disconnected; after a period of time, the main domain name A is verified again, the main domain name is connected, and the verification result of the main domain name is changed into connected.
In step 130, in the embodiment of the present invention, each main domain name is verified, and the verification result is recorded, which may be performed in a reference server where each main domain name can be verified with a lower frequency. Such as verifying 1 every 10 seconds, which frequency is not substantially limited by the website firewall.
Step 140, verifying the website of each advertisement material data; when the verification of a website is not passed, searching a verification result of a main domain name corresponding to the website; and if the verification result shows that the verification is passed, confirming that the verification of the website is passed.
In the embodiment of the invention, for the websites of the advertisement material data, connectivity verification is carried out on the websites one by one.
The advertisement platform determines whether the verification for each web site passes. If the verification of the website is passed, the corresponding advertisement material data can be on-line. If the verification of the website is not passed, searching the verification result of the main domain name of the website, and judging whether to confirm that the website passes the verification according to the verification result; if the verification result of the main domain name is connected, the verification of the main domain name is passed, and the initial verification result of the website can be modified to pass the verification. If the verification result of the main domain name is disconnected, the verification of the main domain name is not passed, and the website address is continuously considered to be failed in verification.
For example, when the result of the verification at www.tuniu.com in step 130 is recorded as a pass-through. In step 140, the advertisement platform verifies that the http:// www.tuniu.com/guide/d-ouzhou-3600/does not pass, then searches the verification result of the http:// www.tuniu.com/guide/d-ouzhou-3600/corresponding main domain name www.tuniu.com, records that the verification result is connected, and then the advertisement platform determines that the http:// www.tuniu.com/guide/d-ouzhou-3600/passes.
In the embodiment of the present invention, for connectivity verification of a main domain name and a URL, an HTTP (Hypertext transfer protocol) request may be initiated according to the URL, and then whether to connect is determined according to a received HTTP response to the HTTP request, where the verification is passed if the connection is made, and the verification is not passed if the connection is not made. For example, the responses of the 4XX series and the 5XX series of HTTP responses both indicate no communication, and the responses of the 2XX system indicate communication.
In an embodiment of the present invention, the verification of the respective web address in step 140 may be verified in a verification server other than the reference server mentioned in step 130. And when a certain website is not verified, obtaining the verification result of the main domain name corresponding to the website by referring to the server. If the main domain name verification is passed, the website is considered to be passed, and if the main domain name verification is not passed, the website is not passed.
The embodiment of the invention can extract a main domain name from unverified websites belonging to the same main domain name; and then, performing connectivity verification on the main domain name, and recording a verification result. In the subsequent process of verifying the websites of the advertisement material data, if the verification of a certain website fails, the verification result of the main domain name corresponding to the website is searched, if the verification of the main domain name passes, the website where the website is located is connected, the website is misjudged to be disconnected, and therefore the website is considered to pass the verification, and the corresponding advertisement material data can be on-line. If the main domain names of the websites are connected, the websites under the main domain names are also connected under the normal condition, so that the method and the device can reduce the probability of misjudgment caused by interception of the website verification by a firewall in the verification process of the website of the advertisement material data, and ensure that the corresponding advertisement material data can be normally online.
Example two
Referring to fig. 2, a flow diagram illustrating a method for verifying an advertisement material data website according to an embodiment of the present invention is shown, which may specifically include:
step 210, acquiring websites of the unverified advertising material data;
step 220, extracting a main domain name from the websites of the advertisement material data for the websites belonging to the same main domain name;
step 230, verifying each main domain name and recording verification results;
the steps 210-230 are similar to the embodiments in terms of similar steps and will not be described in detail.
Step 240, dividing the web addresses of the advertisement material data with the same IP address into a verification group according to the IP addresses corresponding to the web addresses of the advertisement material data;
in practical application, each URL has a corresponding IP address, and thus the embodiments of the present invention may group the websites of the advertisement material data according to the IP addresses corresponding to the URLs. The web address of the advertising material data directed to the same website can also be classified into one verification group as much as possible.
Preferably, the dividing the web address of the advertisement material data of the same IP address into one verification packet according to the IP address corresponding to the web address of each advertisement material data in step 240 includes:
a substep 241 of obtaining an IP address corresponding to the website according to the website of each advertisement material data;
the method for acquiring the IP address corresponding to the website can be realized by the following steps:
substep a11, for each URL, constructing a DNS request;
substep a12, sending a DNS request to a DNS server;
substep a13 receives the IP address returned from the DNS server, and associates the IP address with the URL.
If a client wants to access the URL, the client needs to first obtain its IP address through a DNS (Domain Name System, Domain Name resolution System), and then can send a specific access request to a server corresponding to the URL to obtain the resource of the URL.
In the embodiment of the invention, the advertisement platform constructs a DNS request for each URL, and then sends the DNS request to the DNS server, so that the IP address corresponding to the URL can be obtained from the DNS server.
Of course, in the embodiment of the present invention, for a URL that does not acquire an IP address, it may not be classified. And the connectivity verification of the advertisement material data can not be passed, because the URL cannot be searched to obtain a corresponding IP address, the URL corresponding to the advertisement material data cannot be accessed, and the advertisement material data corresponding to the URL cannot be on-line.
Substep 242, the web address corresponding to the same IP address is divided into a verification packet.
And each website of the advertisement material data has a corresponding IP address, and the websites of the advertisement material data are grouped according to the IP address. Then, the addresses of the advertisement material data of the same IP address are classified into the same verification packet. As shown in fig. 2A, the verification packet includes IP address 1, IP address 2, etc., URL11, URL12, URL13, etc. exists in the verification packet "IP address 1", and URL21, URL22, etc. exists in the verification packet "IP address 2".
In the embodiment of the invention, the web addresses in the verification packet are also arranged in sequence.
Preferably, dividing the web address of the advertisement material data of the same IP address into one verification packet according to the IP address corresponding to the web address of each advertisement material data, includes:
substep 243, dividing the websites with the same main domain name into a first group according to the main domain name in the websites of each advertisement material data;
in practical applications, each website has a main domain name, such as the aforementioned http:// www.tuniu.com/guide/d-ouzhou-3600/, http:// www.tuniu.com/g 3600/sources-bj-0/, http:// www.tuniu.com/g 3600/pkg-sh-0/all the main domain names of several websites are www.tuniu.com.
The present invention can group URLs having the same primary domain name into a first group, each first group identified by a corresponding primary domain name. Such as the three URLs discussed above, may be grouped into www.tuniu.com, the first grouping.
Substep 244, obtaining an IP address corresponding to each main domain name according to the main domain name;
then for the first packet described above, since each first packet has a main domain name, the IP address of the main domain name can be obtained.
In practical applications, a DNS request may be constructed for the main domain name, and then sent to the DNS server, and the corresponding IP address is obtained from the DNS server.
Substep 245 combines the first packets corresponding to the same IP address into one authentication packet.
In practical applications, many main domain names may point to the same IP address, and then the embodiment of the present invention may combine the first packets of the same IP address into the same verification packet.
In the substep 243-. For example, for the three http:// www.tuniu.com/guide/d-ouzhou-3600/, http:// www.tuniu.com/g 3600/times to our-bj-0/, http:// www.tuniu.com/g3600/pkg-sh-0/, if the IP address is directly obtained, 3 times are needed, and after the first packet is divided, only the IP address of www.tuniu.com needs to be obtained, so that the IP address only needs to be obtained once, and the obtaining times of the IP address are reduced.
In practical application, the same advertisement delivery party may have a plurality of IP addresses, and in order to make the IP addresses delivered by the advertisement not continuously exist in the verification packet queue as much as possible, the verification packets of the respective IP addresses may be randomly ordered.
Step 250, circularly verifying each verification group for each obtained verification group; wherein, when verifying each verification group, the method comprises the following steps: selecting a specified number of website data from the unverified websites in the verification group for verification; when the verification of a website is not passed, searching a verification result of the main domain name corresponding to the website; and if the verification result shows that the verification is passed, confirming that the verification of the website is passed.
If there are 10 verification groups, starting from the 1 st group, performing verification, and selecting 10 websites with unverified advertising material data from the verification group for verification; then, entering a2 nd verification group, and selecting 10 unverified websites of unverified advertising material data from the verification group for verification; and by analogy, after the 10 th verification group, the verification group is circulated to the 1 st verification group, and the circulation is continued until the website verification of the unverified advertising material data of all the verification groups is completed.
Currently, if the number of websites of unverified and unverified advertising material data in a certain verification group is smaller than the specified number in the verification process, the actual number is selected for verification.
In the embodiment of the present invention, the designated number may be set as needed. The specified number is of a small order of magnitude and generally does not exceed a hundred digits. So that a round of cycles can be performed quickly for each authentication packet.
Preferably, the circularly verifying each verification packet according to the step 250 includes:
a substep 251 of determining whether there is a verification packet that has not been verified; if there is a verified packet that has not been verified, go to step 252; if there is no verified packet that has not been verified, go to step 253;
substep 252, selecting the next unverified verification packet for verification; wherein the next verification packet to the last verification packet is the first verification packet;
and sub-step 253, the verification ends.
In the embodiment of the present invention, for each verification packet, it may be determined whether there is a verification packet that is not verified. Wherein, the non-verification is completed, which indicates that the verification group has non-verified websites; if all the web addresses of the verification packet are verified, the verification packet is verified.
When the verification of a website is not passed, searching a verification result of a main domain name corresponding to the website; and if the verification result shows that the verification is passed, confirming that the verification of the website is passed.
In practical applications, during the verification process, for a verification packet, a flag indicating whether verification is completed or not is performed, for example, 0 indicates that verification is not completed, and 1 indicates that verification is completed.
For the initial case, where there is an unverified web address for each verification packet, and thus each verification packet is marked 0, verification is started from the first verification packet. When each verification group is verified, a specified number of websites are selected from the unverified websites in the verification group for verification.
After a verification packet is verified, if the verification packet has an unverified web address, 0 is not changed, and if the unverified web address does not exist, 0 is changed to 1.
Thus, in the loop, after the specified number of websites of one verification packet are verified each time, the sub-step 251 can be proceeded to determine whether there is a verification packet that has not been verified. Of course, in the initial case, i.e. before the first authentication packet is authenticated for the first time, it is not necessary to determine whether there is an authentication packet that has not been authenticated.
Preferably, the selecting a specified number of web address data for verification from the unverified web addresses in the verification packet includes:
substep 254, selecting websites one by one from the unverified websites for verification;
in the embodiment of the invention, for each verification group, the websites of the unverified advertising material data are arranged in sequence, and each website can be understood to form a website queue.
When each verification group is verified, the embodiment of the invention extracts the websites from the website queue one by one for verification. For example, if there are 100 unverified sites for verification packet a, the specified number is 10. When the verification group is circulated for the first time, 1-10 network addresses are extracted for verification, and when the verification group is circulated for the second time, 11-20 network addresses are extracted for verification. And so on for other cases.
Wherein, when verifying each URL, the method comprises the following steps:
sub-step B11, determining whether the verification of the web address passes: if the verification of the web address is not passed, go to substep B12;
if the web address is verified, substep 255 is entered.
Substep B12, searching the verification result of the main domain name corresponding to the website, and judging whether the verification result shows that the verification is passed; if the verification result indicates that the verification passes, go to sub-step B13;
and a sub-step B13 of confirming that the website is verified.
After substep B13, substep 255 may be entered.
In the substep 255, after selecting the web address each time, if the selected web addresses reach the designated number and unverified web addresses exist, turning to the verification process of the next verification group;
for one verification group, at most, only a specified number of unverified websites can be selected for verification each time the verification group is verified. Then, in step 254, when the websites are extracted one by one for verification, the number of the extracted websites is recorded, and when the number of the extracted websites reaches the specified number, it is determined whether there are any unverified websites after the last extracted website, and if so, it indicates that the remaining websites need to wait for verification in a subsequent round.
Like the previous example, authentication packet a, if it has 100 unverified web addresses, it corresponds to a queue of 1-100. If the specified number is 10. After the verification packet is circulated for the first time, the websites are extracted one by one from the queue, and when the 10 th website is extracted and the 11 th website is found, the verification process of the next verification packet is carried out, for example, in the verification packet queue, if the next verification packet is a verification packet B, the verification of the verification packet B is switched to.
And a substep 256 of exiting the loop process for the corresponding verification packet and proceeding to the verification process for the next verification packet if there is no unverified web address.
For example, after the verification packet a is circulated to the verification packet for the 10 th time, web addresses are extracted one by one from 91 in the queue, and when the 100 th web address is extracted and no 101 th web address is found, the verification packet is verified, and the verification packet may exit the circulation process, and at the same time, the verification process for the next verification packet is performed.
For another example, if the verification packet a has 98 unverified web addresses, the specified number is 10. And (3) extracting the websites one by one from 91 in the queue, and when the 98 th website is extracted, finding that the 99 th website is not extracted, only extracting 8 websites and not reaching the specified number 10, but finishing the verification of all the websites of the verification packet A, exiting the loop process of the verification packet and switching to the verification process of the next verification packet.
It is understood that in sub-step 256, no matter whether the specified number is reached, i.e. the number of selected websites is less than or equal to the specified number, when the website verification of the verification packet is completed, the verification packet exits the loop process and is transferred to the verification process of the next verification packet.
Wherein the authentication packet is exited from the loop process, such as being exited from the authentication process. For example, if the verification packet queue originally having A, B, C, D is verified, and the verification packet queue exits the loop process after the verification of the verification packet a is completed, the verification packet queue is B, C, D. At the same time, authentication of authentication packet B is switched. Then subsequent verifications continue to loop through the verification packet queue at B, C, D. Thus, the number of verification packet queues is reduced, and traversal of verification packets is reduced.
In the embodiment of the invention, for the verified website, the advertisement platform can upload the advertisement material data corresponding to the website. Then, the merchant can search the advertisement material data from the network, and the advertisement material data can also be released to each client.
In the embodiment of the invention, the advertisement platform can be provided with a plurality of server nodes for executing the verification function on the verification group, and each verification group can be distributed to the plurality of server nodes of the advertisement platform for verification. Namely, after step 120, further comprising: each authentication packet is distributed to each server node. Such as authentication packet A, B, C, D, is authenticated in server node a and authentication packet C, D, E, F is authenticated in server node B. Each server node performs the process of step 130.
The embodiment of the invention can group all websites according to the IP addresses corresponding to the websites of the advertisement material data to obtain all verification groups, wherein each verification group comprises a series of websites of the advertisement material data; then, verifying a part of websites (for example, 10 websites) of a verification group each time, and after the part of websites of the verification group are verified, transferring to the next verification group; in the next verification group, verifying part of the website of the next verification group, and after the verification of the part of the website is finished, switching to the next verification group; and in the same way, after the last verification group is verified, the operation is circulated to the first verification group, and the operation is circulated until all verification groups have no unverified websites.
Compared with the prior art, the websites of the advertisement material data are extracted according to the advertisement accounts, and then the websites of the advertisement material data of the same advertisement account are directly sequenced according to the extraction sequence, so that when the data volume of the advertisement material data of a certain advertisement account is huge during verification, the websites of the advertisement material data of the advertisement account are sequenced in the advertisement accounts behind the certain advertisement account, and the verification can be started after waiting for a long time. Especially, under the condition that one advertisement putting party uploads a large amount of advertisement material data in a plurality of advertisement accounts in an advertisement platform, for the advertisement accounts sequenced behind the advertisement accounts, the time for waiting verification of the website of the advertisement material data is longer, and the corresponding advertisement putting party can start to see the online advertisement material data for a very long time. In the verification queue of each advertisement putting position unit, the execution process of the prior art is equivalent to that the queue is completely blocked by the advertisement account with larger data volume, and the verification of the subsequent advertisement account with smaller data volume is influenced.
The embodiment of the invention can ensure that the website of the advertisement material data of each advertisement account can be partially and quickly verified, thereby being capable of partially and quickly online and shortening the time of waiting for online of each advertisement account. For each advertisement account, the online advertisement material data can be seen timely. Particularly, for the advertisement account with small data volume of the advertisement material data, the website of the advertisement material data can be completely verified in a few rounds of circulation, and for each advertisement account, the time for waiting for verification of the advertisement account is reduced on the whole, so that the verification time is dispersed into each advertisement account, and the online speed of the advertisement material data is improved. The embodiment of the invention can improve the fairness and the friendliness of the advertisement platform and improve the user experience of the advertisement platform.
In addition, the embodiment of the invention can extract a main domain name from the unverified websites and the websites belonging to the same main domain name; and then, performing connectivity verification on the main domain name, and recording a verification result. In the subsequent process of verifying the websites of the advertisement material data, if the verification of a certain website fails, the verification result of the main domain name corresponding to the website is searched, if the verification of the main domain name passes, the website where the website is located is connected, the website is misjudged to be disconnected, and therefore the website is considered to pass the verification, and the corresponding advertisement material data can be on-line. If the main domain names of the websites are connected, the websites under the main domain names are also connected under the normal condition, so that the method and the device can reduce the probability of misjudgment caused by interception of the website verification by a firewall in the verification process of the website of the advertisement material data, and ensure that the corresponding advertisement material data can be normally online.
EXAMPLE III
Referring to fig. 3, a flow diagram illustrating a method for verifying an advertisement material data website according to an embodiment of the present invention is shown, which may specifically include:
step 312, acquiring websites of the unverified advertising material data;
step 314, extracting a main domain name from the websites of the advertisement material data for the websites belonging to the same main domain name;
step 316, verifying each main domain name and recording the verification result;
and 318, dividing the web addresses of the advertisement material data with the same IP address into a verification packet according to the IP addresses corresponding to the web addresses of the advertisement material data.
Regarding step 312-318, the principle of the corresponding steps is similar to that of the embodiment two, and will not be described in detail herein.
Step 320, judging whether verification packets which are not verified exist; if there is a verified packet that has not been verified, go to step 322; if there is no verified packet that has not yet been verified, then step 336 is entered.
Step 322, circularly selecting the next unverified verification group; wherein the next verification packet to the last verification packet is the first verification packet;
324, selecting websites one by one from unverified websites of the verification group for verification;
step 326, judging whether the verification of the website passes aiming at the website of each advertisement material data; if the verification is not passed, step 328 is entered; if the verification passes, go to step 332;
step 328, searching for a verification result of the main domain name corresponding to the website, and determining whether the verification result indicates that the verification is passed; if the verification result indicates that the verification is passed, go to step 330; if the verification result indicates that the verification is not passed, go to step 332;
and step 330, confirming that the verification of the website is passed.
Step 332, after selecting the website every time, judging whether the number of the currently selected websites reaches the specified number and whether unverified websites exist; if the selected web addresses reach the designated number and unverified web addresses exist, go to step 320; if there is no unverified web address, go to step 334;
at step 334, the corresponding authentication packet exits the loop process and enters step 320.
Step 336, end verification.
For the loop process of steps 320 to 336, the following is described as an example:
such as step 318 resulting in a verified packet queue: IP address 1, IP address 2, IP address 3. Wherein:
there are 80 unverified URLs in order in IP address 1.
There are 60 unverified URLs in order in IP address 2.
There are 35 unverified URLs in order in IP address 3.
The specified number is 10.
A first round of circulation: initially, step 320 determines that there are verification packets IP address 1, IP address 2, and IP address 3 that have not been verified. Step 322 selects the first validation packet in order: IP address 1. In step 322, the URLs 1-10 are extracted from IP address 1 one by one for verification.
When each URL is verified, entering step 332 when the URL passes the verification; when the URL does not verify, step 328 is entered. Step 328, searching for a verification result of the main domain name corresponding to the website, and determining whether the verification result indicates that the verification is passed; if the verification result of the main domain name indicates that the verification is passed, step 330 is entered, and the verification of the website is confirmed, step 332 is entered. If the verification result of the primary domain name indicates failure, then step 332 is entered directly.
In step 332, when the 10 th site is extracted, the 10 th site is found not to be the last site, and the process proceeds to step 320.
Step 320 continues to determine that there are verification packets that have not been verified: IP address 1, IP address 2, IP address 3. Step 322 selects the next authentication packet: IP address 2. In step 324, the websites 1-10 are extracted from the IP address 2 one by one for verification. And then through steps 326-328. After entering step 332, when the 10 th website is extracted, and the 10 th website is found not to be the last website, the process proceeds to step 320.
Step 320 continues to determine that there are verification packets that have not been verified: IP address 1, IP address 2, IP address 3. Step 322 selects the next authentication packet: IP address 3. In step 324, the addresses 1-10 are extracted from the IP address 3 one by one for verification, and then the steps of step 326-328 are performed. After entering step 332, when the 10 th website is extracted, and the 10 th website is found not to be the last website, the process proceeds to step 320. At this point IP address 3 is the last in the queue of validation packets, then its next validation packet is IP address 1. And entering a second round of circulation.
By analogy with the principle, after the fourth round of circulation is entered, the circulation is performed to the IP address 3, and in step 332, when the 35 th network address is extracted, and the 35 th network address is found to be the last network address, the corresponding verification packet exits the circulation process, and the process goes to step 320. Step 320 determines that there are not verified verification packets: IP address 1, IP address 2. And entering a fifth round of circulation.
And after entering the sixth round of circulation, circulating to the IP address 2, and in step 332, when the 60 th address is extracted and the 60 th address is found to be the last website, exiting the circulation process of the corresponding verification packet and turning to step 320. Step 320 determines that there are not verified verification packets: IP address 1. And entering a seventh round of circulation.
The IP address 1 is then verified until the cycle is complete, and step 336 is entered.
The principle of the steps of the embodiment of the invention is similar to that of the first and second embodiments, and are not described in detail herein.
Firstly, the embodiment of the invention can ensure that the website of the advertisement material data of each advertisement account can be partially and quickly verified, thereby being capable of partially and quickly online and shortening the time of waiting for online of each advertisement account. For each advertisement account, the online advertisement material data can be seen timely. Particularly, for the advertisement account with small data volume of the advertisement material data, the website of the advertisement material data can be completely verified in a few rounds of circulation, and for each advertisement account, the time for waiting for verification of the advertisement account is reduced on the whole, so that the verification time is dispersed into each advertisement account, and the online speed of the advertisement material data is improved. The embodiment of the invention can improve the fairness and the friendliness of the advertisement platform and improve the user experience of the advertisement platform.
Secondly, the embodiment of the invention can reduce the probability of misjudgment caused by the interception of the website verification by the firewall in the verification process of the website of the advertisement material data, so that the corresponding advertisement material data can be normally online.
Example four
Referring to fig. 4, a schematic structural diagram of an advertisement material data website verification device according to an embodiment of the present invention is shown, which may specifically include:
an obtaining module 410 adapted to obtain a website of each unverified advertising material data;
a main domain name extracting module 420 adapted to extract a main domain name for websites belonging to the same main domain name from the websites of the respective advertisement material data;
a main domain name verification module 430, adapted to verify each main domain name and record the verification result;
the website verification module 440 is suitable for verifying the website of each piece of advertisement material data; when the verification of a website is not passed, searching a verification result of a main domain name corresponding to the website; and if the verification result shows that the verification is passed, confirming that the verification of the website is passed.
EXAMPLE five
Referring to fig. 5, a schematic structural diagram of an advertisement material data website verification device according to an embodiment of the present invention is shown, which may specifically include:
an obtaining module 510 adapted to obtain a website of each unverified advertisement material data;
a main domain name extracting module 520 adapted to extract a main domain name for websites belonging to the same main domain name from the websites of the respective advertisement material data;
a main domain name verification module 530 adapted to verify each main domain name and record the verification result;
the website verifying module 540 specifically includes:
the IP grouping module 542 is adapted to divide the websites of the advertisement material data of the same IP address into a verification group according to the IP address corresponding to the website of each advertisement material data;
a loop verification module 543 adapted to circularly verify each verification group obtained; wherein, when verifying each verification group, the method comprises the following steps: selecting a specified number of website data from the unverified websites in the verification group for verification; when the verification of a website is not passed, searching a verification result of a main domain name corresponding to the website; and if the verification result shows that the verification is passed, confirming that the verification of the website is passed.
Preferably, the loop verification module includes:
the verification grouping judgment module is suitable for judging whether verification grouping which is not verified exists or not;
the cycle selection module is suitable for circularly selecting the next unverified verification packet to carry out verification if the unverified verification packet exists; wherein the next verification packet to the last verification packet is the first verification packet;
and the ending module is suitable for ending the verification if the verification packet which is not verified completely does not exist.
Preferably, the loop verification module includes:
the one-by-one verification module is suitable for selecting websites one by one from unverified websites for verification;
the switching-in judgment module is suitable for switching in the verification process of the next verification group if the selected websites reach the specified number and unverified websites exist after the websites are selected each time;
and the quitting module is suitable for quitting the corresponding verification group from the circulation process and switching to the verification process of the next verification group if the unverified website does not exist.
Preferably, the IP packet module includes:
the IP address acquisition module is suitable for acquiring an IP address corresponding to the website according to the website of each advertisement material data;
the first IP grouping module is suitable for dividing the website corresponding to the same IP address into a verification grouping.
Preferably, the IP packet module includes:
the main domain name grouping module is suitable for dividing the websites with the same main domain name into a first group according to the main domain name in the websites of each advertisement material data;
the main domain name IP acquisition module is suitable for acquiring an IP address corresponding to each main domain name;
and the second IP grouping module is suitable for combining the first groups corresponding to the same IP address into a verification group.
EXAMPLE six
Referring to fig. 6, which shows a schematic structural diagram of an advertisement material data website verification apparatus according to an embodiment of the present invention, specifically, the apparatus may include:
an obtaining module 610 adapted to obtain websites of each unverified advertising material data;
a main domain name extraction module 620, adapted to extract a main domain name for websites belonging to the same main domain name from the websites of each advertisement material data;
a main domain name verification module 630, adapted to verify each main domain name and record the verification result;
the website verifying module 640 specifically includes:
the IP grouping module 642 is adapted to divide the advertisement material data of the same IP address into a verification group according to the IP address corresponding to the website of each advertisement material data.
The loop verification module 643 specifically includes:
a verification group judgment module 6431 adapted to judge whether there is a verification group that has not been verified; if a verification packet which is not verified already exists, entering a loop selection module 6432; if there are no verification packets that have not been verified, then an end block 6439 is entered.
A loop selection module 6432 adapted to loop select a next unverified verification packet for verification if there is a verification packet that has not been verified yet; wherein the next verification packet to the last verification packet is the first verification packet; if there is no unverified advertising material data, exit module 6438 is entered.
The one-by-one verification module 6433 is suitable for selecting websites one by one from the unverified advertising material data for verification;
the verification judging module 6434 is adapted to judge, for each website of the advertisement material data, whether the verification of the website passes; if the verification is not passed, entering a result verification module 6435; if the verification is passed, entering a transfer-in judgment module 6437;
the result checking module 6435 is adapted to search a verification result of the main domain name corresponding to the website, and determine whether the verification result indicates that the verification is passed; if the verification result indicates that the verification is passed, entering a pass validation module 6436; if the verification result shows that the verification is not passed, entering a transfer-in judgment module 6437;
and confirming that the verification of the website is passed through a confirmation module 6436.
The switching-in judgment module 6437 is suitable for judging whether the number of the currently selected websites reaches the specified number and whether unverified websites exist after the websites are selected each time; if the selected advertisement material data reaches the specified number and the unverified advertisement material data still exists, entering a verification grouping judgment module 6431;
an exit module 6438 adapted to exit the corresponding verification packet out of the loop process and enter the verification packet judgment module 6431.
An end module 6439 adapted to end the verification if there is no verification packet that has not been verified.
The algorithms and displays presented herein are not inherently related to any particular computer, virtual machine, or other apparatus. Various general purpose systems may also be used with the teachings herein. The required structure for constructing such a system will be apparent from the description above. Moreover, the present invention is not directed to any particular programming language. It is appreciated that a variety of programming languages may be used to implement the teachings of the present invention as described herein, and any descriptions of specific languages are provided above to disclose the best mode of the invention.
In the description provided herein, numerous specific details are set forth. It is understood, however, that embodiments of the invention may be practiced without these specific details. In some instances, well-known methods, structures and techniques have not been shown in detail in order not to obscure an understanding of this description.
Similarly, it should be appreciated that in the foregoing description of exemplary embodiments of the invention, various features of the invention are sometimes grouped together in a single embodiment, figure, or description thereof for the purpose of streamlining the disclosure and aiding in the understanding of one or more of the various inventive aspects. However, the disclosed method should not be interpreted as reflecting an intention that: that the invention as claimed requires more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive aspects lie in less than all features of a single foregoing disclosed embodiment. Thus, the claims following the detailed description are hereby expressly incorporated into this detailed description, with each claim standing on its own as a separate embodiment of this invention.
Those skilled in the art will appreciate that the modules in the device in an embodiment may be adaptively changed and disposed in one or more devices different from the embodiment. The modules or units or components of the embodiments may be combined into one module or unit or component, and furthermore they may be divided into a plurality of sub-modules or sub-units or sub-components. All of the features disclosed in this specification (including any accompanying claims, abstract and drawings), and all of the processes or elements of any method or apparatus so disclosed, may be combined in any combination, except combinations where at least some of such features and/or processes or elements are mutually exclusive. Each feature disclosed in this specification (including any accompanying claims, abstract and drawings) may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise.
Furthermore, those skilled in the art will appreciate that while some embodiments described herein include some features included in other embodiments, rather than other features, combinations of features of different embodiments are meant to be within the scope of the invention and form different embodiments. For example, in the following claims, any of the claimed embodiments may be used in any combination.
The various component embodiments of the invention may be implemented in hardware, or in software modules running on one or more processors, or in a combination thereof. It will be appreciated by those skilled in the art that a microprocessor or Digital Signal Processor (DSP) may be used in practice to implement some or all of the functions of some or all of the components of an advertising material data web site verification apparatus according to embodiments of the present invention. The present invention may also be embodied as apparatus or device programs (e.g., computer programs and computer program products) for performing a portion or all of the methods described herein. Such programs implementing the present invention may be stored on computer-readable media or may be in the form of one or more signals. Such a signal may be downloaded from an internet website or provided on a carrier signal or in any other form.
It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word "comprising" does not exclude the presence of elements or steps not listed in a claim. The word "a" or "an" preceding an element does not exclude the presence of a plurality of such elements. The invention may be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In the unit claims enumerating several means, several of these means may be embodied by one and the same item of hardware. The usage of the words first, second and third, etcetera do not indicate any ordering. These words may be interpreted as names.
The invention also discloses A1 and an advertisement material data website verification method, which comprises the following steps:
acquiring websites of the unverified advertising material data;
extracting a main domain name from the websites of the advertisement material data for the websites belonging to the same main domain name;
verifying each main domain name and recording verification results;
verifying the website of each advertisement material data; when the verification of a website is not passed, searching a verification result of a main domain name corresponding to the website;
and if the verification result shows that the verification is passed, confirming that the verification of the website is passed.
A2, according to the method in A1, the verifying the website of each piece of advertisement material data includes:
dividing the websites of the advertising material data with the same IP address into a verification group according to the IP address corresponding to the website of each advertising material data;
circularly verifying each verification group for each obtained verification group; wherein, when verifying each verification group, the method comprises the following steps: and selecting a specified number of website data from the unverified websites in the verification group for verification.
A3, according to the method of A2, the circularly verifying each verification packet includes:
judging whether verification groups which are not verified exist or not;
if the verification packet which is not verified already exists, circularly selecting the next verification packet which is not verified to verify; wherein the next verification packet to the last verification packet is the first verification packet;
and if the verification packet which is not verified completely does not exist, finishing the verification.
A4, according to the method of A2 or A3, wherein the selecting a specified number of web address data for verification from the web addresses not verified in the verification package comprises:
selecting websites one by one from unverified websites for verification;
after selecting the website every time, if the selected website reaches the specified number and the unverified websites still exist, switching to the verification process of the next verification group;
and if the unverified website does not exist, the corresponding verification group exits the loop process and is switched to the verification process of the next verification group.
A5, according to the method of a2, the dividing the addresses of the advertisement material data of the same IP address into one verification packet according to the IP addresses corresponding to the addresses of the advertisement material data includes:
acquiring an IP address corresponding to the website according to the website of each advertisement material data;
the web addresses corresponding to the same IP address are divided into a verification packet.
A6, according to the method in a2, dividing the addresses of the advertising material data with the same IP address into a verification group according to the IP addresses corresponding to the addresses of the advertising material data, including:
dividing the websites with the same main domain name into a first group according to the main domain name in the websites of each advertisement material data;
acquiring an IP address corresponding to each main domain name according to the main domain name;
the first packets corresponding to the same IP address are combined into one authentication packet.
The invention discloses a B7, an advertisement material data website verification device, comprising:
the acquisition module is suitable for acquiring websites of the unverified advertising material data;
the main domain name extraction module is suitable for extracting a main domain name from websites of the advertisement material data for the websites belonging to the same main domain name;
the main domain name verification module is suitable for verifying each main domain name and recording a verification result;
the website verification module is suitable for verifying the website of each advertisement material data; when the verification of a website is not passed, searching a verification result of a main domain name corresponding to the website; and if the verification result shows that the verification is passed, confirming that the verification of the website is passed.
B8, the apparatus according to B7, the website verifying module comprising:
the IP grouping module is suitable for dividing the websites of the advertising material data with the same IP address into a verification group according to the IP address corresponding to the website of each advertising material data;
the cyclic verification module is suitable for circularly verifying each verification group according to each obtained verification group; wherein, when verifying each verification group, the method comprises the following steps: and selecting a specified number of website data from the unverified websites in the verification group for verification.
B9, the apparatus of B8, the cycle verification module comprising:
the verification grouping judgment module is suitable for judging whether verification grouping which is not verified exists or not;
the cycle selection module is suitable for circularly selecting the next unverified verification packet to carry out verification if the unverified verification packet exists; wherein the next verification packet to the last verification packet is the first verification packet;
and the ending module is suitable for ending the verification if the verification packet which is not verified completely does not exist.
B10, the apparatus of B8 or B9, the cycle verification module comprising:
the one-by-one verification module is suitable for selecting websites one by one from unverified websites for verification;
the switching-in judgment module is suitable for switching in the verification process of the next verification group if the selected websites reach the specified number and unverified websites exist after the websites are selected each time;
and the quitting module is suitable for quitting the corresponding verification group from the circulation process and switching to the verification process of the next verification group if the unverified website does not exist.
B11, the apparatus of B8, the IP packet module comprising:
the IP address acquisition module is suitable for acquiring an IP address corresponding to the website according to the website of each advertisement material data;
the first IP grouping module is suitable for dividing the website corresponding to the same IP address into a verification grouping.
B12, the apparatus of B8, the IP packet module comprising:
the main domain name grouping module is suitable for dividing the websites with the same main domain name into a first group according to the main domain name in the websites of each advertisement material data;
the main domain name IP acquisition module is suitable for acquiring an IP address corresponding to each main domain name;
and the second IP grouping module is suitable for combining the first groups corresponding to the same IP address into a verification group.

Claims (10)

1. A method for verifying advertising material data website comprises the following steps:
acquiring websites of the unverified advertising material data;
extracting a main domain name from the websites of the advertisement material data for the websites belonging to the same main domain name;
verifying each main domain name and recording verification results;
verifying the website of each advertisement material data; when the verification of a website is not passed, searching a verification result of a main domain name corresponding to the website;
if the verification result shows that the verification is passed, confirming that the verification of the website is passed;
wherein, the verifying the website of each advertisement material data comprises:
dividing the websites of the advertising material data with the same IP address into a verification group according to the IP address corresponding to the website of each advertising material data;
circularly verifying each verification group for each obtained verification group; wherein, when verifying each verification group, the method comprises the following steps: and selecting a specified number of website data from the unverified websites in the verification group for verification.
2. The method of claim 1, wherein the circularly verifying each verification packet obtained comprises:
judging whether verification groups which are not verified exist or not;
if the verification packet which is not verified already exists, circularly selecting the next verification packet which is not verified to verify; wherein the next verification packet to the last verification packet is the first verification packet;
and if the verification packet which is not verified completely does not exist, finishing the verification.
3. The method according to claim 1 or 2, wherein selecting a specified number of web address data for verification from the web addresses not verified in the verification packet comprises:
selecting websites one by one from unverified websites for verification;
after selecting the website every time, if the selected website reaches the specified number and the unverified websites still exist, switching to the verification process of the next verification group;
and if the unverified website does not exist, the corresponding verification group exits the loop process and is switched to the verification process of the next verification group.
4. The method of claim 1, wherein the classifying the addresses of the advertisement material data of the same IP address into one verification packet according to the IP addresses corresponding to the addresses of the advertisement material data comprises:
acquiring an IP address corresponding to the website according to the website of each advertisement material data;
the web addresses corresponding to the same IP address are divided into a verification packet.
5. The method of claim 1, wherein the dividing the addresses of the advertisement material data of the same IP address into one verification packet according to the IP address corresponding to the address of each advertisement material data comprises:
dividing the websites with the same main domain name into a first group according to the main domain name in the websites of each advertisement material data;
acquiring an IP address corresponding to each main domain name according to the main domain name;
the first packets corresponding to the same IP address are combined into one authentication packet.
6. An advertisement material data website verification device, comprising:
the acquisition module is suitable for acquiring websites of the unverified advertising material data;
the main domain name extraction module is suitable for extracting a main domain name from websites of the advertisement material data for the websites belonging to the same main domain name;
the main domain name verification module is suitable for verifying each main domain name and recording a verification result;
the website verification module is suitable for verifying the website of each advertisement material data; when the verification of a website is not passed, searching a verification result of a main domain name corresponding to the website; if the verification result shows that the verification is passed, confirming that the verification of the website is passed;
wherein, the website verification module comprises:
the IP grouping module is suitable for dividing the websites of the advertising material data with the same IP address into a verification group according to the IP address corresponding to the website of each advertising material data;
the cyclic verification module is suitable for circularly verifying each verification group according to each obtained verification group; wherein, when verifying each verification group, the method comprises the following steps: and selecting a specified number of website data from the unverified websites in the verification group for verification.
7. The apparatus of claim 6, wherein the loop verification module comprises:
the verification grouping judgment module is suitable for judging whether verification grouping which is not verified exists or not;
the cycle selection module is suitable for circularly selecting the next unverified verification packet to carry out verification if the unverified verification packet exists; wherein the next verification packet to the last verification packet is the first verification packet;
and the ending module is suitable for ending the verification if the verification packet which is not verified completely does not exist.
8. The apparatus of claim 6 or 7, wherein the loop verification module comprises:
the one-by-one verification module is suitable for selecting websites one by one from unverified websites for verification;
the switching-in judgment module is suitable for switching in the verification process of the next verification group if the selected websites reach the specified number and unverified websites exist after the websites are selected each time;
and the quitting module is suitable for quitting the corresponding verification group from the circulation process and switching to the verification process of the next verification group if the unverified website does not exist.
9. The apparatus of claim 6, wherein the IP packet module comprises:
the IP address acquisition module is suitable for acquiring an IP address corresponding to the website according to the website of each advertisement material data;
the first IP grouping module is suitable for dividing the website corresponding to the same IP address into a verification grouping.
10. The apparatus of claim 6, wherein the IP packet module comprises:
the main domain name grouping module is suitable for dividing the websites with the same main domain name into a first group according to the main domain name in the websites of each advertisement material data;
the main domain name IP acquisition module is suitable for acquiring an IP address corresponding to each main domain name;
and the second IP grouping module is suitable for combining the first groups corresponding to the same IP address into a verification group.
CN201510484812.2A 2015-08-07 2015-08-07 Advertisement material data website verification method and device Active CN106453207B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510484812.2A CN106453207B (en) 2015-08-07 2015-08-07 Advertisement material data website verification method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510484812.2A CN106453207B (en) 2015-08-07 2015-08-07 Advertisement material data website verification method and device

Publications (2)

Publication Number Publication Date
CN106453207A CN106453207A (en) 2017-02-22
CN106453207B true CN106453207B (en) 2021-01-29

Family

ID=58092409

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510484812.2A Active CN106453207B (en) 2015-08-07 2015-08-07 Advertisement material data website verification method and device

Country Status (1)

Country Link
CN (1) CN106453207B (en)

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
IL173128A0 (en) * 2006-01-12 2006-06-11 Yaacoby Eli Method for authenticating a website
CN100543744C (en) * 2006-12-12 2009-09-23 孙斌 Method to webpage and website grading
CN102457588A (en) * 2011-12-20 2012-05-16 北京瑞汛世纪科技有限公司 Method and device for implementing rDNS
CN102882703B (en) * 2012-08-31 2015-08-19 赛尔网络有限公司 A kind of system and method for the URL automatic classification classification based on HTTP analysis
CN103220302A (en) * 2013-05-07 2013-07-24 腾讯科技(深圳)有限公司 Malicious website access defending method and related device
CN104268289B (en) * 2014-10-21 2017-12-12 中国建设银行股份有限公司 The abatement detecting method and device of link URL
CN104317938B (en) * 2014-10-31 2018-02-02 北京国双科技有限公司 Web page interlinkage validation verification method and device

Also Published As

Publication number Publication date
CN106453207A (en) 2017-02-22

Similar Documents

Publication Publication Date Title
CN108683666B (en) Webpage identification method and device
US20180219907A1 (en) Method and apparatus for detecting website security
CN110020062B (en) Customizable web crawler method and system
CN105939326A (en) Message processing method and device
CN106878265A (en) A kind of data processing method and device
CN108259457B (en) WEB authentication method and device
CN103501331B (en) Data transmission method, data transmission equipment and data transmission system
CN102571846A (en) Method and device for forwarding hyper text transport protocol (HTTP) request
CN107239701B (en) Method and device for identifying malicious website
CN106453460B (en) File distribution method, device and system
CN110830564A (en) CDN scheduling method, device, system and computer readable storage medium
CN105635064B (en) CSRF attack detection method and device
CN104410546A (en) Testing method and device of real-time processing system
CN103051976A (en) Method, system and equipment for distributing HLS (HyperText Transfer Protocol Living Steaming) content by CDN (Content Distribute Network)
US20120166526A1 (en) Request forwarding and result aggregating systems, methods and computer readable media
US10536425B2 (en) Cross-domain HTTP requests using DNS rebinding
CN105592083B (en) Method and device for terminal to access server by using token
CN106331042A (en) Single sign-on method and device for heterogeneous user system
CN107948253B (en) Decentralized data storage method and system, electronic device and storage medium
CN105515882B (en) Website security detection method and device
CN106390458B (en) Webpage game on-hook method, server, mobile terminal and browser client
CN111225038B (en) Server access method and device
CN108924159A (en) The verification method and device in a kind of message characteristic identification library
CN106484720A (en) The method and apparatus that the effectiveness of URL is promoted in a kind of detection
CN106453207B (en) Advertisement material data website verification method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20240116

Address after: 100088 room 112, block D, 28 new street, new street, Xicheng District, Beijing (Desheng Park)

Patentee after: BEIJING QIHOO TECHNOLOGY Co.,Ltd.

Address before: Room 112, block D, No. 28, Xinjiekou outer street, Xicheng District, Beijing 100088 (Desheng Park)

Patentee before: BEIJING QIHOO TECHNOLOGY Co.,Ltd.

Patentee before: Qizhi software (Beijing) Co.,Ltd.

TR01 Transfer of patent right