CN106445935B - Misjudgment processing method and device for click data - Google Patents

Misjudgment processing method and device for click data Download PDF

Info

Publication number
CN106445935B
CN106445935B CN201510473699.8A CN201510473699A CN106445935B CN 106445935 B CN106445935 B CN 106445935B CN 201510473699 A CN201510473699 A CN 201510473699A CN 106445935 B CN106445935 B CN 106445935B
Authority
CN
China
Prior art keywords
data
click
click data
abnormal
time interval
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510473699.8A
Other languages
Chinese (zh)
Other versions
CN106445935A (en
Inventor
吕超
李益群
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201510473699.8A priority Critical patent/CN106445935B/en
Publication of CN106445935A publication Critical patent/CN106445935A/en
Application granted granted Critical
Publication of CN106445935B publication Critical patent/CN106445935B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Abstract

The invention discloses a misjudgment processing method of click data, which comprises the following steps: sequentially traversing each click data in a first preset time interval in a reverse order according to the order of the click time points of the click data; in the traversing process, each abnormal click data marked as abnormal is marked as abnormal data within a second preset time interval before the click time point of the abnormal click data, and the click data which is the same as the parameter information of the abnormal click data is marked as abnormal data. The invention also discloses a misjudgment processing device of the click data. The invention avoids misjudgment of the click data.

Description

Misjudgment processing method and device for click data
Technical Field
The invention relates to the technical field of internet, in particular to a misjudgment processing method and device for click data.
Background
In order to prevent a user from maliciously clicking data, the existing data servers such as advertisement servers are all provided with click anti-cheating systems to perform abnormal judgment on click data, the existing technologies often judge abnormal click data through abnormal click judgment strategies (for example, the number of continuous clicks of the same user in a preset time interval exceeds the preset number), but the existing click data are reported in real time, the click data with the preset number before the abnormal click data are judged to be normal data, so that misjudgment of the click data is caused, and a system (advertisement) which charges according to the click data possibly causes mistaken fee deduction.
Disclosure of Invention
The embodiment of the invention provides a method and a device for processing misjudgment of click data, and aims to solve the problem of misjudgment of the click data.
In order to achieve the above object, an embodiment of the present invention provides a method for processing misjudgment of click data, where the method for processing misjudgment of click data includes:
sequentially traversing each click data in a first preset time interval in a reverse order according to the order of the click time points of the click data;
in the traversing process, each abnormal click data marked as abnormal is marked as abnormal data within a second preset time interval before the click time point of the abnormal click data, and the click data which is the same as the parameter information of the abnormal click data is marked as abnormal data.
In order to achieve the above object, an embodiment of the present invention further provides a device for processing misjudgment of click data, where the device for processing misjudgment of click data includes:
the traversal module is used for sequentially traversing each click data in a first preset time interval in a reverse order according to the order of the click time points of the click data;
and the marking module is used for marking the click data which is not marked as abnormal and has the same parameter information with the abnormal click data as the abnormal data in a second preset time interval before the click time point of each abnormal click data marked as abnormal in the traversal process.
According to the misjudgment processing method and device for the click data, the click data in the first preset time interval are sequentially traversed in a reverse order according to the sequence of the click time points of the click data; in the traversal process, the click data which is not marked as abnormal in a second preset time interval before the click time point of each abnormal click data marked as abnormal and has the same parameter information with the abnormal click data is marked as abnormal data again, so that misjudgment of the click data is avoided.
Drawings
Fig. 1 is a schematic diagram of a hardware architecture related to a misjudgment processing apparatus for click data according to an embodiment of the present invention;
FIG. 2 is a schematic flow chart illustrating a method for processing misjudgment of click data according to a first embodiment of the present invention;
FIG. 3 is a flowchart illustrating a detailed process of a first scheme for marking abnormal click data in the first embodiment;
FIG. 4 is a schematic view of a detailed flow of the step of marking the click data as abnormal click data when the time interval is smaller than a second preset time interval according to the present invention;
FIG. 5 is a flowchart illustrating a detailed process of a second scheme for marking abnormal click data in the first embodiment;
FIG. 6 is a flowchart illustrating a method for processing misjudgment of click data according to a second embodiment of the present invention;
FIG. 7 is a system frame diagram of the misjudgment processing method of click data for advertisement billing according to the present invention;
FIG. 8 is a functional block diagram of a device for processing misjudgment of click data according to a first embodiment of the present invention;
FIG. 9 is a schematic diagram showing a first detailed functional block of the marking module in the first embodiment;
FIG. 10 is a schematic diagram showing a detailed functional block of a first marking unit in the first embodiment;
FIG. 11 is a diagram showing a second detailed functional block of the marking module in the first embodiment;
fig. 12 is a functional module diagram of a misjudgment processing device for click data according to a second embodiment of the invention.
The implementation, functional features and advantages of the objects of the present invention will be further explained with reference to the accompanying drawings.
Detailed Description
It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
The main solution of the embodiment of the invention is as follows: sequentially traversing each click data in a first preset time interval in a reverse order according to the order of the click time points of the click data; in the traversing process, each abnormal click data marked as abnormal is marked as abnormal data within a second preset time interval before the click time point of the abnormal click data, and the click data which is the same as the parameter information of the abnormal click data is marked as abnormal data.
Because the existing click data abnormity judging mode is real-time judgment, part of abnormal click data is judged as normal data, and therefore misjudgment of the click data is caused.
The embodiment of the invention constructs a misjudgment processing device of click data, and the tool carries out misjudgment detection on normal data in the click data within a preset time interval and re-identifies the misjudgment data which is not marked as abnormal click data, thereby avoiding misjudgment of the click data.
The misjudgment processing device of the click data of the embodiment can be borne on the server and can also be borne on the terminal. The hardware architecture of the click data misjudgment processing device may be as shown in fig. 1.
Fig. 1 shows a hardware architecture of a misjudgment processing apparatus for click data according to an embodiment of the present invention. As shown in fig. 1, the hardware of the click data erroneous determination processing device includes: a processor 1001, such as a CPU, a network interface 1004, a user interface 1003, a memory 1005, a communication bus 1002. The communication bus 1002 is used for implementing connection communication between the components in the server. The user interface 1003 may include a Display (Display), a Keyboard (Keyboard), a mouse, and other components, and is configured to receive information input by a user and send the received information to the processor 1005 for processing. The display screen can be an LCD display screen, an LED display screen or a touch screen. The optional user interface 1003 may also include a standard wired interface, a wireless interface. The network interface 1004 may optionally include a standard wired interface, a wireless interface (e.g., WI-FI interface). The memory 1005 may be a high-speed RAM memory or a non-volatile memory (e.g., a magnetic disk memory). The memory 1005 may alternatively be a storage device separate from the processor 1001. As shown in fig. 1, a memory 1005, which is a kind of computer storage medium, may include therein an operating system, a network communication module, a user interface module, and a misjudgment processing program of click data.
Among the hardware involved in the server shown in fig. 1, the network interface 1004 is mainly used for connecting other application servers (such as a billing server and a click data server) and performing data communication with other application servers; the user interface 1003 is mainly used for connecting a client, performing data communication with the client, and receiving information and instructions input by the client; and the processor 1001 may be configured to call a misjudgment processing program of the click data stored in the memory 1005, and perform the following operations:
sequentially traversing each click data in a first preset time interval in a reverse order according to the order of the click time points of the click data;
in the traversing process, each abnormal click data marked as abnormal is marked as abnormal data within a second preset time interval before the click time point of the abnormal click data, and the click data which is the same as the parameter information of the abnormal click data is marked as abnormal data.
Further, in one embodiment, the processor 1001 may call the misjudgment processing program of the click data stored in the memory 1005 to perform the following operations:
updating a first click time point corresponding to the abnormal click data when the abnormal click data marked as abnormal in the click data is traversed;
when the click data which are not marked as abnormal in the click data are traversed, a second click time point of the click data is obtained;
acquiring a time interval between a first click time point and a second click time point of abnormal click data which is the same as the parameter information of the click data;
and when the time interval is smaller than a second preset time interval, marking the click data as abnormal.
Further, in one embodiment, the processor 1001 may call the misjudgment processing program of the click data stored in the memory 1005 to perform the following operations:
when the time interval is smaller than a second preset time interval, acquiring exposure data and/or request data corresponding to the parameter information;
determining whether the traversed click data is abnormal or not based on the exposure data and/or the request data corresponding to the parameter information and the click data corresponding to the parameter information;
and when the traversed click data is abnormal, marking the second data as abnormal click data.
Further, in one embodiment, the processor 1001 may call the misjudgment processing program of the click data stored in the memory 1005 to perform the following operations:
when first data marked as abnormity in click data is traversed, second data which is the same as the first data parameter information and is not marked as abnormity in a second preset time interval before the click time point of the first data is obtained, wherein the second time interval is smaller than the first time interval;
and when the second data is acquired, marking the second data as abnormal click data.
Further, in one embodiment, the processor 1001 may call the misjudgment processing program of the click data stored in the memory 1005 to perform the following operations:
when click data are received, acquiring click data parameter information;
when the number of the click data with the parameter information same as the acquired parameter information in a second preset time interval is larger than that in the second preset time interval, marking the click data as abnormal click data;
and saving the received click data.
Further, in one embodiment, the processor 1001 may call the misjudgment processing program of the click data stored in the memory 1005 to perform the following operations:
and when the time interval between the current time point and the last traversal time point is greater than or equal to the first preset time interval, executing the step of sequentially traversing each click data in the first preset time interval in a reverse order according to the sequence of the click time points of the click data.
According to the scheme, the click data in the first preset time interval are sequentially traversed in a reverse order according to the order of the click time points of the click data; in the traversal process, the click data which is not marked as abnormal in a second preset time interval before the click time point of each abnormal click data marked as abnormal and has the same parameter information with the abnormal click data is marked as abnormal data again, so that misjudgment of the click data is avoided.
Based on the hardware architecture, the embodiment of the misjudgment processing method of the click data is provided.
As shown in fig. 2, a first embodiment of a method for processing misjudgment of click data according to the present invention is provided, where the method for processing misjudgment of click data includes:
step S10, sequentially traversing each click data in a first preset time interval in a reverse order according to the order of the click time points of the click data;
in this embodiment, the first preset time interval may be set by the user as needed, for example, the click data in one day or the click data in 2h, and the first preset time interval may be changed as needed after being set, or may be determined according to the capability of the server where the misjudgment processing device of the click data is located, for example, based on the size of the storage space of the server.
The method comprises the steps that click data of a user, which are acquired by an application server, are generally uploaded and stored to a preset storage system in real time, the storage system is preferably a distributed storage system, and the click data in a first preset time interval are pulled from the distributed storage system every other first preset time interval.
And each click data is sequentially stored according to the sequence of the click time points, so that when the click data in the first preset time interval is obtained, each click data is arranged according to the sequence. The parameter information of each click data may be a user identifier (e.g., user IP, ID used when the user logs in to the application, such as qq number, etc.), or may be a combination of the user identifier and other parameters, such as user identifier, advertiser, user identifier, advertisement name, etc.
Step S20, in the traversal process, the click data that is not marked as abnormal within a second preset time interval before the click time point of each abnormal click data marked as abnormal and is the same as the parameter information of the abnormal click data is marked as abnormal data.
For ease of understanding, the following is described in two specific embodiments, specifically as follows:
1) referring to fig. 3, the step S20 includes the steps of:
step S21, when the abnormal click data marked as abnormal in the click data is traversed, updating a first click time point corresponding to the abnormal click data;
step S22, when the click data which is not marked as abnormal in the click data is traversed, acquiring a second click time point of the click data;
step S23, obtaining a time interval between the first click time point and the second click time point of the abnormal click data that is the same as the parameter information of the click data;
and step S24, when the time interval is smaller than a second preset time interval, marking the click data as abnormal click data.
The first time point may be a click time point of the abnormal click data, or a click time point obtained by subtracting a second preset time interval from the click time point of the abnormal click data, and the first time point is taken as an example of the click time point of the abnormal click data, and the click data in the preset first time interval is [ a ]1,A2,A3……,An]If the preset second time interval is 30min, the second time interval is determined from AnStart to traverse to A in turn1At AnMarked as abnormal and the click time point is 10:15, the click time point of the abnormal click data is updated to be 10:15, and the click data is continuously traversed at An-1Is marked as abnormal and the clicking time point is 10:10, the clicking time point of the abnormal clicking data is updated to be 10:10, and the clicking data is continuously traversed, at An-2Not marked as abnormal, and its click time point is 9:50, and its parameter information and An-1Is the same, when A is the samen-1And marking the abnormal click data until all the click data are traversed.
According to the scheme, each click data is continuously traversed forwards, data does not need to be captured for many times, and the calculation amount is small. It can be understood that, if the parameter information corresponding to the click data used in the second preset time interval is different, when the second click time point is updated, the parameter information corresponding to the current abnormal click data can be obtained, and the second time point corresponding to the parameter information is updated. It can be understood that, in order to avoid that the second time point is too much, before traversing the click data within the first preset time interval, the click data may be divided into a plurality of click parameter groups according to the corresponding parameter information, and each click parameter group is sequentially traversed, and when each parameter group is traversed, the second time point for updating the abnormal click data does not need to be directed to the parameter information.
Further, in order to improve the accuracy of the determination of the erroneous judgment data, referring to fig. 4, the step S24 includes:
step S241, when the time interval is smaller than a second preset time interval, acquiring exposure data and/or request data corresponding to the parameter information;
step S242, determining whether the traversed click data is abnormal based on the exposure data and/or the request data corresponding to the parameter information and the click data corresponding to the parameter information;
step S243, when the traversed click data is abnormal, marking the second data as abnormal click data.
In the present embodiment, taking an advertisement as an example, a user clicks the advertisement and accesses a homepage of the advertisement as one-click data; the user side terminal displays the advertisement as primary exposure data, and the user side terminal requests the advertisement as primary request data; it will be appreciated that the click data, exposure data, and request data may also be used for other information such as songs and pictures.
The misjudgment operation is not accurate enough only through the click data, exposure data and/or request data need to be combined, when whether the second data is abnormal or not is judged through the combination of the exposure data and/or the request data, the first click times of the second data in the first preset time interval corresponding to the parameter information can be obtained, and the exposure times and/or the request times of the second data in the first preset time interval corresponding to the parameter information are obtained through the combination of the exposure data and/or the request data; and acquiring a second click frequency corresponding to the exposure frequency and/or the request frequency based on a preset mapping relation between the exposure frequency and/or the request frequency and the click frequency, and judging that the second data is abnormal click data when the first click frequency is greater than the second click frequency.
2) Referring to fig. 5, the step S20 includes the steps of:
step S25, when traversing the first data marked as abnormal in the click data, acquiring second data which is the same as the first data parameter information and is not marked as abnormal in a second preset time interval before the click time point of the first data, wherein the second time interval is smaller than the first time interval;
step S26, when the second data is acquired, mark the second data as abnormal click data.
The data marked as abnormal carries abnormal marks, whether the data carries the abnormal marks is judged when traversing each data, when the data carries the abnormal marks, the click data is judged to be abnormal click data (first data), at the moment, parameter information of the click data and click data in a preset time interval before the click time point of the abnormal click data are obtained, whether the parameter information of the click data and the parameter information of the abnormal click data are the same or not is judged, when the parameter information is the same, whether the click data is not marked as abnormal or not can be judged, when the click data is not marked as abnormal, the click data is judged to be misjudged to be normal data, the abnormal marks are added to the click data, namely, the click data is marked as abnormal; or, when acquiring click data within a preset time interval before a click time point of abnormal click data (first data), judging whether the click data is marked as abnormal, and when the click data is not marked as abnormal, indicating that the click data is misjudged as normal data, adding an abnormal identifier to the click data, namely, marking the click data as abnormal. It is understood that, when traversing the click data which is not marked as abnormal in the click data, other click data may continue to be traversed in sequence until all click data within the first preset time interval is traversed.
The above traversal process is described below with specific examples, where the click data in the first time interval is preset as [ a1,A2,A3……,An]If the click time points of the click data are separated by 1s and the preset time interval is 5s, the time interval is defined as AnStart to traverse to A in turn1At AnWhen the click data A is marked as abnormal, the click data A within 5s can be acquiredn-1,An-2,An-3,An-4,An-5And obtaining An-1,An-2,An-3,An-4,An-5If A is the parameter information ofn-2And An-3Parameter (d) ofInformation and AnIf the parameter information is the same, then A is judgedn-2And An-3Whether an abnormal mark is carried or not is judged, if A isn-2When carrying an abnormal mark, An-3For misjudging data, pair An-3Adding an abnormal identifier; continue traversing click data An-1At An-1When the click data is not abnormal, continuously traversing the click data An-2At this time, click data A within 5s is acquiredn-3,An-4,An-5,An-6,An-7And continues with AnThe same misjudgment processing flow is carried out until the click data A is traversed1
According to the misjudgment processing method of the click data, according to the order of the click time points of the click data, the click data in a first preset time interval are sequentially traversed in a reverse order; in the traversal process, the click data which is not marked as abnormal in a second preset time interval before the click time point of each abnormal click data marked as abnormal and has the same parameter information with the abnormal click data is marked as abnormal data again, so that misjudgment of the click data is avoided.
Further, a second embodiment of the present invention is proposed based on the first embodiment of the method for processing misjudgment of click data. As shown in fig. 6, the misjudgment processing method of click data includes:
step S30, when receiving click data, acquiring the click data parameter information;
step S40, when the number of the click data with the parameter information same as the acquired parameter information in the second preset time interval is larger than the second preset time interval, marking the click data as abnormal click data;
and step S50, saving the received click data.
In this embodiment, each time one piece of click data is received, whether the number of the click data corresponding to the parameter information corresponding to the click data in a preset second time interval is greater than the second preset time interval is determined in real time, and if the number of the click data is greater than the second preset time interval, the click data is marked as abnormal click data and stored; and when the time interval is less than or equal to a second preset time interval, taking the click data as normal data and storing the normal data.
Further, to avoid the erroneous determination, the step S60 is followed by the step of:
when the time interval between the current time point and the last traversal time point is greater than or equal to the first preset time interval, the step S10 is executed.
It is understood that, in order to reduce the storage pressure of the server, after the obtained click data is marked as abnormal click data, the received click data and other click data except the click data within the first preset time interval are deleted.
The above-mentioned scheme is described below by taking click data as advertisement data and the click data is used for advertisement charging as an example, and with reference to fig. 7, the specific implementation process is as follows:
1: the click server transmits real-time click data to the charging server;
2: when receiving the click data, the charging server transmits the real-time click data to a real-time click anti-cheating system for anti-cheating judgment, wherein the specific anti-cheating judgment mode is as follows: when click data are received, acquiring click data parameter information; when the number of the click data with the parameter information same as the acquired parameter information in a second preset time interval is larger than the second preset time interval, marking the click data as abnormal click data, judging the click data as normal click data when the number of the click data is smaller than or equal to the second preset time interval, and feeding back the judgment result of the click data to the real-time click anti-cheating system;
clicking the anti-cheating system in real time to send the judgment result (abnormal or normal) to the charging server;
4, the charging server charges according to the result, wherein, when the click data is normal, the fee deduction is carried out, and when the click data is abnormal, the fee deduction operation is not carried out
Logjoin (data synthesis means) combines the request data, the exposure data, and the click data Join to generate click data after Join;
7 and 8, reading the click data after the join is read by the anti-cheating delay re-judgment system, performing delay re-judgment on the normal click, and generating re-judgment click data, wherein the delay re-judgment process comprises the following steps:
sequentially traversing each click data in a first preset time interval in a reverse order according to the order of the click time points of the click data;
in the traversal process, each abnormal click data marked as abnormal is marked as abnormal data within a second preset time interval before the click time point of the abnormal click data, and the click data which is the same as the parameter information of the abnormal click data is marked as abnormal data
9, sending the click data re-marked as abnormal by the anti-cheating delay re-judgment system to the charging server for refund operation.
Correspondingly, the invention provides a preferred embodiment of the misjudgment processing device for click data. Referring to fig. 8, the misjudgment processing apparatus for click data includes a traversal module 10 and a marking module 20, wherein:
the traversal module 10 is configured to sequentially traverse each click data within a first preset time interval in a reverse order according to the order of the click time points of the click data;
in this embodiment, the first preset time interval may be set by the user as needed, for example, the click data in one day or the click data in 2h, and the first preset time interval may be changed as needed after being set, or may be determined according to the capability of the server where the misjudgment processing device of the click data is located, for example, based on the size of the storage space of the server.
The method comprises the steps that click data of a user, which are acquired by an application server, are generally uploaded and stored to a preset storage system in real time, the storage system is preferably a distributed storage system, and the click data in a first preset time interval are pulled from the distributed storage system every other first preset time interval.
And each click data is sequentially stored according to the sequence of the click time points, so that when the click data in the first preset time interval is obtained, each click data is arranged according to the sequence. The parameter information of each click data may be a user identifier (e.g., user IP, ID used when the user logs in to the application, such as qq number, etc.), or may be a combination of the user identifier and other parameters, such as user identifier, advertiser, user identifier, advertisement name, etc.
The marking module 20 is configured to mark, in the traversal process, click data that is not marked as abnormal and is identical to parameter information of the abnormal click data as abnormal data within a second preset time interval before a click time point of each abnormal click data marked as abnormal.
For ease of understanding, the following is described in two specific embodiments, specifically as follows:
1) referring to fig. 9, the marking module 20 includes:
the updating unit 21 is configured to update a first click time point corresponding to the abnormal click data when the abnormal click data marked as abnormal in the click data is traversed;
the first obtaining unit 22 is configured to obtain a second click time point of click data when the click data which is not marked as abnormal in the click data is traversed, and obtain a time interval between a first click time point and the second click time point of the abnormal click data which is the same as parameter information of the click data;
the first marking unit 23 is configured to mark the click data as abnormal when the time interval is smaller than a second preset time interval.
The above traversal process is described below with specific examples, where the click data in the first time interval is preset as [ a1,A2,A3……,An]If the preset second time interval is 30min, the second time interval is determined from AnStart to traverse to A in turn1At AnMarked as abnormal and the click time point is 10:15, the click time point of the abnormal click data is updated to be 10:15, and the click data is continuously traversed at An-1Marked as abnormal and the click time point is 10:10, the click time point of the abnormal click data is updated to 10:10, and the process continues to passCalendar click data, in An-2Not marked as abnormal, and its click time point is 9:50, and its parameter information and An-1Is the same, when A is the samen-1And marking the abnormal click data until all the click data are traversed.
According to the scheme, each click data is continuously traversed forwards, data does not need to be captured for many times, and the calculation amount is small. It can be understood that, if the parameter information corresponding to the click data used in the second preset time interval is different, when the second click time point is updated, the parameter information corresponding to the current abnormal click data can be obtained, and the second time point corresponding to the parameter information is updated. It can be understood that, to avoid the second time point being too much, before traversing the click data within the first preset time interval, the traversal module 10 may divide the click data into a plurality of click parameter groups according to the corresponding parameter information, and sequentially traverse each click parameter group, and when traversing each parameter group, the second time point updating the abnormal click data does not need to be specific to the parameter information.
Further, in order to improve the accuracy of determining the erroneous judgment data, referring to fig. 10, the first marking means 23 includes:
an obtaining subunit 231, configured to obtain, when the time interval is smaller than a second preset time interval, exposure data and/or request data corresponding to the parameter information;
a determining subunit 232, configured to determine whether the traversed click data is abnormal based on the exposure data and/or the request data corresponding to the parameter information and the click data corresponding to the parameter information;
and a marking subunit 233, configured to mark the second data as abnormal click data when the traversed click data is abnormal.
In the present embodiment, taking an advertisement as an example, a user clicks the advertisement and accesses a homepage of the advertisement as one-click data; the user side terminal displays the advertisement as primary exposure data, and the user side terminal requests the advertisement as primary request data; it will be appreciated that the click data, exposure data, and request data may also be used for other information such as songs and pictures.
The misjudgment operation is not accurate enough only through the click data, exposure data and/or request data need to be combined, when whether the second data is abnormal or not is judged through the combination of the exposure data and/or the request data, the first click times of the second data in the first preset time interval corresponding to the parameter information can be obtained, and the exposure times and/or the request times of the second data in the first preset time interval corresponding to the parameter information are obtained through the combination of the exposure data and/or the request data; and acquiring a second click frequency corresponding to the exposure frequency and/or the request frequency based on a preset mapping relation between the exposure frequency and/or the request frequency and the click frequency, and judging that the second data is abnormal click data when the first click frequency is greater than the second click frequency.
2) Referring to fig. 11, the marking module 20 includes:
the second obtaining unit 21 is configured to, when traversing a first data marked as an exception in click data, obtain a second data that is the same as parameter information of the first data and is not marked as an exception in a second preset time interval before a click time point of the first data, where the second time interval is smaller than the first time interval;
and a second marking unit 22, configured to mark the second data as abnormal click data when the second data is acquired.
The data marked as abnormal carries abnormal marks, whether the data carries the abnormal marks is judged when traversing each data, when the data carries the abnormal marks, the click data is judged to be abnormal click data (first data), at the moment, parameter information of the click data and click data in a preset time interval before the click time point of the abnormal click data are obtained, whether the parameter information of the click data and the parameter information of the abnormal click data are the same or not is judged, when the parameter information is the same, whether the click data is not marked as abnormal or not can be judged, when the click data is not marked as abnormal, the click data is judged to be misjudged to be normal data, the abnormal marks are added to the click data, namely, the click data is marked as abnormal; or, when acquiring click data within a preset time interval before a click time point of abnormal click data (first data), judging whether the click data is marked as abnormal, and when the click data is not marked as abnormal, indicating that the click data is misjudged as normal data, adding an abnormal identifier to the click data, namely, marking the click data as abnormal. It is understood that, when traversing the click data which is not marked as abnormal in the click data, other click data may continue to be traversed in sequence until all click data within the first preset time interval is traversed.
The above traversal process is described below with specific examples, where the click data in the first time interval is preset as [ a1,A2,A3……,An]If the click time points of the click data are separated by 1s and the preset time interval is 5s, the time interval is defined as AnStart to traverse to A in turn1At AnWhen the click data A is marked as abnormal, the click data A within 5s can be acquiredn-1,An-2,An-3,An-4,An-5And obtaining An-1,An-2,An-3,An-4,An-5If A is the parameter information ofn-2And An-3Parameter information of (A)nIf the parameter information is the same, then A is judgedn-2And An-3Whether an abnormal mark is carried or not is judged, if A isn-2When carrying an abnormal mark, An-3For misjudging data, pair An-3Adding an abnormal identifier; continue traversing click data An-1At An-1When the click data is not abnormal, continuously traversing the click data An-2At this time, click data A within 5s is acquiredn-3,An-4,An-5,An-6,An-7And continues with AnThe same misjudgment processing flow is carried out until the click data A is traversed1
According to the misjudgment processing device for the click data, according to the sequence of the click time points of the click data, the click data in the first preset time interval are sequentially traversed in a reverse order; in the traversal process, the click data which is not marked as abnormal in a second preset time interval before the click time point of each abnormal click data marked as abnormal and has the same parameter information with the abnormal click data is marked as abnormal data again, so that misjudgment of the click data is avoided.
Further, a second embodiment of the present invention is proposed based on the first embodiment of the method for processing misjudgment of click data. As shown in fig. 12, the misjudgment processing apparatus for click data further includes:
the obtaining module 30 is configured to obtain click data parameter information when click data is received;
the marking module 20 is further configured to mark the click data as abnormal click data when the number of click data having parameter information that is the same as the acquired parameter information is greater than a second preset time interval;
and the storage module 40 is used for storing the received click data.
In this embodiment, each time one piece of click data is received, whether the number of the click data corresponding to the parameter information corresponding to the click data in a preset second time interval is greater than the second preset time interval is determined in real time, and if the number of the click data is greater than the second preset time interval, the click data is marked as abnormal click data and stored; and when the time interval is less than or equal to a second preset time interval, taking the click data as normal data and storing the normal data.
Further, to avoid the erroneous judgment, the traversal module 10 is further configured to sequentially traverse each click data within the first preset time interval in a reverse order according to the order of the click time points of the click data when the time interval between the current time point and the time point at which the previous traversal is performed is greater than or equal to the first preset time interval.
It is understood that, in order to reduce the storage pressure of the server, after the obtained click data is marked as abnormal click data, the received click data and other click data except the click data within the first preset time interval are deleted.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.
The above-mentioned serial numbers of the embodiments of the present invention are merely for description and do not represent the merits of the embodiments.
Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better implementation manner. Based on such understanding, the technical solutions of the present invention may be embodied in the form of a software product, which is stored in a storage medium (e.g., ROM/RAM, magnetic disk, optical disk) and includes instructions for enabling a terminal device (e.g., a mobile phone, a computer, a server, or a network device) to execute the method according to the embodiments of the present invention.

Claims (10)

1. A misjudgment processing method of click data is characterized by comprising the following steps:
traversing each click data in a first preset time interval in a reverse order according to the order of the click time points of the click data;
in the traversal process, dividing the click data into a plurality of click parameter groups according to the parameter information of the click data, respectively traversing each click parameter group, and marking the click data which is marked as abnormal and has the same parameter information with the abnormal click data as the abnormal click data and is not marked as abnormal in a second preset time interval before the click time point of the abnormal click data marked as abnormal;
the method comprises the following steps of marking click data which is marked as abnormal and has the same parameter information with abnormal click data as the abnormal click data as abnormal data, wherein the click data which is not marked as abnormal in a second preset time interval before the click time point of each abnormal click data marked as abnormal includes the following steps:
when first data marked as abnormal in click data is traversed, second data which is the same as the first data parameter information and is not marked as abnormal in a second preset time interval before the click time point of the first data is obtained, wherein the second preset time interval is smaller than the first preset time interval;
and when the second data is acquired, marking the second data as abnormal click data.
2. The method for processing misjudgment of click data according to claim 1, wherein the step of labeling each click data labeled as abnormal data within a second preset time interval before the click time point of the abnormal click data, and labeling the click data having the same parameter information as the abnormal click data includes:
updating a first click time point corresponding to the abnormal click data when the abnormal click data marked as abnormal in the click data is traversed;
when the click data which are not marked as abnormal in the click data are traversed, acquiring a second click time point of the click data;
acquiring a time interval between a first click time point and a second click time point of abnormal click data which is the same as the parameter information of the click data;
and when the time interval is smaller than a second preset time interval, marking the click data as abnormal click data.
3. The method for processing misjudgment of click data according to claim 2, wherein the step of marking the click data as abnormal when the time interval is smaller than a second preset time interval comprises:
when the time interval is smaller than a second preset time interval, acquiring exposure data and/or request data corresponding to the parameter information;
determining whether the traversed click data is abnormal or not based on the exposure data and/or the request data corresponding to the parameter information and the click data corresponding to the parameter information;
and when the traversed click data is abnormal, marking the second data as abnormal click data.
4. The method for processing misjudgment of click data according to any one of claims 1 to 3, wherein the method for processing misjudgment of click data further comprises the steps of:
when click data are received, acquiring click data parameter information;
when the number of the click data with the parameter information same as the acquired parameter information in a second preset time interval is larger than that in the second preset time interval, marking the click data as abnormal click data;
and saving the received click data.
5. The method for processing misjudgment of click data according to claim 4, wherein after the step of marking the click data as abnormal click data when the number of click data having parameter information identical to the acquired parameter information in a second preset time interval is greater than the second preset time interval, the method for processing misjudgment of click data comprises:
and when the time interval between the current time point and the last traversal time point is greater than or equal to the first preset time interval, executing the step of sequentially traversing each click data in the first preset time interval in a reverse order according to the sequence of the click time points of the click data.
6. A misjudgment processing device for click data is characterized by comprising:
the traversal module is used for sequentially traversing each click data in a first preset time interval according to the sequence of the click time points of the click data;
the marking module is used for dividing the click data into a plurality of click parameter groups according to the parameter information of the click data in the traversal process, marking the click data which is marked as abnormal and is the same as the parameter information of the abnormal click data as abnormal data within a second preset time interval before the click time point of each abnormal click data which is marked as abnormal;
wherein the marking module comprises:
the second obtaining unit is used for obtaining second data which is identical to the parameter information of the first data and is not marked as abnormal in a second preset time interval before the clicking time point of the first data when the first data marked as abnormal in the clicking data is traversed, wherein the second preset time interval is smaller than the first preset time interval;
and the second marking unit is used for marking the second data as abnormal click data when the second data is acquired.
7. The misjudgment processing device of click data according to claim 6, wherein the marking module includes:
the updating unit is used for updating a first click time point corresponding to the abnormal click data when the abnormal click data marked as abnormal in the click data is traversed;
the first acquisition unit is used for acquiring a second click time point of click data when the click data which is not marked as abnormal in the click data is traversed, and acquiring a time interval between a first click time point of the abnormal click data and the second click time point, wherein the first click time point and the second click time point are the same as the parameter information of the click data;
and the first marking unit is used for marking the click data as abnormal when the time interval is smaller than a second preset time interval.
8. The misjudgment processing apparatus for click data according to claim 7, wherein the first marking unit includes:
the obtaining subunit is configured to obtain, when the time interval is smaller than a second preset time interval, exposure data and/or request data corresponding to the parameter information;
the determining subunit is configured to determine whether the traversed click data is abnormal based on the exposure data and/or the request data corresponding to the parameter information and the click data corresponding to the parameter information;
and the marking subunit is used for marking the second data as abnormal click data when the traversed click data is abnormal.
9. The misjudgment processing apparatus for click data according to any one of claims 6 to 8, wherein the misjudgment processing apparatus for click data further comprises:
the acquisition module is used for acquiring the click data parameter information when receiving the click data;
the marking module is further used for marking the click data as abnormal click data when the number of the click data with the parameter information same as the acquired parameter information in a second preset time interval is larger than the second preset time interval;
and the storage module is used for storing the received click data.
10. The device for processing misjudgment of click data according to claim 9, wherein the traversal module is further configured to sequentially traverse each click data within a first preset time interval in a reverse order according to a sequence of click time points of the click data when a time interval between a current time point and a time point at which traversal was performed last time is greater than or equal to the first preset time interval.
CN201510473699.8A 2015-08-05 2015-08-05 Misjudgment processing method and device for click data Active CN106445935B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510473699.8A CN106445935B (en) 2015-08-05 2015-08-05 Misjudgment processing method and device for click data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510473699.8A CN106445935B (en) 2015-08-05 2015-08-05 Misjudgment processing method and device for click data

Publications (2)

Publication Number Publication Date
CN106445935A CN106445935A (en) 2017-02-22
CN106445935B true CN106445935B (en) 2021-06-25

Family

ID=59216446

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510473699.8A Active CN106445935B (en) 2015-08-05 2015-08-05 Misjudgment processing method and device for click data

Country Status (1)

Country Link
CN (1) CN106445935B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113867553B (en) * 2020-06-15 2023-09-26 武汉斗鱼鱼乐网络科技有限公司 Quick click processing method and device, storage medium and electronic equipment

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101075908B (en) * 2006-11-08 2011-04-20 腾讯科技(深圳)有限公司 Method and system for accounting network click numbers
KR100857148B1 (en) * 2007-04-26 2008-09-05 엔에이치엔(주) Method for processing invalid click and system for executing the method
CN101393629A (en) * 2007-09-20 2009-03-25 阿里巴巴集团控股有限公司 Implementing method and apparatus for network advertisement effect monitoring
US8799069B2 (en) * 2007-12-21 2014-08-05 Yahoo! Inc. Mobile click fraud prevention
US8533825B1 (en) * 2010-02-04 2013-09-10 Adometry, Inc. System, method and computer program product for collusion detection
CN102271068B (en) * 2011-09-06 2015-07-15 电子科技大学 Method for detecting DOS/DDOS (denial of service/distributed denial of service) attack

Also Published As

Publication number Publication date
CN106445935A (en) 2017-02-22

Similar Documents

Publication Publication Date Title
CN105210051A (en) Estimating visibility of content items
CN104980446A (en) Detection method and system for malicious behavior
JP2010250795A (en) Device for measurement of meticulous reading rate, client terminal, meticulous reading rate measurement system, meticulous reading rate measuring method, and program
CN107993106B (en) Electronic invoice generation method and device
CN104834597A (en) Method and system for measuring application response duration
CN103186660A (en) Information source selection system, information source selection method, and program
CN106445935B (en) Misjudgment processing method and device for click data
CN107533688B (en) Service processing method and device
CN106998476B (en) Video viewing method and device based on geographic information system
CN109495319B (en) Method, device and equipment for determining fault information of CDN node
US8922491B2 (en) Method and apparatus enabling keystroke acceleration
CN103049445A (en) Method and device for inquiring pictorial information and picture state server
CN103198120A (en) Webpage presenting method and device and webpage content providing method and system
CN102572554B (en) A kind of information issuing system and method
JP6063056B2 (en) File trimming method, terminal, server, program, and recording medium
CN115860877A (en) Product marketing method, device, equipment and medium
CN107688951B (en) Information pushing method and device
CN106302821B (en) Data request method and equipment thereof
CN111127094B (en) Account matching method and device, electronic equipment and storage medium
CN113377473A (en) Task creation method and device, electronic equipment and computer readable storage medium
CN112364219A (en) Content distribution method and device, electronic equipment and storage medium
CN108804195B (en) Page display method and device, server and client
CN109388527B (en) Method for confirming operation validity, data processing method, device and equipment
JP2013023080A (en) Diagram line making system, time changing method of line and program
CN112686619A (en) Method and device for reminding order collection

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant