CN106294508A - A kind of brush amount tool detection method and device - Google Patents

A kind of brush amount tool detection method and device Download PDF

Info

Publication number
CN106294508A
CN106294508A CN201510317134.0A CN201510317134A CN106294508A CN 106294508 A CN106294508 A CN 106294508A CN 201510317134 A CN201510317134 A CN 201510317134A CN 106294508 A CN106294508 A CN 106294508A
Authority
CN
China
Prior art keywords
user
behavior characteristics
users
sorted group
measured
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510317134.0A
Other languages
Chinese (zh)
Other versions
CN106294508B (en
Inventor
贺海军
孔蓓蓓
熊健
熊焰
杨剑鸣
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Tencent Computer Systems Co Ltd
Original Assignee
Shenzhen Tencent Computer Systems Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Tencent Computer Systems Co Ltd filed Critical Shenzhen Tencent Computer Systems Co Ltd
Priority to CN201510317134.0A priority Critical patent/CN106294508B/en
Publication of CN106294508A publication Critical patent/CN106294508A/en
Application granted granted Critical
Publication of CN106294508B publication Critical patent/CN106294508B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

Abstract

This application discloses a kind of brush amount tool detection method and device, method includes: obtain the behavioral data of at least one user corresponding with channel to be measured, the behavior characteristics of each user is generated according to the behavioral data of each user, the behavior characteristics utilizing each user calculates the behavior characteristics value of each user, with behavior characteristics value as class condition, user is classified, obtain several sorted group, when determining that in described sorted group, number of users meets preset decision plan, determine that described channel to be measured employs brush amount instrument.Owing to the behavior characteristics value of the user in each sorted group is similar or identical, if thus the number of users that comprises in one or more sorted group is the biggest, in then meaning this sorted group, most of user is fictitious users, namely may determine that channel to be measured employs brush amount instrument.The application method avoids existing distribution based on hardware access environment property parameters and detects the confinement problems that the method for brush amount instrument is latched in.

Description

A kind of brush amount tool detection method and device
Technical field
The application relates to networking technology area, more particularly, it relates to a kind of brush amount tool detection method and Device.
Background technology
The intelligent terminal such as smart mobile phone popularize as various types of application APP (Application, application) Provide carrier.Application provider, in order to promote oneself application, shows application by kinds of platform to user And download function is provided, each platform is all considered as a channel of application.And as feedback, After having new user to be have registered application account by certain channel, application provider needs to prop up to this channel Pay certain development expenses.
In reality, some channel (platform), can profit in order to gain the development expenses of application provider by cheating Generate false new user with brush amount instrument, process substantially: channel utilize brush amount instrument random or based on Existing subscriber's data file, generates terminal unit IMEI (International Mobile Equipment Identity, International Mobile Equipment Identity code), IMSI (International Mobile Subscriber Identification Number, international mobile subscriber identity), MAC (Media Access Control, Media interviews control) address, screen resolution, type, SIM number, cell-phone number, operator numbering or The various parameters such as title, mobile phone operating system (OS) version, then using generate these parameters as newly Customer parameter is registered.And for application vendor's end, typically merely by distinguishing that IMEI judges Whether this user is new user, as long as namely unregistered mistake before the IMEI of current registration user, then it is assumed that Current registration user is a new user, and then pays development expenses to corresponding channel.
In order to avoid the appearance of the problems referred to above, typically by detecting new registration under current channel in prior art The IMSI of user, MAC Address, screen resolution, type, etc. hardware access environment property parameters point Cloth is the most normal, differentiates and whether employs brush amount instrument under current channel.As a example by screen resolution, If the screen resolution distribution of user has the biggest with all channel sub-screen resolution distribution under current channel Difference, then mean that current channel may have been used brush amount instrument, citing is such as, new in next sky of channel 1 Registering the screen resolution of user and have 80% as screen resolution 1, remaining 20% is screen resolution 2 and screen Curtain resolution 3, and under all channels, register the screen that the full screen resolution distribution situation of user is as 50% The screen resolution 3 of resolution 2,30% and the screen resolution 1 of 20%.Obviously, the screen of channel 1 is differentiated Rate distribution has the biggest difference with full screen resolution distribution, represents channel 1 and may have been used brush amount work Tool.
But, existing distribution based on hardware access environment property parameters detects the method tool of brush amount instrument There is certain limitation, the meeting when generating hardware and accessing environment property parameters of the brush amount instrument of some high-quality Generate based on the distribution under each attribute truth, under the distribution situation of the most each parameter and normal channels Each parameter distribution is consistent.For using the channel of this type of brush amount instrument, prior art will be unable to detect.
Summary of the invention
In view of this, this application provides a kind of brush amount tool detection method and device, be used for solving existing There is circumscribed asking in the method that distribution based on hardware access environment property parameters detects brush amount instrument Topic.
To achieve these goals, it is proposed that scheme as follows:
A kind of brush amount tool detection method, including:
Obtain the behavioral data of at least one user corresponding with channel to be measured;
According to the behavioral data of each user, generate the behavior characteristics of each user;
Utilize the behavior characteristics of each user, calculate the behavior characteristics value of each user;
With behavior characteristics value as class condition, each user is classified, obtain several sorted group;
When determining that in described sorted group, number of users meets preset decision plan, determine described channel to be measured Employ brush amount instrument.
A kind of brush amount tool detection device, including:
Behavioral data acquiring unit, for obtaining the behavior number of at least one user corresponding with channel to be measured According to;
Behavior characteristics signal generating unit, for the behavioral data according to each user, generates the row of each user It is characterized;
Behavior characteristics value computing unit, for utilizing the behavior characteristics of each user, calculates each user's Behavior characteristics value;
Classification processing unit, for behavior characteristics value as class condition, classifies to each user, Obtain several sorted group;
Brush amount instrument determines unit, meets preset judgement plan for number of users in determining described sorted group Time slightly, determine that described channel to be measured employs brush amount instrument.
From above-mentioned technical scheme it can be seen that the embodiment of the present application provide brush amount tool detection method, Obtaining the behavioral data of at least one user corresponding with channel to be measured, behavioral data characterizes user and logs in Various operations performed after application, and then each user can be generated according to the behavioral data of each user Behavior characteristics, then utilize the behavior characteristics of each user to calculate the behavior characteristics value of each user, with Behavior characteristics value is class condition, classifies user, obtains several sorted group, due to each point The behavior characteristics value of the user in class group is similar or identical, if thus comprised in one or more sorted group Number of users very big, then mean that major part user is fictitious users in this sorted group, namely permissible Determine that channel to be measured employs brush amount instrument.Multiple fictitious users that the application utilizes brush amount instrument to produce exist The principle of certain similarity is possessed in behavior, by calculating the behavior characteristics value of each user and the most right User clusters, and then can judge whether channel to be measured uses brush amount instrument, keeps away according to predetermined policy Exempt from existing to access the distribution of environment property parameters based on hardware and detect what the method for brush amount instrument was latched in Confinement problems.
Accompanying drawing explanation
In order to be illustrated more clearly that the embodiment of the present application or technical scheme of the prior art, below will be to reality Execute the required accompanying drawing used in example or description of the prior art to be briefly described, it should be apparent that below, Accompanying drawing in description is only embodiments herein, for those of ordinary skill in the art, not On the premise of paying creative work, it is also possible to obtain other accompanying drawing according to the accompanying drawing provided.
Fig. 1 is a kind of brush amount tool detection method flow diagram disclosed in the embodiment of the present application;
Fig. 2 is a kind of method flow determining channel to be measured use brush amount instrument disclosed in the embodiment of the present application Figure;
Fig. 3 is the another kind of method stream determining channel to be measured use brush amount instrument disclosed in the embodiment of the present application Cheng Tu;
Fig. 4 is a kind of brush amount tool detection apparatus structure schematic diagram disclosed in the embodiment of the present application;
Fig. 5 is that disclosed in the embodiment of the present application, a kind of brush amount instrument determines cellular construction schematic diagram;
Fig. 6 is that disclosed in the embodiment of the present application, another kind of brush amount instrument determines cellular construction schematic diagram;
Fig. 7 is a kind of behavior characteristics computing unit structural representation disclosed in the embodiment of the present application;
Fig. 8 is a kind of classification processing unit structural representation disclosed in the embodiment of the present application;
Fig. 9 is a kind of server hardware structural representation disclosed in the embodiment of the present application.
Detailed description of the invention
Before introducing the application scheme, first title or phrase to using in literary composition explains:
SimHash is a kind of dimensionality reduction technology, high dimension vector can be mapped to one-dimensional fingerprint, and it is the earliest Proposed by Google, for removing duplicate webpages.The input of SimHash algorithm is a vector, and output is one The fingerprint of individual f position.In order to state conveniently, it is assumed that input is the characteristic set of a user behavior, Mei Gete Levying the weight of correspondence, SimHash algorithm is as follows:
1. the vectorial V of f dimension (f takes 64 herein) is initialized as 0, at the beginning of the binary number S of f position Begin to turn to 0;
2. pair each feature: this feature produces the fingerprint b of a f position by traditional hash algorithm is right I=1 to f, if the i-th bit of b is 1, then the i-th element of V (is weighed plus the weight of this feature herein It is heavily 1);Otherwise, the i-th element of V deducts the weight (weight is 1 herein) of this feature;
3. if the i-th element of V is more than 0, then the i-th bit of S is 1, is otherwise 0;
4. output fingerprint S.
Below in conjunction with the accompanying drawing in the embodiment of the present application, the technical scheme in the embodiment of the present application is carried out Clearly and completely describe, it is clear that described embodiment is only some embodiments of the present application, and It is not all, of embodiment.Based on the embodiment in the application, those of ordinary skill in the art are not doing Go out the every other embodiment obtained under creative work premise, broadly fall into the scope of the application protection.
Seeing Fig. 1, Fig. 1 is a kind of brush amount tool detection method flow diagram disclosed in the embodiment of the present application.
As it is shown in figure 1, the method includes:
Step S100, obtain the behavioral data of at least one user corresponding with channel to be measured;
The user corresponding with channel to be measured refers to the user registered by channel to be measured.For this type of user, Obtain it and log in the behavioral data produced after application.
Specifically, application server have recorded the behavioral data of each login user, takes in behavioral data With channel mark, this channel mark shows active user by which channel is registered.
Behavioral data can comprise multiple, particular type need according to application type depending on, such as For the application of the precious this kind of offer download function of application, its behavioral data can include off-the-shelf software scene Amount of action, renewal that lower click is downloaded push to expose under behavior record quantity, fine work scene under scene and move The quantity etc. made.
Certainly, for most of application, it generally individually can relate to following several behavioral data: uses Family access record quantity, user's access duration time, the type that starts, login type, user participate in difference The quantity of scene and the quantity etc. of user's click action.
It should be noted that we can obtain the behavior number of all users corresponding with channel to be measured According to, only obtain the behavioral data of the new user registered in the same day or a period of time by channel to be measured.
Step S110, behavioral data according to each user, generate the behavior characteristics of each user;
Specifically, after obtaining the behavioral data of user, generate each user's according to preset method Behavior characteristics.Behavior characteristics will behavioral data quantify after embodiment.User such as, is accessed by citing in advance Record quantity is divided into multiple interval, and it is interval i.e. which the access record quantity of the user of acquisition specifically falls into For a kind of behavior characteristics.
Step S120, utilize the behavior characteristics of each user, calculate the behavior characteristics value of each user;
The behavior characteristics of each user may comprise multiple, thus can use certain algorithm, based on The behavior characteristics set of user, calculates the behavior characteristics value of user.
Ratio preferably, can use SimHash algorithm.
Step S130, with behavior characteristics value as class condition, each user is classified, obtains some Individual sorted group;
Specifically, in each sorted group, the eigenvalue of user possesses certain similarity, namely represents same In sorted group, the behavior of user is similar.
Step S140, when determining that in described sorted group, number of users meets preset decision plan, determine institute State channel to be measured and employ brush amount instrument.
Specifically, preset decision plan can have multiple, referring particularly to hereafter relevant portion introduction.Due to Multiple fictitious users produced by brush amount instrument possess certain similarity in behavior, are embodied in behavior characteristics In value the most similar or identical, therefore by analyzing number of users in each sorted group, i.e. can determine that canal to be measured Whether road employs brush amount instrument.
The brush amount tool detection method that the embodiment of the present application provides, obtains at least corresponding with channel to be measured The behavioral data of individual user, behavioral data characterizes various operations performed after user logs in application, enters And the behavior characteristics of each user can be generated according to the behavioral data of each user, then utilize each to use The behavior characteristics at family calculates the behavior characteristics value of each user, with behavior characteristics value as class condition, to Classifying in family, obtains several sorted group, due to the behavior characteristics value phase of the user in each sorted group Like or identical, if thus the number of users comprised in one or more sorted group is very big, then mean In this sorted group, major part user is fictitious users, namely may determine that channel to be measured employs brush amount instrument. Multiple fictitious users that the application utilizes brush amount instrument to produce possess the principle of certain similarity in behavior, By calculating the behavior characteristics value of each user, and accordingly user is clustered, and then can be according in advance Fixed strategy judges whether channel to be measured uses brush amount instrument, it is to avoid existing access environment attribute based on hardware The distribution of parameter detects the confinement problems that the method for brush amount instrument is latched in.
Optionally, with behavior characteristics value as class condition, when each user is classified, can be by The user that behavior characteristics value is identical is divided into a class.It is, of course, also possible to classify according to Else Rule, Such as when behavior characteristics value is binary system group number form formula, can be individual less than presetting by the number of different place values The user that the behavior characteristics value of number is corresponding is divided into a class, as two behavior characteristics values are respectively 1011 Hes 1111, and predetermined number is 2, the most obvious two behavior characteristics values value of the 2nd the most from left to right is different, because of Two users of behavior characteristics value 1011 and 1111 can be divided into a class by this.
In another embodiment of the application, disclose several different decision plan, to be measured to determine Whether channel employs brush amount instrument.
See Fig. 2, Fig. 2 be disclosed in the embodiment of the present application a kind of determine channel to be measured use brush amount instrument Method flow diagram.
As in figure 2 it is shown, the method includes:
Step S200, all sorted group filter out number of users exceed the sorted group of first threshold;
The total number of users amount that the sorted group that step S210, statistics filter out is comprised;
Each sorted group obtained for screening, is added the number of users in each sorted group, is used Family total quantity.
Step S220, judge in described total number of users amount and all sorted group the ratio of total number of users amount whether Exceed Second Threshold, the most then perform step S230;
Step S230, determine that described channel to be measured employs brush amount instrument.
In the present embodiment, from all users, filter out the user that similar behavior is higher, it is judged that this type of user Whether the proportion accounting for total user number exceedes certain value, if then thinking that current channel to be measured employs brush amount Instrument.
Such as, corresponding with channel A to be measured has 200 users, through uniting mass data in citing Meter, determines that first threshold is 20, and Second Threshold is 50%.6 sorted group have been obtained through classification, point Not Wei sorted group 1-6, the number of users in each sorted group is followed successively by 100,80,10,5,3,2.Aobvious So, the number in the 1st and the 2nd sorted group has exceeded first threshold, and in the two sorted group, number is total With for 100+80=180, it is judged that 180/200=90% > 50%, it is thus determined that channel A to be measured employs brush Amount instrument.
Seeing Fig. 3, Fig. 3 is that another kind disclosed in the embodiment of the present application determines that channel to be measured uses brush amount instrument Method flow diagram.
As it is shown on figure 3, the method includes:
Step S300, by many orders the most at least, described sorted group is ranked up according to comprising number of users;
The total number of users amount that before in step S310, statistical series, n sorted group is comprised;
Here n value is preset constant, can be the constants that arbitrarily user sets such as 1,2, and certain n needs Less than the total number of sorted group.
Step S320, judge in described total number of users amount and all sorted group the ratio of total number of users amount whether More than the 3rd threshold value, the most then perform step S330;
Step S330, determine that described channel to be measured employs brush amount instrument.
Present embodiments provide another method determining that channel to be measured uses brush amount instrument, compared to upper One embodiment, selects in this enforcement to comprise front n the sorted group that number of users is more, it is judged that this n point Whether the proportion of class group total number of persons and all sorted group total numbers of persons is more than the 3rd threshold value, if then determining to be measured Channel employs brush amount instrument.
Such as, corresponding with channel B to be measured has 200 users, through uniting mass data in citing Meter, determines that n is 3, and the 3rd threshold value is 60%.6 sorted group, 6 sorted group have been obtained through classification The number of users comprised is as shown in table 1 below:
Table 1
According to comprising number of users order from large to small, sorted group is ranked up: 1-3-4-6-5-2.Choosing Take front 3 sorted group, i.e. sorted group 1, sorted group 3 and sorted group 4, ask for three sorted group numbers total With for 120+50+15=185.Judge 185/200=92.5% > 60%, it is thus determined that channel B to be measured uses Brush amount instrument.
It follows that in the present embodiment as a example by the precious software of application, whole scheme is introduced.
The behavioral data of each user corresponding with channel to be measured can be obtained by the precious software server of application, Behavioral data includes the attribute shown in table 2 below:
Table 2
According to the behavioral data of each user, generate the behavior characteristics of each user, the attribute of behavior characteristics It is described as follows table 3:
Table 3
Utilize SimHash algorithm, behavior characteristics set based on each user, calculate the row of each user It is characterized value, and to gather to the user that behavior characteristics value is identical be a class, obtain several sorted group.Choose Comprise the sorted group that number of users is most, by number of users in this sorted group divided by user in all sorted group Total quantity, it is judged that whether result is more than preset value, if it is, think that current channel to be measured employs brush Amount instrument.
The brush amount tool detection device provided the embodiment of the present application below is described, brush described below Amount tool detection device can be mutually to should refer to above-described brush amount tool detection method.
Seeing Fig. 4, Fig. 4 is a kind of brush amount tool detection apparatus structure schematic diagram disclosed in the embodiment of the present application.
As shown in Figure 4, this device includes:
Behavioral data acquiring unit 41, for obtaining the behavior of at least one user corresponding with channel to be measured Data;
Behavior characteristics signal generating unit 42, for the behavioral data according to each user, generates each user's Behavior characteristics;
Behavior characteristics value computing unit 43, for utilizing the behavior characteristics of each user, calculates each user Behavior characteristics value;
Classification processing unit 44, for behavior characteristics value as class condition, classifies to each user, Obtain several sorted group;
Brush amount instrument determines unit 45, meets preset judgement for number of users in determining described sorted group During strategy, determine that described channel to be measured employs brush amount instrument.
Optionally, Fig. 5 illustrates above-mentioned brush amount instrument and determines the optional structure of one of unit 45, such as Fig. 5 institute Showing, brush amount instrument determines that unit 45 may include that
Sorted group screening unit 451, exceedes first threshold for filtering out number of users in all sorted group Sorted group;
First user quantity statistics unit 452, for adding up the total number of users that the sorted group filtered out is comprised Amount;
First ratio in judgement unit 453, is used for judging that described total number of users amount is total with user in all sorted group Whether the ratio of quantity exceedes Second Threshold, if, it is determined that described channel to be measured employs brush amount instrument.
Optionally, Fig. 6 illustrates above-mentioned brush amount instrument and determines the optional structure of another kind of unit 45, such as Fig. 6 Shown in, brush amount instrument determines that unit 45 may include that
Sorted group sequencing unit 454, for according to comprising number of users by many orders at least to described classification Group is ranked up;
Second number of users statistic unit 455, in statistical series before the user that comprised of n sorted group Total quantity;
Second ratio in judgement unit 456, is used for judging that described total number of users amount is total with user in all sorted group The ratio of quantity whether more than the 3rd threshold value, if, it is determined that described channel to be measured employs brush amount instrument.
Optionally, Fig. 7 illustrates the optional structure of one of above-mentioned behavior characteristics value computing unit 43, such as Fig. 7 Shown in, behavior characteristics value computing unit 43 may include that
The first row is characterized value computation subunit 431, is used for using SimHash algorithm, uses based on each The behavior characteristics set at family, calculates the behavior characteristics value of each user.
Optionally, Fig. 8 illustrates the optional structure of one of above-mentioned classification processing unit 44, as shown in Figure 8, Classification processing unit 44 may include that
First classification processes subelement 441, for user identical for behavior characteristics value is divided into a class.
It is, of course, also possible to classify according to Else Rule, such as, it is binary system group number when behavior characteristics value During form, user corresponding less than the behavior characteristics value of predetermined number for the number of different place values can be divided Being a class, as two behavior characteristics values are respectively 1011 and 1111, and predetermined number is 2, the most obvious two Behavior characteristics value only has the value difference of the 2nd the most from left to right, therefore can be by the two of behavior characteristics value 1011 and 1111 Individual user is divided into a class.
The brush amount tool detection device that the embodiment of the present application provides, obtains at least corresponding with channel to be measured The behavioral data of individual user, behavioral data characterizes various operations performed after user logs in application, enters And the behavior characteristics of each user can be generated according to the behavioral data of each user, then utilize each to use The behavior characteristics at family calculates the behavior characteristics value of each user, with behavior characteristics value as class condition, to Classifying in family, obtains several sorted group, due to the behavior characteristics value phase of the user in each sorted group Like or identical, if thus the number of users comprised in one or more sorted group is very big, then mean In this sorted group, major part user is fictitious users, namely may determine that channel to be measured employs brush amount instrument. Multiple fictitious users that the application utilizes brush amount instrument to produce possess the principle of certain similarity in behavior, By calculating the behavior characteristics value of each user, and accordingly user is clustered, and then can be according in advance Fixed strategy judges whether channel to be measured uses brush amount instrument, it is to avoid existing access environment attribute based on hardware The distribution of parameter detects the confinement problems that the method for brush amount instrument is latched in.
The embodiment of the present application also provides for a kind of server, and this server can include brush amount work described above Tool detection device, the description for brush amount tool detection device can refer to corresponding part description above, herein Repeat no more.
Below the hardware configuration of the server that the embodiment of the present application provides is described, relates in being described below And the part of account management can refer to corresponding part above and describes.The service that Fig. 9 provides for the embodiment of the present application The hardware architecture diagram of device, with reference to Fig. 9, this server may include that
Processor 1, communication interface 2, memorizer 3, communication bus 4, and display screen 5;
Wherein processor 1, communication interface 2, memorizer 3 and display screen 5 complete phase by communication bus 4 Communication between Hu;
Optionally, communication interface 2 can be the interface of communication module, such as the interface of gsm module;
Processor 1, is used for performing program;
Memorizer 3, is used for depositing program;
Program can include that program code, described program code include the operational order of processor.
Processor 1 is probably a central processor CPU, or specific integrated circuit ASIC (Application Specific Integrated Circuit), or be configured to implement the embodiment of the present application One or more integrated circuits.
Memorizer 3 may comprise high-speed RAM memorizer, it is also possible to also includes nonvolatile memory (non-volatile memory), for example, at least one disk memory.
Wherein, program can be specifically for:
Obtain the behavioral data of at least one user corresponding with channel to be measured;
According to the behavioral data of each user, generate the behavior characteristics of each user;
Utilize the behavior characteristics of each user, calculate the behavior characteristics value of each user;
With behavior characteristics value as class condition, each user is classified, obtain several sorted group;
When determining that in described sorted group, number of users meets preset decision plan, determine described channel to be measured Employ brush amount instrument.
Finally, in addition it is also necessary to explanation, in this article, the relational terms of such as first and second or the like It is used merely to separate an entity or operation with another entity or operating space, and not necessarily requires Or imply relation or the order that there is any this reality between these entities or operation.And, art Language " includes ", " comprising " or its any other variant are intended to comprising of nonexcludability, thus Make to include that the process of a series of key element, method, article or equipment not only include those key elements, and Also include other key elements being not expressly set out, or also include for this process, method, article or The key element that person's equipment is intrinsic.In the case of there is no more restriction, by statement " including ... " The key element limited, it is not excluded that also deposit in including the process of described key element, method, article or equipment In other identical element.
In this specification, each embodiment uses the mode gone forward one by one to describe, and each embodiment stresses Being the difference with other embodiments, between each embodiment, identical similar portion sees mutually.
Described above to the disclosed embodiments, makes professional and technical personnel in the field be capable of or uses The application.Multiple amendment to these embodiments will be aobvious and easy for those skilled in the art See, generic principles defined herein can in the case of without departing from spirit herein or scope, Realize in other embodiments.Therefore, the application is not intended to be limited to the embodiments shown herein, And it is to fit to the widest scope consistent with principles disclosed herein and features of novelty.

Claims (10)

1. a brush amount tool detection method, it is characterised in that including:
Obtain the behavioral data of at least one user corresponding with channel to be measured;
According to the behavioral data of each user, generate the behavior characteristics of each user;
Utilize the behavior characteristics of each user, calculate the behavior characteristics value of each user;
With behavior characteristics value as class condition, each user is classified, obtain several sorted group;
When determining that in described sorted group, number of users meets preset decision plan, determine described channel to be measured Employ brush amount instrument.
Method the most according to claim 1, it is characterised in that described in determining described sorted group When number of users meets preset decision plan, determine that described channel to be measured employs the process bag of brush amount instrument Include:
In all sorted group, filter out number of users exceed the sorted group of first threshold;
The total number of users amount that the sorted group that statistics filters out is comprised;
Judge that whether described total number of users amount and the ratio of total number of users amount in all sorted group are more than the second threshold Value, if, it is determined that described channel to be measured employs brush amount instrument.
Method the most according to claim 1, it is characterised in that described in determining described sorted group When number of users meets preset decision plan, determine that described channel to be measured employs the process bag of brush amount instrument Include:
Described sorted group is ranked up by many orders at least according to comprising number of users;
The total number of users amount that before in statistical series, n sorted group is comprised;
Judge that whether described total number of users amount and the ratio of total number of users amount in all sorted group are more than the 3rd threshold Value, if, it is determined that described channel to be measured employs brush amount instrument.
Method the most according to claim 1, it is characterised in that the behavior characteristics of each user is many Individual, the described behavior characteristics utilizing each user, calculate the behavior characteristics value of each user, including:
Use SimHash algorithm, behavior characteristics set based on each user, calculate the row of each user It is characterized value.
Method the most according to claim 1, it is characterised in that described with behavior characteristics value for classification Condition, classifies to each user, including:
User identical for behavior characteristics value is divided into a class.
6. according to the method described in any one of claim 1-5, it is characterised in that described user behavior number According to including that user accesses record quantity, user's access duration time, the type that starts, login type, user Participate in the different quantity of scene and the quantity of user's click action is any one or more.
7. a brush amount tool detection device, it is characterised in that including:
Behavioral data acquiring unit, for obtaining the behavior number of at least one user corresponding with channel to be measured According to;
Behavior characteristics signal generating unit, for the behavioral data according to each user, generates the row of each user It is characterized;
Behavior characteristics value computing unit, for utilizing the behavior characteristics of each user, calculates each user's Behavior characteristics value;
Classification processing unit, for behavior characteristics value as class condition, classifies to each user, Obtain several sorted group;
Brush amount instrument determines unit, meets preset judgement plan for number of users in determining described sorted group Time slightly, determine that described channel to be measured employs brush amount instrument.
Device the most according to claim 7, it is characterised in that described brush amount instrument determines unit bag Include:
Sorted group screening unit, exceedes first threshold for filtering out number of users in all sorted group Sorted group;
First user quantity statistics unit, for adding up the total number of users amount that the sorted group filtered out is comprised;
First ratio in judgement unit, is used for judging described total number of users amount and total number of users in all sorted group Whether the ratio of amount exceedes Second Threshold, if, it is determined that described channel to be measured employs brush amount instrument.
Device the most according to claim 7, it is characterised in that described brush amount instrument determines unit bag Include:
Sorted group sequencing unit, for according to comprising number of users by many orders at least to described sorted group It is ranked up;
Second number of users statistic unit, in statistical series before the user that comprised of n sorted group total Quantity;
Second ratio in judgement unit, is used for judging described total number of users amount and total number of users in all sorted group Amount ratio whether more than the 3rd threshold value, if, it is determined that described channel to be measured employs brush amount instrument.
Device the most according to claim 7, it is characterised in that the behavior characteristics of each user is Multiple, described behavior characteristics value computing unit includes:
The first row is characterized value computation subunit, is used for using SimHash algorithm, based on each user Behavior characteristics set, calculates the behavior characteristics value of each user.
CN201510317134.0A 2015-06-10 2015-06-10 Brushing amount tool detection method and device Active CN106294508B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510317134.0A CN106294508B (en) 2015-06-10 2015-06-10 Brushing amount tool detection method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510317134.0A CN106294508B (en) 2015-06-10 2015-06-10 Brushing amount tool detection method and device

Publications (2)

Publication Number Publication Date
CN106294508A true CN106294508A (en) 2017-01-04
CN106294508B CN106294508B (en) 2020-02-11

Family

ID=57659601

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510317134.0A Active CN106294508B (en) 2015-06-10 2015-06-10 Brushing amount tool detection method and device

Country Status (1)

Country Link
CN (1) CN106294508B (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106844655A (en) * 2017-01-23 2017-06-13 广东欧珀移动通信有限公司 Information processing method, device and computer equipment
CN107124391A (en) * 2016-09-22 2017-09-01 北京小度信息科技有限公司 The recognition methods of abnormal behaviour and device
CN107193972A (en) * 2017-05-25 2017-09-22 山东浪潮云服务信息科技有限公司 A kind of sorted users method and device based on big data
CN107707642A (en) * 2017-09-22 2018-02-16 广东欧珀移动通信有限公司 Brush amount terminal determines method and device
CN109284380A (en) * 2018-09-25 2019-01-29 平安科技(深圳)有限公司 Illegal user's recognition methods and device, electronic equipment based on big data analysis
WO2019041200A1 (en) * 2017-08-30 2019-03-07 深圳市云中飞网络科技有限公司 Method and apparatus for determining resources for increasing downloading quantities
WO2019041199A1 (en) * 2017-08-30 2019-03-07 深圳市云中飞网络科技有限公司 Method and apparatus for determining downloading quantity increase terminal
CN110189165A (en) * 2019-05-14 2019-08-30 微梦创科网络科技(中国)有限公司 Channel abnormal user and abnormal channel recognition methods and device
CN110378112A (en) * 2019-07-08 2019-10-25 北京达佳互联信息技术有限公司 A kind of user identification method and device
CN111368169A (en) * 2018-12-25 2020-07-03 卓望数码技术(深圳)有限公司 Method, device, equipment and storage medium for detecting brushing amount behavior
CN111639681A (en) * 2020-05-09 2020-09-08 同济大学 Early warning method, system, medium and device based on education drive type fraud
CN113383362A (en) * 2019-06-24 2021-09-10 深圳市欢太科技有限公司 User identification method and related product

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102521248A (en) * 2011-11-14 2012-06-27 北京亿赞普网络技术有限公司 Network user classification method and device
CN102769851A (en) * 2011-05-06 2012-11-07 中国移动通信集团广东有限公司 Method and system for monitoring service provider services
CN103793484A (en) * 2014-01-17 2014-05-14 五八同城信息技术有限公司 Fraudulent conduct identification system based on machine learning in classified information website
CN103853841A (en) * 2014-03-19 2014-06-11 北京邮电大学 Method for analyzing abnormal behavior of user in social networking site
CN103886068A (en) * 2014-03-20 2014-06-25 北京国双科技有限公司 Data processing method and device for Internet user behavior analysis
CN104281882A (en) * 2014-09-16 2015-01-14 中国科学院信息工程研究所 Method and system for predicting social network information popularity on basis of user characteristics
CN104298719A (en) * 2014-09-23 2015-01-21 新浪网技术(中国)有限公司 Method and system for conducting user category classification and advertisement putting based on social behavior
CN104424433A (en) * 2013-08-22 2015-03-18 腾讯科技(深圳)有限公司 Anti-cheating method and anti-cheating system of application program

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102769851A (en) * 2011-05-06 2012-11-07 中国移动通信集团广东有限公司 Method and system for monitoring service provider services
CN102521248A (en) * 2011-11-14 2012-06-27 北京亿赞普网络技术有限公司 Network user classification method and device
CN104424433A (en) * 2013-08-22 2015-03-18 腾讯科技(深圳)有限公司 Anti-cheating method and anti-cheating system of application program
CN103793484A (en) * 2014-01-17 2014-05-14 五八同城信息技术有限公司 Fraudulent conduct identification system based on machine learning in classified information website
CN103853841A (en) * 2014-03-19 2014-06-11 北京邮电大学 Method for analyzing abnormal behavior of user in social networking site
CN103886068A (en) * 2014-03-20 2014-06-25 北京国双科技有限公司 Data processing method and device for Internet user behavior analysis
CN104281882A (en) * 2014-09-16 2015-01-14 中国科学院信息工程研究所 Method and system for predicting social network information popularity on basis of user characteristics
CN104298719A (en) * 2014-09-23 2015-01-21 新浪网技术(中国)有限公司 Method and system for conducting user category classification and advertisement putting based on social behavior

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107124391A (en) * 2016-09-22 2017-09-01 北京小度信息科技有限公司 The recognition methods of abnormal behaviour and device
CN107124391B (en) * 2016-09-22 2021-11-16 北京星选科技有限公司 Abnormal behavior identification method and device
CN106844655A (en) * 2017-01-23 2017-06-13 广东欧珀移动通信有限公司 Information processing method, device and computer equipment
CN107193972A (en) * 2017-05-25 2017-09-22 山东浪潮云服务信息科技有限公司 A kind of sorted users method and device based on big data
WO2019041200A1 (en) * 2017-08-30 2019-03-07 深圳市云中飞网络科技有限公司 Method and apparatus for determining resources for increasing downloading quantities
WO2019041199A1 (en) * 2017-08-30 2019-03-07 深圳市云中飞网络科技有限公司 Method and apparatus for determining downloading quantity increase terminal
CN107707642A (en) * 2017-09-22 2018-02-16 广东欧珀移动通信有限公司 Brush amount terminal determines method and device
CN109284380A (en) * 2018-09-25 2019-01-29 平安科技(深圳)有限公司 Illegal user's recognition methods and device, electronic equipment based on big data analysis
CN109284380B (en) * 2018-09-25 2023-04-25 平安科技(深圳)有限公司 Illegal user identification method and device based on big data analysis and electronic equipment
WO2020062690A1 (en) * 2018-09-25 2020-04-02 平安科技(深圳)有限公司 Method and apparatus for illegal user identification based on big data analysis, and electronic device
CN111368169B (en) * 2018-12-25 2023-08-01 卓望数码技术(深圳)有限公司 Method, device, equipment and storage medium for detecting brushing amount behavior
CN111368169A (en) * 2018-12-25 2020-07-03 卓望数码技术(深圳)有限公司 Method, device, equipment and storage medium for detecting brushing amount behavior
CN110189165B (en) * 2019-05-14 2021-07-23 微梦创科网络科技(中国)有限公司 Channel abnormal user and abnormal channel identification method and device
CN110189165A (en) * 2019-05-14 2019-08-30 微梦创科网络科技(中国)有限公司 Channel abnormal user and abnormal channel recognition methods and device
CN113383362A (en) * 2019-06-24 2021-09-10 深圳市欢太科技有限公司 User identification method and related product
CN113383362B (en) * 2019-06-24 2022-05-13 深圳市欢太科技有限公司 User identification method and related product
CN110378112A (en) * 2019-07-08 2019-10-25 北京达佳互联信息技术有限公司 A kind of user identification method and device
CN111639681A (en) * 2020-05-09 2020-09-08 同济大学 Early warning method, system, medium and device based on education drive type fraud

Also Published As

Publication number Publication date
CN106294508B (en) 2020-02-11

Similar Documents

Publication Publication Date Title
CN106294508A (en) A kind of brush amount tool detection method and device
CN107579956B (en) User behavior detection method and device
CN102724219B (en) A network data computer processing method and a system thereof
CN106560811A (en) Direct broadcasting room recommending method and system based on broadcaster style
CN107368856B (en) Malicious software clustering method and device, computer device and readable storage medium
CN104836781A (en) Method distinguishing identities of access users, and device
CN104391979A (en) Malicious web crawler recognition method and device
CN108038130A (en) Automatic cleaning method, device, equipment and the storage medium of fictitious users
CN107222511B (en) Malicious software detection method and device, computer device and readable storage medium
CN108243191A (en) Risk behavior recognition methods, storage medium, equipment and system
CN110209551B (en) Abnormal equipment identification method and device, electronic equipment and storage medium
CN106375975A (en) Policy conflict detection method and apparatus
CN110516752A (en) Clustering cluster method for evaluating quality, device, equipment and storage medium
CN106936778A (en) The abnormal detection method of website traffic and device
CN112488716A (en) Abnormal event detection system
CN108171570A (en) A kind of data screening method, apparatus and terminal
CN110222790A (en) Method for identifying ID, device and server
CN107948257B (en) The method for pushing and computer readable storage medium of APP
CN113609409A (en) Method and system for recommending browsing information, electronic device and storage medium
EP2621156B1 (en) Subscriber portfolio management system
CN107623696A (en) A kind of user ID authentication method and device based on user behavior feature
CN105989019B (en) A kind of method and device for cleaning data
CN104767739B (en) The method that unknown multi-protocols blended data frame is separated into single protocol data frame
KR101568800B1 (en) Real-time issue search word sorting method and system
CN108259363A (en) A kind of method and device of staged service traffics control

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant