Disclosure of Invention
Therefore, the application provides a method for identifying the account numbers registered in batches, which is used for accurately and effectively identifying the account numbers registered in batches and realizing the effect of real-time interception, and the method comprises the following steps:
if the registered account number initiated by an IP exceeds a set value in a preset time period, performing first conversion on all account numbers initiated and registered by the IP in the time period: expressing each letter in each account number by a first character, and expressing each number by a second character; calculating a first proportion of the number of the same account numbers after the first conversion to the account number of the IP initiated registration in the period of time;
and performing second conversion on all accounts which are initiated and registered by the IP within the period of time: converting each letter in each account into a special symbol corresponding to the letter, wherein each number is represented by a second character; calculating a second proportion of the number of the same account numbers after the second conversion to the account number of the IP initiated registration in the period of time;
and if the first proportion exceeds a first preset value and the second proportion exceeds a second preset value, determining that batch registration currently exists.
Optionally, the method includes:
when the first proportion does not exceed a first preset value or the second proportion does not exceed a second preset value, judging whether account type characters corresponding to the account types exist in all accounts within the period of time;
if the judgment result is yes, performing third conversion on all the account numbers in the period of time: converting other characters except the account type character in each account into a third character; calculating a third proportion of the number of the same account numbers after the third conversion to the account number of the IP initiated registration in the period of time;
and performing fourth conversion on all the account numbers in the period of time: converting the account type characters in each account into fourth characters; converting each letter except the account number type character in each account into a special symbol corresponding to the letter, and converting the number except the account number type character in each account into a fifth character; calculating the fourth proportion of the number of the same account numbers after the fourth conversion to the account number of the IP initiated registration in the period of time;
and if the third proportion exceeds a third preset value and the fourth proportion exceeds a fourth preset value, determining that batch registration currently exists.
Optionally, the method includes:
when the third proportion does not exceed the third preset value or the fourth proportion does not exceed the fourth preset value, judging whether the second proportion exceeds the second preset value or not and whether the third proportion exceeds the third preset value or not;
and if the second proportion exceeds a second preset value and the third proportion exceeds a third preset value, determining that batch registration currently exists.
Optionally, the method includes:
if the second proportion does not exceed a second preset value, or the third proportion does not exceed a third preset value, performing fifth conversion on all the account numbers in the period of time:
converting the account type characters in each account into fourth characters; representing each letter except the account number type character in each account by using a first character; representing each number except the account number type character in each account by using a second character;
calculating a fifth proportion of the number of the same account numbers after the fifth conversion to the account number of the IP initiated registration in the period of time;
and if the third proportion exceeds a third preset value and the fifth proportion exceeds a fifth preset value, determining that batch registration currently exists.
The application also provides equipment for identifying the account numbers registered in batches, which comprises:
the conversion module is used for performing first conversion on all accounts which initiate registration by the IP in a preset time period when the registered account initiated by the IP exceeds the set value in the preset time period: expressing each letter in each account number by a first character, and expressing each number by a second character; and performing second conversion on all accounts which are initiated and registered by the IP within the period of time: converting each letter in each account into a special symbol corresponding to the letter, wherein each number is represented by a second character;
the calculation module is used for calculating the first proportion of the number of the same account numbers after the first conversion to the account number initiated and registered by the IP in the period of time; calculating a second proportion of the number of the same account numbers after the second conversion to the account number of the IP initiated registration in the period of time;
and the identification module is used for determining that batch registration exists currently when the first proportion exceeds a first preset value and the second proportion exceeds a second preset value.
Optionally, the apparatus further comprises:
the judging module is used for judging whether account type characters corresponding to the account types exist in all accounts within the period of time or not when the first proportion does not exceed a first preset value or the second proportion does not exceed a second preset value;
the conversion module is further configured to, when the determination result is yes, perform third conversion on all the account numbers in the period of time: converting other characters except the account type character in each account into a third character; and performing fourth conversion on all the account numbers in the period of time: converting the account type characters in each account into fourth characters; converting each letter except the account number type character in each account into a special symbol corresponding to the letter, and converting the number except the account number type character in each account into a fifth character;
the calculation module is further configured to calculate a third ratio of the number of the same account numbers after the third conversion to the account number of the IP initiated registration within the period of time; calculating the fourth proportion of the number of the same account numbers after the fourth conversion to the account number of the IP initiated registration in the period of time;
the identification module is further configured to determine that batch registration currently exists when the third ratio exceeds a third preset value and the fourth ratio exceeds a fourth preset value.
Optionally, the determining module is further configured to determine whether the second ratio exceeds a second preset value and whether the third ratio exceeds a third preset value when the third ratio does not exceed the third preset value or the fourth ratio does not exceed the fourth preset value;
and the identification module is used for determining that batch registration exists currently when the second proportion exceeds a second preset value and the third proportion exceeds a third preset value.
Optionally, the conversion module is further configured to, when the second ratio does not exceed a second preset value, or the third ratio does not exceed a third preset value, perform a fifth conversion on all the account numbers in the period of time: converting the account type characters in each account into fourth characters; representing each letter except the account number type character in each account by using a first character; representing each number except the account number type character in each account by using a second character;
the calculation module is configured to calculate a fifth proportion of the number of the same account numbers after the fifth conversion to the account number of the IP-initiated registration within the period of time;
and the identification module is used for determining that batch registration exists currently when the third proportion exceeds a third preset value and the fifth proportion exceeds a fifth preset value.
Compared with the prior art, the method and the device have the advantages that characters in the account are converted, the probability obtained through calculation after conversion is compared with the corresponding preset value, whether subsequent conversion and calculation are carried out really according to the comparison result, the batch registered account is identified rapidly, accurately and effectively through successive operation, consumption of system resources is reduced, and in addition, the batch registered account can be judged in real time, so that the effect of real-time interception can be achieved.
Detailed Description
For example, in the background art, several ways in the prior art cannot accurately and effectively identify the account numbers registered in batch, and therefore, the application provides a method for identifying the account numbers registered in batch, so as to accurately and effectively identify the account numbers registered in batch. Specifically, mainly aiming at the identification of whether the account of the foreign user is the batch registered account, as shown in fig. 1, the method includes:
step 101, if a registered account initiated by an IP exceeds a set value within a preset time period, performing a first conversion on all accounts initiated and registered by the IP within the time period: expressing each letter in each account number by a first character, and expressing each number by a second character; and performing second conversion on all accounts which are initiated and registered by the IP within the period of time: each letter in each account is converted to a special symbol corresponding to that letter, with each number represented by a second character.
The registered account may be a mailbox, e.g. 1qsds @. Other forms of account numbers are of course possible, such as wez02, and so forth. When the registered account of the mailbox class is converted, the @ and the character content after the @ are ignored, and only the content before the @ is converted according to the embodiment of the invention.
In addition, since batch registration is to be identified, that is, whether a person registers a plurality of accounts in a centralized manner within a certain period of time is determined, the embodiment acquires a registered account initiated by a certain IP within a preset period of time. Referring to the example of fig. 2, the example records IP address information for initiating account registration and registration account information within 1 minute. If the number of account registrations initiated by the same IP address exceeds a set value, for example, 20 accounts within 1 minute, it is suspected that batch registration may occur. It should be noted that 1 minute is only an example, and other preset times, such as 10 minutes, 1 hour, etc., may also be adopted and may be set according to different scenarios. The setting value for the number of account registrations may be set according to a specific scenario. And when the number of the registered account numbers corresponding to a certain IP is larger than a set value, starting an identification process, and if the number of the registered account numbers corresponding to the certain IP does not exceed the set value, not performing subsequent processing. For example, if there are only 1 piece of account registration data corresponding to IP2, it is not processed.
The subsequent processing performed here mainly includes conversion of account letters, numbers, and the like. Specifically, the first conversion in step 101 is to represent each letter in each account number by a first character, and each number by a second character. Taking the account number abcdf002 as an example, assuming that the first character is &andthe second character is x, the account number abcdf002 undergoes a first conversion of & & & & & & &. Note that, & is used as the first character, and the second character is merely an example, and actually, the first character and the second character may be other characters, such as%, # and the like. For the mailbox account number zaqazys1816@ yandex.ru, the result of & & & & & & & & & & &.
And the second conversion is to convert each letter in each account number into a special symbol corresponding to the letter, and each number is represented by a second character. Still taking account abcdf002 as an example for explanation, the letters may be converted based on the correspondence table to generate special symbols corresponding to the letters one by one. Table 1 gives the conversion between letters and their corresponding special symbols:
A
|
Zm01
|
H
|
Zm07
|
N
|
Zm13
|
T
|
Zm19
|
Z
|
Zm25
|
B
|
Zm02
|
I
|
Zm08
|
O
|
Zm14
|
U
|
Zm20
|
G
|
Zm26
|
C
|
Zm03
|
J
|
Zm09
|
P
|
Zm15
|
V
|
Zm21
|
|
|
D
|
Zm04
|
K
|
Zm10
|
Q
|
Zm16
|
W
|
Zm22
|
|
|
E
|
Zm05
|
L
|
Zm11
|
R
|
Zm17
|
X
|
Zm23
|
|
|
F
|
Zm06
|
M
|
Zm12
|
S
|
Zm18
|
Y
|
Zm24
|
|
|
the account abcdf002 undergoes the second conversion to obtain Zm01Zm02Zm03Zm04Zm 06. It should be noted that the special symbol corresponding to each letter, for example, Zm01 corresponding to a, indicates the name of the 1 st special symbol in the special symbols, and may be, for example, one or two or other special symbols, as long as it is ensured that there is a difference between the special symbols.
102, calculating a first proportion of the number of the same account numbers after the first conversion to the account number of the IP initiated registration in the period of time; and calculating a second proportion of the number of the same account numbers after the second conversion to the account number of the IP initiated registration in the period of time.
Taking the account numbers stored in the table in fig. 2 as an example, there are 10 account numbers corresponding to 193.169.80.37 IP. The letter parts before @ are all zaqazys, and the number parts are all four numbers. Therefore, the 10 account numbers are all subjected to the first conversion&&&&&&&****. So the same account number&&&&&&&The total number of 10 accounts for a first percentage of 100% of the account numbers registered under IP. In the second conversion, the 10 account numbers are converted into zm25zm01zm16zm01zm25zm24zm18****So the corresponding second proportion is 100%.
And 103, if the first proportion exceeds a first preset value and the second proportion exceeds a second preset value, determining that batch registration exists currently.
Specifically, for example, the account number is abcdf002, and when the first proportion exceeds 90% and the second proportion exceeds 80%, it is determined that the abcdf002 is registered in batch. Of course, the specific preset value can be set according to historical experience and required identification precision.
Specifically, when the batch registration cannot be judged in the above steps, the judgment is continued in other ways:
firstly, when the first proportion does not exceed a first preset value or the second proportion does not exceed a second preset value, judging whether account type characters corresponding to account types exist in all accounts within the period of time;
if the judgment result is yes, performing third conversion on all the account numbers in the period of time: converting other characters except the account type character in each account into a third character; calculating a third proportion of the number of the same account numbers after the third conversion to the account number of the IP initiated registration in the period of time;
and performing fourth conversion on all the account numbers in the period of time: converting the account type characters in each account into fourth characters; converting each letter except the account number type character in each account into a special symbol corresponding to the letter, and converting the number except the account number type character in each account into a fifth character; calculating the fourth proportion of the number of the same account numbers after the fourth conversion to the account number of the IP initiated registration in the period of time;
and if the third proportion exceeds a third preset value and the fourth proportion exceeds a fourth preset value, determining that batch registration currently exists.
The account type characters described above are used to characterize the account type. For example, the mailbox is an account type, and the characters representing the mailbox type account are as follows: com @ xxx.cn, or @ xxx.net, etc. Here, xxx may be a letter, may be a number, and may be other characters. Of course, besides the mailbox, there are some other accounts in which the account type character corresponding to the account exists. The account is illustrated as a mailbox, for example, the account is zaqazys1816@ yandex.ru as shown in fig. 2, wherein the account type character is @ yandex.ru, and the other part is zaqazys1816, and when the third conversion is performed, the zaqazys1816 is converted into the third character, for example, into a #. Thus, zaqazys1816@ yandex. And calculating a third proportion, namely calculating the proportion of the number of the # yandex.ru in the account which is initiated and registered by the IP in the period of time aiming at the account # @ yandex.ru.
As for the fourth conversion, the @ yandex.ru of zaqazys1816@ yandex.ru is converted into a fourth character, for example, is%, and the remaining letter part, zaqazys, is based on the correspondence in table 1: z (Zm25), a (Zm01), q (Zm16), y (Zm24), s (Zm18), convertible to Zm25Zm01Zm16Zm01Zm25Zm24Zm18, and the numeric portion 1816 is convertible to. For this purpose zaqazys1816@ yandex. ru was subjected to a fourth conversion to Zm25Zm01Zm16Zm01Zm25Zm24Zm 18. A fourth proportion is calculated for the account, i.e. the number of Zm25Zm01 Zm25Zm24Zm 18% is calculated as a proportion of the account numbers for which the IP originated the registration during the period of time.
For example, for zaqazys1816@ yandex. ru, the corresponding third proportion is 90% and exceeds the third preset value by 80%, and the corresponding fourth proportion is 85% and exceeds the fourth preset value by 79%, then it is determined that zaqazys1816@ yandex. ru is the batch registered account number
Secondly, when the third proportion does not exceed a third preset value or the fourth proportion does not exceed a fourth preset value, judging whether the second proportion exceeds a second preset value or not and whether the third proportion exceeds the third preset value or not;
and if the second proportion exceeds a second preset value and the third proportion exceeds a third preset value, determining that batch registration currently exists.
And finally, if the second proportion does not exceed a second preset value or the third proportion does not exceed a third preset value, performing fifth conversion on all the account numbers in the period of time:
converting the account type characters in each account into fourth characters; representing each letter except the account number type character in each account by using a first character; representing each number except the account number type character in each account by using a second character;
calculating a fifth proportion of the number of the same account numbers after the fifth conversion to the account number of the IP initiated registration in the period of time;
and if the third proportion exceeds a third preset value and the fifth proportion exceeds a fifth preset value, determining that batch registration currently exists.
Specifically, still taking the account number zaqazys1816@ yandex.ru as an example, the account number type character is @ yandex.ru, @ yandex.ru is converted into a fourth character, for example, a #, the remaining number part 1816 is converted into a second character, for example,% and the remaining letter part zaqazys is converted into a first character, for example, & & & & & & & & & &. The account number zaqazys1816@ yandex.ru is converted to & & & & & & & & & & & & & &%% ##. If the account number za1qa8z1ys6@ yandex.ru is converted, the conversion is & &% & & & & & & & & & & & & & & & & & & & & & & & & & & & & & & & & &. Taking account zaqazys1816@ yandex.ru as an example, the corresponding fifth proportion is the proportion of the number of & & & & & & & & & & & & & &%% # to the account number for which the IP initiation registration is performed in the period of time. As for the fifth proportion corresponding to za1qa8z1ys6@ yandex.ru, the number of & & & &% & & & & & & & & & & & & & & # is a proportion of the account number of the IP origination registration within the period of time.
For zaqazys1816@ yandex.ru, if the corresponding third proportion is 90% and exceeds the third preset value by 80%, and the corresponding fifth proportion is 87% and exceeds the fifth preset value by 80%, determining that the zaqazys1816@ yandex.ru is a batch registered account.
The embodiment of the present application further discloses a device for identifying batch registered accounts, as shown in fig. 3, including:
a conversion module 301, configured to, when a registered account initiated by an IP exceeds a set value within a preset time period, perform a first conversion on all accounts initiated by the IP within the time period: expressing each letter in each account number by a first character, and expressing each number by a second character; and performing second conversion on all accounts which are initiated and registered by the IP within the period of time: converting each letter in each account into a special symbol corresponding to the letter, wherein each number is represented by a second character;
a calculating module 302, configured to calculate a first ratio of the number of the same account numbers after the first conversion to the account number initiated by the IP registration within the period of time; calculating a second proportion of the number of the same account numbers after the second conversion to the account number of the IP initiated registration in the period of time;
the identifying module 303 is configured to determine that batch registration currently exists when the first ratio exceeds a first preset value and the second ratio exceeds a second preset value.
The apparatus, further comprising:
the judging module is used for judging whether account type characters corresponding to the account types exist in all accounts within the period of time or not when the first proportion does not exceed a first preset value or the second proportion does not exceed a second preset value;
the converting module 301 is further configured to, when the determination result is yes, perform third conversion on all the accounts in the period of time: converting other characters except the account type character in each account into a third character; and performing fourth conversion on all the account numbers in the period of time: converting the account type characters in each account into fourth characters; converting each letter except the account number type character in each account into a special symbol corresponding to the letter, and converting the number except the account number type character in each account into a fifth character;
the calculating module 302 is further configured to calculate a third ratio of the number of the same account numbers after the third conversion to the account number of the IP initiated registration in the period of time; calculating the fourth proportion of the number of the same account numbers after the fourth conversion to the account number of the IP initiated registration in the period of time;
the identifying module 303 is further configured to determine that batch registration currently exists when the third ratio exceeds a third preset value and the fourth ratio exceeds a fourth preset value.
Specifically, the judging module is further configured to judge whether the second ratio exceeds a second preset value and whether the third ratio exceeds a third preset value when the third ratio does not exceed the third preset value or the fourth ratio does not exceed the fourth preset value;
the identifying module 303 is configured to determine that batch registration currently exists when the second ratio exceeds a second preset value and the third ratio exceeds a third preset value.
Specifically, the conversion module 301 is further configured to, when the second ratio does not exceed the second preset value, or the third ratio does not exceed the third preset value, perform fifth conversion on all the account numbers in the period of time: converting the account type characters in each account into fourth characters; representing each letter except the account number type character in each account by using a first character; representing each number except the account number type character in each account by using a second character;
the calculation module is configured to calculate a fifth proportion of the number of the same account numbers after the fifth conversion to the account number of the IP-initiated registration within the period of time;
and the identification module is used for determining that batch registration exists currently when the third proportion exceeds a third preset value and the fifth proportion exceeds a fifth preset value.
According to the method and the device, characters in the account are converted, the probability obtained through calculation after conversion is compared with the corresponding preset value, whether subsequent conversion and calculation are carried out really or not is achieved according to the comparison result, rapid, accurate and effective identification of batch registered accounts is achieved through successive operation, consumption of system resources is reduced, and in addition, the batch registered accounts can be judged in real time, so that the effect of real-time interception can be achieved.
Through the above description of the embodiments, those skilled in the art will clearly understand that the present application can be implemented by hardware, and also by software plus a necessary general hardware platform. Based on such understanding, the technical solution of the present application may be embodied in the form of a software product, which may be stored in a non-volatile storage medium (which may be a CD-ROM, a usb disk, a removable hard disk, etc.), and includes several instructions for enabling a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the method according to the implementation scenarios of the present application.
Those skilled in the art will appreciate that the figures are merely schematic representations of one preferred implementation scenario and that the blocks or flow diagrams in the figures are not necessarily required to practice the present application.
Those skilled in the art will appreciate that the modules in the devices in the implementation scenario may be distributed in the devices in the implementation scenario according to the description of the implementation scenario, or may be located in one or more devices different from the present implementation scenario with corresponding changes. The modules of the implementation scenario may be combined into one module, or may be further split into a plurality of sub-modules.
The above application serial numbers are for description purposes only and do not represent the superiority or inferiority of the implementation scenarios.
The above disclosure is only a few specific implementation scenarios of the present application, but the present application is not limited thereto, and any variations that can be made by those skilled in the art are intended to fall within the scope of the present application.