CN111340344B - Type determination method and device - Google Patents

Type determination method and device Download PDF

Info

Publication number
CN111340344B
CN111340344B CN202010104659.7A CN202010104659A CN111340344B CN 111340344 B CN111340344 B CN 111340344B CN 202010104659 A CN202010104659 A CN 202010104659A CN 111340344 B CN111340344 B CN 111340344B
Authority
CN
China
Prior art keywords
application program
user group
application
type
distribution ratio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010104659.7A
Other languages
Chinese (zh)
Other versions
CN111340344A (en
Inventor
付子圣
巩金慧
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
AlipayCom Co ltd
Original Assignee
Alipay Hangzhou Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alipay Hangzhou Information Technology Co Ltd filed Critical Alipay Hangzhou Information Technology Co Ltd
Priority to CN202010104659.7A priority Critical patent/CN111340344B/en
Publication of CN111340344A publication Critical patent/CN111340344A/en
Application granted granted Critical
Publication of CN111340344B publication Critical patent/CN111340344B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0635Risk analysis of enterprise or organisation activities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/02Banking, e.g. interest calculation or account maintenance

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Strategic Management (AREA)
  • Accounting & Taxation (AREA)
  • Development Economics (AREA)
  • Finance (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Economics (AREA)
  • Human Resources & Organizations (AREA)
  • General Physics & Mathematics (AREA)
  • Marketing (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • Game Theory and Decision Science (AREA)
  • Educational Administration (AREA)
  • Tourism & Hospitality (AREA)
  • Quality & Reliability (AREA)
  • Operations Research (AREA)
  • Technology Law (AREA)
  • Data Mining & Analysis (AREA)
  • Stored Programmes (AREA)

Abstract

The embodiment of the specification provides a type determination method and a type determination device, wherein the method comprises the steps of obtaining a first application program set comprising a first application program of a first user group and a second application program set comprising a second application program of a second user group; determining a first distribution ratio of the first application program in the first user group and a second distribution ratio of the first application program in the second user group; calculating the first distribution proportion and the second distribution proportion of the first application program through a preset algorithm to obtain a difference value of the first application program; determining the type of the first user group based on the difference value of the first application program and a preset type mapping table; the type of the first user group can be conveniently and accurately determined by using the acquired distribution difference of the application program in different user groups and the mapping relation between the application program and the type in the preset type mapping table.

Description

Type determination method and device
Technical Field
The embodiment of the specification relates to the technical field of data processing, in particular to a type determining method. One or more embodiments of the present specification also relate to a type determination apparatus, a computing device, and a computer-readable storage medium.
Background
With the rise of internet finance, more and more illegal users perform some risk behaviors by using the operable diversity of internet finance, and adaptive penalties are different for different types of user groups performing the risk behaviors.
Based on this, there is a need to provide a more efficient type determination scheme for a user population.
Disclosure of Invention
In view of this, the embodiments of the present specification provide a type determination method. One or more embodiments of the present specification are also directed to a type determining apparatus, a computing device, and a computer-readable storage medium, which solve the technical problems of the prior art.
According to a first aspect of embodiments herein, there is provided a type determination method including:
acquiring a first application program set comprising a first application program of a first user group and a second application program set comprising a second application program of a second user group;
calculating a first distribution ratio of the first application program in the first user group and a second distribution ratio of the second application program in the second user group;
determining a second distribution ratio of the first application program in the second user group based on the second distribution ratio of the second application program in the second user group and the incidence relation between the first application program and the second application program;
calculating the first distribution proportion and the second distribution proportion of the first application program through a preset algorithm to obtain a difference value of the first application program;
determining the type of the first user group based on the difference value of the first application program and a preset type mapping table, wherein the preset type mapping table comprises the name of the application program and the corresponding type.
According to a second aspect of embodiments of the present specification, there is provided a type determination apparatus including:
the application program acquisition device is configured to acquire a first application program set including a first application program of a first user group and a second application program set including a second application program of a second user group;
a computing device configured to compute a first distribution ratio of the first application to the first group of users and a second distribution ratio of the second application to the second group of users;
a distribution ratio determining device configured to determine a second distribution ratio of the first application program in the second user group based on a second distribution ratio of the second application program in the second user group and an association relationship between the first application program and the second application program;
a difference value obtaining device configured to calculate the first distribution ratio and the second distribution ratio of the first application program through a preset algorithm to obtain a difference value of the first application program;
the type determining device is configured to determine the type of the first user group based on the difference value of the first application program and a preset type mapping table, wherein the preset type mapping table comprises the name of the application program and the corresponding type.
According to a third aspect of embodiments herein, there is provided a computing device comprising:
a memory and a processor; the memory is to store computer-executable instructions, and the processor is to execute the computer-executable instructions to:
acquiring a first application program set comprising a first application program of a first user group and a second application program set comprising a second application program of a second user group;
calculating a first distribution ratio of the first application program in the first user group and a second distribution ratio of the second application program in the second user group;
determining a second distribution ratio of the first application program in the second user group based on the second distribution ratio of the second application program in the second user group and the incidence relation between the first application program and the second application program;
calculating the first distribution proportion and the second distribution proportion of the first application program through a preset algorithm to obtain a difference value of the first application program;
determining the type of the first user group based on the difference value of the first application program and a preset type mapping table, wherein the preset type mapping table comprises the name of the application program and the corresponding type.
According to a fourth aspect of embodiments herein, there is provided a computer-readable storage medium storing computer-executable instructions that, when executed by a processor, implement the steps of the type determination method.
One embodiment of the specification realizes a type determination method and a type determination device, wherein the method comprises the steps of obtaining a first application program set including a first application program of a first user group and a second application program set including a second application program of a second user group; calculating a first distribution ratio of the first application program in the first user group and a second distribution ratio of the second application program in the second user group; determining a second distribution ratio of the first application program in the second user group based on the second distribution ratio of the second application program in the second user group and the incidence relation between the first application program and the second application program; calculating the first distribution proportion and the second distribution proportion of the first application program through a preset algorithm to obtain a difference value of the first application program; determining the type of the first user group based on the difference value of the first application program and a preset type mapping table;
the type determining method can conveniently and accurately determine the type of the first user group by utilizing the acquired distribution difference of the application program in different user groups and the mapping relation between the application program and the type in the preset type mapping table.
Drawings
FIG. 1 is a flow chart of a type determination method provided by one embodiment of the present description;
fig. 2 is a first distribution ratio of a first application program in a first user group and a second distribution ratio of the first application program in a second user group in a type determination method provided in an embodiment of the present specification;
fig. 3 is a first distribution ratio of another first application in a first user group and a second distribution ratio in a second user group in a type determination method provided in an embodiment of the present specification;
fig. 4 is a schematic structural diagram of a type determining apparatus provided in an embodiment of the present specification;
fig. 5 is a block diagram of a computing device according to an embodiment of the present disclosure.
Detailed Description
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present description. This description may be embodied in many different forms and should not be construed as limited to the embodiments set forth herein, as those skilled in the art will be able to make and use the present disclosure without departing from the spirit and scope of the present disclosure.
The terminology used in the description of the one or more embodiments is for the purpose of describing the particular embodiments only and is not intended to be limiting of the description of the one or more embodiments. As used in one or more embodiments of the present specification and the appended claims, the singular forms "a," "an," and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. It should also be understood that the term "and/or" as used in one or more embodiments of the present specification refers to and encompasses any and all possible combinations of one or more of the associated listed items.
It will be understood that, although the terms first, second, etc. may be used herein in one or more embodiments to describe various information, these information should not be limited by these terms. These terms are only used to distinguish one type of information from another. For example, a first can also be referred to as a second and, similarly, a second can also be referred to as a first without departing from the scope of one or more embodiments of the present description. The word "if" as used herein may be interpreted as "at … …" or "when … …" or "in response to a determination", depending on the context.
First, the noun terms to which one or more embodiments of the present specification relate are explained.
The application program comprises the following steps: a mobile phone application, APP.
Distribution: the abbreviation of probability distribution refers to a probability law for expressing the value of a random variable, and generally approximates probability by frequency.
Cross entropy: the method is an important concept in information theory and is mainly used for measuring the difference information between two probability distributions.
In the present specification, a type determination method is provided. One or more embodiments of the present specification relate to a type determining apparatus, a computing device, and a computer-readable storage medium, which are described in detail in the following embodiments one by one.
Referring to fig. 1, fig. 1 shows a flowchart of a type determination method provided according to an embodiment of the present specification, including steps 102 to 110.
Step 102: a first set of applications including a first application of a first user group and a second set of applications including a second application of a second user group are obtained.
The first user group consists of a plurality of users with the same characteristics, for example, the first user group consists of a plurality of users with frequent transfer and fund flow behaviors; the second group of users consists of a plurality of users having the same characteristics and the characteristics of the users in the second group of users are different from those of the users in the first group of users, e.g., the second group of users consists of users randomly drawn from a plurality of accounts that miss any risk policy within the last 30 days.
The first application set comprises but is not limited to at least two or more than three first applications, and the second application set comprises but is not limited to at least two or more than three second applications; for example, the first set of applications includes 95 applications 1, 23 applications 2, and 50 applications 3; the second set of applications includes 89 applications 1 and 35 applications 2.
Specifically, the first user group is a user group of a type to be determined, and the second user group is a user group of a type determined;
correspondingly, the acquiring a first application set including a first application program of the first user group and a second application set including a second application program of the second user group includes:
the method comprises the steps of obtaining a first application program set including a first application program of a user group of a type to be determined and a second application program set including a second application program of the user group of the type to be determined.
In practical applications, the user group of the type to be determined may be regarded as a user group with a certain security risk, and the user group of the type to be determined may be regarded as a secure user group without any risk, which is confirmed by some risk confirmation policies.
The first application program set is composed of a plurality of first application programs, the plurality of first application programs are obtained from a client of each user based on the authorization of each user in a user group of a type to be determined, for example, each user in the user group of the type to be determined reports the application program installed on the client to a security log system, and the first application program set of the user group of the type to be determined is obtained at the moment, and the first application program set of the user group of the type to be determined which is reported last time is a set of application programs installed on the client reported by each user in the user group of the type to be determined which is extracted from the security log system; such as game-type applications, financial-type applications, shopping-type applications, and the like.
The second application program set is also composed of a plurality of second application programs, and the plurality of second application programs are obtained from the client of each user based on the authorization of each user in the user group with the determined type, for example, each user in the user group with the determined type reports the application program installed on the client to the security log system, and the second application program set of the user group with the determined type is obtained at the moment, and is a set of application programs installed on the client reported by each user in the user group with the determined type reported last time is extracted from the security log system; such as game-type applications, financial-type applications, shopping-type applications, and the like.
In specific implementation, when a first application program set including first application programs of a first user group is obtained, the name of each first application program in the first application program set is also obtained at the same time; and when a second application program set including the second application program of the second user group is used, the name of each second application program in the second application program set is obtained at the same time.
Specifically, the first application program in the first application program set and the second application program in the second application program set may overlap, that is, the first application program may be the same as the second application program, and since each application program has a unique name, whether the specific first application program is the same as the second application program may be determined according to the name of the application program.
Step 104: and calculating a first distribution ratio of the first application program in the first user group and a second distribution ratio of the second application program in the second user group.
Specifically, the first distribution ratio and the second distribution ratio both include an installation ratio and an uninstallation ratio;
correspondingly, the calculating a first distribution ratio of the first application program in the first user group and a second distribution ratio of the second application program in the second user group comprises:
and calculating a first installation proportion and a non-installation proportion of the first application program in the first user group and a second installation proportion and a non-installation proportion of the second application program in the second user group.
In practical applications, a first installation proportion and a non-installation proportion of the first application program in the first user group and a second installation proportion and a non-installation proportion of the second application program in the second user group are calculated, and it can be understood that a first installation proportion and a non-installation proportion of each first application program in the first application program set in the first user group and a second installation proportion and a non-installation proportion of each second application program in the second application program set in the second user group are calculated.
For example, a first user group a includes 100 users, such as a1 user, a2 … user a100 user, etc., a second user group b includes 100 users, such as a b1 user, a b2 … user b100 user, etc., a first application set includes 95 applications 1, 23 applications 2, 50 applications 3, and a second application set includes 89 applications 1, 35 applications 2, 78 applications 4;
at this time, the calculated first installation proportion of each first application program in the first application program set in the first user group is: the first installation proportion of the application program 1 in the first user group a is 95%, and the first uninstallation proportion of the application program 1 in the first user group a is 5%; the first installation proportion of the application program 2 in the first user group a is 23%, and the first uninstallation proportion of the application program 2 in the first user group a is 77%; the first installation proportion of the application program 3 in the first user group a is 50%, and the first uninstallation proportion of the application program 3 in the first user group a is 50%;
the calculated second installation proportion and the calculated non-installation proportion of each second application program in the second application program set in the second user group are as follows: the second installation proportion of the application program 1 in the second user group b is 89%, and the second uninstallation proportion of the application program 1 in the second user group b is 11%; the second installation proportion of the application 2 in the second user group b is 35%, the second uninstallation proportion of the application 2 in the second user group b is 65%, the second installation proportion of the application 4 in the second user group b is 78%, and the second uninstallation proportion of the application 2 in the second user group b is 22%.
Step 106: and determining a second distribution ratio of the first application program in the second user group based on the second distribution ratio of the second application program in the second user group and the incidence relation of the first application program and the second application program.
Specifically, the determining, based on the second distribution ratio of the second application program in the second user group and the association relationship between the first application program and the second application program, that the first application program is in front of the second distribution ratio of the second user group further includes:
acquiring the name of the first application program and the name of the second application program;
comparing the name of the first application program with the name of the second application program to determine an association relationship between the first application program and the second application program, wherein the association relationship comprises that the first application program is the same as the second application program and the first application program is different from the second application program.
Taking the above as an example, the name of each first application program may be obtained when obtaining the first application set including the first application program of the first user group, and the name of each second application program may be obtained when obtaining the second application set including the second application program of the second user group; since the name of each application is unique, it can be determined whether the first application is the same as or different from the second application based on the name of the first application and the name of the second application.
For example, the number of the first applications is 3, which are application 1, application 2, and application 3; the number of the second application programs is 3, which are the application program 1, the application program 2, and the application program 4, at this time, it may be determined that the application program 1 of the first application program and the application program 1 of the second application program are the same application program, the application program 2 of the first application program and the application program 2 of the second application program are the same application program, and the application program 3 of the first application program and the application program 4 of the second application program are different application programs.
In implementation, the determining the second distribution ratio of the first application program in the second user group based on the second distribution ratio of the second application program in the second user group and the association relationship between the first application program and the second application program includes:
under the condition that the first application program is the same as the second application program, taking a second distribution proportion of the second application program in the second user group as a second distribution proportion of the corresponding first application program in the second user group;
setting a second distribution ratio of the first application to zero for the second group of users if the first application is different from the second application.
Specifically, if the first application program and the second application program are the same application program, the second distribution ratio of the second application program in the second user group may be used as the second distribution ratio of the corresponding first application program in the second user group, that is, the second installation ratio and the non-installation ratio of the second application program in the second user group are used as the second installation ratio and the non-installation ratio of the corresponding first application program in the second user group; if the first application program does not have the same second application program in the second application program set, that is, under the condition that the first application program is different from the second application program, setting the second distribution proportion of the first application program in the second user group to zero, that is, setting the second installation proportion and the non-installation proportion of the first application program in the second user group to zero.
Take the example that the first application set includes 95 applications 1, 23 applications 2 and 50 applications 3, and the second application set includes 89 applications 1, 35 applications 2 and 78 applications 4.
Since application 1 of the first application is the same as application 1 of the second application, the first application may be considered to be: the first installation proportion of the application program 1 in the first user group a is 95%, and the first uninstallation proportion of the application program 1 in the first user group a is 5%; and the first application: the second installation proportion of the application program 1 in the second user group b is 89%, and the second uninstallation proportion of the application program 1 in the second user group b is 11%;
since application 2 of the first application is the same as application 2 of the second application, the first application may be considered to be: the first installation proportion of the application program 2 in the first user group a is 23%, and the first uninstallation proportion of the application program 2 in the first user group a is 77%; and the first application: the second installation proportion of the application program 2 in the second user group b is 35%, and the second uninstallation proportion of the application program 2 in the second user group b is 65%;
furthermore, since there is no second application program identical to the second application program set, the application program 3 of the first application program may consider that: the first installation proportion of the application program 3 in the first user group a is 50%, and the first uninstallation proportion of the application program 3 in the first user group a is 50%; the second installation proportion of the application program 3 in the second user group b is 0, and the second uninstallation proportion of the application program 3 in the second user group b is 0; for example, the first user group a has a game application installed, and the second user group b has no game application installed, in this case, the enumeration is only 1/0, and the difference value of the game application can also be calculated by using the following preset algorithm formula, which is equivalent to that the difference value is infinite, and the game application is most specific.
Step 108: and calculating the first distribution proportion and the second distribution proportion of the first application program through a preset algorithm to obtain a difference value of the first application program.
Specifically, after the distribution ratio of each first application in the first user group and the second user group is determined, the difference value of each first application can be calculated based on the first distribution ratio of each first application in the first user group and the second distribution ratio of each first application in the second user group.
In specific implementation, the preset algorithm is as follows:
Figure BDA0002388133020000121
wherein H (p, q) represents a disparity value, i represents a discrete distribution segment, p (i) and q (i) are distribution probability values on the ith segment, i represents installation or non-installation in practical application, p (i) can be understood as a first distribution proportion of the first application program in the first user group, and q (i) can be understood as a second distribution proportion of the first application program in the second user group.
Referring to fig. 2, 3, fig. 2 and 3 respectively show a first distribution ratio of two first applications in a first user group and a second distribution ratio in a second user group.
In fig. 2, the first application: the installation proportion of the application 1 in the first user group is 95%, the uninstallation proportion is 5%, the installation proportion of the application 1 in the second user group is 89%, and the uninstallation proportion is 11%.
In fig. 3, the first application: the installation proportion of the application 2 in the first user group is 99%, the uninstalled proportion is 1%, the installation proportion of the application 2 in the second user group is 17%, and the uninstalled proportion is 83%.
Based on the distribution ratios of the application program 1 in the first user group and the second user group in fig. 2, the difference value H of the application program 1 can be obtained by the above-mentioned preset algorithm, and the specific calculation process is as follows:
i-mounting, H1-0.95 log10(1/0.89)=0.048079494,
i is not mounted, H2 is 0.05 log10(1/0.11)=0.047930366,
H=H1+H2=0.096。
In specific implementation, the first distribution proportion and the second distribution proportion of each first application program are calculated by referring to the calculation process, and the difference value, namely the cross entropy, of each first application program is obtained.
Step 110: determining the type of the first user group based on the difference value of the first application program and a preset type mapping table.
The preset type mapping table comprises the name of an application program and a corresponding type.
Referring to table 1, table 1 is a preset type mapping table, including names of preset applications and corresponding types.
TABLE 1
Application program Type (B)
Application program a Type 1
Application b Type 2
Application program c Type 2
Application program n Type n
In practical application, since the type determining method may be applied to determine the types of risk groups, the types corresponding to the application programs in the preset type mapping table may be specific risk types, such as virtual coin types, running score money laundering types, and the like.
Specifically, after obtaining the difference value of the first application program, the method further includes:
and arranging the difference values of the first application programs according to a descending order, and determining the first application programs of which the difference values are greater than or equal to a preset difference threshold value.
By the method, the difference of the application program installation of the first user group and the second user group is abstracted into the problem of distribution difference measurement, and the specific application program is extracted, so that the type of the first user group can be more accurately determined based on the specific application program in the following process.
The preset difference threshold may be set according to actual requirements, for example, set to 0.3.
For example, the first application program comprises an application program a, an application program b and an application program c, the cross entropy of the application program a is 0.763, the cross entropy of the application program b is 0.865, and the cross entropy of the application program c is 0.901; at this time, the application programs a, b and c are arranged in a descending order according to the values of the cross entropy, the first application program with the value of the cross entropy being more than or equal to 0.3 is reserved, and the first application program with the value of the cross entropy being more than or equal to 0.3: application a, application b, and application c are specific applications.
In specific implementation, the determining the type of the first user group based on the difference value of the first application program and the preset type mapping table includes:
determining a type corresponding to the first application program with the difference value being greater than or equal to a preset difference threshold value based on a preset type mapping table;
and adding the difference values of the first application programs of the same type, and taking the type corresponding to the first application program with the maximum sum of the difference values as the type of the first user group.
Specifically, the type corresponding to each first application program is first found out through a preset type mapping table based on the name of the first application program, which is, for example, the type 1 corresponding to the application program a, the type 2 corresponding to the application program b, and the type 2 corresponding to the application program c.
Then, the cross entropies of the application b and the application c with the same type are added to obtain that the sum of the cross entropies corresponding to the type 2 is 0.865+ 0.901-1.766, and the sum of the cross entropies corresponding to the type 1 of the application a is 0.763 because the application a does not have other first applications with the same type, so that it can be determined that the sum of the cross entropies corresponding to the type 2 is the maximum, and then the type 2 is taken as the type of the first user group.
In practical application, for a conventional application program, the distribution difference between risk groups and random groups is small, and for a risk application program, the proportion of the random group installing the application program is small, so that the distribution difference of the application program with the risk in the risk groups and the random groups is large, therefore, the type determination method provided by the embodiment of the specification utilizes the risk groups and the application program installed by a normal group client to determine the types of the risk groups, utilizes the distribution difference of the application programs installed by the risk groups and the normal group to lock specific application programs, and can accurately and conveniently determine the risk types of the risk groups based on the incidence relation between the application programs and the types in a preset type mapping table; and the application program installation information of the user is not easy to hide and tamper, and the identification of the risk types of the risk groups is facilitated.
Corresponding to the above method embodiment, the present specification further provides an embodiment of a type determining apparatus, and fig. 4 shows a schematic structural diagram of a type determining apparatus provided in an embodiment of the present specification. As shown in fig. 4, the apparatus includes:
an application acquisition device 402 configured to acquire a first set of applications including a first application of a first user group and a second set of applications including a second application of a second user group;
a computing device 404 configured to compute a first distribution ratio of the first application to the first group of users and a second distribution ratio of the second application to the second group of users;
a distribution ratio determining device 406 configured to determine a second distribution ratio of the first application program in the second user group based on a second distribution ratio of the second application program in the second user group and an association relationship between the first application program and the second application program;
a difference value obtaining device 408 configured to calculate the first distribution ratio and the second distribution ratio of the first application program through a preset algorithm to obtain a difference value of the first application program;
a type determining device 410 configured to determine a type of the first user group based on the difference value of the first application and a preset type mapping table, wherein the preset type mapping table includes a name of the application and a corresponding type.
Optionally, the apparatus further includes:
a name acquisition means configured to acquire a name of the first application and a name of the second application;
a relationship determining device configured to compare the name of the first application with the name of the second application to determine an association relationship between the first application and the second application, wherein the association relationship includes that the first application is the same as the second application and the first application is different from the second application.
Optionally, the distribution ratio determining device 406 is further configured to:
under the condition that the first application program is the same as the second application program, taking a second distribution proportion of the second application program in the second user group as a second distribution proportion of the corresponding first application program in the second user group;
setting a second distribution ratio of the first application to zero for the second group of users if the first application is different from the second application.
Optionally, the first distribution ratio and the second distribution ratio both include an installation ratio and an uninstallation ratio;
accordingly, the computing device 404 is further configured to:
and calculating a first installation proportion and a non-installation proportion of the first application program in the first user group and a second installation proportion and a non-installation proportion of the second application program in the second user group.
Optionally, the distribution ratio determining device 406 is further configured to:
and taking the second installation proportion and the non-installation proportion of the second application program in the second user group as the corresponding second installation proportion and the non-installation proportion of the first application program in the second user group.
Optionally, the distribution ratio determining device 406 is further configured to:
and setting the second installation proportion and the non-installation proportion of the first application program in the second user group to be zero.
Optionally, the preset algorithm is as follows:
Figure BDA0002388133020000171
wherein i represents a discretely distributed segment, and p (i) and q (i) are distribution probability values on the ith segment, respectively.
Optionally, the apparatus further includes:
the specific application program determining module is configured to arrange the difference values of the first application programs in a descending order and determine the first application programs with the difference values larger than or equal to a preset difference threshold value.
Optionally, the type determining device 410 is further configured to:
determining a type corresponding to the first application program with the difference value being greater than or equal to a preset difference threshold value based on a preset type mapping table;
and adding the difference values of the first application programs of the same type, and taking the type corresponding to the first application program with the maximum sum of the difference values as the type of the first user group.
Optionally, the first user group is a user group of a type to be determined, and the second user group is a user group of a type determined;
accordingly, the application acquiring device 402 is further configured to:
the method comprises the steps of obtaining a first application program set comprising a first application program of a user group of a type to be determined and a second application program set comprising a second application program of the user group of the type to be determined.
The above is an illustrative scheme of one type of determination device of the present embodiment. It should be noted that the technical solution of the type determining apparatus and the technical solution of the type determining method belong to the same concept, and details that are not described in detail in the technical solution of the type determining apparatus can be referred to the description of the technical solution of the type determining method.
FIG. 5 illustrates a block diagram of a computing device 500 provided in accordance with one embodiment of the present description. The components of the computing device 500 include, but are not limited to, a memory 510 and a processor 520. Processor 520 is coupled to memory 510 via bus 530, and database 550 is used to store data.
Computing device 500 also includes access device 540, access device 540 enabling computing device 500 to communicate via one or more networks 560. Examples of such networks include the Public Switched Telephone Network (PSTN), a Local Area Network (LAN), a Wide Area Network (WAN), a Personal Area Network (PAN), or a combination of communication networks such as the internet. The access device 540 may include one or more of any type of network interface, e.g., a Network Interface Card (NIC), wired or wireless, such as an IEEE802.11 Wireless Local Area Network (WLAN) wireless interface, a worldwide interoperability for microwave access (Wi-MAX) interface, an ethernet interface, a Universal Serial Bus (USB) interface, a cellular network interface, a bluetooth interface, a Near Field Communication (NFC) interface, and so forth.
In one embodiment of the present description, the above-described components of computing device 500, as well as other components not shown in FIG. 5, may also be connected to each other, such as by a bus. It should be understood that the block diagram of the computing device architecture shown in FIG. 5 is for purposes of example only and is not limiting as to the scope of the present description. Those skilled in the art may add or replace other components as desired.
Computing device 500 may be any type of stationary or mobile computing device, including a mobile computer or mobile computing device (e.g., tablet, personal digital assistant, laptop, notebook, netbook, etc.), mobile phone (e.g., smartphone), wearable computing device (e.g., smartwatch, smartglasses, etc.), or other type of mobile device, or a stationary computing device such as a desktop computer or PC. Computing device 500 may also be a mobile or stationary server.
Wherein processor 520 is configured to execute the following computer-executable instructions:
acquiring a first application program set comprising a first application program of a first user group and a second application program set comprising a second application program of a second user group;
calculating a first distribution ratio of the first application program in the first user group and a second distribution ratio of the second application program in the second user group;
determining a second distribution ratio of the first application program in the second user group based on the second distribution ratio of the second application program in the second user group and the incidence relation between the first application program and the second application program;
calculating the first distribution proportion and the second distribution proportion of the first application program through a preset algorithm to obtain a difference value of the first application program;
determining the type of the first user group based on the difference value of the first application program and a preset type mapping table, wherein the preset type mapping table comprises the name of the application program and the corresponding type.
The above is an illustrative scheme of a computing device of the present embodiment. It should be noted that the technical solution of the computing device and the technical solution of the type determination method belong to the same concept, and details that are not described in detail in the technical solution of the computing device can be referred to the description of the technical solution of the type determination method.
An embodiment of the present specification also provides a computer readable storage medium storing computer instructions which, when executed by a processor, implement the steps of the type determination method.
The above is an illustrative scheme of a computer-readable storage medium of the present embodiment. It should be noted that the technical solution of the storage medium belongs to the same concept as the technical solution of the type determination method described above, and for details that are not described in detail in the technical solution of the storage medium, reference may be made to the description of the technical solution of the type determination method described above.
The foregoing description has been directed to specific embodiments of this disclosure. Other embodiments are within the scope of the following claims. In some cases, the actions or steps recited in the claims may be performed in a different order than in the embodiments and still achieve desirable results. In addition, the processes depicted in the accompanying figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In some embodiments, multitasking and parallel processing may also be possible or may be advantageous.
The computer instructions comprise computer program code which may be in the form of source code, object code, an executable file or some intermediate form, or the like. The computer-readable medium may include: any entity or device capable of carrying the computer program code, recording medium, usb disk, removable hard disk, magnetic disk, optical disk, computer Memory, Read-Only Memory (ROM), Random Access Memory (RAM), electrical carrier wave signals, telecommunications signals, software distribution medium, and the like. It should be noted that the computer readable medium may contain content that is subject to appropriate increase or decrease as required by legislation and patent practice in jurisdictions, for example, in some jurisdictions, computer readable media does not include electrical carrier signals and telecommunications signals as is required by legislation and patent practice.
It should be noted that, for the sake of simplicity, the foregoing method embodiments are described as a series of acts, but those skilled in the art should understand that the present embodiment is not limited by the described acts, because some steps may be performed in other sequences or simultaneously according to the present embodiment. Further, those skilled in the art should also appreciate that the embodiments described in this specification are preferred embodiments and that acts and modules referred to are not necessarily required for an embodiment of the specification.
In the above embodiments, the descriptions of the respective embodiments have respective emphasis, and for parts that are not described in detail in a certain embodiment, reference may be made to related descriptions of other embodiments.
The preferred embodiments of the present specification disclosed above are intended only to aid in the description of the specification. Alternative embodiments are not exhaustive and do not limit the invention to the precise embodiments described. Obviously, many modifications and variations are possible in light of the above teaching. The embodiments were chosen and described in order to best explain the principles of the embodiments and the practical application, to thereby enable others skilled in the art to best understand and utilize the embodiments. The specification is limited only by the claims and their full scope and equivalents.

Claims (15)

1. A type determination method, comprising:
acquiring a first application program set comprising a first application program of a first user group and a second application program set comprising a second application program of a second user group;
calculating a first distribution ratio of the first application program in the first user group and a second distribution ratio of the second application program in the second user group;
determining a second distribution ratio of the first application program in the second user group based on a second distribution ratio of the second application program in the second user group and an association relationship between the first application program and the second application program, wherein the association relationship comprises that the first application program is the same as the second application program and the first application program is different from the second application program;
calculating the first distribution proportion and the second distribution proportion of the first application program through a preset algorithm to obtain a difference value of the first application program;
determining the type of the first user group based on the difference value of the first application program and a preset type mapping table, wherein the preset type mapping table comprises the name of the application program and the corresponding type.
2. The type determination method of claim 1, the determining the first application before the second distribution ratio of the second group of users based on the second distribution ratio of the second application among the second group of users and the association relationship of the first application with the second application, further comprising:
acquiring the name of the first application program and the name of the second application program;
comparing the name of the first application program with the name of the second application program to determine the incidence relation between the first application program and the second application program.
3. The type determination method of claim 2, wherein the determining a second distribution ratio of the first application in the second user group based on a second distribution ratio of the second application in the second user group and an association relationship of the first application with the second application comprises:
under the condition that the first application program is the same as the second application program, taking a second distribution proportion of the second application program in the second user group as a second distribution proportion of the corresponding first application program in the second user group;
setting a second distribution ratio of the first application to zero for the second group of users if the first application is different from the second application.
4. The type determination method according to claim 1 or 3, the first distribution ratio and the second distribution ratio each including an installation ratio and an uninstallation ratio;
correspondingly, the calculating a first distribution ratio of the first application program in the first user group and a second distribution ratio of the second application program in the second user group comprises:
and calculating a first installation proportion and a non-installation proportion of the first application program in the first user group and a second installation proportion and a non-installation proportion of the second application program in the second user group.
5. The type determination method of claim 4, the taking a second distribution ratio of the second application across the second group of users as a second distribution ratio of the corresponding first application across the second group of users comprising:
and taking the second installation proportion and the non-installation proportion of the second application program in the second user group as the corresponding second installation proportion and the non-installation proportion of the first application program in the second user group.
6. The type determination method of claim 5, the setting a second distribution proportion of the first application to zero for the second group of users comprising:
and setting the second installation proportion and the non-installation proportion of the first application program in the second user group to be zero.
7. The type determination method according to claim 1, the preset algorithm is as follows:
Figure FDA0003500429570000031
wherein i represents a discretely distributed segment, and p (i) and q (i) are distribution probability values on the ith segment, respectively.
8. The type determination method according to claim 1, further comprising, after obtaining the disparity value of the first application program:
and arranging the difference values of the first application programs according to a descending order, and determining the first application programs of which the difference values are greater than or equal to a preset difference threshold value.
9. The type determination method of claim 8, the determining the type of the first user group based on the disparity value of the first application and a preset type mapping table comprising:
determining a type corresponding to the first application program with the difference value being greater than or equal to a preset difference threshold value based on a preset type mapping table;
and adding the difference values of the first application programs of the same type, and taking the type corresponding to the first application program with the maximum sum of the difference values as the type of the first user group.
10. The type determination method according to claim 1, wherein the first user group is a user group of a type to be determined, and the second user group is a user group of a type to be determined;
correspondingly, the acquiring a first application set including a first application program of the first user group and a second application set including a second application program of the second user group includes:
the method comprises the steps of obtaining a first application program set comprising a first application program of a user group of a type to be determined and a second application program set comprising a second application program of the user group of the type to be determined.
11. A type determination apparatus comprising:
the application program acquisition device is configured to acquire a first application program set including a first application program of a first user group and a second application program set including a second application program of a second user group;
a computing device configured to compute a first distribution ratio of the first application to the first group of users and a second distribution ratio of the second application to the second group of users;
the distribution proportion determining device is configured to determine a second distribution proportion of the first application program in the second user group based on a second distribution proportion of the second application program in the second user group and an association relationship between the first application program and the second application program, wherein the association relationship comprises that the first application program is the same as the second application program and the first application program is different from the second application program;
a difference value obtaining device configured to calculate the first distribution ratio and the second distribution ratio of the first application program through a preset algorithm to obtain a difference value of the first application program;
the type determining device is configured to determine the type of the first user group based on the difference value of the first application program and a preset type mapping table, wherein the preset type mapping table comprises the name of the application program and the corresponding type.
12. The type determination device of claim 11, further comprising:
a name acquisition means configured to acquire a name of the first application and a name of the second application;
a relationship determination device configured to compare the name of the first application with the name of the second application to determine an association relationship of the first application with the second application.
13. The type determination device of claim 12, the distribution ratio determination device, further configured to:
under the condition that the first application program is the same as the second application program, taking a second distribution proportion of the second application program in the second user group as a second distribution proportion of the corresponding first application program in the second user group;
setting a second distribution ratio of the first application to zero for the second group of users if the first application is different from the second application.
14. A computing device, comprising:
a memory and a processor;
the memory is to store computer-executable instructions, and the processor is to execute the computer-executable instructions to:
acquiring a first application program set comprising a first application program of a first user group and a second application program set comprising a second application program of a second user group;
calculating a first distribution ratio of the first application program in the first user group and a second distribution ratio of the second application program in the second user group;
determining a second distribution ratio of the first application program in the second user group based on a second distribution ratio of the second application program in the second user group and an association relationship between the first application program and the second application program, wherein the association relationship comprises that the first application program is the same as the second application program and the first application program is different from the second application program;
calculating the first distribution proportion and the second distribution proportion of the first application program through a preset algorithm to obtain a difference value of the first application program;
determining the type of the first user group based on the difference value of the first application program and a preset type mapping table, wherein the preset type mapping table comprises the name of the application program and the corresponding type.
15. A computer readable storage medium storing computer instructions which, when executed by a processor, carry out the steps of the type determination method of any one of claims 1 to 10.
CN202010104659.7A 2020-02-20 2020-02-20 Type determination method and device Active CN111340344B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010104659.7A CN111340344B (en) 2020-02-20 2020-02-20 Type determination method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010104659.7A CN111340344B (en) 2020-02-20 2020-02-20 Type determination method and device

Publications (2)

Publication Number Publication Date
CN111340344A CN111340344A (en) 2020-06-26
CN111340344B true CN111340344B (en) 2022-04-15

Family

ID=71185488

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010104659.7A Active CN111340344B (en) 2020-02-20 2020-02-20 Type determination method and device

Country Status (1)

Country Link
CN (1) CN111340344B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1154667A2 (en) * 2000-05-11 2001-11-14 Lucent Technologies Inc. Method and apparatus for multi-user resource management in wireless communication systems
CN104899215A (en) * 2014-03-06 2015-09-09 北京搜狗科技发展有限公司 Data processing method, recommendation source information organization, information recommendation method and information recommendation device
CN107451255A (en) * 2017-07-31 2017-12-08 陕西识代运筹信息科技股份有限公司 A kind of user interest treating method and apparatus based on concern relation
CN108898218A (en) * 2018-05-24 2018-11-27 阿里巴巴集团控股有限公司 A kind of training method of neural network model, device and computer equipment
CN110580171A (en) * 2019-09-17 2019-12-17 RealMe重庆移动通信有限公司 APP classification method, related device and product

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9075665B2 (en) * 2010-06-29 2015-07-07 International Business Machines Corporation Smoothing peak system load via behavior prediction in collaborative systems with temporal data access patterns

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1154667A2 (en) * 2000-05-11 2001-11-14 Lucent Technologies Inc. Method and apparatus for multi-user resource management in wireless communication systems
CN104899215A (en) * 2014-03-06 2015-09-09 北京搜狗科技发展有限公司 Data processing method, recommendation source information organization, information recommendation method and information recommendation device
CN107451255A (en) * 2017-07-31 2017-12-08 陕西识代运筹信息科技股份有限公司 A kind of user interest treating method and apparatus based on concern relation
CN108898218A (en) * 2018-05-24 2018-11-27 阿里巴巴集团控股有限公司 A kind of training method of neural network model, device and computer equipment
CN110580171A (en) * 2019-09-17 2019-12-17 RealMe重庆移动通信有限公司 APP classification method, related device and product

Also Published As

Publication number Publication date
CN111340344A (en) 2020-06-26

Similar Documents

Publication Publication Date Title
US20230066545A1 (en) Training User-Level Differentially Private Machine-Learned Models
Bickel et al. A nonparametric view of network models and Newman–Girvan and other modularities
CN107220845B (en) User re-purchase probability prediction/user quality determination method and device and electronic equipment
AU2019101565A4 (en) User data sharing method and device
CN110827924B (en) Clustering method and device for gene expression data, computer equipment and storage medium
CN112651619A (en) Business-oriented wind control method and device
CN110929836B (en) Neural network training and image processing method and device, electronic equipment and medium
CN111428217A (en) Method and device for identifying cheat group, electronic equipment and computer readable storage medium
CN116383753B (en) Abnormal behavior prompting method, device, equipment and medium based on Internet of things
CN110717758B (en) Abnormal transaction identification method and device
CN110689136A (en) Deep learning model obtaining method, device, equipment and storage medium
CN111340344B (en) Type determination method and device
CN107871213B (en) Transaction behavior evaluation method, device, server and storage medium
CN109241249B (en) Method and device for determining burst problem
CN113891323B (en) WiFi-based user tag acquisition system
WO2020093817A1 (en) Identity verification method and device
CN113010912B (en) Desensitization method and apparatus
US11227231B2 (en) Computational efficiency in symbolic sequence analytics using random sequence embeddings
CN112905457B (en) Software testing method and device
CN114611149A (en) Data processing method and device
CN113312166A (en) Resource processing method and device
CN111552846A (en) Method and device for identifying suspicious relationship
Wang An adaptive variational mode decomposition technique with differential evolution algorithm and its application analysis
CN116755805B (en) Resource optimization method and device applied to C++, and resource optimization device applied to C++
CN117150321B (en) Equipment trust evaluation method and device, service equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20230113

Address after: 201208 Floor 15, No. 447, Nanquan North Road, Pudong New Area, Shanghai

Patentee after: Alipay.com Co.,Ltd.

Address before: 801-11, Section B, 8th floor, 556 Xixi Road, Xihu District, Hangzhou City, Zhejiang Province, 310013

Patentee before: Alipay (Hangzhou) Information Technology Co.,Ltd.

TR01 Transfer of patent right