CN112686718B - Method and device for acquiring user loss reason, computer equipment and storage medium - Google Patents

Method and device for acquiring user loss reason, computer equipment and storage medium Download PDF

Info

Publication number
CN112686718B
CN112686718B CN202110293420.3A CN202110293420A CN112686718B CN 112686718 B CN112686718 B CN 112686718B CN 202110293420 A CN202110293420 A CN 202110293420A CN 112686718 B CN112686718 B CN 112686718B
Authority
CN
China
Prior art keywords
user
loss
value
churn
characteristic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110293420.3A
Other languages
Chinese (zh)
Other versions
CN112686718A (en
Inventor
杨健颖
何悦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Suoxinda Data Technology Co ltd
Original Assignee
Shenzhen Suoxinda Data Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Suoxinda Data Technology Co ltd filed Critical Shenzhen Suoxinda Data Technology Co ltd
Priority to CN202110293420.3A priority Critical patent/CN112686718B/en
Publication of CN112686718A publication Critical patent/CN112686718A/en
Application granted granted Critical
Publication of CN112686718B publication Critical patent/CN112686718B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The application relates to a method and a device for acquiring a user loss reason, computer equipment and a storage medium. The method comprises the following steps: acquiring lost user data; the attrition user data comprises at least one user characteristic of the attrition user; calculating the loss value of each user characteristic through a preset loss value calculation formula according to the loss user data; the churn value is a parameter used to quantify the relationship between the user characteristics and user churn; calculating the loss value ratio of each user characteristic according to the loss value of each user characteristic; the loss value proportion is the proportion of the loss values of all user characteristics in the total loss value; the total churn value is the sum of the churn values of the user features; and acquiring the user loss reason according to the loss value ratio of each user characteristic. By adopting the method, the loss reason of the user can be accurately acquired.

Description

Method and device for acquiring user loss reason, computer equipment and storage medium
Technical Field
The present application relates to the field of computer application technologies, and in particular, to a method and an apparatus for obtaining a cause of user churn, a computer device, and a storage medium.
Background
With the development of science and technology, more and more application programs appear in order to meet the requirements of people on work and life. For developers, it is generally desirable that the developed application has high adhesion with users. After an application is online, a user is often lost. Therefore, the early warning analysis of user loss becomes an important ring in the user relationship management; the early warning analysis of the user loss firstly needs to acquire the reason of the user loss.
However, in the conventional technology, machine learning algorithms used in the user loss early warning field generally have a problem of poor interpretability, that is, the loss early warning model generally only can give a result of whether a user will be lost, and cannot accurately obtain the loss reason of each user, which increases difficulty for subsequent user saving. Therefore, how to accurately obtain the loss reason of the user becomes an urgent problem to be solved.
Disclosure of Invention
In view of the foregoing, it is desirable to provide a method, an apparatus, a computer device, and a storage medium for obtaining a user churn cause, which can accurately obtain a churn cause of a user.
A method for acquiring a user churn reason, the method comprising:
acquiring lost user data; the attrition user data comprises at least one user characteristic of the attrition user;
calculating the loss value of each user characteristic through a preset loss value calculation formula according to the loss user data; the churn value is a parameter used to quantify the relationship between the user characteristics and user churn;
calculating the loss value ratio of each user characteristic according to the loss value of each user characteristic; the loss value proportion is the proportion of the loss values of all user characteristics in the total loss value; the total churn value is the sum of the churn values of the user features;
and acquiring the user loss reason according to the loss value ratio of each user characteristic.
In one embodiment, the method further comprises the following steps: inputting target user data into a pre-trained lost user judgment model, and outputting lost user data;
the loss user judgment model is obtained by training according to sample user data and a loss label of a sample user; the sample user data comprises not less than one user characteristic of a sample user; the attrition label is a parameter used to characterize whether the sample user is an attrition user.
In one embodiment, the churn user data is substituted into a preset churn value calculation formula to obtain a churn value of each user characteristic;
the loss value calculation formula is as follows:
Figure 187359DEST_PATH_IMAGE001
wherein "! "represents factorial," | "represents the number of elements contained in a set, F represents a set of all user characteristics,
Figure 36233DEST_PATH_IMAGE002
indicating the removal of the ith bit from the F setThe feature set left after characterization, S represents a subset of F,
Figure 411851DEST_PATH_IMAGE003
representing an attrition user judgment model trained after adding the ith feature to the feature subset S,
Figure 19418DEST_PATH_IMAGE004
an attrition user judgment model trained based on the user feature subset S is represented.
In one embodiment, replacing the data less than 0 in the churn value of each user characteristic with 0, and calculating the sum of the churn values of each user characteristic as a total churn value;
and calculating the ratio of the loss value of each user characteristic to the total loss value, and taking the ratio as the loss value ratio of each user characteristic.
In one embodiment, the loss value ratios of the user characteristics are arranged in a descending order to generate a loss value sequence;
accumulating the loss value sequence from the maximum value in sequence to obtain the accumulated value of the loss values;
when the accumulated value is larger than or equal to a preset accumulated threshold value, taking the user characteristic corresponding to the accumulated value as a user loss characteristic;
and acquiring the user loss reason according to the user loss characteristics.
In one embodiment, according to the user churn characteristics, obtaining a WOE code value corresponding to the user churn characteristics;
generating a user loss reason according to a preset loss reason mapping table by using the user loss characteristics of which the WOE coding value exceeds a preset WOE coding threshold value; the churn reason mapping table comprises mapping relations between user churn characteristics and user churn reasons.
In one embodiment, continuous user characteristics in the target user data are subjected to binning processing, and binning corresponding to each continuous user characteristic is obtained;
calculating the WOE code value of each sub-box and the WOE code value of discrete user characteristics in the target user data;
and setting a corresponding user loss reason for the user characteristics of which the WOE code value is greater than the preset WOE code threshold value, and generating a user loss reason mapping table.
An apparatus for obtaining a reason for user churn, the apparatus comprising:
the first acquisition module is used for acquiring lost user data; the attrition user data comprises at least one user characteristic of the attrition user;
the first calculation module is used for calculating the loss value of each user characteristic according to the loss user data through a preset loss value calculation formula; the churn value is a parameter used to quantify the relationship between the user characteristics and user churn;
the second calculation module is used for calculating the loss value ratio of each user characteristic according to the loss value of each user characteristic; the loss value proportion is the proportion of the loss values of all user characteristics in the total loss value; the total churn value is the sum of the churn values of the user features;
and the second acquisition module is used for acquiring the user loss reason according to the loss value ratio of each user characteristic.
A computer device comprising a memory and a processor, the memory storing a computer program, the processor implementing the following steps when executing the computer program:
acquiring lost user data; the attrition user data comprises at least one user characteristic of the attrition user;
calculating the loss value of each user characteristic through a preset loss value calculation formula according to the loss user data; the churn value is a parameter used to quantify the relationship between the user characteristics and user churn;
calculating the loss value ratio of each user characteristic according to the loss value of each user characteristic; the loss value proportion is the proportion of the loss values of all user characteristics in the total loss value; the total churn value is the sum of the churn values of the user features;
and acquiring the user loss reason according to the loss value ratio of each user characteristic.
A computer-readable storage medium, on which a computer program is stored which, when executed by a processor, carries out the steps of:
acquiring lost user data; the attrition user data comprises at least one user characteristic of the attrition user;
calculating the loss value of each user characteristic through a preset loss value calculation formula according to the loss user data; the churn value is a parameter used to quantify the relationship between the user characteristics and user churn;
calculating the loss value ratio of each user characteristic according to the loss value of each user characteristic; the loss value proportion is the proportion of the loss values of all user characteristics in the total loss value; the total churn value is the sum of the churn values of the user features;
and acquiring the user loss reason according to the loss value ratio of each user characteristic.
According to the method, the device, the computer equipment and the storage medium for acquiring the user loss reason, the loss value of each user characteristic is calculated through a preset loss value calculation formula and loss user data; the attrition user data comprises at least one user characteristic of the attrition user; and calculating the loss value ratio of each user characteristic according to the loss value of each user characteristic, and acquiring the loss reason of each user. Not only can the prediction on whether the user loses be realized, but also the accurate acquisition of the loss reason of the user is realized.
Drawings
FIG. 1 is a diagram of an exemplary embodiment of an application environment for a method for obtaining a cause of user churn;
fig. 2 is a flowchart illustrating a method for obtaining a cause of user churn in one embodiment;
FIG. 3 is a flowchart illustrating a step of obtaining a cause of user churn in one embodiment;
FIG. 4 is a block diagram of an apparatus for obtaining a cause of user churn in one embodiment;
FIG. 5 is a diagram illustrating an internal structure of a computer device according to an embodiment.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the present application is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.
The method for acquiring the user churn reason can be applied to the application environment shown in fig. 1. Wherein the terminal 102 communicates with the server 104 via a network. The terminal 102 and the server 104 may be respectively and independently used to execute the method for acquiring the user churn reason provided by the present application. The terminal 102 and the server 104 may also be configured to cooperatively execute the method for acquiring the user churn reason provided by the present application. For example, server 104 is used to obtain attrition user data; the attrition user data comprises at least one user characteristic of the attrition user; calculating the loss value of each user characteristic through a preset loss value calculation formula according to the loss user data; the churn value is a parameter used to quantify the relationship between the user characteristics and user churn; calculating the loss value ratio of each user characteristic according to the loss value of each user characteristic; the loss value proportion is the proportion of the loss values of all user characteristics in the total loss value; the total churn value is the sum of the churn values of the user features; and acquiring the user loss reason according to the loss value ratio of each user characteristic.
The terminal 102 may be, but is not limited to, a device including a data acquisition apparatus, and the server 104 may be implemented by an independent server or a server cluster formed by a plurality of servers.
In one embodiment, as shown in fig. 2, a method for extracting a video key frame is provided, which is described by taking the method as an example applied to the terminal in fig. 1, and includes the following steps:
step 202, acquiring lost user data; the attrition user data comprises at least one user characteristic of the attrition user.
The lost user data is related data of lost users and comprises not less than one user characteristic; for example, including the user's occupation, age, gender, school calendar, length of the account, etc.
Specifically, firstly, the loss user data needs to be acquired, the acquisition mode of the loss user data is not specifically limited, and the loss user data can be acquired through statistical acquisition and model judgment; the attrition user data includes user characteristics of attrition users, including discrete user characteristics and/or continuous user characteristics. And finally acquiring the loss reason of the user by analyzing the user characteristics of the user.
Step 204, calculating the loss value of each user characteristic through a preset loss value calculation formula according to the loss user data; the churn value is a parameter used to quantify the relationship between the user characteristics and user churn.
Specifically, the churn value calculation formula is a formula for calculating a churn value of each user characteristic of the user according to churn user data; the churn value of a user characteristic is a parameter for characterizing the relationship between the user characteristic of a user as a churn cause and the result of user churn. For a user characteristic of a user, the larger the loss value of the user characteristic is, the stronger the user characteristic plays a role in user loss; the smaller the churn value of the user characteristic, the weaker the user characteristic will have on the user churn.
Step 206, calculating the loss value ratio of each user characteristic according to the loss value of each user characteristic; the loss value proportion is the proportion of the loss values of all user characteristics in the total loss value; the total churn value is the sum of the churn values for each user characteristic.
Specifically, the churn value of each user feature can better reflect the role of each user feature on user churn in all user features. Therefore, it is necessary to obtain the loss value ratio of each user feature according to the loss value of each user feature and the total loss value obtained according to the sum of the loss values of all user features. In general, the higher the loss value of a user feature is, the greater the user feature plays a role in user loss; conversely, the lower the churn value duty of a user feature, the less the user feature will contribute to user churn.
And step 208, acquiring the loss reason of the user according to the loss value ratio of each user characteristic.
Specifically, the user characteristics with a large user loss effect can be screened out through the loss value ratio of each user characteristic; the user characteristics with a large user loss effect are analyzed, the relation between the user characteristics and the user loss reasons is searched, and the specific reason of the user loss can be found.
In the method for acquiring the user loss reason, the loss value of each user characteristic is calculated through a preset loss value calculation formula and loss user data; the attrition user data comprises at least one user characteristic of the attrition user; calculating the loss value ratio of each user characteristic according to the loss value of each user characteristic; the total churn value is the sum of the churn values of the user features; finally, acquiring the loss reason of the user according to the loss value ratio of each user characteristic; the method can convert the user characteristics causing user loss into specific service language and accurately acquire the loss reason of the user.
In one embodiment, the obtaining attrition user data comprises:
inputting target user data into a pre-trained lost user judgment model, and outputting lost user data;
the loss user judgment model is obtained by training according to sample user data and a loss label of a sample user; the sample user data comprises not less than one user characteristic of a sample user; the attrition label is a parameter used to characterize whether the sample user is an attrition user.
Specifically, the target user data includes churned user data and non-churned user data, and the target user data includes user characteristics of churned users and non-churned users. The sample user data and the attrition labels of the sample users are training sets used for training the attrition user judgment model. The loss user judgment model is obtained by training based on sample user data and a loss label of a sample user, and the target user data can be screened and the loss user data can be screened out through the loss user judgment model, so that the loss user data can be analyzed and the loss reason of the loss user can be obtained.
In the embodiment, the loss user judgment model is obtained by training according to the sample user data and the loss label of the sample user as a training set, and the target user data is screened through the loss user judgment model, so that the loss user data can be rapidly screened out, and the efficiency of obtaining the loss reason of the user is improved.
In an embodiment, the calculating, by using a preset churn value calculation formula, a churn value of each user feature includes:
substituting the loss user data into a preset loss value calculation formula to obtain the loss value of each user characteristic;
the loss value calculation formula is as follows:
Figure 429671DEST_PATH_IMAGE001
wherein "! "represents factorial," | "represents the number of elements contained in a set, F represents a set of all user characteristics,
Figure 231536DEST_PATH_IMAGE002
representing the set of features left after the ith feature is removed from the set of F, S represents a subset of F,
Figure 891188DEST_PATH_IMAGE003
representing an attrition user judgment model trained after adding the ith feature to the feature subset S,
Figure 505709DEST_PATH_IMAGE004
an attrition user judgment model trained based on the user feature subset S is represented.
Specifically, the churn value calculation formula is a calculation formula for calculating a churn value of each user characteristic; through the loss value calculation formula, the loss value of each user characteristic can be quickly obtained, so that the loss value ratio can be further obtained according to the loss value of the user. For a user characteristic of a user, the larger the loss value of the user characteristic is, the stronger the user characteristic plays a role in user loss; the smaller the churn value of the user characteristic, the weaker the user characteristic will have on the user churn.
In this embodiment, through the loss value calculation formula, the loss value corresponding to each user characteristic can be quickly obtained, and then the loss reason of the lost user can be quickly obtained according to the loss value of each user characteristic, so that the accuracy of obtaining the loss reason of the lost user is improved.
In an embodiment, the calculating the churn value ratio of each user characteristic according to the churn value of each user characteristic includes:
replacing data smaller than 0 in the loss values of the user characteristics with 0, and calculating the sum of the loss values of the user characteristics to serve as a total loss value;
and calculating the ratio of the loss value of each user characteristic to the total loss value, and taking the ratio as the loss value ratio of each user characteristic.
Specifically, after the churn value of each user characteristic is obtained, the churn value of some user characteristics may be a negative value. The sum of the user characteristics is acquired according to the loss value of each user characteristic to serve as a total loss value, and the loss value ratio of each user characteristic is further acquired according to the loss value and the total loss value of each user characteristic, so that 0 is adopted to replace the value smaller than 0 in the loss values of the user characteristics. And calculating the total loss value and the loss value ratio of each user characteristic according to the loss value of the replaced user characteristic.
The calculation formula of the loss value ratio is as follows:
Figure 504889DEST_PATH_IMAGE005
wherein the content of the first and second substances,
Figure 474725DEST_PATH_IMAGE006
an attrition value representing a jth user characteristic of an ith attrition user;
Figure 90514DEST_PATH_IMAGE007
to representThe loss value of the jth user characteristic of the ith lost user accounts for the ratio; n is the number of user features of the attrition users.
In this embodiment, the churn value of a user profile may be a positive value or a negative value, and a negative churn value indicates that the user profile does not cause churn. Therefore, in order to eliminate the influence of the user characteristics which do not cause user loss, a negative value in the loss value is replaced by 0 in the calculation process, and then a characteristic with a larger proportion is searched in a mode of calculating the proportion of the loss value from the characteristics with the positive value of the loss value, so that the important user characteristics are found.
In an embodiment, the obtaining the user churn reason according to the churn value ratio of each user feature includes:
performing descending order arrangement on the loss value ratios of the user characteristics to generate a loss value sequence;
accumulating the loss value sequence from the maximum value in sequence to obtain the accumulated value of the loss values;
when the accumulated value is larger than or equal to a preset accumulated threshold value, taking the user characteristic corresponding to the accumulated value as a user loss characteristic;
and acquiring the user loss reason according to the user loss characteristics.
Specifically, among the user characteristics of each attrition user, only a small portion of the user characteristics are strongly correlated with the attrition user. User churn features that are not strongly correlated with churn users need to be removed. Therefore, after the loss values of all the user characteristics of the lost users are obtained, the loss value ratios of all the user characteristics are arranged in a descending order to generate a loss value sequence; and accumulating the loss value sequence from the maximum value in sequence to obtain the accumulated value of the loss values. And obtaining the user characteristics with the top rank as the user loss characteristics according to the accumulated values and the accumulated threshold values, and obtaining the user loss reasons according to the user loss characteristics.
In this embodiment, some of the user characteristics with the positive loss values of the user characteristics are relatively large (for example, 5), some of the user characteristics are relatively small (for example, 0.005), the user characteristics with the relatively large loss values have a relatively large effect on the user loss, and similarly, the user characteristics with the small loss values have a relatively small effect on the user loss. Therefore, by calculating the churn value fraction, it is possible to find several user characteristics that have the greatest effect on user churn from all the user characteristics.
As shown in fig. 3, the obtaining the user churn reason according to the user churn feature includes:
step 302, obtaining a WOE code value corresponding to the user loss characteristic according to the user loss characteristic;
specifically, for continuous user churn features, binning processing is required to be performed, binning corresponding to each continuous user churn feature is obtained, and a WOE (Evidence Weight) coding value of each bin is calculated; for discrete user churn characteristics, the WOE code values of the discrete user churn characteristics can be calculated without binning.
Step 304, generating a user loss reason according to a preset loss reason mapping table by using the user loss characteristics of which the WOE coding value exceeds a preset WOE coding threshold value; the churn reason mapping table comprises mapping relations between user churn characteristics and user churn reasons.
Specifically, the WOE code value is obtained by the following formula:
Figure 305464DEST_PATH_IMAGE008
wherein the content of the first and second substances,
Figure 627992DEST_PATH_IMAGE009
is the WOE encoded value of the ith packet,
Figure 506081DEST_PATH_IMAGE010
is the number of attrition users in the current interval,
Figure 609166DEST_PATH_IMAGE011
is the number of users in the current interval that are not churned,
Figure 362227DEST_PATH_IMAGE012
indicating the number of attrition users in all samples under the user profile,
Figure 824082DEST_PATH_IMAGE013
indicating the number of users in all samples for which churn did not occur under this feature.
Acquiring a WOE (word on element) code value corresponding to user loss characteristics, judging whether the WOE code value corresponding to the user loss characteristics exceeds a preset WOE code threshold value, and if the WOE code value exceeds the preset WOE code threshold value, generating corresponding user loss reasons through a loss reason mapping table containing mapping relations between the user loss characteristics and the user loss reasons.
In the embodiment, a WOE code value corresponding to the user churn characteristic is generated according to the user churn characteristic; and generating a user loss reason according to a preset loss reason mapping table by using the user loss characteristics of which the WOE code value exceeds a preset WOE code threshold value. The method can convert the user characteristics causing user loss into specific service language and accurately acquire the loss reason of the user.
In an embodiment, the obtaining, according to the user churn feature, a WOE code value corresponding to the user churn feature previously further includes:
performing binning processing on continuous user characteristics in the target user data to obtain bins corresponding to the continuous user characteristics;
calculating the WOE code value of each sub-box and the WOE code value of discrete user characteristics in the target user data;
and setting a corresponding user loss reason for the user characteristics of which the WOE code value is greater than the preset WOE code threshold value, and generating a user loss reason mapping table.
Specifically, the user features in the target user data include continuous user features and discrete user features, for the continuous user features, binning processing needs to be performed to obtain bins corresponding to each continuous user feature, and a WOE code value of each bin is calculated; for discrete user characteristics, the WOE code value of the discrete user characteristics is directly calculated through a formula. And finally, setting a corresponding user loss reason for the user characteristics of which the WOE code value is greater than the preset WOE code threshold value, and generating a user loss reason mapping table. The reason for user churn is described in the business language.
For example, for a discrete user characteristic of "user account opening time", the relationship between the WOE interval, the WOE code value and the service language indicating the reason for user churn is shown in table 1:
TABLE 1
Figure 856760DEST_PATH_IMAGE014
The WOE interval in the table 1 is customized according to needs, and the WOE code value is calculated and obtained through an obtaining formula of the WOE code value; the value of the WOE coding threshold in table 1 is 0.1, and for a WOE interval in which the WOE coding value is greater than 0.1, a corresponding user loss reason described by a service language is set. As shown in table 1, when the account opening duration of the user is less than or equal to 188 days, the user churn reason of the churn user is a new account opening; when the account opening time of the user is more than 188 days and less than or equal to 714 days, the user churn reason of the churn user is that the account opening time is short.
In this embodiment, continuous user features in the target user data are subjected to binning processing to obtain bins corresponding to each continuous user feature, and a WOE code value of each bin and a WOE code value of discrete user features in the target user data are calculated. And finally, setting a corresponding user loss reason for the user characteristics of which the WOE code value is greater than the preset WOE code threshold value, and generating a user loss reason mapping table. Therefore, the user loss reason can be acquired according to the user loss characteristic and the user loss reason mapping table.
It should be understood that although the various steps in the flow charts of fig. 2-3 are shown in order as indicated by the arrows, the steps are not necessarily performed in order as indicated by the arrows. The steps are not performed in the exact order shown and described, and may be performed in other orders, unless explicitly stated otherwise. Moreover, at least some of the steps in fig. 2-3 may include multiple steps or multiple stages, which are not necessarily performed at the same time, but may be performed at different times, which are not necessarily performed in sequence, but may be performed in turn or alternately with other steps or at least some of the other steps.
In one embodiment, as shown in fig. 4, an apparatus for obtaining a cause of user churn is provided, including: a first obtaining module 401, a first calculating module 402, a second calculating module 403, and a second obtaining module 404, wherein:
a first obtaining module 401, configured to obtain churn user data; the attrition user data comprises at least one user characteristic of the attrition user.
A first calculating module 402, configured to calculate a churn value of each user feature according to the churn user data through a preset churn value calculation formula; the churn value is a parameter used to quantify the relationship between the user characteristics and user churn.
A second calculating module 403, configured to calculate a ratio of loss values of each user characteristic according to the loss value of each user characteristic; the loss value proportion is the proportion of the loss values of all user characteristics in the total loss value; the total churn value is the sum of the churn values for each user characteristic.
A second obtaining module 404, configured to obtain a user churn reason according to the churn value ratio of each user characteristic.
In an embodiment, the first obtaining module 401 is further configured to input target user data into a pre-trained attrition user determination model, and output attrition user data; the loss user judgment model is obtained by training according to sample user data and a loss label of a sample user; the sample user data comprises not less than one user characteristic of a sample user; the attrition label is a parameter used to characterize whether the sample user is an attrition user.
In an embodiment, the first calculating module 402 is further configured to substitute the churn user data into a preset churn value calculation formula to obtain a churn value of each user feature;
the loss value calculation formula is as follows:
Figure 696409DEST_PATH_IMAGE001
wherein "! "represents factorial," | "represents the number of elements contained in a set, F represents a set of all user characteristics,
Figure 161151DEST_PATH_IMAGE002
representing the set of features left after the ith feature is removed from the set of F, S represents a subset of F,
Figure 989429DEST_PATH_IMAGE003
representing an attrition user judgment model trained after adding the ith feature to the feature subset S,
Figure 973435DEST_PATH_IMAGE004
an attrition user judgment model trained based on the user feature subset S is represented.
In one embodiment, the second calculating module 403 is further configured to replace data smaller than 0 in the churn value of each user feature with 0, and calculate a sum of the churn values of each user feature as a total churn value; and calculating the ratio of the loss value of each user characteristic to the total loss value, and taking the ratio as the loss value ratio of each user characteristic.
In an embodiment, the second obtaining module 404 is further configured to perform descending order arrangement on the attrition value ratios of the user features, so as to generate an attrition value sequence; accumulating the loss value sequence from the maximum value in sequence to obtain the accumulated value of the loss values; when the accumulated value is larger than or equal to a preset accumulated threshold value, taking the user characteristic corresponding to the accumulated value as a user loss characteristic; and acquiring the user loss reason according to the user loss characteristics.
In an embodiment, the second obtaining module 404 is further configured to obtain, according to the user churn feature, a WOE code value corresponding to the user churn feature; generating a user loss reason according to a preset loss reason mapping table by using the user loss characteristics of which the WOE coding value exceeds a preset WOE coding threshold value; the churn reason mapping table comprises mapping relations between user churn characteristics and user churn reasons.
In an embodiment, the second obtaining module 404 is further configured to perform binning processing on consecutive user features in the target user data, and obtain bins corresponding to each consecutive user feature; calculating the WOE code value of each sub-box and the WOE code value of discrete user characteristics in the target user data; and setting a corresponding user loss reason for the user characteristics of which the WOE code value is greater than the preset WOE code threshold value, and generating a user loss reason mapping table.
In the device for acquiring the user loss reason, the loss value of each user characteristic is calculated through a preset loss value calculation formula and loss user data; the attrition user data comprises at least one user characteristic of the attrition user; calculating the loss value ratio of each user characteristic according to the loss value of each user characteristic; the total churn value is the sum of the churn values of the user features; finally, acquiring the loss reason of the user according to the loss value ratio of each user characteristic; the method can convert the user characteristics causing user loss into specific service language and accurately acquire the loss reason of the user.
For specific limitations of the apparatus for acquiring the user churn cause, reference may be made to the above limitations on the method for acquiring the user churn cause, which are not described herein again. All or part of the modules in the device for acquiring the user churn cause can be realized by software, hardware and a combination thereof. The modules can be embedded in a hardware form or independent from a processor in the computer device, and can also be stored in a memory in the computer device in a software form, so that the processor can call and execute operations corresponding to the modules.
In one embodiment, a computer device is provided, which may be a server, the internal structure of which may be as shown in fig. 5. The computer device includes a processor, a memory, and a network interface connected by a system bus. Wherein the processor of the computer device is configured to provide computing and control capabilities. The memory of the computer device comprises a nonvolatile storage medium and an internal memory. The non-volatile storage medium stores an operating system, a computer program, and a database. The internal memory provides an environment for the operation of an operating system and computer programs in the non-volatile storage medium. The network interface of the computer device is used for communicating with an external terminal through a network connection. The computer program is executed by a processor to implement a method for acquiring a cause of user churn.
Those skilled in the art will appreciate that the architecture shown in fig. 5 is merely a block diagram of some of the structures associated with the disclosed aspects and is not intended to limit the computing devices to which the disclosed aspects apply, as particular computing devices may include more or less components than those shown, or may combine certain components, or have a different arrangement of components.
In one embodiment, a computer device is provided, comprising a memory and a processor, the memory having a computer program stored therein, the processor implementing the following steps when executing the computer program:
acquiring lost user data; the attrition user data comprises at least one user characteristic of the attrition user;
calculating the loss value of each user characteristic through a preset loss value calculation formula according to the loss user data; the churn value is a parameter used to quantify the relationship between the user characteristics and user churn;
calculating the loss value ratio of each user characteristic according to the loss value of each user characteristic; the loss value proportion is the proportion of the loss values of all user characteristics in the total loss value; the total churn value is the sum of the churn values of the user features;
and acquiring the user loss reason according to the loss value ratio of each user characteristic.
In one embodiment, the processor, when executing the computer program, further performs the steps of: inputting target user data into a pre-trained lost user judgment model, and outputting lost user data; the loss user judgment model is obtained by training according to sample user data and a loss label of a sample user; the sample user data comprises not less than one user characteristic of a sample user; the attrition label is a parameter used to characterize whether the sample user is an attrition user.
In one embodiment, the processor, when executing the computer program, further performs the steps of: substituting the loss user data into a preset loss value calculation formula to obtain the loss value of each user characteristic;
the loss value calculation formula is as follows:
Figure 254375DEST_PATH_IMAGE001
wherein "! "represents factorial," | "represents the number of elements contained in a set, F represents a set of all user characteristics,
Figure 113353DEST_PATH_IMAGE002
representing the set of features left after the ith feature is removed from the set of F, S represents a subset of F,
Figure 61717DEST_PATH_IMAGE003
representing an attrition user judgment model trained after adding the ith feature to the feature subset S,
Figure 13362DEST_PATH_IMAGE004
an attrition user judgment model trained based on the user feature subset S is represented.
In one embodiment, the processor, when executing the computer program, further performs the steps of: replacing data smaller than 0 in the loss values of the user characteristics with 0, and calculating the sum of the loss values of the user characteristics to serve as a total loss value; and calculating the ratio of the loss value of each user characteristic to the total loss value, and taking the ratio as the loss value ratio of each user characteristic.
In one embodiment, the processor, when executing the computer program, further performs the steps of: performing descending order arrangement on the loss value ratios of the user characteristics to generate a loss value sequence; accumulating the loss value sequence from the maximum value in sequence to obtain the accumulated value of the loss values; when the accumulated value is larger than or equal to a preset accumulated threshold value, taking the user characteristic corresponding to the accumulated value as a user loss characteristic; and acquiring the user loss reason according to the user loss characteristics.
In one embodiment, the processor, when executing the computer program, further performs the steps of: acquiring a WOE (word on edge) coding value corresponding to the user loss characteristic according to the user loss characteristic; generating a user loss reason according to a preset loss reason mapping table by using the user loss characteristics of which the WOE coding value exceeds a preset WOE coding threshold value; the churn reason mapping table comprises mapping relations between user churn characteristics and user churn reasons.
In one embodiment, the processor, when executing the computer program, further performs the steps of: performing binning processing on continuous user characteristics in the target user data to obtain bins corresponding to the continuous user characteristics; calculating the WOE code value of each sub-box and the WOE code value of discrete user characteristics in the target user data; and setting a corresponding user loss reason for the user characteristics of which the WOE code value is greater than the preset WOE code threshold value, and generating a user loss reason mapping table.
The computer equipment calculates the loss value of each user characteristic through a preset loss value calculation formula and loss user data; the attrition user data comprises at least one user characteristic of the attrition user; calculating the loss value ratio of each user characteristic according to the loss value of each user characteristic; the total churn value is the sum of the churn values of the user features; finally, acquiring the loss reason of the user according to the loss value ratio of each user characteristic; the method can convert the user characteristics causing user loss into specific service language and accurately acquire the loss reason of the user.
In one embodiment, a computer-readable storage medium is provided, having a computer program stored thereon, which when executed by a processor, performs the steps of:
acquiring lost user data; the attrition user data comprises at least one user characteristic of the attrition user;
calculating the loss value of each user characteristic through a preset loss value calculation formula according to the loss user data; the churn value is a parameter used to quantify the relationship between the user characteristics and user churn;
calculating the loss value ratio of each user characteristic according to the loss value of each user characteristic; the loss value proportion is the proportion of the loss values of all user characteristics in the total loss value; the total churn value is the sum of the churn values of the user features;
and acquiring the user loss reason according to the loss value ratio of each user characteristic.
In one embodiment, the computer program when executed by the processor further performs the steps of: inputting target user data into a pre-trained lost user judgment model, and outputting lost user data; the loss user judgment model is obtained by training according to sample user data and a loss label of a sample user; the sample user data comprises not less than one user characteristic of a sample user; the attrition label is a parameter used to characterize whether the sample user is an attrition user.
In one embodiment, the computer program when executed by the processor further performs the steps of: substituting the loss user data into a preset loss value calculation formula to obtain the loss value of each user characteristic;
the loss value calculation formula is as follows:
Figure 47177DEST_PATH_IMAGE001
wherein "! "represents factorial," | "represents the number of elements contained in a set, F represents a set of all user characteristics,
Figure 181617DEST_PATH_IMAGE002
representing the set of features left after the ith feature is removed from the set of F, S represents a subset of F,
Figure 781226DEST_PATH_IMAGE003
representing an attrition user judgment model trained after adding the ith feature to the feature subset S,
Figure 310296DEST_PATH_IMAGE004
an attrition user judgment model trained based on the user feature subset S is represented.
In one embodiment, the computer program when executed by the processor further performs the steps of: replacing data smaller than 0 in the loss values of the user characteristics with 0, and calculating the sum of the loss values of the user characteristics to serve as a total loss value; and calculating the ratio of the loss value of each user characteristic to the total loss value, and taking the ratio as the loss value ratio of each user characteristic.
In one embodiment, the computer program when executed by the processor further performs the steps of: performing descending order arrangement on the loss value ratios of the user characteristics to generate a loss value sequence; accumulating the loss value sequence from the maximum value in sequence to obtain the accumulated value of the loss values; when the accumulated value is larger than or equal to a preset accumulated threshold value, taking the user characteristic corresponding to the accumulated value as a user loss characteristic; and acquiring the user loss reason according to the user loss characteristics.
In one embodiment, the computer program when executed by the processor further performs the steps of: acquiring a WOE (word on edge) coding value corresponding to the user loss characteristic according to the user loss characteristic; generating a user loss reason according to a preset loss reason mapping table by using the user loss characteristics of which the WOE coding value exceeds a preset WOE coding threshold value; the churn reason mapping table comprises mapping relations between user churn characteristics and user churn reasons.
In one embodiment, the computer program when executed by the processor further performs the steps of: performing binning processing on continuous user characteristics in the target user data to obtain bins corresponding to the continuous user characteristics; calculating the WOE code value of each sub-box and the WOE code value of discrete user characteristics in the target user data; and setting a corresponding user loss reason for the user characteristics of which the WOE code value is greater than the preset WOE code threshold value, and generating a user loss reason mapping table.
The storage medium calculates the loss value of each user characteristic through a preset loss value calculation formula and loss user data; the attrition user data comprises at least one user characteristic of the attrition user; calculating the loss value ratio of each user characteristic according to the loss value of each user characteristic; the total churn value is the sum of the churn values of the user features; finally, acquiring the loss reason of the user according to the loss value ratio of each user characteristic; the method can convert the user characteristics causing user loss into specific service language and accurately acquire the loss reason of the user.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by hardware instructions of a computer program, which can be stored in a non-volatile computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. Any reference to memory, storage, database or other medium used in the embodiments provided herein can include at least one of non-volatile and volatile memory. Non-volatile Memory may include Read-Only Memory (ROM), magnetic tape, floppy disk, flash Memory, optical storage, or the like. Volatile Memory can include Random Access Memory (RAM) or external cache Memory. By way of illustration and not limitation, RAM can take many forms, such as Static Random Access Memory (SRAM) or Dynamic Random Access Memory (DRAM), among others.
The technical features of the above embodiments can be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the above embodiments are not described, but should be considered as the scope of the present specification as long as there is no contradiction between the combinations of the technical features.
The above-mentioned embodiments only express several embodiments of the present application, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the concept of the present application, which falls within the scope of protection of the present application. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims (9)

1. A method for acquiring a user churn reason is characterized by comprising the following steps:
acquiring lost user data; the attrition user data comprises at least one user characteristic of the attrition user;
calculating the loss value of each user characteristic through a preset loss value calculation formula according to the loss user data; the churn value is a parameter used to quantify the relationship between the user characteristics and user churn;
calculating the loss value ratio of each user characteristic according to the loss value of each user characteristic; the loss value proportion is the proportion of the loss values of all user characteristics in the total loss value; the total churn value is the sum of the churn values of the user features;
acquiring a user loss reason according to the loss value ratio of each user characteristic;
the calculating the loss value of each user characteristic through a preset loss value calculation formula comprises:
substituting the loss user data into a preset loss value calculation formula to obtain the loss value of each user characteristic;
the loss value calculation formula is as follows:
Figure 832425DEST_PATH_IMAGE001
wherein "! "represents factorial," | "represents the number of elements contained in a set, F represents a set of all user characteristics,
Figure 135230DEST_PATH_IMAGE002
representing the set of features left after the ith feature is removed from the set of F, S represents a subset of F,
Figure 164497DEST_PATH_IMAGE003
representing an attrition user judgment model trained after adding the ith feature to S,
Figure 312582DEST_PATH_IMAGE004
representing the union of S and the ith feature;
Figure 196224DEST_PATH_IMAGE005
representing an attrition user judgment model based on S training.
2. The method of claim 1, wherein obtaining attrition user data comprises:
inputting target user data into a pre-trained lost user judgment model, and outputting lost user data;
the loss user judgment model is obtained by training according to sample user data and a loss label of a sample user; the sample user data comprises not less than one user characteristic of a sample user; the attrition label is a parameter used to characterize whether the sample user is an attrition user.
3. The method of claim 1, wherein calculating the churn value ratio for each user profile based on the churn value for each user profile comprises:
replacing data smaller than 0 in the loss values of the user characteristics with 0, and calculating the sum of the loss values of the user characteristics to serve as a total loss value;
and calculating the ratio of the loss value of each user characteristic to the total loss value, and taking the ratio as the loss value ratio of each user characteristic.
4. The method according to claim 2, wherein the obtaining the user churn reason according to the churn value ratio of each user feature comprises:
performing descending order arrangement on the loss value ratios of the user characteristics to generate a loss value sequence;
accumulating the loss value sequence from the maximum value in sequence to obtain the accumulated value of the loss values;
when the accumulated value is larger than or equal to a preset accumulated threshold value, taking the user characteristic corresponding to the accumulated value as a user loss characteristic;
and acquiring the user loss reason according to the user loss characteristics.
5. The method according to claim 4, wherein the obtaining a user churn cause according to the user churn feature comprises:
acquiring a WOE (word on edge) coding value corresponding to the user loss characteristic according to the user loss characteristic;
generating a user loss reason according to a preset loss reason mapping table by using the user loss characteristics of which the WOE coding value exceeds a preset WOE coding threshold value; the churn reason mapping table comprises mapping relations between user churn characteristics and user churn reasons.
6. The method of claim 5, wherein obtaining the WOE code value corresponding to the user churn feature according to the user churn feature further comprises:
performing binning processing on continuous user characteristics in the target user data to obtain bins corresponding to the continuous user characteristics;
calculating the WOE code value of each sub-box and the WOE code value of discrete user characteristics in the target user data;
and setting a corresponding user loss reason for the user characteristics of which the WOE code value is greater than the preset WOE code threshold value, and generating a user loss reason mapping table.
7. An apparatus for obtaining a reason for user churn, the apparatus comprising:
the first acquisition module is used for acquiring lost user data; the attrition user data comprises at least one user characteristic of the attrition user;
the first calculation module is used for calculating the loss value of each user characteristic according to the loss user data through a preset loss value calculation formula; the churn value is a parameter used to quantify the relationship between the user characteristics and user churn; the calculating the loss value of each user characteristic through a preset loss value calculation formula comprises:
substituting the loss user data into a preset loss value calculation formula to obtain the loss value of each user characteristic;
the loss value calculation formula is as follows:
Figure 217401DEST_PATH_IMAGE001
wherein "! "represents factorial," | "represents the number of elements contained in a set, F represents a set of all user characteristics,
Figure 920914DEST_PATH_IMAGE002
representing the set of features left after the ith feature is removed from the set of F, S represents a subset of F,
Figure 420160DEST_PATH_IMAGE003
representing an attrition user judgment model trained after adding the ith feature to S,
Figure 423888DEST_PATH_IMAGE006
representing the union of S and the ith feature;
Figure 802917DEST_PATH_IMAGE005
representing an attrition user judgment model based on S training;
the second calculation module is used for calculating the loss value ratio of each user characteristic according to the loss value of each user characteristic; the loss value proportion is the proportion of the loss values of all user characteristics in the total loss value; the total churn value is the sum of the churn values of the user features;
and the second acquisition module is used for acquiring the user loss reason according to the loss value ratio of each user characteristic.
8. A computer device comprising a memory and a processor, the memory storing a computer program, characterized in that the processor, when executing the computer program, implements the steps of the method of any of claims 1 to 6.
9. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the method of any one of claims 1 to 6.
CN202110293420.3A 2021-03-19 2021-03-19 Method and device for acquiring user loss reason, computer equipment and storage medium Active CN112686718B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110293420.3A CN112686718B (en) 2021-03-19 2021-03-19 Method and device for acquiring user loss reason, computer equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110293420.3A CN112686718B (en) 2021-03-19 2021-03-19 Method and device for acquiring user loss reason, computer equipment and storage medium

Publications (2)

Publication Number Publication Date
CN112686718A CN112686718A (en) 2021-04-20
CN112686718B true CN112686718B (en) 2021-06-29

Family

ID=75455665

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110293420.3A Active CN112686718B (en) 2021-03-19 2021-03-19 Method and device for acquiring user loss reason, computer equipment and storage medium

Country Status (1)

Country Link
CN (1) CN112686718B (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105099731A (en) * 2014-04-23 2015-11-25 腾讯科技(深圳)有限公司 Method and system for finding churn factor for user churn of network application
CN108510096A (en) * 2017-02-24 2018-09-07 百度在线网络技术(北京)有限公司 Trade company's attrition prediction method, apparatus, equipment and storage medium
AU2019101198A4 (en) * 2019-10-03 2020-01-16 Gu, Jingchao MR A statistical analysis method of mobile telecom data driven user loss prediction
CN110852780A (en) * 2019-10-08 2020-02-28 百度在线网络技术(北京)有限公司 Data analysis method, device, equipment and computer storage medium
CN111652661A (en) * 2020-08-04 2020-09-11 索信达(北京)数据技术有限公司 Mobile phone client user loss early warning processing method

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105335430B (en) * 2014-08-08 2019-09-17 腾讯科技(深圳)有限公司 A kind of customer churn determining method of path and device based on map scene

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105099731A (en) * 2014-04-23 2015-11-25 腾讯科技(深圳)有限公司 Method and system for finding churn factor for user churn of network application
CN108510096A (en) * 2017-02-24 2018-09-07 百度在线网络技术(北京)有限公司 Trade company's attrition prediction method, apparatus, equipment and storage medium
AU2019101198A4 (en) * 2019-10-03 2020-01-16 Gu, Jingchao MR A statistical analysis method of mobile telecom data driven user loss prediction
CN110852780A (en) * 2019-10-08 2020-02-28 百度在线网络技术(北京)有限公司 Data analysis method, device, equipment and computer storage medium
CN111652661A (en) * 2020-08-04 2020-09-11 索信达(北京)数据技术有限公司 Mobile phone client user loss early warning processing method

Also Published As

Publication number Publication date
CN112686718A (en) 2021-04-20

Similar Documents

Publication Publication Date Title
CN109598095B (en) Method and device for establishing scoring card model, computer equipment and storage medium
CN109829629B (en) Risk analysis report generation method, apparatus, computer device and storage medium
CN111222305B (en) Information structuring method and device
US20200065710A1 (en) Normalizing text attributes for machine learning models
CN108520041B (en) Industry classification method and system of text, computer equipment and storage medium
CN110263326B (en) User behavior prediction method, prediction device, storage medium and terminal equipment
CN113240510B (en) Abnormal user prediction method, device, equipment and storage medium
CN112395500A (en) Content data recommendation method and device, computer equipment and storage medium
CN110888911A (en) Sample data processing method and device, computer equipment and storage medium
CN111190946A (en) Report generation method and device, computer equipment and storage medium
CN113268403A (en) Time series analysis and prediction method, device, equipment and storage medium
CN111160959B (en) User click conversion prediction method and device
CN113254620B (en) Response method, device and equipment based on graph neural network and storage medium
CN110602207A (en) Method, device, server and storage medium for predicting push information based on off-network
CN111599382B (en) Voice analysis method, device, computer equipment and storage medium
CN114203200A (en) Voice quality inspection method and device, computer equipment and storage medium
CN112686718B (en) Method and device for acquiring user loss reason, computer equipment and storage medium
CN111104422A (en) Training method, device, equipment and storage medium of data recommendation model
US20230214451A1 (en) System and method for finding data enrichments for datasets
CN113946363A (en) Method and device for executing and configuring service data, computer equipment and storage medium
CN113438374B (en) Intelligent outbound call processing method, device, equipment and storage medium
CN111078984B (en) Network model issuing method, device, computer equipment and storage medium
CN114141235A (en) Voice corpus generation method and device, computer equipment and storage medium
CN113627514A (en) Data processing method and device of knowledge graph, electronic equipment and storage medium
CN111368044A (en) Intelligent question answering method and device, computer equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant