CN112862602A - User request determining method, storage medium and electronic device - Google Patents

User request determining method, storage medium and electronic device Download PDF

Info

Publication number
CN112862602A
CN112862602A CN202110336662.6A CN202110336662A CN112862602A CN 112862602 A CN112862602 A CN 112862602A CN 202110336662 A CN202110336662 A CN 202110336662A CN 112862602 A CN112862602 A CN 112862602A
Authority
CN
China
Prior art keywords
user
inventory
incremental
liability
stock
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110336662.6A
Other languages
Chinese (zh)
Other versions
CN112862602B (en
Inventor
孙羡斐
黄若愚
董媛
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Citic Bank Corp Ltd
Original Assignee
China Citic Bank Corp Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Citic Bank Corp Ltd filed Critical China Citic Bank Corp Ltd
Priority to CN202110336662.6A priority Critical patent/CN112862602B/en
Publication of CN112862602A publication Critical patent/CN112862602A/en
Application granted granted Critical
Publication of CN112862602B publication Critical patent/CN112862602B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/03Credit; Loans; Processing thereof

Landscapes

  • Business, Economics & Management (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Engineering & Computer Science (AREA)
  • Development Economics (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • Technology Law (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The embodiment of the invention provides a method for determining a user request, which is used for determining the security level of a user based on the determined common debt level score of the user and determining whether to pass the user request of the user based on the security level so as to avoid the condition that the data management system fails due to unreasonable passing of the user request.

Description

User request determining method, storage medium and electronic device
Technical Field
The present invention relates to the field of data management, and in particular, to a method for determining a user request, a storage medium, and an electronic device.
Background
With the ever-increasing volume of consumer credit markets, the opportunity to consume finance is self-evident, but currently the subject of various markets participates in which the greatest challenge is the risk of multiple commons. The same customer, facing more and more easy loan channels, is very vulnerable to the chain of funds and risk resistance when each institution gives credit. The financial institution's risk management capability determines the client's choice and trust, as well as how to perform risk pricing and client location, and even for the risks and challenges that may be encountered over long periods of time.
Therefore, based on the relevant information of the client, the machine learning algorithm is utilized to quantify the common debt degree of the client, which provides an important basis for how a bank reasonably trusts to maintain data security and normal operation of a trusting system. Currently, the evaluation of the common debt degree of the customers in financial industries such as banking industry and the like still stays in a more traditional mode, and generally only external data information of the customers (for example, loan information of the customers on other financial platforms) is combined, but the information of the customers is often lost in the process, the risk is used as a target for modeling, and finally the output result is discrete (0,1), so that the obtained result is not very accurate, and an inaccurate result can cause the data management system which grants the user request to malfunction or operate abnormally through the user request (for example, the loan request), so that a new more accurate and reasonable technical scheme is urgently needed to evaluate the common debt degree of the customers more accurately, so as to avoid the situation that the data management system malfunctions due to unreasonable user request.
Disclosure of Invention
The embodiment of the invention provides a method for determining a user request, a storage medium and an electronic device, which are used for at least solving the problem of data management system failure caused by unreasonable user request in the related art.
According to an embodiment of the present invention, there is provided a method for determining a user request, including:
when a user to be determined of the balance level is a stock user, obtaining a balance level score of the stock user based on user characteristics of the stock user and a stock balance model algorithm, wherein the stock user is a user after a preset performance period;
when a user to be determined with a common debt level is an incremental user, obtaining a common debt level score of the incremental user based on the user characteristics of the incremental user, the common debt level score evaluation result of an inventory user by the inventory common debt model algorithm and the incremental common debt model algorithm, wherein the common debt level score evaluation result of the inventory user by the inventory common debt model algorithm is used for adjusting the weight of the user characteristics of the incremental user, and the incremental user is a new user who has not passed through a preset expression period;
determining a security level of the user based on the determined liability level score of the user, and determining whether to pass the user request of the user based on the security level.
According to a further embodiment of the present invention, a computer-readable storage medium is also provided, in which a computer program is stored, wherein the computer program is configured to carry out the steps of any of the above-described method embodiments when executed.
According to yet another embodiment of the present invention, there is also provided an electronic device, including a memory and a processor, the memory having a computer program stored therein, the processor being configured to execute the computer program to implement the steps in any of the above method embodiments.
According to the embodiment of the invention, the safety level of the user is determined based on the determined common debt level score of the user, and whether the user request of the user passes or not is determined based on the safety level, so that the condition that the data management system fails due to unreasonable passing of the user request is avoided.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the invention and together with the description serve to explain the invention without limiting the invention. In the drawings:
FIG. 1 is a flow chart of a method of determining a user request according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of a modeling process according to an exemplary embodiment of the present invention;
fig. 3 is a schematic diagram of weight adjustment according to an exemplary embodiment of the present invention.
Detailed Description
The invention will be described in detail hereinafter with reference to the accompanying drawings in conjunction with embodiments. It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict.
It should be noted that the terms "first," "second," and the like in the description and claims of the present invention and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order.
Example 1
The method provided by the first embodiment of the present application may be executed in a mobile terminal, a computer terminal, a server, or a similar computing device. These devices have some structures that are known at present and will not be described again.
In the present embodiment, a method for determining a user request running on the computing device is provided, and fig. 1 is a flowchart of a method for determining a user request according to an embodiment of the present invention, as shown in fig. 1, the flowchart includes the following steps:
step S101, when the user to be determined the balance level is the stock user, the balance level score of the stock user is obtained based on the user characteristics of the stock user and the stock balance model algorithm, and the stock user is the user after the preset expression period;
when the user to be determined with the common debt level is an incremental user, obtaining the common debt level score of the incremental user based on the user characteristics of the incremental user, the common debt level score evaluation result of the inventory user by the inventory common debt model algorithm and the incremental common debt model algorithm, wherein the common debt level score evaluation result of the inventory user by the inventory common debt model algorithm is used for adjusting the weight of the user characteristics of the incremental user, and the incremental user is a new user who has not passed through a preset expression period;
step S103, determining the security level of the user based on the determined liability level score of the user, and determining whether to pass the user request of the user based on the security level.
According to the embodiment of the invention, the safety level of the user is determined based on the determined common debt level score of the user, and whether the user request of the user passes or not is determined based on the safety level, so that the condition that the data management system fails due to unreasonable passing of the user request is avoided. Wherein the security level may be determined according to the liability level score, if the liability level score indicates that the liability level of the user is high and the loan repayment capacity is low, the security level may be set to indicate that the user request of the user is not passed or the user request of the user is cautious passed, and the alarm may be given based on the user request. Vice versa, if the liability level score indicates that the user has a low liability level and a high loan repayment capacity, the security level may be set to indicate a user request by the user or a user request by the user more loosely, and a prompt may be made, for example, to prompt the user request.
In one example, the deriving the balance level score of the inventory consumer based on the consumer characteristics of the inventory consumer and an inventory balance model algorithm comprises:
acquiring the user characteristics of the stock users, and preprocessing the user characteristics of the stock users, wherein the user characteristics of each stock user correspond to a preprocessing mode;
based on the preprocessed user characteristics of the stock users, carrying out vector standardization to obtain a standardized matrix;
determining a liability level score for the inventory user based on the normalization matrix.
In one example, the determining the liability level score for the inventory user based on the standardized matrix comprises:
determining the distance between the stock user and the optimal scheme and the worst scheme by using a best-worst solution distance method according to the standardized matrix;
determining a liability level score for the inventory user based on the distance.
In one example, determining the distance between the inventory user and the optimal solution and the worst solution using a best solution method based on the normalization matrix comprises:
determining the distance between the inventory user and the optimal solution and the worst solution according to the following formula:
Figure BDA0002997915090000041
wherein i represents the ith sample client, j represents the jth index of the m indexes,
Figure BDA0002997915090000051
indicating the distance of the client i from the optimal solution,
Figure BDA0002997915090000052
indicates the distance, w, of the customer i from the worst casejIn order to obtain the weight of the index,
Figure BDA0002997915090000053
is the value of the jth index in the optimal solution,
Figure BDA0002997915090000054
is the value of the jth index in the worst case, zijIs the value of the jth index for client i;
determining a liability level score for the inventory user based on the distance, comprising:
determining a liability level score for the inventory consumer by the following formula:
Figure BDA0002997915090000055
wherein, CiRepresents the liability score of the customer i,
Figure BDA0002997915090000056
indicating the distance of the client i from the optimal solution,
Figure BDA0002997915090000057
indicating the distance of customer i from the worst case.
In one example, the index weight is determined by:
calculating the entropy value of each index by the following formula:
Figure BDA0002997915090000058
wherein x isijValue of j index representing i client, ejEntropy representing the jth index;
the index weight is obtained by the following formula:
Figure BDA0002997915090000059
wherein e isjEntropy, w, representing the j indexjThe index weight of the jth index is represented.
In one example, the obtaining the corporate level score of the incremental user based on the user characteristics of the incremental user, the evaluation result of the corporate level score of the inventory user by the inventory corporate model algorithm and the incremental corporate model algorithm comprises:
using the balance level score of the inventory user as a target variable of the incremental balance model algorithm;
adjusting the training sample weight of the incremental liability model using the liability level score of the inventory user;
and obtaining the liability level score of the incremental user by using the incremental liability model algorithm based on the user characteristics of the incremental user.
In one example, the method further comprises:
and verifying the balance level score of the stock user obtained by the stock balance model algorithm, and/or verifying the balance level score of the incremental user obtained by the incremental balance model algorithm.
In one example, the user characteristics of the inventory user include: the user attribute of the stock user and the behavior characteristic of the stock user; the user characteristics of the incremental user include: the user attributes of the incremental user.
Example embodiments
The following further explains the embodiment of the present invention with reference to a specific implementation scenario, and specifically explains the method for quantifying the risk of mutual debt of a client (also referred to as a user) provided in the embodiment by taking a bank scenario as an example, and it should be noted that the method provided in the embodiment is not only applicable to the bank scenario but also applicable to other financial institution scenarios, and details are not described below. The embodiment of the invention provides a method for quantifying risk of common debt of a client by combining unsupervised learning and supervised learning, which is used for risk identification control of a financial bank. The method provided by the embodiment adopts a supervised learning method in the pre-loan link and an unsupervised learning method in the middle loan link, and the two methods are combined to jointly quantify the common debt level of the customer. It should be added that "client" in this document, i.e., "user", is not described in detail below.
In one exemplary embodiment, an inventory liability model may be established for inventory customers of a bank. The new customers of the bank can become stock customers after a preset performance period, the customers can have more performance of the bank behavior in the performance period, relevant characteristic variables are extracted based on the attributes of the customers and the performance of the bank behavior, a comprehensive evaluation model is established through an unsupervised learning algorithm, and the common debt score is calculated for the common debt level of each customer. The attribute of the client may be basic information of the user, such as gender, age, industry occupation, education background, income, and the like, and may also include information of a credit inquiry result (e.g., a credit rating information) of the user.
In an exemplary embodiment, in the comprehensive evaluation process, expert experience is introduced to correct the model result, a small sample is extracted, the artificial judgment is carried out by front-end experts who give credit, examine and approve and the like according to experience, the judgment result of the experts and the result of the model are compared with each other, and the result of the stock common debt model is verified. That is, the result of scoring the liability model provided in the present embodiment may be compared with the result of manual determination for verification.
In an exemplary embodiment, for a newly added customer at a bank, an incremental liability model may be established; for the incremental common-bond model, a supervised learning algorithm can be used, the result of the stock common-bond model is provided for the incremental common-bond model training, the client common-bond scores obtained by the comprehensive evaluation model are classified into two categories and then serve as target variables of the pre-loan link, the sample weight of the incremental client model is adjusted based on the stock common-bond scores, and the incremental common-bond model is trained by using an xgboost algorithm.
The technical scheme adopted by the invention can greatly improve the identification capability of the risk of the common debt client, wherein the method combining unsupervised learning and supervised learning enables the quantitative result to be more objectively close to the actual common debt level of the client, thereby avoiding the unreasonable request of a data management system through a user.
The embodiment of the invention relates to a method for quantifying the risk of common debt of a client by combining unsupervised learning and supervised learning, which can be applied to the quantification of the common debt level of the client before, during and after loan and the like. Defining a grading threshold value of the common debt customer according to the distribution of the customer common debt grading, for example, directly giving rejection to the customer with high common debt level in credit strategies such as adjustment and management; otherwise, the customer is accepted, i.e., a low liability level is accepted.
In an exemplary embodiment, the process of establishing the model is mainly divided into the establishment of an inventory balance model (also referred to as "inventory client balance model") and the establishment of an incremental model (also referred to as "incremental client balance model") as shown in fig. 2.
1) The related index features of the stock clients (such as the attribute information of the client and the behavior features of the client at the bank in the above embodiment) are extracted, and particularly the behavior (i.e., the behavior features) of the client at the bank is extracted, so as to establish the feature variables of the stock balance model. The characteristic variables (i.e., "correlation index characteristics") are preprocessed by methods such as isotropic transformation (which may also be referred to as isotropic transformation), quantile transformation, and logarithmic transformation to construct a normalization matrix. The attribute information of the client may be basic information of the user, such as gender, age, industry occupation, education background, income, and the like, and may also include information of a credit inquiry result (e.g., a credit rating information) of the user. The behavior characteristics of the client at the bank can be transaction information, borrowing information, deposit information, amount information and the like, but not limited to the above, and the behavior characteristics can also comprise other characteristic indexes suitable for analyzing the common debt level of the client. It should be further noted that, the characteristic variables are preprocessed, one algorithm may be used to preprocess the characteristic variables, and multiple algorithms may also be used to preprocess one characteristic variable;
2) through an unsupervised learning method such as a good-poor solution method, an inventory client common debt degree score (also called as "inventory client common debt score") is obtained by establishing an inventory common debt model and considering the difference between a client and an optimal scheme and a worst scheme, wherein one client corresponds to one common debt degree score (also called as "common debt score").
3) And extracting the related attribute characteristics of the newly added client, adjusting the weight of the training sample by using the total debt score (also called as a "total debt degree score") of the stock client obtained by the total debt model of the stock client, and classifying the total debt score obtained by the total debt model of the stock client into two categories to be used as target variables of the newly added client.
4) And establishing an incremental common-liability model by using an xgboost algorithm based on the target variable and the training sample after the weight is adjusted, wherein the incremental common-liability model is used for quantifying the common-liability of the newly added client.
Wherein, the stock customer characteristic processing can use the following ideas:
basic information of the client, card information, credit inquiry data and the like are extracted, namely attribute information of the client is extracted. Since the scales and meanings of the indexes of the clients are different from each other, when the comprehensive evaluation method is used, the fact is that the distance scale is used for quantifying the difference between the clients. Using the distance scale, the index attribute is scaled.
Some of the data may use a homonymization transformation, which is applied to data in which the index value itself varies divergently from the liability level, such as the income index, and the higher the income index value, the higher the ability of the customer to pay back liabilities further indicates that the liability level of the customer may be smaller. On the other hand, for the liability index, the smaller the liability index value is, the less liability of the client is represented, so in order to unify the meaning of the index, we adopt the following processing for the anisotropic index such as income:
Figure BDA0002997915090000081
wherein x represents data before transformation (example)Such as revenue values), x' represents processed data (e.g., processed revenue values).
In addition, some information can be processed by using quantile transformation, for example, data distribution of part of indexes presents long tail characteristics, while in a model which is calculated by taking distance as a core, such as a comprehensive evaluation method, the distance scale is greatly influenced by an extreme value, the extreme value of part of customers can make the model unstable, and for the purpose of increasing robustness of the model and reducing influence of abnormal values on the model, quantile transformation is performed on the indexes presenting obvious long tail distribution of data, for example, 95 quantiles are adopted as maximum values, and abnormal data are removed (values greater than 95% of the scores are assigned as 95% quantiles of the variables):
x'=min(x,x0.95) Where x denotes data before transformation and x' denotes data after transformation.
In addition, some information can be processed by using logarithmic transformation, for example, the data distribution range of different indexes is different, for example, the dimensions of loan amount and loan number are different, and for this, logarithmic transformation is performed on indexes of amount and balance class (for example, various loan amounts and balances of customers), and the transformation can reduce the influence of different dimensions on the data as a whole to a certain extent:
x 'denotes log (1+ x), where x denotes data before transformation and x' denotes data after transformation.
Then, based on these transformed data, a normalized matrix is constructed, and a specific method is exemplified as follows:
let us have n customers, each customer has k indexes (i.e. the extracted basic information of the customer (including the attribute information of the customer and the behavior characteristics of the customer at the bank) is processed to obtain data, and one kind of data corresponds to one kind of index), then the original customer index matrix is:
Figure BDA0002997915090000091
wherein x isn1A value representing the 1 st index of the nth customer;
the metric is vector normalized, i.e., each column element is divided by the L2 norm of the current column vector,
Figure BDA0002997915090000092
xija value representing the jth index of the ith customer,
Figure BDA0002997915090000093
l2 norm, z representing the jth indexijAnd the j index of the ith client is represented as a value obtained by vector normalization transformation.
A normalized normalization matrix Z is thus obtained:
Figure BDA0002997915090000101
each client corresponds to one row in the matrix, for example, the first row is the index value of the first client, the nth row is the index value of the nth client, the row corresponds to the several clients, and the column corresponds to the several indexes.
In an exemplary embodiment, further, based on the normalized matrix Z obtained above, the distance between a certain customer and the optimal solution or the worst solution is obtained by using a best-and-worst solution distance method. The good-bad solution distance method is also called TOPSIS, and is a comprehensive evaluation method, which can fully utilize the information of original data and accurately reflect the difference between evaluation schemes. The basic process is based on the normalized homodromous feature matrix, an optimal scheme and a worst scheme in the limited schemes are found out by adopting a cosine method, then the distances between each evaluation object and the optimal scheme and the worst scheme are respectively calculated, the relative approach degree of each evaluation object and the optimal scheme is obtained and is used as a basis for evaluating the advantages and the disadvantages, and the specific algorithm flow is as follows:
inventory balance model algorithm flow:
procedure liability model (calculation Procedure):
carrying out syntropy transformation on the original index into X1;
scaling X1 to X2 within 0 to 95% quantile;
carrying out logarithmic transformation on the X2 to obtain X3;
constructing a normalized matrix Z after vector normalization, wherein the normalized matrix Z is { Z1, Z2, … …, zn };
each column Zi, do of For Z;
the ith dimension of the worst scheme Z-, the minimum value of Zi elements;
the ith dimension of the worst scheme Z +, and the maximum value of Zi elements;
End for
For zi∈Z do
proximity of Zi to optimal solution
Figure BDA0002997915090000111
Proximity of Zi to optimal solution
Figure BDA0002997915090000112
Score of Zi
Figure BDA0002997915090000113
End for
According to CiSorting the sizes of the components;
End procedure;
and outputting the comprehensive evaluation result of each client.
The normalized feature matrix has been described above as to how to determine the best solution and the worst solution, the best solution being composed of the maximum values of each column of elements in Z, which means the ideal case with the lowest degree of liabilities, which does not refer to a specific client, but the ideal case with the lowest degree of liabilities.
The worst case is composed of the minimum value of each column of elements in Z, which means the ideal case with the highest degree of common debt, and as with the best case, this case does not refer to a specific client, but the ideal case with the highest degree of common debt.
Calculating the distance between a certain customer and the optimal scheme and the worst scheme based on the index variable and the weight of the customer:
Figure BDA0002997915090000114
wherein i represents the ith sample client, j represents the jth index of the m indexes,
Figure BDA0002997915090000115
indicating the distance of the client i from the optimal solution,
Figure BDA0002997915090000116
indicates the distance, w, of the customer i from the worst casejThe weight of the j index determined by the entropy method and the like,
Figure BDA0002997915090000117
is the value of the jth index in the optimal solution,
Figure BDA0002997915090000118
is the value of the jth index in the worst case, zijIs the value of the jth index for client i;
and finally, calculating the comprehensive common debt degree of the client:
Figure BDA0002997915090000121
Cirepresents the liability score of the customer i,
Figure BDA0002997915090000122
indicating the distance of the client i from the optimal solution,
Figure BDA0002997915090000123
indicating the distance of customer i from the worst case.
CiThe degree of common debt of the client i is measured, and through the comprehensive evaluation method, the abstract degree of common debt is quantified, the quantification standard is not based on a certain fixed standard, but the various behavior characteristic information of the client is fully utilized, and the result can accurately reflect the facies between the clientsFor the liability level.
In an exemplary embodiment, determining the index weight based on the entropy weight method refers to: according to the basic principle of information theory, information is a measure of the degree of system order; while entropy is a measure of the degree of disorder of the system. The entropy weight method is an objective weighting method, and in a specific using process, the entropy weight method calculates the entropy weight of each index by using information entropy according to the variation degree of each index, and then corrects the weight of each index through the entropy weight, so that objective index weight is obtained.
Calculating the entropy value of each index:
Figure BDA0002997915090000124
xijvalue of j index representing i client, ejRepresenting the entropy value of the j-th index.
Further, we normalize the entropy values as weights for each index:
Figure BDA0002997915090000125
ejentropy, w, representing the j indexjRepresenting the entropy value of the j-th index after normalization.
wjThe larger the index is, the larger the amount of information represented by the index is, the stronger the ability to discriminate the degree of liability is.
In one exemplary embodiment, expert experience may be used to verify the effectiveness of the supervised learning approach. In order to verify the effectiveness of the supervised learning algorithm, an expert experience judgment method is introduced to correct the result of the comprehensive evaluation model. And (3) manually judging the extracted data according to experience by front-end experts such as credit granting, examination and approval and the like, for example, the experts label 1050 cases in total, wherein each case adopts a pairwise cross judgment mode. And adopting a mode of judging again for the case with large individual difference.
For the newly added customers, an incremental common bond model is established to evaluate the common bond level of the customers. Firstly, extracting relevant index characteristics (including attribute information of the client) of the newly added client, such as basic information (such as gender, age, industry occupation, education background, income and the like) and data of a pedestrian credit inquiry result and the like, and establishing a corresponding data set.
The method uses the xgboost as an increment common-liability model, the xgboost is an optimized distributed Gradient increment learning framework, the method realizes a machine learning algorithm under the Gradient Boosting idea, provides parallel tree promotion, and can quickly and accurately solve a plurality of data science problems. A highly extensible end-to-end lifting tree system is designed and constructed, a theoretically reasonable weighting quantile sketch map is provided for calculating a candidate set, a novel sparse perception algorithm is introduced for parallel tree learning, and missing values are enabled to have default directions.
xgboost is a linear addition model that sums the results of K trees as the final predictor, i.e.:
Figure BDA0002997915090000131
the first term to the right of the equality sign of the above equation is the loss function, where n is the number of training function samples, l is the loss to a single sample, assuming it is a convex function, yiThe true label value of the sample is trained for the mode,
Figure BDA0002997915090000132
for the model to the predicted value of the training sample, ft(xi) A model representing the ith tree of the ith customer.
Second term Ω (f) of the above formulat) Is a regular term, controls the complexity of the tree, and prevents overfitting. The regularization term defines the complexity of the model:
Figure BDA0002997915090000133
wherein gamma and lambda are regularization parameters set manually, omega is a vector formed by all leaf node values of the decision tree, and T is the number of the leaf nodes.
The combination of the inventory and the incremental model, the expert experience, the segmentation of the model score of the inventory common-liability model, the judgment of whether the client is a common-liability client or not and the judgment of whether the client is a target variable of the incremental common-liability model, so that the incremental model of the newly added client can learn the characteristics of the inventory client with a large amount of behavioral expressions, and the predictability of the incremental common-liability model is improved.
In this embodiment, the target variable of the model is an index that the model intends to predict or evaluate, and in this embodiment, the target variable is: whether the customer is a corporate client.
The model is trained with input target variables, i.e., the model training phase, which needs to determine what the training samples are, the model using phase, which needs to determine what the model input is, and what the output result is. For example, in the training phase: the new clients become stock clients after a certain presentation period, and the characteristic indexes of the clients in the stock stage are selected as the input of the stock balance model; and selecting the characteristic indexes of the clients in the application stage as the input of the newly added common debt model. In the model use stage: and the characteristic index of the new client in application is used as the input of the newly added common debt model. In this embodiment, not only the model score of the stock liability model is used as the target variable of the incremental liability model, but also the model score is used to adjust the weight of the training sample of the incremental liability model, as shown in fig. 3:
for customers with inventory liability scores greater than the median, their sample weight x:
x=(d-d0.5)/(1-d0.5) D is the stock balance score of the sample, d0.5Is the median of the total incremental bond model sample inventory bond scores.
For customers with inventory liability scores less than the median, their sample weights are:
x=(d-d0.5)/d0.5d is the stock balance score of the sample, d0.5Is the median of the total incremental common debt model sample stock common debt score。
Because the goal of modeling is to identify customers with very high and very low liability levels, the sample weights are adjusted by the stock liability score. The inventory balance scores were high and low for customers, with the sample weights being greater. This allows samples with inventory liability scores that deviate more from the median or average to have more weight, which in turn allows the xgboost algorithm to more easily identify customers with very high and very low liability levels.
In the embodiment, an xgboost algorithm is used for establishing an incremental common-liability model and predicting the common-liability level of a newly-added client, so that an inventory common-liability model is established for inventory clients by using an unsupervised learning method, an incremental common-liability model is established for the newly-added client by using a supervised learning method, and an evaluation system for different stages of the whole life process of the client is established by combining supervision and unsupervised learning, and the robustness of the evaluation of the overall common-liability level can be improved more.
Through the above description of the embodiments, those skilled in the art can clearly understand that the method according to the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but the former is a better implementation mode in many cases. Based on such understanding, the technical solutions of the present invention may be embodied in the form of a software product, which is stored in a storage medium (e.g., ROM/RAM, magnetic disk, optical disk) and includes instructions for enabling a terminal device (e.g., a mobile phone, a computer, a server, or a network device) to execute the method according to the embodiments of the present invention.
Embodiments of the present invention also provide a computer-readable storage medium, in which a computer program is stored, wherein the computer program is configured to implement the steps in any of the above method embodiments when executed. Optionally, the specific examples in this embodiment may refer to the examples described in the above embodiments and optional implementation manners, and this embodiment is not described herein again.
Optionally, in this embodiment, the storage medium may include, but is not limited to: various media capable of storing computer programs, such as a usb disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic disk, or an optical disk.
Embodiments of the present invention also provide an electronic device comprising a memory having a computer program stored therein and a processor configured to execute the computer program to implement the steps of any of the above method embodiments.
Optionally, the electronic apparatus may further include a transmission device and an input/output device, wherein the transmission device is connected to the processor, and the input/output device is connected to the processor.
Optionally, the specific examples in this embodiment may refer to the examples described in the above embodiments and optional implementation manners, and this embodiment is not described herein again.
It will be apparent to those skilled in the art that the modules or steps of the present invention described above may be implemented by a general purpose computing device, they may be centralized on a single computing device or distributed across a network of multiple computing devices, and alternatively, they may be implemented by program code executable by a computing device, such that they may be stored in a storage device and executed by a computing device, and in some cases, the steps shown or described may be performed in an order different than that described herein, or they may be separately fabricated into individual integrated circuit modules, or multiple ones of them may be fabricated into a single integrated circuit module. Thus, the present invention is not limited to any specific combination of hardware and software.
The above description is only a preferred embodiment of the present invention and is not intended to limit the present invention, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, or improvement made within the principle of the present invention should be included in the protection scope of the present invention.

Claims (10)

1. A method for determining a user request, comprising:
when a user to be determined of the balance level is a stock user, obtaining a balance level score of the stock user based on user characteristics of the stock user and a stock balance model algorithm, wherein the stock user is a user after a preset performance period;
when a user to be determined with a common debt level is an incremental user, obtaining a common debt level score of the incremental user based on the user characteristics of the incremental user, the common debt level score evaluation result of an inventory user by the inventory common debt model algorithm and the incremental common debt model algorithm, wherein the common debt level score evaluation result of the inventory user by the inventory common debt model algorithm is used for adjusting the weight of the user characteristics of the incremental user, and the incremental user is a new user who has not passed through a preset expression period;
determining a security level of the user based on the determined liability level score of the user, and determining whether to pass the user request of the user based on the security level.
2. The method of claim 1, wherein said deriving a balance level score for said inventory subscribers based on said subscriber characteristics of said inventory subscribers and an inventory balance model algorithm comprises:
acquiring the user characteristics of the stock users, and preprocessing the user characteristics of the stock users, wherein the user characteristics of each stock user correspond to a preprocessing mode;
based on the preprocessed user characteristics of the stock users, carrying out vector standardization to obtain a standardized matrix;
and determining the common debt level score of the inventory user according to the standardized matrix.
3. The method of claim 2, wherein said determining a liability level score for said inventory subscribers from said standardized matrix comprises:
determining the distances between the stock user and the optimal scheme and the worst scheme by using a best-worst solution distance method according to the standardized matrix;
and determining the common debt level score of the inventory user according to the distance.
4. The method of claim 3, wherein determining the distance between the inventory user and the optimal solution and the worst solution using a best-and-worst solution method according to the normalization matrix comprises:
determining the distance between the inventory user and the optimal solution and the worst solution according to the following formula:
Figure FDA0002997915080000021
wherein i represents the ith sample client, j represents the jth index of the m indexes,
Figure FDA0002997915080000028
indicating the distance of the client i from the optimal solution,
Figure FDA0002997915080000022
indicates the distance, w, of the customer i from the worst casejIn order to obtain the weight of the index,
Figure FDA0002997915080000023
is the value of the jth index in the optimal solution,
Figure FDA0002997915080000024
is the value of the jth index in the worst case, zijIs the value of the jth index for client i;
determining a liability level score for the inventory user based on the distance, comprising:
determining a liability level score for said inventory consumer by the formula:
Figure FDA0002997915080000025
wherein, CiRepresents the liability score of the customer i,
Figure FDA0002997915080000026
indicating the distance of the client i from the optimal solution,
Figure FDA0002997915080000027
indicating the distance of customer i from the worst case.
5. The method of claim 4, wherein the metric weight is determined by:
calculating the entropy value of each index by the following formula:
Figure FDA0002997915080000031
wherein x isijValue of j index representing i client, ejEntropy representing the jth index;
the index weight is obtained by the following formula:
Figure FDA0002997915080000032
wherein e isjEntropy, w, representing the j indexjThe index weight of the jth index is represented.
6. The method for determining a user request according to any one of claims 1 to 5, wherein the obtaining of the liability level score of the incremental user based on the user characteristics of the incremental user, the assessment result of the liability level score of the inventory user by the inventory liability model algorithm and the incremental liability model algorithm comprises:
using the balance level score of the inventory user as a target variable of the incremental balance model algorithm;
adjusting training sample weights for the incremental liability model using the liability level scores of the inventory users;
and obtaining the liability level score of the incremental user by using the incremental liability model algorithm based on the user characteristics of the incremental user.
7. The method of determining a user request according to claim 1, further comprising:
and verifying the balance level score of the stock user obtained by the stock balance model algorithm, and/or verifying the balance level score of the incremental user obtained by the incremental balance model algorithm.
8. The method of claim 1, wherein the user characteristics of the inventory users comprise: the user attribute of the stock user and the behavior characteristic of the stock user; the user characteristics of the incremental user include: user attributes of the incremental user.
9. A computer-readable storage medium, in which a computer program is stored, wherein the computer program is configured to carry out the method of any one of claims 1 to 8 when executed.
10. An electronic device comprising a memory and a processor, wherein the memory has stored therein a computer program, and wherein the processor is arranged to execute the computer program to implement the method of any one of claims 1 to 8.
CN202110336662.6A 2021-03-29 2021-03-29 User request determining method, storage medium and electronic device Active CN112862602B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110336662.6A CN112862602B (en) 2021-03-29 2021-03-29 User request determining method, storage medium and electronic device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110336662.6A CN112862602B (en) 2021-03-29 2021-03-29 User request determining method, storage medium and electronic device

Publications (2)

Publication Number Publication Date
CN112862602A true CN112862602A (en) 2021-05-28
CN112862602B CN112862602B (en) 2024-05-24

Family

ID=75993132

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110336662.6A Active CN112862602B (en) 2021-03-29 2021-03-29 User request determining method, storage medium and electronic device

Country Status (1)

Country Link
CN (1) CN112862602B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108648074A (en) * 2018-05-18 2018-10-12 深圳壹账通智能科技有限公司 Loan valuation method, apparatus based on support vector machines and equipment
CN109727123A (en) * 2019-01-04 2019-05-07 平安科技(深圳)有限公司 User's collage-credit data construction method, device and computer readable storage medium
CN110135972A (en) * 2019-04-23 2019-08-16 上海淇玥信息技术有限公司 A kind of method, apparatus, system and recording medium for improving user and moving branch rate
CN110728301A (en) * 2019-09-09 2020-01-24 北京镭文科技有限公司 Credit scoring method, device, terminal and storage medium for individual user

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108648074A (en) * 2018-05-18 2018-10-12 深圳壹账通智能科技有限公司 Loan valuation method, apparatus based on support vector machines and equipment
CN109727123A (en) * 2019-01-04 2019-05-07 平安科技(深圳)有限公司 User's collage-credit data construction method, device and computer readable storage medium
CN110135972A (en) * 2019-04-23 2019-08-16 上海淇玥信息技术有限公司 A kind of method, apparatus, system and recording medium for improving user and moving branch rate
CN110728301A (en) * 2019-09-09 2020-01-24 北京镭文科技有限公司 Credit scoring method, device, terminal and storage medium for individual user

Also Published As

Publication number Publication date
CN112862602B (en) 2024-05-24

Similar Documents

Publication Publication Date Title
Kozodoi et al. Fairness in credit scoring: Assessment, implementation and profit implications
Maldonado et al. Integrated framework for profit-based feature selection and SVM classification in credit scoring
Verbraken et al. Development and application of consumer credit scoring models using profit-based classification measures
US20180260891A1 (en) Systems and methods for generating and using optimized ensemble models
US20220122171A1 (en) Client server system for financial scoring with cash transactions
Bravo et al. Granting and managing loans for micro-entrepreneurs: New developments and practical experiences
Callejón et al. A System of Insolvency Prediction for industrial companies using a financial alternative model with neural networks
CN112102073A (en) Credit risk control method and system, electronic device and readable storage medium
CN115205011B (en) Bank user portrait model generation method based on LSF-FC algorithm
Eddy et al. Credit scoring models: Techniques and issues
CN112232950A (en) Loan risk assessment method and device, equipment and computer-readable storage medium
CN111061948A (en) User label recommendation method and device, computer equipment and storage medium
CN113344692B (en) Method for establishing network loan credit risk assessment model with multi-information-source fusion
CN112836750A (en) System resource allocation method, device and equipment
CN116468273A (en) Customer risk identification method and device
CN116596659A (en) Enterprise intelligent credit approval method, system and medium based on big data wind control
Ray et al. Bayesian rapid optimal adaptive design (broad): Method and application distinguishing models of risky choice
CN113177733B (en) Middle and small micro enterprise data modeling method and system based on convolutional neural network
CN112862602B (en) User request determining method, storage medium and electronic device
CN116128339A (en) Client credit evaluation method and device, storage medium and electronic equipment
CN115271442A (en) Modeling method and system for evaluating enterprise growth based on natural language
CN114170000A (en) Credit card user risk category identification method, device, computer equipment and medium
CN113052512A (en) Risk prediction method and device and electronic equipment
Lee et al. Application of machine learning in credit risk scorecard
US20230222378A1 (en) Method and system for evaluating fairness of machine learning model

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant