CN113610638A - SMAA-DS-based rating system and method for matching credit rating with default loss rate - Google Patents

SMAA-DS-based rating system and method for matching credit rating with default loss rate Download PDF

Info

Publication number
CN113610638A
CN113610638A CN202110962078.1A CN202110962078A CN113610638A CN 113610638 A CN113610638 A CN 113610638A CN 202110962078 A CN202110962078 A CN 202110962078A CN 113610638 A CN113610638 A CN 113610638A
Authority
CN
China
Prior art keywords
credit
index
borrower
default
loss rate
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110962078.1A
Other languages
Chinese (zh)
Other versions
CN113610638B (en
Inventor
李刚
马洪栋
刘荣月
张可心
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Northeastern University Qinhuangdao Branch
Original Assignee
Northeastern University Qinhuangdao Branch
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Northeastern University Qinhuangdao Branch filed Critical Northeastern University Qinhuangdao Branch
Priority to CN202110962078.1A priority Critical patent/CN113610638B/en
Publication of CN113610638A publication Critical patent/CN113610638A/en
Application granted granted Critical
Publication of CN113610638B publication Critical patent/CN113610638B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/03Credit; Loans; Processing thereof

Landscapes

  • Business, Economics & Management (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Engineering & Computer Science (AREA)
  • Development Economics (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • Technology Law (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)

Abstract

The invention provides a system and a method for grading a credit grade matched with a default loss rate based on SMAA-DS, and relates to the technical field of credit evaluation. The method comprises a user login registration module, a user data management module and a user credit rating module, wherein a binary Logistic regression model between a single qualitative index and a default state is constructed by collecting historical data of all borrowers of credit loan services accumulated by a financial institution, and the binary Logistic regression model is divided into a discrete index and a continuous index according to index properties; performing multiple collinearity detection on the discrete index and the continuous index by using a Lasso-Logistic model, constructing a credit score index system with optimal overall default discrimination capability under two index types, and calculating the credit score of the borrower under the two index types; and establishing a credit rating division optimization model, combining the two index types in different proportions, determining the credit rating information of the borrower, and obtaining the credit rating of the user.

Description

SMAA-DS-based rating system and method for matching credit rating with default loss rate
Technical Field
The invention relates to the technical field of credit evaluation, in particular to a system and a method for rating a credit grade matched with default loss rate based on SMAA-DS.
Background
The credit score is a main tool for evaluating the credit condition of the borrower, and the financial institution or the borrower makes corresponding credit decisions according to the credit score and the credit level and effectively allocates credit resources including loan, loan interest rate, loan amount and the like. The specific method comprises the steps of analyzing a plurality of samples of historically default customers and non-default customers, mining key characteristics influencing whether the borrowers default or not from known data, establishing a mathematical model and measuring default risks of the borrowers.
In the current research on credit score, the credit score of the borrower is generally calculated by directly and comprehensively using all index information. In recent years, some scholars classify indices into hard information and soft information according to whether information can be accurately quantified and reliably delivered. The soft information can reduce the information asymmetry between the borrower and the lender to a certain extent, so that the lender and the loan platform can better evaluate the default risk of borrowing and the loan interest rate. The soft information and the hard information are distinguished, so that the problem of reverse selection brought by the financial market is solved, and the operation efficiency of the loan market is improved. In particular, the availability of loans is increased for borrowers and the investment risk is reduced for investors.
For the borrower, the soft information comprises the sex, the age, text information in loan description, photos published by the social network, online behaviors, personality, morality and other indexes of the borrower. The hard information comprises the indexes of annual income, working age, income ratio of liability, FICO score, cyclic loan utilization rate and the like. For small micro-enterprises, the hard information includes indexes such as pre-tax profit/gross assets (ROA), Short-term debt/equity, and cash/gross assets (cash/total asset), and the soft information includes intangible assets/fixed assets (intangible assets/fixed assets), development expenditure/sales (R & D/sales), and potential market situation (potential market) indexes.
The text information contains information such as the character organization ability of the borrower, the percentage of misspelled words, the borrowing purpose, the debt condition and the like, the repayment ability and the repayment willingness, so the text information is widely researched. The role of rientsgaem (2020) in crediting the loan assignment of the investor familiar with borrowers in the P2P loan is confirmed by the P2P loan data of the netherlands, which shows that the investor familiar with borrowers can invest before other P2P investors, and that the loan has a low probability of default. Weiguo Zhang et al (2020) propose a new approach to fully mining textual information in the description of loans. The results of the demonstration of loan data of the United states of America Lendingclub and Chinese people show that a combined model considering soft information and hard information of loan description text information is superior to a model only considering hard information in the indexes of AUC and G-mean of loan prediction. The statistical measurement model for distinguishing the index types considers different influences of soft information and hard information on default probability, and improves the accuracy of default judgment.
However, the existing research neglects the different influence of exploring the discrete index and the continuous index on the default probability from the angle of the index change mode and the change amplitude. Mathematically, a discrete index is a variable that has only a few values, and a continuous index is a variable that has an infinite number of values within a certain interval. In the field of credit scoring, a typical discrete indicator is the number of credit card accounts and a typical continuous indicator is annual revenue. The soft information and the hard information are crossed with the discrete type and the continuous type indexes, for example, the index of annual income belongs to the hard information from the classification of the hard information and the soft information, and belongs to the continuous type indexes from the classification of the discrete type indexes and the continuous type indexes. From the perspective of the index types, the mapping relationships between different index types and default states are different, and the default discrimination capabilities of different index types are also different.
In many practical decision problems, it is difficult for a decision maker to know the exact relevant information, or there may be a lack of information required for many decisions. Random multi-attribute acceptability analysis (SMAA) is a decision-making method to help a decision-maker make a multi-attribute decision in an uncertain situation. The method of the inverse weight space analysis can help the decision maker to find the best scheme under the condition of not knowing the accurate attribute value and the preference information of the decision maker. By the method of inverse weight space analysis, a lot of information can be obtained, and a decision maker can judge an optimal scheme according to the information.
Evidence theory was introduced in 1967 by Dempster, university of Harvard, USA, who his student Shafer promoted and perfected it, so evidence theory is also called Dempster-Shafer theory (D-S theory for short). Evidence theory is an uncertainty reasoning method that allows a problem to be treated as an evidence, the description of the problem as the proposition of supporting evidence, and the set of all possible propositions as the power set of supporting evidence. For example, in other words, the proposition of supporting evidence is a subset of the power set. Such as: if a criminal crimes a suspect, the suspect must be described as a subset of the power set { convicted } { guilty } and { indeterminate }. Evidence theory represents each proposition in the form of a confidence level due to the complexity of the problem or uncertainty of the subjective decision. For example, m (a) is 0.5, m (b) is 0.2, and m (c) is 0.3, which indicates that the confidence of the certificate to be guilty is 0.5, the confidence of the certificate to be guilty is 0.2, and the confidence of the certificate to be indeterminate is 0.3. However, in practical application, there is a conflict problem between evidences, the D-S theory sometimes cannot be fused to achieve reasonable effects, and especially in 1984, Zadeh proposes that the D-S theory has paradox phenomena, which greatly restricts the application of the D-S theory. In order to solve the conflict problem, an Evidence Reasoning (ER) method is proposed by the Yankee waves, the Wang Ying and the like, evidence weight is introduced into the method, an ER combination rule is adopted on the basis of a D-S theory, and the paradox problem during evidence synthesis is effectively solved.
Disclosure of Invention
Aiming at the problems in the prior art, the invention provides a system and a method for grading a credit grade based on SMAA-DS and matching default loss rate.
In order to solve the technical problems, the invention adopts the following technical scheme:
in one aspect, a system for rating SMAA-DS based credit ratings to match default loss rates, comprising: the system comprises a user login registration module, a user data management module and a user credit rating module.
The user login registration module is used for registering personal information of a borrower through a user page, wherein the personal information comprises a mobile phone number and a name, and the borrower enters a login page to log in an account after the registration is successful;
the user data management module clicks a borrowing button to enter a borrowing detailed information page after the borrower logs in, and index data of the borrower are added and modified on the borrowing detailed information page;
and the user credit rating module clicks a credit rating button to enter a credit rating information page after the borrower logs in, performs credit rating and displays the credit rating page.
On the other hand, a credit rating adjusting method based on matching of the SMAA-DS credit rating and the default loss rate is realized based on the SMAA-DS credit rating system, and specifically comprises the following steps:
step 1: preprocessing all borrower historical data of credit loan businesses accumulated by a financial institution;
step 1.1: carrying out standardization treatment on the quantitative index value by adopting a maximum and minimum standardization method;
step 1.2: matching the scoring result of the qualitative index value with the default state according to the principle that the larger the scoring result corresponding to the qualitative index value is, the lower the default probability is, and counting the corresponding relation between the qualitative index value and the default probability by using a data perspective tool of Excel, wherein the default number corresponding to the same qualitative index value of the default probability is divided by the total number;
step 1.3: the qualitative index value is scored using the efficacy coefficient method, as shown in the following formula:
let xijRepresenting the membership value of the jth index of the ith borrower; vijThe value of the jth index of the ith borrower is represented, and the efficiency coefficient method formula (1):
Figure BDA0003222419910000031
wherein:
Figure BDA0003222419910000032
and
Figure BDA0003222419910000033
is a satisfied value and an unallowed value of the jth index, and c and d are constants;
step 2: constructing a binary Logistic regression model between a single qualitative index and a default state, screening the single index with default discrimination capability by using Wald statistic, and then dividing the reserved single index into a discrete index and a continuous index according to the index property;
in the binary Logistic regression model, a set of independent and identically distributed observation data (X, Y) is assumed, wherein Y is a dependent variable, and Y ═ Y1,y2,…,yn),ynThe default state of the nth borrower is shown, n represents a total of n observation data, namely the borrower, X is an independent variable of a J-dimensional vector, and X is (X ═ X)i1,xi2,…,xij…,xiJ),xijThe value of the jth index of the ith borrower is represented, J represents a total of J dimensions (indexes), yiE {0,1 }. Setting yi1 indicates a default by the borrower, yi0 means that the borrower has no default, P (y)i=1|xi)=piIs represented by xiKnown condition yiProbability of 1, xiIndicating the ith borrower, piAs shown in equation (2), where β is the coefficient vector.
Figure BDA0003222419910000034
The Logistic model can be obtained by transforming the equation (2), which is shown in equation (3).
Figure BDA0003222419910000041
Building Wald statistics, and performing significance test on regression coefficients of single indexes; let WjFor the Wald statistical value of the jth index,
Figure BDA0003222419910000042
is an estimate of the coefficient of the j-th index, SEβjIs a coefficient of betajSo the Wald statistic W of the j-th indexiAs shown in formula (4).
Figure BDA0003222419910000043
And step 3: performing multiple collinearity inspection on discrete indexes and continuous indexes by using a Lasso-Logistic model, constructing a credit score index system with optimal overall default discrimination capability under two index types, setting weight constraints by respectively using regression coefficients of Lasso-Logistic as the basis of the index weight, solving the optimal weights under the two index types by using a random multi-attribute acceptability analysis method SMAA, and calculating credit scores of borrowers under the two index types;
in the Lasso-Logistic model, a set of independent and identically distributed observed data (X, Y) is assumed, wherein Y is a dependent variable, and Y ═ Y1,y2,…,yn) X is an argument of a p-dimensional vector, and X is (X)i1,xi2,…,xip) And n groups of data are provided, and each group of data comprises p independent variables and 1 dependent variable. When the dependent variable is a binary classification variable, the dependent variable is a repayment state of the borrower showing default or non-default conditions, and fitting is carried out by using a Lasso-Logistic regression model;
the parameter estimation of the Lasso-Logistic regression on the coefficient beta of n independent variables X is specifically shown in the formula (5).
Figure BDA0003222419910000044
Wherein, lambda is a parameter for adjusting the compression coefficient,
Figure BDA0003222419910000045
the degree of fit of the model is represented,
Figure BDA0003222419910000046
is a penalty to the variable coefficients in the model, since each observation data is independently and identically distributed, their joint distribution can be represented by the product of each marginal distribution, i.e. the likelihood function of n observations (borrowers) is represented by L (θ).
And calculating the credit score of the borrower in a linear addition mode by using the index score and the index weight in the feature combination. Suppose that
Figure BDA0003222419910000047
Weights (J1, 2,3, …, J), (K1, 2,3, …, K,), u, representing the kth simulation of the jth indexijIndicating the normalized value of the ith borrower under the index j, the credit score of the ith borrower in the k simulation
Figure BDA0003222419910000048
As shown in formula (6):
Figure BDA0003222419910000049
credit score mean value of ith borrower in K times of simulation
Figure BDA00032224199100000410
As shown in formula (7):
Figure BDA00032224199100000411
and 4, step 4: establishing a credit grade division optimization model, wherein the optimization model consists of 1 objective function and 2 constraint conditions, the maximum square sum of default loss rate differences between adjacent credit grades is taken as the objective function, the higher the credit grade is, the lower the default loss rate is taken as the constraint condition 1, the default loss rate difference value of the adjacent grades is taken as the constraint condition 2, and the range from a to b times that the latter default loss rate difference value is the former difference value is taken as the constraint condition 2, so as to determine the credit grade of the borrower under the two index types.
Step 4.1: establishing an objective function f;
divide the borrower into L credit classes, LGDlThe penalty loss rate for the ith credit rating is the objective function as shown in equation (8).
obj:maxf=(LGDl-LGDl-1)2+(LGDl-1-LGDl-2)2+...+(LGD2-LGD1)2 (8)
Step 4.2: establishment of constraint 1 and constraint 2
Constraint 1: the default loss rate of each credit rating increases as the credit rating decreases, as shown in equation (9):
s.t.:0<LGD1<LGD2<...<LGDl≤1 (9)
constraint 2: and the default loss rate difference value of the adjacent grade is constrained, and the later default loss rate difference value is a-b times of the former difference value.
Let Δ LGDl,l+1For the ith credit level LGDlAnd the l +1 st credit level LGDl+1The default loss rate difference (d), i.e., the difference between adjacent levels, is represented by equation (10).
ΔLGDl,l+1=LGDl+1-LGDl (10)
The default loss rate difference between adjacent levels is set as shown in equation (11).
ΔLGDl,l+1=[a,b]×ΔLGDl-1,l (11)
Wherein the default loss rate LGD of the first credit ratinglAs shown in formula (12):
Figure BDA0003222419910000051
wherein, glThe number of borrower samples representing the first credit level; glTotal number of borrower samples representing first 1 credit levels, Gl=g1+g2+…+gl-1;LliIndicating the amount of default loss of the ith borrower in the ith credit rating(Loss);RliIndicating the receivable sum (Receivables) of the ith borrower in the ith credit rating.
And 5: respectively determining T credit scores of the borrowers based on an SMAA method under the two index types, and further comparing the T credit scores with threshold points of the credit grades to determine the probability that each borrower belongs to different credit grades;
taking the average value of the T-time scores as the credit score of the borrower, recording the credit score as the average value of the credit score, and determining the credit grade of the borrower according to the average value of the credit score; simulating the credit scores of T times, comparing the credit scores of T times with the threshold points of the credit grades, and determining the probability that each borrower belongs to different credit grades;
step 6: combining the two index types in different proportions, integrating the credit levels of the borrowers under the two index types by using an evidence theory ER combination rule, and comparing the default loss discrimination f under the different combination proportions by calculating, namely the target function f in the step 4.1, obtaining the optimal combination proportion of the discrete index and the continuous index as the final combination proportion of the two index types in the SMAA-DS model according to the maximum discrimination f, and inputting the proportion into the evidence theory ER combination rule to determine the credit level information of the borrowers;
the evidence theory ER combination rule is as follows:
step D1: calculating the BPA value of each evidence before the synthesis, wherein each evidence refers to the probability that the borrower belongs to all grades in the discrete index and the continuous index respectively
Let Θ be the recognition framework, 2 evidences m1、m2:2Θ→[0,1]。
Figure BDA0003222419910000061
Figure BDA0003222419910000062
Figure BDA0003222419910000063
Figure BDA0003222419910000064
Wherein woIs a relative weight of evidence o, and
Figure BDA0003222419910000065
o is the number of evidences. Beta is ao(Hn) And betao(H) Respectively represent the focal unit
Figure BDA0003222419910000066
And
Figure BDA0003222419910000067
the degree of confidence of (a) is,
Figure BDA0003222419910000068
is the BPA value for evidence o, L represents the L-th level, for a total of L levels.
Step D2: and synthesizing the evidence by using an evidence theory ER combination rule, and calculating the BPA value of each synthesized evidence.
Figure BDA0003222419910000069
Figure BDA00032224199100000610
Figure BDA00032224199100000611
Figure BDA00032224199100000612
Wherein m is12(Hl) A BPA value representing the first level to which the borrower is combined; k is an evidence conflict coefficient, and the greater K is the greater the conflict between the evidences is; b and C represent that the borrower belongs to a certain grade under the discrete index type and the continuous index type respectively;
Figure BDA00032224199100000613
is a process parameter;
Figure BDA00032224199100000614
respectively representing the belief ratios (probabilities) to be distributed in discrete type, continuous type, discrete type and continuous type indexes; m is1(B),m2(C) BPA (mass function) before combination for B and C, respectively.
Step D3: and converting the BPA value of each synthesized evidence into a confidence Bel, namely that the borrower belongs to a credit level H in the SMAA-DS modellThe probability of (c).
Figure BDA00032224199100000615
Figure BDA00032224199100000616
And 7: the user logs in at the user login module, and then fills in and modifies personal information at the user data management module. And inputting the data input by the borrower in the user data management module into the user credit rating module, and clicking the user credit rating module by the user to display the credit rating.
The invention has the following beneficial effects:
the invention provides a credit rating system and a credit rating method based on matching of SMAA-DS rating and default loss rate, which have the following beneficial effects:
1. according to the invention, from the angle of index change amplitude and mode, a credit rating model constructed by a discrete type index and a continuous type index in a nonlinear mode meets the standard of 'higher credit grade, lower default loss rate', and has stronger capacity of distinguishing borrowers with different default possibilities.
2. The invention provides important reference for credit rating and credit decision of financial institutions or borrowers, and effectively allocates credit resources including loan and credit rating.
Drawings
FIG. 1 is a block diagram of the overall architecture of a credit rating system in an embodiment of the invention;
FIG. 2 is a general flow diagram of a credit rating method in an embodiment of the invention;
FIG. 3 is a comparison chart of credit rating classification results of different models according to an embodiment of the present invention.
Detailed Description
The following detailed description of embodiments of the present invention is provided in connection with the accompanying drawings and examples. The following examples are intended to illustrate the invention but are not intended to limit the scope of the invention.
In one aspect, a system for rating SMAA-DS based credit ratings to match default loss rates, as shown in FIG. 1, comprises: the system comprises a user login registration module, a user data management module and a user credit rating module.
The user login registration module is used for registering personal information of a borrower through a user page, wherein the personal information comprises a mobile phone number and a name, and the borrower enters a login page to log in an account after the registration is successful;
the user data management module clicks a borrowing button to enter a borrowing detailed information page after the borrower logs in, and index data of the borrower are added and modified on the borrowing detailed information page;
and the user credit rating module clicks a credit rating button to enter a credit rating information page after the borrower logs in, performs credit rating and displays the credit rating page.
On the other hand, a credit rating adjustment method based on matching of the credit rating of the SMAA-DS and the default loss rate is implemented based on the aforementioned credit rating system based on matching of the credit rating of the SMAA-DS and the default loss rate, as shown in fig. 2, and specifically includes the following steps:
step 1: preprocessing all borrower historical data of credit loan businesses accumulated by a financial institution;
step 1.1: carrying out standardization treatment on the quantitative index value by adopting a maximum and minimum standardization method; wherein the index value is an index which is all numbers;
step 1.2: matching the scoring result of the qualitative index value with the default state according to the principle that the larger the scoring result corresponding to the qualitative index value is, the lower the default probability is, and counting the corresponding relation between the qualitative index value and the default probability by using a data perspective tool of Excel, wherein the default number corresponding to the same qualitative index value of the default probability is divided by the total number; for example, if the home address is 10000 people in a certain city, wherein 10000 people default, the value of the borrower in the qualitative index of the home address is the certain city, and the corresponding default probability is 0.1;
the default state is an index carried by each piece of data in the history sample;
step 1.3: the qualitative index value is scored using the efficacy coefficient method, as shown in the following formula:
let xijRepresenting the membership value of the jth index of the ith borrower; vijThe value of the jth index of the ith borrower is represented, and the efficiency coefficient method formula (1):
Figure BDA0003222419910000081
wherein:
Figure BDA0003222419910000082
and
Figure BDA0003222419910000083
is the satisfied and disallowed values for the jth index, c, d are constants, c functions to translate the transformed value, and d functions to either enlarge or reduce the transformed value. In this case, c is 0.5 and d is 0.5. According to default of each classification of qualitative indexes in the initial index system, the qualitative indexes are scored, and the 6 qualitative indexes are subjected to standardization treatment processes as shown in the table1 is shown.
TABLE 1 qualitative index standardization process
Figure BDA0003222419910000084
In this embodiment, 43471 loan data of three years of 2012 of a platform are used for demonstration, and 16574 loan data are left as a sample after data cleaning.
Step 2: constructing a binary Logistic regression model between a single qualitative index and a default state, screening the single index with default discrimination capacity by using Wald statistic, removing the index with Wald statistic smaller than 3.841, reserving the index with Wald statistic larger than 3.841, and then dividing the reserved single index into a discrete index and a continuous index according to the index property;
the discrete index includes age, academic history, marital status, house ownership status, loan record existence or not, whether unreturned bank borrowing exists or not, and the like. The continuous indexes include monthly income, monthly payment amount accounting for monthly income proportion, monthly daily expenditure and the like.
Logistic regression is a common classification model, in which dependent variables are discrete variables and binary variables are the main variables.
In the binary Logistic regression model, a set of independent and identically distributed observation data (X, Y) is assumed, wherein Y is a dependent variable, and Y ═ Y1,y2,…,yn),ynThe default state of the nth borrower is shown, n represents a total of n observation data, namely the borrower, X is an independent variable of a J-dimensional vector, and X is (X ═ X)i1,xi2,…,xij…,xiJ),xijThe value of the jth index of the ith borrower is represented, J represents a total of J dimensions (indexes), yiE {0,1 }. Setting yi1 indicates a default by the borrower, yi0 means that the borrower has no default, P (y)i=1|xi)=piIs represented by xiKnown condition yiProbability of 1, xiIndicating the ith borrower, piAs shown in formula (2)Where β is the coefficient vector.
Figure BDA0003222419910000091
The Logistic model can be obtained by transforming the equation (2), which is shown in equation (3).
Figure BDA0003222419910000092
Building Wald statistics, and performing significance test on regression coefficients of single indexes; let WjFor the Wald statistical value of the jth index,
Figure BDA0003222419910000093
is an estimate of the coefficient of the j-th index, SEβjIs a coefficient of betajSo the Wald statistic W of the j-th indexiAs shown in formula (4).
Figure BDA0003222419910000094
Because the Wald statistic test is carried out on a single index, the method obeys the chi with the degree of freedom of 12Is distributed by x2Distribution table shows that χ at 0.05 is satisfied at this time2Critical value is χ0.05 2(1) 3.841, i.e. when the Wald statistic of index i is greater than 3.841, the index is considered to have significant default discrimination ability.
And step 3: performing multiple collinearity inspection on discrete indexes and continuous indexes by using a Lasso-Logistic model, constructing a credit score index system with optimal overall default discrimination capability under two index types, setting weight constraints by respectively using regression coefficients of Lasso-Logistic as the basis of the index weight, determining an index weight reasonable interval, solving the optimal weights under the two index types by using a random multi-attribute acceptability analysis method SMAA, and calculating the credit score of a borrower under the two index types;
the reasonable interval is that the influence of the index regression coefficient on the data change is considered, so that the weight of part of indexes is too large or too small, the accuracy of credit scoring is interfered, and the weight is set for constraining
Figure BDA0003222419910000095
n is the index number to reduce the influence of the objective data change on the index weight. When the index weight is less than
Figure BDA0003222419910000096
Then, the weight of the index is adjusted to
Figure BDA0003222419910000097
When the index weight is greater than
Figure BDA0003222419910000098
Then, the weight of the index is adjusted to
Figure BDA0003222419910000099
The credit score is calculated as follows.
The Lasso-Logistic model is a technology for realizing variable selection by adding coefficient punishment to perform coefficient shrinkage on the basis of a least square method. In the Lasso-Logistic model, a set of independent and identically distributed observed data (X, Y) is assumed, wherein Y is a dependent variable, and Y ═ Y1,y2,…,yn) X is an argument of a p-dimensional vector, and X is (X)i1,xi2,…,xip) And n groups of data are provided, and each group of data comprises p independent variables and 1 dependent variable. When the dependent variable is a binary classification variable, the dependent variable is a repayment state of the borrower showing default or non-default conditions, and fitting is carried out by using a Lasso-Logistic regression model;
the parameter estimation of the Lasso-Logistic regression on the coefficient beta of n independent variables X is specifically shown in the formula (5).
Figure BDA0003222419910000101
Wherein, lambda is a parameter for adjusting the compression coefficient,
Figure BDA0003222419910000102
the degree of fit of the model is represented,
Figure BDA0003222419910000103
is a penalty to the variable coefficients in the model, since each observation data is independently and identically distributed, their joint distribution can be represented by the product of each marginal distribution, i.e. the likelihood function of n observations (borrowers) is represented by L (θ).
And calculating the credit score of the borrower in a linear addition mode by using the index score and the index weight in the feature combination. Suppose that
Figure BDA0003222419910000104
Weights (J1, 2,3, …, J), (K1, 2,3, …, K,), u, representing the kth simulation of the jth indexijIndicating the normalized value of the ith borrower under the index j, the credit score of the ith borrower in the k simulation
Figure BDA0003222419910000105
As shown in formula (6):
Figure BDA0003222419910000106
credit score mean value of ith borrower in K times of simulation
Figure BDA0003222419910000107
As shown in formula (7):
Figure BDA0003222419910000108
and 4, step 4: establishing a credit grade division optimization model, wherein the optimization model consists of 1 objective function and 2 constraint conditions, the maximum square sum of default loss rate differences between adjacent credit grades is taken as the objective function, the higher the credit grade is, the lower the default loss rate is taken as the constraint condition 1, the default loss rate difference value of the adjacent grades is taken as the constraint condition 2, and the range from a to b times that the latter default loss rate difference value is the former difference value is taken as the constraint condition 2, so as to determine the credit grade of the borrower under the two index types.
Step 4.1: establishing an objective function f;
divide the borrower into L credit classes, LGDlThe Default Loss rate Loss of the ith credit rating Loss Given by Given defaults is Given by the objective function shown in equation (8).
obj:max f=(LGDl-LGDl-1)2+(LGDl-1-LGDl-2)2+...+(LGD2-LGD1)2 (8)
The scheme with the maximum default loss value is selected from all the credit grade division schemes by calculating the sum of the square sums of default loss rate difference values between every two adjacent credit grades, so that the maximum default loss division degree of each credit grade of the screened credit grade division result is ensured, borrowers with different credit conditions are well distinguished, reasonable and effective credit grade division is realized, and then threshold points among all grades are determined.
Step 4.2: establishment of constraint 1 and constraint 2
Constraint 1: the default loss rate of each credit rating increases as the credit rating decreases, as shown in equation (9):
s.t.:0<LGD1<LGD2<...<LGDl≤1 (9)
constraint 2: and the default loss rate difference value of the adjacent grade is constrained, and the later default loss rate difference value is a-b times of the former difference value. In order to avoid the situation that the change of the default loss rate is too sensitive, and because the objective function is maximum, the blind pursuit of the default loss rate difference value between grades easily causes unreasonable division results, and through setting a reasonable interval range, the degree of distinction of the division results is guaranteed, and the rationality of the division results is also guaranteed. In this embodiment, the adjacent-level default loss rate difference interval range is set to [1,1.2 ].
Let Δ LGDl,l+1For the ith credit level LGDlAnd the l +1 st credit level LGDl+1The default loss rate difference (d), i.e., the difference between adjacent levels, is represented by equation (10).
ΔLGDl,l+1=LGDl+1-LGDl (10)
The default loss rate difference between adjacent levels is set as shown in equation (11).
ΔLGDl,l+1=[a,b]×ΔLGDl-1,l (11)
Wherein the default loss rate LGD of the first credit ratinglAs shown in formula (12):
Figure BDA0003222419910000111
wherein, glThe number of borrower samples representing the first credit level; glTotal number of borrower samples representing first 1 credit levels, Gl=g1+g2+…+gl-1;LliRepresenting a Loss of default (Loss) amount for the ith borrower in the ith credit rating; rliIndicating the receivable sum (Receivables) of the ith borrower in the ith credit rating.
The model may output a credit score threshold point, default loss amount, receivables interest, number of people, and default loss rate for each level.
And 5: respectively determining T credit scores of the borrowers based on an SMAA method under the two index types, and further comparing the T credit scores with threshold points of the credit grades to determine the probability that each borrower belongs to different credit grades;
taking the average value of the T-time scores as the credit score of the borrower, recording the credit score as the average value of the credit score, and determining the credit grade of the borrower according to the average value of the credit score; simulating the credit scores of T times, comparing the credit scores of T times with the threshold points of the credit grades, and determining the probability that each borrower belongs to different credit grades;
for example, if the number of times the 1 st borrower is classified into the first, second, third, fourth, fifth, sixth, and seventh levels is 0, 817, 183, 0, and 0, respectively, and the ratio obtained by dividing the number of times classified into each level by 1000 is 0, 0.817, 0.183, 0, and 0, respectively, the probabilities that the borrower belongs to the first, second, third, fourth, fifth, sixth, and seventh levels are 0, 0.817, 0.183, 0, and 0, respectively, by simulating 1000 credit scores. The probabilities of the borrower belonging to each level under the discrete index and the continuous index are respectively put into the 2 nd column to the 15 th column in the table 2, and the 1 st column in the table 2 is the serial number of the borrower.
TABLE 2 probability of borrower belonging to different credit classes under two index types
Figure BDA0003222419910000112
Figure BDA0003222419910000121
Step 6: combining the two index types in different proportions, integrating the credit levels of the borrowers under the two index types by using an evidence theory ER combination rule, and comparing the default loss discrimination f under the different combination proportions by calculating, namely the target function f in the step 4.1, obtaining the optimal combination proportion of the discrete index and the continuous index as the final combination proportion of the two index types in the SMAA-DS model according to the maximum discrimination f, and inputting the proportion into the evidence theory ER combination rule to determine the credit level information of the borrowers;
w corresponding to the maximum default loss discrimination f of 0.00861=0.54,w2Example 0.46 illustrates the calculation process of the ER rule in table 2, 1 st borrower credit rating in the SMAA-DS model, in which HlThe values are respectively A, B, C, D, E, F and G grades, and H is a set of A, B, C, D, E, F and G grades.
Step D1: a mass function m1, m2 is constructed, and BPA values of a discrete index type (abbreviated as 1) and a continuous index type (abbreviated as 2) are obtained
The first row of Table 2 is numbered with w1=0.54,w2Substituting 0.46 into equation 13 yields:
m1({A})=w11({A})=0.54*0=0
m1({B})=w11({B})=0.54*0=0
m1({C})=w11({C})=0.54*0.8170=0.4412
in the same way, m1({D})=0.0988,m1({E})=0,m1({F})=0,m1({G})=0。
m2({A})=w22({A})=0.46*0=0
m2({B})=w22({B})=0.46*0=0
m2({C})=w22({C})=0.46*0=0
In the same way, m2({D})=0,m2({E})=0.0069,m2({F})=0.4531,m2({G})=0。
Will w1=0.54,w2Substituting 0.46 into equation 14 yields:
Figure BDA0003222419910000122
Figure BDA0003222419910000123
will w1=0.54,w2Substituting the figure in 0.46 and the first row of table 2 into equation 15 yields:
Figure BDA0003222419910000124
Figure BDA0003222419910000131
will be provided with
Figure BDA0003222419910000132
And
Figure BDA0003222419910000133
substituting equation 16 yields:
Figure BDA0003222419910000134
Figure BDA0003222419910000135
step D2: synthesizing by using an evidence theory ER rule, and calculating BPA values of the synthesized evidence of a discrete index type (abbreviated as 1) and a continuous index type (abbreviated as 2):
substituting formula 17 to calculate the collision coefficient K
Figure BDA0003222419910000136
(1) Calculating the BPA value after the credit grades A, B, C, D, E, F and G are combined
By substituting into equation 18, the result is obtained
Figure BDA0003222419910000137
By substituting into equation 19, it is possible to obtain
Figure BDA0003222419910000138
By substituting equation 20
Figure BDA0003222419910000139
(2) Calculating BPA value after each credit grade combination
The combined BPA value for credit rating A can be obtained by substituting equation 20
Figure BDA0003222419910000141
The combined BPA value for credit rating B can be obtained by substituting equation 20
Figure BDA0003222419910000142
In the same way, m12({C})=0.3170,m12({D})=0.0710,m12({E})=0.0042,m12({F})=0.2773,m12({G})=0。
Step D3: the BPA value of the combined evidence is converted into confidence, namely the probability of the borrower belonging to each credit level in the SMAA-DS model
By substituting into equation 21, the result is obtained
Figure BDA0003222419910000143
The probability of the borrower belonging to the credit rating A in the SMAA-DS model can be obtained by substituting the formula 22
Figure BDA0003222419910000144
Probability of borrower belonging to credit class B in SMAA-DS model
Figure BDA0003222419910000145
Probability of borrower belonging to credit class C in SMAA-DS model
Figure BDA0003222419910000146
Probability of borrower belonging to credit level D in SMAA-DS model
Figure BDA0003222419910000147
For the same reason of beta12({E})=0.0063,β12({F})=0.4142,β12({G})=0。
And putting the probability of the borrower in the SMAA-DS model into the 2 nd row in the table 3, and selecting the grade with the highest probability as the credit grade of the borrower in the SMAA-DS model. The probability that the 1 st borrower belongs to level C is the greatest, and the credit rating of the borrower in the SMAA-DS model is C. Similarly, the probability distribution of the borrowers from lines 2-16574 belonging to the credit classes in the SMAA-DS model can be calculated and placed in lines 3-16575 of Table 3. When the ratio w in the formula (13)1Traversal procedure from 0 to 1 (w)2=1-w1) In, w1=0.54,w2The maximum default loss distinguishing degree f is 0.0086 when the default loss distinguishing degree f is 0.46, and borrowers with different credit levels can be distinguished to the maximum degree. Thus w1=0.54,w20.46 is the optimal combination ratio of the two index types in the SMAA-DS model.
TABLE 3 probability distribution of borrowers belonging to credit classes in SMAA-DS
Figure BDA0003222419910000148
Figure BDA0003222419910000151
And counting the data in the last column of the table 3 to obtain the number of people in each grade in the SMAA-DS model, substituting the data in the last column of the table 3 into a formula (12) to calculate the loss rate of default and the number of people in each grade of the SMAA-DS model, and determining the credit score threshold point of each grade according to the number of people in each grade.
And 7: the user logs in at the user login module, and then fills in and modifies personal information at the user data management module. And inputting the data input by the borrower in the user data management module into the user credit rating module, and clicking the user credit rating module by the user to display the credit rating.
The credit rating classification result pair in this embodiment is shown in Table 4, and as can be seen from FIG. 3, w is in the SMAA-DS model1=0.54,w2The credit rating using the ER method of evidence theory is optimal at 0.46.
TABLE 4 comparison of Credit rating results
Figure BDA0003222419910000152
Figure BDA0003222419910000161
FIG. 1 compares the results of 4 model credit rankings
Meanwhile, the SMAA-DS model has interpretability and can guide loan practices of financial institutions such as actual banks and the like. The SMAA-DS model has the advantages of 6 aspects:
1) the default loss discrimination f of the SMAA-DS model is the largest, and borrowers with different default possibilities can be furthest distinguished. The essence of credit rating classification is that the borrowers with different default possibilities can be differentiated to the greatest extent, and the larger the difference f of default loss rates between adjacent levels, the larger the difference of credit levels of the borrowers with the adjacent levels, the smaller the difference of credit levels of the borrowers with the same credit level. The default loss discrimination f of the SMAA-DS model is 0.0086, the default loss discrimination f corresponding to the binding club 43471 samples and 16574 samples, the indistinguishable index type, the discrete index and the continuous index is 0.0028, 0.0022, 0.0065, 0.0067 and 0.0032 respectively. The default loss discrimination f of the SMAA-DS model is 132.31% of the non-discrimination index type model f. Obviously, the SMAA-DS model can best distinguish borrowers with different defaults.
2) The SMAA-DS model has the lowest default loss rate of high credit rating and can retain high-quality clients with high credit rating to the maximum extent. In a more developed market of the financial industry in the united states, borrowers with higher credit ratings often have a plurality of loan sources, so that default loss rate of customers with high credit ratings is lowest, corresponding loan interest rate is also lowest, and high-quality customers with high credit ratings can be better reserved. The default loss rate of the A grade of the SMAA-DS model is 0.68 percent, and the default loss rates of the A grades corresponding to the leaving club 43471 samples and 16574 samples, the indistinguishable index types, the discrete index and the continuous index are 3.41 percent, 2.91 percent, 1.25 percent, 0.75 percent and 0.62 percent respectively. The default loss rate of the A grade of the SMAA-DS model is reduced by 45.60 percent relative to the default loss rate of the A grade of the index type which is not distinguished. Obviously, the default loss rate of the high level in the SMAA-DS model is the minimum, and the high-quality client can be reserved most.
3) The SMAA-DS model has the highest default loss rate of low level, and can reduce the default loss expected by the platform to the maximum extent. For borrowers with lower credit ratings, the primary purpose of the P2P platform is to reduce default losses, so the lowest rated borrower will be identified and given higher pricing for the loan, making up for the default losses of the P2P platform to the greatest extent possible. The loss rate of the SMAA-DS model in the G grade is 21.69%, and the loss rates of the G grade default corresponding to the drawing club 43471 samples and 16574 samples, the nondifferential index type, the discrete index and the continuous index are 8.87%, 7.27%, 20.19%, 20.53% and 13.93%, respectively. Compared with the default loss rate of the G grade without distinguishing the index type, the default loss rate of the G grade of the SMAA-DS model is improved by 7.43 percent. Obviously, the default loss rate of the lower level in the SMAA-DS model is the minimum, and the default loss can be reduced to the minimum.
4) And the minimum interval of default loss rate between adjacent levels of the SMAA-DS model is ranked second from large to small, and is only second to the discrete index. The minimum gap loss rate between adjacent levels of the SMAA-DS model is 2.41%, and the minimum gap loss rate between adjacent levels corresponding to the binding club 43471 samples and 16574 samples, the indistinguishable index types, the discrete index and the continuous index is 0.81%, 0.27%, 1.97%, 2.44% and 1.50%, respectively. Compared with the minimum interval of the default loss rate without distinguishing the index types, the SMAA-DS model is improved by 22.34 percent. Obviously, the SMAA-DS model ensures that the credit level difference of the borrowers with the same credit level is small, and the credit level difference of the borrowers among the levels is large.
5) The default loss rate difference between the A grade and the G grade in the SMAA-DS model is the largest, and the credit levels of all borrowers can be differentiated to the largest extent. The default loss rate difference between the A grade and the G grade of the SMAA-DS model is 21.01 percent, and the default loss rate minimum intervals between adjacent grades corresponding to the binding club 43471 samples and 16574 samples, the indistinguishable index types, the discrete index and the continuous index are 5.46 percent, 4.36 percent, 18.94 percent, 19.78 percent and 13.31 percent respectively. Compared with the gap of default loss rate of the index type A and the index type G which are not distinguished, the SMAA-DS model is improved by 22.34 percent. Obviously, the gap between the default loss rates of the A grade and the G grade in the SMAA-DS model is the largest.
6) The SMAA-DS model is subject to a credit rating division criterion that the higher the credit rating, the lower the loss rate of default. As can be seen from table 11, the credit rating classification results of the SMAA-DS model and the nondifferential index types, the discrete index, and the continuous index comply with the credit rating classification standard that the higher the credit rating, the lower the default loss rate; however, the binding club 43471 samples and 16574 samples partition results that do not comply with the above criteria. Therefore, the SMAA-DS model has better credit rating result.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; such modifications and substitutions do not depart from the spirit of the corresponding technical solutions and scope of the present invention as defined in the appended claims.

Claims (7)

1. A system for rating SMAA-DS based credit ratings to match default loss rates, comprising: the system comprises a user login registration module, a user data management module and a user credit rating module;
the user login registration module is used for registering personal information of a borrower through a user page, wherein the personal information comprises a mobile phone number and a name, and the borrower enters a login page to log in an account after the registration is successful;
the user data management module clicks a borrowing button to enter a borrowing detailed information page after the borrower logs in, and index data of the borrower are added and modified on the borrowing detailed information page;
and the user credit rating module clicks a credit rating button to enter a credit rating information page after the borrower logs in, performs credit rating and displays the credit rating page.
2. A credit rating adjusting method based on matching of credit rating and default loss rate of SMAA-DS is realized based on the rating system based on matching of credit rating and default loss rate of SMAA-DS, and is characterized by comprising the following steps:
step 1: preprocessing all borrower historical data of credit loan businesses accumulated by a financial institution;
step 2: constructing a binary Logistic regression model between a single qualitative index and a default state, screening the single index with default discrimination capability by using Wald statistic, and then dividing the reserved single index into a discrete index and a continuous index according to the index property;
and step 3: performing multiple collinearity inspection on discrete indexes and continuous indexes by using a Lasso-Logistic model, constructing a credit score index system with optimal overall default discrimination capability under two index types, setting weight constraints by respectively using regression coefficients of Lasso-Logistic as the basis of the index weight, solving the optimal weights under the two index types by using a random multi-attribute acceptability analysis method SMAA, and calculating credit scores of borrowers under the two index types;
and 4, step 4: establishing a credit grade division optimization model, wherein the optimization model consists of 1 objective function and 2 constraint conditions, the maximum square sum of default loss rate differences between adjacent credit grades is taken as the objective function, the higher the credit grade is, the lower the default loss rate is taken as the constraint condition 1, the default loss rate difference values of the adjacent grades are taken as the constraint condition 2, and the range from a to b times that the latter default loss rate difference value is the former difference value is taken as the constraint condition 2, so as to determine the credit grade of the borrower under the two index types;
and 5: respectively determining T credit scores of the borrowers based on an SMAA method under the two index types, and further comparing the T credit scores with threshold points of the credit grades to determine the probability that each borrower belongs to different credit grades;
taking the average value of the T-time scores as the credit score of the borrower, recording the credit score as the average value of the credit score, and determining the credit grade of the borrower according to the average value of the credit score; simulating the credit scores of T times, comparing the credit scores of T times with the threshold points of the credit grades, and determining the probability that each borrower belongs to different credit grades;
step 6: combining the two index types in different proportions, integrating the credit levels of the borrowers under the two index types by using an evidence theory ER combination rule, calculating and comparing the size of default loss discrimination f under different combination proportions, namely an objective function f, obtaining the optimal combination proportion of a discrete index and a continuous index by taking the maximum discrimination f as the basis, taking the optimal combination proportion as the final combination proportion of the two index types in an SMAA-DS model, and inputting the proportion into an evidence theory ER combination rule to determine the credit level information of the borrowers;
and 7: the user logs in the user login module, then fills in and modifies personal information in the user data management module, data input by the borrower in the user data management module is input into the user credit rating module, and the user clicks the user credit rating module to display the credit rating.
3. The SMAA-DS based credit rating matching default loss rate adjustment method as claimed in claim 2, wherein the step 1 comprises:
step 1.1: carrying out standardization treatment on the quantitative index value by adopting a maximum and minimum standardization method;
step 1.2: matching the scoring result of the qualitative index value with the default state according to the principle that the larger the scoring result corresponding to the qualitative index value is, the lower the default probability is, and counting the corresponding relation between the qualitative index value and the default probability by using a data perspective tool of Excel, wherein the default number corresponding to the same qualitative index value of the default probability is divided by the total number;
step 1.3: the qualitative index value is scored using the efficacy coefficient method, as shown in the following formula:
let xijRepresenting the membership value of the jth index of the ith borrower; vijThe value of the jth index of the ith borrower is represented, and the efficiency coefficient method formula (1):
Figure FDA0003222419900000021
wherein:
Figure FDA0003222419900000022
and
Figure FDA0003222419900000023
are the satisfied and disallowed values for the jth index, and c, d are constants.
4. The SMAA-DS-based credit rating adjustment method for matching credit rating with default loss rate as claimed in claim 2, wherein in the binary Logistic regression model in step 2, a set of independent and identically distributed observation data (X, Y) is assumed, wherein Y is a dependent variable, and Y is (Y-Y)1,y2,…,yn),ynThe default state of the nth borrower is shown, n represents a total of n observation data, namely the borrower, X is an independent variable of a J-dimensional vector, and X is (X ═ X)i1,xi2,…,xij…,xiJ),xijJ-th representing the ith borrowerThe value of each index, J represents a total of J dimensions (indexes), yiE {0,1}, and y is seti1 indicates a default by the borrower, yi0 means that the borrower has no default, P (y)i=1|xi)=piIs represented by xiKnown condition yiProbability of 1, xiIndicating the ith borrower, piAs shown in formula (2), where β is a coefficient vector;
Figure FDA0003222419900000024
the Logistic model can be obtained by transforming the pair of the formula (2), as shown in the formula (3):
Figure FDA0003222419900000025
building Wald statistics, and performing significance test on regression coefficients of single indexes; let WjFor the Wald statistical value of the jth index,
Figure FDA0003222419900000026
is an estimate of the coefficient of the j-th index, SEβjIs a coefficient of betajSo the Wald statistic W of the j-th indexiAs shown in formula (4):
Figure FDA0003222419900000027
5. the SMAA-DS-based credit rating adjustment method for matching the default loss rate according to claim 2, wherein in the Lasso-Logistic model in the step 3, a set of independent and identically distributed observation data (X, Y) is assumed, wherein Y is a dependent variable, and Y is (Y-Y)1,y2,…,yn) X is an argument of a p-dimensional vector, and X is (X)i1,xi2,…,xip) N groups of data are provided, and each group of data comprises p independent variables and 1 dependent variable; when the dependent variable is a binary classification variable, the dependent variable is a repayment state of the borrower showing default or non-default conditions, and fitting is carried out by using a Lasso-Logistic regression model;
the parameter estimation of the Lasso-Logistic regression on the coefficient beta of n independent variables X is specifically shown in the formula (5):
Figure FDA0003222419900000031
wherein, lambda is a parameter for adjusting the compression coefficient,
Figure FDA0003222419900000032
the degree of fit of the model is represented,
Figure FDA0003222419900000033
is punishment to variable coefficient in the model, because each observation data is independently and identically distributed, the joint distribution of the observation data can be represented by the product of each marginal distribution, namely the likelihood function of n observation values (borrowers) is represented by L (theta);
calculating credit score of the borrower in a linear addition mode by using the index score and the index weight in the feature combination; suppose wjK represents the weight of the kth simulation of the jth index (J ═ 1,2,3, …, J), (K ═ 1,2,3, …, K,), uijIndicating the normalized value of the ith borrower under the index j, the credit score of the ith borrower in the k simulation
Figure FDA0003222419900000034
As shown in formula (6):
Figure FDA0003222419900000035
credit score mean value of ith borrower in K times of simulation
Figure FDA0003222419900000036
As shown in formula (7):
Figure FDA0003222419900000037
6. the SMAA-DS-based credit rating adjustment method for matching credit rating with default loss rate as claimed in claim 2, wherein the step 4 comprises the following steps:
step 4.1: establishing an objective function f;
divide the borrower into L credit classes, LGDlFor the default loss rate of the ith credit rating, the objective function is shown as equation (8):
obj:maxf=(LGDl-LGDl-1)2+(LGDl-1-LGDl-2)2+...+(LGD2-LGD1)2 (8)
step 4.2: establishing a constraint condition 1 and a constraint condition 2;
constraint 1: the default loss rate of each credit rating increases as the credit rating decreases, as shown in equation (9):
s.t.:0<LGD1<LGD2<...<LGDl≤1 (9)
constraint 2: the default loss rate difference value of the adjacent levels is constrained, and the later default loss rate difference value is a-b times of the former difference value;
let Δ LGDl,l+1For the ith credit level LGDlAnd the l +1 st credit level LGDl+1The default loss rate difference, i.e. the difference between adjacent levels, is expressed as shown in equation (10):
ΔLGDl,l+1=LGDl+1-LGDl (10)
then, the default loss rate difference between adjacent levels is set as shown in equation (11):
ΔLGDl,l+1=[a,b]×ΔLGDl-1,l (11)
wherein the default loss rate LGD of the first credit ratinglAs shown in formula (12):
Figure FDA0003222419900000041
wherein, glThe number of borrower samples representing the first credit level; glTotal number of borrower samples representing first 1 credit levels, Gl=g1+g2+…+gl-1;LliRepresenting a Loss of default (Loss) amount for the ith borrower in the ith credit rating; rliIndicating the receivable sum (Receivables) of the ith borrower in the ith credit rating.
7. The SMAA-DS based credit rating matching default loss rate adjustment method as claimed in claim 2, wherein the evidence theory ER in step 6 is combined as follows:
step D1: calculating the BPA value of each evidence before the synthesis, wherein each evidence refers to the probability that the borrower belongs to all grades in the discrete index and the continuous index respectively
Let Θ be the recognition framework, 2 evidences m1、m2:2Θ→[0,1];
Figure FDA0003222419900000042
Figure FDA0003222419900000043
Figure FDA0003222419900000044
Figure FDA0003222419900000045
Wherein woIs a relative weight of evidence o, and
Figure FDA0003222419900000046
o is the number of evidences, βo(Hn) And betao(H) Respectively represent the focal unit
Figure FDA0003222419900000047
And
Figure FDA0003222419900000048
the degree of confidence of (a) is,
Figure FDA0003222419900000049
is the BPA value for evidence o, L represents the L-th level, for a total of L levels;
step D2: synthesizing the evidence by using an evidence theory ER combination rule, and solving BPA values of the synthesized evidence:
Figure FDA00032224199000000410
Figure FDA00032224199000000411
Figure FDA00032224199000000412
Figure FDA00032224199000000413
wherein m is12(Hl) A BPA value representing the first level to which the borrower is combined; k is evidence conflict coefficient, and the larger K is shownThe greater the conflict between the evidences; b and C represent that the borrower belongs to a certain grade under the discrete index type and the continuous index type respectively;
Figure FDA0003222419900000051
Figure FDA0003222419900000052
is a process parameter;
Figure FDA0003222419900000053
respectively representing the belief ratios to be distributed in discrete type, continuous type, discrete type and continuous type indexes; m is1(B),m2(C) BPA before combination for B and C, respectively;
step D3: and converting the BPA value of each synthesized evidence into a confidence Bel, namely that the borrower belongs to a credit level H in the SMAA-DS modellProbability of (c):
Figure FDA0003222419900000054
Figure FDA0003222419900000055
CN202110962078.1A 2021-08-20 2021-08-20 Rating system and method for matching credit rating with default loss rate based on SMAA-DS Active CN113610638B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110962078.1A CN113610638B (en) 2021-08-20 2021-08-20 Rating system and method for matching credit rating with default loss rate based on SMAA-DS

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110962078.1A CN113610638B (en) 2021-08-20 2021-08-20 Rating system and method for matching credit rating with default loss rate based on SMAA-DS

Publications (2)

Publication Number Publication Date
CN113610638A true CN113610638A (en) 2021-11-05
CN113610638B CN113610638B (en) 2023-11-21

Family

ID=78309039

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110962078.1A Active CN113610638B (en) 2021-08-20 2021-08-20 Rating system and method for matching credit rating with default loss rate based on SMAA-DS

Country Status (1)

Country Link
CN (1) CN113610638B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102779317A (en) * 2012-06-18 2012-11-14 大连理工大学 System and method used for credit rating and based on matching of credit rate and loss given default
CN103124105A (en) * 2012-03-27 2013-05-29 湖南大学 Wireless intelligent sensor network system for monitoring states of intelligent substation devices
CN105426624A (en) * 2015-12-04 2016-03-23 浙江工业大学 Building windowing action simulation method based on evidence reasoning iterative algorithm and entropy weight
CN108492877A (en) * 2018-03-26 2018-09-04 西安电子科技大学 A kind of cardiovascular disease auxiliary prediction technique based on DS evidence theories
CN109724617A (en) * 2017-10-31 2019-05-07 腾讯科技(深圳)有限公司 A kind of method for drafting and relevant device of navigation routine
CN109886473A (en) * 2019-01-24 2019-06-14 河海大学 A kind of basin scene water system Multiobjective Optimal Operation method considering downstream ecology

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103124105A (en) * 2012-03-27 2013-05-29 湖南大学 Wireless intelligent sensor network system for monitoring states of intelligent substation devices
CN102779317A (en) * 2012-06-18 2012-11-14 大连理工大学 System and method used for credit rating and based on matching of credit rate and loss given default
CN105426624A (en) * 2015-12-04 2016-03-23 浙江工业大学 Building windowing action simulation method based on evidence reasoning iterative algorithm and entropy weight
CN109724617A (en) * 2017-10-31 2019-05-07 腾讯科技(深圳)有限公司 A kind of method for drafting and relevant device of navigation routine
CN108492877A (en) * 2018-03-26 2018-09-04 西安电子科技大学 A kind of cardiovascular disease auxiliary prediction technique based on DS evidence theories
CN109886473A (en) * 2019-01-24 2019-06-14 河海大学 A kind of basin scene water system Multiobjective Optimal Operation method considering downstream ecology

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
GANGSHU CAI 等: "The Roles of Bank and Trade Credits: Theoretical Analysis and Empirical Evidence", 《 PRODUCTION AND OPERATIONS MANAGEMENT》, vol. 23, no. 4, pages 583 - 598 *
杨星亚: "基于证据理论的中小企业信用评级研究", 《中国优秀硕士学位论文全文数据库》, no. 6, pages 152 - 1696 *

Also Published As

Publication number Publication date
CN113610638B (en) 2023-11-21

Similar Documents

Publication Publication Date Title
Chen et al. A fuzzy credit-rating approach for commercial loans: a Taiwan case
Abdou et al. Neural nets versus conventional techniques in credit scoring in Egyptian banking
US8515862B2 (en) Computer-implemented systems and methods for integrated model validation for compliance and credit risk
JP2003535387A (en) Rapid evaluation of asset portfolios such as financial products
JP2003526146A (en) Method and system for reducing risk by obtaining evaluation values
CN111401600A (en) Enterprise credit risk evaluation method and system based on incidence relation
Blanco Oliver et al. Improving bankruptcy prediction in micro-entities by using nonlinear effects and non-financial variables
Wu et al. Group decision-making using improved multi-criteria decision making methods for credit risk analysis
Nath Country risk analysis: A survey of the quantitative methods
CN113344692B (en) Method for establishing network loan credit risk assessment model with multi-information-source fusion
Yuan Research on credit risk assessment of P2P network platform: based on the logistic regression model of evidence weight
Costea et al. A new fuzzy clustering algorithm for evaluating the performance of non-banking financial institutions in Romania
Ryan et al. FinTech isn’t so different from traditional banking: trading off aggregation of soft information for transaction processing efficiency
CN113610638A (en) SMAA-DS-based rating system and method for matching credit rating with default loss rate
Adewusi et al. Residential Tenants Classification: A Test of Performance of Five Selected Artificial Neural Networks training Algorithms
Tan et al. Investment recommendation with total capital value maximization in online P2P lending
Bahabadi et al. Customer ranking based on credit risk using MCDM methods
Ekong et al. Development Of Credit Scoring Model For Borrowers Using Machine Learning Techniques
Wang Data-Driven Investment Decisions in P2P Lending: Strategies of Integrating Credit Scoring and Profit Scoring
Cuesta et al. Bank Technology Adoption and Loan Production in the US Mortgage Market
SULTANA An Evaluation of Automated Credit Scoring System for Financial Services in Developing Countries
Şakar Variable Importance Analysis in Default Prediction using Machine Learning Techniques
Tajik et al. Presenting the smart pattern of credit risk of the real banks’ customers using machine learning algorithm.
Yaghini et al. A Prediction Model for Recognition of Bad Credit Customers in Saman Bank Using Neural Networks
Nazari et al. Using the Hybrid Model for Credit Scoring (Case Study: Credit Clients of microloans, Bank Refah-Kargeran of Zanjan, Iran)

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant