CN111242191A - Credit rating method and device based on multi-classifier integration - Google Patents
Credit rating method and device based on multi-classifier integration Download PDFInfo
- Publication number
- CN111242191A CN111242191A CN202010009078.5A CN202010009078A CN111242191A CN 111242191 A CN111242191 A CN 111242191A CN 202010009078 A CN202010009078 A CN 202010009078A CN 111242191 A CN111242191 A CN 111242191A
- Authority
- CN
- China
- Prior art keywords
- credit rating
- classifier model
- historical
- classifier
- data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 53
- 230000010354 integration Effects 0.000 title claims abstract description 36
- 238000003066 decision tree Methods 0.000 claims abstract description 55
- 238000007637 random forest analysis Methods 0.000 claims abstract description 45
- 238000012549 training Methods 0.000 claims abstract description 43
- 230000015654 memory Effects 0.000 claims description 19
- 238000004422 calculation algorithm Methods 0.000 claims description 18
- 238000004590 computer program Methods 0.000 claims description 12
- 238000007635 classification algorithm Methods 0.000 claims description 7
- 230000006870 function Effects 0.000 description 7
- 230000008569 process Effects 0.000 description 6
- 238000004364 calculation method Methods 0.000 description 4
- 238000003909 pattern recognition Methods 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 230000011218 segmentation Effects 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 238000004891 communication Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000002996 emotional effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 230000007306 turnover Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/217—Validation; Performance evaluation; Active pattern learning techniques
- G06F18/2193—Validation; Performance evaluation; Active pattern learning techniques based on specific statistical tests
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2415—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
- G06F18/24155—Bayesian classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/243—Classification techniques relating to the number of classes
- G06F18/24323—Tree-organised classifiers
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes
- G06Q40/06—Asset management; Financial planning or analysis
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- Business, Economics & Management (AREA)
- Life Sciences & Earth Sciences (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Bioinformatics & Computational Biology (AREA)
- General Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Development Economics (AREA)
- Finance (AREA)
- Probability & Statistics with Applications (AREA)
- Accounting & Taxation (AREA)
- Operations Research (AREA)
- Game Theory and Decision Science (AREA)
- Human Resources & Organizations (AREA)
- Entrepreneurship & Innovation (AREA)
- Economics (AREA)
- Marketing (AREA)
- Strategic Management (AREA)
- Technology Law (AREA)
- General Business, Economics & Management (AREA)
- Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)
Abstract
The invention discloses a credit rating method and a device based on multi-classifier integration, wherein the method comprises the following steps: acquiring financial characteristic data of a bond issuer; respectively inputting the financial characteristic data into a trained naive Bayes classifier model, a gradient boosting decision tree classifier model and a random forest classifier model to obtain credit rating results output by the classifier models, wherein the naive Bayes classifier model, the gradient boosting decision tree classifier model and the random forest classifier model are obtained by training according to historical financial characteristic data and historical credit rating data; and determining the credit rating of the bond issuer according to the credit rating result output by each classifier model. The credit rating method based on multi-classifier integration improves the stability and accuracy of credit rating.
Description
Technical Field
The invention relates to the technical field of credit service, in particular to a credit rating method and device based on multi-classifier integration.
Background
In recent years, debt violations have gradually assumed an increasing situation. Bond default is bound to cause economic damage and emotional attack to bond investment workers, and bond issuers who present credit risks are related to various large enterprises such as central enterprises, national enterprises and famous enterprises at present. Therefore, there is a need for a method for determining the credit risk level of a bond issuer using public information on the market, which can protect the interests of investors and avoid property damage.
Disclosure of Invention
The invention provides a credit rating method and device based on multi-classifier integration in order to solve at least one technical problem in the background art.
In order to achieve the above object, according to one aspect of the present invention, there is provided a credit rating method based on multi-classifier integration, the method including:
acquiring financial characteristic data of a bond issuer;
respectively inputting the financial characteristic data into a trained naive Bayes classifier model, a gradient boosting decision tree classifier model and a random forest classifier model to obtain credit rating results output by the classifier models, wherein the naive Bayes classifier model, the gradient boosting decision tree classifier model and the random forest classifier model are obtained by training according to historical financial characteristic data and historical credit rating data;
and determining the credit rating of the bond issuer according to the credit rating result output by each classifier model.
Optionally, the financial characteristic data includes: an operating revenue growth rate of four quarters, a net profit growth rate of four quarters after the unusual profit is deducted, an operating revenue to fixed asset ratio, and a flowing asset to flowing liability ratio.
Optionally, the method further includes: acquiring historical financial characteristic data and historical credit rating data; training the naive Bayes classifier model according to historical financial feature data, historical credit rating data and a preset naive Bayes classification algorithm, and performing credit rating on the bond issuer according to the trained naive Bayes classifier model.
Optionally, the method further includes: acquiring historical financial characteristic data and historical credit rating data; training the gradient lifting decision tree classifier model according to historical financial feature data, historical credit rating data and a preset gradient lifting decision tree algorithm, and performing credit rating on the bond issuer according to the trained gradient lifting decision tree classifier model.
Optionally, the method further includes: acquiring historical financial characteristic data and historical credit rating data; training the random forest classifier model according to historical financial characteristic data, historical credit rating data and a preset random forest algorithm, and performing credit rating on a bond issuer according to the trained random forest classifier model.
In order to achieve the above object, according to another aspect of the present invention, there is provided a credit rating apparatus based on multi-classifier integration, the apparatus including:
the financial characteristic data determining unit is used for acquiring financial characteristic data of the bond issuer;
the multi-classifier credit rating unit is used for respectively inputting the financial characteristic data into a trained naive Bayes classifier model, a gradient boosting decision tree classifier model and a random forest classifier model to obtain credit rating results output by the classifier models, wherein the naive Bayes classifier model, the gradient boosting decision tree classifier model and the random forest classifier model are obtained by training according to historical financial characteristic data and historical credit rating data;
and the credit rating determination unit of the bond issuer is used for determining the credit rating of the bond issuer according to the credit rating result output by each classifier model.
Optionally, the financial characteristic data includes: an operating revenue growth rate of four quarters, a net profit growth rate of four quarters after the unusual profit is deducted, an operating revenue to fixed asset ratio, and a flowing asset to flowing liability ratio.
Optionally, the apparatus further comprises: the naive Bayes classifier model training unit is used for acquiring historical financial characteristic data and historical credit rating data; and training the naive Bayes classifier model according to historical financial characteristic data, historical credit rating data and a preset naive Bayes classification algorithm, and performing credit rating on the bond issuer according to the trained naive Bayes classifier model.
Optionally, the apparatus further comprises: the gradient lifting decision tree classifier model training unit is used for acquiring historical financial characteristic data and historical credit rating data; and training the gradient lifting decision tree classifier model according to historical financial characteristic data, historical credit rating data and a preset gradient lifting decision tree algorithm, so as to perform credit rating on the bond issuer according to the trained gradient lifting decision tree classifier model.
Optionally, the apparatus further comprises: the random forest classifier model training unit is used for acquiring historical financial characteristic data and historical credit rating data; and training the random forest classifier model according to historical financial characteristic data, historical credit rating data and a preset random forest algorithm, and performing credit rating on the bond issuer according to the trained random forest classifier model.
To achieve the above object, according to another aspect of the present invention, there is also provided a computer device including a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the steps of the above credit rating method based on multi-classifier integration when executing the computer program.
In order to achieve the above object, according to another aspect of the present invention, there is also provided a computer-readable storage medium storing a computer program which, when executed in a computer processor, implements the steps in the above-described credit rating method based on multi-classifier integration.
The invention has the beneficial effects that: according to financial characteristic data of bond issuers, a plurality of classifier methods including Naive Bayes (NB), Gradient Boosting Decision Trees (GBDT) and Random Forests (RF) are adopted for pattern recognition, and a credit rating result under each classifier method is obtained. And finally, a final credit rating result is obtained through a preset rule, so that the condition that a certain classifier has larger deviation of the recognition result on data of local characteristics is avoided, and the stability and the accuracy of credit rating are improved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts. In the drawings:
FIG. 1 is a flow chart of a credit rating method based on multi-classifier integration according to an embodiment of the present invention;
FIG. 2 is a block diagram of a credit rating device based on multi-classifier integration according to an embodiment of the present invention;
FIG. 3 is a schematic diagram of a computer apparatus according to an embodiment of the present invention.
Detailed Description
In order to make the technical solutions of the present invention better understood, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
It should be noted that the terms "comprises" and "comprising," and any variations thereof, in the description and claims of the present invention and the above-described drawings, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
It should be noted that the embodiments and features of the embodiments may be combined with each other without conflict. The present invention will be described in detail below with reference to the embodiments with reference to the attached drawings.
Fig. 1 is a flowchart of a credit rating method based on multi-classifier integration according to an embodiment of the present invention, and as shown in fig. 1, the credit rating method based on multi-classifier integration according to the present embodiment includes steps S101 to S103.
And step S101, acquiring financial characteristic data of the bond issuer.
The bond issuer in the present invention may be a company, business, or other organization. In an alternative embodiment of the invention said financial characteristic data comprises: an operating revenue growth rate of four quarters, a net profit growth rate of four quarters after the unusual profit is deducted, an operating revenue to fixed asset ratio, and a flowing asset to flowing liability ratio.
In an alternative embodiment of the invention the financial characteristic data is determined in particular by the following steps. The financial data of a bond issuer is first acquired and preprocessed. Financial data may include profitability, leverage, capital structure, asset quality, debt ability, financing ability, extent of expansion, etc., with specific being total revenue, gross profit, net profit due, gross profit on sale, net profit on sale, etc. The preprocessing includes removing duplicated data and data with a large amount of missing data, and complementing missing values in the data. And further, performing feature extraction or calculation on the preprocessed financial data to obtain financial feature data.
The Revenue Growth rate for four quarters can be calculated by the following formula:
wherein, Total Revenuet-1Total revenues for the Total revenues of a business' recent quarter revenuest-5The total revenue that the business was exposed five quarters ago.
The net profit Growth rate after the four quarters minus the unusual profit loss can be calculated from the following formula:
wherein, NPADNGLtFor the net profit after the unusual profit is deducted from the company's last season on day t,the net profit after the unusual profit was deducted, which was revealed by the enterprise in the last fourth season of day t.
The revenue versus Fixed Asset ratio Fixed Asset Turnover can be calculated by the following formula:
wherein, Core Revenuet-iFixed Asset for revenue that stock has been revealed in the last ith quartert-iA fixed asset that was recently disclosed in the ith quarter for an enterprise.
The flowing asset to flowing liability Ratio Current Ratio can be calculated by the following formula:
wherein Current AssettCurrentLiability, a floating asset disclosed by the enterprise in the last quarter of day ttThe floating liability revealed for the business in the last season of day t.
In the embodiment of the invention, the financial characteristic data of the bond issuer in the step is the current financial characteristic data of the bond issuer. The invention uses historical financial characteristic data of a plurality of bond issuers in training classifier models.
And S102, respectively inputting the financial characteristic data into a trained naive Bayes classifier model, a gradient boosting decision tree classifier model and a random forest classifier model to obtain credit rating results output by the classifier models, wherein the naive Bayes classifier model, the gradient boosting decision tree classifier model and the random forest classifier model are obtained by training according to historical financial characteristic data and historical credit rating data.
In the embodiment of the invention, the corresponding classifier models are trained according to historical financial characteristic data of a plurality of bond issuers and corresponding historical credit rating data respectively according to a naive Bayes classification algorithm, a gradient boosting decision tree algorithm and a random forest algorithm. And then carrying out credit rating on the target bond issuer based on the current financial feature data of the target bond issuer according to the trained three classifier models, and respectively obtaining credit rating results output by the classifier models. In an alternative embodiment of the invention, the historical credit rating data of the bond issuer (enterprise) may be rating data made by domestic and foreign rating agencies.
And step S103, determining the credit rating of the bond issuer according to the credit rating result output by each classifier model.
In an embodiment of the present invention, the credit rating result output by each classifier model may be a credit rating level. In the embodiment of the invention, the grades output by the naive Bayes classifier model, the gradient boosting decision tree classifier model and the random forest classifier model are I in sequence1,I2,I3. The invention firstly converts the rating grade output by the classifier model into a corresponding rating score, namely:
S1=f1(I1)
S2=f2(I2)
S3=f3(I3)
S1、S2、S3and the grading scores are respectively corresponding to the grading results output by the naive Bayes classifier model, the gradient boosting decision tree classifier model and the random forest classifier model. f. of1、f2、f3、f4Is a preset relation function.
In an optional embodiment of the present invention, the average calculation may be performed on the rating scores corresponding to the rating results output by the three classifier models, that is:
in other optional embodiments of the present invention, the final credit rating score S may be obtained by performing weighted average calculation on the rating scores corresponding to the rating results output by the three classifier models according to a preset weight ratio.
Taking S as a final credit rating score and converting the score into a credit rating grade, namely:
I=f4(S)
the credit rating level I is the final credit rating result of the bond issuer.
From the above description, it can be seen that the present invention adopts multiple classifier methods including Naive Bayes (NB), Gradient Boosting Decision Trees (GBDT), and Random Forests (RF) according to financial feature data of bond issuers to perform pattern recognition, and obtain credit rating results under each classifier method. And finally, a final credit rating result is obtained through a preset rule, so that the condition that a certain classifier has larger deviation of the recognition result on data of local characteristics is avoided, and the stability and the accuracy of credit rating are improved.
The specific training process of the naive Bayes classifier model, the gradient boosting decision tree classifier model and the random forest classifier model of the invention is introduced below. In the embodiment of the invention, the naive Bayes classifier model, the gradient boosting decision tree classifier model and the random forest classifier model are obtained by training according to the acquired historical financial feature data of a plurality of bond issuers and the corresponding historical credit rating data as training data. The financial characteristic data comprises: revenue growth rate for four quarters, net profit growth rate for four quarters with infrequent losses deducted, revenue to fixed asset ratio, and mobile asset to mobile liability ratio, etc. In the embodiment of the present invention, the present invention may use data of one year as a sample, for example, financial characteristic data of a corporation 2000 as a financial characteristic data sample, and a credit rating level of the corporation 2000 is credit rating data corresponding to the financial characteristic data sample. From this, we obtain a set of financial feature data samples,
And a set of credit rating data samples,
where, c-22, represents the following 22 credit rating levels,representing the credit rating level in i samples, i.e. when the ith sample credit rating is j, thenyiThe other data in (1) is 0.
In the embodiment of the present invention, the credit rating data is generally expressed by a credit rating scale, which may be defined as the following table 1:
TABLE 1
Further the invention may assemble the above-mentioned financial characteristic data samples,
and a set of credit rating data samples,
and c is 22, and the training data is used for training a corresponding classifier model by combining a naive Bayes classification algorithm, a gradient boosting decision tree algorithm and a random forest algorithm. The training process for the three models is explained below separately.
1. Naive Bayes (NB) classifier model
For the above-mentioned financial characteristic data sample set xiFurther processing is carried out, and the specific processing is as follows:
wherein, the [ alpha ], [ beta ]]In order to get the whole,in order to be of a known value,n is a known positive number.
And finally, the set of the processed financial characteristic data samples is as follows:
the Naive Bayes (NB) based decision criterion is:
where c denotes a particular credit rating level class, y denotes a set of all classes, P (c) denotes a prior probability that c belongs to a particular credit rating level, P (x)i' | c) denotes the sample x of c at a certain credit rating leveli' conditional probability, xi' means the ith financial characteristic data from 1 to d financial characteristic data in the data.
In order to improve the calculation efficiency, the invention simplifies the formula, performs logarithm taking, does not affect the result, changes multiplication into addition, and avoids underflow data overflow caused by multiplication, namely:
wherein, PcAnd P (x)i' | c) is obtained by training according to known training data, and finally, a trained naive Bayes classifier model is formed. And then inputting the financial characteristic data of the target bond issuer into the trained naive Bayes classifier model to obtain a credit rating result of the naive Bayes classifier model.
2. Gradient Boosting Decision Tree (GBDT) classifier model
X and Y are input and output variables, respectively, and Y is a continuous variable, given a training data set:
D={(x1,y1),(x2,y2),…,(xN,yN)}
wherein N is the number of samples of the data set.
In an input space where a training data set is located, recursively dividing each region into two sub-regions, determining an output value on each sub-region, and constructing a binary decision tree, specifically comprising the following steps:
1) selecting an optimal segmentation j and an optimal segmentation point s, and solving:
and traversing the variable j, scanning a segmentation point s for the fixed segmentation variable j, and selecting a pair (j, s) which enables the above formula to reach the minimum value.
2) Dividing the region by the selected pair (j, s) and determining the corresponding output value:
R1(j,s)={x|x(j)≤s},R2(j,s)={x|x(j)>s}
3) and continuing to call the steps 1) and 2) for the two sub-areas until a stop condition is met.
4) Dividing an input space into M regions R1,R2,…,RMAt each sheetR is a member ofmHas a fixed output value cmThe decision tree is generated as follows:
the Gradient Boosting Decision Tree (Gradient Boosting Decision Tree) adopts an additive model (namely linear combination of basis functions) and a forward distribution algorithm. The lifting method using decision tree as basis function is a lifting tree, which is a common decision by iterating multiple regression trees. The model of the lifting tree is:
wherein T (x; theta)m) Represents a decision tree, θmM is the number of trees as a parameter of the decision tree.
In the iteration of GBDT, the gradient descent approximation method is used, i.e. the negative gradient of the penalty function is used to approximate the value of the current model, i.e.,
as a residual approximation of the lifting tree algorithm in the regression problem. The current loss function uses the mean square error, i.e.:
the negative gradient is:
i.e. the residual error.
The specific algorithm steps are as follows:
1) initialization f0(x)=0
2) For M is 1,2 …, M
a) Calculating the residual error, namely:
rmi=yi-fm-1(xi),i=1,2,…,N
b) fitting residual rmiLearning a regression tree to obtain T (x; theta)m)
c) Updating the regression tree, namely:
fm(x)=fm-1(x)+T(x;θm)
3) and obtaining an output final gradient boosting decision tree classifier model, namely:
after the gradient boosting decision tree classifier model is obtained, the financial characteristic data of the target bond issuer can be input into the classifier model, and a credit rating result output by the gradient boosting decision tree classifier model is obtained.
3. Random Forest (RF) classifier model
Firstly, performing credit rating pattern recognition on the extracted features based on a decision tree, specifically adopting a C4.5 decision tree algorithm, wherein the gain rate is as follows:
wherein,
training the model through supervised data to obtain a parameter thetatreeAnd then calculating with the new characteristic data and the known parameters to obtain the final credit rating mode result, namely:
I=f(θtree,F3)
randomly selecting samples and features, establishing N decision trees, outputting a credit rating mode result by each decision tree, and determining a final credit rating mode through a voting mechanism to obtain a random forest classifier model.
As can be seen from the above description, the present invention achieves at least the following advantageous effects:
the invention provides a novel method for pattern recognition of bond issuer credit in a classifier integrating mode based on different characteristics of a plurality of classifiers, and the core of the method utilizes different characteristics of different classifiers, a specific feature extraction mode and a mode of finally determining credit rating by the integrated classifier. Therefore, the situation that the recognition result of a certain classifier has larger deviation on the data of local characteristics is avoided, and the stability and the accuracy of recognition are improved.
It should be noted that the steps illustrated in the flowcharts of the figures may be performed in a computer system such as a set of computer-executable instructions and that, although a logical order is illustrated in the flowcharts, in some cases, the steps illustrated or described may be performed in an order different than presented herein.
Based on the same inventive concept, the embodiment of the present invention further provides a credit rating apparatus based on multi-classifier integration, which can be used to implement the credit rating method based on multi-classifier integration described in the above embodiments, as described in the following embodiments. Since the principle of solving the problem of the credit rating apparatus based on multi-classifier integration is similar to that of the credit rating method based on multi-classifier integration, the embodiment of the credit rating apparatus based on multi-classifier integration may refer to the embodiment of the credit rating method based on multi-classifier integration, and repeated details are omitted. As used hereinafter, the term "unit" or "module" may be a combination of software and/or hardware that implements a predetermined function. Although the means described in the embodiments below are preferably implemented in software, an implementation in hardware, or a combination of software and hardware is also possible and contemplated.
Fig. 2 is a block diagram of a credit rating apparatus based on multi-classifier integration according to an embodiment of the present invention, and as shown in fig. 2, the credit rating apparatus based on multi-classifier integration according to an embodiment of the present invention includes: a financial characteristic data determination unit 1, a multi-classifier credit rating unit 2 and a bond issuer credit rating determination unit 3.
And the financial characteristic data determining unit 1 is used for acquiring the financial characteristic data of the bond issuer.
In an alternative embodiment of the invention, the financial characteristic data comprises: an operating revenue growth rate of four quarters, a net profit growth rate of four quarters after the unusual profit is deducted, an operating revenue to fixed asset ratio, and a flowing asset to flowing liability ratio.
And the multi-classifier credit rating unit 2 is used for inputting the financial characteristic data into a trained naive Bayes classifier model, a gradient boosting decision tree classifier model and a random forest classifier model respectively to obtain credit rating results output by the classifier models, wherein the naive Bayes classifier model, the gradient boosting decision tree classifier model and the random forest classifier model are obtained by training according to historical financial characteristic data and historical credit rating data.
And the credit rating determination unit 3 is used for determining the credit rating of the bond issuer according to the credit rating result output by each classifier model.
In an alternative embodiment of the present invention, the inventive credit rating apparatus based on multi-classifier integration further comprises: the naive Bayes classifier model training unit is used for acquiring historical financial characteristic data and historical credit rating data; and training the naive Bayes classifier model according to historical financial characteristic data, historical credit rating data and a preset naive Bayes classification algorithm, and performing credit rating on the bond issuer according to the trained naive Bayes classifier model.
In an alternative embodiment of the present invention, the inventive credit rating apparatus based on multi-classifier integration further comprises: the gradient lifting decision tree classifier model training unit is used for acquiring historical financial characteristic data and historical credit rating data; and training the gradient lifting decision tree classifier model according to historical financial characteristic data, historical credit rating data and a preset gradient lifting decision tree algorithm, so as to perform credit rating on the bond issuer according to the trained gradient lifting decision tree classifier model.
In an alternative embodiment of the present invention, the inventive credit rating apparatus based on multi-classifier integration further comprises: the random forest classifier model training unit is used for acquiring historical financial characteristic data and historical credit rating data; and training the random forest classifier model according to historical financial characteristic data, historical credit rating data and a preset random forest algorithm, and performing credit rating on the bond issuer according to the trained random forest classifier model.
To achieve the above object, according to another aspect of the present application, there is also provided a computer apparatus. As shown in fig. 3, the computer device comprises a memory, a processor, a communication interface and a communication bus, wherein a computer program that can be run on the processor is stored in the memory, and the steps of the method of the above embodiment are realized when the processor executes the computer program.
The processor may be a Central Processing Unit (CPU). The Processor may also be other general purpose processors, Digital Signal Processors (DSPs), Application Specific Integrated Circuits (ASICs), Field Programmable Gate Arrays (FPGAs) or other Programmable logic devices, discrete Gate or transistor logic devices, discrete hardware components, or a combination thereof.
The memory, which is a non-transitory computer readable storage medium, may be used to store non-transitory software programs, non-transitory computer executable programs, and units, such as the corresponding program units in the above-described method embodiments of the present invention. The processor executes various functional applications of the processor and the processing of the work data by executing the non-transitory software programs, instructions and modules stored in the memory, that is, the method in the above method embodiment is realized.
The memory may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function; the storage data area may store data created by the processor, and the like. Further, the memory may include high speed random access memory, and may also include non-transitory memory, such as at least one disk storage device, flash memory device, or other non-transitory solid state storage device. In some embodiments, the memory optionally includes memory located remotely from the processor, and such remote memory may be coupled to the processor via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The one or more units are stored in the memory and when executed by the processor perform the method of the above embodiments.
The specific details of the computer device may be understood by referring to the corresponding related descriptions and effects in the above embodiments, and are not described herein again.
In order to achieve the above object, according to another aspect of the present application, there is also provided a computer-readable storage medium storing a computer program which, when executed in a computer processor, implements the steps in the above-mentioned credit rating method based on multi-classifier integration. It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by a computer program, which can be stored in a computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. The storage medium may be a magnetic Disk, an optical Disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a Flash Memory (Flash Memory), a Hard Disk (Hard Disk Drive, abbreviated as HDD) or a Solid State Drive (SSD), etc.; the storage medium may also comprise a combination of memories of the kind described above.
It will be apparent to those skilled in the art that the modules or steps of the present invention described above may be implemented by a general purpose computing device, they may be centralized on a single computing device or distributed across a network of multiple computing devices, and they may alternatively be implemented by program code executable by a computing device, such that they may be stored in a storage device and executed by a computing device, or fabricated separately as individual integrated circuit modules, or fabricated as a single integrated circuit module from multiple modules or steps. Thus, the present invention is not limited to any specific combination of hardware and software.
The above description is only a preferred embodiment of the present invention and is not intended to limit the present invention, and various modifications and changes may be made by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.
Claims (12)
1. A credit rating method based on multi-classifier integration, comprising:
acquiring financial characteristic data of a bond issuer;
respectively inputting the financial characteristic data into a trained naive Bayes classifier model, a gradient boosting decision tree classifier model and a random forest classifier model to obtain credit rating results output by the classifier models, wherein the naive Bayes classifier model, the gradient boosting decision tree classifier model and the random forest classifier model are obtained by training according to historical financial characteristic data and historical credit rating data;
and determining the credit rating of the bond issuer according to the credit rating result output by each classifier model.
2. The multi-classifier integration-based credit rating method of claim 1, wherein the financial characteristic data comprises: an operating revenue growth rate of four quarters, a net profit growth rate of four quarters after the unusual profit is deducted, an operating revenue to fixed asset ratio, and a flowing asset to flowing liability ratio.
3. The multi-classifier integration-based credit rating method of claim 1 or 2, further comprising:
acquiring historical financial characteristic data and historical credit rating data;
training the naive Bayes classifier model according to historical financial feature data, historical credit rating data and a preset naive Bayes classification algorithm, and performing credit rating on the bond issuer according to the trained naive Bayes classifier model.
4. The multi-classifier integration-based credit rating method of claim 1 or 2, further comprising:
acquiring historical financial characteristic data and historical credit rating data;
training the gradient lifting decision tree classifier model according to historical financial feature data, historical credit rating data and a preset gradient lifting decision tree algorithm, and performing credit rating on the bond issuer according to the trained gradient lifting decision tree classifier model.
5. The multi-classifier integration-based credit rating method of claim 1 or 2, further comprising:
acquiring historical financial characteristic data and historical credit rating data;
training the random forest classifier model according to historical financial characteristic data, historical credit rating data and a preset random forest algorithm, and performing credit rating on a bond issuer according to the trained random forest classifier model.
6. A credit rating apparatus based on multi-classifier integration, comprising:
the financial characteristic data determining unit is used for acquiring financial characteristic data of the bond issuer;
the multi-classifier credit rating unit is used for respectively inputting the financial characteristic data into a trained naive Bayes classifier model, a gradient boosting decision tree classifier model and a random forest classifier model to obtain credit rating results output by the classifier models, wherein the naive Bayes classifier model, the gradient boosting decision tree classifier model and the random forest classifier model are obtained by training according to historical financial characteristic data and historical credit rating data;
and the credit rating determination unit of the bond issuer is used for determining the credit rating of the bond issuer according to the credit rating result output by each classifier model.
7. The multi-classifier integration-based credit rating device of claim 6, wherein the financial characteristic data comprises: an operating revenue growth rate of four quarters, a net profit growth rate of four quarters after the unusual profit is deducted, an operating revenue to fixed asset ratio, and a flowing asset to flowing liability ratio.
8. The multi-classifier integration-based credit rating device of claim 6 or 7, further comprising:
the naive Bayes classifier model training unit is used for acquiring historical financial characteristic data and historical credit rating data; and training the naive Bayes classifier model according to historical financial characteristic data, historical credit rating data and a preset naive Bayes classification algorithm, and performing credit rating on the bond issuer according to the trained naive Bayes classifier model.
9. The multi-classifier integration-based credit rating device of claim 6 or 7, further comprising:
the gradient lifting decision tree classifier model training unit is used for acquiring historical financial characteristic data and historical credit rating data; and training the gradient lifting decision tree classifier model according to historical financial characteristic data, historical credit rating data and a preset gradient lifting decision tree algorithm, so as to perform credit rating on the bond issuer according to the trained gradient lifting decision tree classifier model.
10. The multi-classifier integration-based credit rating device of claim 6 or 7, further comprising:
the random forest classifier model training unit is used for acquiring historical financial characteristic data and historical credit rating data; and training the random forest classifier model according to historical financial characteristic data, historical credit rating data and a preset random forest algorithm, and performing credit rating on the bond issuer according to the trained random forest classifier model.
11. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the method of any of claims 1 to 5 when executing the computer program.
12. A computer-readable storage medium, in which a computer program is stored which, when executed in a computer processor, implements the method of any one of claims 1 to 5.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010009078.5A CN111242191A (en) | 2020-01-06 | 2020-01-06 | Credit rating method and device based on multi-classifier integration |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010009078.5A CN111242191A (en) | 2020-01-06 | 2020-01-06 | Credit rating method and device based on multi-classifier integration |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111242191A true CN111242191A (en) | 2020-06-05 |
Family
ID=70872449
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010009078.5A Pending CN111242191A (en) | 2020-01-06 | 2020-01-06 | Credit rating method and device based on multi-classifier integration |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111242191A (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111782807A (en) * | 2020-06-19 | 2020-10-16 | 西北工业大学 | Self-acceptance technology debt detection and classification method based on multi-method ensemble learning |
CN112529695A (en) * | 2020-12-23 | 2021-03-19 | 招联消费金融有限公司 | Credit risk determination method, credit risk determination device, computer equipment and storage medium |
CN112818899A (en) * | 2021-02-22 | 2021-05-18 | 招联消费金融有限公司 | Face image processing method and device, computer equipment and storage medium |
CN112819069A (en) * | 2021-01-29 | 2021-05-18 | 中国农业银行股份有限公司 | Event grading method and device |
CN114066652A (en) * | 2021-11-30 | 2022-02-18 | 中国建设银行股份有限公司 | Bond credit rating method based on multi-dimensional model and related equipment |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106803204A (en) * | 2017-01-17 | 2017-06-06 | 上海聚赟大数据有限公司 | A kind of floater's promise breaking analysis method based on machine learning |
CN107248030A (en) * | 2017-05-26 | 2017-10-13 | 谢首鹏 | A kind of bond Risk Forecast Method and system based on machine learning algorithm |
CN107766418A (en) * | 2017-09-08 | 2018-03-06 | 广州汪汪信息技术有限公司 | A kind of credit estimation method based on Fusion Model, electronic equipment and storage medium |
CN110490726A (en) * | 2019-07-11 | 2019-11-22 | 林宏伟 | Credit debt risk management method, device, equipment and storage medium |
CN110503459A (en) * | 2019-07-19 | 2019-11-26 | 平安科技(深圳)有限公司 | User credit degree appraisal procedure, device and storage medium based on big data |
CN110580268A (en) * | 2019-08-05 | 2019-12-17 | 西北大学 | Credit scoring integrated classification system and method based on deep learning |
-
2020
- 2020-01-06 CN CN202010009078.5A patent/CN111242191A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106803204A (en) * | 2017-01-17 | 2017-06-06 | 上海聚赟大数据有限公司 | A kind of floater's promise breaking analysis method based on machine learning |
CN107248030A (en) * | 2017-05-26 | 2017-10-13 | 谢首鹏 | A kind of bond Risk Forecast Method and system based on machine learning algorithm |
CN107766418A (en) * | 2017-09-08 | 2018-03-06 | 广州汪汪信息技术有限公司 | A kind of credit estimation method based on Fusion Model, electronic equipment and storage medium |
CN110490726A (en) * | 2019-07-11 | 2019-11-22 | 林宏伟 | Credit debt risk management method, device, equipment and storage medium |
CN110503459A (en) * | 2019-07-19 | 2019-11-26 | 平安科技(深圳)有限公司 | User credit degree appraisal procedure, device and storage medium based on big data |
CN110580268A (en) * | 2019-08-05 | 2019-12-17 | 西北大学 | Credit scoring integrated classification system and method based on deep learning |
Non-Patent Citations (3)
Title |
---|
康琦 等: "《机器学习中的不平衡分类方法》", 31 October 2017, 同济大学出版社, pages: 59 - 61 * |
李艳秋: "基于集成学习的人脸识别研究", 《中国博士学位论文全文数据库 信息科技辑》, no. 02, 15 February 2019 (2019-02-15), pages 45 - 102 * |
王冲: "《现代信息检索技术基本原理教程》", 西安电子科技大学出版社, pages: 98 - 102 * |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111782807A (en) * | 2020-06-19 | 2020-10-16 | 西北工业大学 | Self-acceptance technology debt detection and classification method based on multi-method ensemble learning |
CN111782807B (en) * | 2020-06-19 | 2024-05-24 | 西北工业大学 | Self-bearing technology debt detection classification method based on multiparty integrated learning |
CN112529695A (en) * | 2020-12-23 | 2021-03-19 | 招联消费金融有限公司 | Credit risk determination method, credit risk determination device, computer equipment and storage medium |
CN112819069A (en) * | 2021-01-29 | 2021-05-18 | 中国农业银行股份有限公司 | Event grading method and device |
CN112818899A (en) * | 2021-02-22 | 2021-05-18 | 招联消费金融有限公司 | Face image processing method and device, computer equipment and storage medium |
CN114066652A (en) * | 2021-11-30 | 2022-02-18 | 中国建设银行股份有限公司 | Bond credit rating method based on multi-dimensional model and related equipment |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111242191A (en) | Credit rating method and device based on multi-classifier integration | |
Xia et al. | Cost-sensitive boosted tree for loan evaluation in peer-to-peer lending | |
CN111967971B (en) | Bank customer data processing method and device | |
CN108665366A (en) | Determine method, terminal device and the computer readable storage medium of consumer's risk grade | |
CN108898476A (en) | A kind of loan customer credit-graded approach and device | |
CN113095927B (en) | Method and equipment for identifying suspected transactions of backwashing money | |
WO2019062006A1 (en) | Time selection admission method based on machine learning, device and terminal equipment therefor | |
Khodaee et al. | Forecasting turning points in stock price by applying a novel hybrid CNN-LSTM-ResNet model fed by 2D segmented images | |
KR102342321B1 (en) | Method and apparatus for deep learning-based stock screening and its use for advancing and automating portfolio management | |
Song | Stock trend prediction: Based on machine learning methods | |
CN110796485A (en) | Method and device for improving prediction precision of prediction model | |
CN111667307B (en) | Method and device for predicting financial product sales volume | |
CN112613997A (en) | Method and apparatus for forecasting combined investment of money fund | |
CN112434862B (en) | Method and device for predicting financial dilemma of marketing enterprises | |
CN111967973B (en) | Bank customer data processing method and device | |
Sutkatti et al. | Stock market forecasting techniques: A survey | |
CN108629381A (en) | Crowd's screening technique based on big data and terminal device | |
Ochotorena et al. | Robust stock trading using fuzzy decision trees | |
CN115186898A (en) | Bank account opening amount prediction method and device | |
CN114722941A (en) | Credit default identification method, apparatus, device and medium | |
CN111191688A (en) | User staging number management method and device and electronic equipment | |
Borovkova et al. | Deep learning prediction of the eurostoxx 50 with news sentiment | |
CN111340632A (en) | Resource control method and device | |
Wang | Forecasting Credit Card Defaults Using Light Gradient Boosting Machine with Dart Algorithm | |
Zhang et al. | Large-scale uncertainty estimation and its application in revenue forecast of SMEs |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20220908 Address after: 25 Financial Street, Xicheng District, Beijing 100033 Applicant after: CHINA CONSTRUCTION BANK Corp. Address before: 25 Financial Street, Xicheng District, Beijing 100033 Applicant before: CHINA CONSTRUCTION BANK Corp. Applicant before: Jianxin Financial Science and Technology Co.,Ltd. |