A kind of personal credit's methods of risk assessment and system
Technical field
The present invention relates to computer and internet financial field, more particularly to a kind of personal credit's methods of risk assessment and it is
System.
Background technology
With developing rapidly for internet finance, personal credit is increasingly becoming a kind of mainstream production in internet financial field
The problem of product, network loan causes, also gradually shows, due to the risk assessment inefficiency of personal credit, the mistake artificially assessed
Rate remains high, and result in many network money-lenders can not repay the loan in time, and lending mechanism can not be obtained with Company capital
Effectively turnover.
Personal credit's risk control method of mainstream is all based on artificial appraisal procedure at present, artificial to assess single paragraph person's
Repaying ability is simultaneously inefficiency using certain fixed assets as guarantee, the method shortcoming, and there are commented caused by human error
Estimate inconsistency and erroneous judgement risk.
The content of the invention
The present invention is directed to problems of the prior art, there is provided a kind of personal credit's methods of risk assessment and system,
Assessment efficiency and accuracy rate can be improved.
The technical solution that the present invention is proposed with regard to above-mentioned technical problem is as follows:
On the one hand, the present invention provides a kind of personal credit's methods of risk assessment, including:
Gather user's real information sample;
Generation user's training data sample is simulated according to the distribution character of user's real information sample;
Assessment models are trained according to user's training data sample;
Obtain the real information of user to be assessed;
The real information of the user to be assessed is inputted in the assessment models to training, obtains the wind of user to be assessed
Danger assessment information.
Further, it is described that generation user's training data sample is simulated according to the distribution character of user's real information sample
This, specifically includes:
Establish and train generation network model;
Gaussian noise, simulation generation and user's real information sample are inputted into the generation network model after training
The consistent user's training data sample of distribution character.
Further, it is described to establish and train generation network model, specifically include:
Establish confrontation network model;The confrontation network model includes generation network model and differentiates network model;
Network weight is generated according to the generation error transfer factor of the generation network model, to minimize object function;
Network weight is differentiated according to the differentiation error transfer factor of the differentiation network model, to maximize the object function;
By being adjusted to the generation network weight and the alternating for differentiating network weight, make the generation error and institute
State and differentiate that error reaches dynamic equilibrium, complete the generation network model and the training for differentiating network model.
Further, the real information by the user to be assessed is inputted in the assessment models to training, is obtained
The risk assessment information of user to be assessed, specifically includes:
The real information of the user to be assessed is inputted in the assessment models to training, it is defeated to obtain the assessment models
The risk factor gone out;
Obtain the corresponding credit accrediting amount of the risk factor and interest rate value, and by the risk factor and corresponding letter
Borrow the risk assessment information of the accrediting amount and interest rate value as user to be assessed.
Further, after the risk assessment information for obtaining user to be assessed, further include:
The deviation data in risk assessment information to having assessed user is modified;
Training is re-started to the generation network model using revised risk assessment information, so that after re -training
Generation network model simulation generate new user's training data sample;
Periodically the assessment models are trained using new user's training data sample, are kept at the assessment models
In effective status.
On the other hand, the present invention provides a kind of personal credit's risk evaluating system, including:
Acquisition module, for gathering user's real information sample;
Generation module is simulated, number is trained for simulating generation user according to the distribution character of user's real information sample
According to sample;
Training module, for being trained according to user's training data sample to assessment models;
Acquisition module, for obtaining the real information of user to be assessed;And
Evaluation module, for inputting the real information of the user to be assessed in the assessment models to training, obtains
The risk assessment information of user to be assessed.
Further, the simulation generation module specifically includes:
Unit is established, for establishing and training generation network model;And
Generation unit, for inputting Gaussian noise, simulation generation and the user into the generation network model after training
The consistent user's training data sample of the distribution character of real information sample.
Further, the model foundation unit specifically includes:
Model foundation subelement, network model is resisted for establishing;The confrontation network model includes generation network model
With differentiation network model;
First adjustment subelement, for generating network weight according to the generation error transfer factor of the generation network model, with
Minimize object function;
Second adjustment subelement, for differentiating network weight according to the differentiation error transfer factor of the differentiation network model, with
Maximize the object function;And
Training subelement, for by being adjusted to the generation network weight and the alternating for differentiating network weight, making
The generation error and the differentiation error reach dynamic equilibrium, complete the generation network model and the differentiation network model
Training.
Further, the evaluation module specifically includes:
Risk factor acquiring unit, for the real information of the user to be assessed to be inputted the assessment models to training
In, obtain the risk factor that the assessment models export;And
Information acquisition unit is assessed, for obtaining the corresponding credit accrediting amount of the risk factor and interest rate value, and will
The risk assessment information of the risk factor and the corresponding credit accrediting amount and interest rate value as user to be assessed.
Further, personal credit's risk evaluating system further includes:
Correcting module, is modified for the deviation data in the risk assessment information to having assessed user;
Re -training module, for re-starting instruction to the generation network model using revised risk assessment information
Practice, so that the generation network model simulation after re -training generates new user's training data sample;And
Regular exercise module, for being periodically trained using new user's training data sample to the assessment models,
The assessment models are kept to be in effective status.
The beneficial effect that technical solution provided in an embodiment of the present invention is brought is:
User's real information sample is gathered, generation network model is trained, makes the simulation generation of generation network model big
The amount user training data sample consistent with the distribution character of user's real information sample, then sample is trained using substantial amounts of user
This, is trained assessment models, the assessment models after training is assessed according to the real information of user to be assessed, obtains
The risk assessment information of user to be assessed, without the artificial efficiency and accuracy assessed, effectively improve assessment.
Brief description of the drawings
To describe the technical solutions in the embodiments of the present invention more clearly, make required in being described below to embodiment
Attached drawing is briefly described, it should be apparent that, drawings in the following description are only some embodiments of the present invention, for
For those of ordinary skill in the art, without creative efforts, other can also be obtained according to these attached drawings
Attached drawing.
Fig. 1 is the flow diagram for personal credit's methods of risk assessment that the embodiment of the present invention one provides;
Fig. 2 is the schematic diagram for personal credit's methods of risk assessment that the embodiment of the present invention one provides;
Fig. 3 is the structure diagram of personal credit's risk evaluating system provided by Embodiment 2 of the present invention.
Embodiment
To make the object, technical solutions and advantages of the present invention clearer, below in conjunction with attached drawing to embodiment party of the present invention
Formula is described in further detail.
Embodiment one
An embodiment of the present invention provides a kind of personal credit's methods of risk assessment, referring to Fig. 1, this method includes:
S1, collection user's real information sample;
S2, simulate generation user's training data sample according to the distribution character of user's real information sample;
S3, according to user's training data sample be trained assessment models;
S4, the real information for obtaining user to be assessed;
S5, input the real information of the user to be assessed in the assessment models to training, obtains user to be assessed
Risk assessment information.
It should be noted that in step sl, user's real information of collection mainly includes three aspects:User itself is personal
Attribute information, social information and proprietary information.Wherein, user itself personal attribute information includes gender, age, native place, education
Background, marriage and childbirth, occupation etc.;Social information includes the information such as cell-phone number replacement frequency, address replacement frequency, hobby;Proprietary information
Situation etc. is in arrears with including house property, vehicle, loan, credit card amount, credit card.
Before step S2 is performed, structuring processing is carried out to the information gathered.For example, the text information that will be collected
Vectorization expression is carried out, all information collected carry out unified scale and represent, it is unified using day to be single to be such as in arrears with the credit card date
Position etc..And then the information of structuring is stored in database, and the user data for having missing is removed, while the number that will have redundancy
According to necessarily being deleted.
Further, in step s 2, it is described that generation use is simulated according to the distribution character of user's real information sample
Family training data sample, specifically includes:
Establish and train generation network model;
Gaussian noise, simulation generation and user's real information sample are inputted into the generation network model after training
The consistent user's training data sample of distribution character.
Further, it is described to establish and train generation network model, specifically include:
Establish confrontation network model;The confrontation network model includes generation network model and differentiates network model;
Network weight is generated according to the generation error transfer factor of the generation network model, to minimize object function;
Network weight is differentiated according to the differentiation error transfer factor of the differentiation network model, to maximize the object function;
By being adjusted to the generation network weight and the alternating for differentiating network weight, make the generation error and institute
State and differentiate that error reaches dynamic equilibrium, complete the generation network model and the training for differentiating network model.
It should be noted that anti-network model is a network model based on game, wherein, generate network model
Generato is used to generate the user instruction consistent with the distribution character of user's real information by the noise simulation of Gaussian Profile
Practice data;Differentiate that network model Discriminator is used to differentiate that the data of its input to be the information generated by Generato
Or user's real information.
Generate network model and differentiate that network model is full Connection Neural Network, structure is:
Wherein, w represents weights, and b represents biasing, and W represents the matrix of weights, the transposition of T representing matrixes, and f represents activation letter
Number, generally sigmoid functions, by multi-level stacking, form a complete neutral net.
In network model is generated, one random noise vector z with Gaussian Profile of input of neutral net, its dimension
Can be any n values, and the data dimension of user's real information is m, user's real information is expressed as { x1, x2..., xm, y },
Wherein y is the label of data, i.e. risk factor, then the output dimension for generating network model is m, and generation data format is { x1,
x2..., xm}.In network model is differentiated, input dimension is m, and output dimension is 1, that is, generates the data that network model is generated
With user's truthful data as the input for differentiating network model, output label 1 represents network and judges that input data is true as user
Data, output label represent network for 0 and judge that input data makes a living into the data of network model generation.
Generation network model and the differentiation network model object function to be optimized are:
Wherein, D represents to differentiate network model, and G represents generation network model.
When optimization generates network model, object function V is minimized, and adjust generation network weight, differentiated in optimization
During network model, object function V is maximized, and adjust differentiation network weight, by alternately updating two network weights, made
Two network models are staggeredly trained by game method, to get a promotion at the same time in game, when generation error is with differentiating error
Two network models are restrained when reaching dynamic equilibrium, are finally reached the system for the data for being generated generation network model
Cloth of scoring is extremely similar to the statistical distribution of user's truthful data.
After two network model training completions reach convergence, the generation network model in confrontation network model is extracted
Part.Random Gaussian reflectivity mirrors are inputted into generation network model, generation network model is generated several and meets assessment
The training sample of model training, i.e. user's training data sample.
Further, in step s3, assessment models are an XGBoost assessment models, and wherein XGBoost is a kind of base
In the machine learning algorithm of decision tree, feature is good etc. for that can carry out end-to-end study, parallel speed block, robustness, is one
The good classification ensemble learning algorithm of kind.In the present embodiment, be adapted to according to the feature selecting of data XGBoost parameters (by
Different with dimension in data scale, optimal parameter is also different), including iterations, learning rate, object function, maximal tree depth
Deng.
After XGBoost assessment models are established, given according to a large number of users training data sample of generation and artificial standard
The risk factor gone out is trained XGBoost assessment models.
Further, in step s 5, the real information by the user to be assessed inputs the assessment to training
In model, the risk assessment information of user to be assessed is obtained, is specifically included:
The real information of the user to be assessed is inputted in the assessment models to training, it is defeated to obtain the assessment models
The risk factor gone out;
Obtain the corresponding credit accrediting amount of the risk factor and interest rate value, and by the risk factor and corresponding letter
Borrow the risk assessment information of the accrediting amount and interest rate value as user to be assessed.
It should be noted that inputting the real information of user to be assessed into assessment models, assessment models output is to be evaluated
Estimate the risk factor of user, the final credit information of user to be assessed obtained according to risk factor, such as maximum accrediting amount and
Interest rate relation etc..
Further, after step s 5, i.e., also wrapped after the risk assessment information for obtaining user to be assessed
Include:
The deviation data in risk assessment information to having assessed user is modified;
Training is re-started to the generation network model using revised risk assessment information, so that after re -training
Generation network model simulation generate new user's training data sample;
Periodically the assessment models are trained using new user's training data sample, are kept at the assessment models
In effective status.
It should be noted that after assessment is made to user to be assessed, assessment result can be also modified, and according to repairing
Result re -training generation network model after just, assessment models, further improve the accuracy of assessment.
Specifically, after assessment, the risk assessment information for having assessed user is all preserved in the database, and periodic cleaning
Partial failure historical data.Artificially the deviation data in risk assessment information is modified, such as picks out risk assessment letter
There are risk factor and data of obvious deviation etc. in breath.The training of generation network model is re-started according to revised data,
It is valid data always to keep training data, and then periodically continues trained XGBoost assessment models using new training data, is protected
Hold assessment models and be in long-time effective status.
Referring to Fig. 2, for the schematic diagram of personal credit's methods of risk assessment provided in an embodiment of the present invention.Generate network model
Gaussian noise is inputted, differentiates that network model input generates data and the user's truthful data that network model is generated, to generation
Network model and differentiation network model are alternately trained, and are made two network models in game while are improved, realize unsupervised
Study.After the training of generation network model is completed, Gaussian noise is inputted into generation network model, makes generation network model defeated
Go out training data largely consistent with the distribution character of user's truthful data.Further according to training data to XGBoost assessment models
It is trained, user information is inputted to the XGBoost assessment models to training, you can export by XGBoost assessment models
Assessment result, obtains the risk assessment information of user.
The embodiment of the present invention gathers user's real information sample, and generation network model is trained, makes generation network mould
Pattern is intended generating user's training data sample largely consistent with the distribution character of user's real information sample, then using a large amount of
User's training sample, assessment models are trained, make the assessment models after training according to the real information of user to be assessed
Assessed, obtain the risk assessment information of user to be assessed, without artificial assessment, effectively improve the efficiency of assessment and accurate
Property.
Embodiment two
An embodiment of the present invention provides a kind of personal credit's risk evaluating system, can realize that above-mentioned personal credit's risk is commented
Estimate all flows of method, referring to Fig. 3, the system comprises:
Acquisition module 1, for gathering user's real information sample;
Generation module 2 is simulated, for simulating generation user's training according to the distribution character of user's real information sample
Data sample;
Training module 3, for being trained according to user's training data sample to assessment models;
Acquisition module 4, for obtaining the real information of user to be assessed;And
Evaluation module 5, for inputting the real information of the user to be assessed in the assessment models to training, obtains
The risk assessment information of user to be assessed.
Further, the simulation generation module specifically includes:
Unit is established, for establishing and training generation network model;And
Generation unit, for inputting Gaussian noise, simulation generation and the user into the generation network model after training
The consistent user's training data sample of the distribution character of real information sample.
Further, the model foundation unit specifically includes:
Model foundation subelement, network model is resisted for establishing;The confrontation network model includes generation network model
With differentiation network model;
First adjustment subelement, for generating network weight according to the generation error transfer factor of the generation network model, with
Minimize object function;
Second adjustment subelement, for differentiating network weight according to the differentiation error transfer factor of the differentiation network model, with
Maximize the object function;And
Training subelement, for by being adjusted to the generation network weight and the alternating for differentiating network weight, making
The generation error and the differentiation error reach dynamic equilibrium, complete the generation network model and the differentiation network model
Training.
Further, the evaluation module specifically includes:
Risk factor acquiring unit, for the real information of the user to be assessed to be inputted the assessment models to training
In, obtain the risk factor that the assessment models export;And
Information acquisition unit is assessed, for obtaining the corresponding credit accrediting amount of the risk factor and interest rate value, and will
The risk assessment information of the risk factor and the corresponding credit accrediting amount and interest rate value as user to be assessed.
Further, personal credit's risk evaluating system further includes:
Correcting module, is modified for the deviation data in the risk assessment information to having assessed user;
Re -training module, for re-starting instruction to the generation network model using revised risk assessment information
Practice, so that the generation network model simulation after re -training generates new user's training data sample;And
Regular exercise module, for being periodically trained using new user's training data sample to the assessment models,
The assessment models are kept to be in effective status.
The embodiment of the present invention gathers user's real information sample, and generation network model is trained, makes generation network mould
Pattern is intended generating user's training data sample largely consistent with the distribution character of user's real information sample, then using a large amount of
User's training sample, assessment models are trained, make the assessment models after training according to the real information of user to be assessed
Assessed, obtain the risk assessment information of user to be assessed, without artificial assessment, effectively improve the efficiency of assessment and accurate
Property.
The foregoing is merely presently preferred embodiments of the present invention, is not intended to limit the invention, it is all the present invention spirit and
Within principle, any modification, equivalent replacement, improvement and so on, should all be included in the protection scope of the present invention.