CN112508679A - Small and micro enterprise loan risk assessment method and device and storage medium - Google Patents
Small and micro enterprise loan risk assessment method and device and storage medium Download PDFInfo
- Publication number
- CN112508679A CN112508679A CN202011278327.7A CN202011278327A CN112508679A CN 112508679 A CN112508679 A CN 112508679A CN 202011278327 A CN202011278327 A CN 202011278327A CN 112508679 A CN112508679 A CN 112508679A
- Authority
- CN
- China
- Prior art keywords
- loan
- small
- enterprise
- micro
- data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes
- G06Q40/03—Credit; Loans; Processing thereof
Landscapes
- Business, Economics & Management (AREA)
- Accounting & Taxation (AREA)
- Finance (AREA)
- Engineering & Computer Science (AREA)
- Development Economics (AREA)
- Economics (AREA)
- Marketing (AREA)
- Strategic Management (AREA)
- Technology Law (AREA)
- Physics & Mathematics (AREA)
- General Business, Economics & Management (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
本申请公开了一种小微企业贷款风险评估方法、装置及存储介质,涉及信贷审批技术领域,解决了现有技术中对小微企业的还款能力评估不准确,并且占用的大量劳动力的问题;该方法包括:获取小微企业的用电运行状态数据和贷款数据;构建小微企业的贷款评估指标体系;对贷款评估指标体系中每个评估指标的赋予权重,确定企业综合评分,所述企业综合评分等于多个评估指标的加权和;根据企业综合评分设定监测阈值,以监测小微企业的贷前和贷后风险;该方法有效解决了工作人员对还款能力评估不准确,并且占用大量的劳动力的问题,进而实现了根据用电运行状态数据和其他数据确定小微企业的贷款风险,提高了风险评估的准确性,并且节省了大量的人员成本。
The present application discloses a small and micro enterprise loan risk assessment method, device and storage medium, which relates to the technical field of credit approval and solves the problems of inaccurate assessment of the repayment ability of small and micro enterprises in the prior art and the occupation of a large amount of labor force ; The method includes: obtaining the electricity operation status data and loan data of small and micro enterprises; constructing a loan evaluation index system for small and micro enterprises; assigning weights to each evaluation index in the loan evaluation index system, and determining the comprehensive score of the enterprise. The enterprise comprehensive score is equal to the weighted sum of multiple evaluation indicators; the monitoring threshold is set according to the enterprise comprehensive score to monitor the pre-loan and post-loan risks of small and micro enterprises; this method effectively solves the inaccurate assessment of the repayment ability of the staff, and The problem of occupying a lot of labor is realized, and the loan risk of small and micro enterprises can be determined according to the data of electricity operation status and other data, which improves the accuracy of risk assessment and saves a lot of personnel costs.
Description
技术领域technical field
本申请涉及信贷审批技术领域,尤其涉及一种小微企业贷款风险评估方法、装置及存储介质。The present application relates to the technical field of credit approval, and in particular, to a method, device and storage medium for loan risk assessment of small and micro enterprises.
背景技术Background technique
贷款是金融机构提供给消费者资金,消费者按照一定的利率和归还条件归资金的方式,金融机构通过贷款的方式将货币资金投放出去,满足社会扩大生产对补充资金的需要,促进经济发展。申请贷款,也成为了当代多数消费者的消费习惯。Loans are funds provided by financial institutions to consumers. Consumers return funds according to certain interest rates and repayment conditions. Financial institutions release monetary funds through loans to meet the needs of the society to expand production for supplementary funds and promote economic development. Applying for a loan has also become a consumption habit of most consumers today.
目前,随着社会的发展,小微企业越来越多,小微企业也成为解决就业率的主力,小微企业也成为带贷款的主要群体,对金融机构来说,如何对小微企业的还款能力进行评估是首要问题。At present, with the development of society, there are more and more small and micro enterprises, small and micro enterprises have become the main force in solving the employment rate, and small and micro enterprises have also become the main group with loans. For financial institutions, how to deal with small and micro enterprises? Assessing repayment ability is the primary issue.
现有的评估方法是基于调查人员对小微企业的资质、信用和财产状况进行考察,即人工对小微企业的贷款资料进行审查,导致工作人员对还款能力评估不准确,并且占用大量的劳动力。The existing evaluation method is based on the investigator's inspection of the qualifications, credit and property status of the small and micro enterprises, that is, the manual review of the loan information of the small and micro enterprises, resulting in inaccurate assessment of the repayment ability of the staff, and occupying a large amount of money. labor force.
发明内容SUMMARY OF THE INVENTION
本申请实施例通过提供一种小微企业贷款评估方法、装置及存储介质,解决了现有技术中对小微企业的还款能力评估不准确,并且占用的大量劳动力的问题。The embodiments of the present application solve the problems of inaccurate assessment of the repayment ability of small and micro enterprises in the prior art and a large amount of labor occupied by providing a small and micro enterprise loan evaluation method, device and storage medium.
第一方面,本发明实施例提供了一种小微企业贷款评估方法,其特征在于,包括:In a first aspect, an embodiment of the present invention provides a small and micro enterprise loan evaluation method, characterized in that it includes:
获取小微企业的用电运行状态数据和贷款数据;Obtain electricity operation status data and loan data of small and micro enterprises;
构建所述小微企业的贷款评估指标体系;constructing a loan evaluation index system for the small and micro enterprises;
对所述贷款评估指标体系中每个评估指标的赋予权重,确定企业综合评分,所述企业综合评分等于多个所述评估指标的加权和;A weight is assigned to each evaluation index in the loan evaluation index system, and a comprehensive enterprise score is determined, and the comprehensive enterprise score is equal to the weighted sum of a plurality of the evaluation indicators;
根据所述企业综合评分设定监测阈值,以监测所述小微企业的贷前和贷后风险。A monitoring threshold is set according to the comprehensive score of the enterprise to monitor the pre-loan and post-loan risks of the small and micro enterprise.
结合第一方面,在一种可能的实现方式中,所述对所述贷款评估指标体系中每个评估指标的赋予权重,包括:With reference to the first aspect, in a possible implementation manner, the weight assigned to each evaluation index in the loan evaluation index system includes:
采用层次分析法或熵值法,确定每个所述评估指标的权重。The weight of each of the evaluation indicators is determined by using the analytic hierarchy process or the entropy method.
结合第一方面,在一种可能的实现方式中,所述对所述贷款评估指标体系中每个评估指标的赋予权重,包括:With reference to the first aspect, in a possible implementation manner, the weight assigned to each evaluation index in the loan evaluation index system includes:
采用综合赋权法确定每个所述评估指标的权重,所述综合赋权法的函数为:A comprehensive weighting method is used to determine the weight of each of the evaluation indicators, and the function of the comprehensive weighting method is:
其中,Wj为综合赋权的权重值,W1j为采用层次分析法和TOPSIS法确定权重值,W2j为熵值法和TOPSIS法确定权重值。Among them, W j is the weight value of the comprehensive weighting, W 1j is the weight value determined by the AHP and TOPSIS method, and W 2j is the weight value determined by the entropy value method and the TOPSIS method.
结合第一方面,在一种可能的实现方式中,所述根据所述企业综合评分设定监测阈值,包括:With reference to the first aspect, in a possible implementation manner, the setting of the monitoring threshold according to the enterprise comprehensive score includes:
采用梯度提升树法预测所述小微企业的月用电量,进而设定所述监测阈值。The gradient boosting tree method is used to predict the monthly electricity consumption of the small and micro enterprises, and then the monitoring threshold is set.
结合第一方面,在一种可能的实现方式中,还包括:In combination with the first aspect, in a possible implementation manner, the method further includes:
关联所述用电运行状态数据和所述贷款数据,并筛选字段形成数据宽表;Associating the power consumption operating status data with the loan data, and filtering fields to form a data wide table;
在所述数据宽表的基础上对数据进行处理,以解决样本不平衡现象、处理异常值和填补缺失值。Data is processed on the basis of the data wide table to resolve sample imbalance, handle outliers and fill in missing values.
结合第一方面,在一种可能的实现方式中,所述在所述数据宽表的基础上对数据进行处理,包括:With reference to the first aspect, in a possible implementation manner, the data processing on the basis of the data wide table includes:
采用欠采样法或调整权重采样法,解决样本不平衡现象;Adopt undersampling method or adjusting weight sampling method to solve the sample imbalance phenomenon;
采用箱线法处理异常值;Outliers are handled by the box-plot method;
采用线性回归法填补缺失值。Missing values were filled with linear regression.
结合第一方面,在一种可能的实现方式中,所述评分体系包括:缴费信用度,用电稳定度,行业景气度,用电增长度,以及违约用电信息。With reference to the first aspect, in a possible implementation manner, the scoring system includes: payment credit, electricity consumption stability, industry prosperity, electricity consumption growth, and default electricity consumption information.
第二方面,本发明实施例提供了一种小微企业贷款评估装置,该装置包括:In a second aspect, an embodiment of the present invention provides a small and micro enterprise loan evaluation device, the device comprising:
数据获取单元,用于获取小微企业的用电运行状态数据和贷款数据;A data acquisition unit, which is used to acquire the power consumption operation status data and loan data of small and micro enterprises;
体系构建单元,用于构建所述小微企业的贷款评估指标体系;a system construction unit, used for constructing the loan evaluation index system of the small and micro enterprises;
综合评分单元,用于对所述贷款评估指标体系中每个评估指标的赋予权重,确定企业综合评分,所述企业综合评分等于多个所述评估指标的加权和;a comprehensive scoring unit, configured to assign a weight to each evaluation index in the loan evaluation index system, and determine a comprehensive enterprise score, where the comprehensive enterprise score is equal to the weighted sum of a plurality of the evaluation indicators;
监测方案生成单元,用于根据所述企业综合评分设定监测阈值,以监测所述小微企业的贷前和贷后风险。A monitoring plan generation unit, configured to set a monitoring threshold according to the comprehensive score of the enterprise, so as to monitor the pre-loan and post-loan risks of the small and micro enterprise.
结合第二方面,在一种可能的实现方式中,所述综合评分单元具体用于:With reference to the second aspect, in a possible implementation manner, the comprehensive scoring unit is specifically used for:
采用层次分析法和/或熵值法,确定每个所述评估指标的权重,然后确定企业综合评分。Using the AHP and/or the entropy method, determine the weight of each of the evaluation indicators, and then determine the comprehensive enterprise score.
结合第二方面,在一种可能的实现方式中,所述综合评分单元具体用于:With reference to the second aspect, in a possible implementation manner, the comprehensive scoring unit is specifically used for:
采用综合赋权法确定每个所述评估指标的权重,然后确定企业综合评分,所述综合赋权法的函数为:The weight of each of the evaluation indicators is determined by the comprehensive weighting method, and then the comprehensive score of the enterprise is determined. The function of the comprehensive weighting method is:
其中,Wj为综合赋权的权重值,W1j为层次分析法和TOPSIS法确定权重值,W2j为熵值法和TOPSIS法确定权重值。Among them, W j is the weight value of the comprehensive weighting, W 1j is the weight value determined by the AHP and TOPSIS method, and W 2j is the weight value determined by the entropy value method and the TOPSIS method.
结合第二方面,在一种可能的实现方式中,所述监测方案生成单元具体用于:采用梯度提升树法预测所述小微企业的月用电量,进而设定所述监测阈值。With reference to the second aspect, in a possible implementation manner, the monitoring scheme generating unit is specifically configured to: predict the monthly electricity consumption of the small and micro enterprise by using a gradient boosting tree method, and then set the monitoring threshold.
结合第二方面,在一种可能的实现方式中,所述装置还包括数据宽表构造单元和数据处理单元:With reference to the second aspect, in a possible implementation manner, the apparatus further includes a data wide table construction unit and a data processing unit:
所述数据宽表构造单元用于关联所述用电运行状态数据和所述贷款数据,并筛选字段形成数据宽表;The data wide table constructing unit is used for correlating the power consumption operation status data and the loan data, and filtering fields to form a data wide table;
所述数据处理单元用于在所述数据宽表的基础上对数据进行处理,以解决样本不平衡现象、处理异常值和填补缺失值。The data processing unit is configured to process the data on the basis of the data wide table, so as to solve the phenomenon of sample imbalance, process outliers and fill in missing values.
结合第二方面,在一种可能的实现方式中,所述数据处理单元具体用于:With reference to the second aspect, in a possible implementation manner, the data processing unit is specifically configured to:
采用欠采样法或调整权重采样法,解决样本不平衡现象;Adopt undersampling method or adjusting weight sampling method to solve the sample imbalance phenomenon;
采用箱线法处理异常值;Outliers are handled by the box-plot method;
采用线性回归法填补缺失值。Missing values were filled with linear regression.
结合第二方面,在一种可能的实现方式中,所述贷款评估指标体系包括:缴费信用度,用电稳定度,行业景气度,用电增长度,以及违约用电信息。With reference to the second aspect, in a possible implementation manner, the loan evaluation index system includes: payment credit, electricity consumption stability, industry prosperity, electricity consumption growth, and default electricity consumption information.
第三方面,本申请实施例提供基于层次分析的评分装置,该装置包括存储器和处理器;In a third aspect, an embodiment of the present application provides an AHP-based scoring device, the device including a memory and a processor;
所述存储器用于存储计算机可执行指令;the memory for storing computer-executable instructions;
所述处理器用于执行所述计算机可执行指令,以实现第一方面以及第一方面各种可能实现方式所述的方法。The processor is configured to execute the computer-executable instructions to implement the method of the first aspect and various possible implementations of the first aspect.
第四方面,本申请实施例提供一种计算机可读存储介质,其特征在于,所述计算机可读存储介质存储有可执行指令,计算机执行所述可执行指令时能够实现第一方面以及第一方面各种可能实现方式所述的方法。In a fourth aspect, an embodiment of the present application provides a computer-readable storage medium, wherein the computer-readable storage medium stores executable instructions, and when a computer executes the executable instructions, the first aspect and the first aspect can be implemented. Aspects various possible implementations of the methods described.
本发明实施例中提供的一个技术方案,至少具有如下技术效果或优点:A technical solution provided in the embodiment of the present invention has at least the following technical effects or advantages:
本发明实施例提供了一种小微企业贷款风险评估方法,该方法通过使用用电运行状态数据和贷款数据构建小微企业的贷款评估指标体系,并对贷款评估指标体系中每个评估指标的赋予权重后确定企业综合评分,通过企业综合评分设定监测阈值后能够监测小微企业的贷前和贷后风险,进而有效解决了工作人员对还款能力评估不准确,并且占用大量的劳动力的问题,实现了根据用电运行状态数据和其他数据确定小微企业的贷款风险评估,提高了风险评估的准确性和节省了大量的人员成本。The embodiment of the present invention provides a loan risk assessment method for small and micro enterprises. The method constructs a loan evaluation index system for small and micro enterprises by using power consumption operation status data and loan data, and evaluates the value of each evaluation index in the loan evaluation index system. After the weight is assigned, the comprehensive enterprise score is determined. After setting the monitoring threshold through the enterprise comprehensive score, the pre-loan and post-loan risks of small and micro enterprises can be monitored, thereby effectively solving the problem of inaccurate assessment of the repayment ability of the staff and the occupation of a large amount of labor. It realizes the loan risk assessment of small and micro enterprises based on the data of electricity operation status and other data, which improves the accuracy of risk assessment and saves a lot of personnel costs.
附图说明Description of drawings
为了更清楚地说明本发明实施例的技术方案,下面将对本发明实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to illustrate the technical solutions of the embodiments of the present invention more clearly, the following briefly introduces the accompanying drawings used in the description of the embodiments of the present invention. Obviously, the drawings in the following description are some embodiments of the present invention. For those of ordinary skill in the art, other drawings can also be obtained from these drawings without any creative effort.
图1为本申请实施例提供的小微企业贷款评估方法的流程图;FIG. 1 is a flowchart of a small and micro enterprise loan evaluation method provided by the embodiment of the present application;
图2为本申请实施例提供的小微企业贷款评估方法的分层分析法的流程图;2 is a flowchart of the hierarchical analysis method of the loan evaluation method for small and micro enterprises provided by the embodiment of the present application;
图3为本申请实施例提供的小微企业贷款评估方法的熵值法的流程图;3 is a flowchart of the entropy method of the loan evaluation method for small and micro enterprises provided by the embodiment of the present application;
图4为本申请实施例提供的小微企业贷款评估方法的TIPSIS法的流程图;4 is a flowchart of the TIPSIS method of the small and micro enterprise loan evaluation method provided in the embodiment of the present application;
图5为本申请实施例提供的小微企业贷款评估方法的综合评分法的流程图;5 is a flowchart of the comprehensive scoring method of the small and micro enterprise loan evaluation method provided by the embodiment of the present application;
图6为本申请实施例提供的小微企业贷款评估方法的XGBoost算法所做的月用电量预测折线图;6 is a line chart of monthly electricity consumption forecast made by the XGBoost algorithm of the small and micro enterprise loan evaluation method provided by the embodiment of the present application;
图7为本申请实施例提供的小微企业贷款评估方法的具体处理数据的流程图;7 is a flow chart of the specific processing data of the small and micro enterprise loan evaluation method provided by the embodiment of the present application;
图8为本申请实施例提供的小微企业贷款评估方法的评估体系示意图;FIG. 8 is a schematic diagram of the evaluation system of the loan evaluation method for small and micro enterprises provided by the embodiment of the present application;
图9为本申请实施例提供的小微企业贷款评估装置示意图;FIG. 9 is a schematic diagram of a loan evaluation device for small and micro enterprises provided by the embodiment of the present application;
图10为本申请实施例提供的小微企业贷款评估实体装置示意图。FIG. 10 is a schematic diagram of a small and micro enterprise loan evaluation entity device according to an embodiment of the present application.
具体实施方式Detailed ways
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述。显然,所描述的实施例是本发明的一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some, but not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.
本申请提供一种小微企业贷款评估方法,如图1所示,该方法包括以下步骤S101至步骤S104,具体的步骤如下。This application provides a small and micro enterprise loan evaluation method. As shown in FIG. 1 , the method includes the following steps S101 to S104, and the specific steps are as follows.
步骤S101,获取小微企业的用电运行状态数据和贷款数据。Step S101 , obtaining power consumption operation status data and loan data of the small and micro enterprises.
步骤S102,构建小微企业的贷款评估指标体系。Step S102, constructing a loan evaluation index system for small and micro enterprises.
步骤S103,对贷款评估指标体系中每个评估指标的赋予权重,确定企业综合评分,企业综合评分等于多个评估指标的加权和。In step S103, a weight is assigned to each evaluation index in the loan evaluation index system, and a comprehensive enterprise score is determined, and the comprehensive enterprise score is equal to the weighted sum of multiple evaluation indicators.
步骤S104,根据企业综合评分设定监测阈值,以监测小微企业的贷前和贷后风险。In step S104, a monitoring threshold is set according to the comprehensive score of the enterprise to monitor the pre-loan and post-loan risks of the small and micro enterprises.
本发明实施例提供的小微企业贷款风险评估方法,通过使用用电运行状态数据和贷款数据构建小微企业的贷款评估指标体系,并对贷款评估指标体系中每个评估指标的赋予权重后确定企业综合评分,通过企业综合评分设定监测阈值后能够监测小微企业的贷前和贷后风险,有效解决了工作人员对还款能力评估不准确,并且占用大量的劳动力的问题,进而实现了根据用电运行状态数据和其他数据确定小微企业的贷款风险,提高了小微企业的贷款风险评估的准确性,并且节省了大量的人员成本。In the loan risk assessment method for small and micro enterprises provided by the embodiments of the present invention, a loan evaluation index system for small and micro enterprises is constructed by using power consumption operation status data and loan data, and a weight is assigned to each evaluation index in the loan evaluation index system. Enterprise comprehensive score, after setting the monitoring threshold through enterprise comprehensive score, it can monitor the pre-loan and post-loan risks of small and micro enterprises. The loan risk of small and micro enterprises is determined according to the data of electricity operation status and other data, which improves the accuracy of loan risk assessment for small and micro enterprises, and saves a lot of personnel costs.
具体的,在步骤S101实施时,获取小微企业近几年的用电运行状态数据,对用电行为、缴费行为、违约行为进行统计,得到相关指标,并且和金融机构进行相互合作,获取到小微企业的近几年来的贷款数据。Specifically, when step S101 is implemented, the data on the operation status of electricity consumption of small and micro enterprises in recent years is obtained, the electricity consumption behavior, bill payment behavior, and breach of contract behavior are counted to obtain relevant indicators, and mutual cooperation with financial institutions is carried out to obtain the Loan data of small and micro enterprises in recent years.
在步骤S102中,小微企业的贷款评估指标体系是根据步骤S101中得到的电运行状态数据和贷款数据构建的,具体的根据电运行状态数据中的用电增长度、行业景气度、用电稳定度、缴费信用度、违约用电以及贷款涨幅构建贷款评估指标体系。In step S102, the loan evaluation index system of small and micro enterprises is constructed according to the electricity operation status data and loan data obtained in step S101, and specifically according to the electricity consumption growth rate, industry prosperity, electricity consumption in the electricity operation status data Stability, payment credit, defaulted electricity consumption and loan growth to build a loan evaluation index system.
在步骤S103中,具体的,对贷款评估指标体系中每个评估指标的赋予权重,包括:In step S103, specifically, the weights assigned to each evaluation index in the loan evaluation index system include:
采用层次分析法或熵值法,确定每个评估指标的权重。The weight of each evaluation index is determined by the analytic hierarchy process or the entropy method.
在上述的采用层次分析法确定每个评估指标的权重时,如图2所示,具体包括以下几个步骤。When the above-mentioned AHP is used to determine the weight of each evaluation index, as shown in Figure 2, the following steps are specifically included.
步骤S201,建立层次递进结构模型。Step S201, establishing a hierarchical progressive structure model.
步骤S202,构造出各层次中的所有判断矩阵。Step S202, construct all judgment matrices in each level.
步骤S203,层次单排序及一致性检验。Step S203, ordering and consistency checking of the hierarchical order.
步骤S204,层次总排序及一致性检验。Step S204, total ordering of the hierarchy and consistency check.
在具体的使用层次分析法时,得到一个层次分析法的判断矩阵,构造小微企业信用评估的5个重要指标,得到如下表所示的层次分析法判断矩阵。在下文的叙述中称下表为表Ⅰ。When using AHP specifically, an AHP judgment matrix is obtained, five important indicators for credit evaluation of small and micro enterprises are constructed, and the AHP judgment matrix shown in the following table is obtained. The following table is referred to as Table I in the following description.
在表Ⅰ中的标度说明如下:1表示两个元素相比具有同样的重要性;3表示两个元素相比,前者比后者稍重要;5表示两个元素相比前者比后者明显重要。倒数表示若元素i和元素j的重要性之比为aij,那么元素j与元素i的重要性之比为 The scales in Table I are explained as follows: 1 means that the two elements are of equal importance; 3 means that the former is slightly more important than the latter; 5 means that the former is more important than the latter important. The reciprocal means that if the ratio of the importance of element i to element j is a ij , then the ratio of the importance of element j to element i is
求解各层次判断矩阵,得到层次分析法权重。经过上述步骤的计算,得到层次分析法确定的每个评估指标的权重值。Solve the judgment matrix of each level to get the weight of AHP. After the calculation in the above steps, the weight value of each evaluation index determined by the AHP is obtained.
在具体的使用熵值法时,如图3所示,分为以下步骤,计算得到熵值法确定的每个评估指标的权重。When specifically using the entropy method, as shown in Figure 3, it is divided into the following steps to calculate the weight of each evaluation index determined by the entropy method.
步骤S301,构建训练样本。Step S301, constructing training samples.
步骤S302,对训练样本进行标准化处理。Step S302, standardize the training samples.
步骤S303,计算第i项和第j项指标值的比重值。Step S303: Calculate the proportion value of the index value of the i-th item and the j-th item.
步骤S304,计算指标信息熵。Step S304, calculating the index information entropy.
步骤S305,计算信息熵冗余度。Step S305, calculating the information entropy redundancy.
步骤S306,计算指标权重。Step S306, calculate the index weight.
在上述的步骤S301中构建的训练样本为:The training samples constructed in the above step S301 are:
x{{x11,x12,x13...x1j},{x21,x22,x23...x2j},{x31,x32,x33...x3j}...{xi1,xi2,x3i...xij}}。x{{x 11 ,x 12 ,x 13 ...x 1j },{x 21 ,x 22 ,x 23 ...x 2j },{x 31 ,x 32 ,x 33 ...x 3j }. ..{x i1 ,x i2 ,x 3i ...x ij }}.
具体的,经过对步骤S301中构建的训练样本的标准化处理后,得到:正向指标:负向指标:其中xi'j表示标准化后的数据值;min{xj}表示一组元数中的最小值;max{xj}表示一组元素的最大值。Specifically, after standardizing the training samples constructed in step S301, it is obtained: the positive index: Negative indicators: where x i ' j represents the normalized data value; min{x j } represents the minimum value in a set of arity; max{x j } represents the maximum value of a set of elements.
进一步地,实施步骤S303:计算第i项和第j项指标值的比重值。计算公式为:其中,yij表示第i项指标值与第j项指标值的比重值,m表示元组的个数。然后实施步骤S304:计算指标信息熵。计算公式为:其中ej表示指标信息熵,k表示为 Further, step S303 is implemented: the proportion value of the index value of the i-th item and the j-th item is calculated. The calculation formula is: Among them, y ij represents the ratio of the i-th index value to the j-th index value, and m represents the number of tuples. Then step S304 is implemented: calculating the index information entropy. The calculation formula is: where e j represents the index information entropy, and k represents as
在步骤S305中计算信息熵冗余度的公式为:dj=1-ej。其中,dj表示信息熵冗余度。The formula for calculating the information entropy redundancy in step S305 is: d j =1-e j . Among them, d j represents the information entropy redundancy.
在步骤S306中计算指标权重的公式为:其中,Wi表示每个评估指标的权重值,n表示有n个元素。The formula for calculating the index weight in step S306 is: Among them, Wi represents the weight value of each evaluation index, and n represents that there are n elements.
经过上述步骤的计算,可以得到经过熵值法确定的指标权重值。After the calculation in the above steps, the index weight value determined by the entropy method can be obtained.
更进一步的,还可以用TOPSIS法(Technique for Order Preference bySimilarity to an Ideal Solution,优劣解距离法)确定每个评估指标的权重,如图4所示,包括以下几个步骤。Further, the TOPSIS method (Technique for Order Preference by Similarity to an Ideal Solution) can also be used to determine the weight of each evaluation index, as shown in Figure 4, including the following steps.
步骤S401,统一每个评估指标的单调性。Step S401, unify the monotonicity of each evaluation index.
步骤S402,归一化处理。Step S402, normalization processing.
步骤S403,进行加权处理。Step S403, performing weighting processing.
步骤S404,确定最优方案和最差方案。Step S404, determine the optimal solution and the worst solution.
步骤S405,计算每个评价对象到最优和最差方案的距离。Step S405, calculate the distance from each evaluation object to the optimal and worst solutions.
步骤S406,计算综合评价值。Step S406, calculate the comprehensive evaluation value.
在步骤S401中,在确定各项同一指标的单调性时,使用公式:其中,xij表示矩阵中i行j列的元素。In step S401, when determining the monotonicity of each same index, the formula is used: Among them, x ij represents the element of i row and j column in the matrix.
在步骤S402中,进行归一化处理的公式为其中xij表示矩阵中i行j列的元素,minj表示j列的最小元素,maxj表示j列的最大元素。In step S402, the formula for normalization processing is: where x ij represents the element in row i and column j in the matrix, min j represents the smallest element in column j, and max j represents the largest element in column j.
进一步地,将归一化后的数据进行加权处理,在步骤S403中,加权处理的公式为:Zij=Wij*Aij,其中,Zij表示加权的得分矩阵,Wij表示人工给定的加权值,Aij表示归一化后的评估矩阵。Further, perform weighting processing on the normalized data. In step S403, the formula for weighting processing is: Z ij =W ij *A ij , where Z ij represents a weighted score matrix, and W ij represents an artificial given The weighted value of , A ij represents the normalized evaluation matrix.
在步骤S404中,确定最优方案和最差方案,确定最优方案的公式为:确定最差方案的公式为:其中,Z+表示矩阵Zij按行计算的最大解,Z-表示矩阵Zij按行计算的最小解。In step S404, the optimal solution and the worst solution are determined, and the formula for determining the optimal solution is: The formula for determining the worst case scenario is: Among them, Z + represents the maximum solution calculated by the row of the matrix Z ij , and Z - represents the smallest solution of the matrix Z ij calculated by the row.
在步骤S405中,计算每个评价对象到最优和最差方案的距离,分别为与具体的计算公式为:其中,Xi表示对象矩阵,表示对象为最优方案距离,表示对象为最差方案距离,Z+表示最优方案矩阵,Z-表示最差方案矩阵。In step S405, the distances from each evaluation object to the optimal and worst solutions are calculated, which are respectively and The specific calculation formula is: where X i represents the object matrix, Indicates that the object is the optimal solution distance, Indicates that the object is the worst solution distance, Z + represents the optimal solution matrix, and Z - represents the worst solution matrix.
在步骤S406中,为计算综合评价值。计算综合评价值的计算公式为:其中,Bi表示综合评价价值矩阵,表示对象为最优方案的距离,表示对象为最方案距离。In step S406, the comprehensive evaluation value is calculated. The formula for calculating the comprehensive evaluation value is: Among them, B i represents the comprehensive evaluation value matrix, represents the distance at which the object is the optimal solution, Indicates that the object is the optimal distance.
进一步的,在步骤S103中,对贷款评估指标体系中每个评估指标的赋予权重,包括:Further, in step S103, the weights assigned to each evaluation index in the loan evaluation index system include:
采用综合赋权法确定每个评估指标的权重,综合赋权法的函数为:The weight of each evaluation index is determined by the comprehensive weighting method. The function of the comprehensive weighting method is:
其中,Wj表示综合赋权的权重值,W1j表示层次分析法和TOPSIS法确定的权重值,W2j表示熵值法和TOPSIS法确定的权重值。Among them, W j represents the weight value of the comprehensive weighting, W 1j represents the weight value determined by the AHP and TOPSIS method, and W 2j represents the weight value determined by the entropy value method and the TOPSIS method.
将层次分析法得到的权重值代入TOPSIS法中求出主观经验法最理想的权重值,如下表所示,以下称下表为表Ⅱ。Substitute the weight value obtained by the AHP into the TOPSIS method to obtain the most ideal weight value of the subjective experience method, as shown in the following table, hereinafter referred to as Table II.
由于根据主观经验法计算出的最理想权重值不能有很好地说服力,在指标的重要性上更倾向于人个人偏好,造成指标权重与实际存在偏差。因此,引入熵值法,通过数据自身蕴含的信息量大小去划分指标的权重分布。使用熵值法和TOPSIS法综合计算出客观最理想权重,客观权重如下表所示,以下称下表为表Ⅲ。Since the optimal weight value calculated according to the subjective experience method cannot be very convincing, the importance of the indicator is more inclined to personal preference, resulting in the deviation of the indicator weight from the actual. Therefore, the entropy value method is introduced to divide the weight distribution of the indicators according to the amount of information contained in the data itself. Use the entropy method and the TOPSIS method to comprehensively calculate the objective optimal weight. The objective weight is shown in the following table, which is hereinafter referred to as Table III.
在综合赋权的公式中,W1j表示层次分析法和TOPSIS法确定的权重值,W2j表示熵值法和TOPSIS法确定的权重值。W1j和W2j由上述的表Ⅱ和表Ⅲ得出,将上述的值代入综合赋权法的函数公式中,得到最终的权重值。如下表所示,以下称下表为表Ⅳ。In the formula of comprehensive weighting, W 1j represents the weight value determined by the AHP and TOPSIS method, and W 2j represents the weight value determined by the entropy value method and the TOPSIS method. W 1j and W 2j are obtained from the above-mentioned Table II and Table III, and the above-mentioned values are substituted into the function formula of the comprehensive weighting method to obtain the final weight value. As shown in the following table, hereinafter referred to as Table IV.
表Ⅳ中得到的权重为举例说明的几个指标的最终权重值。The weights obtained in Table IV are the final weight values of several indicators illustrated.
在步骤S103中,得出小微企业的综合评分,根据上述的表Ⅳ中的最终权重值,具体的对小微企业进行信用评分,由于各项指标的量纲不一致,所以需要进一步对所得数据进行归一化处理,消除量纲。消除量纲的公式为:In step S103, the comprehensive score of the small and micro enterprises is obtained. According to the final weight value in the above table IV, the credit score of the small and micro enterprises is specifically carried out. Since the dimensions of various indicators are inconsistent, it is necessary to further evaluate the obtained data. Perform normalization to eliminate dimensions. The formula for dimension elimination is:
由上述的公式得到企业的信用评分,得到如下表所示的结果,以下统一称下表为表Ⅴ。The credit score of the enterprise is obtained from the above formula, and the results shown in the following table are obtained, and the following table is collectively referred to as Table V.
表Ⅴ为根据上述的步骤和算法得出的企业的贷前风险评估评分。Table V is the pre-loan risk assessment score of the enterprise obtained according to the above steps and algorithms.
在步骤S104中,根据企业综合评分设定监测阈值,如图5所示,包括:采用梯度提升树法预测小微企业的月用电量,进而设定监测阈值。In step S104, the monitoring threshold is set according to the comprehensive score of the enterprise, as shown in FIG. 5, including: using the gradient boosting tree method to predict the monthly electricity consumption of the small and micro enterprises, and then setting the monitoring threshold.
进一步的,采用梯度提升树法预测小微企业贷后月用电量可以分为以下步骤。Further, using the gradient boosting tree method to predict the monthly electricity consumption of small and micro enterprises after loan can be divided into the following steps.
步骤S501,定义一棵回归树,不断地进行特征分裂来生长一棵树,每次添加一个树,学习一个新函数,去拟合上次预测的残差。Step S501, define a regression tree, continuously perform feature splitting to grow a tree, add a tree each time, learn a new function, and fit the residual of the last prediction.
步骤S502,定义XGBoost的目标函数,XGBoost即为梯度提升算法。Step S502, define an objective function of XGBoost, which is a gradient boosting algorithm.
步骤S503,用新生成的树去拟合上次预测的残差,得到预测分数的目标函数。Step S503, use the newly generated tree to fit the residual of the previous prediction to obtain the objective function of the prediction score.
步骤S504,对目标函数求一阶导和二阶导:得到每个叶节点的最优预测分数。Step S504, obtain the first-order derivative and the second-order derivative of the objective function: obtain the optimal prediction score of each leaf node.
步骤S505,最优预测分数带入目标函数,解得最小损失。Step S505, the optimal prediction score is brought into the objective function, and the minimum loss is obtained.
步骤S506,通过贪婪算法寻找最佳分支。Step S506, searching for the best branch through a greedy algorithm.
在步骤S501中,在训练完成后,得到K棵树,预测一个样本分数:In step S501, after the training is completed, K trees are obtained, and a sample score is predicted:
其中,K表示树的总数量,k表示第k棵树,fx(xi)表示一棵树上在这个叶子节点的回归值。Among them, K represents the total number of trees, k represents the kth tree, and f x (x i ) represents the regression value of a tree at this leaf node.
在步骤S502中,XGBoost的目标函数为:In step S502, the objective function of XGBoost is:
其中,表示用来衡量预测分数和真实分数的差距,表示正则化项,T表示叶子节点数,w表示叶子节点的分数,γ表示控制叶子节点的个数,λ表示控制叶子节点的分数不会过大。in, is used to measure the difference between the predicted score and the true score, Represents the regularization term, T represents the number of leaf nodes, w represents the score of leaf nodes, γ represents the number of control leaf nodes, and λ represents that the score of control leaf nodes will not be too large.
在步骤S503中,预测分数的目标函数为:In step S503, the objective function of the predicted score is:
其中,表示衡量预测分数和真实分数的差值,Ω(fk)表示正则化项,fx(xi)表示一棵树上在这个叶子节点的回归值。in, represents the difference between the predicted score and the true score, Ω(f k ) represents the regularization term, and f x (x i ) represents the regression value of a tree at this leaf node.
在步骤S504中,每个叶子节点的最优预测分数为:In step S504, the optimal prediction score of each leaf node is:
其中,gi和hi分别表示移除常量: where gi and hi represent removal constants, respectively:
在步骤S505中,解得的最小损失为:其中,obj表示树的节点分数,T表示叶子节点数,γ表示控制叶子节点的个数,λ表示控制叶子节点的分数不会过大,gi和hi分别表示,移除常量: In step S505, the minimum loss obtained is: Among them, obj indicates the node score of the tree, T indicates the number of leaf nodes, γ indicates the number of control leaf nodes, λ indicates that the score of the control leaf node will not be too large, gi and hi respectively represent, remove the constant:
在步骤S506中,寻找最佳分支,分支后的结构分数差为:其中,表示左子树分数,表示右子树分数,表示不可分割我们可以拿到的分数,γ表示加入新叶子节点引入的复杂度代价。In step S506, find the best branch, and the difference of the structure score after the branch is: in, represents the left subtree fraction, represents the right subtree score, Represents the score we can get for indivisible, and γ represents the complexity cost introduced by adding new leaf nodes.
根据小微企业近几年的用电运行状态数据,使用XGBoost算法计算贷前和贷后企业指标监测阈值。比如,图6使用XGBoost算法预测2020年7月份该小微企业的用电量,用电量同环比作参设定用电量的波动阈值。According to the data of electricity consumption operation status of small and micro enterprises in recent years, the XGBoost algorithm is used to calculate the monitoring thresholds of enterprise indicators before and after lending. For example, Figure 6 uses the XGBoost algorithm to predict the electricity consumption of the small and micro enterprise in July 2020, and the electricity consumption is compared with the chain as the parameter to set the fluctuation threshold of electricity consumption.
进一步的进行预测时,用小微企业的月用电量的同比率和环比率作为监测指标的参考标准,同比率和环比率的任意一个指标向任意方向的波动量大于等于5%小于等于10%定义为黄色预警,向任意方向波动量大于10%以上定义为黄色预警,向任意方向的波动量小于5%不发生预警。When making further forecasts, the year-on-year ratio and ring ratio of monthly electricity consumption of small and micro enterprises are used as reference standards for monitoring indicators. % is defined as a yellow warning, and if the fluctuation in any direction is greater than 10%, it is defined as a yellow warning, and if the fluctuation in any direction is less than 5%, no warning will occur.
举例如下,下表为部分企业2020年七月份的监测情况,以下称下表为表Ⅺ。For example, the following table shows the monitoring situation of some enterprises in July 2020, hereinafter referred to as Table XI.
本发明实施例提供的小微企业贷款评估方法,如图7所示,还包括步骤S701和步骤S702。The small and micro enterprise loan evaluation method provided by the embodiment of the present invention, as shown in FIG. 7 , further includes step S701 and step S702.
步骤S701,关联用电运行状态数据和贷款数据,并筛选字段形成数据宽表。Step S701, correlate the power consumption operation status data and loan data, and filter fields to form a data wide table.
步骤S702,在数据宽表的基础上对数据进行处理,以解决样本不平衡现象、处理异常值和填补缺失值。In step S702, the data is processed on the basis of the data wide table to solve the phenomenon of sample imbalance, process outliers and fill in missing values.
在步骤S701中,从现有的小微企业用电运行状态数据和贷款数据中,筛选出有可能使用到的字段形成数据宽表,为在宽表的基础上对数据进行预处理做好基础。In step S701, fields that may be used are screened out from the existing small and micro enterprise electricity consumption operation status data and loan data to form a data wide table, so as to lay a solid foundation for data preprocessing on the basis of the wide table .
进一步的,在步骤S702中,数据宽表的基础上对数据进行处理,如图8所示,包括步骤S801至步骤S803。Further, in step S702, the data is processed on the basis of the data width table, as shown in FIG. 8, including steps S801 to S803.
步骤S801,采用欠采样法或调整权重采样法,解决样本不平衡现象。Step S801, adopting the under-sampling method or the weight-adjusted sampling method to solve the sample imbalance phenomenon.
步骤S802,采用箱线法处理异常值。Step S802, using the box-line method to process outliers.
步骤S803,采用线性回归法填补缺失值。Step S803, using a linear regression method to fill in the missing values.
在步骤S801中,样本数据往往会出现不平衡现象,针对这种现象的处理办法一般分为:欠采样,调整权重以及合成少数类过采样技术,简称为SMOTE。SMOTE是针对处理数据样本不均衡的一种方法,相较于随机过采样处理技术,SMOTE的优势在于能够利用少数类的样本进行分析并根据这些少量样进行人工合成,生成新的样本并扩充到数据样本中,本次处理采用合成少数过采样技术进行数据不平衡处理。In step S801, the sample data often has an unbalanced phenomenon, and the processing methods for this phenomenon are generally divided into: under-sampling, weight adjustment and synthesis of minority over-sampling technology, referred to as SMOTE for short. SMOTE is a method for dealing with unbalanced data samples. Compared with random oversampling processing technology, the advantage of SMOTE is that it can use a minority of samples for analysis and artificially synthesize these small samples to generate new samples and expand to In the data samples, this processing adopts the synthetic minority oversampling technique to deal with the data imbalance.
在步骤S802是对异常值进行处理,在本实施例中采用的箱线法对异常值进行处理。具体地,对小微企业的用电量做箱线图,突出大于99%分位数的数据,剔除小于1%分位数的数据。In step S802, the abnormal value is processed, and the box-line method adopted in this embodiment is used to process the abnormal value. Specifically, a boxplot is made for the electricity consumption of small and micro enterprises, highlighting the data greater than the 99% quantile, and excluding the data less than 1% quantile.
步骤S803中的线性回归是应对回归问题最常用的方法之一,其实是一种线性的建模方法,可以通过凸优法进行求解,具体地,通过最小化下面的目标函数进行求解:The linear regression in step S803 is one of the most commonly used methods for dealing with regression problems. In fact, it is a linear modeling method, which can be solved by the convex optimization method. Specifically, the solution can be solved by minimizing the following objective function:
其中,J(θ)为WJX和y的函数。目标函数中包括两部分内容,正则项用于控制模型复杂度,损失项用于度量拟合误差,目标函数是同舱位W的凸函数。正则项参数λ>0为最小化误差和模型复杂度之间提供了一种折中,用来避免过于拟合。where J(θ) is a function of W J X and y. The objective function includes two parts, the regular term is used to control the complexity of the model, the loss term is used to measure the fitting error, and the objective function is a convex function of the same class W. The regularization parameter λ>0 provides a compromise between minimizing error and model complexity to avoid overfitting.
线性回归法可以采用如下步骤。The linear regression method can use the following steps.
步骤一:给定训练数据样本集D={(x1,y1),(x2,y2),...,(xm,ym)},yi∈R,选取初值θ0,给定收敛容差ε,最大迭代次数K,然后解下面优化问题。Step 1: Given a training data sample set D={(x 1 ,y 1 ),(x 2 ,y 2 ),...,(x m ,y m )}, y i ∈ R, select the initial value θ 0 , given the convergence tolerance ε, the maximum number of iterations K, and then solve the following optimization problem.
步骤二:采取下面公式更新θStep 2: Take the following formula to update θ
当|J(θ(K+1))-J(θ(k))|<ε或者当k=K,输出为0,其中k为迭代数,否则重复步骤二,直至条件成立。When |J(θ(K+1))-J(θ(k))|<ε or when k=K, the output is 0, where k is the number of iterations, otherwise step 2 is repeated until the condition is satisfied.
构造回归决策函数:f(x)=θTX,用回归决策函数进行缺失值补充,得到完整的,没有数据值缺失的数据,以便后续进行对该小微企业进行评分。Construct the regression decision function: f(x) = θ T X, and use the regression decision function to supplement the missing values to obtain complete data with no missing data values, so that the small and micro enterprises can be scored later.
本申请提供评分体系如图8所示,包括:缴费信用度,用电稳定度,行业景气度,用电增长度,违约用电信息,以及贷款涨幅。The scoring system provided in this application is shown in Figure 8, including: payment credit, electricity consumption stability, industry prosperity, electricity consumption growth, default electricity consumption information, and loan increase.
缴费信用度包括:缴费预存金额、缴费及时率、欠费金额、平均欠费时长。Payment credit includes: pre-paid payment amount, timely payment rate, arrears amount, and average arrears duration.
用电稳定度包括:企业电压稳定度、用电陡增次数。The power consumption stability includes: enterprise voltage stability and the number of times of sudden increase in power consumption.
行业景气度包括:行业景气指数、行业用电总量。Industry prosperity includes: industry prosperity index, industry total electricity consumption.
用电增长度包括:企业月用电量同比、企业月用电量环比。The growth rate of electricity consumption includes: the year-on-year monthly electricity consumption of enterprises and the month-on-month ratio of monthly electricity consumption of enterprises.
违约用电包括:违约用电次数、违约窃电金额、违约窃电次数、追补电量。The breached electricity consumption includes: the number of breached electricity consumption, the amount of breached electricity theft, the number of breached electricity theft, and the recovery of electricity.
贷款涨幅包括:贷款涨幅指数和贷款金额。The loan increase includes: loan increase index and loan amount.
根据上述的数据项构建评分体系,对小微企业贷款风险进行评估。According to the above data items, a scoring system is constructed to evaluate the loan risk of small and micro enterprises.
虽然本申请提供了如以上文字描述或流程图所述的方法操作步骤,但基于常规或者无创造性的劳动可以包括更多或者更少的操作步骤。本实施例中列举的步骤顺序仅仅为众多步骤执行顺序中的一种方式,不代表唯一的执行顺序。在实际中的装置或客户端产品执行时,可以按照本实施例或者附图所示的方法顺序执行或者并行执行(例如并行处理器或者多线程处理的环境)。Although the present application provides method operation steps as described in the above textual description or flow chart, more or less operation steps may be included based on routine or non-creative work. The sequence of steps enumerated in this embodiment is only one way among the execution sequences of many steps, and does not represent the only execution sequence. When an actual device or client product is executed, the methods shown in this embodiment or the accompanying drawings may be executed sequentially or in parallel (for example, a parallel processor or a multi-threaded processing environment).
本发明实施例还提供了一种小微企业贷款评估装置,该评估装置包括:数据获取单元901,体系构建单元902,综合评分单元903,监测方案生成单元904。The embodiment of the present invention also provides a small and micro enterprise loan evaluation device, the evaluation device includes: a data acquisition unit 901 , a system construction unit 902 , a comprehensive scoring unit 903 , and a monitoring plan generation unit 904 .
其中,数据获取单元901用于获取小微企业的用电运行状态数据和贷款数据;体系构建单元902用于构建小微企业的贷款评估指标体系;综合评分单元903用于对贷款评估指标体系中每个评估指标的赋予权重,确定企业综合评分,企业综合评分等于多个评估指标的加权和;监测方案生成单元904用于根据企业综合评分设定监测阈值,以监测小微企业的贷前和贷后风险。Among them, the data acquisition unit 901 is used to acquire the power consumption operation status data and loan data of small and micro enterprises; the system construction unit 902 is used to construct the loan evaluation index system of small and micro enterprises; the comprehensive scoring unit 903 is used to evaluate the loan evaluation index system. The weight assigned to each evaluation index determines the comprehensive enterprise score, which is equal to the weighted sum of multiple evaluation indicators; the monitoring plan generation unit 904 is configured to set a monitoring threshold according to the comprehensive enterprise score, so as to monitor the pre-loan and loan balance of small and micro enterprises. post-loan risk.
综合评分单元可以具体用于:采用层次分析法和/或熵值法,确定每个评估指标的权重,然后确定企业综合评分。The comprehensive scoring unit can be specifically used for: using the analytic hierarchy process and/or the entropy value method, to determine the weight of each evaluation index, and then to determine the comprehensive score of the enterprise.
综合评分单元还可以具体用于:采用综合赋权法确定每个评估指标的权重,然后确定企业综合评分,综合赋权法的函数为:The comprehensive scoring unit can also be specifically used to: determine the weight of each evaluation index by the comprehensive weighting method, and then determine the comprehensive score of the enterprise. The function of the comprehensive weighting method is:
其中,Wj为综合赋权的权重值,W1j为层次分析法和TOPSIS法确定权重值,W2j为熵值法和TOPSIS法确定权重值。Among them, W j is the weight value of the comprehensive weighting, W 1j is the weight value determined by the AHP and TOPSIS method, and W 2j is the weight value determined by the entropy value method and the TOPSIS method.
监测方案生成单元具体用于:采用梯度提升树法预测小微企业的月用电量,进而设定监测阈值。The monitoring plan generation unit is specifically used to: predict the monthly electricity consumption of small and micro enterprises by using the gradient boosting tree method, and then set the monitoring threshold.
装置还包括数据宽表构造单元和数据处理单元:数据宽表构造单元用于关联用电运行状态数据和贷款数据,并筛选字段形成数据宽表;数据处理单元用于在数据宽表的基础上对数据进行处理,以解决样本不平衡现象、处理异常值和填补缺失值。The device also includes a data wide table construction unit and a data processing unit: the data wide table construction unit is used for correlating the data of the electricity consumption running state and the loan data, and filtering fields to form a data wide table; the data processing unit is used for Process data to resolve sample imbalances, handle outliers, and impute missing values.
数据处理单元具体用于:采用欠采样法或调整权重采样法,解决样本不平衡现象;采用箱线法处理异常值;采用线性回归法填补缺失值。The data processing unit is specifically used for: adopting the undersampling method or adjusting the weight sampling method to solve the sample imbalance phenomenon; adopting the box-line method to deal with outliers; adopting the linear regression method to fill in the missing values.
贷款评估指标体系包括:缴费信用度,用电稳定度,行业景气度,用电增长度,以及违约用电信息。The loan evaluation index system includes: payment credit, electricity consumption stability, industry prosperity, electricity consumption growth, and default electricity consumption information.
本发明实施例阐明的装置或模块,具体可以由计算机芯片或实体实现,或者由具有某种功能的产品来实现。为了描述的方便,描述以上装置时以功能分为各种模块分别描述。在实施本申请时可以把各模块的功能在同一个或多个软件和/或硬件中实现。当然,也可以将实现某功能的模块由多个子模块或子单元组合实现。The devices or modules described in the embodiments of the present invention may be specifically implemented by computer chips or entities, or by products with certain functions. For the convenience of description, when describing the above device, the functions are divided into various modules and described respectively. When implementing the present application, the functions of each module may be implemented in one or more software and/or hardware. Of course, a module that implements a certain function can also be implemented by a combination of multiple sub-modules or sub-units.
如图10所示,本发明实施例还提供了一种小微企业贷款评估装置,包括:存储器1001和处理器1002;存储器1001用于存储计算机可执行指令;处理器1002用于执行计算机可执行指令以实现本实施例提供的小微企业贷款评估方法。As shown in FIG. 10 , an embodiment of the present invention further provides a loan evaluation device for small and micro enterprises, including: a memory 1001 and a processor 1002; the memory 1001 is used for storing computer-executable instructions; the processor 1002 is used for executing computer-executable instructions Instructions are provided to implement the loan evaluation method for small and micro enterprises provided in this embodiment.
本发明实施例还提供一种计算机可读存储介质,计算机可读存储介质存储有可执行指令,计算机执行可执行指令时以实现本实施例提供的小微企业贷款评估方法。Embodiments of the present invention further provide a computer-readable storage medium, where the computer-readable storage medium stores executable instructions, and when the computer executes the executable instructions, the small and micro enterprise loan evaluation method provided in this embodiment is implemented.
上述存储介质包括但不限于随机存取存储器(英文:Random Access Memory;简称:RAM)、只读存储器(英文:Read-Only Memory;简称:ROM)、缓存(英文:Cache)、硬盘(英文:Hard Disk Drive;简称:HDD)或者存储卡(英文:Memory Card)。所述存储器可以用于存储计算机程序指令。The above-mentioned storage medium includes but is not limited to random access memory (English: Random Access Memory; referred to as: RAM), read-only memory (English: Read-Only Memory; referred to as: ROM), cache (English: Cache), hard disk (English: Hard Disk Drive; referred to as: HDD) or memory card (English: Memory Card). The memory may be used to store computer program instructions.
本申请中所述的方法、装置或模块可以以计算机可读程序代码方式实现控制器按任何适当的方式实现,例如,控制器可以采取例如微处理器或处理器以及存储可由该(微)处理器执行的计算机可读程序代码(例如软件或固件)的计算机可读介质、逻辑门、开关、专用集成电路(英文:Application Specific Integrated Circuit;简称:ASIC)、可编程逻辑控制器和嵌入微控制器的形式,控制器的例子包括但不限于以下微控制器:ARC 625D、Atmel AT91SAM、Microchip PIC18F26K20以及Silicone Labs C8051F320,存储器控制器还可以被实现为存储器的控制逻辑的一部分。本领域技术人员也知道,除了以纯计算机可读程序代码方式实现控制器以外,完全可以通过将方法步骤进行逻辑编程来使得控制器以逻辑门、开关、专用集成电路、可编程逻辑控制器和嵌入微控制器等的形式来实现相同功能。因此这种控制器可以被认为是一种硬件部件,而对其内部包括的用于实现各种功能的装置也可以视为硬件部件内的结构。或者甚至,可以将用于实现各种功能的装置视为既可以是实现方法的软件模块又可以是硬件部件内的结构。The methods, apparatuses or modules described in this application may be implemented in computer readable program code. The controller may be implemented in any suitable manner, for example, the controller may take the form of, for example, a microprocessor or processor and the memory may be implemented by the (micro)processing computer-readable medium, logic gates, switches, application-specific integrated circuits (English: Application Specific Integrated Circuit; ASIC for short), programmable logic controllers and embedded microcontrollers Examples of controllers include, but are not limited to, the following microcontrollers: ARC 625D, Atmel AT91SAM, Microchip PIC18F26K20, and Silicon Labs C8051F320, and memory controllers can also be implemented as part of the memory's control logic. Those skilled in the art also know that, in addition to implementing the controller in the form of pure computer-readable program code, the controller can be implemented as logic gates, switches, application-specific integrated circuits, programmable logic controllers and embedded devices by logically programming the method steps. The same function can be realized in the form of a microcontroller, etc. Therefore, such a controller can be regarded as a hardware component, and the devices included therein for realizing various functions can also be regarded as a structure within the hardware component. Or even, the means for implementing various functions can be regarded as both a software module implementing a method and a structure within a hardware component.
本说明书中的各个实施方式采用递进的方式描述,各个实施方式之间相同或相似的部分互相参见即可,每个实施方式重点说明的都是与其他实施方式的不同之处。本申请的全部或者部分可用于众多通用或专用的计算机系统环境或配置中。例如:个人计算机、服务器计算机、手持设备或便携式设备、平板型设备、移动通信终端、多处理器系统、基于微处理器的系统、可编程的电子设备、网络PC、小型计算机、大型计算机、包括以上任何系统或设备的分布式计算环境等等。Each embodiment in this specification is described in a progressive manner, and the same or similar parts between the various embodiments may be referred to each other, and each embodiment focuses on the differences from other embodiments. All or part of this application may be used in numerous general purpose or special purpose computer system environments or configurations. For example: personal computers, server computers, handheld or portable devices, tablet devices, mobile communication terminals, multiprocessor systems, microprocessor-based systems, programmable electronic devices, network PCs, minicomputers, mainframe computers, including A distributed computing environment for any of the above systems or devices, and the like.
以上实施例仅用以说明本申请的技术方案,而非对本申请限制;尽管参照前述实施例对本申请进行了详细的说明,本领域普通技术人员应当理解:其依然可以对前述实施例所记载的技术方案进行修改,或者对其中部分或者全部技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请技术方案的范围。The above embodiments are only used to illustrate the technical solutions of the present application, but not to limit the present application; although the present application has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: it can still The technical solutions are modified, or some or all of the technical features thereof are equivalently replaced; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the scope of the technical solutions of the present application.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011278327.7A CN112508679A (en) | 2020-11-16 | 2020-11-16 | Small and micro enterprise loan risk assessment method and device and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011278327.7A CN112508679A (en) | 2020-11-16 | 2020-11-16 | Small and micro enterprise loan risk assessment method and device and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112508679A true CN112508679A (en) | 2021-03-16 |
Family
ID=74958020
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011278327.7A Pending CN112508679A (en) | 2020-11-16 | 2020-11-16 | Small and micro enterprise loan risk assessment method and device and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112508679A (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113095931A (en) * | 2021-04-25 | 2021-07-09 | 国家电网有限公司 | Post-loan risk monitoring method and device and computer equipment |
CN113095712A (en) * | 2021-04-25 | 2021-07-09 | 国家电网有限公司 | Enterprise credit granting score obtaining method and device and computer equipment |
CN113744043A (en) * | 2021-09-03 | 2021-12-03 | 国网江苏省电力有限公司常州供电分公司 | Credit risk assessment method and device for small and micro enterprises and computer equipment |
CN115034509A (en) * | 2022-06-30 | 2022-09-09 | 中国建设银行股份有限公司 | Data processing method, apparatus, equipment and storage medium |
CN115941502A (en) * | 2022-11-24 | 2023-04-07 | 中国联合网络通信集团有限公司 | Method and device for determining network operation and maintenance capabilities |
CN119417237A (en) * | 2025-01-07 | 2025-02-11 | 国网山东省电力公司营销服务中心(计量中心) | A collection system risk assessment method, system, medium, equipment and program |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR100921618B1 (en) * | 2009-04-28 | 2009-10-15 | 기술신용보증기금 | Technology evaluation method for technology start-up companies |
CN106504125A (en) * | 2016-12-27 | 2017-03-15 | 北京中电普华信息技术有限公司 | A kind of method and device for building power customer overall evaluation system |
CN107194803A (en) * | 2017-05-19 | 2017-09-22 | 南京工业大学 | P2P net loan borrower credit risk assessment device |
CN109934371A (en) * | 2017-12-18 | 2019-06-25 | 普华讯光(北京)科技有限公司 | The method that solvency risk identification and prediction are carried out to enterprise based on electricity consumption data |
CN111369171A (en) * | 2020-03-19 | 2020-07-03 | 上海观安信息技术股份有限公司 | User service safety comprehensive risk assessment method based on combined empowerment |
CN111612323A (en) * | 2020-05-15 | 2020-09-01 | 国网河北省电力有限公司电力科学研究院 | A power credit evaluation method based on big data model |
-
2020
- 2020-11-16 CN CN202011278327.7A patent/CN112508679A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR100921618B1 (en) * | 2009-04-28 | 2009-10-15 | 기술신용보증기금 | Technology evaluation method for technology start-up companies |
CN106504125A (en) * | 2016-12-27 | 2017-03-15 | 北京中电普华信息技术有限公司 | A kind of method and device for building power customer overall evaluation system |
CN107194803A (en) * | 2017-05-19 | 2017-09-22 | 南京工业大学 | P2P net loan borrower credit risk assessment device |
CN109934371A (en) * | 2017-12-18 | 2019-06-25 | 普华讯光(北京)科技有限公司 | The method that solvency risk identification and prediction are carried out to enterprise based on electricity consumption data |
CN111369171A (en) * | 2020-03-19 | 2020-07-03 | 上海观安信息技术股份有限公司 | User service safety comprehensive risk assessment method based on combined empowerment |
CN111612323A (en) * | 2020-05-15 | 2020-09-01 | 国网河北省电力有限公司电力科学研究院 | A power credit evaluation method based on big data model |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113095931A (en) * | 2021-04-25 | 2021-07-09 | 国家电网有限公司 | Post-loan risk monitoring method and device and computer equipment |
CN113095712A (en) * | 2021-04-25 | 2021-07-09 | 国家电网有限公司 | Enterprise credit granting score obtaining method and device and computer equipment |
CN113744043A (en) * | 2021-09-03 | 2021-12-03 | 国网江苏省电力有限公司常州供电分公司 | Credit risk assessment method and device for small and micro enterprises and computer equipment |
CN115034509A (en) * | 2022-06-30 | 2022-09-09 | 中国建设银行股份有限公司 | Data processing method, apparatus, equipment and storage medium |
CN115941502A (en) * | 2022-11-24 | 2023-04-07 | 中国联合网络通信集团有限公司 | Method and device for determining network operation and maintenance capabilities |
CN115941502B (en) * | 2022-11-24 | 2024-07-02 | 中国联合网络通信集团有限公司 | A method and device for determining network operation and maintenance capability |
CN119417237A (en) * | 2025-01-07 | 2025-02-11 | 国网山东省电力公司营销服务中心(计量中心) | A collection system risk assessment method, system, medium, equipment and program |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112508679A (en) | Small and micro enterprise loan risk assessment method and device and storage medium | |
CN107169628B (en) | A distribution network reliability assessment method based on big data mutual information attribute reduction | |
CN110704542A (en) | Data dynamic partitioning system based on node load | |
CN110990529B (en) | Industry detail dividing method and system for enterprises | |
CN113095712A (en) | Enterprise credit granting score obtaining method and device and computer equipment | |
CN111178585A (en) | Prediction method of fault reception volume based on multi-algorithm model fusion | |
CN117674119A (en) | Power grid operation risk assessment method, device, computer equipment and storage medium | |
CN104951843B (en) | Method for Sales Forecast system and method | |
CN113919162B (en) | Voltage sag risk early warning method based on simulation and multi-source measured data fusion | |
CN106407305A (en) | Data mining system and method | |
CN111950706A (en) | Data processing method, device, computer equipment and medium based on artificial intelligence | |
CN115081641A (en) | Model training method, prediction method of evaluation result, equipment and storage medium | |
CN113807469A (en) | A multi-energy user value prediction method, device, storage medium and device | |
CN117973848A (en) | Power system supply resilience analysis method, device, computer equipment and medium | |
CN116912016A (en) | Bill auditing method and device | |
CN107256461A (en) | A kind of electrically-charging equipment builds address evaluation method and system | |
CN114254857A (en) | Electric equipment inventory evaluation method and server | |
US11227288B1 (en) | Systems and methods for integration of disparate data feeds for unified data monitoring | |
CN112927012A (en) | Marketing data processing method and device and marketing model training method and device | |
CN116805182A (en) | Industrial added value growth forecast methods, systems, electronic equipment and storage media | |
AU2023275277A1 (en) | Methods, systems and computer program products for determining models for predicting reoccurring transactions | |
Li et al. | The application of internet big data and support vector machine in risk warning | |
CN116149895A (en) | Big data cluster performance prediction method and device and computer equipment | |
Rameshkhah et al. | Comparison of shuffled frog leaping algorithm and PSO in data clustering with constraint for grouping voltage control areas in power systems | |
CN107958327A (en) | A kind of project process Risk Forecast Method based on factorial analysis and SOM networks |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20210316 |
|
RJ01 | Rejection of invention patent application after publication |