CN111861738A - Risk control rule screening method, device, computer equipment and storage medium - Google Patents

Risk control rule screening method, device, computer equipment and storage medium Download PDF

Info

Publication number
CN111861738A
CN111861738A CN202010843248.XA CN202010843248A CN111861738A CN 111861738 A CN111861738 A CN 111861738A CN 202010843248 A CN202010843248 A CN 202010843248A CN 111861738 A CN111861738 A CN 111861738A
Authority
CN
China
Prior art keywords
rule
variable
credit evaluation
evaluation model
rules
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010843248.XA
Other languages
Chinese (zh)
Inventor
陈岚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Fuzhifu Information Technology Co ltd
Original Assignee
Shenzhen Fuzhifu Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Fuzhifu Information Technology Co ltd filed Critical Shenzhen Fuzhifu Information Technology Co ltd
Priority to CN202010843248.XA priority Critical patent/CN111861738A/en
Publication of CN111861738A publication Critical patent/CN111861738A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/03Credit; Loans; Processing thereof

Landscapes

  • Business, Economics & Management (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Engineering & Computer Science (AREA)
  • Development Economics (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • Technology Law (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)

Abstract

本发明提供了一种风控规则筛选方法、装置、计算机设备及存储介质,所述方法包括,将信用评估模型的各条规则的变量名逐条命名为与各条规则的名称一致,并将各条规则变量作为自变量,贷后表现变量作为因变量,生成一个一维向量;对信用评估模型的每条进件,生成一个新的一维向量;若信用评估模型的进件中命中一条或多条规则,在新的一维向量中给命中的规则变量赋值1,未命中变量赋值0;若进件贷后逾期,在新的一维向量中给贷后表现变量赋值1,否则赋值0;利用逻辑回归模型,计算出信用评估模型的各条规则的权重参数,将权重参数小于预设值的规则从信用评估模型中剔除。本发明能够更有效地筛选出真正起作用的风控规则。

Figure 202010843248

The present invention provides a method, device, computer equipment and storage medium for screening risk control rules. The method includes: naming the variable names of each rule of the credit evaluation model one by one to be consistent with the name of each rule; The rule variable is used as the independent variable, and the post-loan performance variable is used as the dependent variable to generate a one-dimensional vector; for each entry of the credit evaluation model, a new one-dimensional vector is generated; if one of the entries of the credit evaluation model is hit or For multiple rules, assign 1 to the hit rule variable in the new one-dimensional vector, and assign 0 to the miss variable; if the incoming loan is overdue, assign 1 to the post-loan performance variable in the new one-dimensional vector, otherwise assign 0 ; Using the logistic regression model, the weight parameters of each rule of the credit evaluation model are calculated, and the rules whose weight parameters are less than the preset value are eliminated from the credit evaluation model. The present invention can more effectively screen out the really effective risk control rules.

Figure 202010843248

Description

风控规则筛选方法、装置、计算机设备及存储介质Risk control rule screening method, device, computer equipment and storage medium

技术领域technical field

本发明涉及一种规则筛选方法、装置、计算机设备及存储介质,尤其是指一种风控规则筛选方法、装置、计算机设备及存储介质。The present invention relates to a rule screening method, device, computer equipment and storage medium, in particular to a risk control rule screening method, device, computer equipment and storage medium.

背景技术Background technique

风控能力是信贷行业从业机构的核心竞争力。信贷行业风控系统一般由规则引擎和评分卡组成。规则引擎由几十到几百条不等的风控规则组成,用于评估申请人的还款意愿。评分卡输出评分等级,用于评估申请人的还款能力。申请人递交申请后,先由规则引擎对进件进行筛选,命中引擎中任何一条规则(如黑名单)的进件直接被拒绝,未命中规则的进件由评分卡处理。评分卡输出不同评分等级,低等级代表还款能力弱,系统直接拒绝,高等级代表还款能力强,系统直接通过,中间等级代表还款能力不确定,由信审人员进行人工信审,通过核查申请人信息的方式来决定通过与否。Risk control capability is the core competitiveness of institutions in the credit industry. The credit industry risk control system generally consists of a rule engine and a scorecard. The rule engine consists of dozens to hundreds of risk control rules, which are used to evaluate the repayment willingness of applicants. The scorecard outputs a score grade, which is used to assess an applicant's ability to repay. After the applicant submits the application, the rules engine will first screen the entries, and the entries that hit any rule in the engine (such as the blacklist) will be rejected directly, and the entries that do not hit the rules will be processed by the scorecard. The scorecard outputs different grades. The low grade means the repayment ability is weak, and the system directly rejects it. The high grade means the repayment ability is strong, and the system directly passes. The middle grade means the repayment ability is uncertain. Check the applicant's information to decide whether to pass or not.

在实际工作中,风控规则一般由风险政策人员制定,为了尽可能覆盖欺诈人群(即甄别没有还款意愿的人群),规则往往越来越多,维护成本越来越高。更重要的是,主观堆砌的规则会给花费高昂成本引流进来的进件造成误杀,风控系统迫切需要一个能科学评估规则有效性的方法。In practice, risk control rules are generally formulated by risk policy personnel. In order to cover fraudulent people as much as possible (that is, to identify people who are not willing to repay), there are often more and more rules and higher maintenance costs. More importantly, the subjectively stacked rules will cause manslaughter for the incoming parts that are drained at a high cost. The risk control system urgently needs a method that can scientifically evaluate the effectiveness of the rules.

目前整个行业对规则筛选没有简单高效的方法。一个繁琐、成本高的方法为,将规则引擎分解为多个子系统,子系统都内嵌一条不同规则。给每个子系统随机导入不同进件,进件命中某子系统内嵌规则也不拒绝,而由下一个流程的风控评分卡或人工信审进行处理。通过统计评分卡或人工信审批准的进件的贷后表现,来评估规则的有效性,即如果某子系统进件呈现出超常规逾期比例,判定相关规则为有效规则,否则为无效。在该方法中,子系统通过了足够多的进件后,对其内嵌规则的判定才有统计意义,对测试进件的需求量很大,且规则越多测试进件的需求量越大。总之,该方法需要大量测试进件,会放行很多本可由规则引擎拒绝的潜在欺诈进件,给业绩造成较大影响。At present, there is no simple and efficient method for rule screening in the entire industry. A cumbersome and expensive approach is to decompose the rule engine into multiple subsystems, each with a different rule embedded in it. Randomly import different feeds into each subsystem, and the feeds will not be rejected if they hit the embedded rules of a certain subsystem, but will be processed by the risk control scorecard or manual review of the next process. The validity of the rules is evaluated by counting the post-credit performance of the incoming documents approved by the scorecard or manual credit review, that is, if the incoming documents of a certain subsystem show an abnormal overdue ratio, the relevant rules are determined to be valid rules, otherwise they are invalid. In this method, after the subsystem has passed enough entries, the determination of its embedded rules has statistical significance. The demand for test entries is large, and the more rules, the greater the demand for test entries. . In short, this method requires a large number of test inputs, and will release many potentially fraudulent inputs that could have been rejected by the rule engine, which will have a great impact on performance.

发明内容SUMMARY OF THE INVENTION

本发明所要解决的技术问题是:提供一种更加有效的风控规则筛选方法、装置、计算机设备及存储介质.The technical problem to be solved by the present invention is to provide a more effective screening method, device, computer equipment and storage medium for risk control rules.

为了解决上述技术问题,本发明采用的技术方案为:一种风控规则筛选方法,包括以下步骤,In order to solve the above-mentioned technical problems, the technical solution adopted in the present invention is: a method for screening risk control rules, comprising the following steps:

S10、将信用评估模型的各条规则的变量名逐条命名为与各条规则的名称一致,并将各条规则变量作为自变量,贷后表现变量作为因变量,生成一个一维向量;S10. Name the variable names of each rule of the credit evaluation model to be consistent with the name of each rule, and use each rule variable as an independent variable and the post-loan performance variable as a dependent variable to generate a one-dimensional vector;

S20、对信用评估模型的每条进件,生成一个新的一维向量;S20, generating a new one-dimensional vector for each input condition of the credit evaluation model;

S30、若信用评估模型的进件中命中一条或多条规则,在新的一维向量中给命中的规则变量赋值1,未命中变量赋值0;S30. If one or more rules are hit in the input of the credit evaluation model, assign 1 to the hit rule variable in the new one-dimensional vector, and assign 0 to the miss variable;

S40、若进件贷后逾期,在新的一维向量中给贷后表现变量赋值1,否则赋值0;S40. If the incoming loan is overdue, assign 1 to the post-loan performance variable in the new one-dimensional vector, otherwise assign 0;

S50、利用逻辑回归模型,计算出信用评估模型的各条规则的权重参数;S50, using the logistic regression model to calculate the weight parameters of each rule of the credit evaluation model;

S60、根据信用评估模型的各条规则的权重参数,将权重参数小于预设值的规则从信用评估模型中剔除。S60. According to the weight parameters of each rule of the credit evaluation model, remove the rules whose weight parameters are smaller than the preset value from the credit evaluation model.

进一步的,所述步骤S50中,逻辑回归模型预设有L1正则化项的损失函数。Further, in the step S50, the logistic regression model is preset with a loss function of the L1 regularization term.

进一步的,所述步骤S50具体包括,Further, the step S50 specifically includes:

利用逻辑回归模型的L1正则化项的稀疏化特征,计算出信用评估模型的各条规则的权重参数。Using the sparse feature of the L1 regularization term of the logistic regression model, the weight parameters of each rule of the credit evaluation model are calculated.

进一步的,所述步骤S60具体包括,Further, the step S60 specifically includes:

根据信用评估模型的各条规则的权重参数,将权重参数小于0.01的规则从信用评估模型中剔除。According to the weight parameters of each rule of the credit evaluation model, rules with a weight parameter less than 0.01 are eliminated from the credit evaluation model.

本发明还提供了一种风控规则筛选装置,包括,The present invention also provides a screening device for wind control rules, comprising:

变量生成模块,用于将信用评估模型的各条规则的变量名逐条命名为与各条规则的名称一致,并将各条规则变量作为自变量,贷后表现变量作为因变量,生成一个一维向量;The variable generation module is used to name the variable names of each rule of the credit evaluation model to be consistent with the name of each rule, and use each rule variable as an independent variable and the post-loan performance variable as a dependent variable to generate a one-dimensional vector;

进件模块,用于对信用评估模型的每条进件,生成一个新的一维向量;The entry module is used to generate a new one-dimensional vector for each entry of the credit evaluation model;

规则变量赋值模块,用于若信用评估模型的进件中命中一条或多条规则,在新的一维向量中给命中的规则变量赋值1,未命中变量赋值0;The rule variable assignment module is used to assign 1 to the hit rule variable and 0 to the miss variable in the new one-dimensional vector if one or more rules are hit in the input of the credit evaluation model;

贷后表现变量赋值模块,用于若进件贷后逾期,在新的一维向量中给贷后表现变量赋值1,否则赋值0;The post-loan performance variable assignment module is used to assign a value of 1 to the post-loan performance variable in a new one-dimensional vector if the incoming loan is overdue, otherwise it will be assigned a value of 0;

逻辑回归模块,用于利用逻辑回归模型,计算出信用评估模型的各条规则的权重参数;The logistic regression module is used to calculate the weight parameters of each rule of the credit evaluation model by using the logistic regression model;

规则剔除模块,用于根据信用评估模型的各条规则的权重参数,将权重参数小于预设值的规则从信用评估模型中剔除。The rule elimination module is used for eliminating the rules whose weight parameters are smaller than the preset value from the credit evaluation model according to the weight parameters of each rule of the credit evaluation model.

进一步的,所述逻辑回归模块中,逻辑回归模型预设有L1正则化项的损失函数。Further, in the logistic regression module, the logistic regression model is preset with a loss function of the L1 regularization term.

进一步的,所述逻辑回归模块具体用于,Further, the logistic regression module is specifically used for,

利用逻辑回归模型的L1正则化项的稀疏化特征,计算出信用评估模型的各条规则的权重参数。Using the sparse feature of the L1 regularization term of the logistic regression model, the weight parameters of each rule of the credit evaluation model are calculated.

进一步的,所述规则剔除模块具体用于,Further, the rule elimination module is specifically used for,

根据信用评估模型的各条规则的权重参数,将权重参数小于0.01的规则从信用评估模型中剔除。According to the weight parameters of each rule of the credit evaluation model, rules with a weight parameter less than 0.01 are eliminated from the credit evaluation model.

本发明还提供了一种计算机设备,所述计算机设备包括存储器及处理器,所述存储器上存储有计算机程序,所述处理器执行所述计算机程序时实现如上所述的风控规则筛选方法。The present invention also provides a computer device, the computer device includes a memory and a processor, the memory stores a computer program, and the processor implements the above-mentioned method for screening risk control rules when executing the computer program.

本发明还提供了一种存储介质,所述存储介质存储有计算机程序,所述计算机程序被处理器执行时可实现如上所述的风控规则筛选方法。The present invention also provides a storage medium, where the storage medium stores a computer program, and when the computer program is executed by the processor, the above-mentioned method for screening risk control rules can be implemented.

本发明的有益效果在于:本发明将风控规则的评估和筛选任务考虑为一个逻辑回归模型,利用该模型目标函数的L1正则化稀疏化特征,在求解规则变量的权重的过程中,将不重要的规则权重自动迭代为无限接近为零,简化了规则评估过程,将权重参数小于预设值的规则从信用评估模型中剔除,留下真正起作用的规则,避免了由于进件命中不重要的规则而被信用评估模型所拒绝,该方法极大减小了所需测试进件量,降低了潜在的逾期成本。The beneficial effects of the present invention are: the present invention considers the evaluation and screening tasks of risk control rules as a logistic regression model, and utilizes the L1 regularization and sparseness feature of the objective function of the model to solve the weights of the rule variables. The weight of important rules is automatically iterated to infinitely close to zero, which simplifies the rule evaluation process. Rules with weight parameters less than the preset value are eliminated from the credit evaluation model, leaving the rules that really work, avoiding the unimportant hit due to incoming items. This method greatly reduces the amount of test input required and reduces the potential overdue cost.

附图说明Description of drawings

下面结合附图详述本发明的具体结构。The specific structure of the present invention will be described in detail below with reference to the accompanying drawings.

图1为本发明实施例的风控规则筛选方法流程图;1 is a flowchart of a method for screening risk control rules according to an embodiment of the present invention;

图2为本发明实施例的风控规则筛选装置框图;2 is a block diagram of an apparatus for screening wind control rules according to an embodiment of the present invention;

图3为本发明实施例的计算机设备的示意性框图。FIG. 3 is a schematic block diagram of a computer device according to an embodiment of the present invention.

具体实施方式Detailed ways

下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

应当理解,当在本说明书和所附权利要求书中使用时,术语“包括”和“包含”指示所描述特征、整体、步骤、操作、元素和/或组件的存在,但并不排除一个或多个其它特征、整体、步骤、操作、元素、组件和/或其集合的存在或添加。It is to be understood that, when used in this specification and the appended claims, the terms "comprising" and "comprising" indicate the presence of the described features, integers, steps, operations, elements and/or components, but do not exclude one or The presence or addition of a number of other features, integers, steps, operations, elements, components, and/or sets thereof.

还应当理解,在此本发明说明书中所使用的术语仅仅是出于描述特定实施例的目的而并不意在限制本发明。如在本发明说明书和所附权利要求书中所使用的那样,除非上下文清楚地指明其它情况,否则单数形式的“一”、“一个”及“该”意在包括复数形式。It is also to be understood that the terminology used in this specification of the present invention is for the purpose of describing particular embodiments only and is not intended to limit the present invention. As used in this specification and the appended claims, the singular forms "a," "an," and "the" are intended to include the plural unless the context clearly dictates otherwise.

还应当进一步理解,在本发明说明书和所附权利要求书中使用的术语“和/或”是指相关联列出的项中的一个或多个的任何组合以及所有可能组合,并且包括这些组合。It should further be understood that, as used in this specification and the appended claims, the term "and/or" refers to and including any and all possible combinations of one or more of the associated listed items .

请参阅图1,本发明的第一实施例为:一种风控规则筛选方法,包括以下步骤,Referring to FIG. 1, the first embodiment of the present invention is: a method for screening risk control rules, including the following steps:

S10、将信用评估模型的各条规则的变量名逐条命名为与各条规则的名称一致,并将各条规则变量作为自变量,贷后表现变量作为因变量,生成一个一维向量;S10. Name the variable names of each rule of the credit evaluation model to be consistent with the name of each rule, and use each rule variable as an independent variable and the post-loan performance variable as a dependent variable to generate a one-dimensional vector;

S20、对信用评估模型的每条进件,生成一个新的一维向量;S20, generating a new one-dimensional vector for each input condition of the credit evaluation model;

S30、若信用评估模型的进件中命中一条或多条规则,在新的一维向量中给命中的规则变量赋值1,未命中变量赋值0;S30. If one or more rules are hit in the input of the credit evaluation model, assign 1 to the hit rule variable in the new one-dimensional vector, and assign 0 to the miss variable;

S40、若进件贷后逾期,在新的一维向量中给贷后表现变量赋值1,否则赋值0;S40. If the incoming loan is overdue, assign 1 to the post-loan performance variable in the new one-dimensional vector, otherwise assign 0;

S50、利用逻辑回归模型,计算出信用评估模型的各条规则的权重参数;S50, using the logistic regression model to calculate the weight parameters of each rule of the credit evaluation model;

S60、根据信用评估模型的各条规则的权重参数,将权重参数小于预设值的规则从信用评估模型中剔除。S60. According to the weight parameters of each rule of the credit evaluation model, remove the rules whose weight parameters are smaller than the preset value from the credit evaluation model.

进一步的,所述步骤S50中,逻辑回归模型预设有L1正则化项的损失函数。Further, in the step S50, the logistic regression model is preset with a loss function of the L1 regularization term.

进一步的,所述步骤S50具体包括,Further, the step S50 specifically includes:

利用逻辑回归模型的L1正则化项的稀疏化特征,计算出信用评估模型的各条规则的权重参数。Using the sparse feature of the L1 regularization term of the logistic regression model, the weight parameters of each rule of the credit evaluation model are calculated.

进一步的,所述步骤S60具体包括,Further, the step S60 specifically includes:

根据信用评估模型的各条规则的权重参数,将权重参数小于0.01的规则从信用评估模型中剔除。According to the weight parameters of each rule of the credit evaluation model, rules with a weight parameter less than 0.01 are eliminated from the credit evaluation model.

本实施例中,将风控规则的评估和筛选任务考虑为一个逻辑回归模型,利用该模型目标函数的L1正则化稀疏化特征,在求解规则变量的权重的过程中,将不重要的规则权重自动迭代为无限接近为零,简化了规则评估过程,将权重参数小于预设值的规则从信用评估模型中剔除,留下真正起作用的规则,避免了由于进件命中不重要的规则而被信用评估模型所拒绝,该方法极大减小了所需测试进件量,降低了潜在的逾期成本。In this embodiment, the evaluation and screening tasks of risk control rules are considered as a logistic regression model, and the L1 regularization and sparse features of the objective function of the model are used to calculate the weights of unimportant rules in the process of solving the weights of rule variables. The automatic iteration is infinitely close to zero, which simplifies the rule evaluation process, removes the rules whose weight parameters are less than the preset value from the credit evaluation model, leaves the rules that really work, and avoids the unimportant rules that are hit by the entry. Rejected by the credit evaluation model, this method greatly reduces the amount of test input required and reduces potential overdue costs.

通过上述实施例,能够解决行业中信用评估模型中各种规则的有效性的评估问题,比如,一个信用评估模型中的某条规则拒绝了一个进件,由于该进件没有贷后表现,我们无从得知该规则是否有效,即不能判断该进件是否被误杀。一个信用评估模型的规则越多,拒绝的进件可能也越多,怎样判定哪些规则拒绝了真正该拒绝的进件,哪些规则的误杀比例过高,这个问题一直是整个行业中各种信用评估模型无法解决的问题,本发明实施例提出的量化解决方案,通过逻辑回归模型,解决了目前信用评估模型中普通存在却无简单办法来量化规则有效性的问题。具体的,一条风控规则的自变量x_i(i代表第i条规则),贷后逾期表现为应变量y。逾期表现为不逾期时,y的取值为0,逾期表现为逾期时,y的取值为1。为了量化规则评价,将规则评价问题转换为一个逻辑回归模型中自变量权重之间的大小问题,即某自变量权重的绝对值越大,所对应的相关规则重要性或有效性越高,经过一段时间的逻辑回归机器学习迭代之后,就能得到各条规则的权重参数,将权重小于0.01的规则从模型剔除,达到规则筛选优化的目的。The above embodiment can solve the problem of evaluating the validity of various rules in the credit evaluation model in the industry. For example, a certain rule in a credit evaluation model rejects an entry. Since the entry has no post-loan performance, we There is no way to know whether the rule is valid, that is, it is impossible to judge whether the entry has been killed by mistake. The more rules a credit evaluation model has, the more entries may be rejected. How to determine which rules reject the entries that should be rejected, and which rules have a high rate of manslaughter? This problem has always been a problem for various credit evaluations in the entire industry For the problem that cannot be solved by the model, the quantitative solution proposed by the embodiment of the present invention solves the problem that there is no simple method to quantify the validity of the rules in the current credit evaluation model through the logistic regression model. Specifically, the independent variable x_i (i represents the i-th rule) of a risk control rule, and the overdue expression after loan is the dependent variable y. When the overdue performance is not overdue, the value of y is 0, and when the overdue performance is overdue, the value of y is 1. In order to quantify the rule evaluation, the rule evaluation problem is transformed into a size problem between the independent variable weights in a logistic regression model. After a period of logistic regression machine learning iteration, the weight parameters of each rule can be obtained, and the rules with a weight less than 0.01 are eliminated from the model to achieve the purpose of rule screening and optimization.

请参阅图2,本发明的另一实施例为:一种风控规则筛选装置,包括,Please refer to FIG. 2, another embodiment of the present invention is: a screening device for risk control rules, comprising:

变量生成模块10,用于将信用评估模型的各条规则的变量名逐条命名为与各条规则的名称一致,并将各条规则变量作为自变量,贷后表现变量作为因变量,生成一个一维向量;The variable generation module 10 is used to name the variable names of each rule of the credit evaluation model one by one to be consistent with the name of each rule, and use each rule variable as an independent variable and a post-loan performance variable as a dependent variable to generate a variable name. dimensional vector;

进件模块20,用于对信用评估模型的每条进件,生成一个新的一维向量;The entry module 20 is used to generate a new one-dimensional vector for each entry of the credit evaluation model;

规则变量赋值模块30,用于若信用评估模型的进件中命中一条或多条规则,在新的一维向量中给命中的规则变量赋值1,未命中变量赋值0;The rule variable assignment module 30 is used to assign 1 to the hit rule variable in the new one-dimensional vector if one or more rules are hit in the entry of the credit evaluation model, and 0 to the miss variable;

贷后表现变量赋值模块40,用于若进件贷后逾期,在新的一维向量中给贷后表现变量赋值1,否则赋值0;The post-loan performance variable assignment module 40 is used to assign a value of 1 to the post-loan performance variable in a new one-dimensional vector if the incoming loan is overdue, otherwise, assign a value of 0;

逻辑回归模块50,用于利用逻辑回归模型,计算出信用评估模型的各条规则的权重参数;The logistic regression module 50 is used to calculate the weight parameters of each rule of the credit evaluation model by using the logistic regression model;

规则剔除模块60,用于根据信用评估模型的各条规则的权重参数,将权重参数小于预设值的规则从信用评估模型中剔除。The rule elimination module 60 is used for eliminating the rules whose weight parameter is smaller than the preset value from the credit evaluation model according to the weight parameters of each rule of the credit evaluation model.

进一步的,所述逻辑回归模块50中,逻辑回归模型预设有L1正则化项的损失函数。Further, in the logistic regression module 50, the logistic regression model is preset with a loss function of the L1 regularization term.

进一步的,所述逻辑回归模块50具体用于,Further, the logistic regression module 50 is specifically used to:

利用逻辑回归模型的L1正则化项的稀疏化特征,计算出信用评估模型的各条规则的权重参数。Using the sparse feature of the L1 regularization term of the logistic regression model, the weight parameters of each rule of the credit evaluation model are calculated.

进一步的,所述规则剔除模块60具体用于,Further, the rule elimination module 60 is specifically used to:

根据信用评估模型的各条规则的权重参数,将权重参数小于0.01的规则从信用评估模型中剔除。According to the weight parameters of each rule of the credit evaluation model, rules with a weight parameter less than 0.01 are eliminated from the credit evaluation model.

需要说明的是,所属领域的技术人员可以清楚地了解到,上述风控规则筛选装置的具体实现过程,可以参考前述方法实施例中的相应描述,为了描述的方便和简洁,在此不再赘述。It should be noted that those skilled in the art can clearly understand that the specific implementation process of the above-mentioned risk control rule screening device may refer to the corresponding description in the foregoing method embodiments. For the convenience and brevity of the description, it will not be repeated here. .

上述风控规则筛选装置可以实现为一种计算机程序的形式,该计算机程序可以在如图3所示的计算机设备上运行。The above-mentioned wind control rule screening apparatus can be implemented in the form of a computer program, and the computer program can be executed on the computer equipment as shown in FIG. 3 .

请参阅图3,图3是本申请实施例提供的一种计算机设备的示意性框图。该计算机设备500可以是终端,也可以是服务器,其中,终端可以是智能手机、平板电脑、笔记本电脑、台式电脑、个人数字助理和穿戴式设备等具有通信功能的电子设备。服务器可以是独立的服务器,也可以是多个服务器组成的服务器集群。Please refer to FIG. 3 , which is a schematic block diagram of a computer device provided by an embodiment of the present application. The computer device 500 may be a terminal or a server, wherein the terminal may be an electronic device with communication functions, such as a smart phone, a tablet computer, a notebook computer, a desktop computer, a personal digital assistant, and a wearable device. The server can be an independent server or a server cluster composed of multiple servers.

参阅图3,该计算机设备500包括通过系统总线501连接的处理器502、存储器和网络接口505,其中,存储器可以包括非易失性存储介质503和内存储器504。Referring to FIG. 3 , the computer device 500 includes a processor 502 , a memory and a network interface 505 connected by a system bus 501 , wherein the memory may include a non-volatile storage medium 503 and an internal memory 504 .

该非易失性存储介质503可存储操作系统5031和计算机程序5032。该计算机程序5032包括程序指令,该程序指令被执行时,可使得处理器502执行一种风控规则筛选方法。The nonvolatile storage medium 503 can store an operating system 5031 and a computer program 5032 . The computer program 5032 includes program instructions, which, when executed, can cause the processor 502 to execute a method for screening wind control rules.

该处理器502用于提供计算和控制能力,以支撑整个计算机设备500的运行。The processor 502 is used to provide computing and control capabilities to support the operation of the entire computer device 500 .

该内存储器504为非易失性存储介质503中的计算机程序5032的运行提供环境,该计算机程序5032被处理器502执行时,可使得处理器502执行一种风控规则筛选方法。The internal memory 504 provides an environment for running the computer program 5032 in the non-volatile storage medium 503. When the computer program 5032 is executed by the processor 502, the processor 502 can execute a risk control rule screening method.

该网络接口505用于与其它设备进行网络通信。本领域技术人员可以理解,图3中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备500的限定,具体的计算机设备500可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。The network interface 505 is used for network communication with other devices. Those skilled in the art can understand that the structure shown in FIG. 3 is only a block diagram of a partial structure related to the solution of the present application, and does not constitute a limitation on the computer device 500 to which the solution of the present application is applied. The specific computer device 500 may include more or fewer components than shown, or combine certain components, or have a different arrangement of components.

其中,所述处理器502用于运行存储在存储器中的计算机程序5032,以实现如上的风控规则筛选方法。Wherein, the processor 502 is configured to run the computer program 5032 stored in the memory, so as to realize the above method for screening the risk control rules.

应当理解,在本申请实施例中,处理器502可以是中央处理单元(CentralProcessing Unit,CPU),该处理器502还可以是其他通用处理器、数字信号处理器(DigitalSignal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现成可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。其中,通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。It should be understood that, in this embodiment of the present application, the processor 502 may be a central processing unit (Central Processing Unit, CPU), and the processor 502 may also be other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), dedicated integrated Circuit (Application Specific Integrated Circuit, ASIC), off-the-shelf Programmable Gate Array (Field-Programmable Gate Array, FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, and the like. Wherein, the general-purpose processor can be a microprocessor or the processor can also be any conventional processor or the like.

本领域普通技术人员可以理解的是实现上述实施例的方法中的全部或部分流程,是可以通过计算机程序来指令相关的硬件来完成。该计算机程序包括程序指令,计算机程序可存储于一存储介质中,该存储介质为计算机可读存储介质。该程序指令被该计算机系统中的至少一个处理器执行,以实现上述方法的实施例的流程步骤。It can be understood by those skilled in the art that all or part of the processes in the methods for implementing the above embodiments can be completed by instructing relevant hardware through a computer program. The computer program includes program instructions, and the computer program can be stored in a storage medium, and the storage medium is a computer-readable storage medium. The program instructions are executed by at least one processor in the computer system to implement the flow steps of the above-described method embodiments.

因此,本发明还提供一种存储介质。该存储介质可以为计算机可读存储介质。该存储介质存储有计算机程序,其中计算机程序包括程序指令。该程序指令被处理器执行时使处理器执行如上的风控规则筛选方法。Therefore, the present invention also provides a storage medium. The storage medium may be a computer-readable storage medium. The storage medium stores a computer program, wherein the computer program includes program instructions. When the program instructions are executed by the processor, the processor executes the above-mentioned screening method for risk control rules.

所述存储介质可以是U盘、移动硬盘、只读存储器(Read-Only Memory,ROM)、磁碟或者光盘等各种可以存储程序代码的计算机可读存储介质。The storage medium may be various computer-readable storage media that can store program codes, such as a U disk, a removable hard disk, a read-only memory (Read-Only Memory, ROM), a magnetic disk, or an optical disk.

本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、计算机软件或者二者的结合来实现,为了清楚地说明硬件和软件的可互换性,在上述说明中已经按照功能一般性地描述了各示例的组成及步骤。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本发明的范围。Those of ordinary skill in the art can realize that the units and algorithm steps of each example described in conjunction with the embodiments disclosed herein can be implemented in electronic hardware, computer software, or a combination of the two, in order to clearly illustrate the differences between hardware and software Interchangeability, the above description has generally described the components and steps of each example in terms of function. Whether these functions are performed in hardware or software depends on the specific application and design constraints of the technical solution. Skilled artisans may implement the described functionality using different methods for each particular application, but such implementations should not be considered beyond the scope of the present invention.

在本发明所提供的几个实施例中,应该理解到,所揭露的装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的。例如,各个单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式。例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。In the several embodiments provided by the present invention, it should be understood that the disclosed apparatus and method may be implemented in other manners. For example, the apparatus embodiments described above are merely illustrative. For example, the division of each unit is only a logical function division, and other division methods may be used in actual implementation. For example, multiple units or components may be combined or may be integrated into another system, or some features may be omitted, or not implemented.

本发明实施例方法中的步骤可以根据实际需要进行顺序调整、合并和删减。本发明实施例装置中的单元可以根据实际需要进行合并、划分和删减。另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以是两个或两个以上单元集成在一个单元中。The steps in the method of the embodiment of the present invention may be adjusted, combined and deleted in sequence according to actual needs. Units in the apparatus of the embodiment of the present invention may be combined, divided, and deleted according to actual needs. In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit.

该集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分,或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,终端,或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。The integrated unit, if implemented in the form of a software functional unit and sold or used as an independent product, may be stored in a storage medium. Based on this understanding, the technical solution of the present invention is essentially or a part that contributes to the prior art, or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium , including several instructions to cause a computer device (which may be a personal computer, a terminal, or a network device, etc.) to execute all or part of the steps of the methods described in the various embodiments of the present invention.

以上所述,仅为本发明的具体实施方式,但本发明的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,可轻易想到各种等效的修改或替换,这些修改或替换都应涵盖在本发明的保护范围之内。因此,本发明的保护范围应以权利要求的保护范围为准。The above are only specific embodiments of the present invention, but the protection scope of the present invention is not limited to this. Any person skilled in the art can easily think of various equivalents within the technical scope disclosed by the present invention. Modifications or substitutions should be included within the protection scope of the present invention. Therefore, the protection scope of the present invention should be subject to the protection scope of the claims.

Claims (10)

1. A wind control rule screening method is characterized by comprising the following steps: comprises the following steps of (a) carrying out,
s10, naming the variable names of each rule of the credit evaluation model one by one to be consistent with the names of the rules, taking each rule variable as an independent variable, taking the credit expression variable as a dependent variable, and generating a one-dimensional vector;
s20, generating a new one-dimensional vector for each entry of the credit evaluation model;
s30, if one or more rules are hit in the incoming condition of the credit evaluation model, assigning 1 to the hit rule variable and assigning 0 to the miss variable in the new one-dimensional vector;
s40, if the loan is overdue, assigning 1 to the post-loan expression variable in the new one-dimensional vector, and otherwise, assigning 0;
s50, calculating the weight parameters of each rule of the credit evaluation model by using a logistic regression model;
and S60, according to the weight parameters of each rule of the credit evaluation model, eliminating the rule with the weight parameter smaller than the preset value from the credit evaluation model.
2. The method for screening wind control rules according to claim 1, wherein: in step S50, the logistic regression model is preset with a loss function of the L1 regularization term.
3. The wind control rule screening method according to claim 2, wherein: the step S50 specifically includes the steps of,
and calculating the weight parameters of each rule of the credit evaluation model by using the sparse characteristics of the L1 regularization term of the logistic regression model.
4. The wind control rule screening method according to claim 3, wherein: the step S60 specifically includes the steps of,
and according to the weight parameters of all the rules of the credit evaluation model, removing the rules with the weight parameters less than 0.01 from the credit evaluation model.
5. The utility model provides a wind accuse rule sieving mechanism which characterized in that: comprises the steps of (a) preparing a mixture of a plurality of raw materials,
the variable generation module is used for naming the variable names of the rules of the credit evaluation model one by one to be consistent with the names of the rules, taking the rule variables as independent variables, taking the credited expression variables as dependent variables and generating a one-dimensional vector;
the system comprises a component feeding module, a component extracting module and a component analyzing module, wherein the component feeding module is used for generating a new one-dimensional vector for each component fed by a credit evaluation model;
the rule variable assignment module is used for assigning 1 to a hit rule variable and assigning 0 to a missed rule variable in a new one-dimensional vector if one or more rules are hit in the incoming condition of the credit evaluation model;
the post-loan performance variable assignment module is used for assigning 1 to the post-loan performance variable in a new one-dimensional vector if the post-loan is overdue, and assigning 0 if the post-loan is overdue;
the logistic regression module is used for calculating the weight parameters of all rules of the credit evaluation model by using the logistic regression model;
and the rule removing module is used for removing the rules with the weight parameters smaller than the preset value from the credit evaluation model according to the weight parameters of all the rules of the credit evaluation model.
6. The wind control rule screening apparatus of claim 5, wherein: in the logistic regression module, a logistic regression model is preset with a loss function of an L1 regularization term.
7. The wind control rule screening apparatus of claim 6, wherein: the logistic regression module is specifically configured to,
and calculating the weight parameters of each rule of the credit evaluation model by using the sparse characteristics of the L1 regularization term of the logistic regression model.
8. The wind control rule screening apparatus of claim 7, wherein: the rule culling module is specifically configured to,
and according to the weight parameters of all the rules of the credit evaluation model, removing the rules with the weight parameters less than 0.01 from the credit evaluation model.
9. A computer device, characterized by: the computer device comprises a memory having a computer program stored thereon and a processor that, when executing the computer program, implements the method of wind control rule screening according to any one of claims 1 to 4.
10. A storage medium, characterized by: the storage medium stores a computer program which, when executed by a processor, implements the method of wind control rule screening according to any one of claims 1 to 4.
CN202010843248.XA 2020-08-20 2020-08-20 Risk control rule screening method, device, computer equipment and storage medium Pending CN111861738A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010843248.XA CN111861738A (en) 2020-08-20 2020-08-20 Risk control rule screening method, device, computer equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010843248.XA CN111861738A (en) 2020-08-20 2020-08-20 Risk control rule screening method, device, computer equipment and storage medium

Publications (1)

Publication Number Publication Date
CN111861738A true CN111861738A (en) 2020-10-30

Family

ID=72970531

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010843248.XA Pending CN111861738A (en) 2020-08-20 2020-08-20 Risk control rule screening method, device, computer equipment and storage medium

Country Status (1)

Country Link
CN (1) CN111861738A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112991034A (en) * 2020-11-30 2021-06-18 重庆誉存大数据科技有限公司 Model-based mini-enterprise credit assessment method, equipment and storage medium
CN113240259A (en) * 2021-04-30 2021-08-10 顶象科技有限公司 Method and system for generating rule policy group and electronic equipment

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112991034A (en) * 2020-11-30 2021-06-18 重庆誉存大数据科技有限公司 Model-based mini-enterprise credit assessment method, equipment and storage medium
CN113240259A (en) * 2021-04-30 2021-08-10 顶象科技有限公司 Method and system for generating rule policy group and electronic equipment

Similar Documents

Publication Publication Date Title
US10943186B2 (en) Machine learning model training method and device, and electronic device
KR20180041174A (en) Risk Assessment Methods and Systems
CN1604040A (en) Dynamic transaction control within a host transaction processing system
CN107862425B (en) Wind control data acquisition method, device and system and readable storage medium
CN111681102A (en) Credit prediction method, apparatus, device and storage medium
CN108769163A (en) Alliance's chain common recognition reaches method, equipment and computer readable storage medium
CN111861738A (en) Risk control rule screening method, device, computer equipment and storage medium
CN112686746A (en) Credit flow configuration method, device and equipment
KR102195629B1 (en) Method for selecting workers based on capability of work in crowdsourcing based projects for artificial intelligence training data generation
CN111611786B (en) Text similarity calculation method and device
CN113240259B (en) Rule policy group generation method and system and electronic equipment
CN113487440A (en) Model generation method, health insurance claim settlement determination method, device, equipment and medium
WO2019104844A1 (en) Automatic performance testing method, apparatus and device for monetary fund system, and storage medium
CN112532625A (en) Network situation awareness evaluation data updating method and device and readable storage medium
CN116107789A (en) Method for monitoring and analyzing application fault reasons and storage medium
WO2011149608A1 (en) Identifying and using critical fields in quality management
CN113780580B (en) Data analysis method, device, equipment and storage medium based on machine learning
CN111221923B (en) User identity verification method and device, storage medium and electronic equipment
CN110781410A (en) Community detection method and device
WO2023049745A1 (en) Artificial intelligence assisted live sports data quality assurance
CN113610113B (en) Data visualization method and device
CN114595216A (en) Data verification method, device, storage medium and electronic device
CN112967127A (en) Suspicious loan checking method, system, computer equipment and storage medium
CN113191877A (en) Data feature acquisition method and system and electronic equipment
CN111681005A (en) Data interaction method and device and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20201030